BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 048416
         (495 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
 gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
          Length = 520

 Score =  767 bits (1980), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/492 (78%), Positives = 435/492 (88%), Gaps = 2/492 (0%)

Query: 5   PPIKLKQSANSTAAAASS--AAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQS 62
           PPIKLKQS N TA   +S  ++QDARLLVRET+RISANLA++     +       +   S
Sbjct: 3   PPIKLKQSPNPTAQTLTSTTSSQDARLLVRETLRISANLASAPPAESLALLDNNISNDSS 62

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQA 122
             +++     VE++FVDSSLRLIC E+IDGR+W YVAE D  G+FK+NS R++SLQ+PQA
Sbjct: 63  NKNKSRKLGFVEEEFVDSSLRLICCEEIDGRKWKYVAENDGFGRFKKNSFRSVSLQNPQA 122

Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
           P EELMSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SVGVS++
Sbjct: 123 PVEELMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTKTLLSSVGVSKS 182

Query: 183 RATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
            A PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV
Sbjct: 183 TAIPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN+ADLLGTGL
Sbjct: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGL 302

Query: 303 SIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           SIMISKRNPSL+TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG VPSL+
Sbjct: 303 SIMISKRNPSLVTTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGRVPSLQ 362

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
           EGN +ENIFSFPWL+++P+V+GSRFKDAFQDP A+LA+EP FEKERY+VTYNP+K KVYA
Sbjct: 363 EGNTRENIFSFPWLKERPIVIGSRFKDAFQDPSAFLAIEPLFEKERYIVTYNPTKDKVYA 422

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGES 482
           LLKDQAK+DDILKAAFHA+VLLH INSS+  RS +LKQ ++D+SN +L+   LEAHI ES
Sbjct: 423 LLKDQAKADDILKAAFHAHVLLHFINSSSKSRSSSLKQQQHDHSNYILSANYLEAHIAES 482

Query: 483 CKVVSTALWGFQ 494
           CK+VS+    F+
Sbjct: 483 CKMVSSLYGSFK 494


>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
 gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
          Length = 504

 Score =  759 bits (1961), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/491 (77%), Positives = 427/491 (86%), Gaps = 15/491 (3%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           PP+KLKQS +    A S   QDARLL RET+RISA+LA+S +P  V           + S
Sbjct: 2   PPMKLKQSPSPQTLATS---QDARLLARETLRISASLASSPAPSAV-----------ASS 47

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDR-SGKFKRNSIRAISLQSPQAP 123
            E     +V++QFVD+SLRLIC E+I+GRRWNYV + +  +G+ +  S+RA+  Q+PQAP
Sbjct: 48  PEARNLGIVDEQFVDASLRLICCEEINGRRWNYVVDNEGGNGQGRNGSLRAVCSQTPQAP 107

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
            +ELMSFIRSYVVP+GFPDSVTPSYVPYM+WRALKHFFGGAMGVFTTQTLLNSVGVSRNR
Sbjct: 108 VDELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 167

Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           + PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLR AGDLLMELGAGVELATAAVP
Sbjct: 168 SAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVP 227

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
           HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS
Sbjct: 228 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 287

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
           IMISKRNPSL+TTFALLSCGYV SSYQEVKSVVLHTLNRARFSVA++SFLKTG VPSL+E
Sbjct: 288 IMISKRNPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPSLQE 347

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
           GN++ENIFSFPWL DKP+VLGSRFKDAFQD  +YL++EP FEKERY+VTYNPSKGKVYAL
Sbjct: 348 GNMKENIFSFPWLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVYAL 407

Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESC 483
           LKDQAKSDDILKAAFHA++LLH I SSNG  S   KQ +YD+++LM +TADL AHI ESC
Sbjct: 408 LKDQAKSDDILKAAFHAHMLLHFIVSSNGSESFPTKQQDYDSTHLMPSTADLAAHIAESC 467

Query: 484 KVVSTALWGFQ 494
           K+VST+   F+
Sbjct: 468 KMVSTSYGLFK 478


>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
 gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 507

 Score =  732 bits (1890), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/489 (72%), Positives = 417/489 (85%), Gaps = 9/489 (1%)

Query: 6   PIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESD 65
           PI LKQS NS A + +S+ + AR+LVRET+R++ANLA  S P+      +        + 
Sbjct: 3   PINLKQSPNSAAKSVASSTE-ARILVRETLRVTANLA--SPPLD-----SLPPTTSPAAP 54

Query: 66  ETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAE 125
             T   ++++ F+DS+ RLIC E+IDGRRWNYVA+   SGK K  SIRA+ LQ+PQAP +
Sbjct: 55  HATNLPILDNHFLDSTSRLICCEEIDGRRWNYVADIQPSGKLKNASIRALCLQTPQAPID 114

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
           E+MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV+RN+AT
Sbjct: 115 EMMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKAT 174

Query: 186 PGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
           PGA+AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA PHL
Sbjct: 175 PGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPHL 234

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
           FLPLACAANVAKNV+AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM
Sbjct: 235 FLPLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 294

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
           ISKRNPSL+TTF LLSCGY+FSSYQEV+SVVLHTLNRARF+VAVE+FLKTG VPSL++GN
Sbjct: 295 ISKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPSLQKGN 354

Query: 366 LQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLK 425
           + E I SFPWL++ P+VLG RFKDAFQD  ++LA+EP F++E+Y+VTYN +KGKVYALLK
Sbjct: 355 MNERILSFPWLKENPIVLGPRFKDAFQDAGSFLAIEPLFDREKYLVTYNQTKGKVYALLK 414

Query: 426 DQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCKV 485
           DQAKSDDI+KAAFHA+VLLH I SS+G ++   KQ +  +S++  TT +LEA I  SCK+
Sbjct: 415 DQAKSDDIIKAAFHAHVLLHFIRSSSGAQNSNQKQVDAFSSSVP-TTTNLEAEIAASCKM 473

Query: 486 VSTALWGFQ 494
           VS +   F+
Sbjct: 474 VSNSYEIFK 482


>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
 gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
          Length = 490

 Score =  694 bits (1792), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/491 (71%), Positives = 403/491 (82%), Gaps = 33/491 (6%)

Query: 6   PIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESD 65
           P+ +K+    T   +S ++QD  +L+RET+RISA LA+S+ P+                D
Sbjct: 6   PLNVKKQ---TPNISSPSSQD--ILIRETLRISAQLASSTPPL---------------LD 45

Query: 66  ETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAE 125
            TT           S+LR IC E+IDGRRWNYVAE + SG+FK NS R++SLQ+P+ P +
Sbjct: 46  TTTP----------STLRFICCEEIDGRRWNYVAETNASGQFKNNSFRSLSLQTPKPPLD 95

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
           E+MSFI SYVVP+GFP +VTPSYVPYMTWRALKHFFGGAMGVFTTQTLL+SVGVSRNRAT
Sbjct: 96  EMMSFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRAT 155

Query: 186 PGAVA--INWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           P A A  INWILKDGAGRVGKM+FARQGKKFDYDLKQLRF GDLLMELGAGVELATAA+P
Sbjct: 156 PAAAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAMP 215

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
           HLFLPLACAANV KNV AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG S
Sbjct: 216 HLFLPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGFS 275

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
           IMI+K NPSLITTF+LLSCGY+ SSY+EVKSVVLHTLN ARFSVAV+SFLKTG VP+LRE
Sbjct: 276 IMIAKSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQVPTLRE 335

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
           GNL E+IFSFPW +D+PVVLGSR KDAFQ P AY+A+EP F+KERY+VTYNPSK KV+A+
Sbjct: 336 GNLNEDIFSFPW-KDRPVVLGSRIKDAFQSPSAYVAIEPLFDKERYIVTYNPSKSKVHAV 394

Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESC 483
           LKDQAKSDDILKAAFHA+VL + I S N  +  + KQ +  NS++MLT ADLEA I +SC
Sbjct: 395 LKDQAKSDDILKAAFHAHVLSNFIKSLNESKGSSWKQGDDLNSSVMLTAADLEACIADSC 454

Query: 484 KVVSTALWGFQ 494
           KVV+ A W F+
Sbjct: 455 KVVTNAYWLFK 465


>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 497

 Score =  677 bits (1746), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/496 (69%), Positives = 400/496 (80%), Gaps = 38/496 (7%)

Query: 4   PPPIKL--KQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQ 61
           P P+K+  KQSANST    ++   +  +LVRETMRISANLA++              LL+
Sbjct: 3   PNPMKMNMKQSANST----TTTIANKEILVRETMRISANLASTP-------------LLE 45

Query: 62  SESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISL--QS 119
           +    +              + +IC E++DGRRW YVAE D  G FK+NS R + L   +
Sbjct: 46  APPPPSPI------------VGIICCEELDGRRWKYVAESDARGGFKKNSFRPVRLNFHN 93

Query: 120 PQ-APAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           P+  P  E+++F+ SYVVP+GFPDSV PSYVPYMTWRALKHFFGGAMGVFTTQTLL+SVG
Sbjct: 94  PRDHPLHEVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVG 153

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
           VSRNRA PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELA
Sbjct: 154 VSRNRAAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 213

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
           TAAVPHLFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL
Sbjct: 214 TAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 273

Query: 299 GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
           GTGLSI+I+KRNPSL+TTF+LLSCGY+ SSY+EVKSVVLHTLN  RFSVAVESFL+TG V
Sbjct: 274 GTGLSILIAKRNPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQV 333

Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKG 418
           P+L+EGN+ ENIFSFPW +D+PVVLGSR K+AFQDP AY A+EP F++ERY+VTYNPSK 
Sbjct: 334 PTLQEGNMNENIFSFPW-KDRPVVLGSRIKEAFQDPSAYFAIEPLFDRERYIVTYNPSKH 392

Query: 419 KVYALLKDQAKSDDILKAAFHAYVLL-HCINSSNGHRSLALKQHEYDNSNLMLTTADLEA 477
           KVYA+LKDQAKSDDILKAAFHA+VLL + I S N   + +LKQ E D SN+  T AD+EA
Sbjct: 393 KVYAVLKDQAKSDDILKAAFHAHVLLFNLIKSWNESNASSLKQRE-DLSNMTHTVADIEA 451

Query: 478 HIGESCKVVSTALWGF 493
            +  +CK V+ + +GF
Sbjct: 452 CMAGTCKTVADS-YGF 466


>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
 gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
 gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
 gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 497

 Score =  675 bits (1741), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/490 (69%), Positives = 396/490 (80%), Gaps = 23/490 (4%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           P +KL   +     A+ S     RLL RET+RISA+LA  S P+              + 
Sbjct: 2   PSVKLTHHSPPDTIASDSV----RLLSRETLRISASLA--SPPV--------------DD 41

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
               +    + QF+ S+LRLIC E+IDGRR+ YVAE D SG+FK+NS+RAISL+SPQ P 
Sbjct: 42  LPPHSPPPPDSQFLHSTLRLICCEEIDGRRFKYVAESDGSGRFKKNSVRAISLESPQTPF 101

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           +E+ SF+RSYVVP+GFP SV  SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG SRN +
Sbjct: 102 DEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSS 161

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
              AVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH
Sbjct: 162 ASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 221

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL+GTG SI
Sbjct: 222 LFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI 281

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           +ISKRNPSL+TTF LLSCGY+ SSYQEV+SVVLHTLNRARF+VAVESFLKTG VPSL+EG
Sbjct: 282 LISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEG 341

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N+QE IF+FPW+ D+PV+LG+RFKDAFQDP  Y+AV+PFF+KERY+VTY+P+KGKVYALL
Sbjct: 342 NIQEKIFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALL 401

Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
           K QA SDDILKAAFHA+VLLH +N S      +++Q    +     T  +LE+ I ESC+
Sbjct: 402 KHQANSDDILKAAFHAHVLLHFMNQSKDGNPRSVEQL---DPAFAPTEYELESRIAESCE 458

Query: 485 VVSTALWGFQ 494
           +VST+   F+
Sbjct: 459 MVSTSYGVFK 468


>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 488

 Score =  672 bits (1734), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/492 (68%), Positives = 388/492 (78%), Gaps = 37/492 (7%)

Query: 6   PIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESD 65
           P  +KQSANST    ++   +  +LVRETMRISANLA+                      
Sbjct: 3   PNPMKQSANST----TTTIANKEILVRETMRISANLASPPQ------------------- 39

Query: 66  ETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQS--PQAP 123
                           + +IC E++DGRRW Y+AE D  G FK+NS   +SL S  P+ P
Sbjct: 40  ---------PPPPSPIVGIICCEEMDGRRWKYLAESDGFGGFKKNSFLPVSLNSNHPRDP 90

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
             E++SF+ SYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLL+SVGV RNR
Sbjct: 91  LHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVCRNR 150

Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           A PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF GDLLMELGAGVELATAAVP
Sbjct: 151 AAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAVP 210

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
           HLFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS
Sbjct: 211 HLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 270

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
           I+I+KRNPSL+TTF+LLSCGY+ SSY+EVKSVVLHTLN  RFSVAVE FL TG VP+L+E
Sbjct: 271 ILIAKRNPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVEHFLMTGQVPTLQE 330

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
           GN+ ENIFSFPW +D+PVVLGSR K+AFQDP AY+A+EP F++ERY+VTYNPSK KVYA+
Sbjct: 331 GNMNENIFSFPW-KDRPVVLGSRIKEAFQDPSAYVAIEPLFDRERYIVTYNPSKHKVYAV 389

Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSS-NGHRSLALKQHEYDNSNLMLTTADLEAHIGES 482
           LKDQAKSDDILKAAFHA+VL   +  S N +++ +LKQ E D SN+  T AD+EA I  +
Sbjct: 390 LKDQAKSDDILKAAFHAHVLFFSLMKSLNENKASSLKQRE-DLSNMTHTVADIEARIAGT 448

Query: 483 CKVVSTALWGFQ 494
           CK V+ +   F+
Sbjct: 449 CKTVADSYGCFK 460


>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
          Length = 470

 Score =  672 bits (1734), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/491 (70%), Positives = 393/491 (80%), Gaps = 49/491 (9%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           PP+KLKQS +    A S   QDARLL RET+RISA+LA+S +P  V           + S
Sbjct: 2   PPMKLKQSPSPQTLATS---QDARLLARETLRISASLASSPAPSAV-----------ASS 47

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDR-SGKFKRNSIRAISLQSPQAP 123
            E     +V++QFVD+SLRLIC E+I+GRRWNYV + +  +G+ +  S+RA+  Q+PQAP
Sbjct: 48  PEARNLGIVDEQFVDASLRLICCEEINGRRWNYVVDNEGGNGQGRNGSLRAVCSQTPQAP 107

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
            +ELMSFIRSYVVP+GFPDSVTPSYVPYM+WRALKHFFGGAMGVFTTQTLLNSVGVSRNR
Sbjct: 108 VDELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 167

Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           + PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLR AGDLLMELGAGVELATAAVP
Sbjct: 168 SAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVP 227

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
           HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL      
Sbjct: 228 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL------ 281

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
                                       VKSVVLHTLNRAR+SVA++ FLKTG VPSL+E
Sbjct: 282 ----------------------------VKSVVLHTLNRARYSVAIDCFLKTGRVPSLQE 313

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
           GN++ENIFSFPWL DKP+VLGSRFKDAFQD  +YL++EP FEKERY+VTYNPSKGKVYAL
Sbjct: 314 GNMKENIFSFPWLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVYAL 373

Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESC 483
           LKDQAKSDDILKAAFHA++LLH I SSNG  S   KQ +YD+++LM +TADL AHI ESC
Sbjct: 374 LKDQAKSDDILKAAFHAHMLLHFIVSSNGSESFPTKQQDYDSTHLMPSTADLAAHIAESC 433

Query: 484 KVVSTALWGFQ 494
           K+VST+   F+
Sbjct: 434 KMVSTSYGLFK 444


>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
 gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/490 (70%), Positives = 398/490 (81%), Gaps = 20/490 (4%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           P +KLK  +     ++SS+    RLL RET+RISA+LA  S P+              + 
Sbjct: 2   PSVKLKHHSPPDTISSSSSDS-VRLLSRETLRISASLA--SPPV--------------DD 44

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
               +    + QF+ S+LRLIC E+IDGRR+ YVAE D SG+FK+NS+RAISL+SPQ P 
Sbjct: 45  LPPHSPPSPDSQFLHSTLRLICCEEIDGRRFKYVAESDGSGRFKKNSVRAISLESPQTPF 104

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           +E+ SF+RSYVVP+GFP SV  SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG SRN +
Sbjct: 105 DEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSS 164

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
              AVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH
Sbjct: 165 ASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 224

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL+GTG SI
Sbjct: 225 LFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI 284

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           +ISKRNPSL+TTF LLSCGY+ SSYQEV+SVVLHTLNRARF+VAVESFLKTG VPSL+EG
Sbjct: 285 LISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEG 344

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N+QE IF+FPW+ D+PV+LG+RFKDAFQDP  Y+AV+PFF+KERY+VTY+P+KGKVYALL
Sbjct: 345 NIQEKIFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALL 404

Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
           K QA SDDILKAAFHA+VLLH +N S       ++Q    +     T  +LE+ I ESC+
Sbjct: 405 KHQANSDDILKAAFHAHVLLHFMNQSKDGTPRLVEQL---DPAFAPTEYELESRIAESCE 461

Query: 485 VVSTALWGFQ 494
           +VST+   F+
Sbjct: 462 MVSTSYAIFK 471


>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
 gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
          Length = 503

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 315/500 (63%), Positives = 381/500 (76%), Gaps = 35/500 (7%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           P + LK+SA  T    +    D RL VR+ MR +     + +P        A+T  Q+ +
Sbjct: 3   PAVGLKRSATQTI---TLPPPDTRLAVRDVMRSTIPSQPAVAP--------ASTERQAPA 51

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAE------RDRSGKFKRNS-IRAISL 117
                F              +C E++DGRRW+YV +      R R G     + +RA+ L
Sbjct: 52  AALQGF--------------LCLEEVDGRRWSYVVDGGAAKGRSRGGAVPAGAAVRAVPL 97

Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           QSP  PAEE+M+FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SV
Sbjct: 98  QSPLPPAEEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSV 157

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           GVS+++ TPGA+AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+EL
Sbjct: 158 GVSQSKVTPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIEL 217

Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
            TAA P  FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADL
Sbjct: 218 TTAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADL 277

Query: 298 LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           LGTGLSI ISKRNPSL+T+FA LSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KTGH
Sbjct: 278 LGTGLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGH 337

Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
           +PSL+EGN QE IF+ PW R +PV +GSRF +AFQ+P ++LA+ P FE ERY+VTYNP+K
Sbjct: 338 IPSLKEGNSQETIFNPPW-RHEPVAIGSRFGEAFQEPASFLAIRPLFEDERYMVTYNPTK 396

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHR-SLALKQHEYDNSNLMLT-TADL 475
            KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+  R     KQ   D S  + +   D 
Sbjct: 397 DKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARRLKQKQKQANPDRSEYLYSRNMDF 456

Query: 476 EAHIGESCKVVSTALWGFQE 495
            AHI ESCK+VS++   F++
Sbjct: 457 LAHIAESCKIVSSSYGTFKK 476


>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
 gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
          Length = 398

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 309/364 (84%), Positives = 338/364 (92%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           +ELM FIRSYVVP+GFPDSV PSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGVS+++A
Sbjct: 3   QELMGFIRSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQA 62

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
            PGAVAINWILKDGAGRVGKM++ARQGKKFDYDLKQLRFAGDLLMELGA VELATAAVPH
Sbjct: 63  IPGAVAINWILKDGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPH 122

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN+ADLLGTGLSI
Sbjct: 123 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGLSI 182

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           MISKRNPSL+TTFALLSCGYV SSYQEVKSVVLHTLN ARFSVAVESFLKTG VPSL EG
Sbjct: 183 MISKRNPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEG 242

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N++ENIF+FPWL+D+ + LG RFKDAFQDP AYLA+EP FE+ERY+VTYNPSK KVYALL
Sbjct: 243 NVKENIFNFPWLKDRSISLGPRFKDAFQDPRAYLAIEPLFERERYIVTYNPSKDKVYALL 302

Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
           KDQAK DDI+KAAFHA+VL+H I+SSN + S   KQ EY +SN +L+ ADLE+HI ESCK
Sbjct: 303 KDQAKPDDIVKAAFHAHVLMHFIHSSNNNCSSPSKQQEYGHSNFILSAADLESHIAESCK 362

Query: 485 VVST 488
           +VST
Sbjct: 363 MVST 366


>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
 gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
          Length = 503

 Score =  623 bits (1607), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 313/500 (62%), Positives = 379/500 (75%), Gaps = 35/500 (7%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           P + LK+SA  T    +    D RL VR+ MR +     + +P        A+T  Q+ +
Sbjct: 3   PAVGLKRSATQTI---TLPPPDTRLAVRDVMRSTIPSQPAVAP--------ASTERQAPA 51

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAE------RDRSGKFKRNS-IRAISL 117
                F              +C  ++DG RW+YV +      R R G     + +RA+ L
Sbjct: 52  AALQGF--------------LCLXEVDGXRWSYVVDGGAAKGRSRGGAVPAGAAVRAVPL 97

Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           QSP  PAEE+M+FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SV
Sbjct: 98  QSPLPPAEEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSV 157

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           GVS+++ TPGA+AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+EL
Sbjct: 158 GVSQSKVTPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIEL 217

Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
            TAA P  FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADL
Sbjct: 218 TTAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADL 277

Query: 298 LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           LGTGLSI ISKRNPSL+T+FA LSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KTGH
Sbjct: 278 LGTGLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGH 337

Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
           +PSL+EGN QE IF+ PW R +PV +GSRF +AFQ+P ++LA+ P FE ERY+VTYNP+K
Sbjct: 338 IPSLKEGNSQETIFNPPW-RHEPVAIGSRFGEAFQEPASFLAIRPLFEDERYMVTYNPTK 396

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHR-SLALKQHEYDNSNLMLT-TADL 475
            KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+  R     KQ   D S  + +   D 
Sbjct: 397 DKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARRLKQKQKQANPDRSEYLYSRNMDF 456

Query: 476 EAHIGESCKVVSTALWGFQE 495
            AHI ESCK+VS++   F++
Sbjct: 457 LAHIAESCKIVSSSYGTFKK 476


>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
           distachyon]
          Length = 521

 Score =  616 bits (1588), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 314/493 (63%), Positives = 370/493 (75%), Gaps = 50/493 (10%)

Query: 26  DARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLI 85
           DAR+ VRE +R++   A       V    A A  +                       ++
Sbjct: 27  DARVAVREAVRVAVREAEQPPTARVSAVQAPAAAVDG---------------------VL 65

Query: 86  CSEDIDGRRWNYVAERDRS--GKFKRNS--------------IRAISLQSPQAPAEELMS 129
           C E++DGRRW+YV E   S  GK  R S               RA+ LQSP  P EE+MS
Sbjct: 66  CLEEVDGRRWSYVVEGAASSAGKPGRVSARGRGGSAAPVGATFRAVPLQSPLPPVEEIMS 125

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAM VFTT+ LLNSVGVS++RAT GAV
Sbjct: 126 FIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGAV 185

Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA PHLFLPL
Sbjct: 186 AINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPL 245

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
           AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISKR
Sbjct: 246 ACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKR 305

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
           NPSL+ +FA LSCGY+ SSY+EV+SVVL+TLN ARF+VAV+SF+KTGHVPSL+EGNL+E 
Sbjct: 306 NPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNLEET 365

Query: 370 IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAK 429
           IF+ PW R +PV +GSRF +AFQ+P ++++  P FE ERY+VTYNP+K KVYALLKDQAK
Sbjct: 366 IFNPPW-RHQPVAIGSRFGEAFQEPASFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAK 424

Query: 430 SDDILKAAFHAYVLLHCINSSNGHRSLALKQH--------EYDNSNLMLTTADLEAHIGE 481
            DDILKAAFHA+VLLH IN+S  H +L  ++H        +Y N N +    D   HI E
Sbjct: 425 PDDILKAAFHAHVLLHFINAS--HANLNARKHMNSNRGSYQYVNPNPL--NMDFLPHIEE 480

Query: 482 SCKVVSTALWGFQ 494
           SCK+V+++   F+
Sbjct: 481 SCKIVTSSYGIFK 493


>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
          Length = 508

 Score =  613 bits (1582), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 293/425 (68%), Positives = 354/425 (83%), Gaps = 18/425 (4%)

Query: 84  LICSEDIDGRRWNYVAERDRSGKFKRN------------SIRAISLQSPQAPAEELMSFI 131
            +C E++DGRRW+YV +  +     R             S+RA+ LQSP  PAEE+M+FI
Sbjct: 63  FLCLEEVDGRRWSYVVDGGQGKGKGRGRGRSGAAVPMGASVRAVPLQSPLPPAEEVMAFI 122

Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           RSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SVGVS+++ TPGA+AI
Sbjct: 123 RSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPGAIAI 182

Query: 192 NWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
           NWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+ELATAA P  FLP+AC
Sbjct: 183 NWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMAC 242

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP 311
            ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADLLGTGLSI ISKRNP
Sbjct: 243 VANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNP 302

Query: 312 SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
           SL+T+FA LSCGY++SSY EV+SVVL+TLNRARF+VAV+SF+KTGH+PSL+EGN QE IF
Sbjct: 303 SLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 362

Query: 372 SFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSD 431
           + PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+VTYNP+K KVYALLKDQAKSD
Sbjct: 363 NPPW-RHEPVAIGSRFGEAFQEPASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSD 421

Query: 432 DILKAAFHAYVLLHCINSSNGHRSLALKQHE-YDNSNLMLTTADLEAHIGESCKVVSTAL 490
           DI+KAAFHA+VLLH IN+S+  + +  ++ + Y N   M    D  AHI ESCK+VS++ 
Sbjct: 422 DIIKAAFHAHVLLHFINASHARKQMNSRRSDPYGNPCNM----DFMAHIAESCKIVSSSY 477

Query: 491 WGFQE 495
             F++
Sbjct: 478 GTFKK 482


>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
 gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
          Length = 508

 Score =  613 bits (1582), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 293/425 (68%), Positives = 354/425 (83%), Gaps = 18/425 (4%)

Query: 84  LICSEDIDGRRWNYVAERDRSGKFKRN------------SIRAISLQSPQAPAEELMSFI 131
            +C E++DGRRW+YV +  +     R             S+RA+ LQSP  PAEE+M+FI
Sbjct: 63  FLCLEEVDGRRWSYVVDGGQGKGKGRGRGRSGAAVPMGASVRAVPLQSPLPPAEEVMAFI 122

Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           RSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SVGVS+++ TPGA+AI
Sbjct: 123 RSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPGAIAI 182

Query: 192 NWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
           NWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+ELATAA P  FLP+AC
Sbjct: 183 NWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMAC 242

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP 311
            ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADLLGTGLSI ISKRNP
Sbjct: 243 VANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNP 302

Query: 312 SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
           SL+T+FA LSCGY++SSY EV+SVVL+TLNRARF+VAV+SF+KTGH+PSL+EGN QE IF
Sbjct: 303 SLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 362

Query: 372 SFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSD 431
           + PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+VTYNP+K KVYALLKDQAKSD
Sbjct: 363 NPPW-RHEPVAIGSRFGEAFQEPASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSD 421

Query: 432 DILKAAFHAYVLLHCINSSNGHRSLALKQHE-YDNSNLMLTTADLEAHIGESCKVVSTAL 490
           DI+KAAFHA+VLLH IN+S+  + +  ++ + Y N   M    D  AHI ESCK+VS++ 
Sbjct: 422 DIIKAAFHAHVLLHFINASHARKQMNSRRSDPYGNPCNM----DFMAHIAESCKIVSSSY 477

Query: 491 WGFQE 495
             F++
Sbjct: 478 GTFKK 482


>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 519

 Score =  612 bits (1579), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 299/432 (69%), Positives = 352/432 (81%), Gaps = 21/432 (4%)

Query: 84  LICSEDIDGRRWNYVAERDRS--GKFKRNS--------------IRAISLQSPQAPAEEL 127
           ++C E+++GRRW+YV E   S  GK  R S               RA+ LQSP  PAEE+
Sbjct: 62  VLCFEEVEGRRWSYVVEGAASSAGKTGRVSARARGGSATPVGATFRAVPLQSPLPPAEEI 121

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAM VFTT+ LLNSVGVS++RAT G
Sbjct: 122 MSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSG 181

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           AVAINWIL+DGAGRVGKM+FARQGKKFDYDLKQLRF+GDL+MELGAG+ELATAA P LFL
Sbjct: 182 AVAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQLFL 241

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
           PLAC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI++S
Sbjct: 242 PLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILMS 301

Query: 308 KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
           KRNPSL+ +FA+LSCGY+ SSYQEV+SVVL+TLN ARF+VAV+SF+KTGHVPSL+EGNL+
Sbjct: 302 KRNPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNLE 361

Query: 368 ENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
           E IF+ PW R +PV +GSRF +AFQ+P +++A  P FE ERY+VTYNP+K KVYALLKDQ
Sbjct: 362 ETIFNPPW-RHQPVAIGSRFGEAFQEPASFVATRPLFEDERYIVTYNPAKDKVYALLKDQ 420

Query: 428 AKSDDILKAAFHAYVLLHCINSSNGH----RSLALKQHEYDNSNLMLTTADLEAHIGESC 483
           AK DDILKAAFHA+VLLH IN+S+ +    + +   Q  Y   N      D  AHI ESC
Sbjct: 421 AKQDDILKAAFHAHVLLHFINASHANLKARKRMNSDQGSYQYVNPNPLNMDFLAHIEESC 480

Query: 484 KVVSTALWGFQE 495
           K+V+++   F+ 
Sbjct: 481 KIVTSSYGVFKR 492


>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 506

 Score =  611 bits (1576), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 305/500 (61%), Positives = 371/500 (74%), Gaps = 32/500 (6%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           P + LK+S  +   A     QDAR  VRE +R +      +   G     A A  L+   
Sbjct: 3   PTVGLKRSGTTQTIAVPPPPQDARFAVREAVRSTIATPPPAEAPGAPARVAPAPALEG-- 60

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRN---------SIRAI 115
                               +C E++DGRRW YV +                   S+RA+
Sbjct: 61  -------------------FLCLEEVDGRRWKYVVDEAPGKGKGGRGGSAVPLGASVRAV 101

Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
            LQSP  PAEE+M+F+RSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+
Sbjct: 102 PLQSPLPPAEEIMAFVRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLS 161

Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGV 235
           SVGVS+++ TPGA+AINWILKDGAGRVGKM+FARQGKKFD DLKQLRF+ DLL+E+GAG+
Sbjct: 162 SVGVSQSKVTPGAIAINWILKDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGI 221

Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
           ELATAA P  FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIA
Sbjct: 222 ELATAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIA 281

Query: 296 DLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
           DLLGTGLSI I+KRNPSL+T+FALLSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KT
Sbjct: 282 DLLGTGLSIYITKRNPSLVTSFALLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKT 341

Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP 415
           G+VPSL+EGN QE +F  PW R +PV +GSRF +AFQ+P +++A  P FE ERY+VTYNP
Sbjct: 342 GYVPSLKEGNSQETLFIHPW-RHEPVAIGSRFGEAFQEPVSFVATRPLFEDERYMVTYNP 400

Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADL 475
           +K KVYALLKDQAK DD+LKAAFHA+VLLH IN+S+  + +   + ++  +NL     D 
Sbjct: 401 TKDKVYALLKDQAKPDDVLKAAFHAHVLLHFINASHARKRMNTNRSDH-YANLHPRNMDF 459

Query: 476 EAHIGESCKVVSTALWGFQE 495
            AHI ESCK+VS +   F++
Sbjct: 460 LAHIAESCKLVSASYGTFKK 479


>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
 gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
 gi|224035549|gb|ACN36850.1| unknown [Zea mays]
 gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
          Length = 511

 Score =  609 bits (1571), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 296/423 (69%), Positives = 354/423 (83%), Gaps = 17/423 (4%)

Query: 84  LICSEDIDGRRWNYVAERDRSGKFK--RNSI----RAISLQSPQAPAEELMSFIRSYVVP 137
           ++C E++DGRRW+YV +   +      R S+    +A+ LQSP  P EE+MSFIRSYVVP
Sbjct: 68  VLCLEEVDGRRWSYVVDAAGAAVKAKGRASVGGAFKAVPLQSPMPPVEEIMSFIRSYVVP 127

Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
           +GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGVS++RA  GAVAINWILKD
Sbjct: 128 EGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGAVAINWILKD 187

Query: 198 GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
           GAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P LFLP+AC ANV K
Sbjct: 188 GAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLPMACIANVVK 247

Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTF 317
           NVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISK NPSL+T+F
Sbjct: 248 NVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKSNPSLVTSF 307

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
           A LSCGY+ SSY EV+SVVL+TLNRARF+VAV+SF++TG+VPSL++GN QE IF+ PW R
Sbjct: 308 AFLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDGNSQETIFNPPW-R 366

Query: 378 DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAA 437
            +PV +GSRF DAFQ+P ++LA++P FE ERY+VTYNP+K KVYALLKDQAK DDILKAA
Sbjct: 367 HEPVAIGSRFGDAFQEPASFLAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAA 426

Query: 438 FHAYVLLHCINSSNGH-----RSLALKQHEYDNSNLMLTTADLEAHIGESCKVVSTALWG 492
           FHA+VLLH IN+S+ +     R  + + ++YD  N+     D   HI ESCK+V ++   
Sbjct: 427 FHAHVLLHFINASHANLNARKRMNSNRSYQYDPLNM-----DFVPHIEESCKIVMSSYGV 481

Query: 493 FQE 495
           F++
Sbjct: 482 FKK 484


>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
 gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
          Length = 517

 Score =  600 bits (1547), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 292/423 (69%), Positives = 355/423 (83%), Gaps = 17/423 (4%)

Query: 84  LICSEDIDGRRWNYVAERDRSGKFK--RNSI----RAISLQSPQAPAEELMSFIRSYVVP 137
           ++C E++DGRRW+YV +   +      R S+    +A+ LQSP  P EE+MSFIRSYVVP
Sbjct: 74  VLCLEEVDGRRWSYVVDAAGAAVKAKGRASVGGAFKAVPLQSPLPPVEEIMSFIRSYVVP 133

Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
           +GFP+SVTPSYVPYMTWRA+KHFFGGAMGVFTT+TLLNSVGVS++RA  GAVAINWILKD
Sbjct: 134 EGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRAASGAVAINWILKD 193

Query: 198 GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
           GAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P LFLP+AC ANV K
Sbjct: 194 GAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLPMACIANVVK 253

Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTF 317
           NVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISK NPSL+T+F
Sbjct: 254 NVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKSNPSLVTSF 313

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
           A+LSCGY+ SSY EV+SVVL+TLNRARF+VAV+SF++TG+VPSL++GN QE IF+ PW R
Sbjct: 314 AVLSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDGNSQETIFNPPW-R 372

Query: 378 DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAA 437
            +PV +GSRF +AFQ+P +++A++P FE ERY+VTYNP+K KVYALLKDQAK DDILKAA
Sbjct: 373 HEPVAIGSRFGEAFQEPASFIAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAA 432

Query: 438 FHAYVLLHCINSSNGH-----RSLALKQHEYDNSNLMLTTADLEAHIGESCKVVSTALWG 492
           FHA+VLLH IN+S+ +     R  A + ++Y+  N+     D   HI ESCK+V ++   
Sbjct: 433 FHAHVLLHFINASHANLNARKRMNANRSYQYNPLNM-----DFVPHIEESCKIVMSSYGV 487

Query: 493 FQE 495
           F++
Sbjct: 488 FKK 490


>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
 gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
 gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
 gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
          Length = 415

 Score =  597 bits (1540), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 286/388 (73%), Positives = 334/388 (86%), Gaps = 6/388 (1%)

Query: 111 SIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 170
           + +A+ LQSP  P EE+MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT
Sbjct: 4   TFKAVPLQSPLPPVEEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 63

Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLME 230
           +TLLNSVGV+++RAT GAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLME
Sbjct: 64  RTLLNSVGVAQSRATSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLME 123

Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
           LGAG+ELATAA P LFLP+AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE 
Sbjct: 124 LGAGIELATAAFPQLFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGES 183

Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
           VGNIADLLGTGLSI+ISKRNPSL+T+FA LSCGY+ SSY EV+SVVL+TLN ARF+VAV+
Sbjct: 184 VGNIADLLGTGLSILISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVD 243

Query: 351 SFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
           SF+K+GHVPSL+EGN QE IF+ PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+
Sbjct: 244 SFIKSGHVPSLKEGNSQETIFNPPW-RHQPVAIGSRFGEAFQEPASFVAIRPLFEDERYI 302

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGH---RSLALKQHEYDNSN 467
           VTYNP+K KVYALLKDQAK DDILKAAFHA+VLLH IN+S+ +   R        Y N+N
Sbjct: 303 VTYNPTKDKVYALLKDQAKPDDILKAAFHAHVLLHFINASHANLNARKRMNSNRSYQNAN 362

Query: 468 LMLTTADLEAHIGESCKVVSTALWGFQE 495
            +    D   HI ESCK+V+++   F++
Sbjct: 363 PL--NMDFIPHIAESCKIVTSSYGVFKK 388


>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
          Length = 407

 Score =  588 bits (1517), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 289/370 (78%), Positives = 326/370 (88%), Gaps = 3/370 (0%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           +E+ SF+RSYVVP+GFP SV  SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG SRN +
Sbjct: 12  QEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSS 71

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
              AVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH
Sbjct: 72  ASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 131

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL+GTG SI
Sbjct: 132 LFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI 191

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           +ISKRNPSL+TTF LLSCGY+ SSYQEV+SVVLHTLNRARF+VAVESFLKTG VPSL+EG
Sbjct: 192 LISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEG 251

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N+QE IF+FPW+ D+PV+LG+RFKDAFQDP  Y+AV+PFF+KERY+VTY+P+KGKVYALL
Sbjct: 252 NIQEKIFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALL 311

Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
           K QA SDDILKAAFHA+VLLH +N S      +++Q +        T  +LE+ I ESC+
Sbjct: 312 KHQANSDDILKAAFHAHVLLHFMNQSKDGNPRSVEQLD---PAFAPTEYELESRIAESCE 368

Query: 485 VVSTALWGFQ 494
           +VST+   F+
Sbjct: 369 MVSTSYGVFK 378


>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
 gi|194699768|gb|ACF83968.1| unknown [Zea mays]
          Length = 491

 Score =  585 bits (1508), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 302/503 (60%), Positives = 368/503 (73%), Gaps = 53/503 (10%)

Query: 5   PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
           P ++LK+S+  T    +    DARL VR+ MR +     + +P        AA L     
Sbjct: 3   PAVRLKRSSTQTI---TLPPPDARLAVRDVMRSTIPSRPAEAPASTERPAPAAALQG--- 56

Query: 65  DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRN---------SIRAI 115
                               +C E++DGRRW+YV +   S K + +         S+RA+
Sbjct: 57  -------------------FLCLEEVDGRRWSYVVDGG-SAKGRGSGGVAVPAGASVRAV 96

Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
            LQSP  PAEE+MSFIRSYVVP+GFPDSVTPSYVPYM WRALKHFFGGAMGVFTT+TLL+
Sbjct: 97  PLQSPLPPAEEIMSFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLS 156

Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGV 235
           SVGVS++             KDGAGRVGKM+FARQGKKFD DLKQLRF+ DLL+E+GAG+
Sbjct: 157 SVGVSQS-------------KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGI 203

Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
           EL TAA P  FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIA
Sbjct: 204 ELTTAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIA 263

Query: 296 DLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
           DLLGTGLSI ISKRNPSL+T+FA LSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KT
Sbjct: 264 DLLGTGLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKT 323

Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP 415
           GH+PSL+EGN QE IF+ PW R + V +GSRF +AFQ+P ++LA+ P FE ERY+VTYNP
Sbjct: 324 GHIPSLKEGNSQETIFNPPW-RHETVAIGSRFGEAFQEPASFLAIRPLFEDERYMVTYNP 382

Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD---NSNLMLTT 472
           +K KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+  R L LKQ + +   + +L    
Sbjct: 383 TKDKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARR-LKLKQKQANPDQSEHLYSRN 441

Query: 473 ADLEAHIGESCKVVSTALWGFQE 495
            D  AHI +SCK+VS++   F++
Sbjct: 442 VDFLAHIAKSCKIVSSSYGTFKK 464


>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
          Length = 401

 Score =  583 bits (1503), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 279/374 (74%), Positives = 325/374 (86%), Gaps = 6/374 (1%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           +E+MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGV+++RA
Sbjct: 4   QEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRA 63

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
           T GAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P 
Sbjct: 64  TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 123

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LFLP+AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTGLSI
Sbjct: 124 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSI 183

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           +ISKRNPSL+T+FA LSCGY+ SSY EV+SVVL+TLN ARF+VAV+SF+K+GHVPSL+EG
Sbjct: 184 LISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKEG 243

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N QE IF+ PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+VTYNP+K KVYALL
Sbjct: 244 NSQETIFNPPW-RHQPVAIGSRFGEAFQEPASFVAIRPLFEDERYIVTYNPTKDKVYALL 302

Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGH---RSLALKQHEYDNSNLMLTTADLEAHIGE 481
           KDQAK DDILKAAFHA+VLLH IN+S+ +   R        Y N+N +    D   HI E
Sbjct: 303 KDQAKPDDILKAAFHAHVLLHFINASHANLNARKRMNSNRSYQNANPL--NMDFIPHIAE 360

Query: 482 SCKVVSTALWGFQE 495
           SCK+V+++   F++
Sbjct: 361 SCKIVTSSYGVFKK 374


>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
           distachyon]
          Length = 508

 Score =  583 bits (1503), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 302/493 (61%), Positives = 358/493 (72%), Gaps = 63/493 (12%)

Query: 26  DARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLI 85
           DAR+ VRE +R++   A       V    A A  +                       ++
Sbjct: 27  DARVAVREAVRVAVREAEQPPTARVSAVQAPAAAVDG---------------------VL 65

Query: 86  CSEDIDGRRWNYVAERDRS--GKFKRNS--------------IRAISLQSPQAPAEELMS 129
           C E++DGRRW+YV E   S  GK  R S               RA+ LQSP  P EE+MS
Sbjct: 66  CLEEVDGRRWSYVVEGAASSAGKPGRVSARGRGGSAAPVGATFRAVPLQSPLPPVEEIMS 125

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAM VFTT+ LLNSVGVS++R      
Sbjct: 126 FIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSR------ 179

Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
                  DGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA PHLFLPL
Sbjct: 180 -------DGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPL 232

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
           AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISKR
Sbjct: 233 ACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKR 292

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
           NPSL+ +FA LSCGY+ SSY+EV+SVVL+TLN ARF+VAV+SF+KTGHVPSL+EGNL+E 
Sbjct: 293 NPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNLEET 352

Query: 370 IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAK 429
           IF+ PW R +PV +GSRF +AFQ+P ++++  P FE ERY+VTYNP+K KVYALLKDQAK
Sbjct: 353 IFNPPW-RHQPVAIGSRFGEAFQEPASFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAK 411

Query: 430 SDDILKAAFHAYVLLHCINSSNGHRSLALKQH--------EYDNSNLMLTTADLEAHIGE 481
            DDILKAAFHA+VLLH IN+S  H +L  ++H        +Y N N +    D   HI E
Sbjct: 412 PDDILKAAFHAHVLLHFINAS--HANLNARKHMNSNRGSYQYVNPNPL--NMDFLPHIEE 467

Query: 482 SCKVVSTALWGFQ 494
           SCK+V+++   F+
Sbjct: 468 SCKIVTSSYGIFK 480


>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 215/359 (59%), Positives = 284/359 (79%), Gaps = 4/359 (1%)

Query: 94  RWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMT 153
           +W YV+   RSG+   N+++ +  Q  + P  EL+SF+++Y+VP+G+PDSV PSY PYM 
Sbjct: 2   QWKYVSNLGRSGE---NTMQLMYAQQHRDPLAELVSFVKAYIVPEGYPDSVAPSYTPYMQ 58

Query: 154 WRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKK 213
           WRA+++FFGGAM VFTT++L++++GVSR  +   AVA+NW++KDGAGR+GKM+FAR GKK
Sbjct: 59  WRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSAVAVNWVIKDGAGRIGKMLFARHGKK 118

Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           FD DLKQLRF G +LM+LGAGVEL T AVPHLFLPLACAANVAKNVAAVTS+STR PIYK
Sbjct: 119 FDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLPLACAANVAKNVAAVTSSSTRAPIYK 178

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVK 333
           AFA+ ENIGDVTAKGEC+ NIADLLGTGL I ISKR+PSL+ TF +LSCGY+ SS+ E+K
Sbjct: 179 AFARRENIGDVTAKGECISNIADLLGTGLGIFISKRSPSLVATFCVLSCGYMISSFNEIK 238

Query: 334 SVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQD 393
           SV L TLNRARF V V+SFL+TG VPS+ + N +E++ + PW ++KP+ LGSR   AF +
Sbjct: 239 SVRLATLNRARFGVVVQSFLETGQVPSIEDANRRESVITLPW-QEKPLELGSRVGQAFSN 297

Query: 394 PDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
           P  ++  +  F+KE Y+VTY P+K + Y ++K+ A SDD+++A F A+V+LH +    G
Sbjct: 298 PREFIRTQEQFQKENYLVTYRPNKKRAYVVIKESASSDDVVRATFQAHVMLHMLRGLEG 356


>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 390

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 216/384 (56%), Positives = 287/384 (74%), Gaps = 17/384 (4%)

Query: 112 IRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
           ++ +  Q  + P  EL+SF++SY+VP+ +PDSV PSY PYM WRA+++ FGGAM VFTT+
Sbjct: 1   MQVMHAQQHRDPVAELVSFVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTR 60

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMEL 231
           +L++++GVSR  +   AVA+NW++KDGAGR+GKMIFAR GKKFD DLKQLRF G +LM+L
Sbjct: 61  SLMHALGVSRGASASSAVAVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQL 120

Query: 232 GAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECV 291
           GAGVELAT AVPHLFLPLACAANVAKNVAAVTS+STR PIYKAFA+ ENIGDVTAKGEC+
Sbjct: 121 GAGVELATMAVPHLFLPLACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECI 180

Query: 292 GNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
            NIADL+GTGL I ISK++PSL+ TF +LSCGY+ SS+ E+KSV L TLNRARF V V+S
Sbjct: 181 SNIADLMGTGLGIFISKKSPSLVATFCVLSCGYMISSFNEIKSVRLATLNRARFGVVVQS 240

Query: 352 FLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
           FL+TG VPS+ + N +E++ + PW ++KP+ LGSR   A+ +P  +L     F+KE+Y+V
Sbjct: 241 FLETGKVPSVEDANRRESVITLPW-QEKPLELGSRVAQAYSNPKEFLRAREQFQKEKYLV 299

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLT 471
           TY+  K + Y ++K+ A SDD+++A F A+V+LH +    G RS              L 
Sbjct: 300 TYSLKKKRAYVVIKESASSDDVVRATFQAHVMLHIL---RGLRS-------------TLI 343

Query: 472 TADLEAHIGESCKVVSTALWGFQE 495
           T+++ A + ES K+       F+E
Sbjct: 344 TSNVGAIVEESTKLTHDLYDSFKE 367


>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
 gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
          Length = 430

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 210/403 (52%), Positives = 282/403 (69%), Gaps = 20/403 (4%)

Query: 49  GVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFK 108
           G+GGG       Q  SD  TA R+ + +   S+L   C E I   +W YVA        K
Sbjct: 3   GLGGG-------QCASDGGTAARIADARATRSTL---CLERIGDVQWKYVA--------K 44

Query: 109 RNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVF 168
              +++  L   Q+   E+M F+RS+VVP+GFPDSV P Y PYM WR L++FFGGAMGVF
Sbjct: 45  GQVLQSEKLFPHQSALAEIMDFVRSHVVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVF 104

Query: 169 TTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLL 228
           TT++LL+++GV RN A+  A+A+NW++KDGAGR+GKMIFAR GKKFD  LKQ+RFAGD+L
Sbjct: 105 TTRSLLHALGV-RNGASSSAIAVNWVVKDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVL 163

Query: 229 MELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG 288
           M+L A +ELAT+A P  FLPLAC AN+ KNVAAV STSTR PIYKAFA+ ENIGD+TAKG
Sbjct: 164 MQLAAALELATSATPQFFLPLACIANIGKNVAAVASTSTRAPIYKAFARRENIGDITAKG 223

Query: 289 ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
           EC+GNIADLLGTG+ I++SK  P +   F+ L+ GYV+SS++EVK++ L TLNR RF +A
Sbjct: 224 ECIGNIADLLGTGMGILMSKNFP-VFAPFSALAFGYVYSSFREVKAIQLPTLNRHRFGIA 282

Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKER 408
           V++FL+TG VP L E N +E I   P    + + LG+R  DAF  P  Y  ++  F+ E 
Sbjct: 283 VDTFLETGKVPGLVEANERERIIVGPQWSLEKLELGARVSDAFSAPQDYQYIDSLFKGEN 342

Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSN 451
           Y+V+YNP + + Y +LK++A  DD+L+AAFH  + ++  +  +
Sbjct: 343 YLVSYNPKRQRTYVVLKERANGDDVLRAAFHGQLFVNIFDKCS 385


>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
           nagariensis]
 gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
           nagariensis]
          Length = 495

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/334 (47%), Positives = 219/334 (65%), Gaps = 6/334 (1%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           Q   +   + I +Y++P+GFPDSV P Y PYM WR +++FFGGA+ VFTTQ+LL ++GV+
Sbjct: 99  QTHEDHFRALIHAYLLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVA 158

Query: 181 RNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATA 240
                  A AINW++KDGAGR+G+++FAR G++ D +LKQ R AGDLLME GA +ELAT 
Sbjct: 159 GRFQGEAAAAINWVIKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATV 218

Query: 241 AVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
             P  FLPLAC AN++KN+AAV ++STR PIY+ FA   N+ D+TAKGE V N+AD+LGT
Sbjct: 219 YAPPAFLPLACTANLSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGT 278

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
              I +S+     + TF +LS GY+ SS +EV SV L  +NRAR + A + +L  G +P 
Sbjct: 279 VAGIALSRMKLPRMPTFCVLSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPG 338

Query: 361 LREGNLQENIFSFPWLRDKP--VVLGSRFKDAFQDPDAYL-AVEPFFEKE-RYVVTYNPS 416
           + E N  E +   PW R     +VLG+  + A   P     AV+ F E E RY+VTY P 
Sbjct: 339 VAEANHNEPL--LPWGRHNQNRLVLGASVESACAGPSELAHAVQRFPEPEYRYLVTYRPD 396

Query: 417 KGKVYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
             K Y LL++ A S D L+A+F  +V LH ++ +
Sbjct: 397 TKKAYVLLREGATSMDCLQASFFGHVFLHLLDGA 430


>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
           variabilis]
          Length = 325

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 210/324 (64%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           + L   +  Y++P  FP SV P Y  YM  R++++FFGGAM VFTT++LL S+GV+   +
Sbjct: 1   QRLGDLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHS 60

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
           +  A AINW++KDGAGR+G+ +FAR G++ D +LKQ R  GD+LME GA +EL+T  +P 
Sbjct: 61  SEAAAAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPR 120

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           +FLPLAC AN+AKN+AAVT++STR PIY+ FAK  N+ DVTAKGE V N+AD++GT   I
Sbjct: 121 MFLPLACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGI 180

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
            ++K N  ++ TFA LS GY+ +S +EV SVVL  LNRAR S    +F  TG VP   EG
Sbjct: 181 ALAKANLPVLPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGRVPETLEG 240

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N +E +  +    +  VVLG+  ++A   P         F   +Y +TY P   K YALL
Sbjct: 241 NYREPLMPWSDPHNGRVVLGATVEEACAGPQQLHDALAAFSGRQYALTYRPDTRKCYALL 300

Query: 425 KDQAKSDDILKAAFHAYVLLHCIN 448
           K  A    + +AA  A+ LL  ++
Sbjct: 301 KQGASPRSVQQAAMDAHALLWMLD 324


>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
          Length = 253

 Score =  277 bits (708), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 131/178 (73%), Positives = 149/178 (83%), Gaps = 6/178 (3%)

Query: 84  LICSEDIDGRRWNYVAERDRSGKFKRN------SIRAISLQSPQAPAEELMSFIRSYVVP 137
           ++C E++DGRRW+YV +   +    +       S + + LQSP  P EE+MSFIRSYVVP
Sbjct: 69  VLCLEEVDGRRWSYVVDAAGAAVKAKGRASVGASFKVVPLQSPLPPVEEIMSFIRSYVVP 128

Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
           +GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGVS++RA  GAVAINWILKD
Sbjct: 129 EGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGAVAINWILKD 188

Query: 198 GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
           GAGRVGKM FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P  FLP+AC ANV
Sbjct: 189 GAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQFFLPMACIANV 246


>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
          Length = 236

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 142/207 (68%), Positives = 160/207 (77%), Gaps = 25/207 (12%)

Query: 29  LLVRETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLICSE 88
           +L+RET+RISA LA+SS+P           LL S++            F+ SS R IC E
Sbjct: 33  ILIRETLRISAELASSSTP-----------LLPSDT-----------TFIGSS-RFICCE 69

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
            IDGRRWNYVA+ D SGKF  NS R++SLQ+P+ P +E +SF+RSYVVP+GFPDSVTPSY
Sbjct: 70  QIDGRRWNYVADTDASGKFNNNSFRSLSLQTPKPPLDEFISFVRSYVVPEGFPDSVTPSY 129

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR--ATPGAVAINWILKDGAGRVGKMI 206
           VPYMTWRALKHFFGGAMGVFTTQTLL SVGVSRNR      AVAINWIL DGAGRVGKM+
Sbjct: 130 VPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRATPAAAAVAINWILNDGAGRVGKML 189

Query: 207 FARQGKKFDYDLKQLRFAGDLLMELGA 233
           FARQGKKFDYDLKQLRFAGDLLMELG 
Sbjct: 190 FARQGKKFDYDLKQLRFAGDLLMELGC 216


>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 496

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 133/347 (38%), Positives = 199/347 (57%), Gaps = 12/347 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +PDG+PDSV P Y  Y  WR    FFGG +GVF+TQ+LL +VGV R  A P A A+ W++
Sbjct: 125 LPDGYPDSVGPDYAEYTKWRGRAFFFGGMVGVFSTQSLLLAVGVGRPSAAPIAAALQWVI 184

Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
           +DG GR G+M+F++ G  FD + KQ R A   ++ +   +E  T   PHLFLPLA  AN+
Sbjct: 185 RDGLGRAGRMLFSQVGTGFDAETKQYRLAAAFVLNVSCALEALTPLFPHLFLPLASLANM 244

Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS--- 312
           AK  + V + STR  IY++F + EN+GD+TAK E VG   DLLGT + +++S+ +     
Sbjct: 245 AKGASTVAAASTRGAIYRSFMRRENLGDITAKQETVGVAGDLLGTAVGVILSRASSHSHR 304

Query: 313 -LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK-TGHVPSLREGNLQENI 370
             +  F   S G++ S+Y EVKSV L TLNR R  + + ++L+    V  +R+ N +E +
Sbjct: 305 LSVIAFIGASLGHLISAYYEVKSVELRTLNRQRAHMLIWNYLEDQDRVLPIRDVNRRERL 364

Query: 371 FSFPW---LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
              PW   L    V +G+R  DA  D DA   +   +  E+Y++ Y  + G+V  +L++ 
Sbjct: 365 VYRPWLDSLHAPNVEIGARLSDAAPDADALKFLLRLYRDEKYMMNY--AGGRVRIVLRND 422

Query: 428 AKSDDILKAAFH--AYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
           A   D LKA F   A+  L+  +      + AL +  Y  +N +  +
Sbjct: 423 ATHQDQLKAYFQSRAFWRLYMTSGERFSPNRALIEDSYRATNQLFPS 469


>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 636

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 117/345 (33%), Positives = 190/345 (55%), Gaps = 10/345 (2%)

Query: 106 KFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
           +F  + I+    +  +   E+L+ ++    +PDG+P+SVT  Y+ Y  WRA+++     M
Sbjct: 250 QFSDDMIQEGRFKDDRPIFEKLIDWLSRVYLPDGYPNSVTKDYLSYSIWRAIQNLAASVM 309

Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFA 224
           GVF T+ LL  +G+ R      + AI+W+LKDG G +GK+ +    G +FD D K  R A
Sbjct: 310 GVFATEALLFGLGLGRKSTPATSAAISWVLKDGLGYIGKVFYGSIAGNQFDVDPKSWRLA 369

Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
            D + + G  +E+ T   P  FL LA  AN  K VAA+T T+TR  IYK+ A  EN GD+
Sbjct: 370 ADAVEDAGGVLEIITPLFPGHFLLLASVANAMKGVAAMTGTATRHAIYKSLALHENQGDI 429

Query: 285 TAKGECVGNIADLLGTGLSIMISKRNP----SLITTFALLSCGYVFSSYQEVKSVVLHTL 340
             KGE  G    ++G GL I+ S +      +L++ +AL    ++F++++ +  V   TL
Sbjct: 430 ATKGESQGVTCKMVGLGLGIVASSKMGQNYFALLSAYALGCFVHLFANWKSLSCVQFATL 489

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPW--LRDKPVVLGSRFKDAFQDPDAYL 398
           NR R S+A++ FL+TG+VPS  E + +E +   PW    D+ +V+G+  K+ FQ+     
Sbjct: 490 NRQRCSLALQEFLQTGNVPSPYEVSRREKVVVPPWKGYLDR-IVVGANLKECFQNGRELR 548

Query: 399 AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                +   RY++T   +  K+  +L +  +  DI +A   A+ L
Sbjct: 549 EAVALYRDCRYILTQRDN--KILIVLHESCRPQDIFQAFMQAHKL 591


>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 849

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 113/303 (37%), Positives = 176/303 (58%), Gaps = 10/303 (3%)

Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
           FP+ V+  Y  +  WR  +      +GVF TQ LL ++G+   R    A A +W+LKDG 
Sbjct: 423 FPEGVSQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAA-ATSWVLKDGL 481

Query: 200 GRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
           GR+GKM++A Q G+ FD D K+ RF   LL  LG G+E+ T   P  FL  A  AN  K 
Sbjct: 482 GRLGKMVWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLANSMKQ 541

Query: 259 VAAVTSTSTRTPIYKAFA-KGENIGDVTAKGECVGNIADLLGTGLSIMISK----RNPSL 313
           ++ +T+++TR  +YK FA + ENI D+TAKGE    +ADLLG  L I +SK      P++
Sbjct: 542 ISMLTASATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTSRPNV 601

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
           +T ++ LS   +++ Y+E+++V   TLN  R S+ V+ F++ G+VP   E + +ENIF  
Sbjct: 602 LTAYSFLSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQRENIFLG 661

Query: 374 PWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDI 433
           P  R     + +   DA   P    ++   F+ E+++VT   + G+   +L++ A++ DI
Sbjct: 662 P--RYDVRSMFASLSDAVSCPQELNSLIKIFKGEQFMVTRGWN-GQYRIVLREHARNGDI 718

Query: 434 LKA 436
           L+A
Sbjct: 719 LRA 721


>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 111/342 (32%), Positives = 172/342 (50%), Gaps = 32/342 (9%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F+R  ++P+GFP SVT  Y+ Y  WR ++       GV  TQ LL +VG+ +  A P A 
Sbjct: 285 FLR-LMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKG-AIPTAA 342

Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           A+NW+LKDG G + K++ ++ G+ FD   K  R   DLL     G+E+ T A PH FL +
Sbjct: 343 AVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLI 402

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG----NIADLLGTGLSIM 305
              A   ++ AA+   STR+  Y  FA   N  +V AKGE  G    +I  +LG  L+  
Sbjct: 403 GAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANC 462

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
           I    P    +F +++  ++F + +  +S+ L TLN  R S+    +L +G VPS++E N
Sbjct: 463 IGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN 522

Query: 366 LQENIFS-FPWLRDKP-----------------------VVLGSRFKDAFQDPDAYLAVE 401
            +E +F   P L  KP                       + LGS+  +     +  LA+ 
Sbjct: 523 EEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALF 582

Query: 402 PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
             +  E Y++T +  KG+ + +LK+     D+LK+ FH   L
Sbjct: 583 DLYRNEAYILTEH--KGRFFVILKESCSPQDMLKSVFHVNYL 622


>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 111/342 (32%), Positives = 172/342 (50%), Gaps = 32/342 (9%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F+R  ++P+GFP SVT  Y+ Y  WR ++       GV  TQ LL +VG+ +  A P A 
Sbjct: 83  FLR-LMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKG-AIPTAA 140

Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           A+NW+LKDG G + K++ ++ G+ FD   K  R   DLL     G+E+ T A PH FL +
Sbjct: 141 AVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLI 200

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG----NIADLLGTGLSIM 305
              A   ++ AA+   STR+  Y  FA   N  +V AKGE  G    +I  +LG  L+  
Sbjct: 201 GAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANC 260

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
           I    P    +F +++  ++F + +  +S+ L TLN  R S+    +L +G VPS++E N
Sbjct: 261 IGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN 320

Query: 366 LQENIFS-FPWLRDKP-----------------------VVLGSRFKDAFQDPDAYLAVE 401
            +E +F   P L  KP                       + LGS+  +     +  LA+ 
Sbjct: 321 EEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALF 380

Query: 402 PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
             +  E Y++T +  KG+ + +LK+     D+LK+ FH   L
Sbjct: 381 DLYRNEAYILTEH--KGRFFVILKESCSPQDMLKSVFHVNYL 420


>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
 gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
          Length = 440

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 121/395 (30%), Positives = 193/395 (48%), Gaps = 37/395 (9%)

Query: 100 ERDRSGKFKRNSIRAISL---QSPQAPAEELMSF----IRSYVVPDGFPDSVTPSYVPYM 152
           E++  GK   ++ +A ++   +S +   +E++S+    +++ ++P G+P +V+  Y  Y 
Sbjct: 11  EKNSKGKSSPSTDKAGAVSTGRSRKLGVDEIVSWCLDALKTTMLPIGYPKTVSDDYTEYT 70

Query: 153 TWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGK 212
            WR  +       GV TTQ LL +VG+ +  A P A AINW+LKDG G + K++ ++ G+
Sbjct: 71  LWRMGQIIASQISGVLTTQALLYAVGLGKG-AIPTAAAINWVLKDGIGYLSKIVLSKYGR 129

Query: 213 KFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIY 272
            FD   K  R   DL+ +   G+EL T   PHLF+PL   A   K+ A +   +TR+  Y
Sbjct: 130 HFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFVPLTAVAGAGKSAAGLIQAATRSCFY 189

Query: 273 KAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR----NPSLITTFALLSCGYVFSS 328
             FA   N  +V AKGE  G ++  LG  L I IS +     P L+ +F  ++  +V  +
Sbjct: 190 AGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGISSKVGSSGPLLVLSFGTVTAVHVLCN 249

Query: 329 YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS---FPWLRDKPVV--- 382
            +  ++V LHTLN  R  + +  +L T  +PS+R+ N  E IF+   F   +  PV    
Sbjct: 250 LKSYQAVQLHTLNPYRTGLIMAEYLTTQRIPSVRDVNSVEPIFTVADFFSSKKAPVSTHL 309

Query: 383 ------------------LGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
                             LG  F +A     A  A+   +  E Y++TY+ S      +L
Sbjct: 310 SISCESKCAAADIEKNLKLGVSFGEAVASRGAAQALLDLYSGEEYLLTYDDSHC-FKVVL 368

Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALK 459
           K  A   D+L+A   A  L H  N  +  +   LK
Sbjct: 369 KKSASPHDMLRAVVQASYLRHIHNQQSLSKDELLK 403


>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
 gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
          Length = 389

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 172/353 (48%), Gaps = 26/353 (7%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           +++ ++P G+P +V+  Y  Y  WR  +       GV TTQ LL +VG+ +  A P A A
Sbjct: 8   LKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKG-AIPTAAA 66

Query: 191 INWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
           INW+LKDG G + K++ ++ G+ FD   K  R   DL+ +   G+EL T   PHLF+PL 
Sbjct: 67  INWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFVPLT 126

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR- 309
             A   K+ A +   +TR+  Y  FA   N  +V AKGE  G ++  LG  L I IS + 
Sbjct: 127 AVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGISSKV 186

Query: 310 ---NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
               P L+ +F  ++  +V  + +  ++V LHTLN  R  + +  +L T  +PS+R+ N 
Sbjct: 187 GSSGPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQRIPSVRDVNS 246

Query: 367 QENIFSFPWL--------------------RDKPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
            E IF+                         +K + LG  F +A     A  A+   +  
Sbjct: 247 VEPIFTVADFFSSKKAPLSISCESKCAAADIEKNLKLGVSFGEAVASRGAAQALLDLYSG 306

Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALK 459
           E Y++TY+ S      +LK  A   D+L+A   A  L H  N  +  +   LK
Sbjct: 307 EEYLLTYDDSHC-FKVVLKKSASPHDMLRAVVQASYLRHIHNQQSLSKDELLK 358


>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 432

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 117/356 (32%), Positives = 189/356 (53%), Gaps = 23/356 (6%)

Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
           P + + SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   
Sbjct: 59  PQKMVESFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL--- 114

Query: 183 RATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
           R TP  A  ++WILKDG   VGK+I +  G + D + K+ R   D+L +LG G+EL +  
Sbjct: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPL 174

Query: 242 VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
            PHLFL +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + ++ G G
Sbjct: 175 CPHLFLEMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIG 234

Query: 302 LSIM----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
             I     I       +   ++LS  +V+S  ++++ V ++TLN  R ++ V +FLKTG 
Sbjct: 235 AGIQLASTICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGK 294

Query: 358 VPSLREGNLQENIFSFPWLRDKP------VVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
           VPS  +   QE++  FP   ++P      V +G     A + P     ++  F +E++++
Sbjct: 295 VPSPPDLRFQEDLM-FP---ERPIQDAGNVKVGRALHKAVK-PSEVQRLKQVFVEEKFLL 349

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
           ++  S   +  +L+  A  +D L+    A  +       N    + L Q  YD  N
Sbjct: 350 SHGKSWTDM--VLEHDATGEDALRGWLVAAYVKSMTKIYNDPDDIIL-QDAYDKMN 402


>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
 gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
 gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
 gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
 gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 433

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 117/356 (32%), Positives = 189/356 (53%), Gaps = 23/356 (6%)

Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
           P + + SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   
Sbjct: 60  PQKMVESFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL--- 115

Query: 183 RATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
           R TP  A  ++WILKDG   VGK+I +  G + D + K+ R   D+L +LG G+EL +  
Sbjct: 116 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPL 175

Query: 242 VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
            PHLFL +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + ++ G G
Sbjct: 176 CPHLFLEMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIG 235

Query: 302 LSIM----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
             I     I       +   ++LS  +V+S  ++++ V ++TLN  R ++ V +FLKTG 
Sbjct: 236 AGIQLASTICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGK 295

Query: 358 VPSLREGNLQENIFSFPWLRDKP------VVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
           VPS  +   QE++  FP   ++P      V +G     A + P     ++  F +E++++
Sbjct: 296 VPSPPDLRFQEDLM-FP---ERPIQDAGNVKVGRALHKAVK-PSEVQRLKQVFVEEKFLL 350

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
           ++  S   +  +L+  A  +D L+    A  +       N    + L Q  YD  N
Sbjct: 351 SHGKSWTDM--VLEHDATGEDALRGWLVAAYVKSMTKIYNDPDDIIL-QDAYDKMN 403


>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 738

 Score =  174 bits (442), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 93/237 (39%), Positives = 140/237 (59%), Gaps = 6/237 (2%)

Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
            P+ VT  Y  +  WR  +      +GVF TQ LL ++G+        A  I+W+LKDG 
Sbjct: 379 IPEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGL 438

Query: 200 GRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
            RVGKM++A Q G+  D D K+ RF   LL  LG G+E+AT  VP  FL +A  AN AK 
Sbjct: 439 SRVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKL 498

Query: 259 VAAVTSTSTRTPIYKAFA-KGENIGDVTAKGECVGNIADLLGTGLSIMISK----RNPSL 313
            + +T+++TR  +Y++FA + ENI D+TAKGE    IADL G  L I +SK      P +
Sbjct: 499 CSMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKV 558

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           + T+ +LS   +++ ++E++ +   TLN  R S+ +E +LKTG VP+ ++ +  E +
Sbjct: 559 VATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAERV 615


>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
          Length = 593

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 107/333 (32%), Positives = 180/333 (54%), Gaps = 27/333 (8%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+PDSV+  Y+ Y  WR ++       GV +TQ LL +VG+ +  A P A A+NW+
Sbjct: 196 LLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 254

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   DLL     G+E+ T   PHLF+P+  AA 
Sbjct: 255 LKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 314

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI 314
             ++ AA+   +TR+  Y  FA   N  +V AKGE  G ++  LG  L I ++ R  S +
Sbjct: 315 AGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGIALANRIGSSV 374

Query: 315 T----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           +    +FA ++  +++ + +  +S+ L TLN  R S+    +L +G VPS++E N +E +
Sbjct: 375 SLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 434

Query: 371 F--------------SFPWLRD------KPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
           F                P  +D      + + LGS+  +  ++ +   A+   ++ E+Y+
Sbjct: 435 FLNLSIGTSRKESKILSPQAKDAAEIICRRLQLGSKLSEIIENKEDACALFDLYKNEQYL 494

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +T    KGK   +LK+ +  +D+LK+ FH   L
Sbjct: 495 LTEY--KGKFCVILKEGSSPEDMLKSLFHVNYL 525


>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
          Length = 424

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 188/352 (53%), Gaps = 17/352 (4%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   R TP  
Sbjct: 57  SFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL---RPTPAQ 112

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++W+LKDG   VGK+I +  G + D + K  R   D L +LG G+E+ +   PHLFL
Sbjct: 113 ATAVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFL 172

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I ++
Sbjct: 173 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLA 232

Query: 308 KRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
               S     +    LLS  +V+S  +E+++  ++TLN  R ++ V  F+KTG + S  +
Sbjct: 233 STICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPAD 292

Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
              QE+ IF+   + D   V   R       P     ++  F +E++++ +    GK + 
Sbjct: 293 LRYQEDLIFTGRLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCH----GKKWT 348

Query: 423 --LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
             +L+  A  +D L+    A  + + +  ++   S+ + Q  YD  N + +T
Sbjct: 349 DMVLEHDATGEDALRGWLVAAYVAN-LEKTDHEPSMTVLQEAYDKMNSVFST 399


>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
          Length = 420

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 112/350 (32%), Positives = 186/350 (53%), Gaps = 13/350 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   R TP  
Sbjct: 53  SFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL---RPTPAQ 108

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++W+LKDG   VGK+I +  G + D + K  R   D L +LG G+E+ +   PHLFL
Sbjct: 109 ATAVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFL 168

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I ++
Sbjct: 169 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLA 228

Query: 308 KRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
               S     +    LLS  +V+S  +E+++  ++TLN  R ++ V  F+KTG + S  +
Sbjct: 229 STICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPAD 288

Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
              QE+ IF+   + D   V   R       P     ++  F +E++++ +   K     
Sbjct: 289 LRYQEDLIFTGRLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCHG--KKWTDM 346

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
           +L+  A  +D L+    A  + + +  ++   S+ + Q  YD  N + +T
Sbjct: 347 VLEHDATGEDALRGWLVAAYVAN-LEKTDHEPSMTVLQEAYDKMNSVFST 395


>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
 gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
          Length = 541

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 109/341 (31%), Positives = 177/341 (51%), Gaps = 31/341 (9%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F+R  ++P+G+P SVT  Y+ Y  WR ++       GV  TQ LL ++G+ +  A P A 
Sbjct: 132 FVR-LMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 189

Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           AINW+LKDG G + K++ ++ G+ FD + K  R   DLL     G+E+ T A PHLF+ +
Sbjct: 190 AINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFPHLFVFI 249

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LLGTGLSIM 305
             AA   ++ AA+   +TR+  Y  FA   N  +V AKGE  G ++     +LG GL+  
Sbjct: 250 GAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIGLANC 309

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
           I    P  + +F++++  ++F + +  +S+ L TLN  R S+    +L +G  P +++ N
Sbjct: 310 IGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKDVN 369

Query: 366 LQENIFS--FPWLR--DKPVV-------------------LGSRFKDAFQDPDAYLAVEP 402
            +E +F   FP  +  DKP +                   LGS+  D     +  LA+  
Sbjct: 370 DEEPLFPAVFPHFKSADKPSLVVLSLEARDAATEIERRLQLGSKLSDVVNSKEDVLALFN 429

Query: 403 FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
            ++ E Y++T    KG+   +LK+   + D+LKA F    L
Sbjct: 430 LYKDEGYILT--EYKGRFCVVLKESCSAQDMLKALFQVNYL 468


>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 611

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 113/346 (32%), Positives = 177/346 (51%), Gaps = 32/346 (9%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++P+GFPDSVT  Y+ Y  WR ++       GV  TQ LL +VG+ +  A P A A+NW+
Sbjct: 211 MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNWV 269

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K+  ++ G+ FD   K  R   DLL     G+E+ T A P  F+ +  AA 
Sbjct: 270 LKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 329

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS-- 312
             ++ AA+   +TR+  Y  FA   N  +V AKGE  G ++  +G  L I ++ R  S  
Sbjct: 330 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 389

Query: 313 --LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
              +  F++++  ++F + +  KS+ L TLN  R S+    +L +G VPS+++ N +E +
Sbjct: 390 SLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL 449

Query: 371 F-SFP--------WLRD---------------KPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
           F + P        W RD               K + LGS+  D     +  L +   F K
Sbjct: 450 FPAVPLLNRKAPDWSRDFLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 509

Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
           E Y+++ +  +GK   +LK+ A   D+LKA FH    LH +  + G
Sbjct: 510 ENYILSEH--RGKYCVMLKESASPVDMLKAVFHVNY-LHWLERNAG 552


>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 503

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 113/419 (26%), Positives = 202/419 (48%), Gaps = 38/419 (9%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFI---RSYVVPDGFPDSVT 145
           ++ G RW Y+ +     +F   +++    ++     + +  F+   +  V+PDG+P SVT
Sbjct: 48  EVKGGRWKYLVKHPERDEFVVQTVKNSEEEALAYEEDVITRFVELGKQIVLPDGYPQSVT 107

Query: 146 PSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKM 205
             Y  Y  WR  +       GV TTQ LL +VG+ +  A P A A+NW+L+DG G + K+
Sbjct: 108 DDYFEYTLWRMGQVIASQISGVLTTQALLYAVGLGKG-AIPTAAAVNWVLRDGIGYLSKI 166

Query: 206 IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTST 265
           + ++ G+ FD   K  R   DL+  +  G+EL T A PHLF+ LA AA   ++ + +   
Sbjct: 167 VLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTPAFPHLFVYLAAAAGAGRSASGLIQA 226

Query: 266 STRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS----KRNPSLITTFALLS 321
           +T++      A   N  ++ AKGE  G ++  +G  L I +S       P L+ TF  ++
Sbjct: 227 ATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGIALGIAVSAYVGSTGPLLVGTFFGVT 286

Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR---- 377
             ++F + +  ++V L TLN  R S+ +  + ++G+V S+RE N +E IF F +      
Sbjct: 287 ALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSGNVVSVREVNGEEPIFPFSFFNFRLR 346

Query: 378 --------------------DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
                               +  +  G+ F  A +  +   A+   ++ ERY+++ +   
Sbjct: 347 GSKGDGTLLSEESKAVAHSIENHLEFGATFGAAVKTREEADALLDIYKSERYLLSQHDD- 405

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVLLHCINSS-NGHRSLALKQHEYDNSNLMLTTADL 475
            ++  +LK+ A   D+L+A   A  L +   +    HR+L    H+  N  ++  + D 
Sbjct: 406 -RLLVVLKEGATPRDMLRAMMQAIYLDYLQGTGFTSHRNL---MHDSGNGGVLRVSHDF 460


>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
 gi|194698746|gb|ACF83457.1| unknown [Zea mays]
          Length = 451

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 102/332 (30%), Positives = 175/332 (52%), Gaps = 26/332 (7%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+P SV+  Y+ Y  WR ++       GV +TQ LL +VG+ +  A P A A+NW+
Sbjct: 55  LLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 113

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   D L  +  G+E+ T   PHLF+P+  AA 
Sbjct: 114 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 173

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
             ++ A++  ++TR+  Y  FA   N  +V AKGE  G ++  +G GL I     I    
Sbjct: 174 AGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGIALANHIGSSV 233

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           P  + +FA ++  +++ + +  +S+ L TLN  R S+    +L +G VPS++E N +E +
Sbjct: 234 PLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 293

Query: 371 F---SFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
           F   S    R   ++                +GS+  +  +  +   A+   ++ E+Y++
Sbjct: 294 FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALFDLYKNEQYLL 353

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                KGK   +LK+ +  +D+LK+ FH   L
Sbjct: 354 M--DYKGKFCIVLKEGSSPEDMLKSLFHVNYL 383


>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
 gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
          Length = 399

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 105/308 (34%), Positives = 174/308 (56%), Gaps = 18/308 (5%)

Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWIL 195
           P G+P SV   Y+ Y  +RAL+HF    + VF+TQ+L+ + G+   R TP  A  I+W+L
Sbjct: 49  PVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLVFAAGL---RPTPAQATVISWVL 105

Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
           KDG   +GK++ +R G + D + K+ R   D+L + G  +E+ + A P  FL +A  AN+
Sbjct: 106 KDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTALEMISPACPQYFLGVAGLANL 165

Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-MISKRNPSL- 313
           AK VA VT+ +TR P+Y AFAK  N+ D+ AKGE +  +A++ G G+ I + S    SL 
Sbjct: 166 AKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQ 225

Query: 314 --ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
             +    LLS  +++S  +E+++V L TLN  R ++ V  +L+ G VPS  E   +E + 
Sbjct: 226 GKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLI 285

Query: 372 SFPWLRDKPVVLGSRFK---DAFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
               L  KP V     +   D  Q P   +  ++  F+ ER+++ ++ +  ++  +L   
Sbjct: 286 ----LPVKPNVHAGSVRIANDILQIPSGKIRELKHRFQSERFLLDFHETSTEM--VLHHS 339

Query: 428 AKSDDILK 435
           A  +D+++
Sbjct: 340 ASGEDVVR 347


>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 712

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 97/297 (32%), Positives = 162/297 (54%), Gaps = 23/297 (7%)

Query: 92  GRRWNYV--AERDRSGKFKRNSIRAISLQSPQAPAEELMSFI-------RSYVVPDGFPD 142
           GR+W YV   E+ R+ +F       +SLQ+ +A  E +   +       R  +     PD
Sbjct: 230 GRQWQYVFDGEQIRAVEFD-----GVSLQNGEARNESVSGQVAGSVLKLRQTLQRIFVPD 284

Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
            V P Y+ Y+ W+ +  F    +    TQ +L +VGV   R  P A A+NW++KDG GR+
Sbjct: 285 QVWPHYLVYLKWKLVHRFLSSVLHFQCTQAMLWAVGVGAKRRLPAAAALNWVMKDGVGRL 344

Query: 203 GKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAA 261
           GK++F A  G+ FD DLK++RF   +L     G+E+ T   P  FL LA  AN+AK++A 
Sbjct: 345 GKLLFTANFGRTFDSDLKRVRFWTSVLFSSSVGLEILTPLFPQHFLLLATLANIAKSIAY 404

Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNPSL----- 313
               +T + I+++FA G+N+ D++AKG     +AD LG   ++ +S+     P +     
Sbjct: 405 AAYLATSSAIHRSFALGDNLADISAKGMAQTVVADNLGLAAAVCLSQFIHNFPKIEKFLP 464

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           +  + +L+   +F+ YQ++++V L TLN+ R  + +  ++K   +PS  E +  EN+
Sbjct: 465 LAMYPVLASAELFAIYQQLQAVHLQTLNKERLEIIINIWVKDRRIPSFEEVSKIENV 521


>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
 gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
          Length = 399

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 105/308 (34%), Positives = 174/308 (56%), Gaps = 18/308 (5%)

Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWIL 195
           P G+P SV   Y+ Y  +RAL+HF    + VF+TQ+L+ + G+   R TP  A  I+W+L
Sbjct: 49  PVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLVFAAGL---RPTPAQATVISWVL 105

Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
           KDG   +GK++ +R G + D + K+ R   D+L + G  +E+ + A P  FL +A  AN+
Sbjct: 106 KDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTALEIISPACPQYFLGVAGLANL 165

Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-MISKRNPSL- 313
           AK VA VT+ +TR P+Y AFAK  N+ D+ AKGE +  +A++ G G+ I + S    SL 
Sbjct: 166 AKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQ 225

Query: 314 --ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
             +    LLS  +++S  +E+++V L TLN  R ++ V  +L+ G VPS  E   +E + 
Sbjct: 226 GKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLI 285

Query: 372 SFPWLRDKPVVLGSRFK---DAFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
               L  KP V     +   D  Q P   +  ++  F+ ER+++ ++ +  ++  +L   
Sbjct: 286 ----LPVKPNVHAGSVRIANDILQIPSGKIRELKHRFQSERFLLDFHETSTEM--VLHHS 339

Query: 428 AKSDDILK 435
           A  +D+++
Sbjct: 340 ASGEDVVR 347


>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
          Length = 593

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 102/332 (30%), Positives = 175/332 (52%), Gaps = 26/332 (7%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+P SV+  Y+ Y  WR ++       GV +TQ LL +VG+ +  A P A A+NW+
Sbjct: 197 LLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 255

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   D L  +  G+E+ T   PHLF+P+  AA 
Sbjct: 256 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 315

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
             ++ A++  ++TR+  Y  FA   N  +V AKGE  G ++  +G GL I     I    
Sbjct: 316 AGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGIALANHIGSSV 375

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           P  + +FA ++  +++ + +  +S+ L TLN  R S+    +L +G VPS++E N +E +
Sbjct: 376 PLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 435

Query: 371 F---SFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
           F   S    R   ++                +GS+  +  +  +   A+   ++ E+Y++
Sbjct: 436 FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALFDLYKNEQYLL 495

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                KGK   +LK+ +  +D+LK+ FH   L
Sbjct: 496 M--DYKGKFCIVLKEGSSPEDMLKSLFHVNYL 525


>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 103/337 (30%), Positives = 171/337 (50%), Gaps = 31/337 (9%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++P+GFP+SVT  Y+ Y  WRA++       GV  TQ+LL +VG+ +  A P A AINW+
Sbjct: 185 MLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG-AIPTAAAINWV 243

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ +  G+ FD + K  R   DLL     G+E++T A P  F+ +   A 
Sbjct: 244 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACPQFFVLIGAVAG 303

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----IADLLGTGLSIMISKRN 310
            +++ A++   STR+  +  FA   N  +V AKGE  G     I  +LG GL   I    
Sbjct: 304 ASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLGIGLGNCIGSST 363

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           P ++ +F +L+  +++ + +  +S+ L TLN  R S+    +L +G  P ++E N +E +
Sbjct: 364 PLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 423

Query: 371 F-SFPWLR-----------------------DKPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
           F + P L                        +  + LGS+  +     +  LA+   ++ 
Sbjct: 424 FPAVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKN 483

Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           E Y+++ +   GK   +LK++    D+LKA F    L
Sbjct: 484 EGYILSEH--MGKYSVVLKEKCSQLDMLKALFQVNYL 518


>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
 gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
          Length = 569

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 103/333 (30%), Positives = 176/333 (52%), Gaps = 27/333 (8%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+P SV+  Y+ Y  WR ++       GV +TQ LL +VG+ R  A P A A+NW+
Sbjct: 172 LLPDGYPHSVSSDYMNYSLWRGVQGVASQISGVLSTQALLYAVGLGRG-AIPTAAAVNWV 230

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   D L  +  G+E+ T   PHLF+P+  AA 
Sbjct: 231 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 290

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
            +++ A++   +TR+  Y  FA   N  +V AKGE  G ++  +G GL I     I    
Sbjct: 291 ASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGIALANQIGSSV 350

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           P  + +FA ++  +++ + +  +S+ L TLN  R S+    +L +G VPS++E N +E +
Sbjct: 351 PLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 410

Query: 371 FSFPWL--------------RD------KPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
           F    L              +D      + + +GS+  +  +  +   A+   ++ E+Y+
Sbjct: 411 FFNLSLGASCQQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALFDLYKNEQYL 470

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +     KGK   +LK+ +  +D+LK+ FH   L
Sbjct: 471 LM--DYKGKFCIVLKEGSSPEDMLKSLFHVNYL 501


>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
          Length = 609

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/349 (30%), Positives = 179/349 (51%), Gaps = 43/349 (12%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+PDSV+  Y+ Y  WR ++       GV +TQ LL +VG+ +  A P A A+NW+
Sbjct: 196 LLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 254

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   DLL     G+E+ T   PHLF+P+  AA 
Sbjct: 255 LKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 314

Query: 255 ----------------VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
                           + +N   +   +TR+  Y  FA   N  +V AKGE  G ++  L
Sbjct: 315 AGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFL 374

Query: 299 GTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
           G  L I ++ R  S ++    +FA ++  +++ + +  +S+ L TLN  R S+    +L 
Sbjct: 375 GIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLL 434

Query: 355 TGHVPSLREGNLQENIF--------------SFPWLRD------KPVVLGSRFKDAFQDP 394
           +G VPS++E N +E +F                P  +D      + + LGS+  +  ++ 
Sbjct: 435 SGQVPSVKEVNDEEPLFLNLSIGTSRKESKILSPQAKDAAEIICRRLQLGSKLSEIIENK 494

Query: 395 DAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +   A+   ++ E+Y++T    KGK   +LK+ +  +D+LK+ FH   L
Sbjct: 495 EDACALFDLYKNEQYLLT--EYKGKFCVILKEGSSPEDMLKSLFHVNYL 541


>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 719

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 107/339 (31%), Positives = 176/339 (51%), Gaps = 19/339 (5%)

Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           S ++  E L +++    +P G+P +VT  Y+ +  WR L++     MGVF+T+ LL  +G
Sbjct: 326 SERSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLG 385

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           + +N  +  A A+ WILKDG G  GK+ +A   GK+FD D +  R   DLL +L   +E+
Sbjct: 386 LGKN-VSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEI 444

Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
            T      FL LA  A   + +AA+T T+TR  I+K+ A  EN GD+  KGE  G    L
Sbjct: 445 ITPIFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKL 504

Query: 298 LGTGLSIMISKR----NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
           +G GL I +SKR      +L+  + + +  ++ ++++ ++ V   TLNR R ++ +++F+
Sbjct: 505 IGLGLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAFM 564

Query: 354 -----------KTGHVPSLREGNLQENIFSFPWLRDKP-VVLGSRFKDAFQDPDAYLAVE 401
                      +   +P+ RE   +E     PW   +P +V G+  +   +    +L   
Sbjct: 565 SARAQTSRQASRQLTLPTPREICAKERFMIPPWRDYRPEIVFGASLQRV-KHARHFLHCA 623

Query: 402 PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
             +  ERY++   PS   V   L+D A   D LKA  +A
Sbjct: 624 ALYRNERYIIYAEPSTKHVLVFLRDGATVKDQLKAFMNA 662


>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 428

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/340 (31%), Positives = 183/340 (53%), Gaps = 12/340 (3%)

Query: 103 RSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHF 160
           R  +F+ +   ++ +     PA + +  SF+ ++  P G+P SV+  Y+ Y  +RAL+H 
Sbjct: 33  RRCQFQPDGHLSVKIIDDSRPAIQRIVDSFLNTFF-PSGYPYSVSEGYLRYTQFRALQHV 91

Query: 161 FGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              A+ V +TQ+LL + G+    A   A  ++W+LKDG   VGK+I +  G + D + K+
Sbjct: 92  TSAALSVLSTQSLLFAAGLRPTAAQ--ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKR 149

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
            R   D+L +LGAG+E+ +   PHLFL +A   N AK +A V + +TR PIY +FAK  N
Sbjct: 150 WRVIADVLYDLGAGLEVISPLCPHLFLEMAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 209

Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVV 336
           + D+ AKGE +  + +++G G  + ++    S I        LLS  +V+   +++++  
Sbjct: 210 LSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATP 269

Query: 337 LHTLNRARFSVAVESFLKTGHVPSLREGNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPD 395
           ++TLN  R ++ V  F+K G +PS  +   QE+ +F    + +   V   R       P 
Sbjct: 270 INTLNPQRTAMIVADFVKAGRIPSPADIRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPS 329

Query: 396 AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK 435
             + ++  F  E++V+  N SK  V  +L+  A  +D L+
Sbjct: 330 KLVEMKQIFPGEKFVL--NQSKKWVDMVLEHDASGEDALR 367


>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 428

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/340 (31%), Positives = 183/340 (53%), Gaps = 12/340 (3%)

Query: 103 RSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHF 160
           R  +F+ +   ++ +     PA + +  SF+ ++  P G+P SV+  Y+ Y  +RAL+H 
Sbjct: 33  RRCQFQPDGHLSVKIIDDSRPAIQRIVDSFLNTFF-PSGYPYSVSEGYLRYTQFRALQHV 91

Query: 161 FGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              A+ V +TQ+LL + G+    A   A  ++W+LKDG   VGK+I +  G + D + K+
Sbjct: 92  TSAALSVLSTQSLLFAAGLRPTAAQ--ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKR 149

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
            R   D+L +LGAG+E+ +   PHLFL +A   N AK +A V + +TR PIY +FAK  N
Sbjct: 150 WRVIADVLYDLGAGLEVISPLCPHLFLEMAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 209

Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVV 336
           + D+ AKGE +  + +++G G  + ++    S I        LLS  +V+   +++++  
Sbjct: 210 LSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATP 269

Query: 337 LHTLNRARFSVAVESFLKTGHVPSLREGNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPD 395
           ++TLN  R ++ V  F+K G +PS  +   QE+ +F    + +   V   R       P 
Sbjct: 270 INTLNPQRTAMIVADFVKAGRIPSPADIRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPS 329

Query: 396 AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK 435
             + ++  F  E++V+  N SK  V  +L+  A  +D L+
Sbjct: 330 KLVEMKQIFPGEKFVL--NQSKKWVDMVLEHDASGEDALR 367


>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
 gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 177/360 (49%), Gaps = 35/360 (9%)

Query: 116 SLQSPQAPA---EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQT 172
           SL S Q P     +   F+   ++P+GFP+SVT  Y+ Y  WR ++       GV  TQ 
Sbjct: 157 SLNSSQVPTFLYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQA 216

Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELG 232
           LL +VG+ +  A P A AINW+LKDG G + K++ +  G+ FD + K  R   DLL    
Sbjct: 217 LLYAVGLGKG-AIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAA 275

Query: 233 AGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG 292
            G+E+ T A PHLF+P+   A  +++ A++   STR+  +  FA   N  +V AKGE  G
Sbjct: 276 FGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 335

Query: 293 NIADL----LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
            ++      +G GL   I    P ++ +F +++  +++ + +  +S+ L TLN  R S+ 
Sbjct: 336 MVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASLV 395

Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKPVV-------------------------L 383
              +L +G  P ++E N +E +F    + + P                           L
Sbjct: 396 FSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVLSSEAKDAAVEIESRLQL 455

Query: 384 GSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           GS+  +   + +  LA+   ++ E Y+++ +   GK   +LK+     D+LKA F    L
Sbjct: 456 GSKLSEIINNKEEVLALFSLYKNEGYILSEH--TGKFCVVLKETCSQLDMLKALFQVNYL 513


>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 604

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/333 (31%), Positives = 176/333 (52%), Gaps = 27/333 (8%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+P+SV+  Y+ Y  WR L+       GV +TQ LL +VG+ +    P A AINW+
Sbjct: 206 LLPDGYPNSVSSDYLHYSLWRGLQGVASQISGVLSTQALLYAVGLGKG-VIPTAAAINWV 264

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K+ R   DLL     G+E+ T   PHLF+P+  AA 
Sbjct: 265 LKDGLGYLSKILLSKFGRHFDVNPKRWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 324

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR----N 310
            A++ AA+   +TR+  Y  FA   N  +V AKGE  G ++  LG  L I ++ +     
Sbjct: 325 AARSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGIALANQIGSSV 384

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           P  + +FA ++  +++ + +  +S+ L TLN  R S+    +L +G V S++E N +E +
Sbjct: 385 PLALISFAGVTVVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVSSVKEVNDEEPL 444

Query: 371 F----SFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYV 410
           F        L+   ++                LGS+  +   + +   A+   ++ E+Y+
Sbjct: 445 FLNSSMGAKLKQNKILSAEAKDAADRICRRLQLGSKLSEIIDNEEDACALFDLYKDEQYL 504

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +     KGK   +LK+ +  +D+LK+ FH   L
Sbjct: 505 LLEY--KGKFCVVLKEGSSPEDMLKSVFHVNYL 535


>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 397

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 178/328 (54%), Gaps = 14/328 (4%)

Query: 117 LQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
           L+  + P +  +    S   P G+P+SV   Y+ Y  +RA++HF    M V +TQ+LL +
Sbjct: 12  LKETRPPVQIAVDSFLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSLLFA 71

Query: 177 VGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGV 235
            G+   R TP  A  ++W+LKDG   VGK++ +R G + D + K+ R   D + ++GAG+
Sbjct: 72  AGL---RPTPAQATIVSWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGL 128

Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
           E+ +   P  FLP+A  AN+AK +A V++ +TR P+Y +F K  N+ D+ AKGE +  ++
Sbjct: 129 EVISPLCPQHFLPVAGMANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLS 188

Query: 296 DLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
           ++LG G+ I ++    S I        +LS  +++   QE+++  ++TLN  R ++ V  
Sbjct: 189 NVLGLGVGIHLASNFASTIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVAD 248

Query: 352 FLKTGHVPSLREGNLQENIFSFPWLRDKP---VVLGSRFKDAFQDPDAYLAVEPFFEKER 408
           F+KTG V S  E   +E +   P   +K    V +G+  +     P     ++  F+ E+
Sbjct: 249 FVKTGSVSSPAELRYRERLI-LPVGLEKDAGNVRVGAPLQSGIGKPSVMAELKEKFQGEQ 307

Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKA 436
           +++ +      +  +L  +A  +D ++ 
Sbjct: 308 FLLHFGEQATDM--VLHQKASGEDAVRG 333


>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
 gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
          Length = 428

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 183/347 (52%), Gaps = 17/347 (4%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   R TP  
Sbjct: 61  SFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL---RPTPAQ 116

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG    GK+I +  G + D + K+ R   D+L +LG G+E+ +   PHLFL
Sbjct: 117 ATAVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFL 176

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI- 306
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + ++LG G+ I + 
Sbjct: 177 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLA 236

Query: 307 SKRNPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
           S    S+   F    LLS  +V    +E+++  ++TLN  R ++ V  F+KTG + S  +
Sbjct: 237 STVCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPAD 296

Query: 364 GNLQENIFSFPWLRDK---PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKV 420
               E++  FP    +    V +G     A + P     ++  F  E+++++  P     
Sbjct: 297 LRYHEDLL-FPGRLIENAGNVKVGQALHRAVR-PSKLRELKEIFPGEKFILS--PGNKWT 352

Query: 421 YALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
             +L+  A  +D L+A   A        SS+   S+ L Q  Y+  N
Sbjct: 353 DLVLEQNASGEDALRAWLVAAYASSMKKSSHESTSVTL-QDAYEKMN 398


>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
          Length = 559

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/329 (30%), Positives = 170/329 (51%), Gaps = 23/329 (6%)

Query: 133 SYVVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           S  V D F P  V+ +Y+ Y+ W+ L   F  A+ V  TQ +  ++G+  +R+ P A A+
Sbjct: 155 SSAVRDFFIPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAAL 214

Query: 192 NWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
           NW+LKDG GR+ + I+ A     FD +LK++RF+  +L  L  GVEL T A P+ FL LA
Sbjct: 215 NWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELLTPAFPNYFLLLA 274

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMI 306
             AN+AK V      +T + ++++FA  +N+G+V+AK +    C  N+  +L   L+I+ 
Sbjct: 275 TIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNLGLMLAAVLNIL- 333

Query: 307 SKRNPSLI-----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
            K N  L+       + + S   +F  YQ +K V L TL + R  + +  +++ G+VPS 
Sbjct: 334 CKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEIILNMWIQLGYVPSP 393

Query: 362 REGNLQENIFSFPWLRDK---PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN---- 414
            E + +E I SFPW + +   P+ +G           + L ++    ++ Y +       
Sbjct: 394 AEVSKEEGI-SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLSSEDFYFICMEACRR 452

Query: 415 ---PSKGKVYALLKDQAKSDDILKAAFHA 440
              P +  +   L++ A + D++     A
Sbjct: 453 LTRPEQQGILLCLREGASTTDVISGLLQA 481


>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
          Length = 514

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/329 (30%), Positives = 170/329 (51%), Gaps = 23/329 (6%)

Query: 133 SYVVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           S  V D F P  V+ +Y+ Y+ W+ L   F  A+ V  TQ +  ++G+  +R+ P A A+
Sbjct: 110 SSAVRDFFIPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAAL 169

Query: 192 NWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
           NW+LKDG GR+ + I+ A     FD +LK++RF+  +L  L  GVEL T A P+ FL LA
Sbjct: 170 NWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELLTPAFPNYFLLLA 229

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMI 306
             AN+AK V      +T + ++++FA  +N+G+V+AK +    C  N+  +L   L+I+ 
Sbjct: 230 TIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNLGLMLAAVLNIL- 288

Query: 307 SKRNPSLI-----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
            K N  L+       + + S   +F  YQ +K V L TL + R  + +  +++ G+VPS 
Sbjct: 289 CKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEIILNMWIQLGYVPSP 348

Query: 362 REGNLQENIFSFPWLRDK---PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN---- 414
            E + +E I SFPW + +   P+ +G           + L ++    ++ Y +       
Sbjct: 349 AEVSKEEGI-SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLSSEDFYFICMEACRR 407

Query: 415 ---PSKGKVYALLKDQAKSDDILKAAFHA 440
              P +  +   L++ A + D++     A
Sbjct: 408 LTRPEQQGILLCLREGASTTDVISGLLQA 436


>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
 gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
          Length = 436

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 103/309 (33%), Positives = 171/309 (55%), Gaps = 17/309 (5%)

Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWIL 195
           P G+P SV   Y+ Y  +RAL+H    A+ V +TQ+LL + G+   R TP  A A++W+L
Sbjct: 76  PSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLLFAAGL---RPTPAQATAVSWVL 132

Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
           KDG   VGK+I +  G + D + K+ R   D+L +LG G+E+ +   PHLFL +A   N 
Sbjct: 133 KDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEVAGLGNF 192

Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI- 314
           AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G+ I ++    S I 
Sbjct: 193 AKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGVGIQLASTVCSSIQ 252

Query: 315 ---TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN-I 370
                  LLS  +VFS  +E+++  ++TLN  R ++ V  ++K G + S  +   +E+ I
Sbjct: 253 GKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVADYVKAGKISSPADLRYREDLI 312

Query: 371 FSFPWLRDK-PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA--LLKDQ 427
           F    + D   V +G     AF  P     ++  F +E++++    ++G  +   +L+  
Sbjct: 313 FPGRLIEDAGNVKVGRALHKAFT-PSKVHELKDIFPEEKFLL----NRGNTWTDMVLEQN 367

Query: 428 AKSDDILKA 436
           A  +D L+ 
Sbjct: 368 ASGEDALRG 376


>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 631

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 138/234 (58%), Gaps = 7/234 (2%)

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
           +L+  +RS  +P+GFP+SV PSY+ Y  WR  ++       V +T  LL +VG+S   +T
Sbjct: 233 QLVLQLRSVFLPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLS---ST 289

Query: 186 PGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
           P A A +W+LKDG G++GK++ AR+G+ FD D K+ R + DLL + G  +E+ T   P  
Sbjct: 290 PVAAATSWVLKDGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQY 349

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
           FL LA   N  K+VA     + R  +  +F    N+GD++AK +    + +L G GL I+
Sbjct: 350 FLALAALGNFTKSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIV 409

Query: 306 ISKR---NPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            ++     PS+ +  FA L+  Y   +YQ +K+V L+ +NR R ++  +SF+++
Sbjct: 410 TARALPPKPSVRLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRS 463


>gi|374921923|gb|AFA26139.1| hypothetical protein, partial [Lolium perenne]
          Length = 109

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 72/110 (65%), Positives = 95/110 (86%), Gaps = 1/110 (0%)

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
           G+SI+ISKRNPSL+ +FA+LSCGY+ SSYQEV+SVVL+TLN ARF+VAV+SF+KTGHVPS
Sbjct: 1   GMSILISKRNPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFVKTGHVPS 60

Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
           L+EGN +E +F+ PW R +PV +GSRF +AFQ+P +++A  P FE ERY+
Sbjct: 61  LKEGNSEETVFNPPW-RHQPVAIGSRFGEAFQEPASFVATRPLFEDERYL 109


>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
          Length = 414

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 170/329 (51%), Gaps = 12/329 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +R+++H    A+ V +TQ+LL + G+   R TP  
Sbjct: 47  SFMNKFF-PSGYPYSVNEGYLRYTQFRSVQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 102

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG   VGK+I +  G + D + K+ R   D+L +LG G+E+ +   PHLFL
Sbjct: 103 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLIADVLYDLGTGLEVLSPLCPHLFL 162

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM-- 305
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE    + +++G G+ I   
Sbjct: 163 QMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLCNVIGIGVGIQLA 222

Query: 306 --ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
             I       +    LLS  +++S  +E+++  ++TLN  R ++ V  FLK G V S  +
Sbjct: 223 STICASVQGKMVVGPLLSILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPAD 282

Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
              +E+ +F      D   V   R       P   L +E     E++++  N     V  
Sbjct: 283 LRYREDLLFRVRLTEDAGNVRVGRALHEVIKPSRLLELEQVLPGEKFLL--NRGGKCVDM 340

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSN 451
           +L+  A  +D L+    A       NSS+
Sbjct: 341 VLEQDASGEDALRGWLVAAYAAQIENSSH 369


>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
 gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
          Length = 432

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 162/323 (50%), Gaps = 11/323 (3%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           A  +    +S  +P G+P+SV+  Y+ Y  W  ++ F     G   TQ  L  VGV    
Sbjct: 41  ANSVSGVFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQE 100

Query: 184 ATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           AT  A  I W+L+DG G +G+++FA  +G K D + K+ R   D+L ++   +E+A    
Sbjct: 101 ATVAAATITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHF 160

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
           P  F  + C A + K++  V   +TR  +    A+  N+ D++AK    E + N+A LL 
Sbjct: 161 PPFFTIIVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLV 220

Query: 300 TGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
           +   I +   NP L    F L +  ++F++Y+ V+SVV+ TLN AR S+ +  +L  G V
Sbjct: 221 SLALIPLVTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQV 280

Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDP-DAYLAVEPFFEKERYVVTYNPSK 417
            S  E N +E +F   + R  P+ LG R  D   +P D  LA++     + Y++      
Sbjct: 281 LSPAEANQREPVF-LSFRRTVPIKLGVRLGDLIHEPGDLQLALKN--NSKPYLIGIK--N 335

Query: 418 GKVYALLKDQAKSDDILKAAFHA 440
           G V   L   A  DD +KA   A
Sbjct: 336 GAVCVCLGSDASVDDEIKAVCQA 358


>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 419

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 106/345 (30%), Positives = 181/345 (52%), Gaps = 12/345 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+H    A+ V +TQ+LL + G+   R TP  
Sbjct: 51  SFMNKFF-PSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 106

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG   VGK+I +  G + D + K+ R   D+L ++G G+E+ +   PHLFL
Sbjct: 107 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFL 166

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE    + +++G G+ I ++
Sbjct: 167 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLA 226

Query: 308 KRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
               + I        LLS  +++S  +E+++  ++TLN  R ++ V  FLK G V S  +
Sbjct: 227 STICASIQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPAD 286

Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
              ++N +F+     D   V   +       P   L ++  F +E++++ +      +  
Sbjct: 287 LRYRDNLLFNVQVKEDTGNVRVGKNVHKVIKPSRLLELKQVFPEEKFLLNFG--NKCIDM 344

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
           +L+  A  +D L+    A       +SS+   + ++    Y+  N
Sbjct: 345 VLEQDASGEDALRGWLVAAYAARTESSSHELSATSVLHEAYEKMN 389


>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 439

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 185/380 (48%), Gaps = 18/380 (4%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
           D +G   R SI    +Q  +   ++L+    +   P G+P SV   Y+ Y  +R L+HF 
Sbjct: 40  DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFS 98

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              + V +TQ+LL + G+   R TP  A A++WILKDG    GK+I +  G + D + K 
Sbjct: 99  SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
            R   D+L +LG  +E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N
Sbjct: 156 WRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215

Query: 281 IGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSV 335
           + D+ AKGE +  + +++G G  I     + S     LI    LLS  +++   QE+++ 
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRAT 274

Query: 336 VLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQ 392
            ++TLN  R ++ V  F+K+G V S  E   +E++  FP         V +G   +  F 
Sbjct: 275 PVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRVIEEAGSVKIGQPLRKVF- 332

Query: 393 DPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
            P     +   F  E+++++      + Y +L+  A  +D L+    A        S  G
Sbjct: 333 SPRLVEQLRATFPGEKFLLSQK--SNRTYMVLERSASGEDALRGWLVAAFASEMERSGAG 390

Query: 453 HRSLALKQHEYDNSNLMLTT 472
            R  A+    Y+    +  T
Sbjct: 391 SRDDAVLSEAYEKMERVFPT 410


>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 88/238 (36%), Positives = 140/238 (58%), Gaps = 9/238 (3%)

Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
           P + + SF+  +  P G+P SV   Y+ Y  +RAL+HF   A+ V +TQ+LL + G+   
Sbjct: 59  PQKMVESFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL--- 114

Query: 183 RATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
           R TP  A  ++WILKDG   VGK+I +  G + D + K+ R   D+L +LG G+EL +  
Sbjct: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPL 174

Query: 242 VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
            PHLFL +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + ++ G G
Sbjct: 175 CPHLFLEMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIG 234

Query: 302 LSIMISKRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
             I ++    S     +   ++LS  +V+S  ++++ V ++TLN  R ++ V ++LK 
Sbjct: 235 AGIQLASTICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYLKV 292


>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
          Length = 464

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 98/332 (29%), Positives = 163/332 (49%), Gaps = 24/332 (7%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
            +S ++   +P G+PDSV P Y  Y  W  ++ F    MG  TT +++  VGV    ATP
Sbjct: 45  FISIVKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEASATP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A AI WILKDG G +G+++FA   G   D   K+ R   D+L +L  G+EL        
Sbjct: 105 LAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSY 164

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
            L + C +   K++  V   +TR  + +  A   N+ DV+AK      CV  IA  +G  
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGI- 223

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           L + I      ++  +  L   +++++Y  VK++ L++LN  R ++ V+S++    +P  
Sbjct: 224 LILSIFHNGRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283

Query: 362 REGNLQENIFSFPWLRDKPV--VLGSRFK-----------DAFQDPDAYLAVEPFFEKER 408
            E N +E++     L  KP   + G   K           D     D  LA++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDR-K 338

Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           Y+++ +     ++  LK  A+  D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370


>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
 gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
          Length = 439

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 110/353 (31%), Positives = 177/353 (50%), Gaps = 19/353 (5%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF    + V +TQ+LL + G+   R TP  
Sbjct: 68  SFLNKFF-PSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 123

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG    GK+I +  G + D + K  R   D+L +LG  +E+ +   P LFL
Sbjct: 124 ATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFL 183

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI--- 304
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I   
Sbjct: 184 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 243

Query: 305 --MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
             + S     LI    LLS  +++   QE+++  ++TLN  R ++ V  F+K+G V S  
Sbjct: 244 STVCSTTQGKLIAG-PLLSAVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPA 302

Query: 363 EGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
           E   +E++  FP         V +G   +     P     +   F  E++++T      K
Sbjct: 303 ELRYREDLL-FPNRVIEEAGSVKIGQPLRRVL-SPSLVEQLRATFPDEKFLLTQK--SNK 358

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
            Y +L+  A  +D L+    A        S    R   LK+  Y+ + ++  +
Sbjct: 359 TYMVLEQSASGEDALRGWLVAAFASEMERSGIVSRDTVLKE-AYEKTKMVFPS 410


>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 415

 Score =  158 bits (399), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 101/314 (32%), Positives = 167/314 (53%), Gaps = 12/314 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RA++H    A+ V +TQ+LL + G+   R TP  
Sbjct: 47  SFMNKFF-PSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 102

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG   VGK+I +  G + D + K+ R   D L ++G G+E+ +   PHLFL
Sbjct: 103 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFL 162

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM-- 305
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE    + +++G G+ I   
Sbjct: 163 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLA 222

Query: 306 --ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
             I       +    LLS  +++S  +E+++  ++TLN  R ++ V  FLK G V S  +
Sbjct: 223 STICASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPAD 282

Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
              +EN +F+     D   V   +       P   L ++  F +E++++ +      +  
Sbjct: 283 LRYRENLLFNVHVKEDAGNVRVGKDVHKVIKPSRLLELKQVFPEEKFLLNFG--NKCIDM 340

Query: 423 LLKDQAKSDDILKA 436
           +L+  A  +D L+ 
Sbjct: 341 VLEQDASGEDALRG 354


>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
          Length = 473

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 111/345 (32%), Positives = 177/345 (51%), Gaps = 27/345 (7%)

Query: 116 SLQSPQAPAEE----LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
           SL   + PAE     L  F  S ++P G+PDSV+  Y+ Y  W  ++ F     G   TQ
Sbjct: 50  SLVCERHPAEHGYRSLRDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQ 109

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLME 230
           +LL   GV  + AT  A  I WIL+DG G VG+++FA  +G + D D K+ R   DLL +
Sbjct: 110 SLLRGSGVGDSTATVAAATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLND 169

Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG-- 288
           +   +E+     P  F    CAA V K +  V   +TR  I    AK +N+ DV AK   
Sbjct: 170 IAIFMEIMAPVFPSCFTLTVCAAGVFKCIVGVAGGATRAAITVHQAKRDNMADVCAKDGS 229

Query: 289 -ECVGNIADLLGTGLSI-MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFS 346
            E + N+A LL +   + ++S    +    F LL+  +++++Y+ V+SV++ TLN++R S
Sbjct: 230 QETLVNLAGLLVSLFLVPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLS 289

Query: 347 VAVESFLKTGHVPSLREGNLQENIFSFPWLRDK-PVVLGSRFKDAFQDPDAYLAVE-PFF 404
           + +  +LK G +      NL+E +   P L  + P+ LG         P +YLA   P  
Sbjct: 290 IVLHHYLKEGKILDPEAANLREPL--LPGLGSQAPISLGV--------PLSYLASSVPEL 339

Query: 405 EKER------YVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           E+ +      Y++ +    G +  +L ++A S D+++A  HA +L
Sbjct: 340 EQLKKDNHSLYLLGWREDTGSLAVVLHERASSVDMIRACVHAEML 384


>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
          Length = 466

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 194/395 (49%), Gaps = 26/395 (6%)

Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMG 166
           F  N +++ S +S Q      +SF++   +P G+PDSV   Y+ Y  W  ++ F    MG
Sbjct: 24  FSVNEVKSESTES-QDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMG 82

Query: 167 VFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAG 225
             TT +++  +GV  + ATP A AI WILK G G +G ++FA   G + D   K+ R   
Sbjct: 83  TLTTHSVMQGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFA 142

Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           D+L ++  G+EL    +    +P+ C + + K+V  V   +TR  + +  A   N+ DV+
Sbjct: 143 DILNDIAMGLELMVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVS 202

Query: 286 AKGE----CVGNIADLLGTG-LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
           AK      CV  IA  +G   LS+    R   LI  +  L   +++++Y  VK++ L TL
Sbjct: 203 AKDSSQETCVNLIASFVGIFVLSLFHDGR--YLIELYLFLVAVHLYANYSAVKALCLDTL 260

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQ 392
           N  R ++ +++++    +   ++ N +E++F    L   P        + +G+ F    +
Sbjct: 261 NEDRLALIIKNYMMNERISEPQKINKEESVF----LLGNPSKEICGFNIKIGASFASLLK 316

Query: 393 -DPDAYLAVE---PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL-LHCI 447
            +  + + +E    FFE  RY++T +     ++ +LK   K  ++L+  F+A++  L+  
Sbjct: 317 RNVISSMEMEFLLKFFEDRRYLITIDIETRTIFIILKKDIKPIEVLEGYFYAFMCGLYIC 376

Query: 448 NSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGES 482
            +     +L L+   ++ S  +L T  L      S
Sbjct: 377 MAKQISVNLLLRSETFEISYPLLNTYILHKKFANS 411


>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
 gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
 gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
          Length = 466

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 114/386 (29%), Positives = 189/386 (48%), Gaps = 24/386 (6%)

Query: 72  LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
           L  +QF   + R  CS   DG       RRW +++         +    A     P A A
Sbjct: 11  LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
              +S +RS ++P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A
Sbjct: 65  LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           +  A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
             F      +N+AK +  V   +TR  +    A+  N+ DV+AK    E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244

Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            L + +    PSL +  F LL+  +++++Y+ V+++VL TLN +R  + +E FL+ G V 
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304

Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
                N  E +++  W    P +  S          +   ++   E   E Y++ +N S+
Sbjct: 305 EPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHHEPYLLCWNKSR 360

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
            +V   L  +A  + +L+AA H  +L
Sbjct: 361 NQVRVALSQEAGPETVLRAATHGLIL 386


>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 649

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/337 (31%), Positives = 170/337 (50%), Gaps = 31/337 (9%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++P+GFP+SVT  Y+ Y  WRA++       GV  TQ+LL +VG+ +  A P A AINW+
Sbjct: 186 MLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG-AIPTAAAINWV 244

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ +  G+ FD D K  R   DLL     G+E+ T A P  F+ +   A 
Sbjct: 245 LKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAG 304

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
            +++ A++   STR+  +  FA   N  +V AKGE  G  +  +G GL I     I    
Sbjct: 305 ASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSST 364

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           P ++ +F +L+  +++ + +  +S+ L TLN  R S+    +L +G  P ++E N +E +
Sbjct: 365 PLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 424

Query: 371 F-SFPWL------RDKPVVLGSRFKDA-----------------FQDPDAYLAVEPFFEK 406
           F + P L      + + +VL S  KDA                     +  LA+   ++ 
Sbjct: 425 FPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKN 484

Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           E Y+++     GK   +LK+     D+LKA F    L
Sbjct: 485 EGYILS--EYMGKFCVVLKENCSQQDMLKALFQVNYL 519


>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
          Length = 534

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 96/291 (32%), Positives = 149/291 (51%), Gaps = 8/291 (2%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           ++L    +   +P G+P SVT  Y  YM W  L++  G    V +T  LL SVGV  + +
Sbjct: 109 QKLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTLGSTTYVLSTHALLTSVGVGLSTS 168

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
            P A AI+W+LKDG G    ++FA +     DY+LK  +F GD+L  +G  +E+ T  VP
Sbjct: 169 LPFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKSFKFRGDILHNVGVMLEMLTPFVP 228

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
             FLPLA  +N++K +A +   +TR  + ++F+  EN+GD+TAK +     + L G GL 
Sbjct: 229 GYFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQENLGDITAKYQSQSMASYLTGMGLG 288

Query: 304 IMISKRNPSLITTFAL-----LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
             I      + TT  L     +S  ++  S + +KS+ L TLN  R  + ++ +L+   +
Sbjct: 289 TTIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSICLKTLNSQRSKIIIDHWLREKII 348

Query: 359 PSLREGNLQENIFSFPWLRDKPVV-LGSRFKDAFQDPDAYLAVEPFFEKER 408
            S    N  EN F F    D P +  GS     F+  D   +V  + +  R
Sbjct: 349 LSPEIVNKNEN-FMFSSSTDTPKIHFGSSIDQCFKGQDEIDSVMEYIQNNR 398


>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
          Length = 466

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 114/386 (29%), Positives = 189/386 (48%), Gaps = 24/386 (6%)

Query: 72  LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
           L  +QF   + R  CS   DG       RRW +++         +    A     P A A
Sbjct: 11  LCTEQFGSGAPR-GCSAAADGSLQWDGARRWCWLSR----APIAKPGQHAGGGGGPWA-A 64

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
              +S +RS ++P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A
Sbjct: 65  LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           +  A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
             F      +N+AK +  V   +TR  +    A+  N+ DV+AK    E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244

Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            L + +    PSL +  F LL+  +++++Y+ V+++VL TLN +R  + +E FL+ G V 
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304

Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
                N  E +++  W    P +  S          +   ++   E   E Y++ +N S+
Sbjct: 305 EPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHHEPYLLCWNKSR 360

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
            +V   L  +A  + +L+AA H  +L
Sbjct: 361 NQVRVALSQEAGPETVLRAATHGLIL 386


>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
 gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
          Length = 493

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 94/276 (34%), Positives = 154/276 (55%), Gaps = 9/276 (3%)

Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV-GVSRNRATPGAVA 190
           +SY +P G+P+SV+P Y+ Y  W   ++  G    V +T  LL +V G+S   A P A A
Sbjct: 97  KSYFLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASALPFAAA 156

Query: 191 INWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           I+W+LKDG G    + FA +     D+DLK+ +F GD+L  LG  +E+ T  +   FLPL
Sbjct: 157 ISWVLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQGYFLPL 216

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL----SIM 305
           A  +N++K +A +   STR  + K+F+  +NIGD+TAK +     A L+G G+     ++
Sbjct: 217 ASISNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIGSTFGLL 276

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR-EG 364
           +S    S I+    LS  ++F  Y+ VKS+ L +LN+ R S+ ++++L    + + +   
Sbjct: 277 LSGSTFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEILNSKIVN 336

Query: 365 NLQENIFSFPWLRDKPVVLG--SRFKDAFQDPDAYL 398
           N +E IF   + ++  + LG  S  K+   +P   L
Sbjct: 337 NNEEFIFDSKYNKEPHIKLGLSSDDKEFLNNPQEIL 372


>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 473

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 94/345 (27%), Positives = 177/345 (51%), Gaps = 18/345 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVT  YVP+  W  L+        + +TQ LL+++GV    AT       W L
Sbjct: 93  VPEGFPSSVTADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFL 152

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   DL+ +LG  ++L +   P  F+ + C  +
Sbjct: 153 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGS 212

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NP 311
           ++++   V S +TR  + + FA  +N  D++AK      +A ++G  L +++++    +P
Sbjct: 213 ISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHP 272

Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            ++  +F  L+  +++++Y+ V+ + L++LN  R S+ ++ F++TG V S ++ + QE++
Sbjct: 273 LAIWFSFLSLTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHV 332

Query: 371 FSFPWLR---------DKPVVLGSRFK--DAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
               ++           K V LG R    D  +  +  L   P++ K +Y++     KG 
Sbjct: 333 LPIQFISWSSKKANCLHKKVCLGMRISSFDNMEIKEHLLCAAPYYTKAKYLLV--ERKGI 390

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD 464
           +  ++   + + D+LK+ FHA VL   ++ S    S   K  +Y 
Sbjct: 391 IDVIVHKDSNAADVLKSFFHALVLAKNVHKSKSLHSEGQKWIDYQ 435


>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
 gi|219887187|gb|ACL53968.1| unknown [Zea mays]
          Length = 439

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 184/380 (48%), Gaps = 18/380 (4%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
           D +G   R SI    +Q  +   ++L+    +   P G+P SV   Y+ Y  +R L+HF 
Sbjct: 40  DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFS 98

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              + V +TQ+LL + G+   R TP  A A++WILKDG    GK+I +  G + D + K 
Sbjct: 99  SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
                D+L +LG  +E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N
Sbjct: 156 WTILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215

Query: 281 IGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSV 335
           + D+ AKGE +  + +++G G  I     + S     LI    LLS  +++   QE+++ 
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRAT 274

Query: 336 VLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQ 392
            ++TLN  R ++ V  F+K+G V S  E   +E++  FP         V +G   +  F 
Sbjct: 275 PVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRVIEEAGSVKIGQPLRKVF- 332

Query: 393 DPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
            P     +   F  E+++++      + Y +L+  A  +D L+    A        S  G
Sbjct: 333 SPRLVEQLRATFPGEKFLLSQK--SNRTYMVLERSASGEDALRGWLVAAFASEMERSGAG 390

Query: 453 HRSLALKQHEYDNSNLMLTT 472
            R  A+    Y+    +  T
Sbjct: 391 SRDDAVLSEAYEKMERVFPT 410


>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
 gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
          Length = 466

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 188/386 (48%), Gaps = 24/386 (6%)

Query: 72  LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
           L  +QF   + R  CS   DG       RRW +++         +    A     P A A
Sbjct: 11  LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
              +S +RS ++P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A
Sbjct: 65  LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           +  A    W++KD  G +G++I A  +G K D + KQ R   D+L ++   +E+     P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
             F      +N+AK +  V   +TR  +    A+  N+ DV+AK    E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244

Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            L + +    PSL +  F LL+  +++++Y+ V+++VL TLN +R  + +E FL+ G V 
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304

Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
                N  E +++  W    P +  S          +   ++   E   E Y++ +N S+
Sbjct: 305 EPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHHEPYLLCWNKSR 360

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
            +V   L  +A  + +L+AA H  +L
Sbjct: 361 NQVQVALSQEAGPETVLRAATHGLIL 386


>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
          Length = 464

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 97/332 (29%), Positives = 164/332 (49%), Gaps = 24/332 (7%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
            +S ++   +P G+PDSV P Y  Y  W  ++ F    MG  TT +++  VGV    ATP
Sbjct: 45  FVSILKEVFLPQGYPDSVHPDYTAYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEPAATP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A AI WILKDG G +G+++FA   G   D   K+ R   D+L ++  G+EL        
Sbjct: 105 LAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDVAMGLELLLPYYSSY 164

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
            L + C +   K++  V   +TR  + +  A   N+ DV+AK      CV  IA L+G  
Sbjct: 165 SLAILCVSTAMKSIVGVAGGATRAAVTQHQALQNNLADVSAKDGSQETCVNLIASLVGI- 223

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           L + I      ++  +  L   +++++Y  VKS+ L++LN  R ++ V+S++    +P  
Sbjct: 224 LILSIFHNGRYIMELYFFLIIVHLYANYSAVKSLCLNSLNEDRLALIVKSYITNEVIPEP 283

Query: 362 REGNLQENIFSFPWLRDKP------------VVLGSRFK-DAFQDPDAYLAVEPFFEKER 408
            E N +E++     L  KP            V L +  K +     D  L+++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMSIYGFNIKMGVSLAALIKNNIISTSDTELSLKLFLDR-K 338

Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           Y+++ +     ++   K  A+  D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICFKKDAQPCDVLEAYFHA 370


>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
          Length = 302

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 141/248 (56%), Gaps = 9/248 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RA++H    A+ V +TQ+LL + G+   R TP  
Sbjct: 47  SFMNKFF-PSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 102

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG   VGK+I +  G + D + K+ R   D L ++G G+E+ +   PHLFL
Sbjct: 103 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFL 162

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM-- 305
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE    + +++G G+ I   
Sbjct: 163 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLA 222

Query: 306 --ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
             I       +    LLS  +++S  +E+++  ++TLN  R ++ V  FLK G V S  +
Sbjct: 223 STICASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPAD 282

Query: 364 GNLQENIF 371
              +EN+ 
Sbjct: 283 LRYRENLL 290


>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
 gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
 gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
 gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/347 (31%), Positives = 180/347 (51%), Gaps = 22/347 (6%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF    + V +TQ+LL + G+   R TP  
Sbjct: 62  SFLNKFF-PSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 117

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG    GK+I +  G + D + K  R   D+L + G  +E+ +   P LFL
Sbjct: 118 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 177

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI--- 304
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I   
Sbjct: 178 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 237

Query: 305 --MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
             + S     LI    LLS  +++   QE+++  ++TLN  R ++ V  F+K+G V S  
Sbjct: 238 STVCSTTQGKLIAG-PLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPA 296

Query: 363 EGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
           E   +E++  FP         V +G   +     P     ++  F KE+++++   +   
Sbjct: 297 ELRYREDLL-FPNRLIEEAGSVKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSA- 353

Query: 420 VYALLKDQAKSDDILK----AAFHAYVLLHCINSSNGHRSLALKQHE 462
            Y +L+  A  +D L+    AAF + +    + S +   ++A ++ E
Sbjct: 354 -YMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERME 399


>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 437

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 171/339 (50%), Gaps = 14/339 (4%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF    + V +TQ+LL + G+   R TP  
Sbjct: 66  SFLNKFF-PSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 121

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG    GK+I +  G + D + K  R   D+L + G  ++  +   P LFL
Sbjct: 122 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQLFL 181

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL----GTGLS 303
            +A   N AK +A V + +TR PIY AFAK  N+ D+ AKGE +  +++++    G GLS
Sbjct: 182 EVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGIGLS 241

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
             +       +    LLS  +++   QE++++ +++LN  R ++ V  F+K+G V S  E
Sbjct: 242 STVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSSPAE 301

Query: 364 GNLQENIFSFP--WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVY 421
              +E++  FP   + +   V   +       P     +   +  E+++++   +  K Y
Sbjct: 302 LRYREDLL-FPNRLIEEAGSVKVGQPLCRILSPQRVEQLRATYPNEKFLLSLKSN--KTY 358

Query: 422 ALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
            +L+  A  +D L+    A        S  G R   L +
Sbjct: 359 MVLEQSATGEDALRGWLVAAFASEMEKSGVGSRDSVLNE 397


>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
 gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
 gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
          Length = 380

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 108/347 (31%), Positives = 180/347 (51%), Gaps = 22/347 (6%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
           SF+  +  P G+P SV   Y+ Y  +RAL+HF    + V +TQ+LL + G+   R TP  
Sbjct: 4   SFLNKFF-PSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 59

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A A++WILKDG    GK+I +  G + D + K  R   D+L + G  +E+ +   P LFL
Sbjct: 60  ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 119

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI--- 304
            +A   N AK +A V + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I   
Sbjct: 120 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 179

Query: 305 --MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
             + S     LI    LLS  +++   QE+++  ++TLN  R ++ V  F+K+G V S  
Sbjct: 180 STVCSTTQGKLIAG-PLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPA 238

Query: 363 EGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
           E   +E++  FP         V +G   +     P     ++  F KE+++++   +   
Sbjct: 239 ELRYREDLL-FPNRLIEEAGSVKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSA- 295

Query: 420 VYALLKDQAKSDDILK----AAFHAYVLLHCINSSNGHRSLALKQHE 462
            Y +L+  A  +D L+    AAF + +    + S +   ++A ++ E
Sbjct: 296 -YMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERME 341


>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
 gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
 gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
 gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
 gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 608

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 178/355 (50%), Gaps = 35/355 (9%)

Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           +P+    +  + +  +++P+GFP+SVT  Y+ Y  WR ++       GV  TQ+LL +VG
Sbjct: 182 TPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVG 241

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
           + +  A P A AINW+LKDG G + K++ ++ G+ FD   K  R   DLL     G+E+ 
Sbjct: 242 LGKG-AIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 300

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
           T   P  F+ +  AA   ++ AA+   +TR+     FA   N  +V AKGE  G ++  +
Sbjct: 301 TPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSV 360

Query: 299 GTGLSIMISK---RNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
           G  L I+++     + SL +  F +++  +++++ +  + + L TLN  R S+    +L 
Sbjct: 361 GILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 420

Query: 355 TGHVPSLREGNLQENIFSFPWLR--------------------------DKPVVLGSRFK 388
           +G  P ++E N +E +  FP +R                          ++ + LGS+  
Sbjct: 421 SGQAPLIKEVNDEEPL--FPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLGSKLS 478

Query: 389 DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           D   + +  +A+   +  E Y++T +  KG+   +LK+ +   D+L++ F    L
Sbjct: 479 DVIHNKEEAIALFDLYRNEGYILTEH--KGRFCVMLKESSTPQDMLRSLFQVNYL 531


>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
          Length = 468

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 161/344 (46%), Gaps = 24/344 (6%)

Query: 115 ISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLL 174
           +SL    A    + S  +   +P G+PDSV P Y  Y  W  ++ F     G  TT +++
Sbjct: 31  LSLTESLAIRSRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQAFASTITGTLTTHSIM 90

Query: 175 NSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGA 233
             +GV  + ATP A AI WILK G G +G ++FA   G K D   K+ R   D+L ++  
Sbjct: 91  KGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQCKKWRLFADILDDIAK 150

Query: 234 GVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----E 289
           G+EL           + C +   K++  V   +TRT +    A   N+ DV+AK      
Sbjct: 151 GIELLVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQNNVADVSAKDGSQET 210

Query: 290 CVGNIADLLGTG-LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
           CV  IA   G   LS     R   L+  +  L   +++++Y  VK++ L TLN  R ++ 
Sbjct: 211 CVNLIASFFGIFILSFFHDGR--YLLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAII 268

Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQDPDAYLA- 399
           V++++    +P   + N +E+IF    L   P        + +G  F    +     L  
Sbjct: 269 VKNYMINEQIPKPGKVNKEESIF----LLTNPGRNVHGFNIKIGVSFTSILKRNIISLTE 324

Query: 400 ---VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
              +  FFE  +Y++T +  K  ++ +LK   +S +IL+  F+A
Sbjct: 325 MEFLLKFFENRKYIITIDIEKKNIFIILKKDIQSIEILEGYFYA 368


>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
          Length = 433

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 174/363 (47%), Gaps = 36/363 (9%)

Query: 95  WNYVA-----ERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYV 149
           W YV+     ER R G                +    +    +S  +P G+P+SV+  Y+
Sbjct: 20  WKYVSKDGVMERRRDGS------------EGHSGGNSVFGVFKSVFLPQGYPESVSDDYL 67

Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA- 208
            Y  W  ++ F     G   TQ  L   GV    AT  A  + W+ KDG G +G+++FA 
Sbjct: 68  QYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEATVAAATVTWLFKDGTGMLGRILFAW 127

Query: 209 RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTR 268
           R+G K D + K+ R   D+L ++   +E+     P  F  + C A + K++  V   +TR
Sbjct: 128 RKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFPACFTFIVCTAGIFKSIVGVAGGATR 187

Query: 269 TPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGY 324
             +    A+ +N+ D++AK    E + N+A LL + + I +   NP L +  F L +  +
Sbjct: 188 AALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLILIPLVTDNPVLTLCLFFLFTVLH 247

Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
           +F++Y+ V+SVV+ T N AR S+ ++ +LK   + S  E N +E +F F + +  P+ LG
Sbjct: 248 LFANYKAVRSVVMETFNEARLSIVLQQYLKDKRILSPPEANQREPVF-FEFGQTVPMKLG 306

Query: 385 SRFKDAFQDP-DAYLAVE----PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
            R ++  Q P D  LA++    P+    R         G V   L  +A + D ++A   
Sbjct: 307 VRLQEIAQSPEDLQLALKENSMPYLLGIR--------NGCVCVCLGPEASASDEIRAMCQ 358

Query: 440 AYV 442
           A V
Sbjct: 359 AVV 361


>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
          Length = 432

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 161/323 (49%), Gaps = 11/323 (3%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           A  +    +S  +P G+P+SV+  Y+ Y  W  ++ F     G   TQ  L  VGV    
Sbjct: 41  ANSVSGVFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQE 100

Query: 184 ATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           AT  A  I W+L+DG G +G+++FA  +G K D + K+ R   D+L ++   +E+A    
Sbjct: 101 ATVAAATITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHF 160

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
           P  F  + C A + K++  V   +TR  +    A+  N+ D++AK    E + N+A LL 
Sbjct: 161 PPFFTLIVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLV 220

Query: 300 TGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
           +   I +   NP      F L +  ++F++Y+ V+SVV+ TLN AR S+ +  +L  G V
Sbjct: 221 SLALIPLVTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQV 280

Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDP-DAYLAVEPFFEKERYVVTYNPSK 417
            S  E N +E +F   + R  P+ LG R  D   +P D  LA++     + Y++      
Sbjct: 281 LSPAEANQREPVF-LSFRRTVPIKLGVRLGDIIHEPGDLQLALKN--NSKPYLIGIK--N 335

Query: 418 GKVYALLKDQAKSDDILKAAFHA 440
           G V   L   A  DD +KA   A
Sbjct: 336 GAVCVCLGSDASVDDEIKAVCQA 358


>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 431

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 187/376 (49%), Gaps = 26/376 (6%)

Query: 103 RSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHF 160
           R  +F+ N   +++L     P    +  SF+  +  P G+P SV   Y+ Y  +RAL+H 
Sbjct: 36  RRFQFEPNGQLSVTLVGDSRPLYHRIVDSFLNKFF-PSGYPYSVNEGYLRYTQFRALQHT 94

Query: 161 FGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLK 219
              A+ V +TQ+LL + G+   R TP  A A++W+LKDG   +G +I ++ G + D + K
Sbjct: 95  ASAALSVLSTQSLLFAAGL---RPTPAQATAVSWVLKDGMQHLGNIICSKLGARMDSEPK 151

Query: 220 QLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGE 279
           + R   D+L +LG G+E+ +   P  FL +A   N+AK ++ V S +TR PIY +FAK  
Sbjct: 152 RWRILADVLYDLGTGLEVLSPLCPQFFLEMAGIGNLAKGMSIVASRATRLPIYSSFAKEG 211

Query: 280 NIGDVTAKGECVGNIADLLGTGLSIMISK----RNPSLITTFALLSCGYVFSSYQEVKSV 335
           N+ D+ A+GE +  + +++G G+ I ++          +    LLS  +++S  +E+++ 
Sbjct: 212 NLSDLFARGEAISTLFNVVGIGIGIQLASTVCASMQGKLVVGPLLSIIHIYSVSEEMRAT 271

Query: 336 VLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP--WLRDKPVVLGSRFKDAFQD 393
            ++TLN  R ++ V  FLK G V S  +   +E++  FP   + D   V   R       
Sbjct: 272 PVNTLNPQRTAMIVADFLKAGSVSSPADLRYREDLL-FPRRLIEDAGNVRVGRDLHKVIK 330

Query: 394 PDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGH 453
           P   L  +  F  E++++  N     +  +L+  A   D L+    A   +    SS   
Sbjct: 331 PSRLLESKQVFPGEKFIL--NGDNRCIDMVLEQDAIGKDALRGWLVASYAVQIGKSS--- 385

Query: 454 RSLALKQHEYDNSNLM 469
                  HE   S L+
Sbjct: 386 -------HELSTSTLL 394


>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Apis florea]
          Length = 464

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 101/332 (30%), Positives = 163/332 (49%), Gaps = 24/332 (7%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L+S I+   +P GFPDSV P Y  Y  W  ++ F    MG  TT +++  VGV    ATP
Sbjct: 45  LISIIKEVFLPQGFPDSVHPDYTSYQVWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A AI WILKDG G VG+++FA   G   D   K+ R   D+L +L  G+EL        
Sbjct: 105 LAAAITWILKDGTGMVGRIMFAWWNGTDLDGQCKKWRLFADILNDLAMGLELLLPYFSSY 164

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
            L + C +   K++  V   +TR  +    A   N+ DV+AK      CV  IA  +G  
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTNIKALQNNLADVSAKDGSQETCVNLIASFVGI- 223

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           L + I      ++  +  L   +++++Y  VK++ L++LN  R ++ V+S++    +P  
Sbjct: 224 LILSIFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283

Query: 362 REGNLQENIFSFPWLRDKPVVLGSRF-------------KDAFQDPDAYLAVEPFFEKER 408
            E N +E++     L  KP +    F             KD     D  L+++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDR-K 338

Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           Y+++ +     ++  LK  A+  D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370


>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 613

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 178/355 (50%), Gaps = 35/355 (9%)

Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           +P+    +  + +  +++P+GFP+SVT  Y+ Y  WR ++       GV  TQ+LL +VG
Sbjct: 188 TPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLLYAVG 247

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
           + +  A P A AINW+LKDG G + K++ ++ G+ FD   K  R   DLL     G+E+ 
Sbjct: 248 LGKG-AIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 306

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
           T   P  F+ +  AA   ++ AA+   +TR+     FA   N  +V AKGE  G ++  +
Sbjct: 307 TPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSM 366

Query: 299 GTGLSIMISK---RNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
           G  L I+++     + SL +  F +++  +++++ +  + + L TLN  R S+    +L 
Sbjct: 367 GILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 426

Query: 355 TGHVPSLREGNLQENIFSFPWLR--------------------------DKPVVLGSRFK 388
           +G  P ++E N +E +  FP +R                          ++ + LGS+  
Sbjct: 427 SGQAPLIKEVNDEEPL--FPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLGSKLS 484

Query: 389 DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           D   + +  +A+   +  E Y++T +  +G+   +LK+ +   D+L++ F    L
Sbjct: 485 DVIHNKEEAIALFDLYRNEGYILTEH--RGRFCVMLKESSTPQDMLRSLFQVNYL 537


>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
           siliculosus]
          Length = 789

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 84/241 (34%), Positives = 131/241 (54%), Gaps = 21/241 (8%)

Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
            P+ VTP Y  +  WR  + F  G + VF TQ LL ++G+   R    A A++W+LKD  
Sbjct: 186 IPEDVTPDYYHFTFWRMFQRFVAGTINVFGTQALLLALGIKAKRLG-AAAAMSWVLKDAL 244

Query: 200 GRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
           G+ G++++A + G++FD D K+ RF   LL   G G+E+ T   P  FL LA  AN  K 
Sbjct: 245 GKFGRILWASKMGRRFDSDAKRWRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQ 304

Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAK---------------GECVGNIADLLGT--- 300
           ++ +TS++TR  IY++FA+GENIGD+TAK               GE    + D+LG    
Sbjct: 305 MSMLTSSATRNTIYRSFARGENIGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSG 364

Query: 301 -GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            GL  ++     +++  + +LS   + + Y E+++V    LN  R  + V  F+ +G  P
Sbjct: 365 IGLCTLVGTSRQTIVAAYVILSLVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASGSDP 424

Query: 360 S 360
           S
Sbjct: 425 S 425


>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
          Length = 471

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 92/345 (26%), Positives = 164/345 (47%), Gaps = 22/345 (6%)

Query: 113 RAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQT 172
           R   L   Q     + S  +   +P G+P+SV P Y  Y  W  ++ F     G  TT +
Sbjct: 35  RKFGLTESQTVHSRITSVFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHS 94

Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMEL 231
           ++  +GV  + ATP A AI WILK G G +G ++FA   G + D   K+ R   D+L ++
Sbjct: 95  IMKGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDI 154

Query: 232 GAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG--- 288
             G+EL         + + C +   K++  V   +TRT +    A   N+ DV+AK    
Sbjct: 155 AKGIELLVPYFSSYSVIILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQ 214

Query: 289 -ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
             CV  IA   G  + + +      ++  +  L   +++++Y  VK++ L TLN  R ++
Sbjct: 215 ETCVNLIASFFGIFI-LSLFHDGQYVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAI 273

Query: 348 AVESFLKTGHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQ-DPDAYL 398
            V+ ++    +P   + N +E++F    L   P        + +G  F +  + +  ++ 
Sbjct: 274 IVKKYMMNEQIPEPWKVNKEESVF----LLKNPTRDIHGFDIKIGVSFANILKRNIISFT 329

Query: 399 AVE---PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
            +E    FFE  +Y++  +  K  ++ +LK   +S +IL+A F+A
Sbjct: 330 EMEFLLKFFENRKYIILIDIEKRNIFVILKKDIQSVEILEAYFYA 374


>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
          Length = 464

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 99/332 (29%), Positives = 164/332 (49%), Gaps = 24/332 (7%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
            +S ++   +P G+PDSV P Y PY  W  ++ F    MG  TT +++  VGV    ATP
Sbjct: 45  FISIVKEVFLPQGYPDSVHPDYTPYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A AI WILKDG G +G+++FA   G   D   K+ R   D+L +L  G+EL        
Sbjct: 105 LAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSY 164

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
            L + C +   K++  V   +TR  + +  A   N+ DV+AK      CV  IA  +G  
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGI- 223

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           L + I      ++  +  L   +++++Y  VK++ L++LN  R ++ V+S++    +P  
Sbjct: 224 LILSIFHNGRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283

Query: 362 REGNLQENIFSFPWLRDKPV--VLGSRFK-----------DAFQDPDAYLAVEPFFEKER 408
            E N +E++     L  KP   + G   K           D     D  LA++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDR-K 338

Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           Y+++ +     ++  LK  A+  D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370


>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
          Length = 442

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 158/311 (50%), Gaps = 25/311 (8%)

Query: 95  WNYVA-----ERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYV 149
           W YVA     ER R G   R +               ++   +S  +P G+P+SV+  Y+
Sbjct: 21  WKYVANDGAIERRREGGESRGN--------------SVVGAFKSVFLPQGYPESVSGDYL 66

Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA- 208
            Y  W  ++ FF    G   T+  L  VGV    AT  A  + W+LKDG G  G+++FA 
Sbjct: 67  RYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEATVAAATMTWLLKDGTGMCGRILFAW 126

Query: 209 RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTR 268
           R+G K D + K+ R   D+L ++   +E+     P  F  + C + + K++  V   +TR
Sbjct: 127 RKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFPAFFTLVMCVSGIFKSLVGVAGGATR 186

Query: 269 TPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGY 324
             +    A+ +N+ D++AK    E + N+A LL + + I     N +L +  F L +  +
Sbjct: 187 AALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLVLIRFVSDNAALTLGLFFLFAVLH 246

Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
           +F++Y+ V+SVV+ TLN AR SV ++ +L+ G + S  E N +E +F   + +  P+ LG
Sbjct: 247 LFANYKAVRSVVMETLNEARLSVVLQQYLENGQILSPLEANEREPVF-MDFKKVVPIRLG 305

Query: 385 SRFKDAFQDPD 395
            R ++  Q  +
Sbjct: 306 VRLQEVIQSSE 316


>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
          Length = 444

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 172/343 (50%), Gaps = 9/343 (2%)

Query: 106 KFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
           K +++ ++ + L SP+  +  L  F     +P G+P+SV+  Y+ Y  W  ++ F     
Sbjct: 18  KTEKDKLKTVDL-SPKWTS--LSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSIT 74

Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFA 224
           G    Q LL  VGV    AT  A  + WILKDG G +G+++FA  QG   D D K+ R  
Sbjct: 75  GTLAAQALLKGVGVGDESATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLF 134

Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
            D+L ++   +E+        F P+ C A V K++  V   +TR  + +  A+  N+ DV
Sbjct: 135 ADILNDMAIFMEILAPYFKAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADV 194

Query: 285 TAKG---ECVGNIADLLGT-GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
           +AK    E + N+A L+ +  L  +++ ++  + + F L +  ++F++Y  V SVV+ TL
Sbjct: 195 SAKDGSQETLVNLAALVCSLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAVTSVVMETL 254

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAV 400
           N+AR  + V  +L+TG V SL+E N +E +  +   R   V LG+     F         
Sbjct: 255 NQARLHILVHHYLQTGEVLSLQEVNYKEPVL-WKTRRKLNVHLGTSIGKVFSRASILEES 313

Query: 401 EPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
              ++   Y++  N   G V  +L   + + D L + F A V+
Sbjct: 314 IALYKDSNYLLDINLQTGCVSIILSPLSTTMDHLTSCFQAEVI 356


>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
 gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
          Length = 540

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 165/326 (50%), Gaps = 18/326 (5%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           E++ S + ++ +P+GFP+SV  SY+ Y  WR +++       V +TQ LL + GV R  +
Sbjct: 134 EQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAAGVGRTHS 193

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
              A A  W  KDG G+VG++  A  G ++D D K+ R   DLL ++G  +E+ + A   
Sbjct: 194 ETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEILSPAFSK 253

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
            FL +A   N++K++A     S R  + K F + EN+G+++AK +    +  +LGT + +
Sbjct: 254 YFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGMLGTLIGV 313

Query: 305 MISKRNPSL----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            +SK  P+     +  F   +  Y F +Y+ ++   L T+NR R ++   S+L   +VPS
Sbjct: 314 WMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYLMFQNVPS 373

Query: 361 LREGNLQE-------NIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTY 413
           +   N  E        +F +P +R     LG    +     +        F KE+Y+++ 
Sbjct: 374 VSMSNSYERFVPLFPTVFQYPHVR-----LGVGLTELSISIEKIANYTRQFSKEKYILSL 428

Query: 414 NPSKGKVYALLKDQAKSDDILKAAFH 439
             S G++   L    +S D+L+   H
Sbjct: 429 --SDGEIRVALHRNHESVDLLQLLLH 452


>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
          Length = 423

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 99/329 (30%), Positives = 162/329 (49%), Gaps = 11/329 (3%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L+   +S  +P G+P+SV+  Y+ Y  W  L+ F         TQ  L  VGV    AT 
Sbjct: 41  LVGAFKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATV 100

Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A  + WILKDG G  G+++FA Q G + D + K+ R   D+L ++   +E+     P  
Sbjct: 101 AAATVTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSF 160

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGL 302
           F  + C A V K +  V   +TR  +    A+ +N+ D++AK    E + N+A L+ + +
Sbjct: 161 FTLIVCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLI 220

Query: 303 SIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
            I +   N SL +  F L +  ++F++Y+ V+SVV+ T N+AR S+ ++ +LK   + S 
Sbjct: 221 LIPLITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSP 280

Query: 362 REGNLQENIFSFPWLRDKPVVLGSRFKDAFQDP-DAYLAVEPFFEKERYVVTYNPSKGKV 420
            E N +E +F   +    P+ LG + +D  Q P D +LA+                 G +
Sbjct: 281 FEANRREPVF-LEFKITTPIKLGVKLQDVVQSPEDFHLAL----RNNSMAYLLGVRNGCI 335

Query: 421 YALLKDQAKSDDILKAAFHAYVLLHCINS 449
              L   A   D ++AA  A  L + + S
Sbjct: 336 CVCLGSGASVHDEIRAACQALCLSNMLKS 364


>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
          Length = 430

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/381 (26%), Positives = 179/381 (46%), Gaps = 33/381 (8%)

Query: 117 LQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
           +QS  +     M  +RS  +P G+PDSVT  Y+ Y  W   +       GV  T+ +L  
Sbjct: 33  VQSKISKHSGAMELLRSMFLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRG 92

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
            GV    A+ G+  + WIL+DGAG +G++ FA + G++ D + K  RF  D+L ++G  +
Sbjct: 93  YGVGNAAASVGSAVLQWILRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTL 152

Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECV 291
           E+ +     LF+PL+    V+K++  V    T+  +   FAK +N+GDV+AK       V
Sbjct: 153 EMLSPLSNVLFMPLSVCGIVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAV 212

Query: 292 GNIADLLGTGLSIMISKRNPSLITT-------FALLSCGYVFSSYQEVKSVVLHTLNRAR 344
           G +   LG  LS  I   +    T        F +L   +++ +Y+ V  VVL ++NR R
Sbjct: 213 GLVGMFLGAALSASIPDESEQTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYR 272

Query: 345 FSVAVESFLKTGHVPSLREGNLQENIFSFPWLRD--KPVVLGSRFKDAFQDPDAYLAVEP 402
            ++  + +L+ G +P+  + + +ENI      RD  +P+ LG+   +           + 
Sbjct: 273 ATLIYKHYLQYGEIPTPIQVSKRENIL----FRDDKQPIYLGASLLNIINLNKQQQQQQQ 328

Query: 403 FFE--------------KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN 448
             E              + RY+  +   + ++Y  LK    + D+++A  ++  +LH   
Sbjct: 329 SKEIRESKTNSIIRHKLENRYIDIWQFQQ-EIYISLKVHCTAVDLIQAYLYSVAILHQTK 387

Query: 449 SSNGHRSLALKQHEYDNSNLM 469
             N    + L   +  N  L+
Sbjct: 388 QQNNRAFIDLVTSKSFNDRLL 408


>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
          Length = 466

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 100/332 (30%), Positives = 159/332 (47%), Gaps = 24/332 (7%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
           SF +   +P GFPDSV   YVPY  W  L+ F     G  TT +++  VGV  + ATP A
Sbjct: 48  SFFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLTTHSIMRGVGVGESTATPLA 107

Query: 189 VAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
            AI WILKDG G +G++IFA  QG   D   K+ R   D+L +    +E+    +     
Sbjct: 108 AAITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADILNDAAMALEVTVPYISTFSS 167

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT--GLSIM 305
              C     K +  +   +TRT I +  A  +N+ DV+AK    G + +L G+  G+ I+
Sbjct: 168 YTLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKEHSQGTLVNLAGSIVGILIL 227

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
           +       I    LL   ++ S+Y  V S+ L+TLN  R  + +E ++    + ++ + N
Sbjct: 228 LLINEKFFIYVCLLLVIVHIVSNYFAVTSLELNTLNEDRLCLIIEDYILNHTMSNVEDIN 287

Query: 366 LQENIFSFPWLRDKP--------VVLGSRFKDAF-------QDPDAYLAVEPFFEKERYV 410
            +E++     L +KP        + +G  F+          +D    LA+   F++ +Y+
Sbjct: 288 SRESVII---LLEKPAKKIFGFDIEIGVSFESVLKMNLINTKDVRFLLAL---FQRRKYL 341

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
              +    K++ +L   A  D ILKA FH  V
Sbjct: 342 PVMDLKAKKIHIILSKDADDDIILKAYFHCCV 373


>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
 gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
          Length = 567

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/276 (31%), Positives = 151/276 (54%), Gaps = 9/276 (3%)

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV-GVSRNRA 184
           +L   I+   +P G+P +VT  Y+ Y  W  +++  G    V +T  LL SV G+S   A
Sbjct: 118 KLFDSIKKLFLPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSVIGISSTSA 177

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
            P A AI+W+LKDG G    ++FA +    FDYD+K+ +F GDLL   G  +E+ T    
Sbjct: 178 LPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLLEMFTPFFQ 237

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL----LG 299
             FLPLA  +N++K +A +   STR  + ++F+  ENIGD+TAK +     A L    +G
Sbjct: 238 QFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAAYLSGMAIG 297

Query: 300 TGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
           + + +++S    + ++  ++   LS  ++F     +KS+ L TLN+ R S+ ++++LK+ 
Sbjct: 298 SSIGMILSVFPLTSVSNLSMVFGLSFIHIFCGINAIKSIDLKTLNQQRTSILIDNWLKSN 357

Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQ 392
            + + +  N +E   +        + LG   +++F+
Sbjct: 358 LILTSKIVNQEEKFITNDKYLKPNIKLGCTIEESFE 393


>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
 gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
          Length = 437

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 175/347 (50%), Gaps = 22/347 (6%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP Y P+  W +L+        + +TQ LL+++GV    AT       W L
Sbjct: 56  VPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFL 115

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   DL+ +LG  ++L +   P  F+ + C  +
Sbjct: 116 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVFIVCLGS 175

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----RN 310
           ++++   V S +TR  + + FA   N  D++AK      +A ++G    +++++     +
Sbjct: 176 LSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMAFGMLLARITMGHS 235

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            ++  +F  L+  +++++Y+ V+ ++L TLN  R S+ +E F++TG V S +E +  E++
Sbjct: 236 VAIWFSFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQVLSPQEVSKMEHV 295

Query: 371 ---FSFPWLRDKPVVLGSRFK--------DAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
              ++  W       L +R          D  +  D   + E  ++K +Y++     KG 
Sbjct: 296 LPLWASSWSLKNAKSLHTRIHLGVRVSSLDKLEVTDLLHSAESHYKKAKYLLL--ERKGI 353

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSN----GHRSLALKQHE 462
           +  ++   + + D+L++  HA V+ + ++ S       RS   KQ+E
Sbjct: 354 ISVIMHKDSTASDVLQSFIHALVMANLVDKSIYIHLESRSWMDKQYE 400


>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
 gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
          Length = 466

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 182/361 (50%), Gaps = 17/361 (4%)

Query: 91  DG-RRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYV 149
           DG RRW +++    S   +     A     P A A   +S +R+ ++P GFPDSV+  Y+
Sbjct: 35  DGARRWGWLSGNPASEPGQD----AGGGGGPWA-ALPALSGLRAVLLPQGFPDSVSTDYL 89

Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA- 208
           PY  W +++ F     G   TQ +L  +GV   +A+  A    W+LKD  G +G++IFA 
Sbjct: 90  PYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWLLKDSTGMLGRIIFAW 149

Query: 209 RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTR 268
            +G K D + KQ R   D+L ++   +E+     P  F      +N+AK +  V   +TR
Sbjct: 150 WKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSISNLAKCIVGVAGGATR 209

Query: 269 TPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGY 324
             +    A+  N+ DV+AK    E V N+A LL + L + +    PSL +  F LL+  +
Sbjct: 210 AALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDCPSLSLGCFILLTALH 269

Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
           ++++Y+ V+++VL TLN +R  + ++ FL+ G V +    N  E +++  W    P +  
Sbjct: 270 IYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQMEPLWTGFW----PSLSL 325

Query: 385 SRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
           S          +   +E   E  +E Y++ +N S+ +V   L   A  + +L+AA H  V
Sbjct: 326 SLGVPLHHLVSSVFELEQLVEGHQEPYLLCWNQSQNQVQVALSQVAGPEAVLRAATHGLV 385

Query: 443 L 443
           L
Sbjct: 386 L 386


>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
           communis]
 gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
           communis]
          Length = 523

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 90/254 (35%), Positives = 143/254 (56%), Gaps = 13/254 (5%)

Query: 136 VPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           V D F P  VT +Y+ Y+ W+ L   F  A+ V  TQ +  +VG+  +R+ P A A+NW+
Sbjct: 126 VRDFFIPKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPSAAALNWV 185

Query: 195 LKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
           LKDG GR+ + I+ A     FD +LK++RF+  +L  LG G+EL T A P  FL LA  A
Sbjct: 186 LKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIA 245

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRN 310
           N+AK ++     +T + I+++FA  +N+G+V+AK +      D LG  L+ +++   K N
Sbjct: 246 NIAKQISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNN 305

Query: 311 PSLI-----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
             L+       + + S   +F  YQ +K V L TL + R  + ++S++  G VP+  E +
Sbjct: 306 QRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEMILDSWIALGRVPTPAEVS 365

Query: 366 LQENIFSFPWLRDK 379
            +E    F +LR K
Sbjct: 366 KKEG---FEFLRTK 376


>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
          Length = 437

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 162/315 (51%), Gaps = 11/315 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+  A    W++
Sbjct: 47  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 106

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD  G +G+++FA   G K D + KQ R   D+L ++   +E+    +P  F    C +N
Sbjct: 107 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCISN 166

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
           +AK +  V   +TR  +    A+  N+ DV+AK    E + N+A LL + L + +   +P
Sbjct: 167 LAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPMVSDSP 226

Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           SL +  F  L+  +++++Y+ V+++V+ TLN  R  + +  FL+ G V      N  E +
Sbjct: 227 SLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSANQMEPL 286

Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
           ++  W     + LG+          +   ++   E  +E Y++ ++ S+ +V  +L   A
Sbjct: 287 WTGFW-SSVSLSLGAPLHHV---TSSVSELQQLVEGHQEPYLLRWDQSRNQVQVVLSQMA 342

Query: 429 KSDDILKAAFHAYVL 443
             + IL+AA H  VL
Sbjct: 343 GPEAILRAATHGLVL 357


>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
          Length = 464

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 162/330 (49%), Gaps = 20/330 (6%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L+S I+   +P G+PDSV P Y  Y  W  ++ F    MG  TT +++  VGV    ATP
Sbjct: 45  LISIIKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A AI WILKDG G VG+++FA   G   D   K+ R   D+L +L  G+EL        
Sbjct: 105 LAAAITWILKDGTGMVGRIMFAWWNGTDLDGQCKKWRLFADILNDLAMGLELLLPYFSSY 164

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
            L + C +   K++  V   +TR  + +  A   N+ DV+AK      CV  IA  +G  
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGI- 223

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           L + I      ++  +  L   +++++Y  VK++ L++LN  R ++ V+S++    +P  
Sbjct: 224 LILSIFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283

Query: 362 REGNLQENIFSFPWLRDKPVVLGSRF-----------KDAFQDPDAYLAVEPFFEKERYV 410
            E N +E++      +    + G              KD     D  L+++ F ++ +Y+
Sbjct: 284 EEVNKKESVLLLT--KSTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDR-KYL 340

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           ++ +     ++  LK  A+  D+L+A FHA
Sbjct: 341 ISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370


>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 627

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 121/453 (26%), Positives = 204/453 (45%), Gaps = 46/453 (10%)

Query: 33  ETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSL----RLICSE 88
           E+ R   + A  S+   V  G  +A L  S  ++T+        F +  +    R +   
Sbjct: 86  ESFRDDRDSALVSASDLVFNGSTSAALTCSPEEQTSRTSQSSATFSEVDVLYGRRAVLVY 145

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPD----GFPDSV 144
           D    R+  V+E++R     +    A+     +A    L  FI + ++P       P  V
Sbjct: 146 DPLQERYVKVSEKNRVADSTKQESVAL-----RARRSSLARFITTKILPRLSLAFLPSGV 200

Query: 145 TPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV--AINWILKDGAGRV 202
           T  Y  ++ WR L+ F    + VF TQ+LL ++ +  + +  GA+  A+NW+LKD  G++
Sbjct: 201 TNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALNWVLKDALGKI 260

Query: 203 GKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAA 261
            +M++A R G++FD D K+ RF    +   G G+E+ T   P LFL  A  AN  K ++ 
Sbjct: 261 VRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWATLANCCKQISM 320

Query: 262 VTSTSTRTPIYKAFAKG--ENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS----LIT 315
           +TS+STRT IY +F  G  ENIGD+TAKGE    I DLLG    + +S+   +    ++ 
Sbjct: 321 LTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRTVGTSIRAVLA 380

Query: 316 TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH------------------ 357
            +  L    +   Y ++++V    +N  R    V  F +                     
Sbjct: 381 VYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQARQGPKDGLEGLAASCTTPTP 440

Query: 358 --VPSLREGNLQENIFSFP-WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN 414
             +P+ +    QE IF  P  L  + +  GS    A   PD    +   F++ER+++   
Sbjct: 441 AGIPTPQTLASQERIFLPPKHLTRRAIAFGS-IGRARLSPDELGTLLEIFKRERFILVVG 499

Query: 415 PSKGKVYALLKDQAKS--DDILKAAFHAYVLLH 445
            +       +   AK   D + +   + +++LH
Sbjct: 500 KNVKHPRPFMAKTAKQNEDPVSRIQENCHIVLH 532


>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 474

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 193/391 (49%), Gaps = 39/391 (9%)

Query: 83  RLICSEDIDGRRWNYVAERDR---SGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDG 139
           R +C+ +   R   YV + D+   +G+ + ++   ++L       +E+        +P G
Sbjct: 34  RPVCANEAPRR---YVQQGDQLLSTGQGRASNAGCLTLSGLSRELQEMF-------LPAG 83

Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
           +PDSV+  Y+ +  W  L+       GV  TQ++L SVGV  ++ATP A A+ W+L+DG+
Sbjct: 84  YPDSVSEDYLGFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPLAAALQWVLRDGS 143

Query: 200 GRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
           G +G + FA   G KFD ++K+ R   D++ ++G  +++     P L   + C ++V K 
Sbjct: 144 GMIGGLTFAYFVGPKFDVNVKRWRLFADVVNDVGLTLDMVAPYFPTLVTEVLCVSSVCKA 203

Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM----ISKRNPSLI 314
           +  V + +TR+ +   FAK +N+ D  AK         L G    +     ++    ++ 
Sbjct: 204 MCGVAAGATRSSLMTHFAKRDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNSSPHAVW 263

Query: 315 TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP------------SLR 362
             F +L+  +V+++Y  V  + + T+N +R  + V+ FL +G+ P            S+R
Sbjct: 264 MAFLVLTLVHVYANYNAVSCLCIPTVNCSRGLILVQRFLMSGNKPGKENVADHQGRYSIR 323

Query: 363 EGNLQENIFSFPWLR-DKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVTYNPSKG 418
             N +E IF  P L     +++GS    A     D D  L +   +E E Y++T      
Sbjct: 324 SVNQEEPIFKDPVLPVTSHLMMGSELHQAVPTSTDLDRLLQI---YEDETYLLTV--VGD 378

Query: 419 KVYALLKDQAKSDDILKAAFHAYVLLHCINS 449
           +V+ + + +AK  D L+A F A ++L  + S
Sbjct: 379 RVHVMFEAEAKPTDELRAFFQAVLVLQALAS 409


>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Glycine max]
          Length = 486

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 95/345 (27%), Positives = 172/345 (49%), Gaps = 18/345 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVT  YVP+  W  L         + +TQ LL ++GV    AT       W L
Sbjct: 106 VPEGFPSSVTADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKSATVIGATFQWFL 165

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D   K  R   DL+ +LG  ++L +   P  F+ + C  +
Sbjct: 166 RDLTGMLGGILFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGS 225

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NP 311
           ++++   V S +TR  + + FA  +N  D++AK      +A ++G  L +++++    +P
Sbjct: 226 ISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHP 285

Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            ++  +F  L+  +++++Y+ V+ + L++LN  R S+  + F++TG V S  + + QE+I
Sbjct: 286 LAIWFSFLSLTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQVLSPEQVSSQEHI 345

Query: 371 FSFPWLR---------DKPVVLGSRFK--DAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
               ++           K V LG R    D  +  +  L+  P++ K +Y++     KG 
Sbjct: 346 LPIQFISWSSKKANCLHKKVNLGMRISSFDNMEIKEHLLSAAPYYTKAKYLLV--ERKGI 403

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD 464
           +  ++   + + D+LK  FHA VL   ++ S    S   K  +Y 
Sbjct: 404 IDVIVHKDSNAADVLKLFFHALVLAKNVHKSKSVHSEGQKWIDYQ 448


>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
          Length = 468

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 174/332 (52%), Gaps = 12/332 (3%)

Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           SP A A   +S +++ ++P GFPDSV+P Y+PY  W +++ F         T  ++  +G
Sbjct: 62  SPMA-ASPPVSGLKAVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLG 120

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           V    A+  A    W+++D  G +G+++FA  +G K D ++KQ R   D+L +    +E+
Sbjct: 121 VGNAEASLAAATTTWLVRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEI 180

Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI 294
           A   +P  F      +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+
Sbjct: 181 AAPTLPFCFTVTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNL 240

Query: 295 ADLLGTGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
           A LL + + + +    PS  +  F LL+  +++++Y+ V+++V+ TLN +R  + ++ FL
Sbjct: 241 AGLLFSLVMLPLVSGCPSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFL 300

Query: 354 KTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVV 411
           + G +      N  E +++  W     + LG    D      +   ++   E  +E Y++
Sbjct: 301 QRGEILDTTSANQMEPLWTGFW-PSLSLSLGVPLHDLI---SSVSELQQLVEGHQEPYLL 356

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           ++N S+ KV  +L   A  + +L+AA H  +L
Sbjct: 357 SWNQSRNKVQVVLSHVAGPEAVLRAATHGLLL 388


>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
          Length = 452

 Score =  148 bits (373), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 183/377 (48%), Gaps = 27/377 (7%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL+++GV    AT       W L
Sbjct: 72  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKSATVIGATFQWFL 131

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++FA  QG   D + K  R   D + +LG  ++L +   P   + + C  +
Sbjct: 132 RDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 191

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK--RNPS 312
           ++++   V S +TR  + + FA  +N  D++AK      +A + G GL ++++   R   
Sbjct: 192 LSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHD 251

Query: 313 LI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           L+   +F  L+  +++++Y+ V+S+ L TLN  R S+ ++ FL+ G V + ++ + QE+I
Sbjct: 252 LVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEVRTPQQVSKQEHI 311

Query: 371 FSF--PWLR-------DKPVVLGSRFKDAFQDPDAYL--AVEPFFEKERYVVTYNPSKGK 419
             F   W R        + V LG++        D  L      ++    Y +     +G 
Sbjct: 312 LPFWSSWRRLLRTKLPHELVHLGAK-ASMLTHSDMLLISKTRSYYTDANYFLL--DKEGS 368

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEY--DNSNLMLTTADLEA 477
           V+  +  QA + D+LK+  H  VL      S   +S   + H++  D  N+ ++   +E 
Sbjct: 369 VHIFIHKQAAATDVLKSFVHGLVLARLTRKS---KSCHAEAHQWMDDKYNIFISKLKMEG 425

Query: 478 HIGE---SCKVVSTALW 491
           +  E   S  +V  A W
Sbjct: 426 YSTERLLSHSIVWRAHW 442


>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
          Length = 513

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 84/253 (33%), Positives = 145/253 (57%), Gaps = 12/253 (4%)

Query: 130 FIRSY--VVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           ++R++  VV D F P  VT +YV Y+ W+ L   F  A+ V  TQ +  ++GV  + + P
Sbjct: 104 WLRAFGSVVWDFFIPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSNSLP 163

Query: 187 GAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A A+NW+LKDG GR+ + I+ A     FD +LK++RF   +L     G+EL T A P  
Sbjct: 164 SAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRC 223

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
           FL LA  AN++K ++     +TR+ ++++FA G+N+G+++AK +      D+LG  L+ +
Sbjct: 224 FLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAAL 283

Query: 306 IS------KRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           ++      +R  + +  F     +   +F  YQ +K+V L TL + R  + + ++++ G+
Sbjct: 284 VNLWIENHRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGY 343

Query: 358 VPSLREGNLQENI 370
           VPS  E + +E I
Sbjct: 344 VPSPAEVSEKEGI 356


>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
          Length = 468

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 95/327 (29%), Positives = 169/327 (51%), Gaps = 11/327 (3%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           A   +S + +  +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV    
Sbjct: 66  ASPFLSRLLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAN 125

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A+  A    W++KD AG +G+++FA  +G K D + KQ R   D+L ++   +E+     
Sbjct: 126 ASVSAATATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIY 185

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
           P  F  + C +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL 
Sbjct: 186 PICFTMIVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLV 245

Query: 300 TGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
           + L + +    PS  +  F LL+  +++++Y+ V+++V+ TLN  R  + ++ FL++G V
Sbjct: 246 SLLMLPLVSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEV 305

Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPS 416
                 N  E +++  W     + LG               ++   E  +E Y++ ++ S
Sbjct: 306 LDPTSANKMEPLWTGFW-PSPSLFLGVPLHRLISR---VFELQQLIEGHQEPYLLHWDQS 361

Query: 417 KGKVYALLKDQAKSDDILKAAFHAYVL 443
           + +V  +L   A ++ IL+AA H  VL
Sbjct: 362 QNQVQVVLSQMAGTETILRAATHGLVL 388


>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
          Length = 478

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 96/338 (28%), Positives = 168/338 (49%), Gaps = 26/338 (7%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           ++   +P G+PDSV+  Y+ +  W  L+       GV  TQ++L SVGV  ++ATP A A
Sbjct: 79  LQEMFLPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPLAAA 138

Query: 191 INWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           + W+L+DG+G +G + FA   G KFD ++K+ R   D++ ++G  +++     P L   +
Sbjct: 139 LQWVLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDVGLTLDMVAPYFPTLVTEV 198

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM---- 305
            C ++V K +  V + +TR+ +   FAK +N+ D  AK         L G    +     
Sbjct: 199 LCVSSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQETAVKLFGLMFGMYFANA 258

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP------ 359
           ++    ++   F LL+  +V+++Y  V  + + T+N +R  + V  FL +G+ P      
Sbjct: 259 VNSSPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLILVRRFLASGNKPGKENVA 318

Query: 360 ------SLREGNLQENIFSFPWLR-DKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERY 409
                 S+R  N +E IF  P L     +V+G+    A     D D  L +   +  E Y
Sbjct: 319 DHNGRYSIRSVNQEEPIFKDPVLPVTSHLVMGAELHQAVPSSHDLDRLLHI---YSDEAY 375

Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCI 447
           ++T      +V  + + +AK  D L+A F A ++L  +
Sbjct: 376 LLTV--VGDRVLVMFQAEAKPKDELRAFFQAVLVLEAL 411


>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 408

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/325 (31%), Positives = 164/325 (50%), Gaps = 18/325 (5%)

Query: 134 YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN--RATPGAVAI 191
           + +P  FP SV P Y  +  +  L    G A  V +TQTLL ++GV  N   A+  A A+
Sbjct: 29  HFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMAGAL 88

Query: 192 NWILKDGAGRVGKMIFA-RQG--KKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFL 247
           NW+LKDG G++G +IFA R G  ++FD + K+ R  G L ++  + +++ +   P  + L
Sbjct: 89  NWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSAVVL 148

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
           PLAC AN  KN+  +T  ++R  +++A  +  N+GDVTAK       A LLGTG+ I +S
Sbjct: 149 PLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGIGLS 208

Query: 308 KRNPSLITTFALLSCG----YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
                    F L  CG    +   ++  ++ V LH  NR R  + ++ +L+ G V S RE
Sbjct: 209 AAMGHDAGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVLSPRE 268

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQ-----DPDAYLAVEPFFEKERYVVTYNPSKG 418
              +E+   FP LR+           A        P+    +      E Y++    + G
Sbjct: 269 IATKESF--FPLLREDDTASWLSIGSALTAISPGGPNELTQLLKACPDESYIINCT-NDG 325

Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
           K++ +    A+ +D++K  FHA  L
Sbjct: 326 KIHVVFLQDAQGEDLVKGMFHACFL 350


>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
          Length = 455

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 164/320 (51%), Gaps = 13/320 (4%)

Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           ++  +P GFPDSV+P Y+PY  W  ++ F     G   TQ +L  VGV    A+  A  I
Sbjct: 62  QAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATI 121

Query: 192 NWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
            W+LKDG G +G++ FA   G K D D K+ R   D+L +    +E+   A P  F  + 
Sbjct: 122 TWLLKDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPSCFTLIV 181

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMIS 307
           C +++AK +  V   +TR  +    A+ +N+ DV+AK    E + N+A LL + L + + 
Sbjct: 182 CVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLV 241

Query: 308 KRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
              P L +  F L +  +++++Y+ V++V++ T N  R  + ++ F++ G V +    N 
Sbjct: 242 SERPRLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVLKHFIQWGEVLTPPSANR 301

Query: 367 QENIFS--FPWLR-DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
            E +++   P L     V L S    AF+     L       ++ Y++ +N  +GK+   
Sbjct: 302 MEPLWTGCLPALSLSLGVALHSLVSSAFE-----LQQVSEGHQQPYLLRWNQCRGKIQVT 356

Query: 424 LKDQAKSDDILKAAFHAYVL 443
           L  +A +  IL+AA H  +L
Sbjct: 357 LGQEASAGVILRAATHGLLL 376


>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
 gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
 gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
          Length = 466

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 188/393 (47%), Gaps = 24/393 (6%)

Query: 66  ETTAFR--LVEDQFVDSSLRLICSEDIDGR-RWNYVAERDRSGKFKRNSIRAISLQS--- 119
           +T + R  L  +QF   + R  CS   DG  +W+      R G F R SI          
Sbjct: 3   DTASLRAPLCTEQFGSGAPR-SCSAAADGSLQWDRA---QRWGWFSRASITKPGQHEGGG 58

Query: 120 --PQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
             P A A   +S +RS ++P GFPDSV+P Y+ Y  W +++ F     G   TQ +L  +
Sbjct: 59  RGPWA-ALTTLSGLRSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGL 117

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
           GV   +A+  A    W++KD  G +G++IFA  +G K D + KQ R   D+L +    +E
Sbjct: 118 GVGNAKASVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLE 177

Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
           +     P  F      +N+AK +  V   +TR  +    A+  N+ DV+AK      + +
Sbjct: 178 IMAPMYPIFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVN 237

Query: 297 LLGTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
           L G  +S+++       ++     F LL+  +++++Y+ V+++VL TLN +R  + ++ F
Sbjct: 238 LAGLLVSLLMLPLVSDCLSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHF 297

Query: 353 LKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYV 410
           L+ G V      N  E +++  W    P +  S          +   ++   E  +E Y+
Sbjct: 298 LQRGEVLEPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHQEPYL 353

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           + +N S+ +V   L   A  + +L+AA H  +L
Sbjct: 354 LCWNQSQNQVQVALSQVAGPETVLRAATHGLIL 386


>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
          Length = 525

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/246 (33%), Positives = 139/246 (56%), Gaps = 10/246 (4%)

Query: 135 VVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINW 193
           VV D F P  VT +YV Y+ W+ L   F  A+ V  TQ +  ++GV  + + P A A+NW
Sbjct: 122 VVWDFFIPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSCSLPSAAALNW 181

Query: 194 ILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
           +LKDG GR+ + I+ A     FD +LK++RF   +L     G+EL T A P  FL LA  
Sbjct: 182 VLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRCFLLLATI 241

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS----- 307
           AN++K ++     +TR+ ++++FA G+N+G+++AK +      D+LG  L+ +++     
Sbjct: 242 ANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAALVNLWIEN 301

Query: 308 -KRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
            +R  +    F     +   +F  YQ +K+V L TL + R  + + ++++ G+VPS  E 
Sbjct: 302 HRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGYVPSPAEV 361

Query: 365 NLQENI 370
           + +E I
Sbjct: 362 SEKEGI 367


>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
           norvegicus]
          Length = 466

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 189/393 (48%), Gaps = 24/393 (6%)

Query: 66  ETTAFR--LVEDQFVDSSLRLICSEDIDGR-RWNYVAERDRSGKFKRNSIRAISL----- 117
           +T + R  L  +QF   + R  CS   DG  +W+      R G F R SI          
Sbjct: 3   DTASLRAPLCTEQFGSGAPR-SCSAAADGSLQWDRA---QRWGWFSRASITKPGQHEGGG 58

Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           + P A A   +S +RS ++P GFPDSV+P Y+ Y  W +++ F     G   TQ +L  +
Sbjct: 59  RGPWA-ALTTLSGLRSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGL 117

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
           GV   +A+  A    W++KD  G +G++IFA  +G K D + KQ R   D+L +    +E
Sbjct: 118 GVGNAKASVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLE 177

Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
           +     P  F      +N+AK +  V   +TR  +    A+  N+ DV+AK      + +
Sbjct: 178 IMAPMYPIFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVN 237

Query: 297 LLGTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
           L G  +S+++       ++     F LL+  +++++Y+ V+++VL TLN +R  + ++ F
Sbjct: 238 LAGLLVSLLMLPLVSDCLSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHF 297

Query: 353 LKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYV 410
           L+ G V      N  E +++  W    P +  S          +   ++   E  +E Y+
Sbjct: 298 LQRGEVLEPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHQEPYL 353

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           + +N S+ +V   L   A  + +L+AA H  +L
Sbjct: 354 LCWNQSQNQVQVALSQVAGPETVLRAATHGLIL 386


>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
          Length = 584

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 103/333 (30%), Positives = 161/333 (48%), Gaps = 39/333 (11%)

Query: 140 FPD--SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
           FP   SVT  Y  Y  +R  +      M V  TQ +L ++G+  +R+ P A A+NW+LKD
Sbjct: 173 FPQKTSVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSIPAAAALNWVLKD 232

Query: 198 GAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
           G GR+GK+  A   G+ FD D+K+LRF+  L+      +E  T   P  FL +A  ANV 
Sbjct: 233 GLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFPKHFLAIATVANVG 292

Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLIT- 315
           K++   T    R PI K FA  EN+G++ AK      +AD +G  L + + K     ++ 
Sbjct: 293 KSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALGVSVMKFQTKFVSL 352

Query: 316 ---------TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
                     F +L+   + S ++++KSV L T+N+ R  +  E+F++   + +  +   
Sbjct: 353 ALQRAVPLALFPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFVREKKLATQLQVAE 412

Query: 367 QENIFSFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYV 410
           +E +  FP   D   +                L S FKD    P   L+ EP+ +K+R V
Sbjct: 413 KERLL-FPARLDASALPLRVTSLGNACSTLDALASAFKDDVHRP-YVLSYEPYVQKKRTV 470

Query: 411 VT------YNPSKGKVYAL--LKDQAKSDDILK 435
           ++        P K K  AL  L  +A S D+L+
Sbjct: 471 LSGIFMRDRKPLKYKGRALLALSHRATSKDVLQ 503


>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 443

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 85/241 (35%), Positives = 134/241 (55%), Gaps = 5/241 (2%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++P+GFPDSVT  Y+ Y  WR ++       GV  TQ LL +VG+ +  A P A A+NW+
Sbjct: 181 MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNWV 239

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K+  ++ G+ FD   K  R   DLL     G+E+ T A P  F+ +  AA 
Sbjct: 240 LKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 299

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS-- 312
             ++ AA+   +TR+  Y  FA   N  +V AKGE  G ++  +G  L I ++ R  S  
Sbjct: 300 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 359

Query: 313 --LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
              +  F++++  ++F + +  KS+ L TLN  R S+    +L +G VPS+++ N +E +
Sbjct: 360 SLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL 419

Query: 371 F 371
           F
Sbjct: 420 F 420


>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
          Length = 523

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/246 (33%), Positives = 138/246 (56%), Gaps = 12/246 (4%)

Query: 136 VPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           V D F P +V+  YV Y+ W+ L   F  A+ V  TQ +  ++GV  +R+   A A+NW+
Sbjct: 120 VKDFFIPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWV 179

Query: 195 LKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
           LKDG GR+ + +F A     FD +LK++RF+  +L  L  GVEL T A P  FL LA  A
Sbjct: 180 LKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIA 239

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKR 309
           N+ K ++     ST + ++++FA  +N+G+V+AK +    C  N+  +L   L+ + SK 
Sbjct: 240 NIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFL-SKN 298

Query: 310 NPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           +  L        + + +   +F +YQ +K V L TL + R  + + ++++ G+VP+  E 
Sbjct: 299 DRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEV 358

Query: 365 NLQENI 370
           + +E I
Sbjct: 359 SEREGI 364


>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
          Length = 337

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 158/312 (50%), Gaps = 18/312 (5%)

Query: 72  LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
           L  +QF   + R  CS   DG       RRW +++         +    A     P A A
Sbjct: 11  LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
              +S +RS ++P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A
Sbjct: 65  LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           +  A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
             F      +N+AK +  V   +TR  +    A+  N+ DV+AK    E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244

Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            L + +    PSL +  F LL+  +++++Y+ V+++VL TLN +R  + +E FL+ G V 
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304

Query: 360 SLREGNLQENIF 371
                N  E ++
Sbjct: 305 EPASANQMEPLW 316


>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
           [Cucumis sativus]
          Length = 523

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 80/240 (33%), Positives = 135/240 (56%), Gaps = 11/240 (4%)

Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAG 200
           P +V+  YV Y+ W+ L   F  A+ V  TQ +  ++GV  +R+   A A+NW+LKDG G
Sbjct: 126 PKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGLG 185

Query: 201 RVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
           R+ + +F A     FD +LK++RF+  +L  L  GVEL T A P  FL LA  AN+ K +
Sbjct: 186 RLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQI 245

Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSL-- 313
           +     ST + ++++FA  +N+G+V+AK +    C  N+  +L   L+ + SK +  L  
Sbjct: 246 SLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFL-SKNDRRLQA 304

Query: 314 ---ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
                 + + +   +F +YQ +K V L TL + R  + + ++++ G+VP+  E + +E I
Sbjct: 305 ALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSEREGI 364


>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
          Length = 468

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 109/385 (28%), Positives = 189/385 (49%), Gaps = 18/385 (4%)

Query: 71  RLVEDQFVDSSLRLICSEDIDGR-RWNYVAER--DRSGKF--KRNSIRAISLQSPQAPAE 125
           RL  +Q+   + R  C+  +DG  +W     R    SG F  K          +P A A 
Sbjct: 10  RLCSEQYGSGAARGCCAA-VDGSLQWEAWGWRWWGFSGAFTAKPEGPDGGGEGAP-ATAS 67

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
             +S + +  +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+
Sbjct: 68  APLSRLLALFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGDAKAS 127

Query: 186 PGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
             A    W++KD  G +G++IFA  +G K D + K  R   D+L ++   +E+     P 
Sbjct: 128 VSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDVAMFLEIMAPVYPI 187

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTG 301
            F  + C +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + 
Sbjct: 188 CFTMIVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSL 247

Query: 302 LSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
           L + +    PS  +  F L +  +++++Y+ V+++V+ TLN  R  + ++ FL+ G V  
Sbjct: 248 LMLPLVSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQRGEVLG 307

Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKG 418
               N  E +++  W    P V  S      +   +   ++   E  +E Y++ +N S+ 
Sbjct: 308 PTSANRMEPLWTGFW----PSVSLSLGVPLHRLISSVSELQQLVEGHQEPYLLRWNQSQN 363

Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
           +V  +L   A  + +L+AA H  VL
Sbjct: 364 QVQVVLSQMAGPETVLRAATHGLVL 388


>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
           familiaris]
          Length = 468

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 163/315 (51%), Gaps = 11/315 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV    A+  A    W++
Sbjct: 78  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSAATATWLV 137

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P  F    C +N
Sbjct: 138 KDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCISN 197

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
           +AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L + +    P
Sbjct: 198 LAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSACP 257

Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           S  ++ F  L+  +++++Y+ V+++VL TLN  R  + ++ FL+ G V      N  E +
Sbjct: 258 SFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTSANQMEPL 317

Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
           ++  W    P +  S      +   +   ++   E  +E Y++ ++ SK +V  +L  +A
Sbjct: 318 WTGFW----PSLSLSLGVPLHRLISSVFELQQLVEGHQEPYLLHWDKSKSQVQVVLSQKA 373

Query: 429 KSDDILKAAFHAYVL 443
             + IL+AA H  VL
Sbjct: 374 GPETILRAATHGLVL 388


>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
           intestinalis]
          Length = 414

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 162/348 (46%), Gaps = 33/348 (9%)

Query: 117 LQSPQAPAEELMSF------IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 170
           L   Q+P +    F       +   +P G+P+SV+  Y+ Y  W  ++ F     G   T
Sbjct: 29  LVQKQSPDKSFFKFSGFTQIFKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLAT 88

Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLM 229
             +L   GV    A   +  I W+LKDG G +G++ FA  +G   D + KQ R   D++ 
Sbjct: 89  HAVLKGSGVGDETANVASATITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMN 148

Query: 230 ELGAGVELATAAVPHLFLPLA-CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG 288
           +    ++L    +P  +  L  C A +A ++  V    TR  +    AK  N+ DV+AK 
Sbjct: 149 DCAIFIQLVAPVLPKAYFTLVMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKD 208

Query: 289 ---ECVGNIADLLGTGLSIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRA 343
              E + N+A LL TGL IM  ++   P +   F + +  +VF++Y+ V SVV+   NR 
Sbjct: 209 GSQETLVNLAALL-TGLVIMPIVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRN 267

Query: 344 RFSVAVESFLKTGHVPSLREGNLQENIF---SFPWLRDKPVVLGSRFKDAFQDPDAYLAV 400
           R S  ++ F     V + +E N +E IF   S+ WL      LG + +      D  L  
Sbjct: 268 RLSKVIKVFCDMNKVLTPKEANEKEAIFFPPSYSWLN-----LGCKLE------DLGLTS 316

Query: 401 EPFFEK-----ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
             F +K      RY++  N   G++   L     + DIL+A + +Y+L
Sbjct: 317 SEFHQKVDDGGVRYLIKRNQKTGQIMVSLNHSCTASDILEAFYESYLL 364


>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 297

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 89/263 (33%), Positives = 142/263 (53%), Gaps = 11/263 (4%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
           D +G   R SI    +Q  +   ++L+    +   P G+P SV   Y+ Y  +R L+HF 
Sbjct: 40  DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFS 98

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              + V +TQ+LL + G+   R TP  A A++WILKDG    GK+I +  G + D + K 
Sbjct: 99  SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
            R   D+L +LG  +E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N
Sbjct: 156 WRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215

Query: 281 IGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSV 335
           + D+ AKGE +  + +++G G  I     + S     LI    LLS  +++   QE+++ 
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRAT 274

Query: 336 VLHTLNRARFSVAVESFLKTGHV 358
            ++TLN  R ++ V  F+K  ++
Sbjct: 275 PVNTLNPQRTAMIVADFIKVRYL 297


>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
 gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
          Length = 749

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 87/330 (26%), Positives = 171/330 (51%), Gaps = 19/330 (5%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F++++V P+GFP SVTP YVP+  W  L+        + +TQ LL+++GV    AT    
Sbjct: 363 FLQAFV-PEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGA 421

Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
              W L+D  G +G ++F   QG   D + K  R   DL+ +LG  ++L +   P  F+ 
Sbjct: 422 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVF 481

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
           + C  +++++   V S +TR  + + FA  +N  D++AK      +A ++G  L +++++
Sbjct: 482 IVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVAR 541

Query: 309 RNP----SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
                  ++  +F  L+  +++++Y+ V+ + L++LN  R S+ ++ F+++  V S ++ 
Sbjct: 542 ITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESRQVLSPKQV 601

Query: 365 NLQENIFSFPWLR---------DKPVVLGSRFK--DAFQDPDAYLAVEPFFEKERYVVTY 413
           +  E++                D  V LG+R    D  +  +  L++  ++ K +Y++  
Sbjct: 602 SPLEHVLPIQLTSMSSKKVNSLDSKVRLGTRISSFDEMEIKEHLLSLASYYAKAKYILV- 660

Query: 414 NPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
              KG +  ++   + S D+LK+  HA VL
Sbjct: 661 -EKKGIIDVIVHKDSSSADVLKSFIHALVL 689


>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
 gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
 gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
          Length = 468

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/362 (26%), Positives = 181/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     +  + +P AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDSGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F  G  G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASGLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLG----TGLSIMISKRNPSLITTFALLSCG 323
           R  +    A+  N+ DV+AK      + +L+G      +  ++S      +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V +    N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
          Length = 468

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/326 (29%), Positives = 168/326 (51%), Gaps = 17/326 (5%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S +++  +P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A+  
Sbjct: 70  LSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    W++KD  G +G+++FA  +G K D + KQ R   D+L ++   +E+     P  F
Sbjct: 130 AATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICF 189

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
                 +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 190 TMTVSTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLI 249

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    PS  +  F  L+  +++++Y+ V+++V+ TLN+ R  + ++ FL+ G V    
Sbjct: 250 LPLVSDCPSFSLGCFFFLTALHIYANYRAVRAIVMETLNQGRLRLVLKHFLQRGEVLDPT 309

Query: 363 EGNLQENIFSFPWLRDKP-----VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
             N  E +++    R  P     V L       F   D    VE    +E Y++ ++ S+
Sbjct: 310 SANKMEPLWTG--FRSSPSLSLGVPLHHLISSVF---DLQQLVEG--HQEPYLLYWDQSR 362

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
            +V  +L   A S+ IL+AA H  VL
Sbjct: 363 NQVQVVLSQMAASETILRAATHGLVL 388


>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
 gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
          Length = 590

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 132/236 (55%), Gaps = 10/236 (4%)

Query: 135 VVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINW 193
           VV D F P  V  +YV ++ W+ L   F  A+ V  TQ +  ++GV  + + P A A+NW
Sbjct: 110 VVRDFFIPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSLPSAAALNW 169

Query: 194 ILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
           +LKDG GR+ K I+ A     FD +LK++RF   +L     G+EL T   P  FL LA  
Sbjct: 170 VLKDGLGRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPRFFLLLATI 229

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI-----S 307
           AN+AK ++     +TR+ ++++FAK +N+G+++AK +      D+LG  L+ ++     +
Sbjct: 230 ANIAKQISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAALVNMWIGN 289

Query: 308 KRNPSL---ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            + P        + L +   +F  YQ +K V L TL + R  + ++++++ G+VPS
Sbjct: 290 HQRPQAGLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFGYVPS 345


>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
          Length = 468

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 167/321 (52%), Gaps = 7/321 (2%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A+  
Sbjct: 70  LSGLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGALATQAVLLGIGVGDAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+    +P  F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVLPICF 189

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
               C +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 190 TITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    PS+ +  F  L+  +++++Y+ V+++V+ TLN  R  + ++ FL+ G V S  
Sbjct: 250 LPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEGRLWLILKHFLQKGEVLSPT 309

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
             N  E +++  W     + LG           + L       +E Y++ ++ S+ +V  
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGVPLH-CLISTVSELQQLIGGHQEPYLLHWDQSRNQVQV 367

Query: 423 LLKDQAKSDDILKAAFHAYVL 443
           +L   A  + +L+AA H  VL
Sbjct: 368 VLSQTAGPETVLRAATHGLVL 388


>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
          Length = 468

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 166/323 (51%), Gaps = 11/323 (3%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+ Y  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    WI+KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P  F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
               C +N+AK V +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    PS  +  F LL+  +V+++Y+ V+++V+ TLN  R  + ++ FL+ G V    
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
             N  E +++  W     + LG+          +   ++   E  +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRV 365

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
             +L   A  + IL+AA H  VL
Sbjct: 366 QVVLSQMAGPETILRAATHGLVL 388


>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
          Length = 825

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 90/337 (26%), Positives = 163/337 (48%), Gaps = 22/337 (6%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           Q+    ++S ++   +P G+PDSV   Y  Y  W  ++      M    T +++  +GV 
Sbjct: 394 QSICSRIVSIVKEIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIGVG 453

Query: 181 RNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
            + ATP A AI WILK+GAG VG ++FA   G K D   K+ R   D+L ++ AG+EL  
Sbjct: 454 ESNATPLAAAITWILKNGAGMVGSIMFAWWNGTKLDGQCKKWRLFADILNDMAAGMELLV 513

Query: 240 AAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIA 295
           +      + + C + + K++  V   +TR  + +  A   N+ DV AK      CV  IA
Sbjct: 514 SYFSSYSVVILCISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVNLIA 573

Query: 296 DLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
             +G  +  +   R   LI  +  L   +++++Y  VK + L  LN  R ++ ++ ++  
Sbjct: 574 SFIGIFILSLFHDR-QYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYYMIN 632

Query: 356 GHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAF-QDPDAYLAVE---PF 403
             +P  +  N +E++     L D P        + +G  F     ++  + + +E    F
Sbjct: 633 ERIPEPKRLNKEESVS----LLDNPTKSICGFNIKIGVSFASVLRKNVISSMEMEFLLKF 688

Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           FE  +Y++  +     ++ +LK   ++ +IL+A F+A
Sbjct: 689 FEDRKYLIAIDVKNRNIFVILKKDIQATEILEAYFYA 725


>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 269

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/233 (33%), Positives = 123/233 (52%), Gaps = 13/233 (5%)

Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAG 200
           P SVT  Y  Y  +R  +      + VF TQ +L +VG+  +R  P A A+NW+LKDG G
Sbjct: 24  PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLGASRTLPAAAAVNWVLKDGLG 83

Query: 201 RVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
           R+GK+ + A  G+ FD D+K+ RF   ++ +  + +E+ T   P  FLPLA  AN+ K+V
Sbjct: 84  RLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQHFLPLATVANIGKSV 143

Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----------- 308
              T+   R PI + F   EN+ +V AK      +AD +G  L++  +K           
Sbjct: 144 GITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAVGAAKAVSKVATVRPE 203

Query: 309 -RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            R    +  F  L+   +   Y+E+K+V L T+N+ R  +  E+F+K   +PS
Sbjct: 204 IRRALPVFAFGPLAALDLICIYKELKAVQLRTINKERAEIIAEAFVKEREIPS 256


>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
          Length = 464

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 166/323 (51%), Gaps = 11/323 (3%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+ Y  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    WI+KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P  F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
               C +N+AK V +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    PS  +  F LL+  +V+++Y+ V+++V+ TLN  R  + ++ FL+ G V    
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
             N  E +++  W     + LG+          +   ++   E  +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRV 365

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
             +L   A  + IL+AA H  VL
Sbjct: 366 QVVLSQMAGPETILRAATHGLVL 388


>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 437

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 163/332 (49%), Gaps = 21/332 (6%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           I S  +P+G+P+SVTP Y+P+ TW  L+        + +TQ LL  +GV    AT     
Sbjct: 49  ILSAFLPEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGVGETTATVVGAT 108

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
             W ++D  G +G ++F   QG   D   KQ R   DLL ++G  ++L +   P  F+ +
Sbjct: 109 FLWFMRDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMDLVSPLFPGAFVTI 168

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
            C  ++A++V  V S +TR  + + FA  +N  DV+AK       A ++G  L +++++ 
Sbjct: 169 LCIGSMARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAATMVGMLLGMVLARL 228

Query: 310 NP----SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
                 +L  +F LL+  +++++Y+ V+S+ L +LN  R S+ + +F +   VP+ RE  
Sbjct: 229 TAENVVALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAFKEGRKVPTPREVA 288

Query: 366 LQENIF----SFPWLRD--------KPVVLGSRFKDAFQDP--DAYLAVEPFFEKERYVV 411
            QE++       PW +         K +  G +      D    ++L++   + ++ Y++
Sbjct: 289 DQEHLLPRIPRLPWQKQRAGDAWLPKSIRFGVQISSLKLDDGWKSFLSLVERYSRDPYLM 348

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
              P    V+ +L   A   D L    HA  L
Sbjct: 349 V--PKGAIVHVVLHKLATPQDFLHGYVHALCL 378


>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 519

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 163/329 (49%), Gaps = 16/329 (4%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
            +++  + S  +P+    +V+  Y  + T+  L++     +GV +TQ LL +VG+S ++ 
Sbjct: 160 RKILDLLNSSFLPENCKKTVSKDYFWFTTFNFLQNVVNCCIGVISTQQLLRAVGLSASQG 219

Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
              + ++NW+LKDG GR+G ++F    G +FD D K+ R  GD+L   G G E+ +  +P
Sbjct: 220 IGKSASLNWVLKDGIGRLGAIVFGSFIGNRFDADPKRWRLWGDILYAFGIGTEIISPLLP 279

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLG 299
             FL +A  AN+ K  + +    T   I ++FAK EN GD++AK    E + N+    LG
Sbjct: 280 RYFLLVASLANMVKATSYMLRLPTTAAIRRSFAKRENFGDISAKANSQEVLSNLFGTFLG 339

Query: 300 TGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            G SI++      L   + +    +   +Y  VK + L TLN  R  +   S+ K+  V 
Sbjct: 340 IGFSIILGHSWWQLSVAYFVYFIMFFLLNYLGVKGLHLRTLNLQRSLLLGWSYWKSSQVS 399

Query: 360 SLREGNLQEN-----IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN 414
           ++ E N +EN     I SF     + +  G   +D         A+   + + +Y++T  
Sbjct: 400 TVEEVNYRENVILPSIMSF----GRYIRFGVSLRDIHLTGSELAALLDKYRQAKYLLTVR 455

Query: 415 PSKGKVYALLKDQAKSDDILKAAFHAYVL 443
              G+VY +   +A+  D L+A   A  L
Sbjct: 456 --HGQVYIVFHMEARVKDELQAILQATFL 482


>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 181/377 (48%), Gaps = 27/377 (7%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 67  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 126

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++FA  QG   D + K  R   D + +LG  ++L +   P   + + C  +
Sbjct: 127 RDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 186

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK--RNPS 312
           ++++   V S +TR  + + FA   N  D++AK      +A + G GL ++++   R   
Sbjct: 187 LSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHD 246

Query: 313 LI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           L+   +F  L+  +++++Y+ V+S+ L TLN  R S+ ++ F++ G V + ++ + QE+I
Sbjct: 247 LVVWVSFLSLTVFHMYANYKAVQSLSLSTLNYERTSILLQYFMEHGEVLTPQQVSKQEHI 306

Query: 371 FSF--PW-------LRDKPVVLGSRFKDAFQDPDAYLAVE--PFFEKERYVVTYNPSKGK 419
             F   W       L  + V LG++        D  L  +   ++    Y +      G 
Sbjct: 307 LPFWSSWRKLLRVKLPHELVNLGAK-ASMLTHSDMLLIAKTRSYYTNANYFLL--DKDGS 363

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEY--DNSNLMLTTADLEA 477
           V   +  QA + D+LK+  H  VL   +  S   +S  ++ H++  +  N  ++   +E 
Sbjct: 364 VCIFIHKQAVATDVLKSFIHGLVLARLVPKS---KSCHMEAHQWMDEKYNTFISKLKVEG 420

Query: 478 HIGE---SCKVVSTALW 491
           +  E   S  +V  A W
Sbjct: 421 YSTERLLSHSIVWKAHW 437


>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
 gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
 gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
 gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 440

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 180/372 (48%), Gaps = 28/372 (7%)

Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYV---VPDGFPDSVTPSYVPYMTWRALKHFFGG 163
           FK  +I A S  S Q  A       R  +   VP+GFP SVTP YV +  W  L+     
Sbjct: 26  FKTATITASSSLSIQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTY 85

Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLR 222
              + +TQ LL+++GV    AT       W L+D  G +G ++F   QG   D + K  R
Sbjct: 86  TKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWR 145

Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
              DL+ ++G  ++L +   P  F+ + C  +++++   V S +TR  + + FA  +N  
Sbjct: 146 LVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAA 205

Query: 283 DVTAKGECVGNIADLLGTGLSIMISK---RNPSLI-TTFALLSCGYVFSSYQEVKSVVLH 338
           D++AK      +A ++G  L +++++    NP  I  +F  L+  +++++Y+ V+ +VL+
Sbjct: 206 DISAKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLN 265

Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPW---LRD-------KPVVLGSRFK 388
           +LN  R S+ +  F++TG V S  + +  E +    W   LR        K V LG R  
Sbjct: 266 SLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPL-WATSLRSTNSKPLHKRVQLGVRVS 324

Query: 389 -----DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                D  Q  +   A    ++  +Y++ +   KG V  +L   +K  D+LK+  HA VL
Sbjct: 325 SLPRLDMLQLLNGVGASS--YKNAKYLLAH--IKGNVSVILHKDSKPADVLKSYIHAIVL 380

Query: 444 LHCINSSNGHRS 455
            + +  S    S
Sbjct: 381 ANLMEKSTSFYS 392


>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
 gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
 gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
          Length = 468

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
 gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
          Length = 466

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 182/361 (50%), Gaps = 15/361 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     +  + +P AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDSGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NPSLITTFALLSCGY 324
           R  +    A+  N+ DV+AK      + +L+G  +S+++       P L   F  L+  +
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGL-GCFFFLTALH 269

Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
           ++++Y+ V+++V+ TLN  R  + ++ +L+ G V +    N  E +++  W    P    
Sbjct: 270 IYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPLWTGFW----PAPSL 325

Query: 385 SRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
           S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  +
Sbjct: 326 SLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLM 385

Query: 443 L 443
           L
Sbjct: 386 L 386


>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
          Length = 420

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 99/322 (30%), Positives = 155/322 (48%), Gaps = 41/322 (12%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F  +  +P G+P+SV+P Y+ Y  W  ++ F     G   TQ +L  VGV    +T  A 
Sbjct: 48  FFTAVFLPQGYPESVSPDYLAYQIWDTVQAFASSITGTLATQAVLKGVGVGDETSTVTAA 107

Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
            + WILKDG G +G+++FA  +G K D D KQ R   D+L ++   +E+   A P  F  
Sbjct: 108 TVTWILKDGTGMLGRILFAWSKGSKLDCDAKQWRLFADVLNDVAILMEILAPAFPACFTL 167

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIM 305
           + C +   K +  V   +TR  +    A+ +N+ DV+AK    E + N+A LL + L I 
Sbjct: 168 IVCISGFFKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLFSLLLIP 227

Query: 306 ISKRNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           +   N  L  + + L +  +++++Y+ V++V + TLNRAR  + ++SFLK+G VP     
Sbjct: 228 LVADNLRLTYSLYGLFTILHLYANYRAVRAVCMETLNRARLCLVLQSFLKSGEVPG---- 283

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP---SKGKVY 421
                                        PD     EP    +  +  Y P   S G V 
Sbjct: 284 -----------------------------PDVTNPQEPLLLVDFSITIYVPWSSSTGTVS 314

Query: 422 ALLKDQAKSDDILKAAFHAYVL 443
            LL  Q +S D++KA  HA +L
Sbjct: 315 ILLHQQRRSVDVIKACSHALLL 336


>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
           gorilla]
          Length = 468

 Score =  142 bits (357), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
          Length = 468

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P+A A   +S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+S+V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ +N S+ +V  +L   A    +L+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWNQSRNQVQVVLNQNAGPKTVLRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
          Length = 836

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 159/304 (52%), Gaps = 27/304 (8%)

Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRF 223
           A G      LL +VG+ +  A P A A+NW+LKDG G + K++ ++ G+ FD + K  R 
Sbjct: 468 AKGEAQGMALLYAVGLGKG-AIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRL 526

Query: 224 AGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
             DLL     G+E+ T   PHLF+P+  AA   ++ AA+   +TR+  Y  FA   N  +
Sbjct: 527 FADLLENTAYGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAE 586

Query: 284 VTAKGECVGNIADLLGTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHT 339
           V AKGE  G ++  LG  L I ++ R  S ++    +FA ++  +++ + +  +S+ L T
Sbjct: 587 VIAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRT 646

Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIF--------------SFPWLRD------K 379
           LN  R S+    +L +G VPS++E N +E +F                P  +D      +
Sbjct: 647 LNPYRASLVFSEYLLSGQVPSVKEVNDEEPLFLNLSVGTSRKESKILSPQAKDAAEIICR 706

Query: 380 PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
            + LGS+  +  ++ +   A+   ++ E+Y++T    KGK   +LK+ +  +D+LK+ FH
Sbjct: 707 RLQLGSKLSEIIENKEDACALFDLYKNEQYLLT--EYKGKFCVILKEGSSPEDMLKSLFH 764

Query: 440 AYVL 443
              L
Sbjct: 765 VNYL 768



 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/170 (37%), Positives = 95/170 (55%), Gaps = 9/170 (5%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+PDSV+  Y+ Y  WR ++       GV +TQ LL +VG+ +  A P A A+NW+
Sbjct: 321 LLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 379

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   DLL     G+E+ T   PHLF+P+  AA 
Sbjct: 380 LKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 439

Query: 255 VAKNVAAVTSTSTRTPIYKAF--AKGENIGDVTAKGECVGNIADLLGTGL 302
             ++ AA+  T     I+  F      ++  V AKGE  G +A L   GL
Sbjct: 440 AGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQG-MALLYAVGL 483


>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
          Length = 468

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 167/326 (51%), Gaps = 17/326 (5%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 70  LSSLIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P  F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
               C +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 190 TITICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    P   +  F  L+  +++++Y+ V+++V+ TLN  R  + ++ FL+ G V    
Sbjct: 250 LPLVSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPT 309

Query: 363 EGNLQENIFSFPWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
             N  E +++  W   + + LG    R   +  D      ++   E  +E Y++ ++ S+
Sbjct: 310 SANQMEPLWTGFW-PSQSLSLGVPLHRLTSSVFD------LQQLVEGHQEPYLLRWDLSQ 362

Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
            +V  +L   A  + IL+AA H  VL
Sbjct: 363 NQVQVVLSQMAGPETILRAATHGLVL 388


>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
          Length = 442

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/345 (27%), Positives = 172/345 (49%), Gaps = 27/345 (7%)

Query: 114 AISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTL 173
           ++S ++   P + L + + +  +P GFPDSV+P Y+PY  W  ++ F     G   TQ +
Sbjct: 31  SLSWEAGGGPGQGLRAALTAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAV 90

Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELG 232
           L  VGV    A+  A  I W+LKDG G +G++ FA   G + D + K+ R   D+L +  
Sbjct: 91  LQGVGVGDAAASVSAATITWLLKDGTGMLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAA 150

Query: 233 AGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---E 289
             +E+     P  F  + C +++AK +  V   +TR  +    A+ +N+ DV+AK    E
Sbjct: 151 MFLEIVAPGFPSCFTLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQE 210

Query: 290 CVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
            + N+A LL + L + +    PSL +  F L +  +++++++ V++V++ T N AR  + 
Sbjct: 211 TLVNLAGLLVSLLLLPLVSNRPSLSLAAFFLFTILHLYANFRAVRAVIMETFNEARLYLV 270

Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG--SRFKDAFQDPDAYLAVEPFFE- 405
           ++ FL+ G V +    N           R +P+  G  S    +   P   L +   FE 
Sbjct: 271 LKHFLQWGKVLTPPSAN-----------RMEPLWTGCHSALSLSLGVPLHSL-ISSVFEL 318

Query: 406 -------KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                  ++ Y++ ++  +G++   L  +  +  IL+AA H  +L
Sbjct: 319 QQVSEGHQQPYLLRWDRCRGQIQVTLGQEVSTGVILRAATHGLLL 363


>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 468

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 94/333 (28%), Positives = 170/333 (51%), Gaps = 23/333 (6%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           A   ++ +++  +P GFP+SV+P Y+PY  W +++ F     G   TQ +L  +GV   +
Sbjct: 66  ASRTLAGLQAVFLPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAK 125

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           AT  A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     
Sbjct: 126 ATVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVF 185

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
           P  F      +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL 
Sbjct: 186 PVCFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLV 245

Query: 300 TGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
           + L + +    PS+ +  F LL+  +++++Y+ V+++V+ TLN +R  + ++ FL+ G V
Sbjct: 246 SLLMLPLVSGCPSVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEV 305

Query: 359 PSLREGNLQENIFSFPWLRDKPVV--------LGSRFKDAFQDPDAYLAVEPFFEKERYV 410
                 N  E +++       P +        L S   +  Q  D +        +E Y+
Sbjct: 306 LDPTPANQMEPLWTG--FSPSPALSLGVPLHCLVSSMSELQQLIDGH--------QEPYL 355

Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           + ++ S+ +V  +L   A  + +L+AA H  VL
Sbjct: 356 LCWDQSRNQVQVVLSQMAGPETVLRAATHGLVL 388


>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
          Length = 462

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 166/323 (51%), Gaps = 11/323 (3%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 64  LSRLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVS 123

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    W++KD  G +G+++FA   G K D + KQ R   D+L ++   +E+    +P  F
Sbjct: 124 AATATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCF 183

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
               C +N+AK +  V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 184 TVTVCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLM 243

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +   +PSL +  F  L+  +++++Y+ V+++V+ TLN  R  + +  FL+ G V    
Sbjct: 244 LPLVSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPT 303

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
             N  E +++  W     + LG+          +   ++   E  +E Y++ ++ S+ +V
Sbjct: 304 SANRMEPLWTGFW-SSVSLSLGAPLHCV---TSSVSELQHLVEGHQEPYLLHWDQSRNQV 359

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
             +L   A  + +L+AA H  VL
Sbjct: 360 QVVLSPMAGPEAVLRAATHGLVL 382


>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
           melanoleuca]
 gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
          Length = 468

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 165/323 (51%), Gaps = 11/323 (3%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 70  LSGLMAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    W++KD  G +G+++FA  +G K D + KQ R   D+L ++   +E+     P  F
Sbjct: 130 AATATWLVKDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
               C +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L 
Sbjct: 190 TMTVCTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    PS  +  F  L+  +++++Y+ V+++VL TLN  R  + ++ FL+ G V    
Sbjct: 250 LPLVSACPSFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPT 309

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
             N  E +++  W     + LG           +   ++   E  +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGVPLHCMI---SSVFELQQLVEGHQEPYLLRWDKSQRQV 365

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
             +L   A  + IL+AA H  VL
Sbjct: 366 QVVLSQMAGPETILRAATHGLVL 388


>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
          Length = 468

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P+A A   +S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGGDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+S+V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    +L+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 IL 388


>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
 gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
          Length = 417

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 90/325 (27%), Positives = 155/325 (47%), Gaps = 14/325 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   +++  +P G+PDSV+  YV Y  W  L+ F     G   T  +L  +GV       
Sbjct: 45  LFRVLQNIFLPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +  + WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 YSATVTWILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
               + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 FSTQILCGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  MI K    L T F ++   +++++ + V+S+ L T N +R+ +A+E F ++ H+ +
Sbjct: 225 YLLTMI-KSQAMLYTIFYIVVTLHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLT 283

Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
            ++ N  E +     +     + LG   K    +  +   +E     F   ER+++  + 
Sbjct: 284 PQQVNKMERVTVGQTVSVSLNIRLGLSVKHLIDEYKSSSVIENIVSSFDPHERFIIAESK 343

Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
               VY  L    +  D+LKA F A
Sbjct: 344 KHLGVY--LHFDTRPQDVLKAYFFA 366


>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
          Length = 446

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 103/417 (24%), Positives = 195/417 (46%), Gaps = 47/417 (11%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
           D+SG F  +  R   L        E+   + +  +P+GFP+SVT  Y+P+  W +L+   
Sbjct: 34  DKSGGFTIHRERGRML--------EVWGKLSAAFLPEGFPNSVTSDYLPFQIWDSLQGLS 85

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQ 220
                + +TQ LL+++GV    AT       W L+D  G +G ++F   QG   D + K 
Sbjct: 86  TYIRCMLSTQALLSAIGVGETTATVVGATFQWFLRDFTGMLGGILFTLYQGSNLDSNAKM 145

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
            R   DL+ +LG  ++L +   P  F+ + C  ++A++   V S +TR  + + FA   N
Sbjct: 146 WRLIADLMNDLGMLMDLVSPLYPASFVIILCLGSIARSFTGVASGATRAALTQHFALQHN 205

Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISKRNP----SLITTFALLSCGYVFSSYQEVKSVV 336
             D++AK      +A ++G  L ++++        ++ ++F LL+  ++F++YQ V+ + 
Sbjct: 206 AADISAKEGSQETMATMIGMLLGMLLAHVTAGHPGAMWSSFLLLTFFHMFANYQAVRCLC 265

Query: 337 LHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQ---- 392
           L++LN  R SV ++ F+  G V + ++ +  E +    W++ + +  G  F   F+    
Sbjct: 266 LNSLNEGRTSVLLKCFITNGQVLNPKDVSSMEYLLP-NWMQLRMLTKGRPFNATFKLGAK 324

Query: 393 -------DPDAYLAVE-PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLL 444
                  D    L +    + K +Y++      G +  +L   + S DIL++  H  VL 
Sbjct: 325 ISSLSQSDMSKLLKLSASCYGKGKYLLLCR--NGTIEVILNKDSVSADILQSYIHGLVLS 382

Query: 445 HCINSS-------NGHRSLALKQHE------------YDNSNLMLTTADLEAHIGES 482
           + + SS       N    L ++++             ++   L++ + D  AH  ES
Sbjct: 383 YHMGSSSSVNDANNADSCLWMEKYYSMFISKQLQASGWNTGRLLVPSVDWRAHWVES 439


>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
 gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
          Length = 468

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P+A A   +S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+S+V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    +L+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
          Length = 468

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P+A A   +S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+S+V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    +L+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
 gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
          Length = 440

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 159/323 (49%), Gaps = 17/323 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 64  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 123

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P   + + C  +
Sbjct: 124 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 183

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----RN 310
           ++++   V S +TR  + + FA  +N  D++AK      +A ++G GL ++++       
Sbjct: 184 LSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGLGMLLAHITRGHA 243

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            S+ T+F  L+  +++++Y+ V+S+ L TLN  R S+ ++ F + G V + ++ + QE+I
Sbjct: 244 LSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGDVLAPQKVSQQEHI 303

Query: 371 FSF--PW-------LRDKPVVLGSRFKDAFQDPDAYLA-VEPFFEKERYVVTYNPSKGKV 420
             F   W       L  + V LG++           +A     +E   Y++     +G V
Sbjct: 304 LPFWSTWRKLNKIKLPHERVHLGAKASMLTHSDMLVIAKTRSHYENTNYLLL--NKQGSV 361

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
           Y  +   A   D+L++  H  VL
Sbjct: 362 YIFIHKLATPADVLRSFVHGLVL 384


>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
 gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
           adhaerens]
          Length = 272

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 84/270 (31%), Positives = 142/270 (52%), Gaps = 13/270 (4%)

Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAIN 192
           S  +P G+P+SV+  Y+ Y  W  L+ F     G   TQ +L  VGV   +AT  A A+ 
Sbjct: 1   SVFLPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMT 60

Query: 193 WILKDGAGRVGKMIFA-RQGKKFDYDLKQL------RFAGDLLMELGAGVELATAAVPHL 245
           W+LKDG G +G+++FA  +G   D D K++      R   D L +    +E+     PHL
Sbjct: 61  WMLKDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHL 120

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGL 302
           F  L C A+VAK++  V   +TR  +    A+  N+ DV+AK    E + N++ L+  GL
Sbjct: 121 FTLLLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALI-VGL 179

Query: 303 SIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            ++  ++     + T F + +  +++++Y+ V SVV+ TLNR R  +   +F + G + S
Sbjct: 180 FLLPAVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILS 239

Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDA 390
             E   +E++          ++LG+ FK++
Sbjct: 240 PAEVCQRESVIIMKDQYKLRMILGAEFKES 269


>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
          Length = 468

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P A +  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPGASSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A   +IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKNILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
           tauri]
          Length = 1023

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/233 (33%), Positives = 123/233 (52%), Gaps = 13/233 (5%)

Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAG 200
           P SVT  Y  Y  +R  +      + VF TQ +L +VG+  +R  P A A+NW+LKDG G
Sbjct: 587 PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRTLPAAAAVNWVLKDGLG 646

Query: 201 RVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
           R+GK+ + A  G+ FD D+K+ RF   ++ +  + +E+ T   P  FLPLA  AN+ K+V
Sbjct: 647 RLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQHFLPLATVANIGKSV 706

Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITT--- 316
              T+   R PI + F   EN+ +V AK      +AD +G  L++  +K    + T    
Sbjct: 707 GITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAVGAAKGMSKVATVRPE 766

Query: 317 ---------FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
                    F  L+   +   Y+E+K+V L T+N+ R  +  E+++K   +P+
Sbjct: 767 IRRALPLFAFGPLAVLDLICIYKELKAVQLRTINKERAEIIAEAYVKEREIPT 819


>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
          Length = 468

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +  AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
          Length = 469

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 167/323 (51%), Gaps = 11/323 (3%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S +++  +P GFP+SV+P Y+PY  W +++ F         T +++  +GV    A+  
Sbjct: 71  LSGLKAVFLPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLA 130

Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A    W+++D  G +G++IFA  +G K D + KQ R   D+L +    +EL    +   F
Sbjct: 131 AATATWLVRDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSF 190

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
             +   +N+AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + + 
Sbjct: 191 TMIVSISNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIM 250

Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           + +    P L +  F LL+  +++++Y+ V+++V+ TLN +R  + ++ FL++G V    
Sbjct: 251 LPLMSNCPRLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEVLDTT 310

Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
             N  E +++  W     + LG           +   ++   E  +E Y++ +N S+ +V
Sbjct: 311 SANQMEPLWTGIW-PSPSLSLGVPLHCVV---SSVSELQQLVEGHQEPYLLCWNQSENQV 366

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
             +L   A  + +L+AA H  +L
Sbjct: 367 QVVLSQMAGPEAVLRAATHGLLL 389


>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
 gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
 gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
          Length = 527

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 85/298 (28%), Positives = 149/298 (50%), Gaps = 26/298 (8%)

Query: 96  NYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRS-------------YVVPDGFPD 142
           N + E+++S       I    + SP    ++ ++F+ +               +P+G+PD
Sbjct: 15  NCIIEKNQSNSI----INKFQVASPYDDYDDRLNFLETIEKNSGFYNVLCELFLPNGYPD 70

Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
           SVT  Y  Y  W +++       G   T+ +L   GV  + AT  +    W+++DG G +
Sbjct: 71  SVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWLIRDGMGMI 130

Query: 203 GKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVA 260
           G+++FA R+G   D + K+ R+  D+L  +G   E+ +      LFLPL+C   +AK++ 
Sbjct: 131 GRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCIGLIAKSIC 190

Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRNPSLITT- 316
            V    T+  + + FAK +N+ DV+AK        +L+G  LS+++S     N SLI T 
Sbjct: 191 GVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFINDNTSLIVTW 250

Query: 317 --FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT-GHVPSLREGNLQENIF 371
             F   +  ++F +Y+ V +V L ++NR R  +  + F+   G +PS  E +  ENI 
Sbjct: 251 LVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEISKLENIL 308


>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
 gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
          Length = 665

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 84/245 (34%), Positives = 131/245 (53%), Gaps = 13/245 (5%)

Query: 140 FP--DSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
           FP  +SVT  Y  Y  +R L+      + VF TQ +L ++G+  +R  P A AINW+LKD
Sbjct: 199 FPARESVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPAAAAINWVLKD 258

Query: 198 GAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
           G GR+GK+ + A  G++FD D+K+ RF   ++ +  + VE+ T   P  FL LA  AN+ 
Sbjct: 259 GLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALATLANIG 318

Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG-------TGLSIMISKR 309
           K+V   T+   R PI ++F   EN+ +V AK      +AD LG       TGL+  +   
Sbjct: 319 KSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAVLATGLTGKVVND 378

Query: 310 NPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
              LI  F     L+   ++  Y+E+K+V L T+N+ R  +  + ++  G VP  +  + 
Sbjct: 379 RARLIIPFVAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTVPGTKYVSD 438

Query: 367 QENIF 371
            E +F
Sbjct: 439 AERLF 443


>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
 gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
 gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
 gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
 gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
 gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
          Length = 468

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +  AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
          Length = 468

 Score =  138 bits (347), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +  AP+  L S +++  +P GFPDSV+P Y
Sbjct: 35  EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W    P   
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    IL+AA H  
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 440

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 179/372 (48%), Gaps = 28/372 (7%)

Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYV---VPDGFPDSVTPSYVPYMTWRALKHFFGG 163
           FK  +I A +  S Q  A       R  +   VP+GFP SVTP YV +  W  L+     
Sbjct: 26  FKTATITASASLSIQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQFWDTLQGLSTY 85

Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLR 222
              + +TQ LL+++GV    AT       W L+D  G +G ++F   QG   D + K  R
Sbjct: 86  IKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWR 145

Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
              DL+ ++G  ++L +   P  F+ + C  +++++   V S +TR  + + FA  +N  
Sbjct: 146 LVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAA 205

Query: 283 DVTAKGECVGNIADLLGTGLSIMISK---RNPSLIT-TFALLSCGYVFSSYQEVKSVVLH 338
           D++AK      +A ++G  L +++++    NP  I  +F  L+  +++++Y+ V+ +VL+
Sbjct: 206 DISAKEGSQETMATMMGMSLGMLLARFTSGNPMAIWFSFLSLTVFHMYANYRAVRCLVLN 265

Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPW----------LRDKPVVLGSRFK 388
           +LN  R S+ +  F++TG V S  + +  E +    W          L  K V LG R  
Sbjct: 266 SLNFERSSILLTHFMQTGQVLSPEQVSSMEGVLPV-WATSLRSTNSKLLHKRVHLGVRVS 324

Query: 389 -----DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                +  Q  +   A    ++  +Y++ +   KG V  +L   + + D+LK+  HA VL
Sbjct: 325 SLPRLEMMQLLNGVGASS--YKNAKYLLAHR--KGNVSVILHKDSSAADVLKSYIHAIVL 380

Query: 444 LHCINSSNGHRS 455
            + +  S    S
Sbjct: 381 ANLMEKSTSFYS 392


>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
          Length = 407

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/347 (27%), Positives = 167/347 (48%), Gaps = 28/347 (8%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
            +  +R   +P G+P+SVT  Y  Y TW  L+       GV  T+ +L   GV    A+ 
Sbjct: 41  FIEVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVLATRAILKGYGVGNLNASV 100

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PH 244
            + +I WI +D AG +G++IFA + G   D + K  RFA DLL + G  +E+ +    P 
Sbjct: 101 ASASIQWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAADLLNDFGMTLEMISPLFGPS 160

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGT 300
            FLP++C   ++K++  V    T+  + + FAK +N+ DV+AK       +  +  LLG 
Sbjct: 161 YFLPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAIALVGMLLGM 220

Query: 301 GLSIMISKRNPSLITTFA------LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
            ++  I   + S+   F       + +  ++  +Y+ V  V L ++NR R ++  + F +
Sbjct: 221 VMAASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLVQLTSINRYRANLIYQHFNE 280

Query: 355 TGH--VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVT 412
             +  +P+ ++ +L+ENI       D  + LG  F D   D      V    + ++Y++ 
Sbjct: 281 NNNNQIPTPKQISLKENILFRQ--SDLSIELGVSFTDINGDLK---RVSGSDKDDKYMIW 335

Query: 413 YNPSKGKVYAL---LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSL 456
            N  + K + +   +   A S DI++  F +  L+H      GH  L
Sbjct: 336 SNNKQKKNHLIKIAISSTATSSDIIEGYFESITLVH------GHSKL 376


>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 464

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 167/340 (49%), Gaps = 32/340 (9%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L +F     +P G+PDSV+  Y  Y  W +++ F     G   TQ++L+ VGV    A+ 
Sbjct: 41  LRAFFSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGDQGASV 100

Query: 187 GAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A    WIL+DG G  G+++FA  QG   DYD K+ R   D+L ++   +EL    +   
Sbjct: 101 LAATTTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQYMKGY 160

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGL 302
              + C ++VAK++  V   +TR  + +  A+  N+ DV+AK    E + N+A  L + L
Sbjct: 161 VTAVLCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAFLFSLL 220

Query: 303 SIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT-GH 357
            + I   NP L+     +F LL   ++F++Y+ V  VV+ T NR+R+++ V  FL+T G 
Sbjct: 221 LLRIVAGNPWLLYGIFISFTLL---HIFANYRAVSCVVMETFNRSRYAIVVRRFLETAGD 277

Query: 358 VPSLREGNLQENI------------FSFPWLRDKPV--VLGSRFKDAFQDPDAYLAVEPF 403
           +  +   N +E++            F  P     P    L  R +DA  D      VEP 
Sbjct: 278 IAPVTWVNARESVWIQCGKPFVSINFGVPLFSIAPTNKELQCRVEDAAGDRK----VEPA 333

Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +     VV  +     +  +L  ++   D ++A FHA++L
Sbjct: 334 YLLA--VVRDSSRHYNINIVLSSESTVKDHMEAMFHAFLL 371


>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
          Length = 410

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/230 (32%), Positives = 124/230 (53%), Gaps = 7/230 (3%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           ++ + +P G+P SV+  Y+ Y  W  ++ F     G   +Q +L  VGV  + AT  + +
Sbjct: 41  LKKFFLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGDSSATILSAS 100

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           + W+ KDG+G +G++IFA   G K D D K  RF  D+L +    +E+ +    +LF PL
Sbjct: 101 LTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPL 160

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMI 306
            C ANV K++  V  ++TR  I +  A   N+ DV+AK    E + N+   L   + + +
Sbjct: 161 LCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYM 220

Query: 307 SKRNPSLITTFALLSCG--YVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
              N  LI  F  + C   +++S+Y+ VK + L T NR RF +A++ + K
Sbjct: 221 VTGNQFLI-WFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWFK 269


>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 827

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 166/357 (46%), Gaps = 49/357 (13%)

Query: 135 VVPDGF------PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
           ++P+GF      P  V+  Y+PY     +++    A  V  TQ++L ++GV++      +
Sbjct: 338 LLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIGVAKQATIGVS 397

Query: 189 VAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
            A+ W+LKDG GR+  +IF    G ++D D K+ R  GD+L  LG G E+AT  VP  FL
Sbjct: 398 AAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEIATRLVPQYFL 457

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT----GLS 303
            +   AN  K+V+ +    T   I + F+  EN GD++AK      ++ LLGT    GLS
Sbjct: 458 LVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSMLLGTFVGIGLS 517

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP--SL 361
            MI     +L   + +    ++  +Y  ++ + + TLN  R  + +  +  +G V   S+
Sbjct: 518 YMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYWNSGGVETCSV 577

Query: 362 REGNLQENIFSFPWLRDKP---VVLGSRFKDAFQ-------------------------- 392
              N  EN F  P  R  P   V  G R  +AF+                          
Sbjct: 578 AMANQAEN-FLLP-SRLGPARRVRFGCRLGEAFRTVGEFEEAIARQRLWRGQYLQQTDVS 635

Query: 393 ----DPDA-YLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLL 444
               + DA + +V      ERY++  +  +G+V+ +L   AK  D L+A  HA  L+
Sbjct: 636 KRNGNADADFTSVPDDVLDERYLLGVDAKRGRVFVVLHRDAKVADELQALMHATYLV 692


>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
 gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
          Length = 417

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 89/325 (27%), Positives = 154/325 (47%), Gaps = 14/325 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   +++  +P G+PDSV+  Y  Y  W  ++ F     G   T  +L  +GV       
Sbjct: 45  LFRVLQNIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATA-AVPH 244
            +  + WILK+G+G +G+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 YSATVTWILKEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
           L   + C + + K +  V   +TR  + +  A   N+ DV +K      CV  IA  +G 
Sbjct: 165 LSTQILCGSTMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  MI K    L T F ++   +++++ + V++V L T N +R+ +A+E F ++ H+ +
Sbjct: 225 YLLTMI-KSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLT 283

Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
            ++ N  E +     +     + LG   K    +  +   +E     F   ER+++  + 
Sbjct: 284 PQQVNKLERVTVGQMVSVSLNIRLGLSVKHLIDEYKSSNVIENIVSSFDPHERFIIAESK 343

Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
               VY  L    +  D+LKA F A
Sbjct: 344 KYLGVY--LHFDTRPQDVLKAYFFA 366


>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
 gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
          Length = 244

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/244 (33%), Positives = 128/244 (52%), Gaps = 5/244 (2%)

Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           +S  +P G+P+SV+  Y+ Y  W  ++ F     G   TQ +L  VGV    AT     I
Sbjct: 1   QSVFLPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATI 60

Query: 192 NWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
            WILKDG G +G+++FA  QG   D D K+ R   D++ +    +EL   A P  F  + 
Sbjct: 61  TWILKDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIV 120

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMIS 307
           C A V K +  V+  +TR  +    A+ +N+ DV+AK    E + N+  LL   +   + 
Sbjct: 121 CVAGVGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMV 180

Query: 308 KRNPSLIT-TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
             NP L    F + +C ++F++Y  V  VV+ TLN++R  + V  +L+TG V +  + N+
Sbjct: 181 ADNPLLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANV 240

Query: 367 QENI 370
           QE +
Sbjct: 241 QEPV 244


>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
 gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 87/258 (33%), Positives = 134/258 (51%), Gaps = 10/258 (3%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           ++S  +P G+P+SV+  Y+ Y  W  ++ F     G   TQ +L   GV    AT  A  
Sbjct: 1   LQSAFLPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAAT 60

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           + WIL+ G G VG ++FA  QG   D + K+ R   D++ +L   VEL +      F  L
Sbjct: 61  MTWILRSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLL 120

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMI 306
           AC ++VAK++  V   STR  + +  A+ +N+ DV AK    E + N+  LL  GL I  
Sbjct: 121 ACLSSVAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLA-GLVITP 179

Query: 307 SKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE- 363
                +L+T   F + +  +++++Y  V SVV+ TLN  R  +    FL+TGHV S  E 
Sbjct: 180 LVTGHTLLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEV 239

Query: 364 GNLQENIF--SFPWLRDK 379
           G ++  IF     W+ +K
Sbjct: 240 GKMEPVIFGKCCHWVYEK 257


>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 412

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 83/267 (31%), Positives = 142/267 (53%), Gaps = 6/267 (2%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
            +S  +P G+PDSV+  Y+ Y  W  L+ F     G   T+  L  VGV   +AT  A  
Sbjct: 75  FQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAAT 134

Query: 191 INWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
             W+LKDG G  G+++FA  +G K D + K+ R   D+L ++   +++     P  F  +
Sbjct: 135 TTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTLI 194

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMI 306
            C + V K++  V   +TR  +    A+ +N+ D++AK    E + N+A LL +   +  
Sbjct: 195 MCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVSF 254

Query: 307 SKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
              N +L ++ F L +  ++F++Y+ V+SVV+ TLN AR S+ ++ +LK G + S  E N
Sbjct: 255 VADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEAN 314

Query: 366 LQENIFSFPWLRDKPVVLGSRFKDAFQ 392
            +E +    + +  P+ LG R ++  Q
Sbjct: 315 QREPVL-MEFKKTVPIKLGVRLQEIVQ 340


>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
          Length = 450

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 160/344 (46%), Gaps = 14/344 (4%)

Query: 113 RAISLQS-PQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
           R IS +S   +  + ++ F R  ++P G+PDSV+  Y  Y  W  ++ F     G+FTTQ
Sbjct: 33  REISQESVADSVLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQ 92

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLME 230
            +L  VGV    A+  +  I WI+KDG G VG++IFA  +GK  D + K+ R   DLL +
Sbjct: 93  AILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLND 152

Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG-- 288
               +EL+   V HL   + C + + K +  V   +TR  I    A   N+ ++ AK   
Sbjct: 153 GAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGS 212

Query: 289 --ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFS 346
               V  IA   G  L    +      I TF L+   +++S+YQ V +++   LN  R +
Sbjct: 213 QETLVNLIASFTGIFLINYFTDATTQWIVTFLLIFL-HLYSNYQAVTALIFKNLNNLRMT 271

Query: 347 VAVESFLKTGHVPSLREGNLQENIF-----SFPWLRDKPVVLGSRFKDAFQ--DPDAYLA 399
           + ++S+L    V      N  E++F     S        +VLG       +  + D   +
Sbjct: 272 LILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELKS 331

Query: 400 VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +   ++ ++Y++  +     +Y   +      +++   +HA +L
Sbjct: 332 LTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375


>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
           mulatta]
          Length = 429

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 160/315 (50%), Gaps = 11/315 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +AT  A    W++
Sbjct: 39  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 98

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD  G +G+++FA  +G K D + KQ R   D+L ++   +E+     P  F      +N
Sbjct: 99  KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 158

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
           +AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L + +    P
Sbjct: 159 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 218

Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
              +  F  L+  +++++Y+ V+S+V+ TLN  R  + ++ +L+ G V      N  E +
Sbjct: 219 GFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 278

Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
           ++  W    P    S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A
Sbjct: 279 WTGFW----PAPSLSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKA 334

Query: 429 KSDDILKAAFHAYVL 443
               +L+AA H  +L
Sbjct: 335 GPKTVLRAATHGLML 349


>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
           castaneum]
          Length = 472

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 160/344 (46%), Gaps = 14/344 (4%)

Query: 113 RAISLQS-PQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
           R IS +S   +  + ++ F R  ++P G+PDSV+  Y  Y  W  ++ F     G+FTTQ
Sbjct: 33  REISQESVADSVLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQ 92

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLME 230
            +L  VGV    A+  +  I WI+KDG G VG++IFA  +GK  D + K+ R   DLL +
Sbjct: 93  AILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLND 152

Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG-- 288
               +EL+   V HL   + C + + K +  V   +TR  I    A   N+ ++ AK   
Sbjct: 153 GAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGS 212

Query: 289 --ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFS 346
               V  IA   G  L    +      I TF L+   +++S+YQ V +++   LN  R +
Sbjct: 213 QETLVNLIASFTGIFLINYFTDATTQWIVTFLLIFL-HLYSNYQAVTALIFKNLNNLRMT 271

Query: 347 VAVESFLKTGHVPSLREGNLQENIF-----SFPWLRDKPVVLGSRFKDAFQ--DPDAYLA 399
           + ++S+L    V      N  E++F     S        +VLG       +  + D   +
Sbjct: 272 LILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELKS 331

Query: 400 VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +   ++ ++Y++  +     +Y   +      +++   +HA +L
Sbjct: 332 LTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375


>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
          Length = 631

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 172/378 (45%), Gaps = 58/378 (15%)

Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           +P+    +  + +  +++P+GFP+SVT  Y+ Y  WR ++       GV  TQ+LL +VG
Sbjct: 182 TPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVG 241

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
           + +  A P A AINW+LKDG G + K++ ++ G+ FD   K  R   DLL     G+E+ 
Sbjct: 242 LGKG-AIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 300

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV-------------- 284
           T   P  F+ +  AA   ++ AA+   +TR+     FA   N  +V              
Sbjct: 301 TPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVYNNFYMALVLITYQ 360

Query: 285 -------------TAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQE 331
                         A+G    ++  LLG  ++  I       +  F +++  +++++ + 
Sbjct: 361 QLFVFLNYSGSLGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKS 420

Query: 332 VKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR-------------- 377
            + + L TLN  R S+    +L +G  P ++E N +E +  FP +R              
Sbjct: 421 YQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPL--FPTVRFSNMKSPEKLQDFV 478

Query: 378 ------------DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLK 425
                       ++ + LGS+  D   + +  +A+   +  E Y++T +  KG+   +LK
Sbjct: 479 LSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEH--KGRFCVMLK 536

Query: 426 DQAKSDDILKAAFHAYVL 443
           + +   D+L++ F    L
Sbjct: 537 ESSTPQDMLRSLFQVNYL 554


>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 417

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 77/250 (30%), Positives = 131/250 (52%), Gaps = 7/250 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L  F ++  +P G+PDSV+  Y+ Y  W  ++ F     G  TT  +L  VGV  + A P
Sbjct: 45  LKRFFQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PH 244
            +  I W+LKDG G  G+++FA  +G + D D K+ R   D+L ++  GV+L    + P 
Sbjct: 105 LSATITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPK 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
           +   + CA +  K +  V   +TR+ + +  A   N+ DV +K      CV  IA  +G 
Sbjct: 165 VSTYILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
           GL   +  +   L   F  ++  +++++ + VK+V L T N AR+ +A+E + K+G + S
Sbjct: 225 GLLTHVQNQRV-LYGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLS 283

Query: 361 LREGNLQENI 370
            ++ N  E +
Sbjct: 284 PQQVNKLERV 293


>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
           jacchus]
          Length = 468

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 175/362 (48%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P A A   +S +++  +P GFPDSV+P Y
Sbjct: 35  EVRGWRWWGLS---RAFTVKPEGRDAGEVGAPGA-ASPPLSGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   T  +L  +GV   +AT  A    W++KD  G +G++IFA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIIFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLG----TGLSIMISKRNPSLITTFALLSCG 323
           R  +    A+  N+ DV+AK      + +L G      +  ++S         F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W     + L
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGFW-PTPSLSL 329

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
           G           +   ++   E  +E Y++ ++ S+ +V  +L  +A    +L+AA H  
Sbjct: 330 GVPLHRLV---SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKADPKTVLRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 519

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 90/294 (30%), Positives = 148/294 (50%), Gaps = 27/294 (9%)

Query: 103 RSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGF----------------PDSVTP 146
           +SGK  R  I+  SL+      EE  S      + DGF                P  V+ 
Sbjct: 70  KSGKVSRYFIKGDSLELLCVDEEEDDSTSFCLGLDDGFWKLIRLSSSAAKDFFLPKQVSD 129

Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
           +Y+ Y+ W+ L   F  A+ V  TQ +  ++G+ ++R+   + A NWILKDG GR+ + I
Sbjct: 130 NYISYVKWKFLHRVFSSALQVLATQVMFRAIGIGQSRSLASSAAFNWILKDGLGRLSRCI 189

Query: 207 F-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTST 265
           + A     FD +LK++RF+  +L  L  GVEL T   P  FL LA  AN+AK ++     
Sbjct: 190 YTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQISLSCYL 249

Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSL-----ITT 316
           +T + ++++FA  +N+G+V+AK +    C  N+  LL   L+++  + N  L        
Sbjct: 250 ATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLF-QNNQRLQASLPFVL 308

Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           + + S   +   YQ +K + L TL + R  + +E +++   VPS  E + +E I
Sbjct: 309 YPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFRQVPSPAEVSEEEGI 362


>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
 gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
 gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 520

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 90/294 (30%), Positives = 148/294 (50%), Gaps = 27/294 (9%)

Query: 103 RSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGF----------------PDSVTP 146
           +SGK  R  I+  SL+      EE  S      + DGF                P  V+ 
Sbjct: 71  KSGKVSRYFIKGDSLELLCVDEEEDDSTSFCLGLDDGFWKLIRLTSSAAKDFFLPKQVSD 130

Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
           +Y+ Y+ W+ L   F  A+ V  TQ +  ++G+ ++R+   + A NWILKDG GR+ + I
Sbjct: 131 NYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGLGRLSRCI 190

Query: 207 F-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTST 265
           + A     FD +LK++RF+  +L  L  GVEL T   P  FL LA  AN+AK ++     
Sbjct: 191 YTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQISLSCYL 250

Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSL-----ITT 316
           +T + ++++FA  +N+G+V+AK +    C  N+  LL   L+++  + N  L        
Sbjct: 251 ATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLF-QHNQRLQACLPFVL 309

Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           + + S   +   YQ +K + L TL + R  + +E +++   VPS  E + +E I
Sbjct: 310 YPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFRQVPSPAEVSEEEGI 363


>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
 gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
          Length = 333

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 85/276 (30%), Positives = 142/276 (51%), Gaps = 32/276 (11%)

Query: 151 YMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF-AR 209
           Y+ W+ L   F  A+ V  TQ +  ++G+  +R+ P A A+NW+LKDG GR+ + I+ A 
Sbjct: 3   YVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAALNWVLKDGLGRLSRCIYTAS 62

Query: 210 QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRT 269
               FD +LK++RF   +L  L  G+EL T   P  FL LA  AN+AK ++     +T +
Sbjct: 63  LASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLATGS 122

Query: 270 PIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRNPSLITTFALLSCGY-- 324
            ++++FA  +NIG+V+AK +      D LG  L+ +++   K N     +F  +S  +  
Sbjct: 123 AVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCFCL 182

Query: 325 ------------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS---------LRE 363
                       +F  YQ ++ V L TL + R  + + S++  GHVPS         LRE
Sbjct: 183 PFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAERGILLCLRE 242

Query: 364 G----NLQENIFSFPWLRDKPVVLGSRFKDAFQDPD 395
           G    ++   +    ++R K ++  S ++ A +D D
Sbjct: 243 GSGTADVIMGLLQACYIR-KALLFSSMWETAVEDKD 277


>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
           boliviensis]
          Length = 468

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 177/362 (48%), Gaps = 15/362 (4%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           ++ G RW  ++   R+   K     A  + +P A A   +S +++  +P GFPDSV+P Y
Sbjct: 35  EVRGWRWWGLS---RAFTVKPEGRDAGEVGAPGA-ASPPVSGLQAVFLPQGFPDSVSPDY 90

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD  G +G+++FA
Sbjct: 91  LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
             +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + +V   +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210

Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL----ITTFALLSCG 323
           R  +    A+  N+ DV+AK      + +L G  +S+++              F  L+  
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTAL 270

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
           +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +++  W     + L
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGFW-PTPSLSL 329

Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
           G           +   ++   E  +E Y++ ++ S+ +V  +L  +A    +L+AA H  
Sbjct: 330 GVPLHRLV---SSVSELQQLVEGHQEPYLLCWDQSQNQVQVVLNQKADPKTVLRAATHGL 386

Query: 442 VL 443
           +L
Sbjct: 387 ML 388


>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
           melanogaster gb|AE003660 [Arabidopsis thaliana]
          Length = 438

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/337 (28%), Positives = 166/337 (49%), Gaps = 25/337 (7%)

Query: 139 GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDG 198
           GFP SVTP YV +  W  L+        + +TQ LL+++GV    AT       W L+D 
Sbjct: 59  GFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDF 118

Query: 199 AGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
            G +G ++F   QG   D + K  R   DL+ ++G  ++L +   P  F+ + C  ++++
Sbjct: 119 TGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSR 178

Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNPSLI 314
           +   V S +TR  + + FA  +N  D++AK      +A ++G  L +++++    NP  I
Sbjct: 179 SFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLARFTSGNPMAI 238

Query: 315 -TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
             +F  L+  +++++Y+ V+ +VL++LN  R S+ +  F++TG V S  + +  E +   
Sbjct: 239 WLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPL 298

Query: 374 PW---LRD-------KPVVLGSRFK-----DAFQDPDAYLAVEPFFEKERYVVTYNPSKG 418
            W   LR        K V LG R       D  Q  +   A    ++  +Y++ +   KG
Sbjct: 299 -WATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASS--YKNAKYLLAH--IKG 353

Query: 419 KVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRS 455
            V  +L   +K  D+LK+  HA VL + +  S    S
Sbjct: 354 NVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYS 390


>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
 gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
          Length = 380

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 88/329 (26%), Positives = 164/329 (49%), Gaps = 17/329 (5%)

Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
            GFP SVTP YVP+  W  L+        + +TQ LL+++GV    AT       W L+D
Sbjct: 2   QGFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRD 61

Query: 198 GAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
             G +G ++F   QG   D + K  R   DL+ +LG  ++L +   P  F+ + C  +++
Sbjct: 62  LTGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLS 121

Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNP-S 312
           ++   V S +TR  + + FA   N  D++AK      +A + G  L +++++    +P +
Sbjct: 122 RSFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLARITIGHPLA 181

Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI-- 370
           +  +F  L+  +++++Y+ V+ +VL +LN  R S+ ++ F++TG V S  + +  E+I  
Sbjct: 182 IWFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHILP 241

Query: 371 -FSFPW------LRDKPVVLGSRFKDA-FQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
            ++  W      L    + LG R      Q+    L      +K +Y++     KG +  
Sbjct: 242 RWTNFWRTKHVKLLHSRMCLGVRVSSLDHQELMELLHSAGSHKKAKYLLM--ERKGTINI 299

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSN 451
           L+   + + D+L++  HA V+   +  +N
Sbjct: 300 LIHKNSTTSDVLQSFIHALVMAKLMEENN 328


>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
          Length = 504

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/247 (31%), Positives = 128/247 (51%), Gaps = 9/247 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P+SV   Y+ +  +  L      AM   + Q+L  ++G +  +A   A A  W+L
Sbjct: 98  LPVGYPNSVAEGYLKFSIFNNLSALSITAMSFLSAQSLFVAIGSTMTQANLAAAAYTWVL 157

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KDG G++G ++FA R G+ FD D+K+ RF   + + +   +E+ T   P+ FL LA  AN
Sbjct: 158 KDGIGQLGAILFASRYGRNFDEDIKKWRFMAMIALNISIYIEILTLRFPNHFLALASIAN 217

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK-----R 309
           + KN+  + S ++R  I   FAK  NIGD++ K       + L G G+ +M+SK      
Sbjct: 218 IGKNICFLLSAASRASINVQFAKRNNIGDISGKSVSQFTASTLCGVGIGLMLSKLIDISS 277

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG--HVPSLREGNLQ 367
              L   F +L+   + +SYQ  K +    LN  R  +    + K+   +  ++ EGN+Q
Sbjct: 278 ISQLYPVFMILTILNIATSYQASKVIDEVYLNNQRAFIIFNEYFKSNKKNFLTVEEGNIQ 337

Query: 368 ENIFSFP 374
           E IF  P
Sbjct: 338 E-IFYMP 343


>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
          Length = 422

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 88/315 (27%), Positives = 159/315 (50%), Gaps = 11/315 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +AT  A    W++
Sbjct: 32  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD    +G+++FA  +G K D + KQ R   D+L ++   +E+     P  F      +N
Sbjct: 92  KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
           +AK + +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L + +    P
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211

Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
              +  F  L+  +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 271

Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
           ++  W    P    S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A
Sbjct: 272 WTGFW----PAPSLSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKA 327

Query: 429 KSDDILKAAFHAYVL 443
               IL+AA H  +L
Sbjct: 328 GPKTILRAATHGLML 342


>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
          Length = 315

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 88/268 (32%), Positives = 142/268 (52%), Gaps = 25/268 (9%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
            + S  +P G+P SV+  Y+ Y  W  ++ F     G  +T+ +L  VGV  + +TP A 
Sbjct: 51  LLSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAA 110

Query: 190 AINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVEL----ATAAVP- 243
            + W++KDG G VG+++FA   G K D D K+ RF  DLL +    +EL    A AAV  
Sbjct: 111 TLMWLIKDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGG 170

Query: 244 --HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLL 298
                  + C A VAK++  V   +TR  + +  A+  N+ DV+AK    E + N+A L+
Sbjct: 171 SNGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALI 230

Query: 299 GTGLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
            T L ++      + +T   F L +  +VF++ + VK+V + TLNRAR+ + ++ F  T 
Sbjct: 231 -TSLWLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTK 289

Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLG 384
           +V S++E N           R +PV++G
Sbjct: 290 YVASVKEVN-----------RLEPVIIG 306


>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 518

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 134/281 (47%), Gaps = 10/281 (3%)

Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
           P       +++ ++P  +P +V+  Y  +  W  L+H    A  V  TQ++L ++GV   
Sbjct: 87  PENWPQRLVQTLLLPADYPSAVSSDYDRFWRWNILRHTLLEAAEVLGTQSMLLALGVGGA 146

Query: 183 RATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
            A P A A  WILKDG G   K+  A Q   + D D K+ RF GD +M LG   E+ T A
Sbjct: 147 SALPLAAAWKWILKDGLGYFAKVALATQLAPRVDNDPKRFRFYGDFVMALGTLFEMLTLA 206

Query: 242 VPHLFLPLACAANVAKNVAAV-TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
            P  FL LA   N+ +  A V T  + R  +Y  F+   N GDV++K E    +  L G 
Sbjct: 207 FPPWFLALASTGNLLRKAADVATGPAYRVFLYH-FSIRNNSGDVSSKSESQVVVGRLSGI 265

Query: 301 GLSIMI----SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
           G  I +    S    +L+  + LLS G+++++YQ V  + L TLNR R    +  +  + 
Sbjct: 266 GAGIAVSAVTSHDASALVMAYGLLSAGHLYATYQSVIRLELRTLNRTRLEYVLNVYTASS 325

Query: 357 --HVPSLREGNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDP 394
              VPS+ E N  E   F  PW     +   +   +   DP
Sbjct: 326 GKRVPSIAEANAAEQKSFLRPWSHVYRIRFVNSLNELLSDP 366


>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 288

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/204 (36%), Positives = 114/204 (55%), Gaps = 5/204 (2%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
           D +G   R SI    +Q  +   ++L+    + + P G+P SV   Y+ Y  +R L+HF 
Sbjct: 40  DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFS 98

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              + V +TQ+LL + G+   R TP  A A++WILKDG    GK+I +  G + D + K 
Sbjct: 99  SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155

Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
            R   D+L +LG  +E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N
Sbjct: 156 WRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215

Query: 281 IGDVTAKGECVGNIADLLGTGLSI 304
           + D+ AKGE +  + +++G G  I
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGI 239


>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Cucumis sativus]
          Length = 437

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 165/332 (49%), Gaps = 29/332 (8%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W +L+        + +TQ LL+++GV    AT       W L
Sbjct: 56  VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGATFQWFL 115

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P  F+ + C  +
Sbjct: 116 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVSPLFPSAFMLIVCLGS 175

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP--- 311
           ++++   V S +TR  + + FA   N  D++AK      +A ++G  + +++++      
Sbjct: 176 LSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLARITMGCP 235

Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            ++  +F  L+  +++++Y+ V  + L +LN  R S+A++ +++T  V S ++ ++ E++
Sbjct: 236 LAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQVSMMEHV 295

Query: 371 ---FSFPW-LRDKPVV-----LGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVY 421
              ++  W LR   ++     LG+R             +  FF +    V  +  KG  Y
Sbjct: 296 LPAWASSWSLRSIKLIHTRVRLGARISSLHHTE-----MXAFFNQLMNFVKSSGYKGAKY 350

Query: 422 ALLKDQ----------AKSDDILKAAFHAYVL 443
            L++ +          A ++DI ++  HA V+
Sbjct: 351 LLVERKETISVIVQKDATANDIFQSFVHALVM 382


>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
          Length = 264

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 88/267 (32%), Positives = 142/267 (53%), Gaps = 25/267 (9%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           + S  +P G+P SV+  Y+ Y  W  ++ F     G  +T+ +L  VGV  + +TP A  
Sbjct: 1   LSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAAT 60

Query: 191 INWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVEL----ATAAVP-- 243
           + W++KDG G VG+++FA   G K D D K+ RF  DLL +    +EL    A AAV   
Sbjct: 61  LMWLIKDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGS 120

Query: 244 -HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
                 + C A VAK++  V   +TR  + +  A+  N+ DV+AK    E + N+A L+ 
Sbjct: 121 NGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALI- 179

Query: 300 TGLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           T L ++      + +T   F L +  +VF++ + VK+V + TLNRAR+ + ++ F  T +
Sbjct: 180 TSLWLLPILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKY 239

Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLG 384
           V S++E N           R +PV++G
Sbjct: 240 VASVKEVN-----------RLEPVIMG 255


>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
 gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
          Length = 417

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 76/272 (27%), Positives = 135/272 (49%), Gaps = 11/272 (4%)

Query: 108 KRNSIRAISLQSPQAPAEE---LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGA 164
           +RN +R + L+  +   +E   L   +++  +P G+PDSV+  Y  Y  W  ++ F    
Sbjct: 24  QRNIVR-VPLRGDKLVIQEKFLLFRVLQNIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTI 82

Query: 165 MGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRF 223
            G   T  +L  +GV        +  + WILK+G+G +G++IFA  QG + D D K+ R 
Sbjct: 83  CGTLCTHAILKGIGVGSENVNAYSATVTWILKEGSGHMGRIIFAWWQGSQLDVDSKKWRL 142

Query: 224 AGDLLMELGAGVEL-ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
             D L +L  G+E+      PH    + C + + K +  V   +TR  + +  A   N+ 
Sbjct: 143 RADFLNDLAMGIEIYVLPKYPHFSTQILCGSTLLKAIVGVAGGATRAALTQHHAVRGNLA 202

Query: 283 DVTAKGE----CVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLH 338
           DV +K      CV  IA  +G  L  MI K    L T F ++   +++++ + V++V L 
Sbjct: 203 DVASKDSSQETCVNLIASFMGLYLLTMI-KSQAMLYTVFYIVVTLHLYANLKAVRAVCLR 261

Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           T N +R+ + +E F ++  + + ++ N  E +
Sbjct: 262 TFNESRYLITLEEFFRSSQMLTPQQVNKLERV 293


>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 600

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 104/327 (31%), Positives = 156/327 (47%), Gaps = 40/327 (12%)

Query: 56  AATLLQSESDETTAF---RLVEDQFVDSSLRLICSE-DIDGRRWNYVAERDRSGKFKRNS 111
           AA+ L   ++ T A    R++  Q+  +S R I +  D +G+      +R  SG   R+ 
Sbjct: 86  AASGLSERTENTDALPEKRVLVQQWSGTSRRRIETYCDYEGK---LSVQRQISGPQVRHG 142

Query: 112 IRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
           +        Q   E L   + S  +P  +P SVT SY PY  W  L +       V  T 
Sbjct: 143 L--------QFSTEPLQRLLASVFLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTT 194

Query: 172 TLLNSVGVSRNRATPGAVAI-------------------NWILKDGAGRVGKMIF-ARQG 211
            LL S GV  + +   + A+                   +WILKDG G + ++ F +R  
Sbjct: 195 ALLVSAGVVADPSRAASTALTRDPALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYA 254

Query: 212 KKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPI 271
            +FD DLK+LR   D+L  LG G+EL + + P  F+ LA  A   K +A V   STR+ I
Sbjct: 255 HRFDGDLKRLRVLADVLWHLGTGLELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTI 314

Query: 272 YKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTFALLS---CGYVF 326
           Y+  A   NIGDVTAKG+     A+LLG   GLS+       +L  T+A+      G ++
Sbjct: 315 YRTVATRSNIGDVTAKGDAQSIAAELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMY 374

Query: 327 SSYQEVKSVVLHTLNRARFSVAVESFL 353
           S  Q ++ +VL  LN  R  + V++++
Sbjct: 375 SRIQSMRVLVLPLLNLQRAVLLVDAYV 401


>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
 gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
          Length = 417

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 90/328 (27%), Positives = 160/328 (48%), Gaps = 14/328 (4%)

Query: 125 EELMSF--IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
           E+ M F  ++   +P G+PDSV+  Y  Y  +  ++ F     G   T  +L S+GV   
Sbjct: 41  EKFMLFRVLQKIFLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSE 100

Query: 183 RATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATA 240
                +  + WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+    
Sbjct: 101 NINAYSATVTWILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLP 160

Query: 241 AVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
             PHL   + C + + K V +V   +TR+ + +  A   N+ DV +K        +LL +
Sbjct: 161 KYPHLSTQILCGSTMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLAS 220

Query: 301 --GLSIMISKRNPS-LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
             GL ++   ++P+ L T F L+   +++++ + V++V L T N +R+ +A+E F ++  
Sbjct: 221 CVGLYVLTLIKSPAVLYTVFYLVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSR 280

Query: 358 VPSLREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQD----PDAYLAVEPFFEKERYVVT 412
           + + ++ N  E +     +     + LG   K+   +     D    V  F   ER+++ 
Sbjct: 281 MLTPQQVNGMERVTLGQTVSVSLNIRLGLSVKNLVDEYKSSSDIENIVSSFDTHERFIIA 340

Query: 413 YNPSKGKVYALLKDQAKSDDILKAAFHA 440
              SK  +   L    +  D+LKA F A
Sbjct: 341 --ESKKYLGVYLHFDTRPQDVLKAYFFA 366


>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
 gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
          Length = 492

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 114/409 (27%), Positives = 177/409 (43%), Gaps = 36/409 (8%)

Query: 51  GGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRN 110
           GG G +  +L       TA R +     DS LR I            + E DRS +    
Sbjct: 46  GGEGLSRVILVERYSNGTAKRYIIGD--DSRLRTI------------LIEEDRSTQN--- 88

Query: 111 SIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 170
             R   L SP      L   ++++++P GFP SV+  Y+ YM  +   +  G       T
Sbjct: 89  --RFGVLHSPDKRLSWLPDTVKAFILPAGFPGSVSDDYLQYMLLQFPTNVTGWICHTIVT 146

Query: 171 QTLLNSVGVSRNRATPGAVA---INWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGD 226
            +LL +VGV     T  A +   I W+ KDG G VG++ I  R G  FD D KQ R   D
Sbjct: 147 SSLLKAVGVGSFSGTTAAASAAAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYAD 206

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
            +   G+  +L T   P  FLPLA   N+ K +A      +   I   FA   N+G++ A
Sbjct: 207 FIGSAGSIFDLTTPLYPGYFLPLASLGNLTKAIARGLKDPSSRVIQSHFAISANLGEIAA 266

Query: 287 KGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHT 339
           K E    +A L+G GL I+I    P L+ ++ +LS  +       ++  Y+ +  +  +T
Sbjct: 267 KEEVWEVVAQLVGLGLGILILD-TPGLVKSYTVLSSTWLIVRLLHLWLRYESLSVLQFNT 325

Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLA 399
           +N  R    V+S +    VP   + N +ENI  +       ++ GS          ++  
Sbjct: 326 INLKRARTLVKSHVLHSTVPGCMDCNKEENILVWSQFMKPIIIFGSPLDKMDGMERSHFM 385

Query: 400 VE---PFFEKERYVVTYNPSKG--KVYALLKDQAKSDDILKAAFHAYVL 443
           VE     +  E+Y++  N      K Y   K  A S  +L++ +  + L
Sbjct: 386 VEALLKLYANEKYILVVNQQLDDLKFYVSFKVGATSVSVLRSIWQTFWL 434


>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
 gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
          Length = 552

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 148/297 (49%), Gaps = 20/297 (6%)

Query: 93  RRWNYVAERDRSGK----------FKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPD 142
             +NY+ E+++S            +     R I L++ +  +  L + +    +P G+PD
Sbjct: 9   NNYNYIIEKNQSNSVINKFQVVNPYDDYDDRLILLETKRTKSS-LQNILYDLFLPSGYPD 67

Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
           SVT  Y  Y  W +++       G   T+ +L   GV  + AT  +    W+++   G +
Sbjct: 68  SVTSDYFRYQFWDSIQALCSSITGTLATRAILKGYGVGNSSATVASATTQWVIQSATGMI 127

Query: 203 GKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVA 260
           G++IFA   G   D + K+ RF  D+L ++G   E+ +      LFLPL+C    AK++ 
Sbjct: 128 GRIIFAWANGTDLDCNSKKWRFMADILNDIGMTFEMISPLFNEALFLPLSCIGLFAKSIC 187

Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRNPSLITT- 316
            V    T+  + + FAK +N+ DV+AK         LLG  LS+ +S     + S++ T 
Sbjct: 188 GVAGGCTKASLTQHFAKNDNLADVSAKDGSQETAVSLLGMILSVFVSSVLNDDTSILVTW 247

Query: 317 --FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT-GHVPSLREGNLQENI 370
             F + +  +++ +Y+ V +V L+++N+ R  +  E +++  G +PS +E +  ENI
Sbjct: 248 FIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLIYEYYIRNQGSIPSPKEISNFENI 304


>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
          Length = 432

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 164/353 (46%), Gaps = 17/353 (4%)

Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           + +A    L   ++   +P G+P+SV+  Y+ Y  W  L+ F     G   T  +L  VG
Sbjct: 29  TSKARNTRLADTLKEVFLPQGYPNSVSEDYLQYQLWDTLQAFCSYITGTLATHAMLKGVG 88

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           V  + A+P A  + WILKDG G  G+++FA  +G   D + K+ R   D+L ++   ++L
Sbjct: 89  VGDSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCNAKKWRLIADILNDIAICMQL 148

Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
            +   P  FL +AC A++ ++V  V   +TR  + +  A+ +N+ DV+AK      + +L
Sbjct: 149 VSPFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQARRDNMADVSAKDGSQETLVNL 208

Query: 298 LGTGLSIMISKRNP-------SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
            G  + ++I+           SL  +F LL   +++S+Y+ V  V + TLN  R  + + 
Sbjct: 209 CGLLIGLIITPLIAGQTVFVWSLFFSFTLL---HLYSNYKAVSVVSMETLNCNRLHLLMR 265

Query: 351 SFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF-QDPDAYLAVEPFFEKERY 409
           +    G +      N +E +  F   R   V  GS          D   ++  F + +++
Sbjct: 266 NLFLNGTISEPNIVNREEPLL-FRQERFFTVEYGSSLSSVLIHSSDYSRSILQFNKGQKF 324

Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHA----YVLLHCINSSNGHRSLAL 458
           ++  + +K ++       + S D LKA        +V   C   SN  +  AL
Sbjct: 325 IIKLSKTKRQIRVAFHCGSSSSDQLKAGLTVELIEFVCGGCYGDSNYLKDYAL 377


>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
 gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
          Length = 417

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/250 (30%), Positives = 127/250 (50%), Gaps = 7/250 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L  F +   +P G+PDSV+  Y+ Y  W  L+ F     G  TT  +L  VGV  + A P
Sbjct: 45  LKRFFQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANP 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +  I W+LKDG G +G+++FA  +G + D D K+ R   D+L ++   ++L      P 
Sbjct: 105 LSATITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPK 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
               + CA    K +  V   +TR+ + +  A   N+ DV +K      CV  IA  +G 
Sbjct: 165 ASTYILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L   + +    L   FA ++  +++++ + VK+V L T N AR+ +A+E + K+G + S
Sbjct: 225 FLLTYL-QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLS 283

Query: 361 LREGNLQENI 370
            ++ N  E +
Sbjct: 284 PQQVNKLERV 293


>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
          Length = 657

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/213 (33%), Positives = 120/213 (56%), Gaps = 10/213 (4%)

Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLM 229
           ++LL +VGV   R+ P A  INW+LKDG GR+G++  A R G+ FD DLK+ R+A  ++ 
Sbjct: 78  ESLLQAVGVGARRSLPAAATINWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSIIY 137

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
            +   +E  T   P  FL +A  ANV K++   T  +T+   +++F   EN+ D++AK +
Sbjct: 138 AVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKTQ 197

Query: 290 CVGNIADLLG----TGLSIMI----SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLN 341
               + D +G     GL+ +     + R    +  F LL+ G +++ Y E++S+ L TLN
Sbjct: 198 AQQMVMDNIGLAAAVGLTYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRTLN 257

Query: 342 RARFSVAVESFLKTGHVPSLREGNLQENIFSFP 374
           + R  +  + +L+ G VPS R+ + +E  F  P
Sbjct: 258 KERAEIIAQHWLREGRVPSPRQVS-EEERFVLP 289


>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
          Length = 428

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/237 (31%), Positives = 126/237 (53%), Gaps = 7/237 (2%)

Query: 134 YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINW 193
           Y  P G+P+SV+P Y+ Y  W  ++       G    Q++L  VGV   +AT    +++W
Sbjct: 18  YGEPSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSW 77

Query: 194 ILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
           +LKDG+G +GK++F   QG   D D K+ R   D++ +    +ELA++  P +F  + C 
Sbjct: 78  MLKDGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICL 137

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKR 309
           AN+ K+V +V   +TR  +    A   N+ DV AK    E + N+  L      +     
Sbjct: 138 ANLCKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTG 197

Query: 310 NPSLITT-FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG--HVPSLRE 363
           N S+I T F +L+  +V+++Y+ V+ + L   NR RF +AV+ + +    H PS+ +
Sbjct: 198 NWSIIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPSVSD 254


>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
 gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 495

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 154/332 (46%), Gaps = 16/332 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV---SRNR 183
           L   I+ +++P GFP+SV+  Y+ YM  +   +  G       T +LL +VG+   S   
Sbjct: 110 LPGLIKDFILPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTT 169

Query: 184 ATPGAVAINWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
               AVAI W+ KDG G VG++ I  R G  FD D KQ R   D +   G+  +LAT   
Sbjct: 170 TAASAVAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLY 229

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           P  FLPLA   N+ K VA      +   I   FA   N+G++ AK E    +A LLG  +
Sbjct: 230 PSYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAI 289

Query: 303 SIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            I+I    P L+ ++++LS  +       ++  YQ +  +  +T+N  R  +   + +  
Sbjct: 290 GILILD-TPGLVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVLH 348

Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYLAVEPFFEKERYVVTY 413
             VP   E N +E+I  +       ++ G   ++    +     + +   +  E+Y++  
Sbjct: 349 NKVPGTVECNSEESILVWERFTRPSIIFGVPLEEMMGSERSSTVMKLLKLYANEKYILML 408

Query: 414 NPSKG--KVYALLKDQAKSDDILKAAFHAYVL 443
           +      KV+   K  A S  +L++ +  Y L
Sbjct: 409 DSQDKDLKVFVSFKVGASSMTVLRSIWQTYWL 440


>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
          Length = 437

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 150/323 (46%), Gaps = 38/323 (11%)

Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
           +S  +P G+P+SV+  Y+ Y  W  ++ F     G   TQ +L  VGV    +T  A A+
Sbjct: 50  KSIFLPQGYPESVSTDYLAYQFWDTIQAFASSITGTLATQAVLKGVGVGDETSTVAAAAV 109

Query: 192 NWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
            WILKDG G +G++ FA     F           D+L ++   +E+   A P  F  + C
Sbjct: 110 TWILKDGTGMLGRIAFAWSKGLF----------ADVLNDVAIFMEIMAPAFPACFTLIVC 159

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISK 308
            +   K +  V   +TR  +    A+ +N+ DV+AK    E + N+A LL +   I +  
Sbjct: 160 TSGFFKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLFSLFLIPLVV 219

Query: 309 RNPSLITTF-ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
            N  L   F AL +  +++++YQ V++V + TLNRAR  + ++ +LK   V      N Q
Sbjct: 220 DNLPLTYAFYALFTILHLYANYQAVRAVCMETLNRARLHLVLQHYLKREEVLGPAVINPQ 279

Query: 368 ENIFSFPWLRDKPVVLGSR--FKDAFQDP---------DAYLAVEPFFEKERYVVTYNPS 416
           E           P++LG R   K     P         D   A+E       Y++ +N  
Sbjct: 280 E-----------PLLLGFRQQLKITLGTPLHTVTSSVADFQKALEG--NASNYLIFFNRP 326

Query: 417 KGKVYALLKDQAKSDDILKAAFH 439
            G +  LL  QA S D++KA  H
Sbjct: 327 AGVISILLHRQADSVDVIKAYTH 349


>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
          Length = 621

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 93/330 (28%), Positives = 149/330 (45%), Gaps = 59/330 (17%)

Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV-GVSRNRATPGAVA--------- 190
           P+ VTPSY  +M WR L+ F    + V  TQ+LL  + G+ R+     A           
Sbjct: 157 PEGVTPSYYRFMRWRILQRFMNANVHVIGTQSLLMGLRGMQRSTVVSAAAKGGAALGAVA 216

Query: 191 -INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
             NW+LKD  G++ +M +A + G+KFD D K+ R+    +  LG G+E++T   P  FL 
Sbjct: 217 ATNWVLKDTLGKLVRMAWASKMGRKFDSDAKRWRYRACFIYALGNGLEVSTYLHPRFFLV 276

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKG----------------------------EN 280
           LA  AN  K V+ +TS++TR  +Y +F +                             EN
Sbjct: 277 LAMLANSCKQVSMLTSSATRNALYNSFRRMDDDVDTVGTGSNSTLKSSINGGRRRGVVEN 336

Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISK----RNPSLITTFALLSCGYVFSSYQEVKSVV 336
           IGD+TAKGE    + DLLG    I +S+       +++T +++L  G ++  Y E++S+V
Sbjct: 337 IGDITAKGEAQIAVVDLLGIASGICLSRAIGVSVQNVLTVWSVLQLGEIYCMYHEIRSIV 396

Query: 337 LHTLNRARFSVAVESFLKT--------------GHVPSLREGNLQENIFSFP-WLRDKPV 381
             TLN  R    +   L                  +P+  +   +E +F  P  L  K +
Sbjct: 397 YQTLNFERMYTVLGRLLPVDDESGSANGESCCCDRIPTPDQIAAEEKVFLPPDHLGRKEI 456

Query: 382 VLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
             GS  +      +  + +   F  ER+ +
Sbjct: 457 AFGSAGRSVLSPGELDVLIRDVFRGERFFL 486


>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 516

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 167/387 (43%), Gaps = 27/387 (6%)

Query: 79  DSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQ-----APAEELMSFIRS 133
           D   R +  E + GR  +     +      R   R I+   P+     AP   L   I S
Sbjct: 50  DLKSRFVAIERVGGRECHISWSEEAGSSGVRKEWRDIAAVDPEDGKIAAPTNSLTGRISS 109

Query: 134 Y----VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           +    V+P  +P SV  SY+P+   +  +        V   Q LL SVGVS   +  GAV
Sbjct: 110 WFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALLTSVGVSAEGSVFGAV 169

Query: 190 AINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELAT---AAVPHL 245
           A+ WI+KDGAG V K+ F R+    FD   K   F G+L+  LG+G+++AT   A  P  
Sbjct: 170 AVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGSGLQIATLLIAPSPGN 229

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLS 303
           FL  A   N+ K V      +T     + F++  N GDV AK E   +IA L+G  +G+S
Sbjct: 230 FLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDESQSSIAQLVGYASGIS 289

Query: 304 IM-ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
           ++ +S   P L   FA     ++  +   ++      L   R S   + ++  G V SL 
Sbjct: 290 LLAVSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLPRVSCLAQEYVTRGQVSSLG 349

Query: 363 EGNLQENIFSF-PWLRDKP---VVLGSRFKDAFQDPDAYLAVE-----PFFEKERYVVTY 413
           E +       F  + RDK    + L  R  D          V        FE +RY++  
Sbjct: 350 ELDAAHRTGLFGEFYRDKSDRWLTLAPRVGDVLDSGTELERVRWQICAGVFEDDRYLLF- 408

Query: 414 NPSKGKVYALLKDQAKSDDILKAAFHA 440
            P    +       A +DD+L A +HA
Sbjct: 409 -PRGRNISVFFHPDAANDDMLCAIYHA 434


>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 353

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/315 (28%), Positives = 152/315 (48%), Gaps = 13/315 (4%)

Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAIN 192
           S  +P GFPDSV+P Y+ Y  W  ++ F     G   TQ +L  VGV     T  +    
Sbjct: 1   SVFLPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGA 60

Query: 193 WILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
            ++K G G +G++ FA ++G K D D K+ R   D+L +    +E+   A P  F    C
Sbjct: 61  LLIKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFTLTVC 120

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG----TGLSIMIS 307
            +++AK +  V   +TR  +    A+  N+ DV+AK      + +L G      L  ++S
Sbjct: 121 VSSLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLLPLVS 180

Query: 308 KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
            R    +  F L +  +++++++ V++VV+ TLN +R  + +E FL+ G V SL   N +
Sbjct: 181 DRPSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLASANEK 240

Query: 368 ENIFSFPWLRDKP---VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           E +    W    P   + LG        D  + L +     ++ Y++ ++ ++G V   L
Sbjct: 241 EPL----WTDVGPALTLALGVPLHSLVSDV-SDLQLMKAGHQQAYLLHWDRARGHVQVAL 295

Query: 425 KDQAKSDDILKAAFH 439
              A    IL+AA H
Sbjct: 296 AQTAGPRAILRAATH 310


>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
 gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
          Length = 417

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/250 (29%), Positives = 123/250 (49%), Gaps = 7/250 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W   + F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKIFLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +  + WILK+G+G VG++ FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 YSATVTWILKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
           L   + C + + K +  V   +TR  + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 LSTQILCGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++S+ + V++V L T N +R+ +A+E F ++  + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLS 283

Query: 361 LREGNLQENI 370
            ++ N  E +
Sbjct: 284 PQQVNAMERV 293


>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
          Length = 408

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 86/258 (33%), Positives = 131/258 (50%), Gaps = 14/258 (5%)

Query: 95  WNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTW 154
           W + AE  RS           SL + +     L   +    +P     S+T  Y PY  W
Sbjct: 83  WTHKAEGLRSAP--------TSLAAARRTGSWLYDVLWDLFLPKDARASLTKDYFPYAKW 134

Query: 155 RALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKK 213
             +      A  V + Q+LL ++G+    A P A A+NW+LKDG G+ G ++FA     +
Sbjct: 135 SFVGSVASCAASVLSMQSLLYAIGLGAG-AIPTAAAVNWVLKDGLGQFGGVLFASIVNNR 193

Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           +D D K+ R A  + +++    E+ T   P  FL +A  ANVAKNVA +++++TR   + 
Sbjct: 194 YDADPKRWRVASSVALDVSVLGEILTPLAPGSFLAIASLANVAKNVAWLSASATRAGFHN 253

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS----LITTFALLSCGYVFSSY 329
           +FA  EN+ DVTAK       + + GTGL I+IS+   S    +   FA+LS  ++ S+Y
Sbjct: 254 SFAIRENLADVTAKAGSQAIASSIFGTGLGILISQFTGSSTLNVAAAFAVLSTVHLISTY 313

Query: 330 QEVKSVVLHTLNRARFSV 347
           + V  V L TLN  R  +
Sbjct: 314 KSVDCVQLRTLNCQRLHI 331


>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
           B]
          Length = 505

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 166/369 (44%), Gaps = 34/369 (9%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L ++ R   +P  +P SV  SY+P+   + ++   G  + V   Q LL SVGVS   +  
Sbjct: 110 LSAWFRQMFLPTNYPQSVHRSYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIF 169

Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV--- 242
           GAVA+ WI+KDGAG V K+ F R+    FD   K    +G++L+ LG+G+++AT  V   
Sbjct: 170 GAVAVQWIIKDGAGEVAKLFFIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPT 229

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--T 300
           P  FL  A   N  K +      +T     + F++  N GDV AK E   ++A L G   
Sbjct: 230 PLSFLICAAGGNAFKLIGYAIWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAA 289

Query: 301 GLSIMISKRNPS-LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
           G+ ++    +PS L T FAL    ++ ++   ++      L   R S+  E++ + G V 
Sbjct: 290 GIGLLTLSHDPSYLYTIFALAVPTHLVATTLMMRCATFELLTIPRLSLLAETYSRKGDVM 349

Query: 360 SLR--EGNLQENIFS--FPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP 415
            L   E      +F   +   + K V L  R  +  +D +         E+ R+ +    
Sbjct: 350 PLSRLEAKRATGMFGEFYGKGQGKLVSLAPRVAEVVKDEEG--------ERARWALCTEV 401

Query: 416 SKGKVYAL--------------LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ- 460
            + + Y L              L+  A SDD+L+A  HA  L   +         ALK  
Sbjct: 402 CRDERYLLYPPAAPQANPASVFLRPDASSDDMLRAVLHAARLRSLLADDIVTLREALKDS 461

Query: 461 HEYDNSNLM 469
           HE+   + M
Sbjct: 462 HEWTQEHFM 470


>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
 gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
 gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
 gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
          Length = 417

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 153/325 (47%), Gaps = 14/325 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W  ++ F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKIFLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATA-AVPH 244
            +  + WI+K+G G VG+++FA  +G + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 YSATVTWIMKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
           L   + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 LSTQILCGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++++ + V+SV L T N +R+ +A+E + ++  + +
Sbjct: 225 YLLTLI-KSQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLT 283

Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
            ++ N  E +     +     V LG   K+   +     ++E     F   E +++  + 
Sbjct: 284 PQQVNAMERVTLGQTVSVSLNVRLGLSVKNLIDEYKTSSSIENIVSSFDPHEHFIIAESK 343

Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
               VY  L  + +  D+LKA F A
Sbjct: 344 KYLGVY--LHFETRQQDVLKAYFFA 366


>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 710

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/214 (35%), Positives = 123/214 (57%), Gaps = 6/214 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SVTP+Y+ Y  W+A++   G   GV +TQ LL +VG+    A P A A+NWI+
Sbjct: 162 LPKGYPHSVTPNYMGYSRWQAVQSVTGTMTGVLSTQALLYAVGLGAG-AIPLAGALNWIV 220

Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KDG G++G ++++     KFD D K+ RF  +  ++    +E+ T   P +FL LA  +N
Sbjct: 221 KDGLGQLGGVVYSTFISSKFDSDPKRHRFWSNAALQASTLLEILTPLAPGMFLFLASVSN 280

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNP 311
           + KN++ + ++STR  ++ +    +N+GDVT K         L+GTGL I I+     +P
Sbjct: 281 IGKNISWLAASSTRAQMHNSLTLRDNLGDVTGKAGSQAIATSLIGTGLGIAIAPFVGTDP 340

Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
            +++  F  LS   + S+Y+    V + TLN  R
Sbjct: 341 IAVLLAFLPLSIVNMVSNYRSNTIVHMRTLNVQR 374


>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
 gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
          Length = 393

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 153/321 (47%), Gaps = 11/321 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
           S  R   +P G+P SV+P Y+ Y  W  ++ F         T+ +L  VGV    A+  A
Sbjct: 41  SLFRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALATEAILRGVGVGNETASTMA 100

Query: 189 VAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
            A+ W+L+DG G + +++FA       D D KQ R   D   +L   ++L T   P+LF+
Sbjct: 101 AAVAWLLRDGIGMLTRILFAWFYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPNLFM 160

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSI 304
           P+ C +++ + V  V   +TRT +    A  +N+GDV AK    E + NI  L  + L +
Sbjct: 161 PIVCLSSMVRGVVGVAGGATRTAVVNHQAILDNVGDVAAKDGSQETLINIFALFCSLLLL 220

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
            I   N  +   F + +  +++ +Y+ VKS+   TLN++   + V+++ +T  + ++ + 
Sbjct: 221 PIVNGNICVWLLFCIFTFIHLYGNYRAVKSLQFRTLNQSLLQIIVKNYAETKKIDTVSKV 280

Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           N +E++    W   +    G R  D     D     E  F   ++ V  +      Y  +
Sbjct: 281 NNKESML-LHWNSSRRYY-GCRLSDVVTSSD-----EFSFVCSKFTVICDLRSNYGYVSM 333

Query: 425 KDQAKSDDILKAAFHAYVLLH 445
              + + D L+AA    ++L+
Sbjct: 334 ASISDTSDQLRAALCLELMLN 354


>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
 gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
          Length = 417

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/247 (30%), Positives = 124/247 (50%), Gaps = 7/247 (2%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F ++  +P G+PDSV+  Y+ Y  W  ++ F     G  TT  +L  VGV  + A P + 
Sbjct: 48  FFQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSA 107

Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPHLFL 247
            I W+LKDG G  G++IFA  +G + D D K+ R   D+L +L   ++L      P    
Sbjct: 108 TITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKAST 167

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLS 303
            + CA    K +  V   +TR+ + +  A   N+ DV +K      CV  IA  +G  L 
Sbjct: 168 FILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLL 227

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
             +  +   L   F  ++  +++++ + VK+V L T N AR+ +A+E + KTG + S  +
Sbjct: 228 THVQDQR-VLYGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSPEQ 286

Query: 364 GNLQENI 370
            N  E +
Sbjct: 287 VNKLERV 293


>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
          Length = 423

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 128/243 (52%), Gaps = 20/243 (8%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKH--FFGGAMGVFT-----------TQTLLNSV 177
           ++ + +P G+P SV+  Y+ Y  W  ++   FF   + +F+           +Q +L  V
Sbjct: 41  LKKFFLPVGYPSSVSDDYLEYQIWDTIQAGIFFITKISLFSAFASSITGALASQAVLIGV 100

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
           GV  + AT  + ++ W+ KDG+G +G++IFA   G K D D K  RF  D+L +    +E
Sbjct: 101 GVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLE 160

Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGN 293
           + +    +LF PL C ANV K++  V  ++TR  I +  A   N+ DV+AK    E + N
Sbjct: 161 IISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSN 220

Query: 294 IADLLGTGLSIMISKRNPSLITTFALLSCG--YVFSSYQEVKSVVLHTLNRARFSVAVES 351
           +   L   + + +   N  LI  F  + C   +++S+Y+ VK + L T NR RF +A++ 
Sbjct: 221 LMAWLLNFILLYMVTGNQFLI-WFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQ 279

Query: 352 FLK 354
           + K
Sbjct: 280 WFK 282


>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
          Length = 480

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 102/374 (27%), Positives = 166/374 (44%), Gaps = 22/374 (5%)

Query: 97  YVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRA 156
           ++ E DRS        R   L S       L   I+ +V+P GFP SV+  Y+ YM  + 
Sbjct: 65  FLVEEDRS-----TPNRFQDLHSSDESLSWLPEIIKDFVLPAGFPGSVSDDYLDYMLLQF 119

Query: 157 LKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA---INWILKDGAGRVGKM-IFARQGK 212
             +  G       T +LL +VG+     T  A +   I W+ KDG G VG++ I  R G 
Sbjct: 120 PTNVTGWICHTLVTSSLLKAVGIGSFTGTTAAASAAAIRWVSKDGIGAVGRLFIGGRFGS 179

Query: 213 KFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIY 272
            FD D KQ R   D +   G+  +L T   P  FLPLA   N+ K VA      +   I 
Sbjct: 180 LFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQ 239

Query: 273 KAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-------V 325
             FA   N+G+V AK E    +A L+G  L I+I    P L+ ++ ++S  +       +
Sbjct: 240 NHFAISGNLGEVAAKEEVWEVVAQLVGLSLGILILD-TPGLVKSYGVISLTWLSMRLLHL 298

Query: 326 FSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGS 385
           +  Y+ +  +  +T+N  R  + V+S +    VP   + N +ENI ++       ++ G 
Sbjct: 299 WLRYESLSVLQFNTINIKRARILVKSHVLHSTVPGCTDCNREENILAWSQFMKPKIIFGL 358

Query: 386 RFKDAFQDPDAYLAVE---PFFEKERYVVTYNPSKG--KVYALLKDQAKSDDILKAAFHA 440
             +       +Y  VE     +  E+Y++  N      + Y   K  A +  +L++ + +
Sbjct: 359 PLEKMDGVERSYFMVEALIKLYASEKYILMVNQQTEDLRFYVSFKVGATNVSVLRSVWQS 418

Query: 441 YVLLHCINSSNGHR 454
           + L    +S +  R
Sbjct: 419 FWLSENWDSDDNVR 432


>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
          Length = 457

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 159/322 (49%), Gaps = 20/322 (6%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+ Y  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A    WI+KD  G +G++IFA     F           D+L ++   +E+     P  F 
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGLF----------ADILNDVAMFLEIMAPIYPMCFT 179

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSI 304
              C +N+AK V +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L +
Sbjct: 180 VTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLML 239

Query: 305 MISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
            +    PS  +  F LL+  +V+++Y+ V+++V+ TLN  R  + ++ FL+ G V     
Sbjct: 240 PLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTS 299

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVY 421
            N  E +++  W     + LG+          +   ++   E  +E Y++ ++ S+ +V 
Sbjct: 300 ANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRVQ 355

Query: 422 ALLKDQAKSDDILKAAFHAYVL 443
            +L   A  + IL+AA H  VL
Sbjct: 356 VVLSQMAGPETILRAATHGLVL 377


>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
 gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
          Length = 417

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 86/325 (26%), Positives = 151/325 (46%), Gaps = 14/325 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W   + F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +    WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
           L   + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 LSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++++ + V++V L + N +R+ +A+E F ++  + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283

Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
            ++ N  E +     +     + LG   K+   +  +   +E     F   E +++    
Sbjct: 284 PQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTK 343

Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
               VY  L  + +  D+LKA F A
Sbjct: 344 KCLGVY--LHFETRPQDVLKAYFFA 366


>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
          Length = 453

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 159/322 (49%), Gaps = 20/322 (6%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           +S + +  +P GFPDSV+P Y+ Y  W +++ F     G   T  +L  +GV   +A+  
Sbjct: 70  LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129

Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
           A    WI+KD  G +G++IFA     F           D+L ++   +E+     P  F 
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGLF----------ADILNDVAMFLEIMAPIYPMCFT 179

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSI 304
              C +N+AK V +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L +
Sbjct: 180 VTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLML 239

Query: 305 MISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
            +    PS  +  F LL+  +V+++Y+ V+++V+ TLN  R  + ++ FL+ G V     
Sbjct: 240 PLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTS 299

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVY 421
            N  E +++  W     + LG+          +   ++   E  +E Y++ ++ S+ +V 
Sbjct: 300 ANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRVQ 355

Query: 422 ALLKDQAKSDDILKAAFHAYVL 443
            +L   A  + IL+AA H  VL
Sbjct: 356 VVLSQMAGPETILRAATHGLVL 377


>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
          Length = 457

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 105/415 (25%), Positives = 197/415 (47%), Gaps = 45/415 (10%)

Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYV-------VPDGFPDSVTPSYVPYMTWRALKH 159
           +KR+  R + L+S +   ++   F RS V       +P GFP+SV+  Y  Y  W  ++ 
Sbjct: 23  YKRHDQRQLILESERR--KQSTGFTRSSVNILKEVFLPRGFPNSVSKDYTEYQIWDTIQA 80

Query: 160 FFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDL 218
           F      +  T  +L  VGV   +AT  A +I +ILKDGAG +G++ FA  +G + D   
Sbjct: 81  FCSTISNILATNAILKGVGVGDAQATALAASITYILKDGAGMIGRITFAWCKGSELDTQC 140

Query: 219 KQLRFAGDLLMELGAGVELATAA--VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA 276
           K+ R   D L +L   +EL  +   +    L +   ++ AK++ +V   +TR  + +  A
Sbjct: 141 KKWRLRADALNDLAIFIELLLSIPWIRQFSLIILSFSSCAKSIVSVAGGATRAALTQHQA 200

Query: 277 KGENIGDVTAKG----ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEV 332
             +N+GDV+AK      C+  IA L+G  +  ++  R   +   + +++  ++F++Y+ V
Sbjct: 201 IRDNMGDVSAKDGSQETCINLIAFLIGLIMLPIVENRILLIWMIYLVVTSLHLFANYKAV 260

Query: 333 KSVVLHTLNRARFSVAVESFL----------KTGHVPSLREGNL--QENIFSFPWLRDKP 380
           KS+ ++  N AR+ + ++ +L          K  ++   RE      E + SF       
Sbjct: 261 KSLNINVFNSARYDLTLKYYLSNDTQNHDVQKPDYINK-REACFLGDEKLCSF------K 313

Query: 381 VVLGSR-----FKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK 435
           + LG+      + +     D    +E  ++   Y++  +  +  +  +L     +++ILK
Sbjct: 314 IQLGTSIHELLYTNTLTTWDIIDHIE-LYKDYLYILIVDTHRDIIRVVLDKNINTENILK 372

Query: 436 AAFHAYVLLHCINSSNGHRSL---ALKQHEYDNSNLMLTTADLEAHIGESCKVVS 487
           A FHA +L H I   N    +   +L+  +Y ++N        E ++  SC+ V+
Sbjct: 373 AYFHANILGHLICPKNKFSLIKLNSLRSMKYTSTNTQFCCTHSE-YVQLSCEFVN 426


>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 656

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 163/363 (44%), Gaps = 67/363 (18%)

Query: 110 NSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFT 169
           N   A+S  S     ++   F+ S  VP+G    VTPSY  +M WR L+ F    + V  
Sbjct: 128 NDTDALSTNSGDTLVQKWTQFLHSAFVPEG----VTPSYYRFMRWRILQRFINANVHVIG 183

Query: 170 TQTLLNSV-GVSRNRATPGAVAI--------------NWILKDGAGRVGKMIFA-RQGKK 213
           TQ+LL  + G++R+ +   A +               NW+LKD  G++ +M++A + G+K
Sbjct: 184 TQSLLMGLRGMTRSGSVGEAASFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRK 243

Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           FD D K+ RF   L+  LG G+E++T   P  FL LA  AN  K ++ +TS++TR  +Y 
Sbjct: 244 FDPDAKRWRFRSALIYALGNGLEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYN 303

Query: 274 AF---------------AKGEN-------------IGDVTAKGECVGNIADLLGTGLSIM 305
           +F               A G +             IGD+TAKGE    + DLLG    I 
Sbjct: 304 SFKRVGTTSSTVSTSNNASGSSAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGIC 363

Query: 306 ISK----RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL-------K 354
           +S+       ++   + +L  G +F  Y E++S+V  T N  R  + +   +       K
Sbjct: 364 LSRAVGVSVQNVFAVWVVLQIGEIFCMYHEMRSIVYRTFNFERMYMVLGKLMLDDAADNK 423

Query: 355 TGH-------VPSLREGNLQENIFSFP-WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
            G        +PS  +    E +F  P  L  + +  GS  +      +  + V   F  
Sbjct: 424 GGDSSSTKNLIPSPEQIAANEKVFLPPDHLARRAIAFGSPGRSTLDPAELDVLVNDIFRG 483

Query: 407 ERY 409
           E+Y
Sbjct: 484 EKY 486


>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
          Length = 427

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 97/165 (58%), Gaps = 1/165 (0%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           +E+L    ++  +P G+P+SV+  Y  Y  W  +++  G    + +T  LL SVGV  + 
Sbjct: 99  SEKLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLSA 158

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           + P + AI+W+LKDG G    ++FA +     D+DLK+ +F GD L   G  +E  T  +
Sbjct: 159 SLPFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPFL 218

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
           P  FL +A  +N+AK +A +   STR  ++K+FA  ENIGD+TAK
Sbjct: 219 PGYFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAK 263


>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 532

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/405 (27%), Positives = 173/405 (42%), Gaps = 49/405 (12%)

Query: 108 KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
           + +  +A++  S  +P   L +++R   +P  +P SV PSY+ +   + ++   G  +GV
Sbjct: 87  RNDDTKAVTASSRFSPLGRLTAWLRQMFLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGV 146

Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGD 226
              Q LL SVGVS   +  GAVA+ W++KDGAG V K+ +  R    FD   K   F G 
Sbjct: 147 LCNQALLVSVGVSAEGSILGAVAVQWVIKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGA 206

Query: 227 LLMELGAGVELATAAVPHL---FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
            +  LG+G+++A   VP     FL  A   N+ K V      +T     + FA+  N+GD
Sbjct: 207 TMGLLGSGLQIAALVVPPTTLNFLLCAAGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGD 266

Query: 284 VTAKGECVGNIADL----LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHT 339
           V AK E  G++  L     G GL + +S   P L T FAL    ++  +   +++     
Sbjct: 267 VAAKSESQGSVGILSGYAAGVGL-LTVSHSAPYLYTIFALAIPIHLLITQWMLRAATFEL 325

Query: 340 LNRARFSVAVESFLKTG-------HVPSLREGNLQENIFSF-PWLRDK------------ 379
           L   R S     F            +P+L E   Q  I  F  + + K            
Sbjct: 326 LTLPRLSFLAREFASRASRPEAERELPTLGELERQRAIGVFGEYFKTKEDQYVQIAPDLD 385

Query: 380 ----PVVLGSRFK-----DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKS 430
               P  LG+R +     +AF D       +P+     Y      S+ K+  LL   A  
Sbjct: 386 AVVSPTDLGARTRWDACVEAFND-------QPYL---LYPYLMQQSECKIAVLLHPSATV 435

Query: 431 DDILKAAFH-AYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTAD 474
           DD +++  H A++  H  N      + +  Q   DN  ++    D
Sbjct: 436 DDRIQSVLHAAFLQRHLHNVMQIDPAESTPQELLDNDKVLKAALD 480


>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
 gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
          Length = 261

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/240 (32%), Positives = 125/240 (52%), Gaps = 18/240 (7%)

Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG----- 187
           S  +P  +P SV+ +Y+ Y  W+AL++  G   GV T Q+LL++ G++   A  G     
Sbjct: 3   SVFLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYFL 62

Query: 188 ---AVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV- 242
                A+ W++KDG G++G ++FA +    FD D K  R  G+  +   A +E+ T    
Sbjct: 63  AFLGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPITG 122

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           P  F+  A  AN+ KNV+   +++TR  + ++FAK +N+ DVTAK         L+GT L
Sbjct: 123 PSWFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTTL 182

Query: 303 SIMISKR--------NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
            I +S             +   F +LS   +FS Y+ V +V L  LN+ RF +A   +LK
Sbjct: 183 GIFVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYLK 242


>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
 gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
 gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
          Length = 417

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 150/325 (46%), Gaps = 14/325 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W   + F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +    WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
               + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++++ + V++V L + N +R+ +A+E F ++  + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283

Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
            ++ N  E +     +     + LG   K+   +  +   +E     F   E +++    
Sbjct: 284 PQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTK 343

Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
               VY  L  + +  D+LKA F A
Sbjct: 344 KCLGVY--LHFETRPQDVLKAYFFA 366


>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
 gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
          Length = 417

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 150/325 (46%), Gaps = 14/325 (4%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W   + F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +    WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
               + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++++ + V++V L + N +R+ +A+E F ++  + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283

Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
            ++ N  E +     +     + LG   K+   +  +   +E     F   E +++    
Sbjct: 284 PQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTK 343

Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
               VY  L  + +  D+LKA F A
Sbjct: 344 KCLGVY--LHFETRPQDVLKAYFFA 366


>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
 gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
          Length = 417

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/250 (28%), Positives = 123/250 (49%), Gaps = 7/250 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W   + F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +    WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
               + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++++ + V++V L + N +R+ +A+E F ++  + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283

Query: 361 LREGNLQENI 370
            ++ N  E +
Sbjct: 284 PQQVNAMERV 293


>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
 gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
          Length = 417

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/250 (28%), Positives = 123/250 (49%), Gaps = 7/250 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++   +P G+PDSV+  Y  Y  W   + F     G   T  +L  +GV       
Sbjct: 45  LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
            +    WILK+G+G VG+++FA  QG + D D K+ R   D L +L  G+E+      PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
               + C + + K +  V   +TR+ + +  A   N+ DV +K      CV  +A  +G 
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
            L  +I K    L T F ++   +++++ + V++V L + N +R+ +A+E F ++  + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283

Query: 361 LREGNLQENI 370
            ++ N  E +
Sbjct: 284 PQQVNAMERV 293


>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
          Length = 479

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 163/363 (44%), Gaps = 48/363 (13%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 71  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130

Query: 196 KDGAGRVGKMIFA------------RQ-----------------GKKFDYDLKQLRFAGD 226
           +D  G +G ++F             RQ                 G   D + K  R   D
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
            + +LG  ++L +   P   + + C  +++++   V S +TR  + + FA   N  D++A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250

Query: 287 K---GECVGNIADLLGTGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
           K    E +  +  +    L   +++ +   +  +F  L+  +++++Y+ V+S+ L TLN 
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310

Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFSF--PW-------LRDKPVVLGSRFKDAFQD 393
            R S+ ++ F+  G V + ++ + QE+I  F   W       L  + V LG++       
Sbjct: 311 ERSSIMLQYFMDNGEVLTPQQVSKQEHILPFWSSWRKFLRIKLPHEHVHLGAK-ASRLTH 369

Query: 394 PD--AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN-SS 450
           PD        P++  E Y +     +G V+  +  QA + DIL +  H  VL H +  S 
Sbjct: 370 PDLLQIAKTRPYYRNENYFLL--DKEGSVHIFIHKQAAATDILMSFIHGLVLAHLMQKSK 427

Query: 451 NGH 453
           +GH
Sbjct: 428 SGH 430


>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
 gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
          Length = 265

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 135/268 (50%), Gaps = 15/268 (5%)

Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMG 166
           F+R   R I      AP     ++I    +P+G+P SV+  Y+ + TW  L+        
Sbjct: 5   FRREKTRRI------AP----WTWIVLAFLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRS 54

Query: 167 VFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAG 225
           + +TQ LL+ +GV    AT       W L+D  G VG ++F   QG   D   KQ R A 
Sbjct: 55  MLSTQALLSGIGVGAATATAIGATFQWFLRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAA 114

Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           D + ++G  ++L +   P  F+   C  + A+ V  V   +TR  + + FA+ +N  DV+
Sbjct: 115 DFMNDIGMLMDLVSPLFPRAFMFFLCLGSAARAVTGVAGGATRAALTQHFARKQNAADVS 174

Query: 286 AKGECVGNIADLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLN 341
           AK       A L+G  + + +++   + I    T+F  L+  +V+++Y+ V ++ L ++N
Sbjct: 175 AKEGSQETAATLVGMIVGMFLARLTANNIVLMWTSFMSLTAFHVYANYKAVCALCLTSIN 234

Query: 342 RARFSVAVESFLKTGHVPSLREGNLQEN 369
             R ++ ++SFLK G  PS  E + QE+
Sbjct: 235 AERMAIVLQSFLKDGKAPSPEEASAQES 262


>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 435

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 165/327 (50%), Gaps = 21/327 (6%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W +L+        + +TQ LL+++GV    AT       W L
Sbjct: 56  VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGATFQWFL 115

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAA 253
           +D  G +G ++F   QG   D + K  R   D + +LG  V+   + + P  F+ + C  
Sbjct: 116 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSAFMLIVCLG 175

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP-- 311
           +++++   V S +TR  + + FA   N  D++AK      +A ++G  + +++++     
Sbjct: 176 SLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLARITMGC 235

Query: 312 --SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
             ++  +F  L+  +++++Y+ V  + L +LN  R S+A++ +++T  V S ++ ++ E+
Sbjct: 236 PLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQVSMMEH 295

Query: 370 I---FSFPW-LRDKPVV-----LGSRFKDAFQDPDAYL----AVEPFFEKERYVVTYNPS 416
           +   ++  W LR   ++     LG+R           L          ++E+Y++     
Sbjct: 296 VLPAWASSWSLRSIKLIHTRVRLGARISSLHHTEIKELLHLAGANNNDKQEKYLLV--ER 353

Query: 417 KGKVYALLKDQAKSDDILKAAFHAYVL 443
           K  +  +++  A ++DI ++  HA V+
Sbjct: 354 KETISVIVQKDATANDIFQSFVHALVM 380


>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
          Length = 404

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 80/237 (33%), Positives = 125/237 (52%), Gaps = 6/237 (2%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           ++ L + +    +P  +  SV+P ++ Y  W+ +      A GV + Q+LL +VGV    
Sbjct: 109 SQPLRTRLEHVFLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSLLYAVGVGAG- 167

Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           + P A AINWILKDG G++G M++    G +FD D K+ RF   L +++   V++ T   
Sbjct: 168 SVPLAAAINWILKDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSGIVDIITPLF 227

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           PH FL LA  +N  KNV+ + S++TR  +  +F +  N+GDVTAK       A + GTGL
Sbjct: 228 PHHFLLLASVSNFGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSIAASVFGTGL 287

Query: 303 SIMISKRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            I++SK   +    L+  +  LS   +  +Y   K V L T N  R  +    +  T
Sbjct: 288 GILVSKVTGTEAALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQRAEILARHYFTT 344


>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
          Length = 486

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 150/329 (45%), Gaps = 16/329 (4%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           I  +V+P GFP SV+  Y+ YM W+   +  G       T +LL +VGV     T  A +
Sbjct: 95  ITDFVLPAGFPGSVSDDYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAAS 154

Query: 191 ---INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
              I W+ KDG G  G+++   R G  FD D K+ R   D +  +G+  EL T   P  F
Sbjct: 155 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYF 214

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI 306
           LPLA   N+AK V       +   I   FAK  N+G++ AK E  G  A LLG  + ++I
Sbjct: 215 LPLASLGNLAKAVGRGFRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLI 274

Query: 307 ------SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
                     P+L  T+  +   +++  YQ +  +   T+N  R  + V S +    VP 
Sbjct: 275 LDTSGVQSSYPTLTLTWLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPG 334

Query: 361 LREGNLQENIFSFPWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
               N +ENI ++       +  G    R     +  D    +   ++ E Y++  + S+
Sbjct: 335 YVACNEEENILTWERFLQPQISFGVPMERMLGGGEPSDMVSRLLKLYKNEMYILFVDQSE 394

Query: 418 GKVYALL---KDQAKSDDILKAAFHAYVL 443
            K  A +   K+ A S  +L++ + A+ L
Sbjct: 395 SKEPAFVVTFKEAATSMSVLRSLWQAHWL 423


>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
 gi|194689438|gb|ACF78803.1| unknown [Zea mays]
 gi|223974155|gb|ACN31265.1| unknown [Zea mays]
 gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 442

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 152/323 (47%), Gaps = 17/323 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 62  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 121

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P   + + C  +
Sbjct: 122 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 181

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRNP 311
           ++++   V S +TR  + + FA   N  D++AK    E +  +  +    L   I++ + 
Sbjct: 182 LSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQITRGHA 241

Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            S+  +F  L+  +++++Y+ V+S+ L TLN  R S+ ++ F + G V   R+ + QE+I
Sbjct: 242 LSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKVSQQEHI 301

Query: 371 FSFPW----------LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKV 420
               W          L  + V LG++        D  +  +  +  E         +G V
Sbjct: 302 LP-SWSNWRKLNRIKLPHERVHLGAK-ASMLTHSDMLVIAKTRYHYENANYFLLDKQGIV 359

Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
           Y  +  +A   D+L++  H  VL
Sbjct: 360 YVFIHKEATPADVLRSFVHGLVL 382


>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
          Length = 479

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 163/363 (44%), Gaps = 48/363 (13%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 71  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130

Query: 196 KDGAGRVGKMIFA------------RQ-----------------GKKFDYDLKQLRFAGD 226
           +D  G +G ++F             RQ                 G   D + K  R   D
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
            + +LG  ++L +   P   + + C  +++++   V S +TR  + + FA   N  D++A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250

Query: 287 K---GECVGNIADLLGTGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
           K    E +  +  +    L   +++ +   +  +F  L+  +++++Y+ V+S+ L TLN 
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310

Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFSF--PW-------LRDKPVVLGSRFKDAFQD 393
            R S+ ++ F+  G V + ++ + QE+I  +   W       L  + V LG++       
Sbjct: 311 ERSSILLQYFMDNGEVLTPQQVSKQEHILPYWSSWRKFLRIKLPHEHVHLGAK-ASRLTH 369

Query: 394 PD--AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN-SS 450
           PD        P++  E Y +     +G V+  +  QA + DIL +  H  VL H +  S 
Sbjct: 370 PDLLQIAKTRPYYRNENYFLL--DKEGSVHIFIHKQAAATDILMSFIHGLVLAHLMQKSK 427

Query: 451 NGH 453
           +GH
Sbjct: 428 SGH 430


>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
 gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
          Length = 389

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 75/247 (30%), Positives = 132/247 (53%), Gaps = 5/247 (2%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
            +R   +P G+P SV+P Y+ Y  W  ++ F        +T+ +L  VGV    A+  A 
Sbjct: 37  LLRDIFMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAA 96

Query: 190 AINWILKDGAGRVGKMIFARQGKKF-DYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
           A+ W+LKDG G + +++FA     + D D KQ R   D   +L   ++L T   P LF+P
Sbjct: 97  AVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMP 156

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLGTGLSI 304
           + C +++ + V  V  ++TRT +    A  +N+GDV AK    E + N+ A L    L  
Sbjct: 157 IICLSSMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLP 216

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           ++S+   S+   F+L +  +++ +Y+ VKS+   TLN++   + V+ +++T  + ++ E 
Sbjct: 217 VVSRNIVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEM 276

Query: 365 NLQENIF 371
           N +E I 
Sbjct: 277 NSKEPIL 283


>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 436

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 80/254 (31%), Positives = 129/254 (50%), Gaps = 14/254 (5%)

Query: 110 NSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFT 169
           N +R +  + P+A        +++  +PDG+PDSV+  Y  Y  W  L+ F     G   
Sbjct: 29  NLLRLVDEKHPEA--SSFAQLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLA 86

Query: 170 TQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLL 228
           T  +L  VGV    AT  A A++W+L+DG G +G++ FA Q G   D + K+ R   D+ 
Sbjct: 87  TLAVLKGVGVGDETATATAAAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIF 146

Query: 229 MELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG 288
            +L   VEL +  V   FL  AC  +V +++  V   +TR  I +  A+  N GDV+AK 
Sbjct: 147 NDLAMFVELLSPLVGEWFLAFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKD 206

Query: 289 ---ECVGNIADLL-GTGLSIMISKRNPSLITTFALLSCGY----VFSSYQEVKSVVLHTL 340
              E + N+A LL    L   +   +P+L+    LL C +    +F++Y+ V++ V  T 
Sbjct: 207 GSQETLVNLAALLVSLWLLPALGSDHPTLVW---LLFCSFTSLHLFANYRGVRATVFDTF 263

Query: 341 NRARFSVAVESFLK 354
           NR R  +    +++
Sbjct: 264 NRERLFIVASDYMQ 277


>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
          Length = 272

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 74/198 (37%), Positives = 116/198 (58%), Gaps = 10/198 (5%)

Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLR 222
           A GV + Q LL + G+    + P A AINW+LKDG G++G +++    G +FD D K+ R
Sbjct: 3   ASGVLSMQALLYAAGLGAG-SVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQR 61

Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
           F   + ++     EL    VPHLFLPLA AAN+AKN++ + S++TR  ++ +F +  N+G
Sbjct: 62  FWSLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLG 121

Query: 283 DVTAK--GECVGNIADLLGTGLSIMISKRNPS----LITTFALLSCGYVFSSYQEVKSVV 336
           D+TAK   + +G  A ++GT L I +S    S    +I  F  L+   ++ ++   K VV
Sbjct: 122 DITAKIGAQSIG--ASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVV 179

Query: 337 LHTLNRARFSVAVESFLK 354
           L TLN  R  +A+  +L+
Sbjct: 180 LSTLNEQRLELAMHPWLQ 197


>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 485

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 89/334 (26%), Positives = 156/334 (46%), Gaps = 17/334 (5%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           E + S +    VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    A
Sbjct: 51  ERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSA 110

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           T       W L+D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P
Sbjct: 111 TVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFP 170

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGT 300
              + + C  +++++   V S +TR  + + FA   N  D++AK    E +  +  +   
Sbjct: 171 SSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLG 230

Query: 301 GLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
            L   I++ +  S+  +F  L+  +++++Y+ V+S+ L TLN  R S+ ++ F + G V 
Sbjct: 231 MLLAQITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVL 290

Query: 360 SLREGNLQENIFSFPW----------LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERY 409
             R+ + QE+I    W          L  + V LG++        D  +  +  +  E  
Sbjct: 291 VPRKVSQQEHILP-SWSNWRKLNRIKLPHERVHLGAK-ASMLTHSDMLVIAKTRYHYENA 348

Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                  +G VY  +  +A   D+L++  H  VL
Sbjct: 349 NYFLLDKQGIVYVFIHKEATPADVLRSFVHGLVL 382


>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
          Length = 417

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 124/247 (50%), Gaps = 7/247 (2%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F +   +P G+PDSV+  Y+ Y  W  ++ F     G  TT  +L  VGV  +  T  + 
Sbjct: 48  FFQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTALSA 107

Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPHLFL 247
            + W+LKDG G +G+++FA  +G + D D K+ R   D+L ++   ++L      P    
Sbjct: 108 TVTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKAST 167

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLS 303
            + C     K +  V   +TR+ + +  A   N+ DV +K      CV  IA  +G  L 
Sbjct: 168 YILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLLLL 227

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
             +  +   L   FA ++  +++++ + VK+V L T N AR+ +A+E + K+G + S R+
Sbjct: 228 TYLQNQRI-LYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSPRQ 286

Query: 364 GNLQENI 370
            N  E +
Sbjct: 287 VNKLERV 293


>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
           nagariensis]
 gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
           nagariensis]
          Length = 1055

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 86/292 (29%), Positives = 143/292 (48%), Gaps = 13/292 (4%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           ++   +PD  P  VTP Y  ++ WR  + FF   M  F+   LL + G+   +A   + A
Sbjct: 286 LKEAFLPD--PRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLMATGLGAKKAFAASAA 343

Query: 191 INWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           INW+LKDG  R+ +M  +   G+ FD DLK++RF   L+       E AT   P  F+ L
Sbjct: 344 INWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMAGEFATPFYPQHFVAL 403

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS-- 307
           A  ++V + V      +T+     A A G N+ D+T+K +    + D+L  G+S  ++  
Sbjct: 404 ASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMVMDMLALGVSAGLTWL 463

Query: 308 ----KRNPSLI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
                R   L+    + L + G +   + E+K+V L TLNR R  + +E +++ G VP  
Sbjct: 464 CRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERAEMLIERWMRRGAVPDA 523

Query: 362 REGNLQENIF--SFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
            E +  EN+   S  W    P+ +G   + A  +P+    +   ++ E YV+
Sbjct: 524 AEISAAENLILPSDVWRGLLPLRIGPLDRIAPANPEQLRKLLREYDGEEYVL 575


>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 527

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 157/366 (42%), Gaps = 35/366 (9%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           + + S I+   +P G+P SV+  Y+ Y  W   + F     G  T + ++  +GV  N A
Sbjct: 34  QHIKSLIKDIFLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVA 93

Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           TP + A+ WI+KDG G +G ++FA  +G K D D K+ R   D+L ++   +EL+   + 
Sbjct: 94  TPMSAALTWIIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIM 153

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
              + + CAA + K +     +STR  I    A   N  DV+AK        +LLG    
Sbjct: 154 DYSMYMLCAATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFG 213

Query: 304 IMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
           I + K        + ++    +F    +Y+ V+ + ++  N  R  + + S+L T  VP+
Sbjct: 214 IFLLKNVNESNQVWIVMCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPT 273

Query: 361 LREGNLQENIFSFPWLRDKPVVLGSR--------FKDAF------------QDPDAYLAV 400
           + + N           R +PV+LG          FK  F                    +
Sbjct: 274 IEQVN-----------RREPVILGCGTSDYQLCGFKIKFGVSVNKAVKSEKMTNQQLSTL 322

Query: 401 EPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
              F+ ++Y +     K  +    +      D+LKA FHA  +   I+  N +  L + +
Sbjct: 323 ANVFQSKKYFIIIQLKKKIINVAFRKHFSDIDVLKAYFHAIKMGIAISIINSNFELQIIK 382

Query: 461 HEYDNS 466
               NS
Sbjct: 383 RSRKNS 388


>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
          Length = 525

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 151/335 (45%), Gaps = 13/335 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P+SV+  Y+ Y  W   + F     G+  TQ +   VGV    A+P A  + W+ 
Sbjct: 52  LPQGYPNSVSKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAATVTWVF 111

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KDG G +GK++FA   G   D   K+ R   D L +    +E+A     +      C + 
Sbjct: 112 KDGCGHIGKILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFALCVST 171

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI 314
             K +  V   +TR  + +  A   N+ DV+AK        +L+ +  ++ +     + +
Sbjct: 172 CMKAIVGVAGGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISLIGNSV 231

Query: 315 TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF- 373
           T F +L   ++  +Y  V++V L TLN  RF   ++++L+   + +  E N  E I  + 
Sbjct: 232 TIFIILLIMHIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEINRNEPIIFYQ 291

Query: 374 --PWLRDKP-----VVLGSRFKDAF--QDPDAYL-AVEPFFEKERYVVTYNPSKGKVYAL 423
             P L D       +++G         +    Y+  V+  +  ++Y++  +     +Y  
Sbjct: 292 LGPNLLDLKICGFHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPDTGNRVMYVF 351

Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLAL 458
            K+ A  DD+L A F + VLL  I  +     LA+
Sbjct: 352 PKEDASVDDMLCAYFQS-VLLAIITCAINDHQLAI 385


>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
 gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
 gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 509

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/332 (30%), Positives = 156/332 (46%), Gaps = 19/332 (5%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS-------RNR 183
           +R +V P GFP SV+  Y+ YM W+   +  G    V  T +LL +VGV           
Sbjct: 115 VRDFVFPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAAT 174

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A   A AI W+ KDG G +G+++   R G  FD D KQ R   D +   G+  +LAT   
Sbjct: 175 AAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLY 234

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           P  FL LA   N+AK VA      +   I   FA   N+G+V AK E     A L+G G 
Sbjct: 235 PSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGF 294

Query: 303 SIMI------SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
            I+I       K  P ++ T+  +   +++  YQ +  +  +T+N  R  + VES +   
Sbjct: 295 GILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHS 354

Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVTY 413
            VP   + N +ENI  +       ++ G   ++     +      A+   + KE+Y++T 
Sbjct: 355 VVPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTL 414

Query: 414 NP-SKGKVYAL-LKDQAKSDDILKAAFHAYVL 443
           N  +K   +++  K  A S D+L+  + AY L
Sbjct: 415 NKLNKDTEFSVSFKVNATSRDVLRCLWQAYWL 446


>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
          Length = 507

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/228 (30%), Positives = 115/228 (50%), Gaps = 6/228 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P +V+ SY+ +  +  +  F   AMG  + Q+L  ++G +  +A   A A  W+L
Sbjct: 89  LPAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGSTMTQANLAAAAYTWVL 148

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KDG G++G ++FA R G+ FD D+K+ RF     + +   +E+ T + P  FL LA  AN
Sbjct: 149 KDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEILTLSFPQYFLALASLAN 208

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK-----R 309
           V KN+  + ++++R  I    AK  NIGD++ K       + L+G    + ISK      
Sbjct: 209 VGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLIGGACGLGISKIIDITS 268

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
              L   F +L+   + +SY   K +    LN  R  +    F K+  
Sbjct: 269 IYQLYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLFNEFFKSNE 316


>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
          Length = 442

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 88/325 (27%), Positives = 154/325 (47%), Gaps = 21/325 (6%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 62  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 121

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P   + + C  +
Sbjct: 122 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 181

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRNP 311
           ++++   V S +TR  + + FA   N  D++AK    E +  +  +    L   I++ + 
Sbjct: 182 LSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGLGMLLAQITRGHA 241

Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            S+  +F  L+  +++++Y+ V+S+ L TLN  R S+ ++ F + G V   R+ + QE+I
Sbjct: 242 LSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKVSQQEHI 301

Query: 371 FSFPW----------LRDKPVVLGSRFKDAFQDPDAYLAVEP--FFEKERYVVTYNPSKG 418
               W          L  + V LG++        D  +  +    +E   Y +     +G
Sbjct: 302 LP-SWSNWRKLNRIKLPHERVHLGAK-ASMLTHSDMLVIAKTRCHYENANYFLL--DKQG 357

Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
            VY  +  +A   D+L++  H  VL
Sbjct: 358 IVYVFIHKEATPADVLRSFVHGLVL 382


>gi|413949257|gb|AFW81906.1| hypothetical protein ZEAMMB73_483620, partial [Zea mays]
          Length = 116

 Score =  122 bits (307), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 60/116 (51%), Positives = 84/116 (72%), Gaps = 10/116 (8%)

Query: 385 SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLL 444
           SRF +AFQ+P +++A++P FE ERY+VTYNP+K KVYALLKDQAK DDILKAAFHA+VLL
Sbjct: 1   SRFGEAFQEPASFIAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVLL 60

Query: 445 HCINSSNGHRSLALK-----QHEYDNSNLMLTTADLEAHIGESCKVVSTALWGFQE 495
           H IN+++ +R+   +      H+Y+  N+     D   HI ESCK+V ++   F++
Sbjct: 61  HFINATHANRNARKRANSNLSHQYNPLNM-----DFLPHIEESCKIVMSSYGVFKK 111


>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
          Length = 4586

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 98/344 (28%), Positives = 160/344 (46%), Gaps = 24/344 (6%)

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
           +  S++R   +P  +P SV PSY+ +   +  +  FG  + V   Q LL SVGVS   + 
Sbjct: 108 KFTSWLRQMFLPTNYPQSVHPSYLRFHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSI 167

Query: 186 PGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-- 242
            GAVA+ WI+KDGAG V K+ F R+    FD   K     G+ ++ +G+ +++AT  +  
Sbjct: 168 FGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITP 227

Query: 243 -PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG-- 299
            P  FL  A   N+ K V      +T     + F+   N+GDV AK E   +IA L G  
Sbjct: 228 TPGNFLLCAAGGNIFKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYA 287

Query: 300 TGLSIMISKRNPS-LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK---- 354
            G+ ++    +P+ L + F  L+  ++ ++   ++      L   R S+    + +    
Sbjct: 288 AGIGLLTVSHSPAYLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGE 347

Query: 355 --TGHVPSLR--EGNLQENIFS--FPWLRDKPVVLGSRFK-----DAFQDPDAYLAVEPF 403
             TG V +LR  E + Q   F   F    D+ V L  R       D  ++P  +      
Sbjct: 348 AATGKVNTLREIEESRQTGAFGEFFKRKNDRFVELAPRVTEVIGGDVDREPVLWEMCTSE 407

Query: 404 FEKERYVV--TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH 445
           F+ E+Y++  + +PS+  +       A SDD L++  HA    H
Sbjct: 408 FQTEKYLLYPSASPSEKPITVFYHPDATSDDTLRSILHAARFRH 451


>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
          Length = 394

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 154/320 (48%), Gaps = 22/320 (6%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
           +F     +P G+P SV+  Y+ Y  W  ++ F         T+ +L   GV    A+  A
Sbjct: 42  AFFVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASALA 101

Query: 189 VAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
             + W+++DG G + ++ FA  +  + D D K+ R   D L +L   +EL  AA+P LF 
Sbjct: 102 ATLTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLFT 161

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLGTGLS 303
            L C +++A+++ AV   +TRT + +  A+  N+ DV+AK    E + N+ A L    L 
Sbjct: 162 LLVCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLILL 221

Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
            +IS R   +   + L +  ++F++Y  V ++   TLN+    +    F++TG V S++ 
Sbjct: 222 PLISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVKV 281

Query: 364 GNLQENIFSFPWLRDKPVVLG--SRFKDA--FQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
            N  E           P++ G  SR+     +    A+ A+    +++ ++V   P    
Sbjct: 282 ANELE-----------PLLFGCASRYYGCRLYDVLSAHCALT--LQRDSFIVGCEPKLEY 328

Query: 420 VYALLKDQAKSDDILKAAFH 439
            Y  +    +  + +KAAF+
Sbjct: 329 AYIAMASGTQQREHIKAAFY 348


>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
          Length = 409

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 137/281 (48%), Gaps = 20/281 (7%)

Query: 74  EDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRS 133
           ED   +++  L  SE   G+ +    +++ S + K+ S  ++   +   P      FI++
Sbjct: 43  EDDKDNAATSLGISEYYKGKLYTLTFDKE-SNRIKKESTTSLRNHTLTLPV-----FIKT 96

Query: 134 -------YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
                    +P  +P SV   Y+ Y +W A       A GV +TQ++L ++G+    A P
Sbjct: 97  TKLKLTDIFLPRNYPSSVKGKYLNYCSWAAFGMGLSAAGGVLSTQSMLFAIGMGAG-AVP 155

Query: 187 GAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
            A A+NW+LKDG G++G M F A    + D D +  R     L+E+   +E+ T   PH 
Sbjct: 156 MAAALNWVLKDGLGQLGGMAFTALVNSRLDADSRGWRILSAWLLEISTWLEVMTPLFPHS 215

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
           FL LA  ANV KN++ +  ++TR  I   F    N+GD+TAK         + GT L I+
Sbjct: 216 FLLLATLANVGKNISWLAGSATRAGIRYGFVNAHNMGDITAKEGSQTVAITVFGTFLGIV 275

Query: 306 ISK-----RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLN 341
           IS          ++ +   +S   +FS YQ ++ V L TLN
Sbjct: 276 ISNLIGHGHMEYVLMSSMCISAVSLFSIYQSLRCVSLPTLN 316


>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 510

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 154/340 (45%), Gaps = 21/340 (6%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS-------RNR 183
           ++ +V P GFP SV+  Y+ YM W+   +  G    V  T +LL +VGV           
Sbjct: 116 VKDFVFPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGVGSFSGTSAAAT 175

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A   A AI W+ KDG G +G+++   R G  FD D KQ R   D +   G+  +LAT   
Sbjct: 176 AAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLY 235

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           P  FL LA   N+AK VA      +   I   FA   N+G+V AK E     A L+G G 
Sbjct: 236 PSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGF 295

Query: 303 SIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            I+I    P L+ +F  +S  +       ++  YQ +  +  +T+N  R  + VES +  
Sbjct: 296 GILIID-TPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVH 354

Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVT 412
             VP   + N +ENI          ++ G   ++     +      A+   + KE+Y++T
Sbjct: 355 SVVPGYVDCNKRENILLGQRFMKPRIIFGVSLEEVSGLEKSVSKVKALLKMYTKEKYILT 414

Query: 413 YNP--SKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
            N   S  +     K  A S D+L+  + AY L   +  S
Sbjct: 415 LNKLNSDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEES 454


>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 472

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 160/340 (47%), Gaps = 36/340 (10%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPD+V+  Y  Y  W +++ F     G   TQ +L  VGV    A+  A    WIL
Sbjct: 50  LPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGDQTASVLAATTTWIL 109

Query: 196 KDGAGRVGKMIFARQGKKFDY-------------DLKQLRFA-------GDLLMELGAGV 235
           KDG G +G+++FA   +                 D   L  A        D+L +    +
Sbjct: 110 KDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYCSLFADILNDAAILL 169

Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVG 292
           EL +  +      + C ++VAK+V  V   +TR  + +  A+  N+ DV+AK    E + 
Sbjct: 170 ELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSAKDGSQETLV 229

Query: 293 NIADLLGTGLSIMISKRNPSLIT-TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
           N+A  L + L + +   +  L+   F + +  ++F++++ V  VV+ T NR+R+S+ V+ 
Sbjct: 230 NLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVMETFNRSRYSLLVKH 289

Query: 352 FL-KTGHVPSLREGNLQENIFSFPWLRDK----PVVLGSRFKDAFQDPDAYLAVEPFFEK 406
           FL + G + S+ + N QE++    WLR       V LGS         D   +  P  ++
Sbjct: 290 FLEEAGDIASVEKVNKQESV----WLRCGRHFVNVNLGSPLSSVAATHDDLCSRAPKEKQ 345

Query: 407 ERYVVTYNPSKGKVY---ALLKDQAKSDDILKAAFHAYVL 443
            RY++       + Y    +L +Q    D ++A FH+++L
Sbjct: 346 ARYLLKVVRDSSRHYDINVVLHEQCTPRDQMEAMFHSFLL 385


>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
           SS1]
          Length = 506

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/334 (27%), Positives = 151/334 (45%), Gaps = 20/334 (5%)

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
            + S++R   +P  +P SV  SY  + T +AL+        V   Q LL SVG+S   + 
Sbjct: 111 RISSWLRQMFLPTNYPQSVHSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSV 170

Query: 186 PGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-- 242
            GAVA+ WI+KDGAG V K+ F R     FD   K     G++L  LG+G+++AT  +  
Sbjct: 171 FGAVAVQWIIKDGAGEVAKLFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAP 230

Query: 243 -PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG-- 299
            P  FL  A   N+ K V     ++T     + F+   N GDV AK E   ++A L G  
Sbjct: 231 SPLNFLLCAAGGNIFKLVGNAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYA 290

Query: 300 TGLSIMISKRNPSLITT--FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
            G+S++     P+ +    FA +      ++Y  ++      L   R S   ++++  G 
Sbjct: 291 AGISLLTFSHAPAYLYAIFFAAVPLHLTMTAYM-MRVATFELLTLPRISHLAQTYVNDGV 349

Query: 358 VPSLREGNLQENIFSFPWLR----DKPVVLGSRFKDAFQ-----DPDAYLAVEPFFEKER 408
           V S ++ + +     F        D+ + L  R  DA       D   +      F+ +R
Sbjct: 350 VGSQKDLDREHATGLFGEFYKNNGDRWLTLAPRVGDALNTSSDVDRSTWQVCSQVFQDDR 409

Query: 409 YVV--TYNPSKGKVYALLKDQAKSDDILKAAFHA 440
           Y++    +P    +  L    A +DD+L++  HA
Sbjct: 410 YLLLPCDSPRGPLISVLFHPDATTDDMLQSILHA 443


>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
          Length = 542

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 112/412 (27%), Positives = 185/412 (44%), Gaps = 70/412 (16%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEE--LMSFIRSYVVPDGFPDSVTP 146
           D D RR  Y +E    GK+       + ++SP A  +   L SF+  ++ P  +P+SV  
Sbjct: 77  DDDSRRTLYRSEG--PGKW-------LKVESPPASRQSMSLASFVHHFL-PAKYPESVCS 126

Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
           SY  + ++    +  G A  V +TQ LL +VGV    A P A A+NW+LKD AG++G ++
Sbjct: 127 SYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAAALNWLLKDFAGQLGGVL 186

Query: 207 FARQ---------------------------GKKFDY-------DLKQLRFAGDLLMELG 232
           F  Q                            KK +Y       + K+ R    L ++L 
Sbjct: 187 FVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTADTNPKRWRMVAALALDLS 246

Query: 233 AGVELATAAV-PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG--------D 283
             +E+ T  + P  FLP A  ANV KN+  + ++++R  I ++ + G +          D
Sbjct: 247 TLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQSLSAGGSYSSSSGSNLGD 306

Query: 284 VTAKGECVGNIADLLGTGLSIMISKR------NPSLITTFALLSCGYVFSSYQEVKSVVL 337
           VTAK      +A L+GT L I  SK          +++ F +LS  +  ++Y+ + +V L
Sbjct: 307 VTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVLSAVHQVATYKALMAVPL 366

Query: 338 HTLNRARFSVAVESFLKTGHVPSLREGNLQENIF-------SFPWLRDKPVVLGSRFKDA 390
            +L+R R  + +  F +T  VPS  +   +E  F       S  WL     +L    K  
Sbjct: 367 RSLDRHRLHIVLSEFSRTSSVPSPSQVATKEKFFPLITPDHSCNWLFVGSPLLEICPKGI 426

Query: 391 FQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
            +     L  +   + E+Y++   P+  ++  L  D A   D+L+   HAY+
Sbjct: 427 SELQGLILWRDDAMQHEQYILKVGPASIQLTFL--DDANEVDLLRGQLHAYL 476


>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 719

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 110/459 (23%), Positives = 182/459 (39%), Gaps = 122/459 (26%)

Query: 121 QAPAEELMSFIRS----------------YVVPDGFPDSVTPSYVPYMTWRALKHFFGGA 164
           Q+P   + SFI S                + +P  +P SV PSY  Y ++  L    G +
Sbjct: 213 QSPLSTIRSFISSCRTKDMINIQNLPLIHHFLPANYPQSVCPSYATYASYCFLGSIAGSS 272

Query: 165 MGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ----GKKFDY---- 216
             V +TQ LL +VGV    A P A A+NW++KDG G++G ++FA Q    G   DY    
Sbjct: 273 AMVLSTQALLVAVGVGTQSAAPMAAALNWVMKDGVGQLGGVVFASQLGKGGVDVDYWKVK 332

Query: 217 ----------------------------DLKQLRFAGDLLMELGAGVELATAAV-PHLFL 247
                                       + K+ R    L ++L   +E+ T  + P  FL
Sbjct: 333 LNKLMGSSSSKSTETSARGNFQRGTADSNPKRWRMVAALALDLSTLLEICTPWMGPQWFL 392

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKG----------------------------- 278
           P A  AN+ KNV  + ++++R  ++++   G                             
Sbjct: 393 PCASIANIGKNVGFLAASASRAAVHQSLCTGGSFLPPVPDNTTEKDESYDNPKKGKNVTS 452

Query: 279 --ENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS------LITTFALLSCGYVFSSYQ 330
              N+GDVTAK      +A LLGT L I +S+   S      ++  F +LS  +   +Y+
Sbjct: 453 SSNNLGDVTAKSGSQAIVASLLGTALGIWLSRTFCSDYGTAGILAGFVILSAVHQVCTYR 512

Query: 331 EVKSVVLHTLNRARFSVAVESFL----KTGHVPSLREGNLQENIFSFPWLRDKPVVLGSR 386
            +K+V L +++R R  + + +++    ++     L    + E     P +     V    
Sbjct: 513 ALKAVPLRSVDRHRLHIILNAYITADGRSVQSMVLTPSQVAEEESFLPMMAPDNSVHWLS 572

Query: 387 FKDAFQD--PDAYLAVEPFF-------EKERYVV---------TYNPSKGKVYALLKDQA 428
             D+  D  P     +E          + E+Y++         T     G V     D A
Sbjct: 573 VGDSLIDICPSGLDELEALLIRRDDISQYEKYIMKVHLADTNATTAAVDGIVQLTFLDGA 632

Query: 429 KSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
             +D+L+  FHAY+           R+L +    +D+SN
Sbjct: 633 TDNDLLQGMFHAYIA----------RALMISHQCHDDSN 661


>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
 gi|224029221|gb|ACN33686.1| unknown [Zea mays]
 gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
          Length = 481

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 170/376 (45%), Gaps = 30/376 (7%)

Query: 98  VAERDRSGKFKR-----NSIRAISLQSPQAP--------AEELM-SFIRSYVVPDGFPDS 143
           +AER R G  KR     +S   + L+  ++P        A+ L+   I  +V+P GFP S
Sbjct: 44  LAERYRDGVVKRYISDGSSKLQLRLEKHESPVNGVEDENADSLIPQAILDFVLPAGFPGS 103

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA---INWILKDGAG 200
           V+  Y+ YM  +   +  G    V  T +LL +VGV     T  A +   I W+ KDG G
Sbjct: 104 VSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIG 163

Query: 201 RVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
             G+++   R G  FD D K+ R   D +   G+  EL+T   P  FLPLA   N+AK +
Sbjct: 164 AFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAI 223

Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP------SL 313
           A      +   I   FA+  N+G+V AK E     A LLG  + ++I  +        +L
Sbjct: 224 ARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTL 283

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
           + T+  +   +++  YQ +  +   T+N  R  + V+S +    VP     N +ENI ++
Sbjct: 284 VLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVPGYVACNEEENILTW 343

Query: 374 PWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL---KDQ 427
                  V  G    R     +  D    +   ++ E+Y++ +     +    L   K+ 
Sbjct: 344 ERYLRPQVSFGVPMERMLGGDKSSDMVNRLLKLYKNEKYILFFEQFGSREPTFLVTFKES 403

Query: 428 AKSDDILKAAFHAYVL 443
           A S  +L++ + A+ L
Sbjct: 404 ATSMSVLRSLWQAHWL 419


>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 479

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/329 (27%), Positives = 163/329 (49%), Gaps = 25/329 (7%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           V+P GFP SV P Y+ Y  W+        A GV +TQ LL ++G+    A P A  +NW+
Sbjct: 5   VLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAG-ALPLAATLNWV 63

Query: 195 LKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
           +KDG G++G + F+     +FD + K  R    + ++    +EL +   P  FLP+A  A
Sbjct: 64  IKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPIASVA 123

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS- 312
           N+ KNV+ + ++++R  I+   +   N+ DVTAK      +A + GTG  + +S    S 
Sbjct: 124 NIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSASLGSE 183

Query: 313 ---LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL---KTGHVPSLREGNL 366
              ++     LS  ++ +++  ++ + + T +  R  + V       ++  +PS ++   
Sbjct: 184 FHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVNGMADGDRSRGMPSPQDIAA 243

Query: 367 QENIFSFPWLRDK-------PVVLGSRFKDAFQD----PDAYLAVE--PFFEKERYVVTY 413
           QE++ ++  +R+K        VV+GSR K+A +      +     E  PF    R  + +
Sbjct: 244 QEDVVAYA-VREKIGLDLSPTVVVGSRLKEAVKGWGELEELCRVCEGLPFLLSVRGDLAH 302

Query: 414 NPSKGKVYALLKDQAKSDDILKAAFHAYV 442
               G+V+ LL +   + ++L  ++ A V
Sbjct: 303 R--TGEVHVLLLEGCGAREVLAGSYVALV 329


>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
          Length = 1041

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/338 (29%), Positives = 151/338 (44%), Gaps = 30/338 (8%)

Query: 51  GGGGAAATLLQSES-DETTAFRLVED--QFVDSSLRLI--CSEDIDGRRWNYVAERDRSG 105
           GGG  ++ ++Q E     TA R + D  +   S++ L+  C    D +   ++ E D S 
Sbjct: 571 GGGQVSSRVIQVERYSNGTAKRFLLDYTKIWASTIPLVSRCVLGDDLQLQTFLVEEDTS- 629

Query: 106 KFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
                  R     SP      L   I+ +++P GFP SV+  Y+ YM  +   +  G   
Sbjct: 630 ----TPKRFQDSYSPDESLSWLPDTIKDFILPAGFPGSVSDDYLDYMLLQFPTNVTGWIC 685

Query: 166 GVFTTQTLLNSVGV---SRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQL 221
               T +LL +VG+   S   AT  A AI W+ KDG G VG++    R G  FD D KQ 
Sbjct: 686 HTLVTSSLLKAVGIGSFSGTSATASASAIRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQW 745

Query: 222 RFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENI 281
           R   D +   G+   L T   P  FLPLA   N+ K VA      +   I   FA   N+
Sbjct: 746 RMYADFIGSAGSIFYLTTQVYPDYFLPLASLGNLTKAVARGLKDPSFCVIQNHFAISGNL 805

Query: 282 GDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-------VFSSYQEVKS 334
           G+V AK E    +A L+G  L I+I    PSL+ ++ +LS  +       ++  Y+ +  
Sbjct: 806 GEVAAKEEIWEVVAQLIGLALGILILD-TPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSV 864

Query: 335 VVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
           +  +T+N  R        +    VP     N +ENI +
Sbjct: 865 LQFNTINLKR--------VLHSTVPGCTYCNREENILT 894


>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 340

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 114/224 (50%), Gaps = 5/224 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+PDSVT  Y  Y  W +L+ F       F    +  +VGV    A+  A  + W+L
Sbjct: 56  LPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVVWLL 115

Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           ++G G + +++FA       D D K  RF  D+L +    ++L        F+ +AC AN
Sbjct: 116 RNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVACLAN 175

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
           V +++  +   +TR  I +  ++  N+ DV AK      + +L+  G ++    +++   
Sbjct: 176 VCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVAHHV 235

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
             + T F + +  +++++Y+ ++++   TLN  RF + V+SF+K
Sbjct: 236 LLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIK 279


>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
          Length = 376

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/313 (26%), Positives = 145/313 (46%), Gaps = 7/313 (2%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           + + V+P+G+P SV+  Y  Y  W   + F    MG  +T  +L  VGV   +AT  A  
Sbjct: 46  LSTLVLPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAAT 105

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
              +L+D  G +  + FA       D + KQ R   D++ ++   VEL +  +P  FLPL
Sbjct: 106 YTNLLRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPL 165

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK- 308
            C A+  + +  V   +TR  + +  A+  N+ DV+AK      + +L    +S+ I+  
Sbjct: 166 MCMASFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPL 225

Query: 309 RNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
              S +  +A+   L   +++ +Y+ V+++ +   N  R   A   F++T  VPS+   N
Sbjct: 226 AAESRVMAWAMFLPLVFMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAAN 285

Query: 366 LQENIF-SFPWLRDKP-VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
             E +F   P +     V LGS +        A +  +     E + V    + G V+ L
Sbjct: 286 QAEPLFPRAPAIEGHAKVCLGSSWSHIKDGVGAGVQADVVRALETWRVAAVFTDGTVHVL 345

Query: 424 LKDQAKSDDILKA 436
           L    + +DI++A
Sbjct: 346 LSKHCRHEDIVRA 358


>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
          Length = 368

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 93/158 (58%), Gaps = 1/158 (0%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++PDG+P SV+  Y+ Y  WR ++       GV +TQ LL +VG+ +  A P A A+NW+
Sbjct: 197 LLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 255

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG G + K++ ++ G+ FD + K  R   D L  +  G+E+ T   PHLF+P+  AA 
Sbjct: 256 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 315

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG 292
             ++ A++  ++TR+  Y  FA   N  +V AKGE  G
Sbjct: 316 AGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQG 353


>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 353

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 153/309 (49%), Gaps = 25/309 (8%)

Query: 167 VFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAG 225
           + +TQ LL+++GV    AT       W L+D  G +G ++F   QG   D + K  R   
Sbjct: 2   MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61

Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           DL+ ++G  ++L +   P  F+ + C  +++++   V S +TR  + + FA  +N  D++
Sbjct: 62  DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121

Query: 286 AKGECVGNIADLLGTGLSIMISK---RNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLN 341
           AK      +A ++G  L +++++    NP  I  +F  L+  +++++Y+ V+ +VL++LN
Sbjct: 122 AKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLN 181

Query: 342 RARFSVAVESFLKTGHVPSLREGNLQENIFSFPW---LRD-------KPVVLGSRFK--- 388
             R S+ +  F++TG V S  + +  E +    W   LR        K V LG R     
Sbjct: 182 FERSSILLTHFIQTGQVLSPEQVSSMEGVLPL-WATSLRSTNSKPLHKRVQLGVRVSSLP 240

Query: 389 --DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHC 446
             D  Q  +   A    ++  +Y++ +   KG V  +L   +K  D+LK+  HA VL + 
Sbjct: 241 RLDMLQLLNGVGASS--YKNAKYLLAH--IKGNVSVILHKDSKPADVLKSYIHAIVLANL 296

Query: 447 INSSNGHRS 455
           +  S    S
Sbjct: 297 MEKSTSFYS 305


>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
 gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
          Length = 624

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 169/376 (44%), Gaps = 30/376 (7%)

Query: 98  VAERDRSGKFKR-----NSIRAISLQSPQAP--------AEELM-SFIRSYVVPDGFPDS 143
           +AER R G  KR     NS   + L+   +P        A+ L+   I  +V+P GFP S
Sbjct: 188 LAERYRDGVVKRYISDGNSKLQVHLEKHVSPVNAVEDENADSLIPQAIWDFVLPAGFPGS 247

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA---INWILKDGAG 200
           V+  Y+ YM  +   +  G    V  T +LL +VGV     T  A +   I W+ KDG G
Sbjct: 248 VSVDYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIG 307

Query: 201 RVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
             G+++   R G  FD D K+ R   D++   G+  EL T   P  FLPLA   N+AK V
Sbjct: 308 AFGRLLIGGRFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLASLGNLAKAV 367

Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTF 317
           A      +   I   FA+  N+G+V AK E     A LLG   G+ IM +    S   T 
Sbjct: 368 ARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAGVKSSYLTL 427

Query: 318 AL----LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
           AL    +   +++  YQ +  +   T+N  R  + V S +    VP     N +ENI ++
Sbjct: 428 ALTWLSVRLLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACNEEENILTW 487

Query: 374 PWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFEKERYVVTYNP--SKGKVYAL-LKDQ 427
                  +  G    R     +  D    +   ++ E+Y++ +    S+   + +  K+ 
Sbjct: 488 ERFLRPQISFGVPMERMLGGDESSDMVDRLLKLYKNEKYILFFEQFGSRDPTFLVTFKES 547

Query: 428 AKSDDILKAAFHAYVL 443
           A S  +L++ + A+ L
Sbjct: 548 ATSMSVLRSLWQAHWL 563


>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
          Length = 582

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 95/336 (28%), Positives = 147/336 (43%), Gaps = 60/336 (17%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F+R  ++P+GFP SVT  Y+ Y  WR ++       GV  TQ LL +VG+ +  A P A 
Sbjct: 216 FLR-LMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKG-AIPTAA 273

Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           A+NW+LKDG G + K++ ++ G+ FD   K  R   DLL     G+E+ T A PH FL +
Sbjct: 274 AVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLI 333

Query: 250 ACAANVAKNVAAVTSTSTRT---------------PIYKAFAKGENIGDVTAKGECVG-- 292
              A   ++ AA+      T               P+    A G +   +   GE  G  
Sbjct: 334 GAVAGAGRSAAALIQLLFHTHNHLSSFALLLGSYFPLNNP-AIGNSKRKMPTHGEGDGGE 392

Query: 293 NIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
           N+   LG G  ++        ++T ALL         Q   + V+      R  +    +
Sbjct: 393 NVTAFLG-GPRLLXGA-----VSTLALL--------LQRNFAEVIAKGEAPRNGLVFSEY 438

Query: 353 LKTGHVPSLREGNLQENIFS-FPWLRDKP-----------------------VVLGSRFK 388
           L +G VPS++E N +E +F   P L  KP                       + LGS+  
Sbjct: 439 LLSGQVPSIKEVNEEEPLFPVVPLLNAKPXYKAQSAVLSTEAKDAAAEIERRLQLGSKLS 498

Query: 389 DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
           +     +  LA+   +  E Y++T +  KG+ +A L
Sbjct: 499 EVVSSKEDVLALFDLYRNEAYILTEH--KGRFFASL 532


>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 485

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 87/332 (26%), Positives = 155/332 (46%), Gaps = 37/332 (11%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y+ Y  + +L+ F     G+  ++ +L  VGV    A+P    ++ +L
Sbjct: 73  LPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 132

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
           ++  GR+  ++FA R G   + + K  R A D+L +    VE  + A P HL + L   +
Sbjct: 133 QESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHLRVVLLTFS 192

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE-CVGNIADLLGTGLSIMISKRNPS 312
           ++ + +  V + S++  +   FAK  N+G++ A G   V +I+  L TGL+++       
Sbjct: 193 SILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHISTPLTTGLTLI------- 245

Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
               F LL   ++ ++Y  V++V + TLNR R ++   +  + G   + ++ +  E IF 
Sbjct: 246 ----FLLLI--HLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQTSKCERIFE 299

Query: 373 ----FPWLRDKPVVLGSRFKDAFQD-----------------PDAYLAVEPFFEKERYVV 411
                 W      +   R   AFQ+                 P     +   FEKE Y++
Sbjct: 300 RDGILRWKASSTTLGFCRIGIAFQELLRSSNVSCRANSIRDIPIDIPRLLRLFEKEEYIL 359

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
            +NP   K   +LK+  K    LKA  HA ++
Sbjct: 360 WFNPVSKKGTIVLKNNVKPVSQLKAWSHALLV 391


>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
          Length = 266

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 121/249 (48%), Gaps = 7/249 (2%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           ++   +P GFP SV+  YV Y  W  ++ F         TQ +L  VGV   +AT  A  
Sbjct: 2   LQDVFLPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAAT 61

Query: 191 INWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           + W++KDG G +G+++F   QG   D D K+ R   D+L ++   +++        F  +
Sbjct: 62  LTWLMKDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLI 121

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTGLSIM 305
            C A V +++  V   +TR  + +  A   N+ DV+AK       V   A L    L  M
Sbjct: 122 VCVAGVCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPM 181

Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL--KTGHVPSLRE 363
           +S R   +   F   +  +++++++ V+S+ + T N+ R  +A + F+   TG+ P ++ 
Sbjct: 182 VSDRQWLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKT 241

Query: 364 GNLQENIFS 372
            N +E +  
Sbjct: 242 VNQREPVLC 250


>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
 gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
          Length = 381

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 96/341 (28%), Positives = 157/341 (46%), Gaps = 32/341 (9%)

Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA-----INWILKD 197
           SV+  Y+ YM W+   +  G       T +LL +VG+S     PG+ A     I W+ KD
Sbjct: 2   SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTE---PGSAAAASAAIKWVTKD 58

Query: 198 GAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
           G G VG++ I  R G  FD D KQ R   +++   G   ELAT   P  FL LA   ++ 
Sbjct: 59  GLGAVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLT 118

Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITT 316
           K VA      +   I   FA  EN+GDV AK E       L+G  L +++    P + T+
Sbjct: 119 KAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLS-TPVVATS 177

Query: 317 FALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
           ++ L+  +       ++  YQ + ++ L T+N  R +V +   ++   +  + E N +EN
Sbjct: 178 YSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVGECNKKEN 237

Query: 370 IFSFPWLRDKPVV-LGSRFKDAFQDPDAYLAVEPFFE---KERYVVTY------NPSKGK 419
           +   P+    P + LG   ++        L +E  FE    E+YV+        + S   
Sbjct: 238 LI-LPYTMLMPQIRLGCSLREVITLGKTPLEIEEAFELYNSEKYVLIVLSQFLSSCSCNI 296

Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
            + + +D  ++  +L++ + A+VL H    S    +  LKQ
Sbjct: 297 KFRVCQDGCENTTLLRSIYQAHVLFH----SKSFDTFTLKQ 333


>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 286

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/271 (25%), Positives = 132/271 (48%), Gaps = 11/271 (4%)

Query: 85  ICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSV 144
           I  E+ DG R        RS    + +    SL    +    ++ +++   +PDG+PD+V
Sbjct: 5   IVGEEYDGNR------LVRSWVLDQKAGAVESLGVDASNESSVLLWLKKGFLPDGYPDAV 58

Query: 145 TPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGK 204
           T  YV +  W +++       G+  +  +L  +GV +   +P      + ++D AG  G 
Sbjct: 59  TSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFFVRDLAGMCGG 118

Query: 205 MIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVT 263
           ++FA  QG + D   KQ R   D+   +G  ++LA+   P  FL LAC  ++A+ V  V 
Sbjct: 119 VLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACLGSLARAVTGVA 178

Query: 264 STSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI----SKRNPSLITTFAL 319
             +TR  + + F++ +N GD++AK         L+G  L +      S    ++   F L
Sbjct: 179 GGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASDSPVAVWAVFLL 238

Query: 320 LSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
           L+  +++++ + V+++ L +LN +R  + ++
Sbjct: 239 LTFLHIYANVRAVRALQLTSLNCSRLDILLK 269


>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
          Length = 503

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 19/332 (5%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS-------RNR 183
           ++ +++P GFP SV+  Y+ YM  +   +          T +LL +VGV           
Sbjct: 114 VKDFILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAAS 173

Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A   A AI W+ KDG G VG++    Q G  FD D KQ R   DL+   G+  +L+T   
Sbjct: 174 AAASAAAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLY 233

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           P  FL LA   N+AK VA      +   I   FA   N+G+V AK E     A LLG  L
Sbjct: 234 PAYFLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLAL 293

Query: 303 SIM------ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
            IM      ++   P L  T+  +   +++  YQ +  +   ++N  R  + V+S +   
Sbjct: 294 GIMVLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHS 353

Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYL-AVEPFFEKERY--VV 411
            VP   + N +E I S+       ++ G   ++    + P + +  +   + +E+Y  VV
Sbjct: 354 IVPGYVDCNRKEKILSWQRFLKPRIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVV 413

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
               +  +V+   K  A S  +L++ + +Y L
Sbjct: 414 KQQGTDFEVFVSFKVGATSLSVLRSIWQSYWL 445


>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 495

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/346 (25%), Positives = 166/346 (47%), Gaps = 40/346 (11%)

Query: 125 EELMSFIRSYVV---PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSR 181
           +E  S++ S ++   P G+P SV+  Y+ Y  + +L+ F     G+  ++ +L  VGV  
Sbjct: 59  KESPSYLTSLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGD 118

Query: 182 NRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATA 240
             A+P    ++ +L++  GR+  ++FA R G   + + K  RFA D+L +    V+  + 
Sbjct: 119 ASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSP 178

Query: 241 AVP-HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE-CVGNIADLL 298
           A P HL + L   +++ + +  V + S++  +   FAK  N+G++ A G   V +I+  L
Sbjct: 179 AFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHISTPL 238

Query: 299 GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
            TGL+++           F LL   ++ ++Y  V++V + TLNR R ++   +  + G  
Sbjct: 239 TTGLALI-----------FLLLI--HLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRA 285

Query: 359 PSLREGNLQENIFS----FPWLRDKPVVLGSRFKDAFQD--------------PDAYLAV 400
            + ++ +  E IF       W      +   R  ++FQ+               DA + +
Sbjct: 286 LTPKQTSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIRDAPIDI 345

Query: 401 E---PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                 FEKE Y++ +NP   +   +LK+  +    LKA  HA ++
Sbjct: 346 PRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLV 391


>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
          Length = 388

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 69/196 (35%), Positives = 107/196 (54%), Gaps = 5/196 (2%)

Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFA 224
           G    Q++L  VGV    A+  A    WIL+DGAG  G+++FA  QG   DYD K+ R  
Sbjct: 2   GTLAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRLF 61

Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
            D+L +    +EL          P+ C ++VAK+V  V   +TR  + +  A+  N+ DV
Sbjct: 62  ADILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMADV 121

Query: 285 TAKG---ECVGNIADLLGTGLSIMISKRNPSLITT-FALLSCGYVFSSYQEVKSVVLHTL 340
           +AK    E + N+A  L + L + +   N  L++T F L +  ++F++Y+ V  VV+ T 
Sbjct: 122 SAKDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMETF 181

Query: 341 NRARFSVAVESFLKTG 356
           NR+R+SV V  FL+ G
Sbjct: 182 NRSRYSVVVRHFLQKG 197


>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 306

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 79/278 (28%), Positives = 133/278 (47%), Gaps = 13/278 (4%)

Query: 203 GKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
           GK+I +  G + D + K  R   D+L +LG  +E+ +   P LFL +A   N AK +A V
Sbjct: 5   GKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVV 64

Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTF 317
            + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I     + S     LI   
Sbjct: 65  AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG- 123

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP--- 374
            LLS  +++   QE+++  ++TLN  R ++ V  F+K+G V S  E   +E++  FP   
Sbjct: 124 PLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRV 182

Query: 375 WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDIL 434
                 V +G   +  F  P     +   F  E+++++   +  + Y +L+  A  +D L
Sbjct: 183 IEEAGSVKIGQPLRKVF-SPRLVEQLRATFPGEKFLLSQKSN--RTYMVLERSASGEDAL 239

Query: 435 KAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
           +    A        S  G R  A+    Y+    +  T
Sbjct: 240 RGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPT 277


>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 216

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 62/181 (34%), Positives = 93/181 (51%), Gaps = 3/181 (1%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           +R   +P G+P +VTP Y+ Y  W    H  G       T +LL++VGVS   A   A++
Sbjct: 1   LRELFLPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATVALS 60

Query: 191 --INWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
             I WI+KDG G +G++I  +R   +FD D ++ R   +LL   G G+E+AT   PH FL
Sbjct: 61  ASIKWIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPHYFL 120

Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
            LAC+   A+ +           I   FA+ +N+G V AK E     A + G   S+ + 
Sbjct: 121 LLACSGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLAASVAVL 180

Query: 308 K 308
           +
Sbjct: 181 R 181


>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
          Length = 218

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 55/165 (33%), Positives = 88/165 (53%), Gaps = 1/165 (0%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV    A+  A    W++
Sbjct: 53  LPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWLV 112

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD AG +G++IFA  +G K D + KQ R   D+L ++   +E+     P  F  + C +N
Sbjct: 113 KDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTSN 172

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG 299
           +AK + +V   +TR  +    A+  N+ DV+AK      + +L G
Sbjct: 173 LAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAG 217


>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
 gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
          Length = 412

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 100/354 (28%), Positives = 157/354 (44%), Gaps = 46/354 (12%)

Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA-----INWILKD 197
           SV+  Y+ YM W+   +  G       T +LL +VG+S     PG+ A     I W+ KD
Sbjct: 21  SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTE---PGSAAAASAAIKWVTKD 77

Query: 198 GAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
           G G VG++ I  R G  FD D KQ R   +++   G   ELAT   P  FL LA   ++ 
Sbjct: 78  GLGAVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELATPLAPDYFLLLASLGHLT 137

Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITT 316
           K VA      +   I   FA  EN+GDV AK E       L+G  L + +    P + T+
Sbjct: 138 KAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVALLS-TPVVATS 196

Query: 317 FALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
           ++ L+  +       ++  YQ + ++ L T+N  R +V +   ++   +  + E N +EN
Sbjct: 197 YSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVSECNKKEN 256

Query: 370 IFSFPWLRDKPVV-LGSRFKDAFQDPDAYLAVEPFFE---KERYVVTYNPSKGKVYALLK 425
           +   P+    P + LG   ++        L +E  FE    E+YV+    SK  +  LLK
Sbjct: 257 LI-LPYTMLMPQIRLGCSLREVITLGKTPLEIEEAFELYNSEKYVLIVF-SKSNMGILLK 314

Query: 426 -------------------DQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
                              D  ++  +L++ + A+VL H    S    +  LKQ
Sbjct: 315 VLSQFLSSCSCNIKCRVCQDGCENTTLLRSIYQAHVLFH----SKSFDTFTLKQ 364


>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
           nagariensis]
 gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
           nagariensis]
          Length = 208

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 95/188 (50%), Gaps = 3/188 (1%)

Query: 122 APAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSR 181
           +P   +   IR   +P GFPDSVTP Y+PY  W    H  G       T +LL +VGVS 
Sbjct: 12  SPLSSIPDSIRDLFLPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSLATSSLLTAVGVST 71

Query: 182 NRATPGAV--AINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
           + A   A+  +I WI+KDG G +G+ I  +R   +FD D ++ R   +LL   G G+E+A
Sbjct: 72  SPAATVALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVAELLSTAGLGLEVA 131

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
           T+  P  F+ LA A    + +           I   FA+ +N+G V AK E     A ++
Sbjct: 132 TSLYPQSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVAAKEEVWEVAAQMV 191

Query: 299 GTGLSIMI 306
           G   S+ +
Sbjct: 192 GLMASVAV 199


>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
          Length = 486

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/227 (28%), Positives = 114/227 (50%), Gaps = 19/227 (8%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P+GFP SV+  Y+PY  W +++       G+ ++Q +L  VG                L
Sbjct: 70  LPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVGF--------------FL 115

Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD  G +G ++FA  QG   D   KQ R   D++ ++G  +ELA+  +P  FL LAC  +
Sbjct: 116 KDFTGMLGGILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLACLGS 175

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR----N 310
           +A+ V  V  ++TR  + + FA   N  D+ AK         L+G  L +   +      
Sbjct: 176 LARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAFIRAAHGLE 235

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           P + + F  L+  +V+++ + ++ + + +LN+AR  + +  +L+TG 
Sbjct: 236 PLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGE 282


>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
          Length = 638

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 141/283 (49%), Gaps = 7/283 (2%)

Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFA 224
           G   T  +L  +GV    A+  A    W+LKD  G +G++IFA  +G K D + KQ R  
Sbjct: 278 GSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSKLDCNAKQWRLF 337

Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
            D+L +    +E+    +P  F    C +N+AK + +V   +TR  +    A+  N+ DV
Sbjct: 338 ADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTMHQARRNNMADV 397

Query: 285 TAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTL 340
           +AK    E + N+A LL + L + +    PS  +  F LL+  +V+++Y+ V+++V+ TL
Sbjct: 398 SAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANYRAVRALVIETL 457

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAV 400
           N  R  + ++ FL+ G V      N  E +++  W     + LG           A L  
Sbjct: 458 NEGRLWLVLKHFLQRGEVLDPTSANQMEPLWTGFW-PSLSLSLGVPLHHLVSS-VAELQQ 515

Query: 401 EPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                +E Y++ ++ S+ +V  +L  +A  + +L+AA H  VL
Sbjct: 516 LAEGHQEPYLLRWDQSQNQVQVVLSQEAGPETVLRAATHGLVL 558


>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
 gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
          Length = 343

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/263 (25%), Positives = 121/263 (46%), Gaps = 33/263 (12%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P+G+P SV+  Y+ + TW  L+        + +TQ LL+ +GV    AT       W L
Sbjct: 55  LPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWFL 114

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G VG ++F   QG   D   KQ R A D +                          
Sbjct: 115 RDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFM-------------------------- 148

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI 314
              ++A V   +TR  + + FA+ +N  DV+AK       A L+G  + + +++   + I
Sbjct: 149 --NDIAGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANNI 206

Query: 315 ----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
               T+F  L+  +++++Y+ V ++ L ++N  R ++ ++SFLK G  PS  E + QE+ 
Sbjct: 207 FLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQESA 266

Query: 371 FSFPWLRDKPVVLGSRFKDAFQD 393
              P+     ++ G++      D
Sbjct: 267 LPRPFTSRSGIIFGAKISSLCGD 289


>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
 gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 280

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 63/82 (76%)

Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
           SY PYM WR +++FFGGA+ VFTTQ+LL ++GV+   +   A AINW++KDGAGR+G+++
Sbjct: 10  SYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWVIKDGAGRLGRLL 69

Query: 207 FARQGKKFDYDLKQLRFAGDLL 228
           FAR G++ D +LKQ R  G L+
Sbjct: 70  FARWGRELDCELKQFRLMGYLV 91



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 4/93 (4%)

Query: 323 GYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR--DKP 380
           GY+ +S +EV SV L  +NRAR + A   +L  G VP + E N  E +   PW R     
Sbjct: 88  GYLVASRKEVDSVELPYMNRARLAYAARQYLTGGQVPGVAEANHHEPL--LPWGRYNQGR 145

Query: 381 VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTY 413
           +VLG+  + A   P   L     F +  Y+ ++
Sbjct: 146 LVLGASVEAACAAPADLLTAAAAFREGPYLASF 178


>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
 gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
          Length = 500

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 93/353 (26%), Positives = 151/353 (42%), Gaps = 26/353 (7%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   ++ +++P GFP SV+  Y+ YM  +   +  G       T +LL +VG      T 
Sbjct: 98  LPDILKDFILPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTD 157

Query: 187 GAVA---INWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
            A +   I W+ KDG G +G++ I  R G  FD D KQ R   D +   G+  +L T   
Sbjct: 158 AAASAAAIRWVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVY 217

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
           P  FLPLA   N+ K VA      +   I   FA   N+G+V AK E V  +   L    
Sbjct: 218 PAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEE-VWEVGAQLLGLA 276

Query: 303 SIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
             ++    P L+ +F LL+  +       ++  YQ +  +   T+N  R  + ++S +  
Sbjct: 277 LGILILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQ 336

Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVE-----------PFF 404
             V    + N  ENI S+       ++ G   ++      +   ++             +
Sbjct: 337 SRVLGCVDCNQDENILSWETFTRPRIIFGVPLEEMIGGERSVAKIKHGVVIKIKMLLKLY 396

Query: 405 EKERYVVTYNPSKGKVYALL--KDQAKSDDILKAAFHAYVLLHCINSSNGHRS 455
            KE++ +  N  K     L+  K  A S  +L++ +  Y  LH    S G+RS
Sbjct: 397 AKEKHFLVVNQQKTDFEVLVSFKVGATSMSVLRSVWQTY-WLHENWKSFGNRS 448


>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
          Length = 497

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 85/352 (24%), Positives = 144/352 (40%), Gaps = 85/352 (24%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 148 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 207

Query: 196 KDGAGRVGKMIFA------------RQ-----------------GKKFDYDLKQLRFAGD 226
           +D  G +G ++F             RQ                 G   D + K  R   D
Sbjct: 208 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 267

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
            + +LG  ++L +   P   + + C  +++++   V S +TR  + + FA   N  D++A
Sbjct: 268 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 327

Query: 287 K---GECVGNIADLLGTGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
           K    E +  +  +    L   +++ +   +  +F  L+  +++++Y+ V+S+ L TLN 
Sbjct: 328 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 387

Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEP 402
            R S+ ++ F+  G          +EN F    L DK                       
Sbjct: 388 ERSSILLQYFMDNG----------EENYF----LLDK----------------------- 410

Query: 403 FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN-SSNGH 453
                         +G V+  +  QA + DIL +  H  VL H +  S +GH
Sbjct: 411 --------------EGSVHIFIHKQAAATDILMSFIHGLVLAHLMQKSKSGH 448


>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
           anophagefferens]
          Length = 248

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 80/245 (32%), Positives = 127/245 (51%), Gaps = 9/245 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP +VTP Y  Y   +   +    A GV +T+ LL +VGV    A P A A NW +
Sbjct: 2   LPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWAI 61

Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELAT-AAVPHLFLPLACAA 253
           KDG G +G + FA       D   +Q R      ++L + VELA   A P  F+P+A  A
Sbjct: 62  KDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAALPAFPQYFVPIAGLA 121

Query: 254 NVAKNVAAVTSTSTRTPIYKAF---AKGENIGDVTAKGECVGNIADLLGTGLSIMISK-- 308
           NVAKNV+ + ++++R  I+++    +   N+ D+TAK      +A L G  L +  S   
Sbjct: 122 NVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGVAASNAC 181

Query: 309 --RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
                 +   +A LS  ++  +Y  +K V   TL+ AR +  V++F +TG  P+  + + 
Sbjct: 182 GAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPTPADVSE 241

Query: 367 QENIF 371
           +E++F
Sbjct: 242 RESLF 246


>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 434

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 158/336 (47%), Gaps = 31/336 (9%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSR--NRATPG 187
           + R+  +P G+P SV   Y     W   ++  G A+ V T Q +L S+G S   +     
Sbjct: 35  WFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSAISVLTAQAMLTSIGASTAAHETAAL 94

Query: 188 AVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHL 245
           A+A++W+LKDG G +GKM+   R   +FD   K+ +  G+     GA ++LAT    PH 
Sbjct: 95  AIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRWKLYGEACSISGALMQLATCITNPHH 154

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFA--KGENIGDVTAKGECVGNIADLLG--TG 301
           FL  A      +++      +T T   +  A   G N+GD+ AK +   ++A LLG  +G
Sbjct: 155 FLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNGVNVGDIVAKADSQLSLAHLLGMVSG 214

Query: 302 LSIM-ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH--- 357
           + ++ +S +  +L   F++LS   +  +   ++      L++ R  +    F+   +   
Sbjct: 215 IGMLAVSFQPAALFAWFSVLSAIQIVCTGLLLREARFEVLDQYRLVLIPREFMSEKNALR 274

Query: 358 VPSLREGNLQENIFSFPWLRD--------KPVVLGSRFKDAFQDPDAYLAVEPFFEKERY 409
           +P+L +    EN     WL +          + +G    DAF+      A++  F+  +Y
Sbjct: 275 IPTLDQVVRFEN-----WLNEGLSKGQVVTDLSIGVTVDDAFKHSRLSHALQ-IFKNGKY 328

Query: 410 VVTY-----NPSKGKVYALLKDQAKSDDILKAAFHA 440
           ++ +       S+ KVY +L    +   +++AAFHA
Sbjct: 329 LLGFKLADSTHSQIKVYVVLHQSIQPVHVIQAAFHA 364


>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
          Length = 467

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 150/305 (49%), Gaps = 29/305 (9%)

Query: 90  IDGRRWNYVAERDRSGKFKRNSI-----RAISLQSPQAPAEELMS--------FIRSYVV 136
           I G+    V+E D+SG  ++  +     R I ++ PQ    +L S         +    +
Sbjct: 18  ISGKNDVEVSELDKSGHVQKRWLHEPSGRIIHIR-PQGHTNDLFSRSLREWKKILSDSFL 76

Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILK 196
           P GFP SV+  Y+ Y T+ +L+ FF     +   + LL  +GV    ++     +  IL+
Sbjct: 77  PVGFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILR 136

Query: 197 DGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA----- 250
           D   RV  ++FA R G + + D K+ RF  DL  +    +EL +   P+L  PLA     
Sbjct: 137 DATSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYS---PYLS-PLAKVVVL 192

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMI 306
                 + +  V++ +++  +   FAK +N+ ++ AK       VG I  L+GT + + +
Sbjct: 193 SVGEALRALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVGT-IVVKV 251

Query: 307 SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
            + + S+++   LL  G+++ +Y  V+SV + TLNR R ++  + +L+TG V S  E   
Sbjct: 252 VEDHRSVVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVLSPAEVAK 311

Query: 367 QENIF 371
           +E I 
Sbjct: 312 REYII 316


>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
 gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
          Length = 561

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 145/344 (42%), Gaps = 59/344 (17%)

Query: 93  RRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMS--------------------FIR 132
           RRW     R  +G F R        Q P+ P EEL                       +R
Sbjct: 76  RRWG----RTPTGGFDR--------QPPRRPEEELAGQGEATSGRSGFARWDPRLSPRLR 123

Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV--- 189
           +  +P G+PD+V+P Y P++ W      F   + V T Q+LL ++G+    +TPGA+   
Sbjct: 124 AVFLPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMG---STPGALPLT 180

Query: 190 -AINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
            A  W+LKDG G +  +      G+K D D K+     +   ++   +EL T A P LFL
Sbjct: 181 AATKWVLKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFL 240

Query: 248 PLACAANVAKNVAAVTSTS-TRTPIYKAFAKGENIGDVTAKGECVGN----IADLLGTGL 302
           PLA  A   +  A     S     + + FA+ EN  DV A+ E  G     +A  +G G+
Sbjct: 241 PLAATATFVRTAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGI 300

Query: 303 SIMISK----------RNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
              +S+          R+ ++ +  +  + C ++   +   K +   TLNRAR      +
Sbjct: 301 FRTVSQSFASDPEEPARDVAIAVALYGGVVCAHLVCIWNAAKVLRFETLNRARLVRQARA 360

Query: 352 FLKTGHVPSLREGNL--QENIFSFPWLRDKPVVLGSRFKDAFQD 393
           F   G            +E +F   +    P VLG+   DA +D
Sbjct: 361 FRDGGAAAIADYNAAGDEEGVFRARFSPGDP-VLGASLPDAARD 403


>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 675

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 65/228 (28%), Positives = 114/228 (50%), Gaps = 13/228 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +PDG+P SVT   +P+  W  ++          +T+ +L  VGV  ++A   +  ++W++
Sbjct: 73  LPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLSWMM 132

Query: 196 KDGAGRVGKMIFAR---QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP---HLFLPL 249
           +DGA  +G ++FA    QG   +   K  +   D + +    +EL +  +P    +F  +
Sbjct: 133 QDGARMIGSVVFASVIPQG--LECRAKTWQLVTDFIHDSAHLLELCSPWLPGGQAIFRVV 190

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
             AA+V K +A+V    TR P  + FA   N  D+ AK    G++  L+G  L I+I+  
Sbjct: 191 LVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVGGLIGLALGIVIAYI 250

Query: 310 NPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            P+      +  FAL S  ++F++Y+ V+ V L  LN  R    +E +
Sbjct: 251 VPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPRLEWCLEQY 298


>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
          Length = 673

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 69/232 (29%), Positives = 112/232 (48%), Gaps = 15/232 (6%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV P YV Y  W  L+       GV  TQ +L  +GV R  +TP A  + W+ 
Sbjct: 205 LPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTPLAATLQWMA 264

Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-----FLPL 249
           +DGA  +G ++F       F  ++K  R   D   ++G  +E+  A +P       FL L
Sbjct: 265 RDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEM-LAPLPAFREGGRFLRL 323

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK- 308
            C A+V K    V   +T   I + +A   NI D+ AK      +A LLG GLS+  ++ 
Sbjct: 324 ICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLGLGLSVWFARA 383

Query: 309 -------RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
                      +   +A+L+  ++ ++Y  ++++ L +LN  R S+ V  +L
Sbjct: 384 VSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTEYL 435


>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
          Length = 406

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 83/304 (27%), Positives = 149/304 (49%), Gaps = 27/304 (8%)

Query: 151 YMTWRALKHF--FGGAM-GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF 207
           +  WR  + F  F  ++ G   TQ +L  +GV   RA+  A    W++KD  G +G+++F
Sbjct: 39  WRWWRFTRPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVF 98

Query: 208 ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
           A     F           D+L ++   +E+     P  F    C +N+AK V +V   +T
Sbjct: 99  AWWKGLF----------ADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGAT 148

Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
           R  +    A+  N+ DV AK    E + N+A LL + L + +    PS+ +  F LL+  
Sbjct: 149 RAALTMHQARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTAL 208

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS--FPWLRDKPV 381
           +++++Y+ V+++V+ TLN  R  + ++ FL+TG+V      N  E +++   P L     
Sbjct: 209 HIYANYRAVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPLWTGFCPSLSLSLG 268

Query: 382 VLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
           V   R   +  D      ++   E  +E Y++ ++ S+ +V  +L  +A  + +L+AA H
Sbjct: 269 VPLHRLISSVSD------LQQLVEGHQEPYLLCWDQSRNQVQVVLSQKAGPETVLRAATH 322

Query: 440 AYVL 443
             VL
Sbjct: 323 GLVL 326


>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
 gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 498

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 74/275 (26%), Positives = 135/275 (49%), Gaps = 11/275 (4%)

Query: 105 GKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGA 164
           G+ +  ++RA   ++P+     L + + +++ P G+P SVTP Y+PY T+ +L+ FF   
Sbjct: 21  GESQDTTLRASRTKNPR-----LQALLHAFL-PAGYPHSVTPDYLPYQTYDSLQAFFSSI 74

Query: 165 MGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRF 223
             +   + +L  +GV  + ++P    I  I  D   R+  ++FA R G   + + K  RF
Sbjct: 75  TSLLANRAVLEGLGVGDSSSSPTGALILKIAGDTISRIATILFAHRMGTAIEPECKFYRF 134

Query: 224 AGDLLMELGAGVELATAAVPHL-FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
             DLL +    ++L T A+P+   L +  +A V +++  V + +++  +   FA   N+ 
Sbjct: 135 LADLLNDSAQLLDLLTPALPYFPKLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLA 194

Query: 283 DVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHT 339
           ++ AK      +  LLG  +  ++ K        +AL   L   ++  +Y  V++V   T
Sbjct: 195 ELNAKEASQETVVSLLGILVGSLVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRT 254

Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP 374
           LNR R +V V  +L+ G V    E    E+I   P
Sbjct: 255 LNRQRATVVVRGWLEWGRVLGPEEVARMESILQLP 289


>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
          Length = 459

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 76/232 (32%), Positives = 109/232 (46%), Gaps = 20/232 (8%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT---PG 187
           I+ +V+P GFP SV+  Y+ YM W+   +  G       T TLL +VGV     T     
Sbjct: 120 IKDFVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAAS 179

Query: 188 AVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A AI W+ KDG G  G+++   R G  FD D K+ R   D +   G+  EL T   P  F
Sbjct: 180 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYF 239

Query: 247 LPLACAANVAKNV------AAVTSTSTRTPIYKA----FAKGENIGDVTAKGECVGNIAD 296
           LPLA   N+AK +              R P ++     FA+  N+G+V AK E     A 
Sbjct: 240 LPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQ 299

Query: 297 LLG--TGLSIM----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
           LLG   G+ IM    I    P+L  T+  +   +++  YQ +  +   T++R
Sbjct: 300 LLGLSIGVFIMDTSGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVDR 351


>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 497

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 149/332 (44%), Gaps = 24/332 (7%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y+ Y  + +L+ F     G+  ++ +L  VGV    A+P    ++ +L
Sbjct: 74  LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
            D  GR+  ++FA R G   + + K  R A D+L +     +  + A P HL + +   +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL 313
           +V + +  V + S++  +   FA+  N+G++ AK      +  LLG     ++  R  + 
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTP 253

Query: 314 ITTFALLSCGYVF---SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
             T+A L    +    ++Y  V+SV + TLNR R ++   +  + G V + ++ +  E I
Sbjct: 254 FATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERI 313

Query: 371 FS----FPWLRDKPVVLGSRFKDAFQD------------PDAYLAVE---PFFEKERYVV 411
           F       W      +   R   +FQ+             D  + +      F +E Y++
Sbjct: 314 FERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYIL 373

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
            +N S      +LK+       LKA  HA ++
Sbjct: 374 WFNSSDKGGTIVLKNSVTPLSQLKAWSHALMV 405


>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 497

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 149/332 (44%), Gaps = 24/332 (7%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y+ Y  + +L+ F     G+  ++ +L  VGV    A+P    ++ +L
Sbjct: 74  LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
            D  GR+  ++FA R G   + + K  R A D+L +     +  + A P HL + +   +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL 313
           +V + +  V + S++  +   FA+  N+G++ AK      +  LLG     ++  R  + 
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTP 253

Query: 314 ITTFALLSCGYVF---SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
             T+A L    +    ++Y  V+SV + TLNR R ++   +  + G V + ++ +  E I
Sbjct: 254 FATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERI 313

Query: 371 FS----FPWLRDKPVVLGSRFKDAFQD------------PDAYLAVE---PFFEKERYVV 411
           F       W      +   R   +FQ+             D  + +      F +E Y++
Sbjct: 314 FERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYIL 373

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
            +N S      +LK+       LKA  HA ++
Sbjct: 374 WFNSSDKGGTIVLKNSVTPLSQLKAWSHALMV 405


>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 498

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 149/332 (44%), Gaps = 24/332 (7%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y+ Y  + +L+ F     G+  ++ +L  VGV    A+P    ++ +L
Sbjct: 75  LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 134

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
            D  GR+  ++FA R G   + + K  R A D+L +     +  + A P HL + +   +
Sbjct: 135 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 194

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL 313
           +V + +  V + S++  +   FA+  N+G++ AK      +  LLG     ++  R  + 
Sbjct: 195 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTP 254

Query: 314 ITTFALLSCGYVF---SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
             T+A L    +    ++Y  V+SV + TLNR R ++   +  + G V + ++ +  E I
Sbjct: 255 FATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERI 314

Query: 371 FS----FPWLRDKPVVLGSRFKDAFQD------------PDAYLAVE---PFFEKERYVV 411
           F       W      +   R   +FQ+             D  + +      F +E Y++
Sbjct: 315 FERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYIL 374

Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
            +N S      +LK+       LKA  HA ++
Sbjct: 375 WFNSSDKGGTIVLKNSVTPLSQLKAWSHALMV 406


>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
 gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
          Length = 485

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 94/338 (27%), Positives = 145/338 (42%), Gaps = 19/338 (5%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV-----SRNRAT 185
           I+ +++P GFP SV+  Y  YM  +   +  G       T +LL +VGV     S   A 
Sbjct: 98  IKDFILPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAA 157

Query: 186 PGAVAINWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
             A AI W+ KDG G +G++ I  R G  FD D KQ R   D +   G+  +L T   P 
Sbjct: 158 ASAAAIRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPA 217

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
            FLPLA   N+ K VA      +   I   FA   N+G+V AK E V  +   L      
Sbjct: 218 YFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEE-VWEVGAQLLGLALG 276

Query: 305 MISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           ++    P L+ +F  L+  +       ++  Y  +  +   T+N  R  + V+S +    
Sbjct: 277 ILILDTPGLVKSFPALASTWLSMRLLHLWLRYLSLSVLQFDTINLKRARLLVKSHILHSV 336

Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVE---PFFEKERYVVTYN 414
           V    + N +ENI  +       +  G   ++      +   V+     + KE Y++  N
Sbjct: 337 VLGCVDCNREENILLWETFTKPQITFGVPLQEMVGGERSAAQVKTLLKLYTKENYLLVVN 396

Query: 415 PSKG--KVYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
             KG  +V+   K  A S   L++ +  Y L     SS
Sbjct: 397 QQKGDFEVFVSFKVGATSMSALRSVWQTYWLHENWKSS 434


>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
 gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
          Length = 499

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/452 (25%), Positives = 193/452 (42%), Gaps = 66/452 (14%)

Query: 62  SESDETTAFRLVEDQFVDSSLRLICSE---DIDGRRWNYVAERDR-SGKFKRNSIRAISL 117
           S  DE    RL  +    S+  L C+E   D  G  W     RD   G F R   R I  
Sbjct: 51  SRDDEMHDSRLSSNA-TASTRTLTCAEVIGDASGAIWM----RDSVGGAFTRT--RPIDA 103

Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           ++ +  A  +   +R+  +P GFP +V+  Y  ++ W      F   + + + Q+LL ++
Sbjct: 104 KTGRERAVGVPE-LRAVFLPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVAL 162

Query: 178 GVSRNRATPGAV----AINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELG 232
           GV    +TPGA+    A  W+LKDG G    ++  A  G+ +D D K+     + L ++ 
Sbjct: 163 GVG---STPGALPLTAAAKWVLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVA 219

Query: 233 AGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIY-KAFAKGENIGDVTAKGECV 291
             +EL T A P LFLPLA +A   +  A     S     + + F +  N+GD+ AK E  
Sbjct: 220 RAIELVTPAAPGLFLPLAASATFVRAAALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQ 279

Query: 292 GNIADLLGTGLSI----MISKRNPSL-------------ITTFALLSCGYVFSSYQEVKS 334
           G    L+   + I    +++ +  +L                + L+   ++FS +Q  +S
Sbjct: 280 GRWLALVALPVGIQVFQLVNAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARS 339

Query: 335 VVLHTLNRARFSVAVESFLKTG---HVPSLREGNLQENIF------SFPWLRDKPVVLGS 385
           +   +LN  R     E+++ +     +P+  E    E ++      + P     P  +GS
Sbjct: 340 LKFDSLNGFRLLRLAEAYVASDGKIELPNCVEIGEIEGVYKPRRTLTTPTFGASPNEIGS 399

Query: 386 RFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKV-YALLKDQAKSDDILKAAFHAYVLL 444
            +       DA++          YV+ ++ S+G V  AL+ + A   D+L AA     L 
Sbjct: 400 TW-------DAFIEATNASNGHPYVLGFDSSRGDVASALILESATPRDMLAAALACEKL- 451

Query: 445 HCINSSNGHRSLALKQHEYDNSNLMLTTADLE 476
                    RS  +  ++Y +S     TAD E
Sbjct: 452 ------RIDRSSVVSAYDYADSQ----TADFE 473


>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 509

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 159/343 (46%), Gaps = 35/343 (10%)

Query: 136 VPDGFPDSVTPSYVPYMTW---------RALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           +P G+P SV+  Y+  +            +L+ F     G+  ++ +L  VGV    A+P
Sbjct: 73  LPAGYPHSVSDDYMDLLFLALENDRKYKDSLQAFSSSIAGLLASRAVLQGVGVGDASASP 132

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-H 244
               ++ +L++  GR+  ++FA R G   + + K  RFA D+L +    V+  + A P H
Sbjct: 133 TVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKH 192

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-S 303
           L + L   +++ + +  V + S++  +   FAK  N+G++ AK      +  L+G    S
Sbjct: 193 LRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVISLMGMLCGS 252

Query: 304 IMISKRNPSLITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           +++S  +  L T  AL  L   ++ ++Y  V++V + TLNR R ++   +  + G   + 
Sbjct: 253 LVVSHISTPLTTGLALIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTP 312

Query: 362 REGNLQENIFS----FPWLRDKPVVLGSRFKDAFQD--------------PDAYLAVE-- 401
           ++ +  E IF       W      +   R  ++FQ+               DA + +   
Sbjct: 313 KQTSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIRDAPIDIPRL 372

Query: 402 -PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
              FEKE Y++ +NP   +   +LK+  +    LKA  HA ++
Sbjct: 373 LRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLV 415


>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
 gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
 gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
          Length = 323

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 105/220 (47%), Gaps = 10/220 (4%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           I+ +V+P GFP SV+  Y+ YM W+   +  G       T TLL +VGV     T  A +
Sbjct: 103 IKDFVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAAS 162

Query: 191 ---INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
              I W+ KDG G  G+++   R G  FD D K+ R   D +   G+  EL T   P  F
Sbjct: 163 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYF 222

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSI 304
           LPLA   N+AK V       +   I   FA+  N+G+V AK E     A LLG   G+ I
Sbjct: 223 LPLASLGNLAKAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVFI 282

Query: 305 M----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
           M    I    P+L  T+  +   +++  YQ +  +   T+
Sbjct: 283 MDTSGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTV 322


>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
           UAMH 10762]
          Length = 454

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 74/256 (28%), Positives = 131/256 (51%), Gaps = 10/256 (3%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           +AP + L+       +P GFP SVT  YV Y  + +L+ F    +G+ +++ +L SVGV 
Sbjct: 42  KAPWQRLLDIF----LPAGFPHSVTVDYVEYQIYDSLQAFSSNIVGLLSSRAVLLSVGVG 97

Query: 181 RNRATPGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
              A+P    +  I ++  GR+G + F A  G   + + K  R   D+  +    ++  +
Sbjct: 98  DADASPTYALLLHIAQESLGRIGTIAFVALYGTALEPECKMYRLLADVFNDFAFVLDCLS 157

Query: 240 AAVP-HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
            A+P +L +     +++ K +  V + S +  +   FA+  N+G++ AK      +  LL
Sbjct: 158 PAIPKYLRVVTLSFSSLLKALCGVAAGSAKASLSAHFAQWGNLGELNAKDSSQETVISLL 217

Query: 299 G--TGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
           G   G  ++   R P +  TT  LL   ++ ++ + V++VV+ TLNR R ++ V   L+ 
Sbjct: 218 GMWVGSLVLSWVRTPQATWTTLILLLAIHLETNRRAVRAVVMRTLNRQRATL-VFHHLQR 276

Query: 356 GHVPSLREGNLQENIF 371
           GHVP+  E + QE IF
Sbjct: 277 GHVPTPEEVSKQERIF 292


>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
          Length = 484

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/397 (23%), Positives = 178/397 (44%), Gaps = 56/397 (14%)

Query: 98  VAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIR--------SYVVPDGFPDSVTPSYV 149
           +AE D++G  K   + + S   P +  + L+ F R           +PDG+P SVT  Y 
Sbjct: 10  IAEYDKAGNLKAKYVESASRIGP-SRVDVLLPFKRKKPWLRLLEIFLPDGYPHSVTEDYA 68

Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR 209
            Y  + +++ F G   G+ +++ +   +GV  + A+P    +  ++++  GR   + FA 
Sbjct: 69  AYQIYDSVQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLVQVIRESTGRFATIAFAH 128

Query: 210 Q-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH--LFLPLACAANVAKNVAAVTSTS 266
             G   + + K  R A D+L ++   ++  +   PH   FL L C +++  + + V   +
Sbjct: 129 LFGTSIEAECKAWRLAADVLCDVAMVLDCLSPFFPHGVRFLVL-CFSSLLYSASGVAGNA 187

Query: 267 TRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTGLSIMISKRNPSLITTFALLSC 322
           +++ +   FAK  N+G++ AK       +  I  + GT +  +++ +  +  T   LL+ 
Sbjct: 188 SKSSLSGHFAKWNNLGELNAKDASQETAISLIGMISGTFVVSLLTDQRATWATLIFLLAL 247

Query: 323 GYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS------FPWL 376
            ++F +++ V++V   +LNR R ++A  +    G V +      +E IF       F W 
Sbjct: 248 -HLFLNWKGVRAVKSRSLNRQRANIAFSALFSKGQVLTPHHVAERERIFERRGGEVFRW- 305

Query: 377 RDKPVVLG-SRFKDAFQDPDAYLA--------------------VEPFFEKERYVV---- 411
            +   VLG  ++  + Q     LA                    +   FEKE YV+    
Sbjct: 306 -NSGAVLGHCKYGVSLQSLLEALASGGGGGRERCFRLATVDLSTLLRRFEKEEYVLWCVV 364

Query: 412 ----TYN-PSKGKVYALLKDQAKSDDILKAAFHAYVL 443
               +Y+  S+ +V  +LKD       LKA FH  +L
Sbjct: 365 SERTSYDAASRVRVMVVLKDGVSPKSQLKAWFHGLLL 401


>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
           heterostrophus C5]
          Length = 421

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 127/261 (48%), Gaps = 17/261 (6%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP SVT  YV Y  + +L+ F     G+  ++ +L  VGV  + ATP A  +  +L
Sbjct: 6   LPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAALLLSVL 65

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA-- 252
           ++  GR+  + FA R G   + + K  R   D+L + G  ++  +   P    PL  A  
Sbjct: 66  QESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPK---PLRVAVL 122

Query: 253 --ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT-GLSIMISKR 309
             A+  +++  V + S++  +   FA   N+G+V AK      +  LLG    S ++S  
Sbjct: 123 SFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVVSHI 182

Query: 310 NPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
              + T  T  LL   ++ ++Y  V++V + +LNR R ++     L+ G V S +E + +
Sbjct: 183 TSDIATWCTLILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKEVSAR 242

Query: 368 ENIFS----FPWLRDKPVVLG 384
           E IF       W  D+  VLG
Sbjct: 243 ERIFERGGVLRWADDE--VLG 261


>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
          Length = 448

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 74/293 (25%), Positives = 142/293 (48%), Gaps = 14/293 (4%)

Query: 98  VAERDRSGKFKRNSIRAIS--LQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWR 155
           + E D+SG   R     +   + +   P+  L+  +R   +P G+P SV+  Y+ Y  + 
Sbjct: 16  IVELDKSGNVSRTWNHTVDNHVVAKSKPSIPLIPSLREAFMPVGYPHSVSSDYLNYQFFD 75

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
           +L+ FF     +   + LL  +GV    ++     +  +LKD   R+  ++FA Q G + 
Sbjct: 76  SLQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRIATIVFAHQFGLRI 135

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + D K+ RF  DL  +    +EL +    P+  +    +    + +  V + +++  +  
Sbjct: 136 EPDAKRFRFLADLFNDTAFFLELYSPYFGPYGKILALTSGETLRALCGVAAGASKAALSV 195

Query: 274 AFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSY 329
            FAK +N+ ++ AK       VG I  L+GT + +   + + S++    +L   +++ +Y
Sbjct: 196 HFAKHDNLSELNAKEASQETAVGLIGLLVGT-IVVKYVEDHHSVLFLMTVLVFAHLWMNY 254

Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVV 382
             V++V +  LN+ R ++  E +LKTG+V S  E   +ENI  +     +PVV
Sbjct: 255 LGVRAVCMDNLNQQRATILFEEYLKTGNVLSPEEVAHRENILLW-----RPVV 302


>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 471

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 166/360 (46%), Gaps = 36/360 (10%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
             ++  I    +P G+PDSV+P Y+ Y    AL+ F    +G+ +++ +L   GV    A
Sbjct: 43  RRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPSA 102

Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           T     +  +L+D   R+  ++ A   G     D+K+ R   D+L +    ++  +  + 
Sbjct: 103 TATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRLD 162

Query: 244 HLFLP-----LACAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKGECVGNI 294
            LF P       C + + +++ A+++  ++  I   FA   KG  ++GD+ AK      +
Sbjct: 163 TLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETV 222

Query: 295 ADLLGTGLSIMISKR--NP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
             LLG  L  +I      P ++ TT  +L   ++  +Y  V+ +VL TLN  R  +A  +
Sbjct: 223 LALLGMLLGTLIVPHLTTPWTIYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWLAWSA 282

Query: 352 FLKT--GHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQDPDAYLAVE 401
           +++T  G  P+  E    E IF  P     P          +GS F +    P     +E
Sbjct: 283 YMRTQPGRAPTPDEVASLERIFERPGAFRDPRSGVLMGRCTIGSSFSEILSGPIPPQLLE 342

Query: 402 PFFEKERYVVTYN---------PS-KG--KVYALLKDQAKSDDILKAAFHAYVLLHCINS 449
             F++ERY++ ++         PS +G  +++  LK+   ++D LKA  HA  L   I+S
Sbjct: 343 -LFKEERYLLWFDRRCLLRPESPSVEGFLRLHICLKEGYNTEDQLKAWIHAAELCRAISS 401


>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
 gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
          Length = 226

 Score = 99.0 bits (245), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 7/216 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P+ +P+SV+  Y  +  W   +       G  TT+ LL  VGV    AT  + A  +I 
Sbjct: 3   LPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQFIA 62

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT--AAVPHLFLPLACA 252
           +D  G VG ++F   +G+  D + KQ R   DL   +G  +ELA   A     FL LACA
Sbjct: 63  RDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALACA 122

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---- 308
            +VA+++    + +TR  + + FA+  N  D+ AK         L+G  L + ++     
Sbjct: 123 GSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTSTDG 182

Query: 309 RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
           +       F  L+  +V+ + + V+S+V+ +LNR R
Sbjct: 183 KTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRER 218


>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 263

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 82/164 (50%), Gaps = 1/164 (0%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           E + S +    VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    A
Sbjct: 51  ERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSA 110

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           T       W L+D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P
Sbjct: 111 TVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFP 170

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
              + + C  +++++   V S +TR  + + FA   N  D++AK
Sbjct: 171 SSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAK 214


>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
           ND90Pr]
          Length = 482

 Score = 98.2 bits (243), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 142/296 (47%), Gaps = 16/296 (5%)

Query: 98  VAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRAL 157
           V E  + G+ KR     + +  P+   + ++ +I    +P GFP SVT  Y+ Y  + +L
Sbjct: 33  VGEHGKHGEGKR-----VDVIVPKQAHKSVLQYIIDVFLPSGFPQSVTEDYIEYQVFDSL 87

Query: 158 KHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDY 216
           + F     G+  ++ +L  VGV  + ATP A  +  +L++  GR+  + FA R G   + 
Sbjct: 88  QAFSSSIAGMLASRAVLQGVGVGDSNATPTAALLLSVLQESVGRMATIAFADRFGTALEP 147

Query: 217 DLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAF 275
           + K  R   D+L +    ++  +   P  L + +   A+  +++  V + S++  +   F
Sbjct: 148 ECKMYRLTADILNDSAMILDCLSPGFPKPLRVAVLSFASCLRSLCGVCAGSSKASLSAHF 207

Query: 276 AKGENIGDVTAKGECVGNIADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQEV 332
           A   N+G+V AK      +  LLG    S ++S     L T  T  LL   ++ ++Y  V
Sbjct: 208 ATAGNLGEVNAKDSSQETVISLLGMLAGSFVVSHITSDLATWCTLILLLAIHLATNYAAV 267

Query: 333 KSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG 384
           ++V + +LNR R ++     L+ G V + +E + +E IF       W  D+  VLG
Sbjct: 268 RAVSMQSLNRQRANILFSHILQHGIVLNPKEVSARERIFERGGVLRWADDE--VLG 321


>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
          Length = 207

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 78/153 (50%), Gaps = 1/153 (0%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           VP+GFP SVTP YVP+  W  L+        + +TQ LL ++GV    AT       W L
Sbjct: 6   VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 65

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  G +G ++F   QG   D + K  R   D + +LG  ++L +   P   + + C  +
Sbjct: 66  RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 125

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
           ++++   V S +TR  + + FA   N  D++AK
Sbjct: 126 LSRSFTGVASGATRAALTQHFALANNAADISAK 158


>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
          Length = 462

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 144/329 (43%), Gaps = 16/329 (4%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS----RNRATP 186
           ++ +++P GFP SV+  Y+ YM  +   +          T +LL +   S     N    
Sbjct: 76  VKDFILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKASCWSWFFFSNHCCC 135

Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
                    +DG G VG++    Q G  FD D KQ R   DL+   G+  +L+T   P  
Sbjct: 136 FCCCFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAY 195

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
           FL LA   N+AK VA      +   I   FA   N+G+V AK E     A LLG  L IM
Sbjct: 196 FLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIM 255

Query: 306 ------ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
                 ++   P L  T+  +   +++  YQ +  +   ++N  R  + V+S +    VP
Sbjct: 256 VLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVP 315

Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYL-AVEPFFEKERY--VVTYN 414
              + N +E I S+       ++ G   ++    + P + +  +   + +E+Y  VV   
Sbjct: 316 GYVDCNRKEKILSWQRFLKPRIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQ 375

Query: 415 PSKGKVYALLKDQAKSDDILKAAFHAYVL 443
            +  +V+   K  A S  +L++ + +Y L
Sbjct: 376 GTDFEVFVSFKVGATSLSVLRSIWQSYWL 404


>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
           206040]
          Length = 468

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/326 (26%), Positives = 155/326 (47%), Gaps = 17/326 (5%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           I    +P GFP SV+  Y+ Y T+ +L+ FF     +   + LL  +GV    ++     
Sbjct: 57  IMDSFLPVGFPHSVSKDYLAYQTFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFAL 116

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           +  I+KD   R+  ++FA R G + + D K+ RF  D+  +    +EL +  +      +
Sbjct: 117 LLTIMKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADVFNDSAFFLELYSPYLGSWGKVI 176

Query: 250 ACAANVA-KNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSI 304
           A +   A + +  V + +++  +   FAK +N+ ++ AK       VG I  L+GT +  
Sbjct: 177 ALSTGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGTLVVK 236

Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           MI   + S++    LL   ++ ++Y  V+SV + TLNR R ++  E FL++G V +  + 
Sbjct: 237 MIEDHS-SVVILMILLVIAHLGTNYLGVRSVCMATLNRQRATIVFEEFLRSGKVLTPEQV 295

Query: 365 NLQEN-IFSFPWLRD--KPVVLGSRFKDAF------QDPDAYLAVEPFFEKERYVVTYNP 415
             +EN IF  P +R+     V+   F + +       DP   L +       R++ +  P
Sbjct: 296 AERENIIFWNPIMRNLHGKKVVHVEFAEGYSHAMSCSDPHDKLIIVDGINHTRFICSRVP 355

Query: 416 SK-GKVYALLKDQAKSDDILKAAFHA 440
            +   +  LL   A+  D ++A F A
Sbjct: 356 GRVTPIKLLLWVGAQPIDAIRAWFAA 381


>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
          Length = 434

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/347 (24%), Positives = 142/347 (40%), Gaps = 54/347 (15%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           ++ +R++ +P G+P SVTP Y+ Y       H  G       T +++ ++GV    AT  
Sbjct: 32  LAAVRAFFLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIA 91

Query: 188 AV--AINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP- 243
           A   A+ W+ KDG G  G++I   R    FD D K+ R   + +  +G  +E+AT   P 
Sbjct: 92  ATSAAVKWLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPG 151

Query: 244 -------------HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
                          F+ LA    + K++A    T     I   F+   N+GDV AK E 
Sbjct: 152 GLRSKFRVVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEV 211

Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
               A LLG   S+         +   AL + G               T+   R ++   
Sbjct: 212 WEVAAQLLGLAASV---------VALQALEAAGSP------------DTVVPKRAALMAS 250

Query: 351 SFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEP-------- 402
           S ++TG VPS+ + NL E +   P +    V  G     A                    
Sbjct: 251 SHVRTGLVPSVEQVNLSEQVLLPPTMCQPSVHYGCTLDQALGSGGGGFGGGAASGGRGQQ 310

Query: 403 ------FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                  ++ E+Y++T++  +G  + L+ + A+  D+L+A + A  L
Sbjct: 311 LAELVRLYDGEQYMLTWH--RGAAWVLVLEGAQRADLLRAMWQAAWL 355


>gi|223974535|gb|ACN31455.1| unknown [Zea mays]
          Length = 115

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/89 (57%), Positives = 66/89 (74%), Gaps = 4/89 (4%)

Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD---NS 466
           +VTYNP+K KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+  R L LKQ + +   + 
Sbjct: 1   MVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHA-RRLKLKQKQANPDQSE 59

Query: 467 NLMLTTADLEAHIGESCKVVSTALWGFQE 495
           +L     D  AHI ESCK+VS++   F++
Sbjct: 60  HLYSRNVDFLAHIAESCKIVSSSYGTFKK 88


>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
          Length = 226

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/194 (31%), Positives = 94/194 (48%), Gaps = 14/194 (7%)

Query: 72  LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
           L  +QF   + R  CS   DG       RRW +++         +    A     P A A
Sbjct: 11  LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
              +S +RS ++P GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +A
Sbjct: 65  LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           +  A    W++KD  G +G++IFA  +G K D + KQ R   D+L ++   +E+     P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184

Query: 244 HLFLPLACAANVAK 257
             F      +N+AK
Sbjct: 185 IFFTMTVSTSNLAK 198


>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
           nagariensis]
 gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
           nagariensis]
          Length = 500

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 124/243 (51%), Gaps = 18/243 (7%)

Query: 151 YMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR- 209
           + TW  ++       GV T+Q +L  VGV +  +TP A    + ++D AG +G ++F+  
Sbjct: 4   FQTWDTIQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSYL 63

Query: 210 QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTR 268
           +G  FD   KQ R   D+  +LG  VELA+   P  +FLP+AC  +VA+++  V   +TR
Sbjct: 64  EGSSFDACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGATR 123

Query: 269 TPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSI---MISKRNPSLITT------- 316
             + + FA+  N  DV+AK +       +LG  TG+++   + +   P+   T       
Sbjct: 124 AALTQHFARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGFVI 183

Query: 317 ----FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
               F  L+  +V ++   ++ ++L +LN  R  + V+ +L+ G V S R  +  E++  
Sbjct: 184 AWLVFLGLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLEDLTP 243

Query: 373 FPW 375
            P+
Sbjct: 244 PPF 246


>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 362

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 72/259 (27%), Positives = 113/259 (43%), Gaps = 46/259 (17%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           M  +R   +P GFP + +  YV ++ W+ L   F   + + + Q+LL ++G+     TPG
Sbjct: 1   MPELRRIFLPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGN---TPG 57

Query: 188 AV----AINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A+    A  W+LKDG G    +I     G+++D D K+     + L ++   +EL T A 
Sbjct: 58  ALPLTAAAKWVLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAA 117

Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIY-KAFAKGENIGDVTAKGECVGNIADLLGTG 301
           P LFLPLA +A   +  A     S     + + F +  N+GD+ AK E  G    L+   
Sbjct: 118 PGLFLPLAASATFVRAAALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIAL- 176

Query: 302 LSIMISKRNPSLITTFALLSCG---------------------------YVFSSYQEVKS 334
                    P+ I  F L+S                             +VF+ +Q  KS
Sbjct: 177 ---------PAGIQVFQLVSAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKS 227

Query: 335 VVLHTLNRARFSVAVESFL 353
           +   TLNR R     E+F+
Sbjct: 228 LKFDTLNRFRLLKLAEAFV 246


>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 311

 Score = 96.3 bits (238), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 122/255 (47%), Gaps = 13/255 (5%)

Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           D+L +LG  +E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N+ D+ 
Sbjct: 33  DVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 92

Query: 286 AKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
           AKGE +  + +++G G  I     + S     LI    LLS  +++   QE+++  ++TL
Sbjct: 93  AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRATPVNTL 151

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAY 397
           N  R ++ V  F+K+G V S  E   +E++  FP         V +G   +  F  P   
Sbjct: 152 NPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRVIEEAGSVKIGQPLRKVF-SPRLV 209

Query: 398 LAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLA 457
             +   F  E+++++   +  + Y +L+  A  +D L+    A        S  G R  A
Sbjct: 210 EQLRATFPGEKFLLSQKSN--RTYMVLERSASGEDALRGWLVAAFASEMERSGAGSRDDA 267

Query: 458 LKQHEYDNSNLMLTT 472
           +    Y+    +  T
Sbjct: 268 VLSEAYEKMERVFPT 282


>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
          Length = 466

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 82/312 (26%), Positives = 147/312 (47%), Gaps = 29/312 (9%)

Query: 98  VAERDRSGKFKR-------NSIRAISLQSPQAPAEELMSFIRSY-------VVPDGFPDS 143
           ++E D+SG  +R       N+I    L+S   P+  + S +R +        +P GFP S
Sbjct: 30  ISELDKSGNVQRRWLHRSDNTIAV--LESETEPSSIIHSSVREWKRFLSDAFLPIGFPHS 87

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
           V+  Y+ Y T+ +L+ FF     +   + LL  +GV    ++     +  IL D   R+ 
Sbjct: 88  VSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSATFAMLITILTDATSRIA 147

Query: 204 KMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA-KNVAA 261
            ++FA R G + + D K+ RF  D+  +    +EL +         LA +   A + +  
Sbjct: 148 TIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGDYGKILALSVGQALRALCG 207

Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTF 317
           V + +++  +   FAK +N+ ++ AK       VG I  L+GT L + + + + S++   
Sbjct: 208 VAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGT-LVVKLVQDHQSVVYLM 266

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
             L   +++ +Y  V+SV + T NR R ++    ++ TG V +      +ENI  +    
Sbjct: 267 VTLVLAHLWMNYLGVRSVCMTTFNRQRATILFREYMTTGTVLTPEAVAARENIVLWA--- 323

Query: 378 DKPVVLGSRFKD 389
             PVV GSR  +
Sbjct: 324 --PVV-GSRLAE 332


>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
          Length = 525

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 78/303 (25%), Positives = 140/303 (46%), Gaps = 17/303 (5%)

Query: 65  DETTAFRLVE-DQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRA-ISLQSPQA 122
           +++TA    E D+    +   I SE   G      A R+   K   N  R  ++  S  A
Sbjct: 5   EDSTAITFTEVDEVNHPTATYIYSEPAAGP-----AHREHGKKHGENWGRVDVAHASSTA 59

Query: 123 PA----EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           PA    + +  F+    +P G+P SVT  Y PY  + +L+ F     G+ +++ +L  VG
Sbjct: 60  PAAWSSKSVQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVG 119

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           V    A+P A  +  IL+D +GR+  ++FA R G   + + K  R A D+  +L   ++ 
Sbjct: 120 VGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDC 179

Query: 238 ATAAVP--HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
            +  +P     + +   A V + +  V   S++  +   F++  N+ +V AK      I 
Sbjct: 180 LSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETII 239

Query: 296 DLLGTGLSIMISKRNPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            L+G  +   +  R  +  TT+    +L   ++  +Y  V+SV + +LNR R ++   + 
Sbjct: 240 SLIGMLVGSFVVSRVTNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTL 299

Query: 353 LKT 355
           L T
Sbjct: 300 LNT 302


>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
          Length = 346

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 62/236 (26%), Positives = 111/236 (47%), Gaps = 9/236 (3%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L + +R   +P G+P+SVT  Y  +  W +L+        V  T++ L+ +GV  N A P
Sbjct: 60  LKNVLRRLFLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHP 119

Query: 187 GAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH- 244
            A A+  + ++G G VG ++F   +  +FD D K  R   D++  +   +E      P  
Sbjct: 120 TAAALMLLWREGFGVVGGLLFTWGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTP 179

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           +F  L  +AN+  ++  +   +T T     FAK  NIGD+ +K   +G    L+G  LS 
Sbjct: 180 VFFALHSSANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGLLLST 239

Query: 305 -------MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
                  ++    P     F  L+  +++ +   ++S+ L  LN+ R  +  E ++
Sbjct: 240 LCFWSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYI 295


>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
           C-169]
          Length = 265

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 91/182 (50%), Gaps = 3/182 (1%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV--SRNRATPG 187
           F   + +P G+P++VTP Y+ +  W    H  G       T +LL +VGV  S   AT  
Sbjct: 4   FCTGFFLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAA 63

Query: 188 AVAINWILKDGAGRVGK-MIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A AI WI KDG G  G+ ++  R G  FD D K+ R   +  + +G  +E+ATA  P  F
Sbjct: 64  AAAIKWITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANF 123

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI 306
           + LA A N+++ V    +      I   FA   N+GDV AK E     A LLG   S+ +
Sbjct: 124 IVLAGAGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGL 183

Query: 307 SK 308
            K
Sbjct: 184 LK 185


>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 568

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 80/296 (27%), Positives = 135/296 (45%), Gaps = 37/296 (12%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV- 189
           +R+  +P G+PD+V+  Y  ++ W      F   + V T+Q+LL ++G+    +TPGA+ 
Sbjct: 146 LRAVFLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMG---STPGALP 202

Query: 190 ---AINWILKDGAGRVGKM---IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
              A  W+LKDG G    +   +F   G+K+D D K+     + L ++   +EL T A P
Sbjct: 203 LAAATKWVLKDGVGSFATLLAGVFG--GQKYDEDPKRWWAVTNSLEDVARVLELVTPAYP 260

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIY-KAFAKGENIGDVTAKGECVGNIADLLG--- 299
            LFLPLA +A   +  A     S     + +   + +N GDV AK E +G IA   G   
Sbjct: 261 ALFLPLAASATFVRAGALTGRGSLINGTFMQHLGRNQNFGDVRAKMEALG-IAIFRGVSG 319

Query: 300 ---------------TGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
                          T +S +   R   +   +  +   ++F  ++    +    LNR R
Sbjct: 320 AFGADAAAMAAEGGETAMSELT--RYGGVFGAYGAVLSSHLFCCWKAADVLRFDALNRHR 377

Query: 345 FSVAVESFLKTGH--VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYL 398
            + A E +++ G   V S+ +   +E ++        P  LG+  +DA +D D  L
Sbjct: 378 LTRAAEEYVRGGREAVLSVEDAGEREGVYRARVTASTP-TLGASVEDAARDWDDLL 432


>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 462

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 84/307 (27%), Positives = 147/307 (47%), Gaps = 28/307 (9%)

Query: 98  VAERDRSGKFKR-------NSIRAISLQSPQAPAEELMSFIRSY-------VVPDGFPDS 143
           ++E D+SG  +R       N+I AI L+S   P   + S +R +        +P GFP S
Sbjct: 30  ISELDKSGNVQRRWLHRSDNTI-AI-LESKTEPNSIIHSSVREWKRFLSDAFLPIGFPHS 87

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
           V+  Y+ Y T+ +L+ FF     +   + LL  +GV    ++     +  IL D   R+ 
Sbjct: 88  VSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSATFAMLITILTDATSRIA 147

Query: 204 KMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA-KNVAA 261
            ++FA R G + + D K+ RF  D+  +    +EL +         LA +   A + +  
Sbjct: 148 TIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGDYGKILALSVGQALRALCG 207

Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTF 317
           V + +++  +   FAK +N+ ++ AK       VG I  ++GT L + + + + S++   
Sbjct: 208 VAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVGT-LVVKLVQDHQSVVYLM 266

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
             L   +++ +Y  V+SV + TLNR R ++    ++ TG V +      +ENI    WL 
Sbjct: 267 VALVFAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGRVLTPEAVAARENI--VLWL- 323

Query: 378 DKPVVLG 384
             PVV G
Sbjct: 324 --PVVGG 328


>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
          Length = 447

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/272 (29%), Positives = 127/272 (46%), Gaps = 12/272 (4%)

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
           I W+ KDG G +G+++   R G  FD D KQ R   D +   G+  +LAT   P  FL L
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179

Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI--- 306
           A   N+AK VA      +   I   FA   N+G+V AK E     A L+G G  I+I   
Sbjct: 180 ASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILIIDT 239

Query: 307 ---SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
               K  P ++ T+  +   +++  YQ +  +  +T+N  R  + VES +    VP   +
Sbjct: 240 PGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPGYVD 299

Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVTYNP-SKGK 419
            N +ENI  +       ++ G   ++     +      A+   + KE+Y++T N  +K  
Sbjct: 300 CNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLNKLNKDT 359

Query: 420 VYAL-LKDQAKSDDILKAAFHAYVLLHCINSS 450
            +++  K  A S D+L+  + AY L   +  S
Sbjct: 360 EFSVSFKVNATSRDVLRCLWQAYWLEENMEES 391


>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 500

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 173/369 (46%), Gaps = 47/369 (12%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP-GAVAINWI 194
           +PDG+P SVT  Y  Y  + + + F G   G+ +++ +   +GV  + A+P GA+ I  +
Sbjct: 82  LPDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRAVWEGMGVGDSMASPIGAMMIQ-V 140

Query: 195 LKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH--LFLPLAC 251
           +++  GR+G + FA   G   + + K  R A D+L  +   ++  +   P    FL L C
Sbjct: 141 IRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCSIAFILDCLSPLFPRSIRFLILCC 200

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIM--IS 307
           ++ +    + V   ++++ +   FAK  N+G++ AK         L+G  TG  ++  ++
Sbjct: 201 SS-ILFAASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLMGMLTGTLVVSCLT 259

Query: 308 KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
               +  T F L++  Y+F +++ V++V   +LNR R ++A  + L    V +  + + +
Sbjct: 260 TPTTTWTTLFILIAI-YLFLNWKGVRAVKSRSLNRQRANIAFSALLSKDQVLTPGKISEK 318

Query: 368 ENIFS------FPWLRD----------------KPVVLGSRFKDAFQDPDAYLA-VEPFF 404
           E IF       F W  D                + +  G +   +FQ P   L+ +   F
Sbjct: 319 ELIFERRGGEVFRWNADTILGHCKFGVPLESMLQALANGQQNMRSFQLPTVDLSNLMELF 378

Query: 405 EKERYVV--------TYN-PSKGKVYALLKDQAKSDDILKAAFHAYVL---LHCINSSNG 452
           E ++Y++         Y+  S+ KV  +LK+   +   LKA FH  +L   LH + +   
Sbjct: 379 ENQQYIMWCQISEATNYDAASRIKVLIVLKNGVSAKSQLKAWFHGLLLAERLHGLKAPEL 438

Query: 453 HRSLALKQH 461
            R+  L QH
Sbjct: 439 RRA-QLGQH 446


>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
           NZE10]
          Length = 447

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 101/409 (24%), Positives = 187/409 (45%), Gaps = 53/409 (12%)

Query: 113 RAISLQSPQAPAE----ELMSFIRS---YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
           +A  +QSP+   E    +++S  +      +P GFP+SVT  Y+ Y  + +L+ F     
Sbjct: 18  KATYVQSPEGSTERSRVDVISLWQQAFDVFMPVGFPNSVTHDYLEYQIYDSLQAFSSSIA 77

Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFA 224
            +  ++ +L+SVGV    A+P A  +  I +  AGR+  ++FA R G     + K  R  
Sbjct: 78  SLLASRAVLSSVGVGDADASPTAAILLSIFQTSAGRLATILFAHRLGTSLGPECKMYRLL 137

Query: 225 GDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
            D+L + G  +++ + A P  + + +   +++ +++  V + S +  +   FA+  N+ +
Sbjct: 138 ADILNDTGFLLDMISVAFPKPISVIILSFSSILRSLCGVAAGSAKASLSAHFARWGNLAE 197

Query: 284 VTAKGECVGNIADLLGTGL-SIMIS--KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
           + AK      +  LLG    S++IS      +  TT  +L   ++ ++ + V++V + TL
Sbjct: 198 LNAKDSSQETVISLLGMWAGSVVISYITEPTAAWTTLIVLLAIHLETNRRAVRAVTMQTL 257

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRD---------------------- 378
           NR R ++     L+ GHVP+  E   QE IF     RD                      
Sbjct: 258 NRQRATLMFH-HLQRGHVPTPVEIASQERIFE----RDGVLRGNDDRILGHCAVGASTER 312

Query: 379 --KPVVLGSRFKDAF--QDPDAYLAVEPFFEKERYVVTYN----PSKGKVYALLKDQAKS 430
             K V        AF  Q P A  ++   +EK  Y++  +    P +  ++ +LK  A  
Sbjct: 313 LLKAVSGAQTSTKAFPSQGP-AMQSILTVYEKSPYILWQDRRSLPGRSSLFIVLKKGATP 371

Query: 431 DDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHI 479
            ++L A + A   L   +++N  +S   +  +Y    LM +T D  A++
Sbjct: 372 PEMLTAWWQA---LAVASANNIKKS--TQPDQYAELELMRSTMDQAANL 415


>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 500

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/237 (25%), Positives = 112/237 (47%), Gaps = 7/237 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           + S +RS  +P G+P    P Y+ Y  W  ++        V  TQ +L  VGV R  AT 
Sbjct: 77  VQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLRSVLATQRVLEGVGVGREGATA 136

Query: 187 GAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PH 244
            +  +N++++DG G    + F +    +F  D+K+ R   D+++++G  +E+A +A  P 
Sbjct: 137 LSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIFADVMVDIGITLEVAASACPPT 196

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LFLP+    N  K +  V + +    I   +AKG +I D+ AK      +   LG   + 
Sbjct: 197 LFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISDIQAKFGAQHTVTGSLGLVFAA 256

Query: 305 MISKRNPS-----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
             +K   +     L   +A L+  +V ++ + ++ +     N  R  + + +  ++G
Sbjct: 257 FFAKSVSTVPGGRLWILYAALTALHVMANLRCMRIIHFDVFNTTRLRLVLRALFESG 313


>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
          Length = 199

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 83/159 (52%), Gaps = 7/159 (4%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKH 159
           +R  K    S+  + +     P  + +  SF+  +  P G+P SV   Y+ Y  +R L+H
Sbjct: 45  ERFSKLHDKSVTTVKVIQDSRPIHDKLVDSFLNKFF-PSGYPYSVNEGYLTYTKFRVLQH 103

Query: 160 FFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDL 218
           F    + V +TQ+LL +VG+   R TP  A A++WILKDG     K+I +  G + D + 
Sbjct: 104 FSSAMLHVLSTQSLLFAVGL---RPTPTQATAVSWILKDGMQHTRKLICSSMGARMDSEP 160

Query: 219 KQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
           K  R   D+L +LG  +E+ +   P LFL +A   N AK
Sbjct: 161 KSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAK 199


>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 284

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 107/220 (48%), Gaps = 5/220 (2%)

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
            + +F+R   +P G+P SV   Y+ Y TW  ++       GV     +L++ GV    A+
Sbjct: 62  SMSAFVRRTFLPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENAS 121

Query: 186 PGAVAINWILKDGAGRVGKMIFARQG-KKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
             A A+ W+L+DG G +G ++F+       D ++K+ R   D+  ++G  ++L       
Sbjct: 122 AIAAAMVWVLRDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRG 181

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIAD-LLGT 300
            F  ++  A V K +  V +  T+T I   FA  +N+ D++AK    E   N+   LLG 
Sbjct: 182 NFAVVSSLATVFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGV 241

Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
            ++  +S      +  F   S  +V+++Y+ + S+ L  L
Sbjct: 242 KVASYLSDAPTGRVLIFVFFSLLHVYANYRYMPSIHLLPL 281


>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
           norvegicus]
          Length = 246

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 14/201 (6%)

Query: 66  ETTAFR--LVEDQFVDSSLRLICSEDIDGR-RWNYVAERDRSGKFKRNSIRAISL----- 117
           +T + R  L  +QF   + R  CS   DG  +W+      R G F R SI          
Sbjct: 3   DTASLRAPLCTEQFGSGAPR-SCSAAADGSLQWD---RAQRWGWFSRASITKPGQHEGGG 58

Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           + P A A   +S +RS ++P GFPDSV+P Y+ Y  W +++ F     G   TQ +L  +
Sbjct: 59  RGPWA-ALTTLSGLRSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGL 117

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
           GV   +A+  A    W++KD  G +G++IFA  +G K D + KQ R   D+L +    +E
Sbjct: 118 GVGNAKASVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLE 177

Query: 237 LATAAVPHLFLPLACAANVAK 257
           +     P  F      +N+AK
Sbjct: 178 IMAPMYPIFFTMTVSTSNLAK 198


>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 455

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 87/334 (26%), Positives = 152/334 (45%), Gaps = 31/334 (9%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSVTP Y+ Y  + +L+ F     G+ +++ +L   GV    A+     +  +L
Sbjct: 51  LPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLLTVL 110

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
           +D  GR+  ++FA + G     + K  R A D+  +    ++  + A+P  L +   C A
Sbjct: 111 QDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAALCLA 170

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTGLSIMISKR 309
              + +  V     +  +   FA+  N+G++ AK       +G    L G+ +   I+ R
Sbjct: 171 GALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHITSR 230

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL-KTGHVPSLREGNLQE 368
             + I    LL+  ++ ++Y  V++V L TL+R R ++A   +  ++ +V +  E   +E
Sbjct: 231 RTTWIALIFLLT-AHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVARRE 289

Query: 369 NIFSFPW-LRD--KPVVLG--------------SRFKDAFQDPDAYLAVEPFFEKERYVV 411
            IF  P  +RD  +  +LG              SR        +A +A    FE E YV+
Sbjct: 290 RIFEVPGAIRDSLQGFLLGTCTICTSPEELLRRSRQGSMASWKNALVA----FEHEAYVL 345

Query: 412 TYNPSK--GKVYALLKDQAKSDDILKAAFHAYVL 443
               +    K+    K  AK +D LKA  +A  L
Sbjct: 346 LGIRADFPCKILICFKHGAKPEDYLKAWVNAVEL 379


>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
           77-13-4]
 gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
           77-13-4]
          Length = 444

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/294 (24%), Positives = 140/294 (47%), Gaps = 13/294 (4%)

Query: 98  VAERDRSGKFKRNSI-----RAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYM 152
           V + D+SG   R  +     R I    P      +   +    +P G+P SV+  Y+ Y 
Sbjct: 16  VIDLDKSGNVARRWVHMPSNRIIRKAKPSLFKRSMRKILMDAFLPVGYPHSVSDDYLAYQ 75

Query: 153 TWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-G 211
            + +L+ FF     +   + LL  +GV    ++     +  +LKD   RV  +IFA+Q G
Sbjct: 76  FFDSLQAFFSTITALLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRVATIIFAQQFG 135

Query: 212 KKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVAAVTSTSTRTP 270
            + + D K+ RF  DL  +    +EL +    P+  +    +    + +  V   +++  
Sbjct: 136 LRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPYGKILALTSGEALRALCGVAGGASKAA 195

Query: 271 IYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVF 326
           +   FAK +N+ ++ AK       VG +  L+GT   +   + + S++    +L   +++
Sbjct: 196 LSVHFAKHDNLAELNAKEASQETAVGLVGLLVGT-FVVKHVEDHTSVVYLMTVLVLAHLW 254

Query: 327 SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI-FSFPWLRDK 379
            +Y  V+SV ++ LNR R ++  + +LK+G+V +  E   +E+I F  P +R++
Sbjct: 255 MNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLTPNEVARRESILFWSPVVRNQ 308


>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 164

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 89/161 (55%), Gaps = 6/161 (3%)

Query: 203 GKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
           GK+I +  G + D + K  R   D+L +LG  +E+ +   P LFL +A   N AK +A V
Sbjct: 5   GKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVV 64

Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTF 317
            + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I     + S     LI   
Sbjct: 65  AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG- 123

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
            LLS  +++   QE+++  ++TLN  R ++ V  F+K  ++
Sbjct: 124 PLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKVRYL 164


>gi|414879330|tpg|DAA56461.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
          Length = 115

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 50/88 (56%), Positives = 63/88 (71%), Gaps = 2/88 (2%)

Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHR-SLALKQHEYDNSNL 468
           +VTYNP+K KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+  R     KQ   D S  
Sbjct: 1   MVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARRLKQKQKQANPDRSEY 60

Query: 469 MLT-TADLEAHIGESCKVVSTALWGFQE 495
           + +   D  AHI ESCK+VS++   F++
Sbjct: 61  LYSRNMDFLAHIAESCKIVSSSYGTFKK 88


>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
 gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
          Length = 457

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/243 (25%), Positives = 127/243 (52%), Gaps = 5/243 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP SVT  Y  Y  + +L+ F      + +++ +L+S+GV   +A+P A  +  +L
Sbjct: 58  LPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPTAALLLSVL 117

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAA 253
           ++  GR+  ++FA R G   + + K  R   D+L +    +++ +   P  + + +   +
Sbjct: 118 QESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIVILSFS 177

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT-GLSIMISKRNPS 312
           ++ +++  V + S++  +   FA+  N+G++ AK      +  L+G    S++IS     
Sbjct: 178 SILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVISWITTP 237

Query: 313 LITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           L T  AL  L   ++ ++ + V++V + TLNR R ++      +   VPS++E +  E+I
Sbjct: 238 LATWTALIGLLSIHLETNRRAVRAVKMRTLNRQRATLVFHHLQRQQTVPSIKEISSVEHI 297

Query: 371 FSF 373
           F +
Sbjct: 298 FEW 300


>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
          Length = 527

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 117/238 (49%), Gaps = 6/238 (2%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           ++ ++ F+    +P G+P SVT  Y PY  + +L+ F     G+ +++ +L  VGV    
Sbjct: 65  SKSVLDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNAD 124

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A+P A  +  IL+D +GR+  ++FA R G   + + K  R A D+  +L   ++  +  +
Sbjct: 125 ASPTAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMI 184

Query: 243 P--HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
           P     + +   A V + +  V   S++  +   F++  N+ +V AK      I  L+G 
Sbjct: 185 PAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244

Query: 301 GLSIMISKRNPSLITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            +   +  R  S  TT+    +L   ++  +Y  V+SV + +LNR R ++   + L T
Sbjct: 245 LVGSFVVSRVTSFSTTWICLLMLLALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302


>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
          Length = 283

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 98/186 (52%), Gaps = 5/186 (2%)

Query: 194 ILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
           IL+D  G +G + FA  QG   D + KQ R   D+L +    +EL +  +P  FL L C 
Sbjct: 4   ILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFLVCV 63

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKR 309
           A++A+ +  V   +TR  + +  A+  N+GDV AK    E + N+  LL   + + +  +
Sbjct: 64  ASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPLVSQ 123

Query: 310 NPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQE 368
           N  ++ + F L +C ++F +Y+ V+S+ +  LN  R  + VE + +   + S +  N +E
Sbjct: 124 NAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIANQRE 183

Query: 369 NIFSFP 374
            +F  P
Sbjct: 184 PLFFDP 189


>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
          Length = 427

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 147/311 (47%), Gaps = 12/311 (3%)

Query: 87  SEDIDGRRWNYVAERDRSGKFKRNSIRA----ISLQSPQAPAEELMSFIRSYVVPDGFPD 142
           SE+I       + E D+SG F R+        + LQS +A    L S   +++ P G+P 
Sbjct: 19  SENIGHGEPTEIIEYDKSGNFVRSWTHGPDNIVVLQS-KARTSWLPSMHEAFM-PVGYPH 76

Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
           SV+  Y+ Y  + +++ FF     +   + LL  +GV    ++     +  +LKD   RV
Sbjct: 77  SVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRV 136

Query: 203 GKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVA 260
             + FA+Q G + + D K+ RF  DL  +    +EL +  + P+  +         + + 
Sbjct: 137 ATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALC 196

Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NPSLITTF 317
            V + +++  +   FAK +N+ ++ AK         L+G  +  ++ K    + +++   
Sbjct: 197 GVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVVKYVEDHNTVVFLM 256

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
            +L   +++ +Y  V+ V +  LNR R ++  + +LKTG + +  E   +E+I  + W  
Sbjct: 257 IVLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTPEEVAKRESILLW-WSS 315

Query: 378 DKPVVLGSRFK 388
            +  + GS +K
Sbjct: 316 TRIEMAGSYYK 326


>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 449

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 88/176 (50%), Gaps = 27/176 (15%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
            ++S  +P G+P SV   Y+ +  W   K       G+ ++   +  +            
Sbjct: 72  MLKSLFLPTGYPASVRQDYIHFQKWEVAK-------GIVSSAGFVGHL------------ 112

Query: 190 AINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
                   G G +G M+ A   GK FD + K++R+A D +   G  +ELAT   P  FLP
Sbjct: 113 -------VGLGCLGMMLVAGAFGKFFDTETKRIRWAADCIHVAGVALELATPLFPAYFLP 165

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
           LA  AN AK +A +T+ +T+  I + FA  +N+GD+TAKG   G +A L+G G+ I
Sbjct: 166 LASLANSAKGIAGLTTGATKAAINQGFALRDNLGDITAKGHSQGIVAYLMGMGIGI 221


>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 485

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 152/334 (45%), Gaps = 27/334 (8%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P +VTP Y PY  + +L+ F     G+  ++ +   +GV    A+     + +I 
Sbjct: 59  LPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRAVFVGMGVGSEEASVLTTMLLYIA 118

Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-CAA 253
           ++  GRV  ++FA Q  ++ + ++K  RF  D++ ++   ++  + A+P L   +  C +
Sbjct: 119 QETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDVAFVLDCMSPAMPILGRVVTLCLS 178

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPS 312
           N  + V  V   S++  +   FAK  NIG++ AK      +  LLG  +  +++SK + +
Sbjct: 179 NACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGSQETVISLLGMWVGGVVVSKVHGT 238

Query: 313 LITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           L T   L  L   +++++++ V+SV L++ N  R  V   + L+ G    + +   +E +
Sbjct: 239 LATWSWLVPLLALHLWANWKAVRSVRLNSFNTNRALVFFRAALE-GRTLDMEQVAQEETV 297

Query: 371 FSFP-------WLRDKPVVLGSRFKDAF-----------QDPDAYLAVEPFFEKERYVVT 412
             +P        L      +GS  +D             Q       +   F  E Y++ 
Sbjct: 298 LGWPAASRGIYGLEPCYFQVGSSVQDLVHRVYAGADQPSQPASTVGHLVQLFCDEDYLLW 357

Query: 413 YNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHC 446
           ++   G+   LLKD A      K   H   +L C
Sbjct: 358 FDIETGEALVLLKDTATGITQAKGLCH---MLRC 388


>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
          Length = 386

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 102/226 (45%), Gaps = 28/226 (12%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT---PG 187
           I+ +V+P GFP SV+  Y+ YM W+   +  G       T TLL +VGV     T     
Sbjct: 67  IKDFVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAAS 126

Query: 188 AVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
           A AI W+ KDG G  G+++   R G  FD D K+ R   D +   G+  EL T   P  F
Sbjct: 127 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYF 186

Query: 247 LPLACAANVAKNV------AAVTSTSTRTPIYKA----FAKGENIGDVTAKGECVGNIAD 296
           LPLA   N+AK +              R P ++     FA+  N+G+V AK        D
Sbjct: 187 LPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAK--------D 238

Query: 297 LLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
             G      I    P+L  T+  +   +++  YQ +  +   T++R
Sbjct: 239 TSG------IKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVDR 278


>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
           [Aspergillus nidulans FGSC A4]
          Length = 564

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 111/230 (48%), Gaps = 14/230 (6%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y  Y  + +L+ F     G+  ++ +L  VGV    A+P +  +  IL
Sbjct: 74  LPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTSALLLHIL 133

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
           +D +GR+  ++FA R G   + + K  RFA D+  +L   ++  +  VP  F  + +  A
Sbjct: 134 QDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGFNRVTVLSA 193

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
           A V + +  V   S++  +   FA+  N+ +V AK      +  L+G    +++     S
Sbjct: 194 AGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIG----MLVGSFVVS 249

Query: 313 LITTFALLSCGYVF-------SSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            IT+F       VF        +Y  V+SV + TLNR R ++   + L +
Sbjct: 250 HITSFTATWLTLVFLLSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 299


>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
          Length = 184

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 77/143 (53%), Gaps = 1/143 (0%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
            +R   +P G+P SV+P Y+ Y  W  ++ F        +T+ +L  VGV    A+  A 
Sbjct: 38  LLRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAA 97

Query: 190 AINWILKDGAGRVGKMIFARQGKKF-DYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
           A+ W+LKDG G + +++FA     + D D KQ R   D   +L   ++L T   P+LF+P
Sbjct: 98  AVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLFMP 157

Query: 249 LACAANVAKNVAAVTSTSTRTPI 271
           + C +++ + V  V  ++TRT +
Sbjct: 158 IICLSSMIRAVVGVAGSATRTTV 180


>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
          Length = 510

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/226 (26%), Positives = 111/226 (49%), Gaps = 6/226 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  YVPY  + +L+ F     G+ +++ +L  VGV    A+P A  +  IL
Sbjct: 76  LPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHIL 135

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
           +D +GR+  ++FA R G   + + K  R A D+  +    ++  +  +P  F  + +   
Sbjct: 136 QDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLST 195

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
           A V + +  V   S++  +   F++  N+ +V AK      I  L+G  +   +  R  S
Sbjct: 196 AGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVSRVTS 255

Query: 313 LITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
              T+    +L   ++  +Y  V+SV + +LNR R ++   + L +
Sbjct: 256 YTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 301


>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
 gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
          Length = 559

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/223 (27%), Positives = 114/223 (51%), Gaps = 5/223 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y  Y  + +L+ F     G+  ++ +L  VGV    A+P +  +  IL
Sbjct: 74  LPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTSALLLHIL 133

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
           +D +GR+  ++FA R G   + + K  RFA D+  +L   ++  +  VP  F  + +  A
Sbjct: 134 QDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGFNRVTVLSA 193

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
           A V + +  V   S++  +   FA+  N+ +V A+   +  I  L+G+ +   I+    +
Sbjct: 194 AGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAE-TVISLIGMLVGSFVVSHITSFTAT 252

Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            +T   LLS  ++  +Y  V+SV + TLNR R ++   + L +
Sbjct: 253 WLTLVFLLSM-HLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 294


>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 679

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/224 (29%), Positives = 107/224 (47%), Gaps = 11/224 (4%)

Query: 207 FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTS 266
           FA   +  D D+K+ RF   ++ +  + +E+ T   P  FL LA  ANV K+V   T+  
Sbjct: 306 FACMKRPSDSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKSVGITTANV 365

Query: 267 TRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK-----RNPSL-----ITT 316
            R PI  +FA  EN+ ++ AK      +AD +G  L+++ +       NP L     +  
Sbjct: 366 VRAPIQMSFALEENLAEIAAKTSAQQVLADNIGLALAVVATGWTGKIANPRLRLAVPLLA 425

Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWL 376
           FA L+   ++  Y+E+KSV L T+N+ R  +    F++TG +PS       E +F    +
Sbjct: 426 FAPLASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAERLFIPARM 485

Query: 377 RDKPVVLG-SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
            +  + L  +   +A   P+A            Y +TY P +GK
Sbjct: 486 DESSLPLKITHLAEACPTPEALRTALGDDPSRPYCLTYLPPRGK 529



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 48/84 (57%), Gaps = 3/84 (3%)

Query: 142 DSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGR 201
           ++VTP Y  Y  +R  +      + VF TQ +L +VG+   R  P A A+NW+LKDG GR
Sbjct: 179 NAVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLPAAAAVNWVLKDGLGR 238

Query: 202 VGKMIFARQ-GKKFD--YDLKQLR 222
           +GK+  A   G++F   Y L  +R
Sbjct: 239 LGKLSVATNFGREFGAFYTLVPIR 262


>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
           Silveira]
          Length = 478

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 156/340 (45%), Gaps = 26/340 (7%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
           +L+ F     G+ +++ +L  VGV    A+P +  +  IL+D  GR+  ++FA Q G   
Sbjct: 92  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + + K  R A D+  +    ++  +   P  + + L   ++V + +  V + S++  +  
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFALLS--CGYVFSSYQ 330
            FAK  N+ ++ AK      +  L+G  + S+++S     L T  AL+S    ++ ++Y 
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLIVHITTNYL 271

Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV---- 382
            V++V + +LNR R ++   +      V + ++ + QE +F       W      +    
Sbjct: 272 AVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFCQ 331

Query: 383 LGSRFKD----------AFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDD 432
           +G  FKD          A +     +A+   FE+E Y++ ++P++ +    LK  A    
Sbjct: 332 IGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTS 391

Query: 433 ILKAAFHAYVLLHC--INSSNGHRSLALKQHEYDNSNLML 470
            LKA  HA +  +   I  SN    L  +Q   D S LML
Sbjct: 392 QLKAWSHALLAANSFKIKESNTREPLVREQVGTD-SVLML 430


>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
          Length = 425

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 72/310 (23%), Positives = 144/310 (46%), Gaps = 10/310 (3%)

Query: 87  SEDIDGRRWNYVAERDRSGKFKRNSIRA---ISLQSPQAPAEELMSFIRSYVVPDGFPDS 143
           SE+I       + E D+SG F R+       I +   +     L S   +++ P G+P S
Sbjct: 19  SENIGHGERTEIIEYDKSGNFVRSWTHGPDNIVVLKSKVCTSWLPSMHEAFM-PVGYPHS 77

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
           V+  Y+ Y  + +++ FF     +   + LL  +GV    ++     +  +LKD   RV 
Sbjct: 78  VSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRVA 137

Query: 204 KMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVAA 261
            + FA+Q G + + D K+ RF  DL  +    +EL +  + P+  +         + +  
Sbjct: 138 TIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALCG 197

Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NPSLITTFA 318
           V + +++  +   FAK +N+ ++ AK         L+G  +  ++ K    + +++    
Sbjct: 198 VAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDHNTVVFLMI 257

Query: 319 LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRD 378
           +L   +++ +Y  V+ V +  LNR R ++  + +LKTG + +  E   +E+I  + W   
Sbjct: 258 VLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRESILIW-WSST 316

Query: 379 KPVVLGSRFK 388
           +  + GS +K
Sbjct: 317 RIEMAGSYYK 326


>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
          Length = 282

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 125/249 (50%), Gaps = 17/249 (6%)

Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           D+L + G  +E+ +   P LFL +A   N AK +A V + +TR PIY +FAK  N+ D+ 
Sbjct: 1   DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60

Query: 286 AKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
           AKGE +  + +++G G  I     + S     LI    LLS  +++   QE+++  ++TL
Sbjct: 61  AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSVVHIYGVVQEMRATPVNTL 119

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAY 397
           N  R ++ V  F+K+G V S  E   +E++  FP         V +G   +     P   
Sbjct: 120 NPQRTAMIVADFIKSGKVSSPAELRYREDLL-FPNRLIEEAGSVKIGQPVRRVL-SPQRI 177

Query: 398 LAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK----AAFHAYVLLHCINSSNGH 453
             ++  F KE+++++   +    Y +L+  A  +D L+    AAF + +    + S +  
Sbjct: 178 EQLKATFSKEKFLLSRKDNSA--YMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTV 235

Query: 454 RSLALKQHE 462
            ++A ++ E
Sbjct: 236 LNVAYERME 244


>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 678

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 117/268 (43%), Gaps = 22/268 (8%)

Query: 98  VAERDRSGKFKRNSIRAISLQSPQAPA---------EELMSFIRSYVVPDGFPDSVTPSY 148
           VAE D    F        S  +P  PA           +++ +   ++PDG+P SVT   
Sbjct: 26  VAEVDAQDFFTEPGGLVTSPTTPSTPAGSTSARQARRRVVTTLERILLPDGYPSSVTEDL 85

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           +P+  W   +          +T+ +L  VGV  ++A   +  ++W+++DG   +G ++FA
Sbjct: 86  LPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSATLSWMMQDGTRMIGSVVFA 145

Query: 209 ---RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP---HLFLPLACAANVAKNVAAV 262
               QG   +   K  +   D   ++   +EL    +P     F  +  AA+V K + +V
Sbjct: 146 PLIPQG--LECRAKTWQLVADFTTDIANLLELCAPWLPGGQTTFRVVLVAASVVKALGSV 203

Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL-----ITTF 317
               TR    + FA   N  D+ AK     ++   +G  L + ++   P+      +  F
Sbjct: 204 CRNCTRASFTRHFALRNNAADIAAKASTRSSVGSFIGLVLGVAVAYIVPATSRSLNLVVF 263

Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARF 345
           AL S  ++F+SY+ V+ V L  LN  R 
Sbjct: 264 ALGSVLHIFASYRSVRGVQLKHLNAPRL 291


>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 478

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 156/340 (45%), Gaps = 26/340 (7%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
           +L+ F     G+ +++ +L  VGV    A+P +  +  IL+D  GR+  ++FA Q G   
Sbjct: 92  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + + K  R A D+  +    ++  +   P  + + L   ++V + +  V + S++  +  
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFALLS--CGYVFSSYQ 330
            FAK  N+ ++ AK      +  L+G  + S+++S     L T  AL+S    ++ ++Y 
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLIVHITTNYL 271

Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV---- 382
            V++V + +LNR R ++   +      V + ++ + QE +F       W      +    
Sbjct: 272 AVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFCQ 331

Query: 383 LGSRFKD----------AFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDD 432
           +G  FKD          A +     +A+   FE+E Y++ ++P++ +    LK  A    
Sbjct: 332 IGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTS 391

Query: 433 ILKAAFHAYVLLHC--INSSNGHRSLALKQHEYDNSNLML 470
            LKA  HA +  +   I  SN    L  +Q   D S LML
Sbjct: 392 QLKAWSHALLAANSLKIKESNTREPLVREQVGTD-SVLML 430


>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
          Length = 448

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 59/241 (24%), Positives = 123/241 (51%), Gaps = 5/241 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P +VT  Y+PY T+ +L+ F      +  ++ +L  +GV  + A+P    I  I 
Sbjct: 47  LPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGALILKIT 106

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-FLPLACAA 253
            D   R+  ++FA R G+  + + K  RF  D+  +    ++L T A+P+L  L +  +A
Sbjct: 107 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKLGVIVSA 166

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT---GLSIMISKRN 310
            V +++  V + +++  +   FA   N+ ++ AK      +  LLG     L + + +  
Sbjct: 167 GVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVGSLVVRLVEDK 226

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
             +     +L+  ++  +Y+ V++V + +LNR R ++    +L++G V +  + + +E+I
Sbjct: 227 HKVWGLMIILAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNPAQVSQRESI 286

Query: 371 F 371
            
Sbjct: 287 L 287


>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
          Length = 158

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 68/113 (60%)

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LKDG   VGK+I +  G   D + K+ R   D L ++G G+E+ +   PHLFL +A   N
Sbjct: 1   LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
            AK ++ V + +TR PIY +FAK  N+ D+ AKGE    + +++G G+ I ++
Sbjct: 61  FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLA 113


>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 377

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 143/317 (45%), Gaps = 33/317 (10%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
           +L+ FF    G+   + +L  +GV    AT     +  I++D  GRV  ++ A + G   
Sbjct: 4   SLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLGPML 63

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + + KQ R A DL  + G  +E  +  +PH + + L C A V +++  V++   +  + +
Sbjct: 64  EPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAALSQ 123

Query: 274 AF-AKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL---ITTFALLSCGYVFSSY 329
            F   G +I D+ AK      +  L+G  +  MI  R  S     T    L   ++++++
Sbjct: 124 HFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRITSFEGTWTALLFLIGVHLYANW 183

Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL------ 383
           + V++V + TLNR R ++ +   ++ G V S  +   +E++F     R   +VL      
Sbjct: 184 RGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFC----RSSRLVLDGVTLG 239

Query: 384 ----GSRFKDAF------QDPD----AYLAVEPFFEKERYVV-TYNPSKGKVYAL--LKD 426
               G   +           PD    A   +   F  ERY++      KGKV  +  LK 
Sbjct: 240 QCHIGVSLQHLLAQIARVHGPDRSGSALAGLMDRFASERYILHCCCRRKGKVSIIICLKR 299

Query: 427 QAKSDDILKAAFHAYVL 443
             +S D+L+A  HA VL
Sbjct: 300 GVESRDMLRAWTHALVL 316


>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 891

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 87/387 (22%), Positives = 161/387 (41%), Gaps = 71/387 (18%)

Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
           + F     +P G+P SV   Y+ Y  + +++        V  +  +L + GV    AT  
Sbjct: 431 IRFATELFMPVGYPYSVREGYLSYQIYDSIQGLSSYLRAVLCSAQVLQAAGVGDANATAI 490

Query: 188 AVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL- 245
             A+ W LKDG G +G +IF+       D  +K+ R   D++ ++G  +++    VP   
Sbjct: 491 MAALTWALKDGLGMIGGLIFSYGVASHLDAYVKEFRLFADVINDVGMLLDMLAPYVPRAD 550

Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
            + ++ AA + K +  + + +T++ I + FA   N+ D+ AK      + +L+G    + 
Sbjct: 551 LIWISSAATLCKVMCGIAAGATKSSITQHFALQGNMADLNAKEATQETLVNLIGMVCGVT 610

Query: 306 ISKRNPSL---------------------------ITTFALLSCGYVFSSYQEVKSVVLH 338
           +++    L                              F +L+  +++S+++ V  + L 
Sbjct: 611 LARYLSQLEAGNEEGDTDRIGYRSWIPCISHDSIQWMVFTILTLLHIWSNWKGVSLLRLE 670

Query: 339 TLNRARFSVAV-----------------------------ESFLKT-----GHVPSLREG 364
           TLNR R  V +                             E  L+T      ++P  R  
Sbjct: 671 TLNRERTKVVLCAAGLWRALVPDCEALRSEKYAELGLLRQEMLLETLQKCIENIP--RPN 728

Query: 365 NLQENIFSFPW--LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
           +++E++ S  W  L    +VL +R KD       + A + F   E+YVV  +  +G+   
Sbjct: 729 DMRESLASSTWDLLMPTRLVLHARLKDMIA--IDWTATKDFV-NEKYVVVLS-DRGRALV 784

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINS 449
            L   A   D LKA  H  ++L C++S
Sbjct: 785 CLLTGAAPMDELKAFMHGLLILKCLDS 811


>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
          Length = 171

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 69/113 (61%)

Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           LK+G   VGK+I +  G   D++ K+ R   D L ++G G+E+ +   PHLFL +A   N
Sbjct: 1   LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
            AK +A V + +TR PIY +FAK  N+ D+ AKGE    + +++G G+ I ++
Sbjct: 61  FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLA 113


>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
 gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 477

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 113/449 (25%), Positives = 201/449 (44%), Gaps = 77/449 (17%)

Query: 71  RLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSF 130
           +L E + V+ S +LI +         Y+     S    +   R I +  P+ P+  + S 
Sbjct: 3   KLKEIKEVNQSGKLIAT---------YIQSHAHSTSLDQKDSR-IDIHIPKGPSTWVSSL 52

Query: 131 IRSYVVPDGFPDSVTPSY---VP------YMTWRALKHFFGGAMGVFTTQTLLNSVGVSR 181
             +++ P G+PDSVT  Y   VP      Y+    L  F     G+  ++ +L  +GV  
Sbjct: 53  TGAFL-PVGYPDSVTEDYTACVPCPIHLLYLLCEKLA-FASTIAGLLASRAVLQGLGVGD 110

Query: 182 NRATP-GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
           + A+  GAV +N +L++ AGR+  ++FA R G   + + K+ R   D+  +    ++  +
Sbjct: 111 STASATGAVLLN-VLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDCIS 169

Query: 240 AA---VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
            A   VP +FL    A++V K++  V + S++  +   FAK  N+ ++ AK      +  
Sbjct: 170 PAFPKVPRVFL--LSASSVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETLIS 227

Query: 297 LL----GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
           LL    GT +   IS +  + I   +LLS  ++ ++Y  V+SV + TLNR R ++ +  F
Sbjct: 228 LLGMLVGTFVVSRISSQTATWIALISLLSI-HLGTNYLAVRSVTMRTLNRQRANLVISDF 286

Query: 353 L------KTG-HVPSLREGNLQENIFSFPWLRD------KPVVLG--------------- 384
           +      KT  ++P+ ++ + +E IF     RD      + VVLG               
Sbjct: 287 VSNINAEKTRFNLPTPKDISRKERIFE----RDGAIRHIQGVVLGYCKVGVSLREILSSI 342

Query: 385 ---SRFKDAFQDPDAYLAVEPF--FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
                   ++++  + +    F  ++ + YV+ Y+  +     +LK+       L A FH
Sbjct: 343 SSSPTVSGSYKEETSSIITSLFKIYKNQGYVMWYSRRQNTFLVILKENTAPSVQLDAWFH 402

Query: 440 AYVLLHCI------NSSNGHRSLALKQHE 462
           A   L         NS+NG     LK  E
Sbjct: 403 AVWALSVSSSPAVGNSTNGSDQDILKWIE 431


>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
          Length = 449

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 97/427 (22%), Positives = 189/427 (44%), Gaps = 84/427 (19%)

Query: 94  RWNYVAERDRSGKFKRNSIRA------------ISLQSPQAPAEELMSFIRSYVVPDGFP 141
           R   + E ++SGK     IR+            I +  P+ P+  +MS + +++      
Sbjct: 3   RSEEIKEINQSGKLIATYIRSHTTDTSGTKFDRIDVDIPKGPSTWVMSLVGAFM------ 56

Query: 142 DSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGR 201
               P++   +             G+  ++ +L  +GV  + A+     +  IL++ AGR
Sbjct: 57  ----PAFASTIA------------GLLASRAVLQGLGVGDSTASATHAVLLSILQESAGR 100

Query: 202 VGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAA---VPHLFLPLACAANVAK 257
           +  ++FA R G   + + K+ R   D+  +    ++  + A   +P +FL    A++V K
Sbjct: 101 ISTILFAHRLGSALEPECKKYRLMADIFNDAAMILDCISPAFPKIPRVFL--LSASSVCK 158

Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL----GTGLSIMISKRNPSL 313
           ++  V + S++  +   FAK  N+ ++ AK      +  LL    GT +   IS +  + 
Sbjct: 159 SLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATW 218

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH-------VPSLREGNL 366
           I   ALLS  ++ ++Y  V+SV + TLNR R ++ + S L +         +PS R+ +L
Sbjct: 219 IALLALLSI-HLGTNYMAVRSVTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISL 277

Query: 367 QENIFSFPWLRDKPV-----------VLGSRFKD------------AFQDPDAYL-AVEP 402
           QE IF     RD  +            +G   +D            ++ +P++ L ++  
Sbjct: 278 QERIFE----RDGAIRNTHGTILAYCKVGVSLQDLLSSIATPTTAGSYAEPESILTSLLS 333

Query: 403 FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHE 462
            ++ + Y++ Y+ S+     +LK++      L A FHA   +  I S N  +    K + 
Sbjct: 334 IYKTQGYLMWYSRSRNTFLVILKEETAPKVQLDAWFHA---IFAITSGN-QKIQKSKTNH 389

Query: 463 YDNSNLM 469
            DN +++
Sbjct: 390 VDNQSIL 396


>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
          Length = 407

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/340 (24%), Positives = 155/340 (45%), Gaps = 26/340 (7%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
           +L+ F     G+ +++ +L  VGV    A+P +  +  IL+D  GR+  ++FA Q G   
Sbjct: 21  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 80

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + + K  R A D+  +    ++  +   P  + + L   ++V + +  V + S++  +  
Sbjct: 81  EPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 140

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFALLS--CGYVFSSYQ 330
            FAK  N+ ++ AK      +  L+G  + S+++S     L T  AL+S    ++ ++Y 
Sbjct: 141 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLIVHITTNYL 200

Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV---- 382
            V++V + +LNR R ++   +      V + ++ + QE +F       W      +    
Sbjct: 201 AVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFCQ 260

Query: 383 LGSRFKDAFQDPDA----------YLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDD 432
           +G  FKD      +           +A+   FE+E Y++ ++P++ +    LK  A    
Sbjct: 261 IGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTS 320

Query: 433 ILKAAFHAYVLLHC--INSSNGHRSLALKQHEYDNSNLML 470
            LKA  HA +  +   I  SN    L  +Q   D S LML
Sbjct: 321 QLKAWSHALLAANSLKIKESNTRGPLVREQVGTD-SVLML 359


>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
 gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
          Length = 483

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 88/384 (22%), Positives = 169/384 (44%), Gaps = 41/384 (10%)

Query: 120 PQAPAE-ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           P+ P+    ++ I +  +P G+P SVT  Y+ Y  + +L+ F     G+ +++ +L  VG
Sbjct: 46  PKLPSSTSYLTSILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVG 105

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           V    A+P A  +  +L++  GR+  ++FA R G   + + K  R A D+L +    ++ 
Sbjct: 106 VGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDC 165

Query: 238 ATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
            +   P  + + L   ++V + +  V + S++  +   FA+  N+G++ AK      +  
Sbjct: 166 LSPIFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVIS 225

Query: 297 LLGTGL-SIMISKRNPSLITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
           L G  + S+++S     L T  AL  L   ++ +++  V++V + TLNR R ++      
Sbjct: 226 LAGMLVGSVVVSYITSPLETWIALIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLF 285

Query: 354 KTGHVPSLREGNLQENIFS----FPWLRDKPVVLGS-----------RFKDAFQDPDAYL 398
           +   V +  E + +E IF       W  +   +LG+                  D    L
Sbjct: 286 EDDRVLTPAEASEEERIFERDGVLRWNAESSGILGTCQIGTSLEQLLLLLPEQADRGGAL 345

Query: 399 -----------------AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
                            A+   F +E Y++ ++P   +   +LK  A +   LKA  HA 
Sbjct: 346 ETGQTSSTTMNTSTNLTALLELFREEEYILYFDPLSRRGAIVLKAGATAKAQLKAWSHAL 405

Query: 442 VLLHCINSSNGHRSLALKQHEYDN 465
           ++   +   N   S A  +HE  +
Sbjct: 406 LVSRHLAGKN---STATCRHEKQD 426


>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 438

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 28/302 (9%)

Query: 78  VDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVP 137
           VD +  ++C+ D  G     V E D S K ++  +       P A A      I    +P
Sbjct: 8   VDQTGNVLCTYDQSGA----VGEDDSSLKTEKTHV-------PLAKA------IFYAFLP 50

Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
            GFP SVT  Y+ Y T+ +L+ F      +   + +L  +GV     +P A  +  IL+D
Sbjct: 51  AGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPTAALVLQILQD 110

Query: 198 GAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP---LACAA 253
              R+  ++FA R G   + + K  RF  DL  +  A   L        FLP   +  +A
Sbjct: 111 TFSRLATILFAHRMGTAIEPECKTYRFMADLFND--ASQLLDLLVPVLPFLPKVSVMVSA 168

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKR 309
            + +++  V + +++  +   FAK  N+ ++ AK       V  +  L+GT +  M+  +
Sbjct: 169 AILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVGTAVVHMVEDK 228

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
                    L++  ++  +Y+ V+ V + TLNR R ++    FL+TG V +  +   +E+
Sbjct: 229 RAVWCWMIVLIAI-HLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVLTPGQVAARES 287

Query: 370 IF 371
           I 
Sbjct: 288 IL 289


>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
 gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 357

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 126/265 (47%), Gaps = 12/265 (4%)

Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
           SL++ Q     L + + +++ P G+P SVTP Y+PY T+ +L+ FF     +   + +L 
Sbjct: 25  SLKAHQTKVAPLHALLNAFL-PAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLE 83

Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDL------L 228
            +GV    ++P A  I  I  D   R   ++FA R G   + + K  RF  D+      L
Sbjct: 84  GLGVGDASSSPTAALILKITSDTISRFATILFAHRLGTAIEPECKFFRFLADVLNDAAQL 143

Query: 229 MELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA-KGENIGDVTAK 287
           ++L   V +       L L +   + V ++   V + +++  +   F  +  ++G++ AK
Sbjct: 144 LDLLMPVMMMGETRYGLKLGVVVGSGVLRSWCGVAAGASKASLSAHFVTQRSSLGELNAK 203

Query: 288 GECVGNIADLLGTGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRAR 344
                 +  LLG  +  ++ K     +  + L   L   ++  +Y+ V+ V L TLNR R
Sbjct: 204 EASQETVVSLLGMLVGSLVVKVVQDRLVVWVLMVGLLGVHLLMNYRGVRCVRLRTLNRQR 263

Query: 345 FSVAVESFLKTGHVPSLREGNLQEN 369
            ++ V S+L+TG V    E  L+ +
Sbjct: 264 ATMVVRSWLETGRVLGPEEDGLESH 288


>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
          Length = 467

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 143/295 (48%), Gaps = 27/295 (9%)

Query: 98  VAERDRSGKFKRNSI-----RAISLQSPQAPAEELMS--------FIRSYVVPDGFPDSV 144
           V+E D+SG  ++  +     R + ++ P+    +L S         +    +P GFP SV
Sbjct: 26  VSELDKSGHVQQKWLYEPSGRIVHIR-PKGHTNDLFSRSLREWKKILSDSFLPVGFPHSV 84

Query: 145 TPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGK 204
           +  Y+ Y T+ +L+ FF     +   + LL  +GV    ++     +  IL+D   RV  
Sbjct: 85  SSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDATSRVAT 144

Query: 205 MIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA-----KN 258
           + FA R G + + D K+ RF  DL  +    +EL +   P+L  P    A ++     + 
Sbjct: 145 IAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYS---PYLS-PWGKVAALSIGEALRA 200

Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRNPSLIT 315
           +  V++ +++  +   FAK +N+ ++ AK    E    +  LL   + + + + + S++ 
Sbjct: 201 LCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGTVVVKVVEDHRSVVF 260

Query: 316 TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
              LL  G+++ +Y  V SV + TLNR R ++  + +L+TG V S  E + +E I
Sbjct: 261 LMILLVFGHLWMNYLGVCSVEMTTLNRQRATILFQEYLQTGKVLSPAEVSKREYI 315


>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
          Length = 484

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 104/438 (23%), Positives = 191/438 (43%), Gaps = 66/438 (15%)

Query: 98  VAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIR---------SYVVPDGFPDSVTPSY 148
           +AE D++G  K   + +I +     P  +++  +R            +PDG+P SV+  Y
Sbjct: 25  IAEYDKAGSLKAKYVESIPIIGH--PRVDVLLPVRRKKIWLQLLEIFLPDGYPHSVSDDY 82

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
             Y  + +++ F G   G+ +++ +   +GV  + A+P    +  ++++  GR   + FA
Sbjct: 83  AAYQIYDSIQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLIQVIRESTGRFATITFA 142

Query: 209 RQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH--LFLPLACAANVAKNVAAVTST 265
              G   + + K  R A D+L ++   ++  +   P    FL L C +++  + + V   
Sbjct: 143 HLFGTSIEAECKAYRLASDILCDVAMVLDCLSPFFPRSIRFLVL-CFSSLLYSASGVAGN 201

Query: 266 STRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTFALLS-- 321
           ++++ +   FAK  N+G++ AK         L+G  TG + M+S       T  AL+S  
Sbjct: 202 ASKSSLSGHFAKWNNLGELNAKDASQETAISLMGMITG-TFMVSLLTGPKATWVALISLL 260

Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV-----PSLREGNLQENIFS-FPW 375
             ++F +++ V++V   +LNR R ++A  +      V      S RE   +E   S F W
Sbjct: 261 ALHLFLNWKGVRAVKSRSLNRQRANIAFSALFSKDQVLTPVIVSEREFIFEERWGSVFRW 320

Query: 376 LRDKPVVLG-SRFKDAFQDPDAYLA-----VEPF-------------FEKERYVVTYNPS 416
              K  VLG   F  + Q+    LA     ++ F             +++E YV+    S
Sbjct: 321 NSGK--VLGHCEFGVSLQEILQSLASGQERMKTFHLSTLELANLLNIYDEEEYVLWCEAS 378

Query: 417 KG---------KVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
           +          KV  +LKD       LKA FH  +L   +   +G +   L +   D   
Sbjct: 379 ETVYCDAASRVKVMVVLKDGVTPKSQLKAWFHGLLLTRRL---SGQKESELLRAGMDQQA 435

Query: 468 LMLTTADLEAHIGESCKV 485
           ++       +HI ES +V
Sbjct: 436 ML-------SHIQESLRV 446


>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
          Length = 172

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 70/123 (56%), Gaps = 1/123 (0%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP+SV+  Y+ Y  W  L+ F     G+  TQ +L + GV  N AT  A  I W+L
Sbjct: 50  LPQGFPESVSEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAATITWML 109

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KDGAG +G+++FA ++G + D D K+ R   D+L      ++L        F+ +AC ++
Sbjct: 110 KDGAGMIGRILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLIACLSS 169

Query: 255 VAK 257
           VA+
Sbjct: 170 VAR 172


>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
 gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
          Length = 513

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 113/229 (49%), Gaps = 9/229 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  YVPY  + +L+ F     G+ +++ +L  VGV    A+P A  +  IL
Sbjct: 76  LPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHIL 135

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
           +D +GR+  ++FA R G   + + K  R A D+  +    ++  +  +P  F  + +   
Sbjct: 136 QDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLST 195

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL----SIMISK 308
           A V + +  V   S++  +   F++  N+ +V AK      I  L+G  +    S ++S+
Sbjct: 196 AGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVRFVGSFVVSR 255

Query: 309 RNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
                 T  +  +L   ++  +Y  V+SV + +LNR R ++   + L +
Sbjct: 256 VTSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 304


>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
          Length = 454

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 121/255 (47%), Gaps = 27/255 (10%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR---ATPGAVAIN 192
           +PDGFP++V+  Y+P++ W+ L  FF   + V + Q+LL +VG+  N+   A P A A  
Sbjct: 41  LPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQANAAAPVAGAAK 100

Query: 193 WILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
           W+LKDG G +  +   A   KKFD + K      + L ++   +EL T   P  FL +A 
Sbjct: 101 WVLKDGTGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPLFPSHFLLIAG 160

Query: 252 AANVAKNVAAVTSTSTRTPIYKA-FAKGENIGDVTAKGECVGNIADL--LGTGLSIMISK 308
            A   ++ A    +S     +   F++ EN GD+  K E  G    +  L  G+++  + 
Sbjct: 161 TATFVRSAALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAALPVGIALFRAV 220

Query: 309 RN--------PSL----------ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
            N         SL          + T+  +   + F  Y   KS+   TL+R R  +  +
Sbjct: 221 TNGLELDFATNSLGQEMKGVTVGVATYGSMLFAHSFCCYMAAKSLNFETLSRKRALIQAK 280

Query: 351 SFLKTG--HVPSLRE 363
           +F+++   +V S+ E
Sbjct: 281 TFVESNGKNVLSVEE 295


>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
          Length = 232

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 71/135 (52%), Gaps = 1/135 (0%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           A    S + +  +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +
Sbjct: 60  ASPPFSRLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAK 119

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           A+  A    W++KD  G +G+++FA   G K D + KQ R   D+L ++   +E+    +
Sbjct: 120 ASVSAATATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPIL 179

Query: 243 PHLFLPLACAANVAK 257
           P  F    C +N+AK
Sbjct: 180 PFCFTITVCISNLAK 194


>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
          Length = 331

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/163 (28%), Positives = 83/163 (50%), Gaps = 1/163 (0%)

Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
           +L   + +  +P G+P S+   Y  Y  +  L+       G+ +TQ +L  +GV +   T
Sbjct: 121 KLKRIMYATFLPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVT 180

Query: 186 PGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
             A  + W+ +DG+  +G ++F A     F  D+K+ R   DL+ ++   +++   A P 
Sbjct: 181 SLAATMQWMARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPD 240

Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
           LFLP+ C ++V K +  V + +  T I + +A   NI DV AK
Sbjct: 241 LFLPIICLSSVFKAMCGVAAGACNTAIVQHWAARGNIADVGAK 283


>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
 gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
          Length = 232

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 67/123 (54%), Gaps = 1/123 (0%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+  A    W++
Sbjct: 72  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD  G +G+++FA   G K D + KQ R   D+L ++   +E+    +P  F    C +N
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCISN 191

Query: 255 VAK 257
           +AK
Sbjct: 192 LAK 194


>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
 gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
          Length = 448

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/241 (24%), Positives = 120/241 (49%), Gaps = 5/241 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SVT  Y+PY T+ +L+ F      +   + +L  +GV  + ++P    I  I 
Sbjct: 45  LPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-FLPLACAA 253
            D   R+  ++FA R G+  + + K  RF  D+  +    ++L T A+P+   L +  +A
Sbjct: 105 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKLGVIVSA 164

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK--RNP 311
            V +++  V + +++  +   FA   N+ ++ AK      +  LLG  +  ++ K   + 
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVGSLVVKMVEDK 224

Query: 312 SLITTFALLSCG-YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            ++    ++  G ++  +Y  V++V + +LNR R ++    +L  G V +  + + +E+I
Sbjct: 225 QVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLTPEQVSQRESI 284

Query: 371 F 371
            
Sbjct: 285 L 285


>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
          Length = 448

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 84/321 (26%), Positives = 152/321 (47%), Gaps = 17/321 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP SV+  Y+ Y  + +L+ FF     +   + LL  +GV    ++     +  IL
Sbjct: 75  LPVGFPHSVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTIL 134

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           KD   R+  ++FA R G + + D K+ RF  DL  +    +EL +  +      +A +  
Sbjct: 135 KDAMSRIATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGSWGKVVALSTG 194

Query: 255 VA-KNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKR 309
            A + +  V + +++  +   FA+ +N+ ++ AK       VG I  L+GT L +   + 
Sbjct: 195 EALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGT-LVVKTVEN 253

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE-GNLQE 368
           + +++    +L   ++  +Y  V+SV ++T NR R ++  E +LK+G V S  +  N + 
Sbjct: 254 HSAVVLLMIVLVLAHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVLSPEQVANRES 313

Query: 369 NIFSFPWLRD---KPVV---LGSRFKDAFQDPDAY--LAVEPFFEKERYVVTYNPSK-GK 419
            IF  P +R+   K VV       +  A    DA+  L +       R++ +  P +  +
Sbjct: 314 IIFWSPAIRNAQGKEVVHLEFAESYSHAMSCSDAHEKLIIVDGAMHTRFICSRVPGRLTR 373

Query: 420 VYALLKDQAKSDDILKAAFHA 440
           +  LL D A   D + A F A
Sbjct: 374 IKLLLWDGASPMDAVLAWFAA 394


>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 464

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/241 (26%), Positives = 119/241 (49%), Gaps = 6/241 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP SVT  Y+ Y  + +L+ F      + +++ +L  VGV    A      +  IL
Sbjct: 59  LPVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGVGVGDATANATTALLLSIL 118

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAA 253
           +D  GR+  ++FA R G   + + K  R   D+L + G  ++  +   P    + +   +
Sbjct: 119 QDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLDCLSPIFPKPTRVVILSFS 178

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRN 310
           ++ +++  V + S +  +   FAK  N+G++ AK    E V ++A +L     I   K  
Sbjct: 179 SILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVISLAGMLFGSWVITWVKTP 238

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
            +  T   LL   ++ ++ + V++V + TLNR R ++ +   L+ GH+P+  E   QE I
Sbjct: 239 VATWTALILLLSIHLETNRRAVRAVKMQTLNRQRATL-LYHHLRKGHLPTPIEVADQELI 297

Query: 371 F 371
           F
Sbjct: 298 F 298


>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
          Length = 472

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 88/382 (23%), Positives = 163/382 (42%), Gaps = 62/382 (16%)

Query: 136 VPDGFPDSVTPSYVPY----------MTWR-------------ALKHFFGGAMGVFTTQT 172
           +P G+P SV+  Y+ Y          + W              +L+ F     G+ +++ 
Sbjct: 58  LPVGYPHSVSGDYLEYQVYVRPIFLKLHWSFTDVANNRLFVQDSLQAFSSSIAGLLSSRA 117

Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMEL 231
           +L  VGV    A+P +  +  IL+D  GR+  ++FA Q G   + + K  R A D+  + 
Sbjct: 118 VLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSLEPECKMYRLAADIFNDS 177

Query: 232 GAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
              ++  +   P  + + L   ++V + +  V + S++  +   FAK  N+ ++ A G  
Sbjct: 178 SMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSAHFAKWGNLAELNAVG-- 235

Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLS--CGYVFSSYQEVKSVVLHTLNRARFSVA 348
                       S+++S     L T  AL+S    ++ ++Y  V++V + +LNR R ++ 
Sbjct: 236 ------------SLVVSYITSPLATWVALISLLIVHITTNYLAVRAVSMTSLNRQRANIV 283

Query: 349 VESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV----LGSRFKDAFQDPDA---- 396
             +      V + ++ + QE +F       W      +    +G  FKD      +    
Sbjct: 284 FSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFCQIGVPFKDVIDHVSSSSPA 343

Query: 397 ------YLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHC--IN 448
                  +A+   FE+E Y++ ++P++ +    LK  A     LKA  HA +  +   I 
Sbjct: 344 ERTEATIVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTSQLKAWSHALLAANSLKIK 403

Query: 449 SSNGHRSLALKQHEYDNSNLML 470
            SN    L  +Q   D S LML
Sbjct: 404 ESNTRGPLVREQVGTD-SVLML 424


>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
          Length = 455

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 63/258 (24%), Positives = 120/258 (46%), Gaps = 8/258 (3%)

Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
            F+R   +P  +PDSV+P Y+ Y  + + + FF    G+   + +L  +GV     +   
Sbjct: 75  QFLRDAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNH 134

Query: 189 VAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF- 246
             +   LKDG  R+  + FA R G   + + K+ RF  DL  +    +ELA         
Sbjct: 135 ALLLTCLKDGVSRLATITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPFFAGWAK 194

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGL 302
           + +  +A   + +  + + +++  +   FA+  N+ ++  K       +G I  L GT +
Sbjct: 195 ILILVSAESLRALCGIAAGASKAALSTHFARRNNLAELNTKEASQETAIGLIGLLAGTAI 254

Query: 303 SIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
              +  R      T  L+   ++  +Y  V ++ L T+NR R ++ ++ +L+T  V +  
Sbjct: 255 IHYVQNRGAVFFLTIFLVFI-HLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVLTPA 313

Query: 363 EGNLQEN-IFSFPWLRDK 379
           E + +EN IF  P + D+
Sbjct: 314 EASKKENIIFWRPHIADE 331


>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
           Gv29-8]
          Length = 451

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/332 (25%), Positives = 154/332 (46%), Gaps = 19/332 (5%)

Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
           A E    +    +P GFP SV+  Y+ Y  + +L+ FF     +   + LL  +GV    
Sbjct: 63  AREWRKVLFDSFLPVGFPHSVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDAS 122

Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
           ++     +  ILKD   R+  ++FA R G + + D K+ RF  D+  +    +EL +  +
Sbjct: 123 SSATFALLLTILKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYL 182

Query: 243 PHLFLPLACAANVA-KNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADL 297
                 +A +   A + +  V + +++  +   FA+ +N+ ++ AK       VG I  L
Sbjct: 183 GSWGKVIALSTGEALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLL 242

Query: 298 LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           +GT L +   + + S++    LL   ++  +Y  V+SV + TLNR R ++    +LK   
Sbjct: 243 VGT-LVVKTIEDHSSVVILMILLVFAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRV 301

Query: 358 VPSLREGNLQEN-IFSFPWLRD---KPVVLGSRFKDAF------QDPDAYLAVEPFFEKE 407
           V +  +   +E+ IF  P +R+   K VV    F +++       DP   L +    +  
Sbjct: 302 VLTPEQVAERESIIFWSPIMRNIRGKKVVY-LEFAESYGHAMSCSDPHDQLIIVDGIKHT 360

Query: 408 RYVVTYNPSK-GKVYALLKDQAKSDDILKAAF 438
           R++ +  P     +  LL + A+  D ++A F
Sbjct: 361 RFICSRVPKYLTPIKMLLWEGAQPIDAIRAWF 392


>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 376

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 156/333 (46%), Gaps = 50/333 (15%)

Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAG 234
           S+GV  + A+     +  IL++ AGR+  ++FA R G   + + K+ R   D+  +    
Sbjct: 2   SLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMI 61

Query: 235 VELATAA---VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECV 291
           ++  + A   +P +FL    A++V K++  V + S++  +   FAK  N+ ++ AK    
Sbjct: 62  LDCISPAFPKIPRVFL--LSASSVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQ 119

Query: 292 GNIADLL----GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
             +  LL    GT +   IS +  + I   ALLS  ++ ++Y  V+SV + TLNR R ++
Sbjct: 120 ETLISLLGMLVGTFVVSKISSQVATWIALLALLSI-HLGTNYMAVRSVTMRTLNRQRANL 178

Query: 348 AVESFLKTGH-------VPSLREGNLQENIFSFPWLRDKPV-----------VLGSRFKD 389
            + S L +         +PS R+ +LQE IF     RD  +            +G   +D
Sbjct: 179 VISSLLSSPEHEKNKISLPSPRDISLQERIFE----RDGAIRNTHGTILAYCKVGVSLQD 234

Query: 390 ------------AFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKA 436
                       ++ +P++ L ++   ++ + Y++ Y+ S+     +LK++      L A
Sbjct: 235 LLSSIATPTTAGSYAEPESILTSLLSIYKTQGYLMWYSRSRNTFLVILKEETAPKFQLDA 294

Query: 437 AFHAYVLLHCINSSNGHRSLALKQHEYDNSNLM 469
            FHA   +  I S N  +    K +  DN +++
Sbjct: 295 WFHA---IFAITSGN-QKIQKSKTNHVDNQSIL 323


>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 274

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 76/156 (48%), Gaps = 5/156 (3%)

Query: 108 KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
           K++ +R+I   + +     L + + +  +P GFP SV P Y+ Y  W  ++       G+
Sbjct: 119 KQSEVRSIVRNAKR----RLKTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGI 174

Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGD 226
             TQ +L  VGV R  ATP A  + WI +DGA  +  ++F       F  + K  R   D
Sbjct: 175 LATQAILEGVGVGREGATPLAATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFAD 234

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
            ++++G  +++     P  FL   C A+V K++  +
Sbjct: 235 FMVDIGITLDMLAPLFPKNFLLCICLASVCKSLCGI 270


>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 277

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 76/156 (48%), Gaps = 5/156 (3%)

Query: 108 KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
           K++ +R+I   + +     L + + +  +P GFP SV P Y+ Y  W  ++       G+
Sbjct: 122 KQSEVRSIVRNAKR----RLKTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGI 177

Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGD 226
             TQ +L  VGV R  ATP A  + WI +DGA  +  ++F       F  + K  R   D
Sbjct: 178 LATQAILEGVGVGREGATPLAATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFAD 237

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
            ++++G  +++     P  FL   C A+V K++  +
Sbjct: 238 FMVDIGITLDMLAPLFPKNFLLCICLASVCKSLCGI 273


>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
          Length = 478

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 103/220 (46%), Gaps = 14/220 (6%)

Query: 148 YVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF 207
           Y+ Y+    L+        V  TQ +L  VGV R  AT  A  ++++++DG G +  ++ 
Sbjct: 93  YLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASLVL 152

Query: 208 ARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAANVAKNVAAVTST 265
                  F  D+K+ +   D  ++ G  +E+   ++P   FLP+ C ANV K +  V + 
Sbjct: 153 TSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRWFLPVLCGANVCKALCGVAAG 212

Query: 266 STRTPIYKAFAKGENIGD-----VTAKGECVGNIADLLGTGLSIMISK-------RNPSL 313
           +   P+   +A    +GD     V AK      I    G  LS + ++         P  
Sbjct: 213 AVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGLVLSGLFARWMGGRGANRPVW 272

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
           +  +A+L+  ++ ++Y+ ++ V L+ LNR R  + +E FL
Sbjct: 273 VGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFL 312


>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
           42464]
 gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
           42464]
          Length = 448

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 60/250 (24%), Positives = 121/250 (48%), Gaps = 5/250 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L+  I    +P G+P +VT  Y+ Y T+ +L+ F      +   + +L  +GV  + ++P
Sbjct: 36  LLKAILYAFLPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 95

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
               I  I  D   R+  ++FA R G+  + + K  RF  D+  +    ++L T A+P+L
Sbjct: 96  TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYL 155

Query: 246 -FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT---G 301
             L +  +A V +++  V + +++  +   FA   N+ ++ AK      +  LLG     
Sbjct: 156 PKLGVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVGS 215

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
           L + + +    +     +L+  ++  +Y+ V+SV + +LNR R ++    +L  G V + 
Sbjct: 216 LVVRMVEDKQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGVVLTP 275

Query: 362 REGNLQENIF 371
            +   +E+I 
Sbjct: 276 EQVAQRESIL 285


>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
 gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
          Length = 479

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/288 (22%), Positives = 133/288 (46%), Gaps = 9/288 (3%)

Query: 89  DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
           D+D +  N +   D   +   NS  ++ ++    P   L+  I    +P GFP +VT  Y
Sbjct: 7   DVD-QTGNVLCSYDHGDRIDDNSDGSMKIEKTHVP---LIKAIFYAFLPAGFPHTVTEDY 62

Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
           + Y  + +L+ F      +   + +L  +GV     +P A  +  I++D   R+  ++FA
Sbjct: 63  LHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLKIIQDTISRLATIMFA 122

Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGA-GVELATAAVPHLFLPLACAANVAKNVAAVTSTS 266
            R G   + + K  RF  D+  +       L         + +   +   +++  V + +
Sbjct: 123 HRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLVTSGCLRSLCGVAAGA 182

Query: 267 TRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTF-ALLSCG 323
           ++  +   FAK  N+ ++ AK      +  L+G  TG +++    + + +  +  +L C 
Sbjct: 183 SKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVEDKTAVWCWMVVLICM 242

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
           ++  +Y+ V++V + TLNR R ++    FL+TG V +  + + +E+I 
Sbjct: 243 HLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTPAQCSARESIL 290


>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
          Length = 458

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 123/243 (50%), Gaps = 9/243 (3%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SVT  Y+ Y T+ +L+ F      +   + +L  +GV     +P A  +  I+
Sbjct: 48  LPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLKII 107

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC--- 251
           +D   R+  ++FA R G+  + + K  RF  DL  +    ++L T  +P    P  C   
Sbjct: 108 QDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLP--LYPKVCIMV 165

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIM-ISK 308
           A+++ +++  + +++++  +   FA+  N+ ++ AK      +  LLG   G +++ + +
Sbjct: 166 ASSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAGTAVVHVVQ 225

Query: 309 RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQE 368
              +++    +L   ++  +Y+ V+ V + TLNR R ++    +L+TG + S  E   +E
Sbjct: 226 DKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSPGEVAERE 285

Query: 369 NIF 371
           ++ 
Sbjct: 286 SVL 288


>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 425

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/228 (24%), Positives = 112/228 (49%), Gaps = 5/228 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP +VT  Y+ Y  + +L+ F      +   + +L  +GV  + ++P    I  I 
Sbjct: 45  LPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-FLPLACAA 253
            D   R+  ++FA R G+  + + K  RF  D+  +    ++L T A+P+   L +  +A
Sbjct: 105 GDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKLSVIVSA 164

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRN 310
            V +++  V + +++  +   FA   N+ ++ AK    E V ++  +L   L + + +  
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAGSLVVRLVEDK 224

Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
             +     LL+  ++  +Y+ V+SV + +LNR R ++    +L  G V
Sbjct: 225 QVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTV 272


>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
          Length = 405

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 106/245 (43%), Gaps = 57/245 (23%)

Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
            P  V+  Y  +  WR L+      + VF TQ+LL ++GV  N+    A A  W+  +  
Sbjct: 99  LPRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLLLALGVKTNKVGVAA-ASGWVFANAF 157

Query: 200 GRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
           G+ GKMIF AR G+ FD D K+ R                                    
Sbjct: 158 GKFGKMIFAARWGRDFDSDAKRWR------------------------------------ 181

Query: 259 VAAVTSTSTRTPIYKAFA-KGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTF 317
                 ++TR   YK FA + EN+GD+TAKG+    +ADL+G  + I +++    L  T 
Sbjct: 182 ------SATRGTFYKQFAGRNENLGDITAKGDAQVAVADLVGLVVGIELAR----LAGTN 231

Query: 318 ALLSCG-YVFSS-------YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
           AL   G YV  S        + + ++V   LN  R ++   +F+  G + S  + +  E 
Sbjct: 232 ALCVWGSYVVLSALDFIFILKSLDTIVFRFLNLERATMLASAFVSEGIIKSPADISRSEP 291

Query: 370 IFSFP 374
           + S P
Sbjct: 292 VLSRP 296


>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
           FGSC 2508]
 gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
           2509]
          Length = 361

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 123/269 (45%), Gaps = 16/269 (5%)

Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
           SL++ Q     L + + +++ P G+P SVTP Y+PY T+ +L+ FF     +   + +L 
Sbjct: 25  SLRAHQTKIPPLTALLHAFL-PAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLE 83

Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAG 234
            +GV    ++P    I  I  D   RV  ++FA R G   + + K  RF  D L +    
Sbjct: 84  GLGVGDASSSPTGALILKIASDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQL 143

Query: 235 VELATAAVPH---LFLPLACAANVAKNVAAVTSTSTRTPIYKAF--------AKGENIGD 283
           ++L   A+     L L +   A V ++   V +++++  +   F          G N+ +
Sbjct: 144 LDLLVPAMMERYGLKLGVIVGAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAE 203

Query: 284 VTAKGECVGNIADLLGTGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTL 340
           + AK      +  LLG  +  ++ +     +  + L   L   ++  +Y  V+ V   TL
Sbjct: 204 LNAKEASQETVVSLLGMLVGSLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTL 263

Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQEN 369
           NR R ++ V  +L+TG V    E  L+ +
Sbjct: 264 NRQRATMVVREWLETGRVLGPEEDGLESH 292


>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
 gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
          Length = 451

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 114/237 (48%), Gaps = 5/237 (2%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L+  I    +P G+P +VT  Y+ Y T+ +L+ F      +   + +L  +GV  + ++P
Sbjct: 37  LLKAILYAFLPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 96

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
               I  I  D   R+  ++FA R G+  + + K  RF  DL  +    ++L T A+P+ 
Sbjct: 97  TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALPYF 156

Query: 246 -FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTG 301
             L +  +A V +++  V + +++  +   FA   N+ ++ AK    E V ++  +L   
Sbjct: 157 PKLAVIVSAGVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAGS 216

Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
           L + + +    +      L+  ++  +Y+ V+ V + +LNR R ++    +L  G V
Sbjct: 217 LVVRLVEDKQLVWLLMVALAAVHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHGAV 273


>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
          Length = 478

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 60/241 (24%), Positives = 117/241 (48%), Gaps = 7/241 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+PDSVTP Y+ Y  + +L+ FF    G+ + + +L  +GV    ++     +  IL
Sbjct: 69  LPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLLTIL 128

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAA 253
           KDG  RV  + FA R G   + + K+ RF  D+  +    ++L +    P   +     A
Sbjct: 129 KDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPYFDPWTKVAAIVLA 188

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKR 309
              + +  V + +++  + K FA+  N+ ++ AK       +G +  L+G+ +   +  R
Sbjct: 189 EALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGLLVGSIVVRFVEGR 248

Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
             ++      L   ++  +Y  V+ V L TLN+ R ++  + + +T  V + ++   +EN
Sbjct: 249 E-AVFVLMVFLVFVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQTKRVLTPKQVAWREN 307

Query: 370 I 370
           I
Sbjct: 308 I 308


>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
          Length = 714

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 110/224 (49%), Gaps = 8/224 (3%)

Query: 129 SFIRSYV---VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
           S+ +S++   +P G+P+SVT  Y+ Y  + +L+ F     G+  ++ +L  VGV    A+
Sbjct: 83  SWAKSFMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATAS 142

Query: 186 PGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
           P    +  +L+D  GR+  + F AR G   + + K+ R   D+  +    ++  + A P 
Sbjct: 143 PTTALLLSVLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAFPK 202

Query: 245 -LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TG 301
            L + L  A+   + +  V + S +  +   FAK  N+G++ AK      I  LLG   G
Sbjct: 203 PLRVSLLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGMLVG 262

Query: 302 LSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
            +++     P    T   LL   ++ ++Y  V++V + TLNR R
Sbjct: 263 TALIPYLTTPLQTWTALLLLLSIHLTTNYFAVRAVSMRTLNRQR 306


>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
          Length = 425

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 68/280 (24%), Positives = 106/280 (37%), Gaps = 78/280 (27%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKH 159
           +R  K    S+  + +     P  + +  SF+  +  P G+P SV   Y+ Y  +R L+H
Sbjct: 45  ERFSKLHDKSVTTVKVIQDSRPIHDKLVDSFLNKFF-PSGYPYSVNEGYLTYTKFRVLQH 103

Query: 160 FFGGAMGVFTTQTLLNS-----------------------VGV-------SRNRATPGA- 188
           F    + V +TQ +  +                       VG         R    P A 
Sbjct: 104 FSSAMLHVLSTQLVFQNRLLSVLSSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDAS 163

Query: 189 -------------VAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAG---------- 225
                        +    ILKDG     K+I +  G + D + K  R  G          
Sbjct: 164 NCCKLGKHMMTEMLKSKQILKDGMQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCH 223

Query: 226 ---------------------DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTS 264
                                D+L +LG  +E+ +   P LFL +A   N AK +  V +
Sbjct: 224 CLGSHFTIFPIICLYCSILTADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAA 283

Query: 265 TSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
            +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I
Sbjct: 284 RATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGI 323


>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
          Length = 326

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 123/249 (49%), Gaps = 11/249 (4%)

Query: 202 VGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVA 260
           +G+++FA  +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + 
Sbjct: 2   LGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIV 61

Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITT 316
           +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  
Sbjct: 62  SVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGC 121

Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWL 376
           F  L+  +++++Y+ V+S+V+ TLN  R  + ++ +L+ G V      N  E +++  W 
Sbjct: 122 FFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW- 180

Query: 377 RDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDIL 434
              P    S      +   +   ++   E  +E Y++ ++ S+ +V  +L  +A    +L
Sbjct: 181 ---PAPSLSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVL 237

Query: 435 KAAFHAYVL 443
           +AA H  +L
Sbjct: 238 RAATHGLML 246


>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 155

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 60/102 (58%)

Query: 203 GKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
           GK+I +  G + D + K  R   D+L +LG  +E+ +   P LFL +A   N AK +A V
Sbjct: 5   GKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVV 64

Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
            + +TR PIY +FAK  N+ D+ AKGE +  + +++G G  I
Sbjct: 65  AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGI 106


>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
          Length = 416

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 84/325 (25%), Positives = 142/325 (43%), Gaps = 35/325 (10%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+PDSVTP Y+ Y T+ +L+ FF    G+ + + +L  +GV    ++     +  IL
Sbjct: 10  LPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLLTIL 69

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT------AAVPHLFLP 248
           KDG  RV  ++FA R G   + + K+ RF  D+  +    + L +        V  L L 
Sbjct: 70  KDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWTKVGALVLA 129

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
            A  A     +  V + +++  + K FA+  N+ ++ AK E     A  L   L   +  
Sbjct: 130 EALGA-----MCGVAAGASKAALSKHFARRNNLSELNAK-EASQETAVGLVGLLVGSVVV 183

Query: 309 R----NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
           R      ++      L   ++  +Y  V+ V L TLN+ R ++  E + +T  V +  + 
Sbjct: 184 RFVEGREAVFALMTFLVFVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLTPAQV 243

Query: 365 NLQENI-FSFPWLRDK--PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVY 421
             +ENI F  P +++    VV   RF  ++ D                +     S G  Y
Sbjct: 244 ASRENILFWSPVIKNHRGKVVANIRFAKSYADA---------------MTADVKSAGIRY 288

Query: 422 ALLKDQAKSDDILKAAFHAYVLLHC 446
             LK + +S    K A  ++VL  C
Sbjct: 289 ITLKGEGQSKKEAKTAAASFVLCLC 313


>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
          Length = 413

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 69/318 (21%), Positives = 138/318 (43%), Gaps = 34/318 (10%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
           +L+ F     G+ +++ +L   GV    A+     +  I +D   R+  ++FA R G   
Sbjct: 6   SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + + K+ R A D+  +    ++  + A P  + + + C +   +    V    ++  +  
Sbjct: 66  EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKASLSV 125

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQ 330
            FA+  N+G++ AK      +  L+G    S ++S  + +  T  +  LL   ++ ++++
Sbjct: 126 HFARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHISSTWATWISLTLLQAVHLGTNFK 185

Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPW----------- 375
            V +V L TLNR R ++   +F  TG + + R+    E IF       W           
Sbjct: 186 AVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHERIFEKDGVLRWGGSGVLGNAEI 245

Query: 376 ----------LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLK 425
                     LR  P     R K+   +  + + +   F+ E+Y++ Y+  +  V   LK
Sbjct: 246 GVKLTALIRSLRQSP----ERKKEGEGEGVSPVELASVFQNEKYLLWYDVQRRLVTICLK 301

Query: 426 DQAKSDDILKAAFHAYVL 443
           ++   ++ LK+  HA VL
Sbjct: 302 EKVSPEEQLKSWVHALVL 319


>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 272

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 64/222 (28%), Positives = 116/222 (52%), Gaps = 8/222 (3%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
           +++ F     G+ +++ +L SVGV  + A+P +  +  +L+    R+ K+ FA R GK  
Sbjct: 34  SIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAYRLGKAI 93

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP-LACAANVAKNVAAVTSTSTRTPIYK 273
           + + K  RF  DLL +    +   +  +P  F P +   +++   V  V ++ST+  +  
Sbjct: 94  EPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASSTKASLSA 153

Query: 274 AFAKGENIGDVTAKGECVGNIADLLG--TGLSIMIS--KRNPSLITTFALLSCGYVFSSY 329
            FAK EN+G++ AK      I  LLG  TG S++++    + +  T   L+   ++ ++ 
Sbjct: 154 HFAKWENLGELNAKDSSQETIISLLGMLTG-SLLVNWISGHWATWTALILMLSVHLEANR 212

Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
           + V+++V+ TLNR R +V V   L  GH+P   E +  E IF
Sbjct: 213 RAVRAIVMQTLNRQRATV-VYHRLCVGHIPRPAEVSRVERIF 253


>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 675

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 107/237 (45%), Gaps = 13/237 (5%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           +++ +   ++PDG+P SVT   +P+  W   +          +T+ +L  VGV  ++A  
Sbjct: 64  VVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADL 123

Query: 187 GAVAINWILKDGAGRVGKMIFAR---QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
            +  ++W+++DG   +G ++FA    QG   +   K  +   D   ++   +EL    +P
Sbjct: 124 TSSTLSWMMQDGTRMIGSIVFASLIPQG--LECRAKTWQLVADFTTDIANLLELCAPWLP 181

Query: 244 ---HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
               +F  +  AA+  K +  V  + TR    + FA   N  D+ AK     ++   +G 
Sbjct: 182 GCQTIFRVVLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVGSFIGL 241

Query: 301 GLSIMISKRNPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            L   ++   P+      +  FAL S   +F+ Y+ V+ V L  LN  R    +E +
Sbjct: 242 ALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQY 298


>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 466

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 91/370 (24%), Positives = 152/370 (41%), Gaps = 39/370 (10%)

Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           +S     +EL++ I    +P G+P++VTP Y+ Y    A + F      +  ++  L  +
Sbjct: 41  RSKHQDTKELLTKI---FLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGI 97

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVE 236
           GV    AT     +  +L+D  GR+  +I A   G +   + K+ RF  D+L ++   V+
Sbjct: 98  GVGDPSATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVD 157

Query: 237 LATAAVPHLFLPLA-----CAANVAKNVAAVTSTSTRTPIYKAFAKGE----NIGDVTAK 287
           +          P A     C +   + V  V +   +  I   FA       ++GD+ AK
Sbjct: 158 VFNPVFSAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAK 217

Query: 288 GECVGNIADLLGTGL-SIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
                 +  LLG  L S+MI       +T     LL   ++  +Y  V+ + + + NR R
Sbjct: 218 DSSKETVLALLGMLLGSVMIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHR 277

Query: 345 FSVAVESFL-KTGHVPSLREGNLQENIFSFPW-LRDKPVVL-------GSRFKDAFQDPD 395
            ++A   +  K    PS +    +E IF  P  +RD    L       GS   D  + P 
Sbjct: 278 ATIAWLWYRSKDECAPSPQAVASRELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPA 337

Query: 396 AYLAVEPFFEKERYVVTYNPSKGK-----VYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
                   F  ++Y+V +NPS        ++   K+     D LK+  HA  +       
Sbjct: 338 LSQEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELKSWIHAVEIC------ 391

Query: 451 NGHRSLALKQ 460
              RS+AL Q
Sbjct: 392 ---RSIALNQ 398


>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 675

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 57/237 (24%), Positives = 107/237 (45%), Gaps = 13/237 (5%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           +++ +   ++PDG+P SVT   +P+  W   +          +T+ +L  VGV  ++A  
Sbjct: 64  VVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADL 123

Query: 187 GAVAINWILKDGAGRVGKMIFAR---QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
            +  ++W+++DG   +G ++FA    QG   +   K  +   D   ++   +EL    +P
Sbjct: 124 TSSTLSWMMQDGTRMIGSIVFASLIPQG--LECRAKTWQLVADFTTDIANLLELCAPWLP 181

Query: 244 ---HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
               +F  +  AA+  K +  V  + TR    + FA   N  D+ AK     ++   +G 
Sbjct: 182 GCQTIFRVVLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVGSFIGL 241

Query: 301 GLSIMISKRNPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            L   ++   P+      +  FAL S   +F+ Y+ V+ V L  LN  R    +E +
Sbjct: 242 ALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQY 298


>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 495

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 88/392 (22%), Positives = 168/392 (42%), Gaps = 58/392 (14%)

Query: 120 PQAPAEE-LMSFIRSYVVPDGFPDSVTPSYVPYMTW------RALKHFFGGAMGVFT--- 169
           PQ P+    ++ I +  +P G+P SVT  Y+   +       + L     G         
Sbjct: 46  PQKPSSSSYLTSILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLN 105

Query: 170 --TQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGD 226
             +  + +SVGV    A+P A  +  +L++  GR+  ++FA R G   + + K  R A D
Sbjct: 106 CWSPVIQSSVGVGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAAD 165

Query: 227 LLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           LL +    ++  + A P  + + L   ++V + +  V + S++  +   FA+  N+G++ 
Sbjct: 166 LLNDSAIVLDCMSPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELN 225

Query: 286 AKGECVGNIADLLGTGL-SIMISKRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNR 342
           AK      +  L G  + S+++S  +  L T  A  LL   ++ +++  V++V + TLNR
Sbjct: 226 AKDSSQETVISLFGMLVGSVVVSYISSPLATWIALILLLIVHLGTNHAAVRAVKMTTLNR 285

Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG-------------- 384
            R ++      +   V +  + + +E IF       W       LG              
Sbjct: 286 QRANIVFSYLFEDDRVLTPAQASEEERIFETDGVLRWKAKATGTLGICQIGVSLEQLLLL 345

Query: 385 ------SRFKDAFQDPDAYL--------AVEPFFEKERYVVTYNPSKGKVYALLKDQAKS 430
                 +  + + Q PD  +        A+   F++E Y++ ++P+  +   +LK  A  
Sbjct: 346 LPEQVNTNAQGSSQKPDTTMVGASTGLSALLELFKEEEYILYFSPAARRGAIVLKAGATP 405

Query: 431 DDILKAAFHAYVLLHCINSSNGHRSLALKQHE 462
              LKA  HA ++          R LA ++H+
Sbjct: 406 QAQLKAWSHALLV---------SRHLAREEHD 428


>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
          Length = 542

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 113/249 (45%), Gaps = 37/249 (14%)

Query: 119 SPQAPAE--ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
           +P++P     L + +    +P G+P+SVT  Y+P                      +L  
Sbjct: 56  TPRSPWSLMTLFNLLVDVFLPSGYPNSVTDDYLP---------------------AVLQG 94

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
           VGV    A+P +  +  IL+D +GR+  ++FA R G   + + K  R A D+  +L   +
Sbjct: 95  VGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVL 154

Query: 236 ELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN 293
           +  +  +P     + +  AA V + +  V   S++  +   FA+G N+ +V AK      
Sbjct: 155 DCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQET 214

Query: 294 IADLLGTGLSIMISKRNPSLITTFALLSCGYVF-------SSYQEVKSVVLHTLNRARFS 346
           I  LLG    +++     S +T+FA      VF        +Y  V+SV + +LNR R +
Sbjct: 215 IISLLG----MLVGSFIVSHVTSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRAN 270

Query: 347 VAVESFLKT 355
           +   + L++
Sbjct: 271 IVFSTLLES 279


>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
 gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
          Length = 542

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 113/249 (45%), Gaps = 37/249 (14%)

Query: 119 SPQAPAE--ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
           +P++P     L + +    +P G+P+SVT  Y+P                      +L  
Sbjct: 56  TPRSPWSLMTLFNLLVDVFLPSGYPNSVTDDYLP---------------------AVLQG 94

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
           VGV    A+P +  +  IL+D +GR+  ++FA R G   + + K  R A D+  +L   +
Sbjct: 95  VGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVL 154

Query: 236 ELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN 293
           +  +  +P     + +  AA V + +  V   S++  +   FA+G N+ +V AK      
Sbjct: 155 DCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQET 214

Query: 294 IADLLGTGLSIMISKRNPSLITTFALLSCGYVF-------SSYQEVKSVVLHTLNRARFS 346
           I  LLG    +++     S +T+FA      VF        +Y  V+SV + +LNR R +
Sbjct: 215 IISLLG----MLVGSFIVSHVTSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRAN 270

Query: 347 VAVESFLKT 355
           +   + L++
Sbjct: 271 IVFSTLLES 279


>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
           Y34]
 gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
           P131]
          Length = 431

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/241 (22%), Positives = 115/241 (47%), Gaps = 5/241 (2%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP +VT  Y+ Y  + +L+ F      +   + +L  +GV     +P A  +  I+
Sbjct: 41  LPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLKII 100

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGA-GVELATAAVPHLFLPLACAA 253
           +D   R+  ++FA R G   + + K  RF  D+  +       L         + +   +
Sbjct: 101 QDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLVTS 160

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNP 311
              +++  V + +++  +   FAK  N+ ++ AK      +  L+G  TG +++    + 
Sbjct: 161 GCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVEDK 220

Query: 312 SLITTF-ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
           + +  +  +L C ++  +Y+ V++V + TLNR R ++    FL+TG V +  + + +E+I
Sbjct: 221 TAVWCWMVVLICMHLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTPAQCSARESI 280

Query: 371 F 371
            
Sbjct: 281 L 281


>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
 gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
          Length = 543

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/298 (25%), Positives = 131/298 (43%), Gaps = 30/298 (10%)

Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELG 232
           L  VGV    A+P A  +  +L+D  GR   ++FA R G  F+ + K  RF  D+L + G
Sbjct: 176 LAGVGVGDANASPTAALLLSVLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTG 235

Query: 233 AGVELATAAVPH--LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
             +E  + A P       L+C+ +V +++  V + S +  +   FA+  N+G+V AK   
Sbjct: 236 MVLECLSPAFPKPIRVCVLSCS-SVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSS 294

Query: 291 VGNIADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
              +  LLG    S+++S  +  + T  T   L   ++ ++Y  V++V +  LNR R ++
Sbjct: 295 QETVISLLGMLAGSVVVSWISSPMATWSTLIGLLAIHLATNYAAVRAVSMRCLNRQRANI 354

Query: 348 AVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLGSRFKDAFQDPDAYLAVE-- 401
                L+ G + S  E + +E +F       W  D  +    R   + Q   +++ +   
Sbjct: 355 LFSRLLEHGQLLSPLEVSRRERVFEHDGVLRWSDDMAIGY-CRIGVSLQTLLSHMGLRNA 413

Query: 402 ----------------PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
                             F  E Y++    S G    +LK+     D LKA  HA +L
Sbjct: 414 RTGSLHLRGAEISELLHVFVDEAYILWPASSAGHFLIVLKEGCTPMDQLKAWAHALLL 471


>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
           bisporus H97]
          Length = 462

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/409 (23%), Positives = 165/409 (40%), Gaps = 41/409 (10%)

Query: 79  DSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPD 138
           + +L L+  +D    R  Y +        ++ S R    +S     +EL++ I    +P 
Sbjct: 4   NPALLLVERDDKKQMRELYYSPEKTLHIVEKKSHR--EYRSKHQDTKELLTKI---FLPA 58

Query: 139 GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDG 198
           G+P++VTP Y+ Y    A + F      +  ++  L  +GV    AT     +  +L+D 
Sbjct: 59  GYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPSATSTGAMLISVLQDI 118

Query: 199 AGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-----CA 252
            GR+  +I A   G +   + K+ RF  D+L ++   V++          P A     C 
Sbjct: 119 FGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVFSAFMFPGARVIGLCT 178

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGE----NIGDVTAKGECVGNIADLLGTGL-SIMIS 307
           +   + V  V +   +  I   FA       ++GD+ AK      +  LLG  L S++I 
Sbjct: 179 SAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKETVLALLGMLLGSVLIP 238

Query: 308 KRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL-KTGHVPSLREG 364
                 +T     LL   ++  +Y  V+ + + + NR R ++A   +  K    PS +  
Sbjct: 239 HITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRATIAWLWYRSKDECAPSPQAV 298

Query: 365 NLQENIFSFPW-LRDKPVVL-------GSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPS 416
              E IF  P  +RD    L       GS   D  + P         F  ++Y+V +NPS
Sbjct: 299 ASHELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPALSKEYLDIFHADKYIVCFNPS 358

Query: 417 KGK-----VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
                   ++   K+     D LK+  HA  +          RS+AL Q
Sbjct: 359 HETGHLPIIHICFKEGFTRADELKSWIHAVEIC---------RSIALNQ 398


>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
           1015]
          Length = 504

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 74/303 (24%), Positives = 129/303 (42%), Gaps = 38/303 (12%)

Query: 65  DETTAFRLVE-DQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRA-ISLQSPQA 122
           +++TA    E D+    +   I SE   G      A R+   K   N  R  ++  S  A
Sbjct: 5   EDSTAITFTEVDEVNHPTATYIYSEPAAGP-----AHREHGKKHGENWGRVDVAHASSTA 59

Query: 123 PA----EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
           PA    + +  F+    +P G+P SVT  Y P                      +L  VG
Sbjct: 60  PAAWSSKSVQDFLVEVFLPAGYPHSVTDDYAP---------------------AVLQGVG 98

Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
           V    A+P A  +  IL+D +GR+  ++FA R G   + + K  R A D+  +L   ++ 
Sbjct: 99  VGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDC 158

Query: 238 ATAAVP--HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
            +  +P     + +   A V + +  V   S++  +   F++  N+ +V AK      I 
Sbjct: 159 LSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETII 218

Query: 296 DLLGTGLSIMISKRNPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            L+G  +   +  R  +  TT+    +L   ++  +Y  V+SV + +LNR R ++   + 
Sbjct: 219 SLIGMLVGSFVVSRVTNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTL 278

Query: 353 LKT 355
           L T
Sbjct: 279 LNT 281


>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
          Length = 188

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P+SV   Y  + +W  L+ + G A+GV  +Q +L S+G+    AT GAVAI W+L
Sbjct: 59  LPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGGAVAIQWVL 118

Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRF 223
           KDG G +GK+ F ++    FD   K  +F
Sbjct: 119 KDGIGEIGKLFFIKKYASTFDSHPKTWKF 147


>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 442

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 78/354 (22%), Positives = 155/354 (43%), Gaps = 52/354 (14%)

Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAIN 192
           +Y++      + TP   P     +L+ F     G+ +++ +L  VGV    A+P A  + 
Sbjct: 20  TYLISSPEASAGTP---PVTIVDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLL 76

Query: 193 WILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLAC 251
            +L++  GR+  ++FA +      + K  R A D+  +    ++  +   P  + + +  
Sbjct: 77  NVLQESMGRIATILFAHR-----LECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLS 131

Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG------------ 299
            ++V + +  V + S++  +   FA+  N+ ++ AK      +  L+G            
Sbjct: 132 LSSVLRAMCGVAAGSSKASLSAHFARWGNLAELNAKDSSQETVISLMGMLVRPILGFLES 191

Query: 300 -TGL-------SIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
            +GL       S+++S+   +L T     LL   ++ ++Y  V++V + TLNR R ++  
Sbjct: 192 ISGLILTQQVGSLVVSRVTSTLATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVF 251

Query: 350 ESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLGS-RFKDAFQ------------ 392
            +        +  +   QE IF       W +  P  LGS R   +FQ            
Sbjct: 252 STIFDENRTLTPSQAAHQERIFERDGVLRW-KASPASLGSCRIGVSFQYMVQLIASSESC 310

Query: 393 --DPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
             D  + + A+   FE+E Y++ +NP++ +   +LK  A +   LKA  HA ++
Sbjct: 311 TSDGQSIVAALLHLFEREEYILWFNPTRKRGAIVLKTNATAASQLKAWSHALIV 364


>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 285

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 128/281 (45%), Gaps = 24/281 (8%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
             ++  I    +P G+PDSV+P Y+ Y    AL+ F     G+ +++ +L   GV    A
Sbjct: 2   RRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGDPSA 61

Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           T     +  +L+D   R+  ++ A   G     D K+ R   D+L +    ++  +  + 
Sbjct: 62  TATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRLD 121

Query: 244 HLFLP-----LACAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKGECVGNI 294
            LF P       C + + +++ A+++  ++  I   FA   KG  ++GD+ AK      +
Sbjct: 122 TLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETV 181

Query: 295 ADLLGTGLSIMISKRNPSLIT------TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
             LLG  L  +I    P L T      T  +L   ++  +Y  V+ +VL TLN  R  +A
Sbjct: 182 LALLGMLLGTLIV---PHLTTPWTIYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWIA 238

Query: 349 VESFLKTG--HVPSLREGNLQENIFSFP-WLRD--KPVVLG 384
             ++++T     P+  E    E IF  P   RD  + V+LG
Sbjct: 239 WSAYMRTQPRRAPTPVEVASLERIFERPGGFRDPRRGVILG 279


>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
          Length = 549

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 57/226 (25%), Positives = 101/226 (44%), Gaps = 22/226 (9%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y P +   A+                   VGV    A+P +  +  IL
Sbjct: 74  LPAGYPHSVSDDYTPALAQPAIH----------------AGVGVGNANASPTSALLLHIL 117

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
           +D +GR+  ++FA R G   + + K  RFA D+  +L   ++  +  +P  F  + +  A
Sbjct: 118 QDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGFNRVTVLSA 177

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
           A V + +  V   S++  +   FA+  N+ DV AK      +  L+G  +   +     S
Sbjct: 178 AGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGSFVVSHITS 237

Query: 313 LITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
              T+     L   ++  +Y  V+SV + TLNR R ++   + L +
Sbjct: 238 FTATWLTLVFLVSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 283


>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 515

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 125/291 (42%), Gaps = 44/291 (15%)

Query: 98  VAERDRSGKFKRNSIRAISLQSPQAPAEEL----------MSFIRSYVV----PDGFPDS 143
           V ERD SG  +R +  +   Q   +    L          +  IR  +V    P G+P S
Sbjct: 12  VLERDESGHLRRTTFGSDGKQYKASGRYNLRDELGLRVPSLDRIRRVLVDLFLPAGYPAS 71

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
           VTP Y+PY  + AL+ FF    G+  ++  L  VGV    AT  +  +  +L+D  GR+ 
Sbjct: 72  VTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGVGDASATATSALLLTVLQDVFGRIT 131

Query: 204 KMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELAT------AAVPHLFLPLA---CAA 253
            +  A   G     + K  R   DL+ +  A + L T       +  H FL LA   CA+
Sbjct: 132 TISAAYVFGTSLYPETKTFRLVADLVND--ASIVLDTLCPRLSLSFLHTFL-LAISLCAS 188

Query: 254 NVAKNVAAVTSTSTRTPIYKAF---------AKGE-----NIGDVTAKGECVGNIADLLG 299
              + +  + +  ++  +   F         A GE     ++G++ AK      +  L G
Sbjct: 189 AALRALCGLVAGGSKAALTNHFACTVPASTAAGGEAVVAGDVGELNAKDASRETVVGLFG 248

Query: 300 TGLSIMISKRNPSLITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
             L  ++  R  S+  T+     L  G++ ++Y  V+SV L TLNR R  +
Sbjct: 249 MLLGTLLVPRLTSVRATWLTLVFLIAGHLLANYLAVRSVSLRTLNRQRAGI 299


>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
           [Strongylocentrotus purpuratus]
          Length = 311

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/220 (25%), Positives = 111/220 (50%), Gaps = 13/220 (5%)

Query: 235 VELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECV 291
           +EL +   P  F+ +AC +++ K++  V   +TR  +    A+  N+ DV+AK    E +
Sbjct: 8   IELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAKDGSQETL 67

Query: 292 GNIADLLGTGLSIM-ISKRNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
            N+A L   GL I  I+ +N  L  + F L +C +++++Y+ V  VV+ TLN+ RF + V
Sbjct: 68  VNLAALF-VGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQNRFHILV 126

Query: 350 ESFLKTGH--VPSLREGNLQENIFSFPWLRDKPVV--LGSRFKDAFQDPDAYLAVEPFFE 405
           + +L +    +      N +E +    W   +P+   LG  F+   Q             
Sbjct: 127 QDYLHSSQFSMSGPDAVNSREPVI---WRLSRPLKLNLGVPFRTVAQSMRDLEMCLQSNG 183

Query: 406 KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH 445
           +E+++++++  KG +   +   + S+D +KA   A ++++
Sbjct: 184 QEKFLLSFDVKKGCIDIAIHHDSTSEDFIKACLQAEIIMY 223


>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
 gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 515

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 100/206 (48%), Gaps = 6/206 (2%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
           +L+ F     G+ +++ +L  VGV    A+P +  +  IL+D +GR+  ++FA R G   
Sbjct: 86  SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTAL 145

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIY 272
           + + K  R A D+  ++   ++  +  +P     + +  AA V + +  V   S++  + 
Sbjct: 146 EPECKMYRLAADVFNDVAMVLDCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLS 205

Query: 273 KAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFA---LLSCGYVFSSY 329
             FA+G N+ +V AK      I  L+G  +   I         T+A    L   ++  +Y
Sbjct: 206 AHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHVTCFAATWASLLFLLTVHLGMNY 265

Query: 330 QEVKSVVLHTLNRARFSVAVESFLKT 355
             V+SV + +LNR R ++ + + L++
Sbjct: 266 AAVRSVQMTSLNRQRANIVLSTLLES 291


>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 163

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 67/126 (53%), Gaps = 5/126 (3%)

Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
           D +G   R SI    +Q  +   ++L+    + + P G+P SV   Y+ Y  +R L+HF 
Sbjct: 40  DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFS 98

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
              + V +TQ+LL + G+   R TP  A A++WILKDG    GK+I +  G + D + K 
Sbjct: 99  SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155

Query: 221 LRFAGD 226
            R  G+
Sbjct: 156 WRILGE 161


>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
           [Strongylocentrotus purpuratus]
          Length = 311

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 114/221 (51%), Gaps = 15/221 (6%)

Query: 235 VELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECV 291
           +EL +   P  F+ +AC +++ K++  V   +TR  +    A+  N+ DV+AK    E +
Sbjct: 8   IELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAKDGSQETL 67

Query: 292 GNIADLLGTGLSIM-ISKRNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
            N+A L   GL I  I+ +N  L  + F L +C +++++Y+ V  VV+ TLN+ RF + V
Sbjct: 68  VNLAALF-VGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQNRFHILV 126

Query: 350 ESFLKTGHVPSLREGNLQENIFSFPWLRDKPVV--LGSRFK---DAFQDPDAYLAVEPFF 404
           + +L +    S+   +   +     W   +P+   LG  F+    + +D +  L      
Sbjct: 127 QDYLHSSQF-SMSGPDAVNSREPVIWRLSRPLKLNLGVPFRTVAKSMRDIEMCLQSNG-- 183

Query: 405 EKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH 445
            +E+++++ +  KG +  ++   + S+D +KA   A ++++
Sbjct: 184 -QEKFLLSLDVKKGCIDIVIHHDSTSEDFIKACLQAEIIMY 223


>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
          Length = 236

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 79/154 (51%), Gaps = 3/154 (1%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  YVPY  + +L+ F     G+ +++ +L  VGV    A+P A  +  IL
Sbjct: 76  LPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHIL 135

Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
           +D +GR+  ++FA R G   + + K  R A D+  +    ++  +  +P  F  + +   
Sbjct: 136 QDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLST 195

Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
           A V + +  V   S++  +   F++  N+ +V A
Sbjct: 196 AGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNA 229


>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
           Neff]
          Length = 101

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 32/74 (43%), Positives = 46/74 (62%)

Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
           ++P+G+P+SV+  Y  Y TW  ++       G+  T+ LL  VGV    ATPGA A+ W 
Sbjct: 21  LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80

Query: 195 LKDGAGRVGKMIFA 208
           L+DGAG VG++ FA
Sbjct: 81  LRDGAGMVGRIGFA 94


>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
          Length = 400

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 73/338 (21%), Positives = 144/338 (42%), Gaps = 39/338 (11%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           Q P ++  + ++  +VP G+P SV   Y  ++ +  +       M V +   +L      
Sbjct: 35  QQPFQKSQNHLQELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL------ 88

Query: 181 RNRATPGAVAINWIL--KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
             R  PG  A  +++  +   G +  +I  R G    +  KQ R   +L+ EL   +E+A
Sbjct: 89  --RINPGHSAAFFVVFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIA 146

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----I 294
             + P +F  ++   +V + V   +    +T I   +AKG N+ ++T K + +      +
Sbjct: 147 FISTP-IFYSISVTCSVIEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVV 205

Query: 295 ADLLGTGLSIMISKRNPSLITTF--ALLS---CGYVFSSYQEVKSVVLHTLNRARFSVAV 349
           + L+G  +  ++     S I  F  AL+S   C ++ S+      +V  TLN  R ++ +
Sbjct: 206 SLLIGWVILHVLGDFTESGIIVFICALISHCICNFLLSN-----VIVFKTLNYERLTILM 260

Query: 350 ESFLKTGH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPF 403
             F+   +      V S  E  ++ N           + +G       +  D Y+ +   
Sbjct: 261 SYFMNIKNEILSPVVVSRIESKIKVN-------EMVNIRMGISLNSIPKSSDLYIEISQS 313

Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            +K +Y + Y  S  +++ +L D    D+I ++   AY
Sbjct: 314 IKKYQYYL-YKTSTNQIWVILGDHITKDEIFESLLCAY 350


>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
          Length = 156

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 42/130 (32%), Positives = 69/130 (53%), Gaps = 8/130 (6%)

Query: 79  DSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPD 138
           D SL+     ++ G RW  ++   R+   K     A  + +  AP+  L S +++  +P 
Sbjct: 29  DGSLQW----EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQ 80

Query: 139 GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDG 198
           GFPDSV+P Y+PY  W +++ F     G   TQ +L  +GV   +AT  A    W++KD 
Sbjct: 81  GFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDS 140

Query: 199 AGRVGKMIFA 208
            G +G+++FA
Sbjct: 141 TGMLGRIVFA 150


>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
          Length = 400

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 73/338 (21%), Positives = 144/338 (42%), Gaps = 39/338 (11%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           Q P ++  + ++  +VP G+P SV   Y  ++ +  +       M V +   +L      
Sbjct: 35  QQPFQKSQNHLQELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL------ 88

Query: 181 RNRATPGAVAINWIL--KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
             R  PG  A  +++  +   G +  +I  R G    +  KQ R   +L+ EL   +E+A
Sbjct: 89  --RINPGHSAAFFVVFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIA 146

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----I 294
             + P +F  ++   +V + V   +    +T I   +AKG N+ ++T K + +      +
Sbjct: 147 FISTP-IFYSISVTCSVIEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVV 205

Query: 295 ADLLGTGLSIMISKRNPSLITTF--ALLS---CGYVFSSYQEVKSVVLHTLNRARFSVAV 349
           + L+G  +  ++     S I  F  AL+S   C ++ S+      +V  TLN  R ++ +
Sbjct: 206 SLLIGWLILHVLGDFTESGIIVFICALISHCICNFLLSN-----VIVFKTLNYERLTILM 260

Query: 350 ESFLKTGH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPF 403
             F+   +      V S  E  ++ N           + +G       +  D Y+ +   
Sbjct: 261 SYFMNIKNEILSPVVVSRIESKIKVN-------EMVNIRMGISLNSIPKSSDLYIDISQS 313

Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            +K +Y + Y  S  +++ +L D    D+I ++   AY
Sbjct: 314 IKKYQYYL-YKTSTNQIWVILGDHITKDEIFESLLCAY 350


>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 521

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/393 (23%), Positives = 157/393 (39%), Gaps = 71/393 (18%)

Query: 109 RNSIRAISLQSP-QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
           R+  R + +  P +  +   ++ + +  +P G+P SV+  Y+  +         G A GV
Sbjct: 43  RHERRRVDIVQPSRKDSSSYLASLLNVFLPVGYPHSVSDDYLDCL---------GFASGV 93

Query: 168 FT----TQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLR 222
                 T  +   VGV    A+P    ++ +L+D  GR+  ++FA R G   + + K  R
Sbjct: 94  LKFNCRTAGVPGWVGVGDASASPTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYR 153

Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACA-ANVAKNVAAVTSTSTRTPIYKAFAKGENI 281
            A D+L +     +  + A+P     +  A ++V + +  V + S++  +   FAK  N+
Sbjct: 154 LAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNL 213

Query: 282 GDVTA---KGECVGNIADLLGTGLSIMIS------KRNPSLITTFALLS--CGYVFSS-- 328
           G++ A       V  I+     G   + S      +++ S  T  +LL   CG V  S  
Sbjct: 214 GELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHI 273

Query: 329 ---------------------YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
                                Y  V+SV + TLNR R ++   +  + G V +  E +  
Sbjct: 274 STPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKC 333

Query: 368 ENIFS----FPWLRDKPVVLGSRFKDAFQD----------------PDAYLAVEPFFEKE 407
           E IF       W      +   +   +FQ+                 D Y  +   FEKE
Sbjct: 334 ERIFERDGILRWKASSATLGYCQIGGSFQELLRGSIHGANSIRDVGIDIYKLLR-LFEKE 392

Query: 408 RYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
            YV+ +NP   K   +LK+       L+A  HA
Sbjct: 393 EYVLWFNPIHKKGTIVLKNNVTPISQLQAWSHA 425


>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
 gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 487

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 72/126 (57%), Gaps = 2/126 (1%)

Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLR 222
           A  V  +  +L +VG+    A P A A+NW+LKDG G++G ++F +     FD   K   
Sbjct: 150 ANSVLASTFMLYAVGLGAG-AIPTAGALNWVLKDGMGQLGTLVFGKTIAHNFDVHSKTWF 208

Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
           F   +L++  A +E+ T  VP  FL +   AN+ K +A + + STR+  + +FA+  NI 
Sbjct: 209 FLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFHLSFARDNNIA 268

Query: 283 DVTAKG 288
           DVTAKG
Sbjct: 269 DVTAKG 274


>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
          Length = 470

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 140/328 (42%), Gaps = 43/328 (13%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           ++ +++P GFP  +   +  YM  +  K F    +G F+  T       +   A   A A
Sbjct: 113 VKDFILPAGFPVFL---FYAYMELKNQK-FVAVGVGSFSATT-------AAASAAASAAA 161

Query: 191 INWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAG----VELATAAVPHLF 246
           I W+ KDG G VG++                 F G  +++   G     +L+T   P  F
Sbjct: 162 IRWVSKDGIGAVGRL-----------------FIGLSVIKRAKGYKSIFDLSTQLYPAYF 204

Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM- 305
           L LA   N+AK VA      +   I   FA   N+G+V AK E     A LLG  L IM 
Sbjct: 205 LQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMV 264

Query: 306 -----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
                ++   P L  T+  +   +++  YQ +  +   ++N  R  + V+S +    VP 
Sbjct: 265 LTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPG 324

Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYL-AVEPFFEKERY--VVTYNP 415
             + N +E I S+       ++ G   ++    + P + +  +   + +E+Y  VV    
Sbjct: 325 YVDCNRKEKILSWQRFLKPRIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQG 384

Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +  +V+   K  A S  +L++ + +Y L
Sbjct: 385 TDFEVFVSFKVGATSLSVLRSIWQSYWL 412


>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 440

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/354 (23%), Positives = 140/354 (39%), Gaps = 51/354 (14%)

Query: 98  VAERDRSGKFKRNSIRAISLQSPQAPAEE--------LMSFIRSYV----VPDGFPDSVT 145
           + ERD SG  +R  +     +  + P E         L   IR        P G+PDSV+
Sbjct: 12  IDERDSSGLIRRRWLHDHDGRLTEMPTENSPTTISRLLTMNIRQVFSDAFFPIGYPDSVS 71

Query: 146 PSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKM 205
             Y+ Y  + +L+ FF     +   + +L  +GV    ++     +  ILKDG  R+  +
Sbjct: 72  KDYLGYQLYDSLQAFFSTITSLLANRAILQGLGVGDADSSATYALLLTILKDGISRLATI 131

Query: 206 IFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL---------ACAANV 255
            FA R G   + + K+ RF  D+  +        +A    LF PL            A  
Sbjct: 132 GFAYRFGLVIEPECKKYRFLADIFND--------SAFFLDLFSPLFGSWTKVAALVVAEA 183

Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLIT 315
            + +  V + +++  + K FA   N+ ++ AK         L+G  +   + +   S   
Sbjct: 184 LRAMCGVAAGASKAALSKHFALRGNLSELNAKESSQETAVGLIGLIVGSFVVRHVESREA 243

Query: 316 TFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
            FAL   L   ++  +Y  V+ V L+  N+ R ++  E ++KT        GN       
Sbjct: 244 VFALMIILVFVHLGMNYLGVRCVQLNNFNQQRATIFFEEYMKT-------MGN------- 289

Query: 373 FPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKD 426
              +R  P  +  R    F  P  Y    P F K  +  +Y  +    Y +LK+
Sbjct: 290 ---VRLTPAEVAKRENIIFWKPVIYHDGRP-FAKIVFAKSYVDAMAPGYVVLKE 339


>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
 gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
          Length = 512

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 110/241 (45%), Gaps = 23/241 (9%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           + L+  +    +P G+P SVT  Y+P       + +F  A    T   L   VGV    A
Sbjct: 63  KTLVDLLVDVFLPSGYPHSVTDDYLP-------RVYFHLATPQPTNNPL--GVGVGNADA 113

Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
           TP +  +  IL+D +GR+  ++FA R G   + + K  R A D+  +L   ++  +  +P
Sbjct: 114 TPTSALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIP 173

Query: 244 HLF--LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
                + +   A V + +  V   S++  +   FA+G N+ +V AK      I  L+G  
Sbjct: 174 AGVSRITVLSTAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVG-- 231

Query: 302 LSIMISKRNPSLITTFA-------LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
             +++     S +T FA        L   ++  +Y  V++V + +LNR R ++   + L 
Sbjct: 232 --MLVGSLVVSHVTGFAATWLSLLFLLAAHLAMNYAAVRAVQMTSLNRQRANIVFSTLLD 289

Query: 355 T 355
           +
Sbjct: 290 S 290


>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 501

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 118/237 (49%), Gaps = 18/237 (7%)

Query: 136 VPDGFPDSVTPSYVPYMTW-RALKHFFGGAM----GVFTTQTLLNS--------VGVSRN 182
           +P G+P+SVT  Y+ Y  + RA    +  +     G+ T+   L+S        +GV  +
Sbjct: 58  LPAGYPNSVTEDYLEYQIYVRAANKPWTQSELISEGLSTSVLKLHSWDAVLESRIGVGDS 117

Query: 183 RATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
            A+P A  +  +L+   GR+  ++FA R G   + + K  R A D+  +    ++  + A
Sbjct: 118 HASPTAALLLSVLQQSMGRIATIVFAHRLGTALEPECKMYRLAADIFNDAAMILDCLSPA 177

Query: 242 VPHLF-LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
           +P    + L   ++V +++  VT+ S++  +   FA   N+G++ AK      +  LLG 
Sbjct: 178 LPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNAKDSSQETVISLLGM 237

Query: 301 -GLSIMISKRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
              S+++S      IT  A  LL   ++ ++Y  V++V + TLNR R ++ + + L+
Sbjct: 238 LAGSVVVSYITSKTITWIALILLLATHLGTNYLAVRAVCMQTLNRQRANLVLSNILQ 294


>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 400

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 72/338 (21%), Positives = 143/338 (42%), Gaps = 39/338 (11%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           Q P ++  + ++  +VP G+P SV   Y  ++ +  +       M V +   +L      
Sbjct: 35  QQPFQKSQNHLQELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL------ 88

Query: 181 RNRATPGAVAINWIL--KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
             R   G  A  +++  +   G +  +I  R G    +  KQ R   DL+ EL   +E+A
Sbjct: 89  --RFNSGHSAAFFVVFREVMTGMMHLIITERWGTSIVFFAKQWRMRIDLISELLRIIEIA 146

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----I 294
             + P +F  ++   +V + V   +    +T I   +AKG N+ ++T K + +      +
Sbjct: 147 FISTP-IFYSMSVTCSVIEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVV 205

Query: 295 ADLLGTGLSIMISKRNPSLITTF--ALLS---CGYVFSSYQEVKSVVLHTLNRARFSVAV 349
           + L+G  +  ++     S I  F  AL+S   C ++ S+      +V  TLN  R ++ +
Sbjct: 206 SLLIGWLILHVLGDFTESGIIVFICALISHCVCNFLLSN-----VIVFKTLNYERLTILM 260

Query: 350 ESFLKTGH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPF 403
             F+   +      V S  E  ++ N           + +G       +  + Y+ +   
Sbjct: 261 SYFMNIKNEILSPVVVSRMESKIKVN-------EMVNIRMGISLNSIPKSSNLYIEISQS 313

Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
            +K +Y + Y  S  +++ +L D    D+I ++   AY
Sbjct: 314 IKKYQYYL-YKTSTKQIWVILGDHITKDEIFESLLCAY 350


>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
 gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
          Length = 473

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 75/293 (25%), Positives = 132/293 (45%), Gaps = 30/293 (10%)

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
           VGV  + A+P A  +  +L++  GR+  ++FA R G   + + K  R A D+  +    +
Sbjct: 113 VGVGDSSASPTAALLLSVLQESMGRIATILFAHRLGTALEPECKMYRLAADVFNDTAMIL 172

Query: 236 ELATAAVPHLFLPLACA-ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNI 294
           +  + A P  F  L  + ++V +++  V + S +  +   FA+  N+G+V A    +   
Sbjct: 173 DCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAHITHMNPN 232

Query: 295 ADLLGTGLSI---MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
            D    GL      I+  N  LI   ALLS  ++ ++Y  VK+V +  LNR R ++   +
Sbjct: 233 VDFDIEGLEPGNGHIATWNAILI---ALLSI-HLATNYAAVKAVSMRCLNRQRANIVFGN 288

Query: 352 FLKTGHVPSLREGNLQENIFS----FPWLRD-----------KPVVLGS-----RFKDAF 391
            +  G V +  E + +E IF       W+ D             V+L S     R   + 
Sbjct: 289 LMHHGSVLTPTEVSQRERIFERDGVLRWVDDCVIGHCTIGVPLGVLLASMSQRYRRTGSL 348

Query: 392 QDPDAYLA-VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
           +  D  L  +   F  ++YV+ +  S+ +   +LK+     D LKA  HA + 
Sbjct: 349 ELSDINLPDLMDIFADQKYVLWFTDSEYEAIVVLKEGCTPIDQLKAWAHALLF 401


>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
          Length = 194

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 95/189 (50%), Gaps = 9/189 (4%)

Query: 202 VGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVA 260
           +G+++FA  +G K D + KQ R   D+L ++   +E+     P  F      +N+AK + 
Sbjct: 2   LGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIV 61

Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITT 316
           +V   +TR  +    A+  N+ DV+AK    E + N+A LL + L + +    P   +  
Sbjct: 62  SVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGC 121

Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWL 376
           F  L+  +++++Y+ V+++V+ TLN  R  + ++ +L+ G V      N  E +    W 
Sbjct: 122 FFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL----WT 177

Query: 377 RDKPVVLGS 385
            D    LGS
Sbjct: 178 GDPAPGLGS 186


>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
           24927]
          Length = 445

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 72/303 (23%), Positives = 135/303 (44%), Gaps = 54/303 (17%)

Query: 98  VAERDRSG----KFKRNS----------IRAISLQSPQAPAEELMSFIRSYVVPDGFPDS 143
           + ERD+SG    ++++            +++ SLQ PQA +   +  I +  +P G+P S
Sbjct: 13  IVERDQSGIITAEYRQQGNQNIVQVHQVVQSNSLQ-PQAISH--IQNIINIFLPAGYPKS 69

Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
           VT  Y+             G +          +VG   + A+  A  +  I++D  GRV 
Sbjct: 70  VTDDYLD------------GDL----------AVGFEDSNASTTAAVLLKIVQDSIGRVA 107

Query: 204 KMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAA 261
            ++FA R G   + + K+ RFA D+  +    V+  +   P  L + L   A   + +  
Sbjct: 108 TILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPLRVVLLSTAGTLRALCG 167

Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-----------SIMISKRN 310
           V    ++  +   F+K  N+ ++ AK      +  L+G  +           S++IS   
Sbjct: 168 VAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLMRADQTLRVKAGSLVISSVT 227

Query: 311 PSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQE 368
               T  +   L   +++++Y+ VK+V + TLNR R +  + +F +TG + + ++   QE
Sbjct: 228 SKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVMSTFAETGVMLTPKDVARQE 287

Query: 369 NIF 371
            IF
Sbjct: 288 RIF 290


>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
          Length = 194

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 79/153 (51%), Gaps = 12/153 (7%)

Query: 59  LLQSESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ 118
           ++ ++ DE      V +Q + SS       D D RR  Y +E    GK+       + ++
Sbjct: 52  IVSTDKDENKV--QVWEQTISSSNANKNGGDDDSRRTLYRSEG--PGKW-------LKVE 100

Query: 119 SPQAPAEEL-MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
           SP A  + + ++ I  + +P  +P+SV  SY  + ++    +  G A  V +TQ LL +V
Sbjct: 101 SPPASRQSMSLTSIVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAV 160

Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQ 210
           GV    A P A A+NW+LKD AG++G ++F  Q
Sbjct: 161 GVGSQSAAPMAAALNWLLKDFAGQLGGVLFVSQ 193


>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
          Length = 295

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 11/230 (4%)

Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
            D +   G+  +LAT   P  FL LA   N+AK VA      +   I   FA   N+G+V
Sbjct: 3   ADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEV 62

Query: 285 TAKGECVGNIADLLGTGLSIMI------SKRNPSLITTFALLSCGYVFSSYQEVKSVVLH 338
            AK E     A L+G G  I+I       K  P ++ T+  +   +++  YQ +  +  +
Sbjct: 63  AAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFN 122

Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPD 395
           T+N  R  + VES +    VP   + N +ENI  +       ++ G   ++     +   
Sbjct: 123 TVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVS 182

Query: 396 AYLAVEPFFEKERYVVTYNP-SKGKVYAL-LKDQAKSDDILKAAFHAYVL 443
              A+   + KE+Y++T N  +K   +++  K  A S D+L+  + AY L
Sbjct: 183 KVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQAYWL 232


>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 479

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 86/406 (21%), Positives = 159/406 (39%), Gaps = 88/406 (21%)

Query: 115 ISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTW-------------------- 154
           I+L   Q  +   ++ I +  +P G+P SVT  Y+ Y  +                    
Sbjct: 41  ITLPPRQPSSTSYLTSILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSY 100

Query: 155 -RALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKK 213
             +L+ F     G+ +++ +L  VGV    A+P A  +  +L++  GR+  ++FA +   
Sbjct: 101 QDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLNVLQESMGRIATILFAHR--- 157

Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA----ANVAKNVAAVTSTSTRT 269
                            LG  +E      P    P+       ++V + +  V + S++ 
Sbjct: 158 -----------------LGTSLEPEYCLSPVFPKPVRVGFLSLSSVLRALCGVAAGSSKA 200

Query: 270 PIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFAL--LSCGYVF 326
            +   FA+  N+G++ AK      +  L+G  + S++++  +  L T  AL  L   ++ 
Sbjct: 201 SLSAHFARWGNLGELNAKDSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLAIHLA 260

Query: 327 SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV 382
           +++  V++V + TLNR R ++   S      V +  + + +E IF       W  +   V
Sbjct: 261 TNHAAVRAVNMTTLNRQRANIVFSSLFDDDRVLTPAQVSKEERIFERDGVLRWKAEATGV 320

Query: 383 LG--------------------------SRFKDAFQDPDAYLAV-EPFFEKERYVVTYNP 415
           LG                          SR  +A  D    L V    F++E Y++ ++P
Sbjct: 321 LGTCRVGVSLEQLLLLLPHSQTNDASKTSRRSEAGADFGVSLPVLVDLFKEEEYLLYFSP 380

Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVLLH---------CINSSNG 452
                  +LK    +   LKA  HA ++           CIN   G
Sbjct: 381 RDHAAVIILKAGVTAQAQLKAWSHALLVSRRLAGRSTSACINEKQG 426


>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
          Length = 295

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/230 (24%), Positives = 101/230 (43%), Gaps = 12/230 (5%)

Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
           D +   G+  EL+T   P  FLPLA   N+AK +A      +   I   FA+  N+G+V 
Sbjct: 4   DFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVA 63

Query: 286 AKGECVGNIADLLGTGLSIMISKRNP------SLITTFALLSCGYVFSSYQEVKSVVLHT 339
           AK E     A LLG  + ++I  +        +L+ T+  +   +++  YQ +  +   T
Sbjct: 64  AKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRT 123

Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG---SRFKDAFQDPDA 396
           +N  R  + V+S +    VP     N +ENI ++       V  G    R     +  D 
Sbjct: 124 INLKRGRILVKSHVAQHTVPGYVACNEEENILTWERYLRPQVSFGVPMERMLGGDKSSDM 183

Query: 397 YLAVEPFFEKERYVVTYNPSKGKVYALL---KDQAKSDDILKAAFHAYVL 443
              +   ++ E+Y++ +     +    L   K+ A S  +L++ + A+ L
Sbjct: 184 VNRLLKLYKNEKYILFFEQFGSREPTFLVTFKESATSMSVLRSLWQAHWL 233


>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 333

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/238 (23%), Positives = 110/238 (46%), Gaps = 11/238 (4%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
           +L+ F     G+  ++ +L  VGV  + ATP    +  +++   G++  ++FA R G   
Sbjct: 42  SLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAHRLGTAL 101

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
           + + K  R A D+  +    ++  + A P  + + +   ++  + +  V + S++  +  
Sbjct: 102 EPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSSKASLSA 161

Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVF---SSYQ 330
            FA   N+G+V AK      +  LLG     ++  +  S + T+  L         ++Y 
Sbjct: 162 HFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITSPLATWGTLILLLTIHLATNYA 221

Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG 384
            V++V +H LNR R ++   +  + G V S R+ + +E +F       W  DK  VLG
Sbjct: 222 AVRAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRERVFERGGVLRWSDDK--VLG 277


>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 290

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 56/248 (22%), Positives = 93/248 (37%), Gaps = 56/248 (22%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV-SRNRATPGAVAINWI 194
           +P G+PD+V+  Y  ++ W    +FF   + V T+Q+LL ++G+ S   A P A A  W+
Sbjct: 3   LPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKWV 62

Query: 195 LKDGAGRVGKMIFARQG-KKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
           LKDG G    ++    G +++D D                              P  CA 
Sbjct: 63  LKDGVGSFATLLSGVVGAQRYDED------------------------------PKRCAT 92

Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----- 308
            V        ++       + F + +N GD+ AK E  G    L      I I +     
Sbjct: 93  FVRAGALTGRTSLVNGTFMQHFGRNQNAGDIRAKLEVQGKWLALCALPTGIFIFRGVSGA 152

Query: 309 -------------------RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
                              R+  +I+ +  +  G+ F S++    + +  LNR R   A 
Sbjct: 153 FEPLPPETLETMTLLDHLTRSLPVISAYGAVVGGHSFLSWKAASVLRVDVLNRFRLLKAA 212

Query: 350 ESFLKTGH 357
             F+  G 
Sbjct: 213 REFVDGGE 220


>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 507

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 77/320 (24%), Positives = 128/320 (40%), Gaps = 57/320 (17%)

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
           VGV    A+P    ++ +L+D  GR+  ++FA R G   + + K  R A D+L +     
Sbjct: 31  VGVGDASASPTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIF 90

Query: 236 ELATAAVPHLFLPLACA-ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA---KGECV 291
           +  + A+P     +  A ++V + +  V + S++  +   FAK  N+G++ A       V
Sbjct: 91  DCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLV 150

Query: 292 GNIADLLGTGLSIMIS------KRNPSLITTFALLS--CGYVFSS--------------- 328
             I+     G   + S      +++ S  T  +LL   CG V  S               
Sbjct: 151 IAISGAAQRGFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHISTPFATWTTLLLL 210

Query: 329 --------YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWL 376
                   Y  V+SV + TLNR R ++   +  + G V +  E +  E IF       W 
Sbjct: 211 LLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILRWK 270

Query: 377 RDKPVVLGSRFKDAFQD----------------PDAYLAVEPFFEKERYVVTYNPSKGKV 420
                +   +   +FQ+                 D Y  +   FEKE YV+ +NP   K 
Sbjct: 271 ASSATLGYCQIGGSFQELLRGSIHGANSIRDVGIDIYKLLR-LFEKEEYVLWFNPIHKKG 329

Query: 421 YALLKDQAKSDDILKAAFHA 440
             +LK+       L+A  HA
Sbjct: 330 TIVLKNNVTPISQLQAWSHA 349


>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
 gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
          Length = 391

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 105/217 (48%), Gaps = 11/217 (5%)

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
           VGV  + ATP    +  ++    GR+  ++FA R G   + + K  RFA D+  +    +
Sbjct: 26  VGVGDSTATPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMVL 85

Query: 236 ELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNI 294
           +  + A P  + + +   ++    +  V + S++  +   FA+  N+G+V AK      +
Sbjct: 86  DCLSPAFPKPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQETV 145

Query: 295 ADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
             LLG    S++IS     L T  T  LL   ++ ++Y  V++V++H LNR R ++   +
Sbjct: 146 ISLLGMLAGSVVISYITSPLATWGTLILLLTIHLATNYAAVRAVIMHCLNRQRANILFSN 205

Query: 352 FLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG 384
             + G V S R+ + +E +F       W  DK  VLG
Sbjct: 206 MFQHGLVLSPRDVSQRERVFERGGVLRWSDDK--VLG 240


>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
 gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
          Length = 768

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 57/234 (24%), Positives = 102/234 (43%), Gaps = 13/234 (5%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           ++L  F ++  +P G+P+SVTP Y+ Y    AL+ F      + +++  L   GV    A
Sbjct: 321 QKLRDFGKAVFLPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGVGSASA 380

Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVE-----LA 238
           +     +  IL D   R+  ++ A   G     + K+ RF  D+L ++   ++       
Sbjct: 381 SANDALLLTILLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTLSPVFK 440

Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKGECVGNI 294
               P L +   C +   + +  + +  ++  I   FA   KG+ ++GD+ AK      +
Sbjct: 441 AQGYPQLRVVALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDSSKETV 500

Query: 295 ADLLGTGLSIMISKRNP---SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARF 345
             L G  L  +I  R     S  TT  +L   ++  +Y  V  +VL T+N  R 
Sbjct: 501 LALFGMLLGSLIVPRLQTAWSTYTTLFILLSLHIAINYVAVTGLVLRTMNWQRM 554


>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 461

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/342 (22%), Positives = 136/342 (39%), Gaps = 31/342 (9%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   +    +P G+P SV+P Y+ Y T+ A + F      +  ++ +L   GV    A+ 
Sbjct: 29  LKHLLTDVFLPAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGVGNASASA 88

Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
               +  +L+D  GR+  +      G     + K  R   D+  +  A + L      + 
Sbjct: 89  THAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFND--ASIVLDVFQPFYA 146

Query: 246 FLPLA-------CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNI 294
            +P+A       C++   + +  V +  ++  +   FA   ++GD+ AK       +G +
Sbjct: 147 SVPVAGARVLALCSSGCLRALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRETVIGLV 206

Query: 295 ADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
             L GT L   I     +     ALL   ++  +Y  V++V + TL R R S+A   +++
Sbjct: 207 GLLSGTILLSYIGTPGATYAALCALLGA-HLALNYLAVRAVAMRTLTRQRASIAWRVWME 265

Query: 355 --TGHVPSLREGNLQENIFSFPWL-------RDKPVVLGSRFKDAF----QDPDAYLAVE 401
              G  P+  E   +E + + P L       R      G      F        A ++  
Sbjct: 266 RGAGAAPTPEEVAEREGLLADPTLIADGTGRRLGRCSFGGARSQGFIATTGKDSAMVSTS 325

Query: 402 PFFEKERYVVTYNPSKGKV---YALLKDQAKSDDILKAAFHA 440
              E  RY +  +   G +     LLK     +D+LKA  HA
Sbjct: 326 RLIEAARYWLLIDSKPGSIPTATILLKRVYGPEDLLKAWVHA 367


>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
          Length = 158

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 43/73 (58%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFPDSV+P Y+PY  W +++ F     G   T  +L  +GV   +A+  A    W++
Sbjct: 82  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 141

Query: 196 KDGAGRVGKMIFA 208
           KD  G +G+++FA
Sbjct: 142 KDSTGMLGRIVFA 154


>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
          Length = 405

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 16/116 (13%)

Query: 103 RSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGF----------------PDSVTP 146
           +SGK  R  I+  SL+      EE  S      + DGF                P  V+ 
Sbjct: 71  KSGKVSRYFIKGDSLELLCVDEEEDDSTSFCLGLDDGFWKLIRLTSSAAKDFFLPKQVSD 130

Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
           +Y+ Y+ W+ L   F  A+ V  TQ +  ++G+ ++R+   + A NWILKDG GR+
Sbjct: 131 NYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGLGRL 186


>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 228

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 1/110 (0%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
           F+R   +P  +PDSV+P Y+ Y  + + + FF    G+   + +L  +GV     +    
Sbjct: 76  FLRDAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHA 135

Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
            +   LKDG  R+  + FA R G   + + K+ RF  DL  +    +ELA
Sbjct: 136 LLLTCLKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELA 185


>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 411

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 60/118 (50%), Gaps = 1/118 (0%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L + +    +P G+P SVT  Y  Y  + +L+ F     G+ +++ +L  VGV    A+P
Sbjct: 53  LKTLLIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASP 112

Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
            +  +  IL+D +GR+  ++FA R G     + K  R A DL  +L   ++  + A P
Sbjct: 113 TSALLLHILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAP 170


>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 435

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 102/213 (47%), Gaps = 21/213 (9%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP SV+  Y+ Y  + +L+ F      + +++ +L+SVGV  + A+P A  I  IL
Sbjct: 48  LPVGFPHSVSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAIILSIL 107

Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
           +D  GR+  ++FA   G   + + K  R   D                P + + +   ++
Sbjct: 108 QDSVGRLATILFAHSFGTSLEPECKMYRLLAD----------------P-IRVVVLSLSS 150

Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
           V +++  V + S++  +   FAK  N+G++ AK    E V ++  ++   L I       
Sbjct: 151 VLRSLCGVAAGSSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWITTPI 210

Query: 312 SLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
           +  T   LL   ++ ++ + V++V + TL R R
Sbjct: 211 ATWTALILLLSIHLETNRRAVRAVKMRTLTRQR 243


>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
          Length = 635

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 67/282 (23%), Positives = 117/282 (41%), Gaps = 40/282 (14%)

Query: 107 FKRNSIRAISLQSPQAPAE--------ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALK 158
           ++ NS+    L+SP+A  +          ++ I+   +   +P  VT +Y  Y  WR L+
Sbjct: 75  YRNNSLYNKRLKSPEAEKDPTFVTKLSNSLAKIKKRFLNIVYPSGVTSNYYKYANWRLLE 134

Query: 159 HFFGG-----AMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQ-- 210
            F        +      + LLN    S N      A  I++I  +   ++ K I +R   
Sbjct: 135 RFSHSMSQSISNNPLAAKILLNQSTYSTNDGVINIAYKIDFIATNLMTQIFKEIVSRIVK 194

Query: 211 -------GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-CAANVAKNVAAV 262
                  G  FD + +   F G L+  +    +  +      ++ +      + + +  +
Sbjct: 195 TMWVGNIGIGFDMNPRAFCFWGSLICTITNTADFLSTLFNFKYISITRHITTLFRQIGML 254

Query: 263 TSTSTRTPIYKAFAKGEN---IGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFAL 319
           T++++  PIY +F    N   IG++TAK E V  + DLLG+    +IS     LI  +  
Sbjct: 255 TTSASIGPIYGSFKSQNNVSNIGEITAKMEAVTPLCDLLGSATGSVIS----YLIAPYQY 310

Query: 320 LSCGYV---------FSSYQEVKSVVLHTLNRARFSVAVESF 352
              G           +SSY  V+ VV  +LN +R  V +E F
Sbjct: 311 YYRGIAFAIAALIANYSSYCAVQMVVFRSLNISRCFVVLEDF 352


>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
          Length = 497

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 95/420 (22%), Positives = 169/420 (40%), Gaps = 86/420 (20%)

Query: 98  VAERDRSGKF-----------KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTP 146
           +AERD SG+             R+   + S++  +     L+S +    +P G+P SV+P
Sbjct: 6   IAERDDSGRLLATYLCAAGEHTRSDTPSFSIKQKRTDWRRLLSNV---FLPAGYPASVSP 62

Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
            Y+ Y  + AL+ F      +  ++ +L   GV    A+        +L+D   R+  ++
Sbjct: 63  DYLQYQIYNALQAFCSSLASLIASRAVLEGHGVGNADASATHAIFLTVLQDIFSRLTTIV 122

Query: 207 FARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL---FLPL------------- 249
                G     + K  R   D+  +  A + L T + PHL    L L             
Sbjct: 123 SGYYLGTSLFPEAKTYRLLADIFND--AAIVLDTLS-PHLDGVVLSLRYPFVLSGSGSSL 179

Query: 250 ----ACAANVAKNVAAVTSTSTRTPIYKAFAK-GE---NIGDVTAKGECVGNIADLLG-- 299
                C +  ++ +  + +  ++  +   FA  G+   ++GD+ AK      +  LLG  
Sbjct: 180 RVSALCLSGASRALCGMVAGGSKAALTVHFASPGQGTGDVGDLNAKDGSKETVLALLGML 239

Query: 300 TGLSIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
           +G ++M  +     + +  F +L+  ++F +Y  V+ VVL +LNR R ++   ++    H
Sbjct: 240 SGTAVMHYVHSARATYLVLF-VLTFLHLFGNYLAVRVVVLRSLNRQRANIVWMAY--RAH 296

Query: 358 VPSLREGN-----------LQENIFSFP-WLRD-----KPVV----LGSRFKD------- 389
           +     G             +E IF+ P  L D      P +    +GS F         
Sbjct: 297 IKGPMGGKRPLVLTPEAVAARERIFANPSALHDTSSPAHPFLGICKVGSSFSSLASRQQS 356

Query: 390 ------AFQDPDAYLAVEPFFEKERYVVTYNPSKG---KVYALLKDQAKSDDILKAAFHA 440
                 A   P     +   F +E YV+ + P+ G   +++  LKD  K  D LKA  HA
Sbjct: 357 APRGALAAPSPPQIHTILEIFSEEPYVLWFGPTSGPSCRLHVFLKDGHKPRDHLKAWTHA 416


>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 450

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 79/363 (21%), Positives = 146/363 (40%), Gaps = 47/363 (12%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P+G+P +V+P  V Y  W A++ F     G+F ++ +L   GV    A+        IL
Sbjct: 29  LPEGYPFTVSPQCVRYQIWNAVQAFCSSLAGLFASRAVLQGHGVGNASASATDAIFLTIL 88

Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC--- 251
           +D   R+  ++     G     + K  R   D+L +     +  +  + H    L+    
Sbjct: 89  QDVFSRLTTIVSGYYLGTSLYPEAKTYRLLADILNDAAMVTDTLSPHLAHFSFSLSYPFV 148

Query: 252 --AANVAKNVAAVTSTSTRTPIYKAFAKGE----------------NIGDVTAKGECVGN 293
             A+N +  V A+  +     +  A A G                 ++GD++AK      
Sbjct: 149 VPASNSSLRVVALCMSGAFRALCGAVAGGSKAALTVHFATAGERPGDVGDLSAKDGSKET 208

Query: 294 IADLLG--TGLSIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
           +  LLG  +G  ++  +     + +  F L+ C ++ +++  V+ + +   NR R  +A 
Sbjct: 209 VLALLGMLSGSVVLRYVHTAQATYVVLFGLIFC-HLAANFIAVRVIAMRAFNRQRAGIAW 267

Query: 350 ESFLKTGH----VPSLREGNLQENIFS----FPWLRD-----KPVVLGSRF-----KDAF 391
             F ++      VP       QE IF+     P  +          LG  F     ++ F
Sbjct: 268 CEFRRSFDVVTVVPDYLHVARQERIFTDSARIPISQHARGTYAHCALGVSFLSLSGREPF 327

Query: 392 QDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH-CINSS 450
            D      +E  F  E+Y++    +  ++  +LKD     D LKA  HA+ + + C   S
Sbjct: 328 SDEQIATLLE-VFAHEKYILWPASNPPQIAVVLKDGHGPHDHLKAWAHAHEIANICDRES 386

Query: 451 NGH 453
           + H
Sbjct: 387 HAH 389


>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
          Length = 413

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 72/311 (23%), Positives = 132/311 (42%), Gaps = 40/311 (12%)

Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
           VGV    A+P A  +  +L++  GR+  ++FA R G   + + K  R A D+L +    +
Sbjct: 40  VGVGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVL 99

Query: 236 ELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNI 294
           +  +   P  + + L   ++V + +  V + S++  +   FA+  N+G++ A G    NI
Sbjct: 100 DCLSPIFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGL---NI 156

Query: 295 --ADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
             + L G  LS + S    + I    LL   ++ +++  V++V + TLNR R ++     
Sbjct: 157 PPSQLTGVVLSYITSPLE-TWIALIVLLIV-HLGTNHAAVRAVKMTTLNRQRANIVFSHL 214

Query: 353 LKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG------------------------ 384
            +   + +  E + +E IF       W  +   +LG                        
Sbjct: 215 FEDNLILTPAETSKEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGAL 274

Query: 385 --SRFKDAFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
             SR      D    L A+   F +E Y++ + P   +   +LK  A +   LKA  HA 
Sbjct: 275 ETSRTTSTTMDASTNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHAL 334

Query: 442 VLLHCINSSNG 452
           ++   +   NG
Sbjct: 335 LVSRHLARKNG 345


>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
          Length = 269

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 21/136 (15%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGR-VGKMIFAR-QGKK 213
           A++ F G   G  +T+ +L  VGV  + +TP A  + W++KDG+ R VG+++FA   G K
Sbjct: 11  AIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLMWLIKDGSTRMVGRILFAWCNGTK 70

Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
            D D                    A          + C A VAK++  V   +TR  + +
Sbjct: 71  LDAD-------------------TAVGGSNGTMTSILCVAGVAKSIVGVAGGATRAALTQ 111

Query: 274 AFAKGENIGDVTAKGE 289
             A+  N+ DV+AK E
Sbjct: 112 HQARQGNLADVSAKVE 127


>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 354

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 19/170 (11%)

Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
           +P +V P Y+ Y  + +L+       GV +T  +L++ GV    AT  + A+ W ++DG 
Sbjct: 19  YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78

Query: 200 GRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
           G VG ++F+      FD  +K+ R   D+L ++G  +++A        LP+  +   A  
Sbjct: 79  GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDMA--------LPVVLSWTRA-- 128

Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
                   T+  I   F+   N  DV+AK      +  L+G  L + +S 
Sbjct: 129 --------TKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSN 170


>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
          Length = 127

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 42/73 (57%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y+ Y  W  ++ F     G   +Q +L  VGV  + AT    ++ W+ 
Sbjct: 46  LPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGNSSATILGASLTWMF 105

Query: 196 KDGAGRVGKMIFA 208
           KDG+G +G+++FA
Sbjct: 106 KDGSGMIGRILFA 118


>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 101/205 (49%), Gaps = 17/205 (8%)

Query: 253 ANVAKNVAAVTST-STRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI----- 306
           A+  +N+  V    +TR  I    A  +N+ +++AK      I +L+G+  SI +     
Sbjct: 28  ASALRNIDYVCYCGATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFS 87

Query: 307 -SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
            S+    LI +  +L   ++++++  VK+++ +T NR R ++ ++S+   G V +  + N
Sbjct: 88  SSRAEWILIISLMVL---HLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVN 144

Query: 366 LQENIFSFPWLRDK-----PVVLGSRFKDAFQDPDA--YLAVEPFFEKERYVVTYNPSKG 418
           + E++     L  K      +VLG   K +  D  A     ++  ++ ++Y++  N +K 
Sbjct: 145 VNESVLLGFGLNVKRMCGFDIVLGESIKKSLNDYAAADIKLLQSVYKDKKYMLLVNGTKK 204

Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
           K+Y  L+    + D++   FHA  L
Sbjct: 205 KIYVCLEKGETAADVIAGYFHAVCL 229


>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
          Length = 743

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 57/240 (23%), Positives = 90/240 (37%), Gaps = 30/240 (12%)

Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI--------- 191
           P   T  Y  Y  WR  + F    +    +        + +N+     + +         
Sbjct: 176 PSVATKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISES 235

Query: 192 -------NWILKDGAGRVGKMI-FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV- 242
                    I KD   R+   + F+  G  FD + K  R  G +L      ++       
Sbjct: 236 KMKSILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFT 295

Query: 243 --PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA---KGENIGDVTAKGECVGNIADL 297
             P + +  AC  N  + +  +T ++++ P Y +F       NIG++TAK E    I D 
Sbjct: 296 FGPKMIMG-ACT-NAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDF 353

Query: 298 LGTGLSIMIS---KRNPSLI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            G  L I ++      P +I  TT  L S     ++Y  VKSV    LN  R  V +E F
Sbjct: 354 SGIALGIYLTNLLNEQPFVIQATTVILTSLISNVATYMCVKSVSFKNLNPQRCFVVLEDF 413


>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 472

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 79/326 (24%), Positives = 129/326 (39%), Gaps = 67/326 (20%)

Query: 200 GRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA-ANVAK 257
           GR+  ++FA R G   + + K  R A D+L +     +  + A+P     +  A ++V +
Sbjct: 2   GRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLR 61

Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTA---KGECVGNIADLLGTGLSIMIS------K 308
            +  V + S++  +   FAK  N+G++ A       V  I+     G   + S      +
Sbjct: 62  ALCGVAAGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQ 121

Query: 309 RNPSLITTFALLS--CGYVFSS-----------------------YQEVKSVVLHTLNRA 343
           ++ S  T  +LL   CG V  S                       Y  V+SV + TLNR 
Sbjct: 122 KDSSQETVISLLGMLCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQ 181

Query: 344 RFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG-SRFKDAFQD----- 393
           R ++   +  + G V +  E +  E IF       W +  P  LG  +   +FQ+     
Sbjct: 182 RANIVFSTLFEKGSVLTPTEASKCERIFERDGILRW-KASPAALGYCQIGGSFQELLRGS 240

Query: 394 -----------PDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
                       D Y  +   FEKE YV+ +NP   K   +LK+       LKA  HA +
Sbjct: 241 THGANSIRDVGIDIYKLLR-LFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLKAWSHALI 299

Query: 443 LLHCINSSNGHRSLALKQHEYDNSNL 468
           +          R   L++ E D  N+
Sbjct: 300 V--------AKRLTDLRREEADRINV 317


>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
 gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
          Length = 278

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/252 (25%), Positives = 106/252 (42%), Gaps = 13/252 (5%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P GFP+SV+P Y+ Y  + A++ F     G+ +++ LL   GV    AT     +  + 
Sbjct: 9   LPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATATNAVLLTVA 68

Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATA---AVPHLFLPLA- 250
           +D   R+  +I A   G     + K  RF  D+  +     + +T    A  H  L +A 
Sbjct: 69  QDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASGHANLRVAA 128

Query: 251 -CAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKG---ECVGNIADLLGTGL 302
            C A  ++ +        +  +   FA   +G  +IGDV AK    E +  +  +L    
Sbjct: 129 LCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLALMGMLAGSF 188

Query: 303 SIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
            +       +       L   ++  +Y  V+ V L TLNR R S+A  SF + G  PS  
Sbjct: 189 VVHHLSSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFREYGSAPSPT 248

Query: 363 EGNLQENIFSFP 374
                E I + P
Sbjct: 249 IVAGMEGILARP 260


>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 504

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/246 (22%), Positives = 106/246 (43%), Gaps = 15/246 (6%)

Query: 125 EELMSFIRSYVVPDGFPDSVT-----PSYVPYMTWRALKH----FFGGAMGVFTTQTLLN 175
           + ++S +    +P G+P SV+     P  +P  + R++ H              T T   
Sbjct: 59  KPIISSLSDVFLPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQLTPTQHT 118

Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAG 234
            VGV    A+P +  +  IL+D +GR+  + FA R G   + + K  R A D+  ++   
Sbjct: 119 GVGVGNANASPTSALLLHILQDISGRIATICFAHRIGTALEPECKTYRLAADVFNDIAMI 178

Query: 235 VELATAAVPH--LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG 292
           ++  +  VP     + +   A V + +  V   S++  +   FAK  N+ ++ AK     
Sbjct: 179 LDCLSPGVPAGPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQE 238

Query: 293 NIADLLGTGLSIMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLNRARFSVAV 349
            +  L G  +  ++     S  TT+ +L          +Y  V++V + +LNR R ++  
Sbjct: 239 TVISLFGMLVGSVVISHITSFSTTWLILLILLALHLSMNYAAVRAVQMTSLNRQRANIVF 298

Query: 350 ESFLKT 355
            +   +
Sbjct: 299 STLFDS 304


>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
          Length = 266

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/217 (23%), Positives = 98/217 (45%), Gaps = 13/217 (5%)

Query: 234 GVELATA-AVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE--- 289
           G+E+      PH    + C + + K +  V   +TR+ + +  A   N+ DV +K     
Sbjct: 2   GIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQE 61

Query: 290 -CVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
            CV  +A  +G  L  +I K    L T F ++   +++++ + V++V L + N +R+ +A
Sbjct: 62  TCVNLVASFVGLYLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIA 120

Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PF 403
           +E F ++  + S ++ N  E +     +     + LG   K+   +  +   +E     F
Sbjct: 121 LEEFFRSSRMLSPQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSF 180

Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
              E +++        VY  L  + +  D+LKA F A
Sbjct: 181 DPHEHFIIAQTKKCLGVY--LHFETRPQDVLKAYFFA 215


>gi|440299036|gb|ELP91648.1| hypothetical protein EIN_206450 [Entamoeba invadens IP1]
          Length = 360

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 70/347 (20%), Positives = 137/347 (39%), Gaps = 36/347 (10%)

Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
           Q P     + ++  +VP G+P +V   Y  ++        F  AM       L+ S+  S
Sbjct: 35  QPPFLTTRNHLQDIIVPVGYPTTVERGYFRFVA-------FSVAMESCNMMLLVMSISAS 87

Query: 181 RNRATPGAVAINWILKD-GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
               +  + A   I ++  AG +  +I  R G    +  KQ R   +++ EL   +++A 
Sbjct: 88  LRIDSSHSAAFFVIFREVMAGTMHLIITERWGSSIVFYAKQWRMRIEIISELLRLIQIAF 147

Query: 240 AAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD--- 296
            A P +F  +    +  ++V   +    +T I   +AK  N+ ++T K + +  +A    
Sbjct: 148 IASP-IFYSINVTCSTIESVLNASRQVIKTKILSNYAKENNVAELTEKAQNIETLARVIS 206

Query: 297 -LLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
            L+G  +  ++       I  F L    +V  ++   + +V   +N  R  + ++ F   
Sbjct: 207 LLIGFIVLHLLGDFTEHGIIVFMLALIAHVVCNFLMGRVIVFKAINYERLMILMKYFTTV 266

Query: 356 GH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERY 409
            +      V S  E  ++ N         K + +G       +  D + A+    +  +Y
Sbjct: 267 KNEILSPLVVSAIESKIRVN-------EMKNIRMGVSLNSIPKSTDLFFAISQSIKTCQY 319

Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSN--GHR 454
            + Y  +   +  LL D+   + I +A       L C +S    GHR
Sbjct: 320 FL-YKNTNNTISVLLGDKITKEGIFEA-------LLCAHSQERVGHR 358


>gi|156088815|ref|XP_001611814.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799068|gb|EDO08246.1| hypothetical protein BBOV_III006850 [Babesia bovis]
          Length = 540

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 75/170 (44%), Gaps = 12/170 (7%)

Query: 194 ILKDGAGRVGKMI-FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL--FLPLA 250
           +LKD   R+   + F + G  FD + K  R  G LL    AGV      +  L   + L 
Sbjct: 58  VLKDIFSRILNFVWFGQIGAGFDDNPKAFRLLGGLLCS-AAGVACFIGNMMQLEPKVVLT 116

Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFA---KGENIGDVTAKGECVGNIADLL----GTGLS 303
              NV K+V AVT  +++   +  F       N+G++TAK E    I D L    GT L 
Sbjct: 117 VLVNVIKHVGAVTMVASQATFHSTFCVKNAATNVGEITAKLETQSPICDFLGMAAGTSLG 176

Query: 304 IMISKRNPSLITT-FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
            + +    S+ T  F  +      ++Y   KSV   TLN  R  V +E+F
Sbjct: 177 ALTANMAASMQTVLFTGMCLTSNIATYMCAKSVCFRTLNPQRCFVLLENF 226


>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
          Length = 290

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/239 (25%), Positives = 98/239 (41%), Gaps = 34/239 (14%)

Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
           + T   P+ FL LA  +N+ K+++ + + +T   I   FA   NIGDVTAK     +   
Sbjct: 1   MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60

Query: 297 LLGTGLSIMISKR--NPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
           LLG    + +S    + S+ T FA    +SC  +++  + ++ V L TLN  R  +A   
Sbjct: 61  LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120

Query: 352 FLKTGHVPSLREGNLQENIFSFPWLR----DKPVVLGSRFKDAFQDPDAYLAVEP----- 402
           +   G +    E   +   +  P+L        + LG           AY ++E      
Sbjct: 121 YCSQGIIYQ-PEAVAKLERYYLPYLHVADCGTDIRLGMSLHSVL---SAYRSLEEADDVI 176

Query: 403 -FFEKERYVVTYNPSKGKVY---------------ALLKDQAKSDDILKAAFHAYVLLH 445
             F    Y+++Y   K  +Y                 L   A + D L+A  HA +L H
Sbjct: 177 QLFGDCNYMLSYCDLKQGMYRSIINRTEKTKCTIAISLNSSASTRDQLQAYLHALLLRH 235


>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 122

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 20/117 (17%)

Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLME 230
           Q+LL   G+    A P A A+N +L+DG G                   Q R    + ++
Sbjct: 2   QSLLYDAGLVAG-AIPTAAAVNRVLRDGLG-------------------QWRITSSVALD 41

Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
           +    E+ T   P  FL +A   NVAKNVA +++++TR   + +F   EN+ DVTAK
Sbjct: 42  VAVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAK 98


>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 494

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 63/297 (21%), Positives = 120/297 (40%), Gaps = 49/297 (16%)

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLP 248
           I + + +  GR+  ++FA R G   + + K  R A D+L +    ++  +   P  + + 
Sbjct: 144 IEYQIYESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVG 203

Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
           L   ++V + +  V + S++  +   FA+  N+G++ A G              S+++S 
Sbjct: 204 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVG--------------SVVVSY 249

Query: 309 RNPSLITTFALLSCGYVF--SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
               L T  AL+    V   +++  V++V + TLNR R ++      +   + +  E + 
Sbjct: 250 ITSPLETWIALIVLLIVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETSK 309

Query: 367 QENIFS----FPWLRDKPVVLG--------------------------SRFKDAFQDPDA 396
           +E IF       W  +   +LG                          SR      D   
Sbjct: 310 EERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDAST 369

Query: 397 YL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
            L A+   F +E Y++ + P   +   +LK  A +   LKA  HA ++   +   NG
Sbjct: 370 NLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHALLVSRHLARKNG 426


>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
          Length = 484

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 84/196 (42%), Gaps = 22/196 (11%)

Query: 65  DETTAFRLVE-DQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRA-ISLQSPQA 122
           +++TA    E D+    +   I SE   G      A R+   K   N  R  ++  S  A
Sbjct: 5   EDSTAITFTEVDEVNHPTATYIYSEPAAG-----PAHREHGKKHGENWGRVDVAHASSTA 59

Query: 123 PA----EELMSFIRSYVVPDGFPDSVTPSYVP-YMTWRALKHFFGGAMGVFTT-QTLLNS 176
           PA    + +  F+    +P G+P SVT  Y P Y T R +      ++        +L S
Sbjct: 60  PAAWSSKSVQDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILES 119

Query: 177 --------VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDL 227
                   VGV    A+P A  +  IL+D +GR+  ++FA R G   + + K  R A D+
Sbjct: 120 SAARLPLGVGVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADI 179

Query: 228 LMELGAGVELATAAVP 243
             +L   ++  +  +P
Sbjct: 180 FNDLAMILDCLSPMIP 195


>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
 gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
          Length = 406

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 86/188 (45%), Gaps = 6/188 (3%)

Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELG 232
           ++ VGV    A+P +  +  IL+D +GR+  + FA R G   + + K  R A D+  ++ 
Sbjct: 3   ISRVGVGNASASPTSALLLQILQDISGRIATICFAHRVGTALEPECKTYRLAADVFNDIA 62

Query: 233 AGVELATAAVPH--LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
             ++  +  VP     + +   A V + +  V   S++  +   FAK  N+ ++ AK   
Sbjct: 63  MILDCLSPGVPAGPARVIVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSS 122

Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLNRARFSV 347
              +  L G  +  ++     S  TT+ +L          +Y  V++V + +LNR R ++
Sbjct: 123 QETVISLFGMLVGSVVISHITSFSTTWLVLLFLLALHLSMNYAAVRAVRMTSLNRQRANI 182

Query: 348 AVESFLKT 355
              + L +
Sbjct: 183 VFSALLAS 190


>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
          Length = 234

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 58/109 (53%), Gaps = 5/109 (4%)

Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSLITTFALLS 321
           +TR+ + +  A   N+ DV +K      CV  IA  +G GL   +  +   L   F  ++
Sbjct: 3   ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHVQNQRV-LYGLFLFVT 61

Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
             +++++ + VK+V L T N AR+ +A+E + K+G + S ++ N  E +
Sbjct: 62  LLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERV 110


>gi|384494916|gb|EIE85407.1| hypothetical protein RO3G_10117 [Rhizopus delemar RA 99-880]
          Length = 247

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/181 (23%), Positives = 81/181 (44%), Gaps = 35/181 (19%)

Query: 273 KAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPS-LITTFALLSCGYVFSSY 329
           K F+   NIGD+ AK +   + A LLG  +G+ ++    +P  L   FA+LS   ++S+ 
Sbjct: 19  KHFSPNGNIGDIVAKDDAQMSTAHLLGMLSGVGLISISHSPLFLFGAFAVLSPINIWSTV 78

Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKD 389
           + + +     LN+A+ ++    ++  GHV +  +            LRD+ +  G   K 
Sbjct: 79  KMLHAAEFEILNQAKLTLLAREYIDQGHVATYEK------------LRDREIGFGEWIKP 126

Query: 390 AFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA----------LLKDQAKSDDILKAAFH 439
           +++     ++++         +   PS  + Y            LKD A S D+LK+  H
Sbjct: 127 SYRKKRGAISIK---------IKMGPSAEQAYGSSHEVQEVVHTLKD-AGSKDVLKSVLH 176

Query: 440 A 440
           +
Sbjct: 177 S 177


>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 814

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 55/110 (50%), Gaps = 1/110 (0%)

Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
           +R   +P+G+P SV+  Y+ +    A +     A G+  +  LL   GV  + AT  +  
Sbjct: 233 LRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGSSTATAASAT 292

Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
             ++ +D    +G + FA  +G+  D D K+ R   D + +LG  +ELA+
Sbjct: 293 AQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELAS 342



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/130 (25%), Positives = 65/130 (50%), Gaps = 21/130 (16%)

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIAD--- 296
           + FL  AC   +A+++  VTS +TR  + + FA G++ G    +G       G++AD   
Sbjct: 408 YYFLVAACLGGLARSLCGVTSRATRAALTQHFAAGDDAGRSFTRGRTGGSTTGSVADVDA 467

Query: 297 ----------LLGTGLSIMISK----RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
                     L+G G  +++++       ++   FALL+  +VF + + ++S+ L+T+NR
Sbjct: 468 KEGTQETAATLIGMGAGVLVTRLAADDAAAIWFWFALLTALHVFFNVRAMRSLRLNTMNR 527

Query: 343 ARFSVAVESF 352
               + +  F
Sbjct: 528 VVVRILLRRF 537


>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 596

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 61/276 (22%), Positives = 112/276 (40%), Gaps = 35/276 (12%)

Query: 104 SGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGG 163
           S K +   IR +S +  ++P + L    R + +P G+PDS    +  Y        F   
Sbjct: 115 STKARPGEIRLVS-RRKESPLQRL----RLFGMPTGYPDSCAEGFRRYFLLSLCSTFVSS 169

Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD------GAGRVGKMIFARQGKKFDYD 217
                  Q++LN   +    ++P      W+LKD       A    +++      KF + 
Sbjct: 170 FASSIAYQSILNGFLLV---SSPQL----WMLKDLAPALAAAYLANRVVSYENRPKFWFV 222

Query: 218 LKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAK 277
           +  +     +++E+     +  + VPH  L  A   +  +  A++    TR    + FA 
Sbjct: 223 VSVVLHNTSIVLEM-----IIPSVVPHNLLTAAVLTSCVRQSASLMFLVTRASALQHFAV 277

Query: 278 GENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-VFSSYQEV 332
             N+ ++T K    G  +  ++  LG   +  ++     L+T   L  CG  +  SY  +
Sbjct: 278 SGNLAELTKKFNSFGIVIYTVSTALGIAYTYYVASLTAQLVT--VLFCCGANLVLSYMSM 335

Query: 333 KSVVLHTLNRARFSVAVESFLKT-----GHVPSLRE 363
            ++    LN    SV + +++K      GHV S RE
Sbjct: 336 CNIAFRVLNGTTISVILRAYVKEGCLGPGHVLSPRE 371


>gi|83767160|dbj|BAE57299.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 165

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/134 (24%), Positives = 66/134 (49%), Gaps = 3/134 (2%)

Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
           +L+ F     G+ +++ +L  VGV    A+P A  +  IL+D +GR+  ++FA R G   
Sbjct: 25  SLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHILQDTSGRISTILFAHRVGTAL 84

Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIY 272
           + + K  R A D+  +    ++  +  +P  F  + +   A V + +  V   S++  + 
Sbjct: 85  EPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLSTAGVLRALCGVAGGSSKASLS 144

Query: 273 KAFAKGENIGDVTA 286
             F++  N+ +V A
Sbjct: 145 AHFSRWGNLAEVNA 158


>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
          Length = 234

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 58/109 (53%), Gaps = 5/109 (4%)

Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSLITTFALLS 321
           +TR+ + +  A   N+ DV +K      CV  IA  +G GL   + + +  L   F  L+
Sbjct: 3   ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV-QNHRVLYGLFLFLT 61

Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
             +++ + + V++V L T N AR+ +A+E + K+G + S ++ N  E +
Sbjct: 62  LLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERV 110


>gi|335308911|ref|XP_003361419.1| PREDICTED: UPF0420 protein C16orf58 homolog, partial [Sus scrofa]
          Length = 210

 Score = 48.5 bits (114), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/132 (23%), Positives = 66/132 (50%), Gaps = 6/132 (4%)

Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
           +  F  L+  +++++Y+ V+++V+ TLN  R  + ++ FL+T  V      N  E +++ 
Sbjct: 3   LCCFFFLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQTQEVLGPTAANRMEPLWTG 62

Query: 374 PWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSD 431
            W    P +  S      +   + L ++   E  +E Y++ ++  + +V  +L   A  +
Sbjct: 63  FW----PSLFLSLGPPLHRLTSSVLELQQLVEGHREPYLLRWDQPRNQVQVVLSPMAGPE 118

Query: 432 DILKAAFHAYVL 443
            IL+AA H  +L
Sbjct: 119 TILRAATHGLLL 130


>gi|159467581|ref|XP_001691970.1| hypothetical protein CHLREDRAFT_170864 [Chlamydomonas reinhardtii]
 gi|158278697|gb|EDP04460.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 131

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 12/95 (12%)

Query: 199 AGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVA 256
           AG  G ++FA  +G  FD   KQ R   D+  +LG  VELA+  +P  LFLP AC  ++A
Sbjct: 2   AGMAGGILFAYAEGSSFDACAKQWRLFADIANDLGMTVELASPLLPRALFLPCACFGSIA 61

Query: 257 KNVAAVT-------STSTRTPIYKAFAKGENIGDV 284
           ++V             +   P+Y    +G++ G V
Sbjct: 62  RSVTVFALILPRALQANAHVPVYL---RGQDGGGV 93


>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
           B]
          Length = 507

 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 56/281 (19%), Positives = 112/281 (39%), Gaps = 37/281 (13%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P++V+P Y+ Y  +  L+        +  ++ +L   GV    A+     +  I+
Sbjct: 58  LPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASATNAMLVNII 117

Query: 196 KDGAGR---VGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-- 250
           +D   R   +G   +   G     + K+ R   D+  +    +++ +    H  + L   
Sbjct: 118 QDVVSRLITIGSGYYL--GTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWSISLVYP 175

Query: 251 ---------------CAANVAKNVAAVTSTSTRTPIYKAFAKG----ENIGDVTAKGECV 291
                          C +   + +  + +  ++  +   FA       +IGD+ AK    
Sbjct: 176 FIRREQGFYLRVLALCLSGSFRALCGMVAGGSKAALTMHFATAGPVPGDIGDLNAKDGSK 235

Query: 292 GNIADLLGTGLSIMISKRNPSLITTFALLS---CGYVFSSYQEVKSVVLHTLNRARFSVA 348
             +  L+G  L   + K   S   T+A+LS     ++  ++  V+ VV+ T NR R + A
Sbjct: 236 ETVLALIGMLLGTFVIKHVDSGFATYAVLSMLILTHLAINFVAVRGVVMRTFNRHRAAAA 295

Query: 349 VESFLKTGHVPSLREGNL--------QENIFSFPWLRDKPV 381
             S+ +     +  EG +        +E IF+ P L   P+
Sbjct: 296 WASYRRYEQGIAQYEGAILTPELMAVKERIFADPTLLMHPL 336


>gi|449280556|gb|EMC87826.1| UPF0420 protein [Columba livia]
          Length = 111

 Score = 45.8 bits (107), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 36/67 (53%)

Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
            ++  ++P GFP+SV+  Y+ Y  W AL+       G    +++L +VGV    A+    
Sbjct: 18  LVQRVLLPQGFPESVSSDYLGYQLWDALQALCSSLAGAVAARSVLQAVGVGDTNASVTGA 77

Query: 190 AINWILK 196
           A+ W+L+
Sbjct: 78  ALGWLLR 84


>gi|154278545|ref|XP_001540086.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150413671|gb|EDN09054.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 425

 Score = 45.4 bits (106), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 70/166 (42%), Gaps = 31/166 (18%)

Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDK 379
           ++ ++Y  V+SV + TLNR R ++   +  + G V +  E +  E IF       W +  
Sbjct: 189 HLSTNYAAVRSVNMTTLNRQRANIVFSTLFEKGSVLTPTEASKCERIFERDGILRW-KAS 247

Query: 380 PVVLG-SRFKDAFQD----------------PDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
           P  LG  +   +FQ+                 D Y  +   FEKE YV+ +NP   K   
Sbjct: 248 PAALGYCQIGGSFQELLRGSIHGANSIRDVGIDIYKLLR-LFEKEEYVLWFNPIHKKGTI 306

Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNL 468
           +LK+       LKA  HA ++          R   L++ E D  N+
Sbjct: 307 VLKNNVTPISQLKAWSHALIV--------AKRLTDLRREEADMVNV 344


>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
          Length = 237

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 40/189 (21%), Positives = 84/189 (44%), Gaps = 12/189 (6%)

Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLIT- 315
           +  V   +TR  +    A+  N+ DV+AK      + +L+   TG  I     +   +T 
Sbjct: 7   IVGVAGGATRAALTLHQARRNNMADVSAKDGSQETLVNLMALFTGFLITPLVVDNLFLTW 66

Query: 316 -TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP 374
             F + +  +++++YQ V +VV+ T+N  R  + V+ +   G V + +E N  + +   P
Sbjct: 67  YFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKEVNKVDPVLRRP 126

Query: 375 WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDIL 434
             +   V LG ++ D        L  +   +   + + YN     ++  L       ++L
Sbjct: 127 GFKCS-VHLGCQYADV-------LKCDEILDLPGFSLGYNSKTSTIHVSLYKDTSEHEVL 178

Query: 435 KAAFHAYVL 443
           +++F A ++
Sbjct: 179 QSSFLAMLV 187


>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 464

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 62/276 (22%), Positives = 109/276 (39%), Gaps = 37/276 (13%)

Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
           +P G+P SV+  Y  Y    AL+ FF    G+   + +L   GV    A+     +  IL
Sbjct: 3   LPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLTIL 62

Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMEL------------GAGVELATAAV 242
           +D   RV  ++     G     + K  RF  D+  ++               ++    A 
Sbjct: 63  QDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWLAF 122

Query: 243 PHLFLPL------------ACAANVAKNVAAVTSTSTRTPIYKAFAKGE------NIGDV 284
           P   LP              C + + +++ AV +  ++  +   F++ +      +I D+
Sbjct: 123 PPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDISDL 182

Query: 285 TAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLN 341
           +AK      +  L+G     +I     +   T  LL+   VF    +Y  V+SVV  + N
Sbjct: 183 SAKDGSKETVLALVGMLCGSIIMPLIQTQTQTCGLLAVIIVFHIYINYYAVRSVVFKSYN 242

Query: 342 RARFSVAVESFLKT-GHVPSL--REGNLQENIFSFP 374
           R R  +   +F ++ G    L  RE    E++F  P
Sbjct: 243 RQRTCILWSAFRRSEGEHRCLVPREVARHEHLFHNP 278


>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
          Length = 191

 Score = 43.5 bits (101), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 67/137 (48%), Gaps = 6/137 (4%)

Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLGTGLSIMISKRNPS 312
           + V  V  ++TRT +    A  +N+GDV AK    E + N+ A L    L  ++SK    
Sbjct: 3   RTVVGVAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVF 62

Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
           +   F L +  +++ +Y+ VK +   TLN++   + V+ +++T  + ++ E N +E I  
Sbjct: 63  VWLLFCLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPIL- 121

Query: 373 FPWLRDKPVVLGSRFKD 389
             W   +    G R  D
Sbjct: 122 LHWSSSRH-YYGCRLSD 137


>gi|291410213|ref|XP_002721396.1| PREDICTED: HEAT repeat containing 3 [Oryctolagus cuniculus]
          Length = 756

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 67/296 (22%), Positives = 118/296 (39%), Gaps = 54/296 (18%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAI--SLQSP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  +S       L SP
Sbjct: 428 ELRETIALLTAQQTALEIIVNMCCTEDPSDDEWEELSSSDESDAFMESSFSECGGQLLSP 487

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSV-------TPSYVPYM------TWRALKHFF 161
              + E+ + + +Y++P        FP+SV       +PS+ P +       WRAL    
Sbjct: 488 LCLSHEVHTALTNYLIPKKVFEKTAFPNSVAVDICSRSPSWKPLIRKMNTVQWRAL---- 543

Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLK 219
                    Q+LL+ + V      P        L+  A  + ++IFA+    K  D+ L+
Sbjct: 544 ------VCLQSLLSLLDVDHLGGAPA-------LQTLAQHLSQLIFAQPDFAKHADF-LE 589

Query: 220 QLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGE 279
            +  A   L++  A   ++    P   + L C A V       ++T  R  +        
Sbjct: 590 AVSSALRALLQTMASKNISQCMTPEQLMTL-CRAGVHS-----SNTGVRVNVVSILGI-- 641

Query: 280 NIGDVTAKGECVGNIADLLGTGLSIM-ISKRNPSLITTFALLSCGY-VFSSYQEVK 333
             G V AK +  G +  L   G  ++ ++ ++PSL+     L   + VF+  QE +
Sbjct: 642 -TGSVLAKED--GTLETLKTIGCFLLEVATKDPSLVVAGEALDALFDVFADGQEAE 694


>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
          Length = 480

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 1/126 (0%)

Query: 114 AISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTL 173
           ++S  SP++    L        +P G P +    Y+ Y     ++       GV +T   
Sbjct: 40  SLSSLSPRSVPSALSGLFGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVSTGAA 99

Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELG 232
           L + GV  + AT    A+ W ++DG G +G + F+     KFD  +K+ R   D+  + G
Sbjct: 100 LAAAGVGDSDATAMGAAVTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVANDAG 159

Query: 233 AGVELA 238
             +++A
Sbjct: 160 MILDMA 165


>gi|326927265|ref|XP_003209813.1| PREDICTED: HEAT repeat-containing protein 3-like [Meleagris
           gallopavo]
          Length = 667

 Score = 42.0 bits (97), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 64/269 (23%), Positives = 109/269 (40%), Gaps = 32/269 (11%)

Query: 14  NSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGA-AATLLQS---ESDETTA 69
           N    AA +  ++  + V +   IS +      P G G G    + LL S   E  E TA
Sbjct: 287 NRLKPAAEAETEENVIDVTDNCAISEDDGMEEIPKGKGSGEHDISDLLPSDKWELKEVTA 346

Query: 70  FRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAIS--LQSPQAPAEEL 127
             + +   ++  + + CSED     W  ++  D S  F  NS    S  L SP   + E+
Sbjct: 347 LLMAQQTALEIIVNMCCSEDPSDDEWEELSSSDESDLFMENSYSEGSGLLMSPLCLSAEV 406

Query: 128 MSFIRSYVVPD------GFPDSV-------TPSYVPYMTWRALKHFFGGAMGVFTTQTLL 174
            S   + ++P        FP+SV        PS+ P +  + L      A+    +  L+
Sbjct: 407 HSAFLNNLIPKKILEKTAFPNSVALDICMHNPSWKPLI--KKLNTVQCRALTCLHSILLV 464

Query: 175 NSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYD-LKQLRFAGDLLMELGA 233
           + V         GA A    L+  A  + ++IF++     D + L+ +  A   L++  A
Sbjct: 465 SDVD-----CLGGASA----LQSLAQHLSQLIFSKAELPEDTEFLEAITSALRALLQTMA 515

Query: 234 GVELATAAVPHLFLPLACAANVAKNVAAV 262
              ++    P   L L C A++  + A+V
Sbjct: 516 SKNISQCMTPEQLLAL-CEASIHSSNASV 543


>gi|313238188|emb|CBY13282.1| unnamed protein product [Oikopleura dioica]
          Length = 93

 Score = 41.2 bits (95), Expect = 1.4,   Method: Composition-based stats.
 Identities = 19/75 (25%), Positives = 33/75 (44%)

Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
           L   I    +P  +P+SV+  Y  Y  W + +       G     ++   +GV  + A+ 
Sbjct: 18  LKGLICKVFLPKDYPNSVSEDYTTYQVWDSAQALCSSVTGSLAINSVFKGIGVGDSSASS 77

Query: 187 GAVAINWILKDGAGR 201
            +   NWI++DG  R
Sbjct: 78  LSATSNWIIRDGIDR 92


>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 595

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 52/238 (21%), Positives = 93/238 (39%), Gaps = 12/238 (5%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           E L+  +  + +P G+PDS    +  Y        F          Q++LN   ++   +
Sbjct: 155 ESLLQRLHLFGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLA---S 211

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELAT-AAVP 243
           +P      W+LKD A  +     A +   ++   K        L  L    E+     VP
Sbjct: 212 SPQL----WMLKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVP 267

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
           +  L  A   +  +  A++    TR    + FA   N+ ++T K    G +   + T L 
Sbjct: 268 NHLLAAAILTSCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALG 327

Query: 304 IMISKRNPSLITTFA--LLSCGYVFS-SYQEVKSVVLHTLNRARFSVAVESFLK-TGH 357
           I  +   PS+    A  L+ CG   + S+  + ++    LN    SV +  +++  GH
Sbjct: 328 IAYTSLVPSVTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGH 385


>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 595

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 52/238 (21%), Positives = 93/238 (39%), Gaps = 12/238 (5%)

Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
           E L+  +  + +P G+PDS    +  Y        F          Q++LN   ++   +
Sbjct: 155 ESLLQRLHLFGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLA---S 211

Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELAT-AAVP 243
           +P      W+LKD A  +     A +   ++   K        L  L    E+     VP
Sbjct: 212 SPQL----WMLKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVP 267

Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
           +  L  A   +  +  A++    TR    + FA   N+ ++T K    G +   + T L 
Sbjct: 268 NHLLAAAILTSCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALG 327

Query: 304 IMISKRNPSLITTFA--LLSCGYVFS-SYQEVKSVVLHTLNRARFSVAVESFLK-TGH 357
           I  +   PS+    A  L+ CG   + S+  + ++    LN    SV +  +++  GH
Sbjct: 328 IAYTSLVPSVTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGH 385


>gi|449473073|ref|XP_002193970.2| PREDICTED: HEAT repeat-containing protein 3 [Taeniopygia guttata]
          Length = 638

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 66/292 (22%), Positives = 114/292 (39%), Gaps = 36/292 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAIS--LQSP 120
           E  E TA    +   ++  + + CSED     W  ++  D S  F  NS    S  L SP
Sbjct: 311 ELKEVTALLTAQQTALEIIVNMCCSEDPSDDEWEELSSSDESDLFMENSYNEGSGLLLSP 370

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSV-------TPSYVPYMTWRALKHFFGGAMGV 167
              ++E+ S   + ++P        FP+SV        PS+ P +  + L      A+  
Sbjct: 371 LCLSDEVNSAFLNNLIPKKILEKTAFPNSVALDICMHNPSWKPLI--KKLNAVQCRALTC 428

Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYD-LKQLRFAGD 226
             +  L++ V         GA A    L+  A  + ++IF++     D + L+ +  A  
Sbjct: 429 LQSILLVSDVD-----CLGGASA----LQSLAQHLSQLIFSKTELPTDTEFLEAVTSALR 479

Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
            L++  A   +     P   L L C A +       ++TS R  +          G V A
Sbjct: 480 ALLQTMASKNIPQCMTPEQLLAL-CEAGIQS-----SNTSVRVNMVSILGIS---GSVLA 530

Query: 287 KGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLH 338
           KG+       ++G  L  +  K +  ++   AL +   VF+  +E +   L 
Sbjct: 531 KGQGAAETLKMIGKFLLEVAMKESSLVVAGEALDALFDVFADGEEAERAALQ 582


>gi|449528331|ref|XP_004171158.1| PREDICTED: uncharacterized protein LOC101231720 [Cucumis sativus]
          Length = 157

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 2/66 (3%)

Query: 378 DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAA 437
           +K + LGS+  D     +  L +   F KE Y+++ +  +GK   +LK+ A   D+LKA 
Sbjct: 27  EKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEH--RGKYCVMLKESASPVDMLKAV 84

Query: 438 FHAYVL 443
           FH   L
Sbjct: 85  FHVNYL 90


>gi|397582228|gb|EJK52211.1| hypothetical protein THAOC_28545 [Thalassiosira oceanica]
          Length = 263

 Score = 39.7 bits (91), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 61/140 (43%), Gaps = 13/140 (9%)

Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMEL 231
           +   +GV R  AT     +++ L+D  G    + F A   +     +KQ +F   L+++L
Sbjct: 123 IFEGLGVGRASATALVATMSFALRDFFGICSSLAFSAVVSRNLARHVKQWKFFSALMLDL 182

Query: 232 GAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKG-----ENIGDVTA 286
           G  +++        FLPL C  ++ + +  VT++   + I   +A       + I ++ +
Sbjct: 183 GLAMKVTANLKRQWFLPLLCGGSICEAMYGVTTSPCTSIINLHWANTVLGSEDGIAEIIS 242

Query: 287 K-------GECVGNIADLLG 299
           K        E VG   D LG
Sbjct: 243 KRRAQRTLNELVGGRTDGLG 262


>gi|335056143|dbj|BAK39267.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056145|dbj|BAK39268.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056147|dbj|BAK39269.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056473|dbj|BAK39241.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056475|dbj|BAK39242.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056477|dbj|BAK39243.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056479|dbj|BAK39244.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056481|dbj|BAK39245.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056483|dbj|BAK39246.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056485|dbj|BAK39247.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056487|dbj|BAK39248.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056489|dbj|BAK39249.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056491|dbj|BAK39250.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056493|dbj|BAK39251.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056495|dbj|BAK39252.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056497|dbj|BAK39253.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056499|dbj|BAK39254.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056501|dbj|BAK39255.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056503|dbj|BAK39256.1| similar to EMB1879 [Oryza rufipogon]
 gi|335056505|dbj|BAK39257.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056507|dbj|BAK39258.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056509|dbj|BAK39259.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056511|dbj|BAK39260.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056513|dbj|BAK39261.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056515|dbj|BAK39262.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056517|dbj|BAK39263.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056519|dbj|BAK39264.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056521|dbj|BAK39265.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056777|dbj|BAK39266.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056779|dbj|BAK39270.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056781|dbj|BAK39271.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056783|dbj|BAK39272.1| similar to EMB1879 [Oryza sativa Japonica Group]
 gi|335056785|dbj|BAK39273.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056787|dbj|BAK39274.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056789|dbj|BAK39275.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056791|dbj|BAK39276.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056793|dbj|BAK39277.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056795|dbj|BAK39278.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056797|dbj|BAK39279.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056799|dbj|BAK39280.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056801|dbj|BAK39281.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056803|dbj|BAK39282.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056805|dbj|BAK39283.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056807|dbj|BAK39284.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056809|dbj|BAK39285.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056811|dbj|BAK39286.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056813|dbj|BAK39287.1| similar to EMB1879 [Oryza sativa Indica Group]
 gi|335056815|dbj|BAK39288.1| similar to EMB1879 [Oryza sativa Japonica Group]
          Length = 21

 Score = 39.7 bits (91), Expect = 3.4,   Method: Composition-based stats.
 Identities = 19/21 (90%), Positives = 19/21 (90%)

Query: 288 GECVGNIADLLGTGLSIMISK 308
           GE VGNIADLLGTGLSI ISK
Sbjct: 1   GESVGNIADLLGTGLSIFISK 21


>gi|7020991|dbj|BAA91342.1| unnamed protein product [Homo sapiens]
          Length = 594

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  NS      Q  SP
Sbjct: 266 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 325

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
              + E+ + + +Y++P        FP+S+         TW+ L  K        +F  Q
Sbjct: 326 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 385

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
           +L++ + V       GA A+  +    A  + +++F++    K  D+ L+ +  A   L+
Sbjct: 386 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 437

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
           +  A   ++    P   + L  A   + NV    +  +   I          G V AK +
Sbjct: 438 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 488

Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
             G +  L   G  ++ ++ ++PSL+
Sbjct: 489 --GTLETLKNIGCFLLEVTTKDPSLV 512


>gi|119603150|gb|EAW82744.1| hypothetical protein FLJ20718, isoform CRA_b [Homo sapiens]
          Length = 594

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  NS      Q  SP
Sbjct: 266 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 325

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
              + E+ + + +Y++P        FP+S+         TW+ L  K        +F  Q
Sbjct: 326 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 385

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
           +L++ + V       GA A+  +    A  + +++F++    K  D+ L+ +  A   L+
Sbjct: 386 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 437

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
           +  A   ++    P   + L  A   + NV    +  +   I          G V AK +
Sbjct: 438 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 488

Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
             G +  L   G  ++ ++ ++PSL+
Sbjct: 489 --GTLETLKNIGCFLLEVTTKDPSLV 512


>gi|332845876|ref|XP_520630.3| PREDICTED: HEAT repeat-containing protein 3 isoform 2 [Pan
           troglodytes]
 gi|410210380|gb|JAA02409.1| HEAT repeat containing 3 [Pan troglodytes]
 gi|410268164|gb|JAA22048.1| HEAT repeat containing 3 [Pan troglodytes]
 gi|410288786|gb|JAA22993.1| HEAT repeat containing 3 [Pan troglodytes]
          Length = 683

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  NS      Q  SP
Sbjct: 355 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 414

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
              + E+ + + +Y++P        FP+S+         TW+ L  K        +F  Q
Sbjct: 415 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 474

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
           +L++ + V       GA A+  +    A  + +++F++    K  D+ L+ +  A   L+
Sbjct: 475 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 526

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
           +  A   ++    P   + L  A   + NV    +  +   I          G V AK +
Sbjct: 527 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 577

Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
             G +  L   G  ++ ++ ++PSL+
Sbjct: 578 --GTLETLKNIGCFLLEVTTKDPSLV 601


>gi|41281996|ref|NP_891552.1| HEAT repeat-containing protein 3 [Homo sapiens]
 gi|45477280|sp|Q7Z4Q2.2|HEAT3_HUMAN RecName: Full=HEAT repeat-containing protein 3
 gi|21619962|gb|AAH33077.1| HEAT repeat containing 3 [Homo sapiens]
 gi|54311359|gb|AAH18730.2| HEAT repeat containing 3 [Homo sapiens]
 gi|119603149|gb|EAW82743.1| hypothetical protein FLJ20718, isoform CRA_a [Homo sapiens]
 gi|158260921|dbj|BAF82638.1| unnamed protein product [Homo sapiens]
          Length = 680

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  NS      Q  SP
Sbjct: 352 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 411

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
              + E+ + + +Y++P        FP+S+         TW+ L  K        +F  Q
Sbjct: 412 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 471

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
           +L++ + V       GA A+  +    A  + +++F++    K  D+ L+ +  A   L+
Sbjct: 472 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 523

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
           +  A   ++    P   + L  A   + NV    +  +   I          G V AK +
Sbjct: 524 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 574

Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
             G +  L   G  ++ ++ ++PSL+
Sbjct: 575 --GTLETLKNIGCFLLEVTTKDPSLV 598


>gi|15680144|gb|AAH14415.1| HEATR3 protein [Homo sapiens]
          Length = 726

 Score = 39.3 bits (90), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  NS      Q  SP
Sbjct: 398 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 457

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
              + E+ + + +Y++P        FP+S+         TW+ L  K        +F  Q
Sbjct: 458 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 517

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
           +L++ + V       GA A+  +    A  + +++F++    K  D+ L+ +  A   L+
Sbjct: 518 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 569

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
           +  A   ++    P   + L  A   + NV    +  +   I          G V AK +
Sbjct: 570 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 620

Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
             G +  L   G  ++ ++ ++PSL+
Sbjct: 621 --GTLETLKNIGCFLLEVTTKDPSLV 644


>gi|397498211|ref|XP_003819878.1| PREDICTED: HEAT repeat-containing protein 3 [Pan paniscus]
          Length = 766

 Score = 38.9 bits (89), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)

Query: 63  ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
           E  ET A    +   ++  + + C+ED     W  ++  D S  F  NS      Q  SP
Sbjct: 438 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 497

Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
              + E+ + + +Y++P        FP+S+         TW+ L  K        +F  Q
Sbjct: 498 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 557

Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
           +L++ + V       GA A+  +    A  + +++F++    K  D+ L+ +  A   L+
Sbjct: 558 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 609

Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
           +  A   ++    P   + L  A   + NV    +  +   I          G V AK +
Sbjct: 610 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 660

Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
             G +  L   G  ++ ++ ++PSL+
Sbjct: 661 --GTLETLKNIGCFLLEVTTKDPSLV 684


>gi|71895555|ref|NP_001025740.1| HEAT repeat-containing protein 3 [Gallus gallus]
 gi|53134845|emb|CAG32369.1| hypothetical protein RCJMB04_23o18 [Gallus gallus]
          Length = 675

 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 61/269 (22%), Positives = 109/269 (40%), Gaps = 32/269 (11%)

Query: 14  NSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGA-AATLLQS---ESDETTA 69
           N    AA +  +++ + V +   +S +        G G G    + LL S   E  E TA
Sbjct: 295 NRLKPAAGAETEESEIDVTDNCALSEDDGMEEVSKGKGSGEHDISDLLPSDKWELKEVTA 354

Query: 70  FRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAIS--LQSPQAPAEEL 127
             + +   ++  + + CSED     W  ++  D S  F  NS    S  L SP   + E+
Sbjct: 355 LLMAQQTALEIIVNMCCSEDPSDDEWEELSSSDESDLFMENSYSEGSGLLMSPLCLSAEV 414

Query: 128 MSFIRSYVVPD------GFPDSV-------TPSYVPYMTWRALKHFFGGAMGVFTTQTLL 174
            S   + ++P        FP+SV        PS+ P +  + +      A+    +  L+
Sbjct: 415 HSAFLNNLIPKKILEKTAFPNSVALDICVHNPSWKPLI--KKMNTVQCRALTCLHSILLV 472

Query: 175 NSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYD-LKQLRFAGDLLMELGA 233
           + V         GA A    L+  A  + ++IF++     D + L+ +  A   L++  A
Sbjct: 473 SDVD-----CLGGASA----LQSLAQHLSQLIFSKTELPEDTEFLEAITSALRALLQTMA 523

Query: 234 GVELATAAVPHLFLPLACAANVAKNVAAV 262
              ++    P   L L C A++  + A+V
Sbjct: 524 SKNISQCMTPEQLLAL-CEASIHSSNASV 551


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.132    0.384 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,471,718,190
Number of Sequences: 23463169
Number of extensions: 310087588
Number of successful extensions: 820440
Number of sequences better than 100.0: 506
Number of HSP's better than 100.0 without gapping: 425
Number of HSP's successfully gapped in prelim test: 81
Number of HSP's that attempted gapping in prelim test: 819024
Number of HSP's gapped (non-prelim): 643
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)