BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 048416
(495 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
Length = 520
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/492 (78%), Positives = 435/492 (88%), Gaps = 2/492 (0%)
Query: 5 PPIKLKQSANSTAAAASS--AAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQS 62
PPIKLKQS N TA +S ++QDARLLVRET+RISANLA++ + + S
Sbjct: 3 PPIKLKQSPNPTAQTLTSTTSSQDARLLVRETLRISANLASAPPAESLALLDNNISNDSS 62
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQA 122
+++ VE++FVDSSLRLIC E+IDGR+W YVAE D G+FK+NS R++SLQ+PQA
Sbjct: 63 NKNKSRKLGFVEEEFVDSSLRLICCEEIDGRKWKYVAENDGFGRFKKNSFRSVSLQNPQA 122
Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
P EELMSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SVGVS++
Sbjct: 123 PVEELMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTKTLLSSVGVSKS 182
Query: 183 RATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV
Sbjct: 183 TAIPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN+ADLLGTGL
Sbjct: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGL 302
Query: 303 SIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
SIMISKRNPSL+TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG VPSL+
Sbjct: 303 SIMISKRNPSLVTTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGRVPSLQ 362
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
EGN +ENIFSFPWL+++P+V+GSRFKDAFQDP A+LA+EP FEKERY+VTYNP+K KVYA
Sbjct: 363 EGNTRENIFSFPWLKERPIVIGSRFKDAFQDPSAFLAIEPLFEKERYIVTYNPTKDKVYA 422
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGES 482
LLKDQAK+DDILKAAFHA+VLLH INSS+ RS +LKQ ++D+SN +L+ LEAHI ES
Sbjct: 423 LLKDQAKADDILKAAFHAHVLLHFINSSSKSRSSSLKQQQHDHSNYILSANYLEAHIAES 482
Query: 483 CKVVSTALWGFQ 494
CK+VS+ F+
Sbjct: 483 CKMVSSLYGSFK 494
>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
Length = 504
Score = 759 bits (1961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/491 (77%), Positives = 427/491 (86%), Gaps = 15/491 (3%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
PP+KLKQS + A S QDARLL RET+RISA+LA+S +P V + S
Sbjct: 2 PPMKLKQSPSPQTLATS---QDARLLARETLRISASLASSPAPSAV-----------ASS 47
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDR-SGKFKRNSIRAISLQSPQAP 123
E +V++QFVD+SLRLIC E+I+GRRWNYV + + +G+ + S+RA+ Q+PQAP
Sbjct: 48 PEARNLGIVDEQFVDASLRLICCEEINGRRWNYVVDNEGGNGQGRNGSLRAVCSQTPQAP 107
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
+ELMSFIRSYVVP+GFPDSVTPSYVPYM+WRALKHFFGGAMGVFTTQTLLNSVGVSRNR
Sbjct: 108 VDELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 167
Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLR AGDLLMELGAGVELATAAVP
Sbjct: 168 SAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVP 227
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS
Sbjct: 228 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 287
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
IMISKRNPSL+TTFALLSCGYV SSYQEVKSVVLHTLNRARFSVA++SFLKTG VPSL+E
Sbjct: 288 IMISKRNPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPSLQE 347
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
GN++ENIFSFPWL DKP+VLGSRFKDAFQD +YL++EP FEKERY+VTYNPSKGKVYAL
Sbjct: 348 GNMKENIFSFPWLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVYAL 407
Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESC 483
LKDQAKSDDILKAAFHA++LLH I SSNG S KQ +YD+++LM +TADL AHI ESC
Sbjct: 408 LKDQAKSDDILKAAFHAHMLLHFIVSSNGSESFPTKQQDYDSTHLMPSTADLAAHIAESC 467
Query: 484 KVVSTALWGFQ 494
K+VST+ F+
Sbjct: 468 KMVSTSYGLFK 478
>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 507
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/489 (72%), Positives = 417/489 (85%), Gaps = 9/489 (1%)
Query: 6 PIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESD 65
PI LKQS NS A + +S+ + AR+LVRET+R++ANLA S P+ + +
Sbjct: 3 PINLKQSPNSAAKSVASSTE-ARILVRETLRVTANLA--SPPLD-----SLPPTTSPAAP 54
Query: 66 ETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAE 125
T ++++ F+DS+ RLIC E+IDGRRWNYVA+ SGK K SIRA+ LQ+PQAP +
Sbjct: 55 HATNLPILDNHFLDSTSRLICCEEIDGRRWNYVADIQPSGKLKNASIRALCLQTPQAPID 114
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
E+MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV+RN+AT
Sbjct: 115 EMMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKAT 174
Query: 186 PGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
PGA+AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA PHL
Sbjct: 175 PGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPHL 234
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
FLPLACAANVAKNV+AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM
Sbjct: 235 FLPLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 294
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
ISKRNPSL+TTF LLSCGY+FSSYQEV+SVVLHTLNRARF+VAVE+FLKTG VPSL++GN
Sbjct: 295 ISKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPSLQKGN 354
Query: 366 LQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLK 425
+ E I SFPWL++ P+VLG RFKDAFQD ++LA+EP F++E+Y+VTYN +KGKVYALLK
Sbjct: 355 MNERILSFPWLKENPIVLGPRFKDAFQDAGSFLAIEPLFDREKYLVTYNQTKGKVYALLK 414
Query: 426 DQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCKV 485
DQAKSDDI+KAAFHA+VLLH I SS+G ++ KQ + +S++ TT +LEA I SCK+
Sbjct: 415 DQAKSDDIIKAAFHAHVLLHFIRSSSGAQNSNQKQVDAFSSSVP-TTTNLEAEIAASCKM 473
Query: 486 VSTALWGFQ 494
VS + F+
Sbjct: 474 VSNSYEIFK 482
>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
Length = 490
Score = 694 bits (1792), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/491 (71%), Positives = 403/491 (82%), Gaps = 33/491 (6%)
Query: 6 PIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESD 65
P+ +K+ T +S ++QD +L+RET+RISA LA+S+ P+ D
Sbjct: 6 PLNVKKQ---TPNISSPSSQD--ILIRETLRISAQLASSTPPL---------------LD 45
Query: 66 ETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAE 125
TT S+LR IC E+IDGRRWNYVAE + SG+FK NS R++SLQ+P+ P +
Sbjct: 46 TTTP----------STLRFICCEEIDGRRWNYVAETNASGQFKNNSFRSLSLQTPKPPLD 95
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
E+MSFI SYVVP+GFP +VTPSYVPYMTWRALKHFFGGAMGVFTTQTLL+SVGVSRNRAT
Sbjct: 96 EMMSFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRAT 155
Query: 186 PGAVA--INWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
P A A INWILKDGAGRVGKM+FARQGKKFDYDLKQLRF GDLLMELGAGVELATAA+P
Sbjct: 156 PAAAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAMP 215
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
HLFLPLACAANV KNV AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG S
Sbjct: 216 HLFLPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGFS 275
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
IMI+K NPSLITTF+LLSCGY+ SSY+EVKSVVLHTLN ARFSVAV+SFLKTG VP+LRE
Sbjct: 276 IMIAKSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQVPTLRE 335
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
GNL E+IFSFPW +D+PVVLGSR KDAFQ P AY+A+EP F+KERY+VTYNPSK KV+A+
Sbjct: 336 GNLNEDIFSFPW-KDRPVVLGSRIKDAFQSPSAYVAIEPLFDKERYIVTYNPSKSKVHAV 394
Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESC 483
LKDQAKSDDILKAAFHA+VL + I S N + + KQ + NS++MLT ADLEA I +SC
Sbjct: 395 LKDQAKSDDILKAAFHAHVLSNFIKSLNESKGSSWKQGDDLNSSVMLTAADLEACIADSC 454
Query: 484 KVVSTALWGFQ 494
KVV+ A W F+
Sbjct: 455 KVVTNAYWLFK 465
>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 497
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/496 (69%), Positives = 400/496 (80%), Gaps = 38/496 (7%)
Query: 4 PPPIKL--KQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQ 61
P P+K+ KQSANST ++ + +LVRETMRISANLA++ LL+
Sbjct: 3 PNPMKMNMKQSANST----TTTIANKEILVRETMRISANLASTP-------------LLE 45
Query: 62 SESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISL--QS 119
+ + + +IC E++DGRRW YVAE D G FK+NS R + L +
Sbjct: 46 APPPPSPI------------VGIICCEELDGRRWKYVAESDARGGFKKNSFRPVRLNFHN 93
Query: 120 PQ-APAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
P+ P E+++F+ SYVVP+GFPDSV PSYVPYMTWRALKHFFGGAMGVFTTQTLL+SVG
Sbjct: 94 PRDHPLHEVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVG 153
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
VSRNRA PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELA
Sbjct: 154 VSRNRAAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 213
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
TAAVPHLFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL
Sbjct: 214 TAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 273
Query: 299 GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
GTGLSI+I+KRNPSL+TTF+LLSCGY+ SSY+EVKSVVLHTLN RFSVAVESFL+TG V
Sbjct: 274 GTGLSILIAKRNPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQV 333
Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKG 418
P+L+EGN+ ENIFSFPW +D+PVVLGSR K+AFQDP AY A+EP F++ERY+VTYNPSK
Sbjct: 334 PTLQEGNMNENIFSFPW-KDRPVVLGSRIKEAFQDPSAYFAIEPLFDRERYIVTYNPSKH 392
Query: 419 KVYALLKDQAKSDDILKAAFHAYVLL-HCINSSNGHRSLALKQHEYDNSNLMLTTADLEA 477
KVYA+LKDQAKSDDILKAAFHA+VLL + I S N + +LKQ E D SN+ T AD+EA
Sbjct: 393 KVYAVLKDQAKSDDILKAAFHAHVLLFNLIKSWNESNASSLKQRE-DLSNMTHTVADIEA 451
Query: 478 HIGESCKVVSTALWGF 493
+ +CK V+ + +GF
Sbjct: 452 CMAGTCKTVADS-YGF 466
>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
Length = 497
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/490 (69%), Positives = 396/490 (80%), Gaps = 23/490 (4%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
P +KL + A+ S RLL RET+RISA+LA S P+ +
Sbjct: 2 PSVKLTHHSPPDTIASDSV----RLLSRETLRISASLA--SPPV--------------DD 41
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
+ + QF+ S+LRLIC E+IDGRR+ YVAE D SG+FK+NS+RAISL+SPQ P
Sbjct: 42 LPPHSPPPPDSQFLHSTLRLICCEEIDGRRFKYVAESDGSGRFKKNSVRAISLESPQTPF 101
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+E+ SF+RSYVVP+GFP SV SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG SRN +
Sbjct: 102 DEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSS 161
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
AVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH
Sbjct: 162 ASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 221
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL+GTG SI
Sbjct: 222 LFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI 281
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+ISKRNPSL+TTF LLSCGY+ SSYQEV+SVVLHTLNRARF+VAVESFLKTG VPSL+EG
Sbjct: 282 LISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEG 341
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N+QE IF+FPW+ D+PV+LG+RFKDAFQDP Y+AV+PFF+KERY+VTY+P+KGKVYALL
Sbjct: 342 NIQEKIFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALL 401
Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
K QA SDDILKAAFHA+VLLH +N S +++Q + T +LE+ I ESC+
Sbjct: 402 KHQANSDDILKAAFHAHVLLHFMNQSKDGNPRSVEQL---DPAFAPTEYELESRIAESCE 458
Query: 485 VVSTALWGFQ 494
+VST+ F+
Sbjct: 459 MVSTSYGVFK 468
>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 488
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/492 (68%), Positives = 388/492 (78%), Gaps = 37/492 (7%)
Query: 6 PIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESD 65
P +KQSANST ++ + +LVRETMRISANLA+
Sbjct: 3 PNPMKQSANST----TTTIANKEILVRETMRISANLASPPQ------------------- 39
Query: 66 ETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQS--PQAP 123
+ +IC E++DGRRW Y+AE D G FK+NS +SL S P+ P
Sbjct: 40 ---------PPPPSPIVGIICCEEMDGRRWKYLAESDGFGGFKKNSFLPVSLNSNHPRDP 90
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
E++SF+ SYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLL+SVGV RNR
Sbjct: 91 LHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVCRNR 150
Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
A PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF GDLLMELGAGVELATAAVP
Sbjct: 151 AAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAVP 210
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
HLFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS
Sbjct: 211 HLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 270
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
I+I+KRNPSL+TTF+LLSCGY+ SSY+EVKSVVLHTLN RFSVAVE FL TG VP+L+E
Sbjct: 271 ILIAKRNPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVEHFLMTGQVPTLQE 330
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
GN+ ENIFSFPW +D+PVVLGSR K+AFQDP AY+A+EP F++ERY+VTYNPSK KVYA+
Sbjct: 331 GNMNENIFSFPW-KDRPVVLGSRIKEAFQDPSAYVAIEPLFDRERYIVTYNPSKHKVYAV 389
Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSS-NGHRSLALKQHEYDNSNLMLTTADLEAHIGES 482
LKDQAKSDDILKAAFHA+VL + S N +++ +LKQ E D SN+ T AD+EA I +
Sbjct: 390 LKDQAKSDDILKAAFHAHVLFFSLMKSLNENKASSLKQRE-DLSNMTHTVADIEARIAGT 448
Query: 483 CKVVSTALWGFQ 494
CK V+ + F+
Sbjct: 449 CKTVADSYGCFK 460
>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
Length = 470
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/491 (70%), Positives = 393/491 (80%), Gaps = 49/491 (9%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
PP+KLKQS + A S QDARLL RET+RISA+LA+S +P V + S
Sbjct: 2 PPMKLKQSPSPQTLATS---QDARLLARETLRISASLASSPAPSAV-----------ASS 47
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDR-SGKFKRNSIRAISLQSPQAP 123
E +V++QFVD+SLRLIC E+I+GRRWNYV + + +G+ + S+RA+ Q+PQAP
Sbjct: 48 PEARNLGIVDEQFVDASLRLICCEEINGRRWNYVVDNEGGNGQGRNGSLRAVCSQTPQAP 107
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
+ELMSFIRSYVVP+GFPDSVTPSYVPYM+WRALKHFFGGAMGVFTTQTLLNSVGVSRNR
Sbjct: 108 VDELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 167
Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ PGAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLR AGDLLMELGAGVELATAAVP
Sbjct: 168 SAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVP 227
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL
Sbjct: 228 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL------ 281
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
VKSVVLHTLNRAR+SVA++ FLKTG VPSL+E
Sbjct: 282 ----------------------------VKSVVLHTLNRARYSVAIDCFLKTGRVPSLQE 313
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
GN++ENIFSFPWL DKP+VLGSRFKDAFQD +YL++EP FEKERY+VTYNPSKGKVYAL
Sbjct: 314 GNMKENIFSFPWLEDKPIVLGSRFKDAFQDAGSYLSIEPLFEKERYLVTYNPSKGKVYAL 373
Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESC 483
LKDQAKSDDILKAAFHA++LLH I SSNG S KQ +YD+++LM +TADL AHI ESC
Sbjct: 374 LKDQAKSDDILKAAFHAHMLLHFIVSSNGSESFPTKQQDYDSTHLMPSTADLAAHIAESC 433
Query: 484 KVVSTALWGFQ 494
K+VST+ F+
Sbjct: 434 KMVSTSYGLFK 444
>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 671 bits (1732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/490 (70%), Positives = 398/490 (81%), Gaps = 20/490 (4%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
P +KLK + ++SS+ RLL RET+RISA+LA S P+ +
Sbjct: 2 PSVKLKHHSPPDTISSSSSDS-VRLLSRETLRISASLA--SPPV--------------DD 44
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
+ + QF+ S+LRLIC E+IDGRR+ YVAE D SG+FK+NS+RAISL+SPQ P
Sbjct: 45 LPPHSPPSPDSQFLHSTLRLICCEEIDGRRFKYVAESDGSGRFKKNSVRAISLESPQTPF 104
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+E+ SF+RSYVVP+GFP SV SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG SRN +
Sbjct: 105 DEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSS 164
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
AVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH
Sbjct: 165 ASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 224
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL+GTG SI
Sbjct: 225 LFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI 284
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+ISKRNPSL+TTF LLSCGY+ SSYQEV+SVVLHTLNRARF+VAVESFLKTG VPSL+EG
Sbjct: 285 LISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEG 344
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N+QE IF+FPW+ D+PV+LG+RFKDAFQDP Y+AV+PFF+KERY+VTY+P+KGKVYALL
Sbjct: 345 NIQEKIFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALL 404
Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
K QA SDDILKAAFHA+VLLH +N S ++Q + T +LE+ I ESC+
Sbjct: 405 KHQANSDDILKAAFHAHVLLHFMNQSKDGTPRLVEQL---DPAFAPTEYELESRIAESCE 461
Query: 485 VVSTALWGFQ 494
+VST+ F+
Sbjct: 462 MVSTSYAIFK 471
>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
Length = 503
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 315/500 (63%), Positives = 381/500 (76%), Gaps = 35/500 (7%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
P + LK+SA T + D RL VR+ MR + + +P A+T Q+ +
Sbjct: 3 PAVGLKRSATQTI---TLPPPDTRLAVRDVMRSTIPSQPAVAP--------ASTERQAPA 51
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAE------RDRSGKFKRNS-IRAISL 117
F +C E++DGRRW+YV + R R G + +RA+ L
Sbjct: 52 AALQGF--------------LCLEEVDGRRWSYVVDGGAAKGRSRGGAVPAGAAVRAVPL 97
Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
QSP PAEE+M+FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SV
Sbjct: 98 QSPLPPAEEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSV 157
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
GVS+++ TPGA+AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+EL
Sbjct: 158 GVSQSKVTPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIEL 217
Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
TAA P FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADL
Sbjct: 218 TTAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADL 277
Query: 298 LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
LGTGLSI ISKRNPSL+T+FA LSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KTGH
Sbjct: 278 LGTGLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGH 337
Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
+PSL+EGN QE IF+ PW R +PV +GSRF +AFQ+P ++LA+ P FE ERY+VTYNP+K
Sbjct: 338 IPSLKEGNSQETIFNPPW-RHEPVAIGSRFGEAFQEPASFLAIRPLFEDERYMVTYNPTK 396
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHR-SLALKQHEYDNSNLMLT-TADL 475
KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+ R KQ D S + + D
Sbjct: 397 DKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARRLKQKQKQANPDRSEYLYSRNMDF 456
Query: 476 EAHIGESCKVVSTALWGFQE 495
AHI ESCK+VS++ F++
Sbjct: 457 LAHIAESCKIVSSSYGTFKK 476
>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/364 (84%), Positives = 338/364 (92%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+ELM FIRSYVVP+GFPDSV PSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGVS+++A
Sbjct: 3 QELMGFIRSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQA 62
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
PGAVAINWILKDGAGRVGKM++ARQGKKFDYDLKQLRFAGDLLMELGA VELATAAVPH
Sbjct: 63 IPGAVAINWILKDGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPH 122
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN+ADLLGTGLSI
Sbjct: 123 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVADLLGTGLSI 182
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
MISKRNPSL+TTFALLSCGYV SSYQEVKSVVLHTLN ARFSVAVESFLKTG VPSL EG
Sbjct: 183 MISKRNPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEG 242
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N++ENIF+FPWL+D+ + LG RFKDAFQDP AYLA+EP FE+ERY+VTYNPSK KVYALL
Sbjct: 243 NVKENIFNFPWLKDRSISLGPRFKDAFQDPRAYLAIEPLFERERYIVTYNPSKDKVYALL 302
Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
KDQAK DDI+KAAFHA+VL+H I+SSN + S KQ EY +SN +L+ ADLE+HI ESCK
Sbjct: 303 KDQAKPDDIVKAAFHAHVLMHFIHSSNNNCSSPSKQQEYGHSNFILSAADLESHIAESCK 362
Query: 485 VVST 488
+VST
Sbjct: 363 MVST 366
>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
Length = 503
Score = 623 bits (1607), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/500 (62%), Positives = 379/500 (75%), Gaps = 35/500 (7%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
P + LK+SA T + D RL VR+ MR + + +P A+T Q+ +
Sbjct: 3 PAVGLKRSATQTI---TLPPPDTRLAVRDVMRSTIPSQPAVAP--------ASTERQAPA 51
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAE------RDRSGKFKRNS-IRAISL 117
F +C ++DG RW+YV + R R G + +RA+ L
Sbjct: 52 AALQGF--------------LCLXEVDGXRWSYVVDGGAAKGRSRGGAVPAGAAVRAVPL 97
Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
QSP PAEE+M+FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SV
Sbjct: 98 QSPLPPAEEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSV 157
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
GVS+++ TPGA+AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+EL
Sbjct: 158 GVSQSKVTPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIEL 217
Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
TAA P FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADL
Sbjct: 218 TTAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADL 277
Query: 298 LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
LGTGLSI ISKRNPSL+T+FA LSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KTGH
Sbjct: 278 LGTGLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGH 337
Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
+PSL+EGN QE IF+ PW R +PV +GSRF +AFQ+P ++LA+ P FE ERY+VTYNP+K
Sbjct: 338 IPSLKEGNSQETIFNPPW-RHEPVAIGSRFGEAFQEPASFLAIRPLFEDERYMVTYNPTK 396
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHR-SLALKQHEYDNSNLMLT-TADL 475
KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+ R KQ D S + + D
Sbjct: 397 DKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARRLKQKQKQANPDRSEYLYSRNMDF 456
Query: 476 EAHIGESCKVVSTALWGFQE 495
AHI ESCK+VS++ F++
Sbjct: 457 LAHIAESCKIVSSSYGTFKK 476
>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
distachyon]
Length = 521
Score = 616 bits (1588), Expect = e-174, Method: Compositional matrix adjust.
Identities = 314/493 (63%), Positives = 370/493 (75%), Gaps = 50/493 (10%)
Query: 26 DARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLI 85
DAR+ VRE +R++ A V A A + ++
Sbjct: 27 DARVAVREAVRVAVREAEQPPTARVSAVQAPAAAVDG---------------------VL 65
Query: 86 CSEDIDGRRWNYVAERDRS--GKFKRNS--------------IRAISLQSPQAPAEELMS 129
C E++DGRRW+YV E S GK R S RA+ LQSP P EE+MS
Sbjct: 66 CLEEVDGRRWSYVVEGAASSAGKPGRVSARGRGGSAAPVGATFRAVPLQSPLPPVEEIMS 125
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAM VFTT+ LLNSVGVS++RAT GAV
Sbjct: 126 FIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSGAV 185
Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
AINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA PHLFLPL
Sbjct: 186 AINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPL 245
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISKR
Sbjct: 246 ACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKR 305
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
NPSL+ +FA LSCGY+ SSY+EV+SVVL+TLN ARF+VAV+SF+KTGHVPSL+EGNL+E
Sbjct: 306 NPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNLEET 365
Query: 370 IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAK 429
IF+ PW R +PV +GSRF +AFQ+P ++++ P FE ERY+VTYNP+K KVYALLKDQAK
Sbjct: 366 IFNPPW-RHQPVAIGSRFGEAFQEPASFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAK 424
Query: 430 SDDILKAAFHAYVLLHCINSSNGHRSLALKQH--------EYDNSNLMLTTADLEAHIGE 481
DDILKAAFHA+VLLH IN+S H +L ++H +Y N N + D HI E
Sbjct: 425 PDDILKAAFHAHVLLHFINAS--HANLNARKHMNSNRGSYQYVNPNPL--NMDFLPHIEE 480
Query: 482 SCKVVSTALWGFQ 494
SCK+V+++ F+
Sbjct: 481 SCKIVTSSYGIFK 493
>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
Length = 508
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 293/425 (68%), Positives = 354/425 (83%), Gaps = 18/425 (4%)
Query: 84 LICSEDIDGRRWNYVAERDRSGKFKRN------------SIRAISLQSPQAPAEELMSFI 131
+C E++DGRRW+YV + + R S+RA+ LQSP PAEE+M+FI
Sbjct: 63 FLCLEEVDGRRWSYVVDGGQGKGKGRGRGRSGAAVPMGASVRAVPLQSPLPPAEEVMAFI 122
Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
RSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SVGVS+++ TPGA+AI
Sbjct: 123 RSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPGAIAI 182
Query: 192 NWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
NWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+ELATAA P FLP+AC
Sbjct: 183 NWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMAC 242
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP 311
ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADLLGTGLSI ISKRNP
Sbjct: 243 VANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNP 302
Query: 312 SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
SL+T+FA LSCGY++SSY EV+SVVL+TLNRARF+VAV+SF+KTGH+PSL+EGN QE IF
Sbjct: 303 SLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 362
Query: 372 SFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSD 431
+ PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+VTYNP+K KVYALLKDQAKSD
Sbjct: 363 NPPW-RHEPVAIGSRFGEAFQEPASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSD 421
Query: 432 DILKAAFHAYVLLHCINSSNGHRSLALKQHE-YDNSNLMLTTADLEAHIGESCKVVSTAL 490
DI+KAAFHA+VLLH IN+S+ + + ++ + Y N M D AHI ESCK+VS++
Sbjct: 422 DIIKAAFHAHVLLHFINASHARKQMNSRRSDPYGNPCNM----DFMAHIAESCKIVSSSY 477
Query: 491 WGFQE 495
F++
Sbjct: 478 GTFKK 482
>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
Length = 508
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 293/425 (68%), Positives = 354/425 (83%), Gaps = 18/425 (4%)
Query: 84 LICSEDIDGRRWNYVAERDRSGKFKRN------------SIRAISLQSPQAPAEELMSFI 131
+C E++DGRRW+YV + + R S+RA+ LQSP PAEE+M+FI
Sbjct: 63 FLCLEEVDGRRWSYVVDGGQGKGKGRGRGRSGAAVPMGASVRAVPLQSPLPPAEEVMAFI 122
Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
RSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+SVGVS+++ TPGA+AI
Sbjct: 123 RSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKVTPGAIAI 182
Query: 192 NWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
NWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+ DLL+E+GAG+ELATAA P FLP+AC
Sbjct: 183 NWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQFFLPMAC 242
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP 311
ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIADLLGTGLSI ISKRNP
Sbjct: 243 VANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIADLLGTGLSIFISKRNP 302
Query: 312 SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
SL+T+FA LSCGY++SSY EV+SVVL+TLNRARF+VAV+SF+KTGH+PSL+EGN QE IF
Sbjct: 303 SLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPSLKEGNSQETIF 362
Query: 372 SFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSD 431
+ PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+VTYNP+K KVYALLKDQAKSD
Sbjct: 363 NPPW-RHEPVAIGSRFGEAFQEPASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSD 421
Query: 432 DILKAAFHAYVLLHCINSSNGHRSLALKQHE-YDNSNLMLTTADLEAHIGESCKVVSTAL 490
DI+KAAFHA+VLLH IN+S+ + + ++ + Y N M D AHI ESCK+VS++
Sbjct: 422 DIIKAAFHAHVLLHFINASHARKQMNSRRSDPYGNPCNM----DFMAHIAESCKIVSSSY 477
Query: 491 WGFQE 495
F++
Sbjct: 478 GTFKK 482
>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 519
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 299/432 (69%), Positives = 352/432 (81%), Gaps = 21/432 (4%)
Query: 84 LICSEDIDGRRWNYVAERDRS--GKFKRNS--------------IRAISLQSPQAPAEEL 127
++C E+++GRRW+YV E S GK R S RA+ LQSP PAEE+
Sbjct: 62 VLCFEEVEGRRWSYVVEGAASSAGKTGRVSARARGGSATPVGATFRAVPLQSPLPPAEEI 121
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAM VFTT+ LLNSVGVS++RAT G
Sbjct: 122 MSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRATSG 181
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
AVAINWIL+DGAGRVGKM+FARQGKKFDYDLKQLRF+GDL+MELGAG+ELATAA P LFL
Sbjct: 182 AVAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQLFL 241
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
PLAC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI++S
Sbjct: 242 PLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILMS 301
Query: 308 KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
KRNPSL+ +FA+LSCGY+ SSYQEV+SVVL+TLN ARF+VAV+SF+KTGHVPSL+EGNL+
Sbjct: 302 KRNPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNLE 361
Query: 368 ENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
E IF+ PW R +PV +GSRF +AFQ+P +++A P FE ERY+VTYNP+K KVYALLKDQ
Sbjct: 362 ETIFNPPW-RHQPVAIGSRFGEAFQEPASFVATRPLFEDERYIVTYNPAKDKVYALLKDQ 420
Query: 428 AKSDDILKAAFHAYVLLHCINSSNGH----RSLALKQHEYDNSNLMLTTADLEAHIGESC 483
AK DDILKAAFHA+VLLH IN+S+ + + + Q Y N D AHI ESC
Sbjct: 421 AKQDDILKAAFHAHVLLHFINASHANLKARKRMNSDQGSYQYVNPNPLNMDFLAHIEESC 480
Query: 484 KVVSTALWGFQE 495
K+V+++ F+
Sbjct: 481 KIVTSSYGVFKR 492
>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 506
Score = 611 bits (1576), Expect = e-172, Method: Compositional matrix adjust.
Identities = 305/500 (61%), Positives = 371/500 (74%), Gaps = 32/500 (6%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
P + LK+S + A QDAR VRE +R + + G A A L+
Sbjct: 3 PTVGLKRSGTTQTIAVPPPPQDARFAVREAVRSTIATPPPAEAPGAPARVAPAPALEG-- 60
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRN---------SIRAI 115
+C E++DGRRW YV + S+RA+
Sbjct: 61 -------------------FLCLEEVDGRRWKYVVDEAPGKGKGGRGGSAVPLGASVRAV 101
Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
LQSP PAEE+M+F+RSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLL+
Sbjct: 102 PLQSPLPPAEEIMAFVRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLS 161
Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGV 235
SVGVS+++ TPGA+AINWILKDGAGRVGKM+FARQGKKFD DLKQLRF+ DLL+E+GAG+
Sbjct: 162 SVGVSQSKVTPGAIAINWILKDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGI 221
Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
ELATAA P FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIA
Sbjct: 222 ELATAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIA 281
Query: 296 DLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
DLLGTGLSI I+KRNPSL+T+FALLSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KT
Sbjct: 282 DLLGTGLSIYITKRNPSLVTSFALLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKT 341
Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP 415
G+VPSL+EGN QE +F PW R +PV +GSRF +AFQ+P +++A P FE ERY+VTYNP
Sbjct: 342 GYVPSLKEGNSQETLFIHPW-RHEPVAIGSRFGEAFQEPVSFVATRPLFEDERYMVTYNP 400
Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADL 475
+K KVYALLKDQAK DD+LKAAFHA+VLLH IN+S+ + + + ++ +NL D
Sbjct: 401 TKDKVYALLKDQAKPDDVLKAAFHAHVLLHFINASHARKRMNTNRSDH-YANLHPRNMDF 459
Query: 476 EAHIGESCKVVSTALWGFQE 495
AHI ESCK+VS + F++
Sbjct: 460 LAHIAESCKLVSASYGTFKK 479
>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
gi|224035549|gb|ACN36850.1| unknown [Zea mays]
gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
Length = 511
Score = 609 bits (1571), Expect = e-172, Method: Compositional matrix adjust.
Identities = 296/423 (69%), Positives = 354/423 (83%), Gaps = 17/423 (4%)
Query: 84 LICSEDIDGRRWNYVAERDRSGKFK--RNSI----RAISLQSPQAPAEELMSFIRSYVVP 137
++C E++DGRRW+YV + + R S+ +A+ LQSP P EE+MSFIRSYVVP
Sbjct: 68 VLCLEEVDGRRWSYVVDAAGAAVKAKGRASVGGAFKAVPLQSPMPPVEEIMSFIRSYVVP 127
Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGVS++RA GAVAINWILKD
Sbjct: 128 EGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGAVAINWILKD 187
Query: 198 GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
GAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P LFLP+AC ANV K
Sbjct: 188 GAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLPMACIANVVK 247
Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTF 317
NVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISK NPSL+T+F
Sbjct: 248 NVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKSNPSLVTSF 307
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
A LSCGY+ SSY EV+SVVL+TLNRARF+VAV+SF++TG+VPSL++GN QE IF+ PW R
Sbjct: 308 AFLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDGNSQETIFNPPW-R 366
Query: 378 DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAA 437
+PV +GSRF DAFQ+P ++LA++P FE ERY+VTYNP+K KVYALLKDQAK DDILKAA
Sbjct: 367 HEPVAIGSRFGDAFQEPASFLAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAA 426
Query: 438 FHAYVLLHCINSSNGH-----RSLALKQHEYDNSNLMLTTADLEAHIGESCKVVSTALWG 492
FHA+VLLH IN+S+ + R + + ++YD N+ D HI ESCK+V ++
Sbjct: 427 FHAHVLLHFINASHANLNARKRMNSNRSYQYDPLNM-----DFVPHIEESCKIVMSSYGV 481
Query: 493 FQE 495
F++
Sbjct: 482 FKK 484
>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
Length = 517
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 292/423 (69%), Positives = 355/423 (83%), Gaps = 17/423 (4%)
Query: 84 LICSEDIDGRRWNYVAERDRSGKFK--RNSI----RAISLQSPQAPAEELMSFIRSYVVP 137
++C E++DGRRW+YV + + R S+ +A+ LQSP P EE+MSFIRSYVVP
Sbjct: 74 VLCLEEVDGRRWSYVVDAAGAAVKAKGRASVGGAFKAVPLQSPLPPVEEIMSFIRSYVVP 133
Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
+GFP+SVTPSYVPYMTWRA+KHFFGGAMGVFTT+TLLNSVGVS++RA GAVAINWILKD
Sbjct: 134 EGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRAASGAVAINWILKD 193
Query: 198 GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
GAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P LFLP+AC ANV K
Sbjct: 194 GAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQLFLPMACIANVVK 253
Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTF 317
NVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISK NPSL+T+F
Sbjct: 254 NVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKSNPSLVTSF 313
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
A+LSCGY+ SSY EV+SVVL+TLNRARF+VAV+SF++TG+VPSL++GN QE IF+ PW R
Sbjct: 314 AVLSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPSLKDGNSQETIFNPPW-R 372
Query: 378 DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAA 437
+PV +GSRF +AFQ+P +++A++P FE ERY+VTYNP+K KVYALLKDQAK DDILKAA
Sbjct: 373 HEPVAIGSRFGEAFQEPASFIAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAA 432
Query: 438 FHAYVLLHCINSSNGH-----RSLALKQHEYDNSNLMLTTADLEAHIGESCKVVSTALWG 492
FHA+VLLH IN+S+ + R A + ++Y+ N+ D HI ESCK+V ++
Sbjct: 433 FHAHVLLHFINASHANLNARKRMNANRSYQYNPLNM-----DFVPHIEESCKIVMSSYGV 487
Query: 493 FQE 495
F++
Sbjct: 488 FKK 490
>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
Length = 415
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 286/388 (73%), Positives = 334/388 (86%), Gaps = 6/388 (1%)
Query: 111 SIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 170
+ +A+ LQSP P EE+MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT
Sbjct: 4 TFKAVPLQSPLPPVEEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 63
Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLME 230
+TLLNSVGV+++RAT GAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLME
Sbjct: 64 RTLLNSVGVAQSRATSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLME 123
Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
LGAG+ELATAA P LFLP+AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE
Sbjct: 124 LGAGIELATAAFPQLFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGES 183
Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
VGNIADLLGTGLSI+ISKRNPSL+T+FA LSCGY+ SSY EV+SVVL+TLN ARF+VAV+
Sbjct: 184 VGNIADLLGTGLSILISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVD 243
Query: 351 SFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
SF+K+GHVPSL+EGN QE IF+ PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+
Sbjct: 244 SFIKSGHVPSLKEGNSQETIFNPPW-RHQPVAIGSRFGEAFQEPASFVAIRPLFEDERYI 302
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGH---RSLALKQHEYDNSN 467
VTYNP+K KVYALLKDQAK DDILKAAFHA+VLLH IN+S+ + R Y N+N
Sbjct: 303 VTYNPTKDKVYALLKDQAKPDDILKAAFHAHVLLHFINASHANLNARKRMNSNRSYQNAN 362
Query: 468 LMLTTADLEAHIGESCKVVSTALWGFQE 495
+ D HI ESCK+V+++ F++
Sbjct: 363 PL--NMDFIPHIAESCKIVTSSYGVFKK 388
>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
Length = 407
Score = 588 bits (1517), Expect = e-165, Method: Compositional matrix adjust.
Identities = 289/370 (78%), Positives = 326/370 (88%), Gaps = 3/370 (0%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+E+ SF+RSYVVP+GFP SV SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG SRN +
Sbjct: 12 QEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGASRNSS 71
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
AVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH
Sbjct: 72 ASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 131
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LFLPLACAANV KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL+GTG SI
Sbjct: 132 LFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLMGTGFSI 191
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+ISKRNPSL+TTF LLSCGY+ SSYQEV+SVVLHTLNRARF+VAVESFLKTG VPSL+EG
Sbjct: 192 LISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVESFLKTGRVPSLQEG 251
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N+QE IF+FPW+ D+PV+LG+RFKDAFQDP Y+AV+PFF+KERY+VTY+P+KGKVYALL
Sbjct: 252 NIQEKIFTFPWVDDRPVMLGARFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALL 311
Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGESCK 484
K QA SDDILKAAFHA+VLLH +N S +++Q + T +LE+ I ESC+
Sbjct: 312 KHQANSDDILKAAFHAHVLLHFMNQSKDGNPRSVEQLD---PAFAPTEYELESRIAESCE 368
Query: 485 VVSTALWGFQ 494
+VST+ F+
Sbjct: 369 MVSTSYGVFK 378
>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
gi|194699768|gb|ACF83968.1| unknown [Zea mays]
Length = 491
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 302/503 (60%), Positives = 368/503 (73%), Gaps = 53/503 (10%)
Query: 5 PPIKLKQSANSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSES 64
P ++LK+S+ T + DARL VR+ MR + + +P AA L
Sbjct: 3 PAVRLKRSSTQTI---TLPPPDARLAVRDVMRSTIPSRPAEAPASTERPAPAAALQG--- 56
Query: 65 DETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRN---------SIRAI 115
+C E++DGRRW+YV + S K + + S+RA+
Sbjct: 57 -------------------FLCLEEVDGRRWSYVVDGG-SAKGRGSGGVAVPAGASVRAV 96
Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
LQSP PAEE+MSFIRSYVVP+GFPDSVTPSYVPYM WRALKHFFGGAMGVFTT+TLL+
Sbjct: 97 PLQSPLPPAEEIMSFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLS 156
Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGV 235
SVGVS++ KDGAGRVGKM+FARQGKKFD DLKQLRF+ DLL+E+GAG+
Sbjct: 157 SVGVSQS-------------KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGI 203
Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
EL TAA P FLP+AC ANV KNVAAVTSTSTRTPIYKA+A+GENIGDVTAKGE VGNIA
Sbjct: 204 ELTTAAFPQFFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIA 263
Query: 296 DLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
DLLGTGLSI ISKRNPSL+T+FA LSCGY+ SSYQEV+SVVL+TLNRARF+VAV+SF+KT
Sbjct: 264 DLLGTGLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKT 323
Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP 415
GH+PSL+EGN QE IF+ PW R + V +GSRF +AFQ+P ++LA+ P FE ERY+VTYNP
Sbjct: 324 GHIPSLKEGNSQETIFNPPW-RHETVAIGSRFGEAFQEPASFLAIRPLFEDERYMVTYNP 382
Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD---NSNLMLTT 472
+K KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+ R L LKQ + + + +L
Sbjct: 383 TKDKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARR-LKLKQKQANPDQSEHLYSRN 441
Query: 473 ADLEAHIGESCKVVSTALWGFQE 495
D AHI +SCK+VS++ F++
Sbjct: 442 VDFLAHIAKSCKIVSSSYGTFKK 464
>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
Length = 401
Score = 583 bits (1503), Expect = e-164, Method: Compositional matrix adjust.
Identities = 279/374 (74%), Positives = 325/374 (86%), Gaps = 6/374 (1%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+E+MSFIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGV+++RA
Sbjct: 4 QEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRA 63
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
T GAVAINWILKDGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P
Sbjct: 64 TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 123
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LFLP+AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTGLSI
Sbjct: 124 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGLSI 183
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+ISKRNPSL+T+FA LSCGY+ SSY EV+SVVL+TLN ARF+VAV+SF+K+GHVPSL+EG
Sbjct: 184 LISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPSLKEG 243
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N QE IF+ PW R +PV +GSRF +AFQ+P +++A+ P FE ERY+VTYNP+K KVYALL
Sbjct: 244 NSQETIFNPPW-RHQPVAIGSRFGEAFQEPASFVAIRPLFEDERYIVTYNPTKDKVYALL 302
Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGH---RSLALKQHEYDNSNLMLTTADLEAHIGE 481
KDQAK DDILKAAFHA+VLLH IN+S+ + R Y N+N + D HI E
Sbjct: 303 KDQAKPDDILKAAFHAHVLLHFINASHANLNARKRMNSNRSYQNANPL--NMDFIPHIAE 360
Query: 482 SCKVVSTALWGFQE 495
SCK+V+++ F++
Sbjct: 361 SCKIVTSSYGVFKK 374
>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
distachyon]
Length = 508
Score = 583 bits (1503), Expect = e-164, Method: Compositional matrix adjust.
Identities = 302/493 (61%), Positives = 358/493 (72%), Gaps = 63/493 (12%)
Query: 26 DARLLVRETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLI 85
DAR+ VRE +R++ A V A A + ++
Sbjct: 27 DARVAVREAVRVAVREAEQPPTARVSAVQAPAAAVDG---------------------VL 65
Query: 86 CSEDIDGRRWNYVAERDRS--GKFKRNS--------------IRAISLQSPQAPAEELMS 129
C E++DGRRW+YV E S GK R S RA+ LQSP P EE+MS
Sbjct: 66 CLEEVDGRRWSYVVEGAASSAGKPGRVSARGRGGSAAPVGATFRAVPLQSPLPPVEEIMS 125
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
FIRSYVVP+GFPDSVTPSYVPYMTWRALKHFFGGAM VFTT+ LLNSVGVS++R
Sbjct: 126 FIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSR------ 179
Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
DGAGRVGKM+FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA PHLFLPL
Sbjct: 180 -------DGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHLFLPL 232
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
AC ANV KNVAAVTSTSTRTPIYKA+AKGENIGDVTAKGE VGNIADLLGTG+SI+ISKR
Sbjct: 233 ACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSILISKR 292
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
NPSL+ +FA LSCGY+ SSY+EV+SVVL+TLN ARF+VAV+SF+KTGHVPSL+EGNL+E
Sbjct: 293 NPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSLKEGNLEET 352
Query: 370 IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAK 429
IF+ PW R +PV +GSRF +AFQ+P ++++ P FE ERY+VTYNP+K KVYALLKDQAK
Sbjct: 353 IFNPPW-RHQPVAIGSRFGEAFQEPASFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAK 411
Query: 430 SDDILKAAFHAYVLLHCINSSNGHRSLALKQH--------EYDNSNLMLTTADLEAHIGE 481
DDILKAAFHA+VLLH IN+S H +L ++H +Y N N + D HI E
Sbjct: 412 PDDILKAAFHAHVLLHFINAS--HANLNARKHMNSNRGSYQYVNPNPL--NMDFLPHIEE 467
Query: 482 SCKVVSTALWGFQ 494
SCK+V+++ F+
Sbjct: 468 SCKIVTSSYGIFK 480
>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 215/359 (59%), Positives = 284/359 (79%), Gaps = 4/359 (1%)
Query: 94 RWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMT 153
+W YV+ RSG+ N+++ + Q + P EL+SF+++Y+VP+G+PDSV PSY PYM
Sbjct: 2 QWKYVSNLGRSGE---NTMQLMYAQQHRDPLAELVSFVKAYIVPEGYPDSVAPSYTPYMQ 58
Query: 154 WRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKK 213
WRA+++FFGGAM VFTT++L++++GVSR + AVA+NW++KDGAGR+GKM+FAR GKK
Sbjct: 59 WRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSAVAVNWVIKDGAGRIGKMLFARHGKK 118
Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
FD DLKQLRF G +LM+LGAGVEL T AVPHLFLPLACAANVAKNVAAVTS+STR PIYK
Sbjct: 119 FDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLPLACAANVAKNVAAVTSSSTRAPIYK 178
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVK 333
AFA+ ENIGDVTAKGEC+ NIADLLGTGL I ISKR+PSL+ TF +LSCGY+ SS+ E+K
Sbjct: 179 AFARRENIGDVTAKGECISNIADLLGTGLGIFISKRSPSLVATFCVLSCGYMISSFNEIK 238
Query: 334 SVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQD 393
SV L TLNRARF V V+SFL+TG VPS+ + N +E++ + PW ++KP+ LGSR AF +
Sbjct: 239 SVRLATLNRARFGVVVQSFLETGQVPSIEDANRRESVITLPW-QEKPLELGSRVGQAFSN 297
Query: 394 PDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
P ++ + F+KE Y+VTY P+K + Y ++K+ A SDD+++A F A+V+LH + G
Sbjct: 298 PREFIRTQEQFQKENYLVTYRPNKKRAYVVIKESASSDDVVRATFQAHVMLHMLRGLEG 356
>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 390
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 216/384 (56%), Positives = 287/384 (74%), Gaps = 17/384 (4%)
Query: 112 IRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
++ + Q + P EL+SF++SY+VP+ +PDSV PSY PYM WRA+++ FGGAM VFTT+
Sbjct: 1 MQVMHAQQHRDPVAELVSFVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTR 60
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMEL 231
+L++++GVSR + AVA+NW++KDGAGR+GKMIFAR GKKFD DLKQLRF G +LM+L
Sbjct: 61 SLMHALGVSRGASASSAVAVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQL 120
Query: 232 GAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECV 291
GAGVELAT AVPHLFLPLACAANVAKNVAAVTS+STR PIYKAFA+ ENIGDVTAKGEC+
Sbjct: 121 GAGVELATMAVPHLFLPLACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECI 180
Query: 292 GNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
NIADL+GTGL I ISK++PSL+ TF +LSCGY+ SS+ E+KSV L TLNRARF V V+S
Sbjct: 181 SNIADLMGTGLGIFISKKSPSLVATFCVLSCGYMISSFNEIKSVRLATLNRARFGVVVQS 240
Query: 352 FLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
FL+TG VPS+ + N +E++ + PW ++KP+ LGSR A+ +P +L F+KE+Y+V
Sbjct: 241 FLETGKVPSVEDANRRESVITLPW-QEKPLELGSRVAQAYSNPKEFLRAREQFQKEKYLV 299
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLT 471
TY+ K + Y ++K+ A SDD+++A F A+V+LH + G RS L
Sbjct: 300 TYSLKKKRAYVVIKESASSDDVVRATFQAHVMLHIL---RGLRS-------------TLI 343
Query: 472 TADLEAHIGESCKVVSTALWGFQE 495
T+++ A + ES K+ F+E
Sbjct: 344 TSNVGAIVEESTKLTHDLYDSFKE 367
>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
Length = 430
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 210/403 (52%), Positives = 282/403 (69%), Gaps = 20/403 (4%)
Query: 49 GVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFK 108
G+GGG Q SD TA R+ + + S+L C E I +W YVA K
Sbjct: 3 GLGGG-------QCASDGGTAARIADARATRSTL---CLERIGDVQWKYVA--------K 44
Query: 109 RNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVF 168
+++ L Q+ E+M F+RS+VVP+GFPDSV P Y PYM WR L++FFGGAMGVF
Sbjct: 45 GQVLQSEKLFPHQSALAEIMDFVRSHVVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVF 104
Query: 169 TTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLL 228
TT++LL+++GV RN A+ A+A+NW++KDGAGR+GKMIFAR GKKFD LKQ+RFAGD+L
Sbjct: 105 TTRSLLHALGV-RNGASSSAIAVNWVVKDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVL 163
Query: 229 MELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG 288
M+L A +ELAT+A P FLPLAC AN+ KNVAAV STSTR PIYKAFA+ ENIGD+TAKG
Sbjct: 164 MQLAAALELATSATPQFFLPLACIANIGKNVAAVASTSTRAPIYKAFARRENIGDITAKG 223
Query: 289 ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
EC+GNIADLLGTG+ I++SK P + F+ L+ GYV+SS++EVK++ L TLNR RF +A
Sbjct: 224 ECIGNIADLLGTGMGILMSKNFP-VFAPFSALAFGYVYSSFREVKAIQLPTLNRHRFGIA 282
Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKER 408
V++FL+TG VP L E N +E I P + + LG+R DAF P Y ++ F+ E
Sbjct: 283 VDTFLETGKVPGLVEANERERIIVGPQWSLEKLELGARVSDAFSAPQDYQYIDSLFKGEN 342
Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSN 451
Y+V+YNP + + Y +LK++A DD+L+AAFH + ++ + +
Sbjct: 343 YLVSYNPKRQRTYVVLKERANGDDVLRAAFHGQLFVNIFDKCS 385
>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
Length = 495
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/334 (47%), Positives = 219/334 (65%), Gaps = 6/334 (1%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
Q + + I +Y++P+GFPDSV P Y PYM WR +++FFGGA+ VFTTQ+LL ++GV+
Sbjct: 99 QTHEDHFRALIHAYLLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVA 158
Query: 181 RNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATA 240
A AINW++KDGAGR+G+++FAR G++ D +LKQ R AGDLLME GA +ELAT
Sbjct: 159 GRFQGEAAAAINWVIKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATV 218
Query: 241 AVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
P FLPLAC AN++KN+AAV ++STR PIY+ FA N+ D+TAKGE V N+AD+LGT
Sbjct: 219 YAPPAFLPLACTANLSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGT 278
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
I +S+ + TF +LS GY+ SS +EV SV L +NRAR + A + +L G +P
Sbjct: 279 VAGIALSRMKLPRMPTFCVLSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPG 338
Query: 361 LREGNLQENIFSFPWLRDKP--VVLGSRFKDAFQDPDAYL-AVEPFFEKE-RYVVTYNPS 416
+ E N E + PW R +VLG+ + A P AV+ F E E RY+VTY P
Sbjct: 339 VAEANHNEPL--LPWGRHNQNRLVLGASVESACAGPSELAHAVQRFPEPEYRYLVTYRPD 396
Query: 417 KGKVYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
K Y LL++ A S D L+A+F +V LH ++ +
Sbjct: 397 TKKAYVLLREGATSMDCLQASFFGHVFLHLLDGA 430
>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
variabilis]
Length = 325
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 210/324 (64%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+ L + Y++P FP SV P Y YM R++++FFGGAM VFTT++LL S+GV+ +
Sbjct: 1 QRLGDLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHS 60
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
+ A AINW++KDGAGR+G+ +FAR G++ D +LKQ R GD+LME GA +EL+T +P
Sbjct: 61 SEAAAAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPR 120
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
+FLPLAC AN+AKN+AAVT++STR PIY+ FAK N+ DVTAKGE V N+AD++GT I
Sbjct: 121 MFLPLACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGI 180
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
++K N ++ TFA LS GY+ +S +EV SVVL LNRAR S +F TG VP EG
Sbjct: 181 ALAKANLPVLPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGRVPETLEG 240
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N +E + + + VVLG+ ++A P F +Y +TY P K YALL
Sbjct: 241 NYREPLMPWSDPHNGRVVLGATVEEACAGPQQLHDALAAFSGRQYALTYRPDTRKCYALL 300
Query: 425 KDQAKSDDILKAAFHAYVLLHCIN 448
K A + +AA A+ LL ++
Sbjct: 301 KQGASPRSVQQAAMDAHALLWMLD 324
>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
Length = 253
Score = 277 bits (708), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 131/178 (73%), Positives = 149/178 (83%), Gaps = 6/178 (3%)
Query: 84 LICSEDIDGRRWNYVAERDRSGKFKRN------SIRAISLQSPQAPAEELMSFIRSYVVP 137
++C E++DGRRW+YV + + + S + + LQSP P EE+MSFIRSYVVP
Sbjct: 69 VLCLEEVDGRRWSYVVDAAGAAVKAKGRASVGASFKVVPLQSPLPPVEEIMSFIRSYVVP 128
Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
+GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT+TLLNSVGVS++RA GAVAINWILKD
Sbjct: 129 EGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRAASGAVAINWILKD 188
Query: 198 GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
GAGRVGKM FARQGKKFDYDLKQLRF+GDLLMELGAG+ELATAA P FLP+AC ANV
Sbjct: 189 GAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQFFLPMACIANV 246
>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
Length = 236
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 142/207 (68%), Positives = 160/207 (77%), Gaps = 25/207 (12%)
Query: 29 LLVRETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLICSE 88
+L+RET+RISA LA+SS+P LL S++ F+ SS R IC E
Sbjct: 33 ILIRETLRISAELASSSTP-----------LLPSDT-----------TFIGSS-RFICCE 69
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
IDGRRWNYVA+ D SGKF NS R++SLQ+P+ P +E +SF+RSYVVP+GFPDSVTPSY
Sbjct: 70 QIDGRRWNYVADTDASGKFNNNSFRSLSLQTPKPPLDEFISFVRSYVVPEGFPDSVTPSY 129
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR--ATPGAVAINWILKDGAGRVGKMI 206
VPYMTWRALKHFFGGAMGVFTTQTLL SVGVSRNR AVAINWIL DGAGRVGKM+
Sbjct: 130 VPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRATPAAAAVAINWILNDGAGRVGKML 189
Query: 207 FARQGKKFDYDLKQLRFAGDLLMELGA 233
FARQGKKFDYDLKQLRFAGDLLMELG
Sbjct: 190 FARQGKKFDYDLKQLRFAGDLLMELGC 216
>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 496
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/347 (38%), Positives = 199/347 (57%), Gaps = 12/347 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+PDG+PDSV P Y Y WR FFGG +GVF+TQ+LL +VGV R A P A A+ W++
Sbjct: 125 LPDGYPDSVGPDYAEYTKWRGRAFFFGGMVGVFSTQSLLLAVGVGRPSAAPIAAALQWVI 184
Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
+DG GR G+M+F++ G FD + KQ R A ++ + +E T PHLFLPLA AN+
Sbjct: 185 RDGLGRAGRMLFSQVGTGFDAETKQYRLAAAFVLNVSCALEALTPLFPHLFLPLASLANM 244
Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS--- 312
AK + V + STR IY++F + EN+GD+TAK E VG DLLGT + +++S+ +
Sbjct: 245 AKGASTVAAASTRGAIYRSFMRRENLGDITAKQETVGVAGDLLGTAVGVILSRASSHSHR 304
Query: 313 -LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK-TGHVPSLREGNLQENI 370
+ F S G++ S+Y EVKSV L TLNR R + + ++L+ V +R+ N +E +
Sbjct: 305 LSVIAFIGASLGHLISAYYEVKSVELRTLNRQRAHMLIWNYLEDQDRVLPIRDVNRRERL 364
Query: 371 FSFPW---LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
PW L V +G+R DA D DA + + E+Y++ Y + G+V +L++
Sbjct: 365 VYRPWLDSLHAPNVEIGARLSDAAPDADALKFLLRLYRDEKYMMNY--AGGRVRIVLRND 422
Query: 428 AKSDDILKAAFH--AYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
A D LKA F A+ L+ + + AL + Y +N + +
Sbjct: 423 ATHQDQLKAYFQSRAFWRLYMTSGERFSPNRALIEDSYRATNQLFPS 469
>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 636
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 190/345 (55%), Gaps = 10/345 (2%)
Query: 106 KFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
+F + I+ + + E+L+ ++ +PDG+P+SVT Y+ Y WRA+++ M
Sbjct: 250 QFSDDMIQEGRFKDDRPIFEKLIDWLSRVYLPDGYPNSVTKDYLSYSIWRAIQNLAASVM 309
Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFA 224
GVF T+ LL +G+ R + AI+W+LKDG G +GK+ + G +FD D K R A
Sbjct: 310 GVFATEALLFGLGLGRKSTPATSAAISWVLKDGLGYIGKVFYGSIAGNQFDVDPKSWRLA 369
Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
D + + G +E+ T P FL LA AN K VAA+T T+TR IYK+ A EN GD+
Sbjct: 370 ADAVEDAGGVLEIITPLFPGHFLLLASVANAMKGVAAMTGTATRHAIYKSLALHENQGDI 429
Query: 285 TAKGECVGNIADLLGTGLSIMISKRNP----SLITTFALLSCGYVFSSYQEVKSVVLHTL 340
KGE G ++G GL I+ S + +L++ +AL ++F++++ + V TL
Sbjct: 430 ATKGESQGVTCKMVGLGLGIVASSKMGQNYFALLSAYALGCFVHLFANWKSLSCVQFATL 489
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPW--LRDKPVVLGSRFKDAFQDPDAYL 398
NR R S+A++ FL+TG+VPS E + +E + PW D+ +V+G+ K+ FQ+
Sbjct: 490 NRQRCSLALQEFLQTGNVPSPYEVSRREKVVVPPWKGYLDR-IVVGANLKECFQNGRELR 548
Query: 399 AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ RY++T + K+ +L + + DI +A A+ L
Sbjct: 549 EAVALYRDCRYILTQRDN--KILIVLHESCRPQDIFQAFMQAHKL 591
>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 849
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 176/303 (58%), Gaps = 10/303 (3%)
Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
FP+ V+ Y + WR + +GVF TQ LL ++G+ R A A +W+LKDG
Sbjct: 423 FPEGVSQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAA-ATSWVLKDGL 481
Query: 200 GRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
GR+GKM++A Q G+ FD D K+ RF LL LG G+E+ T P FL A AN K
Sbjct: 482 GRLGKMVWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLANSMKQ 541
Query: 259 VAAVTSTSTRTPIYKAFA-KGENIGDVTAKGECVGNIADLLGTGLSIMISK----RNPSL 313
++ +T+++TR +YK FA + ENI D+TAKGE +ADLLG L I +SK P++
Sbjct: 542 ISMLTASATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTSRPNV 601
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
+T ++ LS +++ Y+E+++V TLN R S+ V+ F++ G+VP E + +ENIF
Sbjct: 602 LTAYSFLSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQRENIFLG 661
Query: 374 PWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDI 433
P R + + DA P ++ F+ E+++VT + G+ +L++ A++ DI
Sbjct: 662 P--RYDVRSMFASLSDAVSCPQELNSLIKIFKGEQFMVTRGWN-GQYRIVLREHARNGDI 718
Query: 434 LKA 436
L+A
Sbjct: 719 LRA 721
>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
Length = 713
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 172/342 (50%), Gaps = 32/342 (9%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F+R ++P+GFP SVT Y+ Y WR ++ GV TQ LL +VG+ + A P A
Sbjct: 285 FLR-LMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKG-AIPTAA 342
Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
A+NW+LKDG G + K++ ++ G+ FD K R DLL G+E+ T A PH FL +
Sbjct: 343 AVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLI 402
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG----NIADLLGTGLSIM 305
A ++ AA+ STR+ Y FA N +V AKGE G +I +LG L+
Sbjct: 403 GAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANC 462
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
I P +F +++ ++F + + +S+ L TLN R S+ +L +G VPS++E N
Sbjct: 463 IGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN 522
Query: 366 LQENIFS-FPWLRDKP-----------------------VVLGSRFKDAFQDPDAYLAVE 401
+E +F P L KP + LGS+ + + LA+
Sbjct: 523 EEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALF 582
Query: 402 PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ E Y++T + KG+ + +LK+ D+LK+ FH L
Sbjct: 583 DLYRNEAYILTEH--KGRFFVILKESCSPQDMLKSVFHVNYL 622
>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
Length = 537
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 172/342 (50%), Gaps = 32/342 (9%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F+R ++P+GFP SVT Y+ Y WR ++ GV TQ LL +VG+ + A P A
Sbjct: 83 FLR-LMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKG-AIPTAA 140
Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
A+NW+LKDG G + K++ ++ G+ FD K R DLL G+E+ T A PH FL +
Sbjct: 141 AVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLI 200
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG----NIADLLGTGLSIM 305
A ++ AA+ STR+ Y FA N +V AKGE G +I +LG L+
Sbjct: 201 GAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANC 260
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
I P +F +++ ++F + + +S+ L TLN R S+ +L +G VPS++E N
Sbjct: 261 IGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVN 320
Query: 366 LQENIFS-FPWLRDKP-----------------------VVLGSRFKDAFQDPDAYLAVE 401
+E +F P L KP + LGS+ + + LA+
Sbjct: 321 EEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALF 380
Query: 402 PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ E Y++T + KG+ + +LK+ D+LK+ FH L
Sbjct: 381 DLYRNEAYILTEH--KGRFFVILKESCSPQDMLKSVFHVNYL 420
>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
Length = 440
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 193/395 (48%), Gaps = 37/395 (9%)
Query: 100 ERDRSGKFKRNSIRAISL---QSPQAPAEELMSF----IRSYVVPDGFPDSVTPSYVPYM 152
E++ GK ++ +A ++ +S + +E++S+ +++ ++P G+P +V+ Y Y
Sbjct: 11 EKNSKGKSSPSTDKAGAVSTGRSRKLGVDEIVSWCLDALKTTMLPIGYPKTVSDDYTEYT 70
Query: 153 TWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGK 212
WR + GV TTQ LL +VG+ + A P A AINW+LKDG G + K++ ++ G+
Sbjct: 71 LWRMGQIIASQISGVLTTQALLYAVGLGKG-AIPTAAAINWVLKDGIGYLSKIVLSKYGR 129
Query: 213 KFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIY 272
FD K R DL+ + G+EL T PHLF+PL A K+ A + +TR+ Y
Sbjct: 130 HFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFVPLTAVAGAGKSAAGLIQAATRSCFY 189
Query: 273 KAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR----NPSLITTFALLSCGYVFSS 328
FA N +V AKGE G ++ LG L I IS + P L+ +F ++ +V +
Sbjct: 190 AGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGISSKVGSSGPLLVLSFGTVTAVHVLCN 249
Query: 329 YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS---FPWLRDKPVV--- 382
+ ++V LHTLN R + + +L T +PS+R+ N E IF+ F + PV
Sbjct: 250 LKSYQAVQLHTLNPYRTGLIMAEYLTTQRIPSVRDVNSVEPIFTVADFFSSKKAPVSTHL 309
Query: 383 ------------------LGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
LG F +A A A+ + E Y++TY+ S +L
Sbjct: 310 SISCESKCAAADIEKNLKLGVSFGEAVASRGAAQALLDLYSGEEYLLTYDDSHC-FKVVL 368
Query: 425 KDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALK 459
K A D+L+A A L H N + + LK
Sbjct: 369 KKSASPHDMLRAVVQASYLRHIHNQQSLSKDELLK 403
>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
Length = 389
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 172/353 (48%), Gaps = 26/353 (7%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
+++ ++P G+P +V+ Y Y WR + GV TTQ LL +VG+ + A P A A
Sbjct: 8 LKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKG-AIPTAAA 66
Query: 191 INWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
INW+LKDG G + K++ ++ G+ FD K R DL+ + G+EL T PHLF+PL
Sbjct: 67 INWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFVPLT 126
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR- 309
A K+ A + +TR+ Y FA N +V AKGE G ++ LG L I IS +
Sbjct: 127 AVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGISSKV 186
Query: 310 ---NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
P L+ +F ++ +V + + ++V LHTLN R + + +L T +PS+R+ N
Sbjct: 187 GSSGPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQRIPSVRDVNS 246
Query: 367 QENIFSFPWL--------------------RDKPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
E IF+ +K + LG F +A A A+ +
Sbjct: 247 VEPIFTVADFFSSKKAPLSISCESKCAAADIEKNLKLGVSFGEAVASRGAAQALLDLYSG 306
Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALK 459
E Y++TY+ S +LK A D+L+A A L H N + + LK
Sbjct: 307 EEYLLTYDDSHC-FKVVLKKSASPHDMLRAVVQASYLRHIHNQQSLSKDELLK 358
>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
Length = 432
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 189/356 (53%), Gaps = 23/356 (6%)
Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
P + + SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+
Sbjct: 59 PQKMVESFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL--- 114
Query: 183 RATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
R TP A ++WILKDG VGK+I + G + D + K+ R D+L +LG G+EL +
Sbjct: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPL 174
Query: 242 VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
PHLFL +A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + ++ G G
Sbjct: 175 CPHLFLEMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIG 234
Query: 302 LSIM----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
I I + ++LS +V+S ++++ V ++TLN R ++ V +FLKTG
Sbjct: 235 AGIQLASTICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGK 294
Query: 358 VPSLREGNLQENIFSFPWLRDKP------VVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
VPS + QE++ FP ++P V +G A + P ++ F +E++++
Sbjct: 295 VPSPPDLRFQEDLM-FP---ERPIQDAGNVKVGRALHKAVK-PSEVQRLKQVFVEEKFLL 349
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
++ S + +L+ A +D L+ A + N + L Q YD N
Sbjct: 350 SHGKSWTDM--VLEHDATGEDALRGWLVAAYVKSMTKIYNDPDDIIL-QDAYDKMN 402
>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
Length = 433
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 189/356 (53%), Gaps = 23/356 (6%)
Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
P + + SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+
Sbjct: 60 PQKMVESFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL--- 115
Query: 183 RATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
R TP A ++WILKDG VGK+I + G + D + K+ R D+L +LG G+EL +
Sbjct: 116 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPL 175
Query: 242 VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
PHLFL +A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + ++ G G
Sbjct: 176 CPHLFLEMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIG 235
Query: 302 LSIM----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
I I + ++LS +V+S ++++ V ++TLN R ++ V +FLKTG
Sbjct: 236 AGIQLASTICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGK 295
Query: 358 VPSLREGNLQENIFSFPWLRDKP------VVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
VPS + QE++ FP ++P V +G A + P ++ F +E++++
Sbjct: 296 VPSPPDLRFQEDLM-FP---ERPIQDAGNVKVGRALHKAVK-PSEVQRLKQVFVEEKFLL 350
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
++ S + +L+ A +D L+ A + N + L Q YD N
Sbjct: 351 SHGKSWTDM--VLEHDATGEDALRGWLVAAYVKSMTKIYNDPDDIIL-QDAYDKMN 403
>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 738
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 93/237 (39%), Positives = 140/237 (59%), Gaps = 6/237 (2%)
Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
P+ VT Y + WR + +GVF TQ LL ++G+ A I+W+LKDG
Sbjct: 379 IPEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGL 438
Query: 200 GRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
RVGKM++A Q G+ D D K+ RF LL LG G+E+AT VP FL +A AN AK
Sbjct: 439 SRVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKL 498
Query: 259 VAAVTSTSTRTPIYKAFA-KGENIGDVTAKGECVGNIADLLGTGLSIMISK----RNPSL 313
+ +T+++TR +Y++FA + ENI D+TAKGE IADL G L I +SK P +
Sbjct: 499 CSMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKV 558
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ T+ +LS +++ ++E++ + TLN R S+ +E +LKTG VP+ ++ + E +
Sbjct: 559 VATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAERV 615
>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
Length = 593
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 107/333 (32%), Positives = 180/333 (54%), Gaps = 27/333 (8%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+PDSV+ Y+ Y WR ++ GV +TQ LL +VG+ + A P A A+NW+
Sbjct: 196 LLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 254
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R DLL G+E+ T PHLF+P+ AA
Sbjct: 255 LKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 314
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI 314
++ AA+ +TR+ Y FA N +V AKGE G ++ LG L I ++ R S +
Sbjct: 315 AGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGIALANRIGSSV 374
Query: 315 T----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ +FA ++ +++ + + +S+ L TLN R S+ +L +G VPS++E N +E +
Sbjct: 375 SLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 434
Query: 371 F--------------SFPWLRD------KPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
F P +D + + LGS+ + ++ + A+ ++ E+Y+
Sbjct: 435 FLNLSIGTSRKESKILSPQAKDAAEIICRRLQLGSKLSEIIENKEDACALFDLYKNEQYL 494
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+T KGK +LK+ + +D+LK+ FH L
Sbjct: 495 LTEY--KGKFCVILKEGSSPEDMLKSLFHVNYL 525
>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
Length = 424
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/352 (32%), Positives = 188/352 (53%), Gaps = 17/352 (4%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R TP
Sbjct: 57 SFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL---RPTPAQ 112
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++W+LKDG VGK+I + G + D + K R D L +LG G+E+ + PHLFL
Sbjct: 113 ATAVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFL 172
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I ++
Sbjct: 173 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLA 232
Query: 308 KRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
S + LLS +V+S +E+++ ++TLN R ++ V F+KTG + S +
Sbjct: 233 STICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPAD 292
Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
QE+ IF+ + D V R P ++ F +E++++ + GK +
Sbjct: 293 LRYQEDLIFTGRLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCH----GKKWT 348
Query: 423 --LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
+L+ A +D L+ A + + + ++ S+ + Q YD N + +T
Sbjct: 349 DMVLEHDATGEDALRGWLVAAYVAN-LEKTDHEPSMTVLQEAYDKMNSVFST 399
>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 420
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 112/350 (32%), Positives = 186/350 (53%), Gaps = 13/350 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R TP
Sbjct: 53 SFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL---RPTPAQ 108
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++W+LKDG VGK+I + G + D + K R D L +LG G+E+ + PHLFL
Sbjct: 109 ATAVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFL 168
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I ++
Sbjct: 169 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLA 228
Query: 308 KRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
S + LLS +V+S +E+++ ++TLN R ++ V F+KTG + S +
Sbjct: 229 STICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPAD 288
Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
QE+ IF+ + D V R P ++ F +E++++ + K
Sbjct: 289 LRYQEDLIFTGRLIEDAGNVKVGRDLHKVVKPSKLCKLKGVFPEEKFLLCHG--KKWTDM 346
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
+L+ A +D L+ A + + + ++ S+ + Q YD N + +T
Sbjct: 347 VLEHDATGEDALRGWLVAAYVAN-LEKTDHEPSMTVLQEAYDKMNSVFST 395
>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
Length = 541
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 109/341 (31%), Positives = 177/341 (51%), Gaps = 31/341 (9%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F+R ++P+G+P SVT Y+ Y WR ++ GV TQ LL ++G+ + A P A
Sbjct: 132 FVR-LMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 189
Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
AINW+LKDG G + K++ ++ G+ FD + K R DLL G+E+ T A PHLF+ +
Sbjct: 190 AINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFPHLFVFI 249
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LLGTGLSIM 305
AA ++ AA+ +TR+ Y FA N +V AKGE G ++ +LG GL+
Sbjct: 250 GAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIGLANC 309
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
I P + +F++++ ++F + + +S+ L TLN R S+ +L +G P +++ N
Sbjct: 310 IGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKDVN 369
Query: 366 LQENIFS--FPWLR--DKPVV-------------------LGSRFKDAFQDPDAYLAVEP 402
+E +F FP + DKP + LGS+ D + LA+
Sbjct: 370 DEEPLFPAVFPHFKSADKPSLVVLSLEARDAATEIERRLQLGSKLSDVVNSKEDVLALFN 429
Query: 403 FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
++ E Y++T KG+ +LK+ + D+LKA F L
Sbjct: 430 LYKDEGYILT--EYKGRFCVVLKESCSAQDMLKALFQVNYL 468
>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 611
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 177/346 (51%), Gaps = 32/346 (9%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++P+GFPDSVT Y+ Y WR ++ GV TQ LL +VG+ + A P A A+NW+
Sbjct: 211 MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNWV 269
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K+ ++ G+ FD K R DLL G+E+ T A P F+ + AA
Sbjct: 270 LKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 329
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS-- 312
++ AA+ +TR+ Y FA N +V AKGE G ++ +G L I ++ R S
Sbjct: 330 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 389
Query: 313 --LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ F++++ ++F + + KS+ L TLN R S+ +L +G VPS+++ N +E +
Sbjct: 390 SLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL 449
Query: 371 F-SFP--------WLRD---------------KPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
F + P W RD K + LGS+ D + L + F K
Sbjct: 450 FPAVPLLNRKAPDWSRDFLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNK 509
Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
E Y+++ + +GK +LK+ A D+LKA FH LH + + G
Sbjct: 510 ENYILSEH--RGKYCVMLKESASPVDMLKAVFHVNY-LHWLERNAG 552
>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 503
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 113/419 (26%), Positives = 202/419 (48%), Gaps = 38/419 (9%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFI---RSYVVPDGFPDSVT 145
++ G RW Y+ + +F +++ ++ + + F+ + V+PDG+P SVT
Sbjct: 48 EVKGGRWKYLVKHPERDEFVVQTVKNSEEEALAYEEDVITRFVELGKQIVLPDGYPQSVT 107
Query: 146 PSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKM 205
Y Y WR + GV TTQ LL +VG+ + A P A A+NW+L+DG G + K+
Sbjct: 108 DDYFEYTLWRMGQVIASQISGVLTTQALLYAVGLGKG-AIPTAAAVNWVLRDGIGYLSKI 166
Query: 206 IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTST 265
+ ++ G+ FD K R DL+ + G+EL T A PHLF+ LA AA ++ + +
Sbjct: 167 VLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTPAFPHLFVYLAAAAGAGRSASGLIQA 226
Query: 266 STRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS----KRNPSLITTFALLS 321
+T++ A N ++ AKGE G ++ +G L I +S P L+ TF ++
Sbjct: 227 ATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGIALGIAVSAYVGSTGPLLVGTFFGVT 286
Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR---- 377
++F + + ++V L TLN R S+ + + ++G+V S+RE N +E IF F +
Sbjct: 287 ALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSGNVVSVREVNGEEPIFPFSFFNFRLR 346
Query: 378 --------------------DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
+ + G+ F A + + A+ ++ ERY+++ +
Sbjct: 347 GSKGDGTLLSEESKAVAHSIENHLEFGATFGAAVKTREEADALLDIYKSERYLLSQHDD- 405
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVLLHCINSS-NGHRSLALKQHEYDNSNLMLTTADL 475
++ +LK+ A D+L+A A L + + HR+L H+ N ++ + D
Sbjct: 406 -RLLVVLKEGATPRDMLRAMMQAIYLDYLQGTGFTSHRNL---MHDSGNGGVLRVSHDF 460
>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
gi|194698746|gb|ACF83457.1| unknown [Zea mays]
Length = 451
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 175/332 (52%), Gaps = 26/332 (7%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+P SV+ Y+ Y WR ++ GV +TQ LL +VG+ + A P A A+NW+
Sbjct: 55 LLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 113
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R D L + G+E+ T PHLF+P+ AA
Sbjct: 114 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 173
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
++ A++ ++TR+ Y FA N +V AKGE G ++ +G GL I I
Sbjct: 174 AGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGIALANHIGSSV 233
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
P + +FA ++ +++ + + +S+ L TLN R S+ +L +G VPS++E N +E +
Sbjct: 234 PLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 293
Query: 371 F---SFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
F S R ++ +GS+ + + + A+ ++ E+Y++
Sbjct: 294 FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALFDLYKNEQYLL 353
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
KGK +LK+ + +D+LK+ FH L
Sbjct: 354 M--DYKGKFCIVLKEGSSPEDMLKSLFHVNYL 383
>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
Length = 399
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 174/308 (56%), Gaps = 18/308 (5%)
Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWIL 195
P G+P SV Y+ Y +RAL+HF + VF+TQ+L+ + G+ R TP A I+W+L
Sbjct: 49 PVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLVFAAGL---RPTPAQATVISWVL 105
Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
KDG +GK++ +R G + D + K+ R D+L + G +E+ + A P FL +A AN+
Sbjct: 106 KDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTALEMISPACPQYFLGVAGLANL 165
Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-MISKRNPSL- 313
AK VA VT+ +TR P+Y AFAK N+ D+ AKGE + +A++ G G+ I + S SL
Sbjct: 166 AKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQ 225
Query: 314 --ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
+ LLS +++S +E+++V L TLN R ++ V +L+ G VPS E +E +
Sbjct: 226 GKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLI 285
Query: 372 SFPWLRDKPVVLGSRFK---DAFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
L KP V + D Q P + ++ F+ ER+++ ++ + ++ +L
Sbjct: 286 ----LPVKPNVHAGSVRIANDILQIPSGKIRELKHRFQSERFLLDFHETSTEM--VLHHS 339
Query: 428 AKSDDILK 435
A +D+++
Sbjct: 340 ASGEDVVR 347
>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 712
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 162/297 (54%), Gaps = 23/297 (7%)
Query: 92 GRRWNYV--AERDRSGKFKRNSIRAISLQSPQAPAEELMSFI-------RSYVVPDGFPD 142
GR+W YV E+ R+ +F +SLQ+ +A E + + R + PD
Sbjct: 230 GRQWQYVFDGEQIRAVEFD-----GVSLQNGEARNESVSGQVAGSVLKLRQTLQRIFVPD 284
Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
V P Y+ Y+ W+ + F + TQ +L +VGV R P A A+NW++KDG GR+
Sbjct: 285 QVWPHYLVYLKWKLVHRFLSSVLHFQCTQAMLWAVGVGAKRRLPAAAALNWVMKDGVGRL 344
Query: 203 GKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAA 261
GK++F A G+ FD DLK++RF +L G+E+ T P FL LA AN+AK++A
Sbjct: 345 GKLLFTANFGRTFDSDLKRVRFWTSVLFSSSVGLEILTPLFPQHFLLLATLANIAKSIAY 404
Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNPSL----- 313
+T + I+++FA G+N+ D++AKG +AD LG ++ +S+ P +
Sbjct: 405 AAYLATSSAIHRSFALGDNLADISAKGMAQTVVADNLGLAAAVCLSQFIHNFPKIEKFLP 464
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ + +L+ +F+ YQ++++V L TLN+ R + + ++K +PS E + EN+
Sbjct: 465 LAMYPVLASAELFAIYQQLQAVHLQTLNKERLEIIINIWVKDRRIPSFEEVSKIENV 521
>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
Length = 399
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 174/308 (56%), Gaps = 18/308 (5%)
Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWIL 195
P G+P SV Y+ Y +RAL+HF + VF+TQ+L+ + G+ R TP A I+W+L
Sbjct: 49 PVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLVFAAGL---RPTPAQATVISWVL 105
Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
KDG +GK++ +R G + D + K+ R D+L + G +E+ + A P FL +A AN+
Sbjct: 106 KDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTALEIISPACPQYFLGVAGLANL 165
Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-MISKRNPSL- 313
AK VA VT+ +TR P+Y AFAK N+ D+ AKGE + +A++ G G+ I + S SL
Sbjct: 166 AKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLANVAGLGVGIRLASSACTSLQ 225
Query: 314 --ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
+ LLS +++S +E+++V L TLN R ++ V +L+ G VPS E +E +
Sbjct: 226 GKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVADYLELGRVPSPAEIRYRERLI 285
Query: 372 SFPWLRDKPVVLGSRFK---DAFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQ 427
L KP V + D Q P + ++ F+ ER+++ ++ + ++ +L
Sbjct: 286 ----LPVKPNVHAGSVRIANDILQIPSGKIRELKHRFQSERFLLDFHETSTEM--VLHHS 339
Query: 428 AKSDDILK 435
A +D+++
Sbjct: 340 ASGEDVVR 347
>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 593
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 102/332 (30%), Positives = 175/332 (52%), Gaps = 26/332 (7%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+P SV+ Y+ Y WR ++ GV +TQ LL +VG+ + A P A A+NW+
Sbjct: 197 LLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 255
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R D L + G+E+ T PHLF+P+ AA
Sbjct: 256 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 315
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
++ A++ ++TR+ Y FA N +V AKGE G ++ +G GL I I
Sbjct: 316 AGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGIALANHIGSSV 375
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
P + +FA ++ +++ + + +S+ L TLN R S+ +L +G VPS++E N +E +
Sbjct: 376 PLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 435
Query: 371 F---SFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
F S R ++ +GS+ + + + A+ ++ E+Y++
Sbjct: 436 FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALFDLYKNEQYLL 495
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
KGK +LK+ + +D+LK+ FH L
Sbjct: 496 M--DYKGKFCIVLKEGSSPEDMLKSLFHVNYL 525
>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
Length = 592
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 171/337 (50%), Gaps = 31/337 (9%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++P+GFP+SVT Y+ Y WRA++ GV TQ+LL +VG+ + A P A AINW+
Sbjct: 185 MLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG-AIPTAAAINWV 243
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ + G+ FD + K R DLL G+E++T A P F+ + A
Sbjct: 244 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACPQFFVLIGAVAG 303
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----IADLLGTGLSIMISKRN 310
+++ A++ STR+ + FA N +V AKGE G I +LG GL I
Sbjct: 304 ASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLGIGLGNCIGSST 363
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
P ++ +F +L+ +++ + + +S+ L TLN R S+ +L +G P ++E N +E +
Sbjct: 364 PLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 423
Query: 371 F-SFPWLR-----------------------DKPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
F + P L + + LGS+ + + LA+ ++
Sbjct: 424 FPAVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKN 483
Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
E Y+++ + GK +LK++ D+LKA F L
Sbjct: 484 EGYILSEH--MGKYSVVLKEKCSQLDMLKALFQVNYL 518
>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
Length = 569
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 176/333 (52%), Gaps = 27/333 (8%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+P SV+ Y+ Y WR ++ GV +TQ LL +VG+ R A P A A+NW+
Sbjct: 172 LLPDGYPHSVSSDYMNYSLWRGVQGVASQISGVLSTQALLYAVGLGRG-AIPTAAAVNWV 230
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R D L + G+E+ T PHLF+P+ AA
Sbjct: 231 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 290
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
+++ A++ +TR+ Y FA N +V AKGE G ++ +G GL I I
Sbjct: 291 ASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGIGLGIALANQIGSSV 350
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
P + +FA ++ +++ + + +S+ L TLN R S+ +L +G VPS++E N +E +
Sbjct: 351 PLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSVKEVNDEEPL 410
Query: 371 FSFPWL--------------RD------KPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
F L +D + + +GS+ + + + A+ ++ E+Y+
Sbjct: 411 FFNLSLGASCQQSKILSAEAKDAADRICRRLQMGSKLSEIIESKEDACALFDLYKNEQYL 470
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ KGK +LK+ + +D+LK+ FH L
Sbjct: 471 LM--DYKGKFCIVLKEGSSPEDMLKSLFHVNYL 501
>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
Length = 609
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 179/349 (51%), Gaps = 43/349 (12%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+PDSV+ Y+ Y WR ++ GV +TQ LL +VG+ + A P A A+NW+
Sbjct: 196 LLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 254
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R DLL G+E+ T PHLF+P+ AA
Sbjct: 255 LKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 314
Query: 255 ----------------VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
+ +N + +TR+ Y FA N +V AKGE G ++ L
Sbjct: 315 AGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFL 374
Query: 299 GTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
G L I ++ R S ++ +FA ++ +++ + + +S+ L TLN R S+ +L
Sbjct: 375 GIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLL 434
Query: 355 TGHVPSLREGNLQENIF--------------SFPWLRD------KPVVLGSRFKDAFQDP 394
+G VPS++E N +E +F P +D + + LGS+ + ++
Sbjct: 435 SGQVPSVKEVNDEEPLFLNLSIGTSRKESKILSPQAKDAAEIICRRLQLGSKLSEIIENK 494
Query: 395 DAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ A+ ++ E+Y++T KGK +LK+ + +D+LK+ FH L
Sbjct: 495 EDACALFDLYKNEQYLLT--EYKGKFCVILKEGSSPEDMLKSLFHVNYL 541
>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 719
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 107/339 (31%), Positives = 176/339 (51%), Gaps = 19/339 (5%)
Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
S ++ E L +++ +P G+P +VT Y+ + WR L++ MGVF+T+ LL +G
Sbjct: 326 SERSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLG 385
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVEL 237
+ +N + A A+ WILKDG G GK+ +A GK+FD D + R DLL +L +E+
Sbjct: 386 LGKN-VSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEI 444
Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
T FL LA A + +AA+T T+TR I+K+ A EN GD+ KGE G L
Sbjct: 445 ITPIFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKL 504
Query: 298 LGTGLSIMISKR----NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
+G GL I +SKR +L+ + + + ++ ++++ ++ V TLNR R ++ +++F+
Sbjct: 505 IGLGLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAFM 564
Query: 354 -----------KTGHVPSLREGNLQENIFSFPWLRDKP-VVLGSRFKDAFQDPDAYLAVE 401
+ +P+ RE +E PW +P +V G+ + + +L
Sbjct: 565 SARAQTSRQASRQLTLPTPREICAKERFMIPPWRDYRPEIVFGASLQRV-KHARHFLHCA 623
Query: 402 PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
+ ERY++ PS V L+D A D LKA +A
Sbjct: 624 ALYRNERYIIYAEPSTKHVLVFLRDGATVKDQLKAFMNA 662
>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 183/340 (53%), Gaps = 12/340 (3%)
Query: 103 RSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHF 160
R +F+ + ++ + PA + + SF+ ++ P G+P SV+ Y+ Y +RAL+H
Sbjct: 33 RRCQFQPDGHLSVKIIDDSRPAIQRIVDSFLNTFF-PSGYPYSVSEGYLRYTQFRALQHV 91
Query: 161 FGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
A+ V +TQ+LL + G+ A A ++W+LKDG VGK+I + G + D + K+
Sbjct: 92 TSAALSVLSTQSLLFAAGLRPTAAQ--ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKR 149
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
R D+L +LGAG+E+ + PHLFL +A N AK +A V + +TR PIY +FAK N
Sbjct: 150 WRVIADVLYDLGAGLEVISPLCPHLFLEMAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 209
Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVV 336
+ D+ AKGE + + +++G G + ++ S I LLS +V+ +++++
Sbjct: 210 LSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATP 269
Query: 337 LHTLNRARFSVAVESFLKTGHVPSLREGNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPD 395
++TLN R ++ V F+K G +PS + QE+ +F + + V R P
Sbjct: 270 INTLNPQRTAMIVADFVKAGRIPSPADIRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPS 329
Query: 396 AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK 435
+ ++ F E++V+ N SK V +L+ A +D L+
Sbjct: 330 KLVEMKQIFPGEKFVL--NQSKKWVDMVLEHDASGEDALR 367
>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 183/340 (53%), Gaps = 12/340 (3%)
Query: 103 RSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHF 160
R +F+ + ++ + PA + + SF+ ++ P G+P SV+ Y+ Y +RAL+H
Sbjct: 33 RRCQFQPDGHLSVKIIDDSRPAIQRIVDSFLNTFF-PSGYPYSVSEGYLRYTQFRALQHV 91
Query: 161 FGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
A+ V +TQ+LL + G+ A A ++W+LKDG VGK+I + G + D + K+
Sbjct: 92 TSAALSVLSTQSLLFAAGLRPTAAQ--ATVVSWVLKDGMQHVGKLICSNLGARMDSEPKR 149
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
R D+L +LGAG+E+ + PHLFL +A N AK +A V + +TR PIY +FAK N
Sbjct: 150 WRVIADVLYDLGAGLEVISPLCPHLFLEMAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 209
Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVV 336
+ D+ AKGE + + +++G G + ++ S I LLS +V+ +++++
Sbjct: 210 LSDLFAKGEAISTLFNVVGIGAGLQLASTICSSIQGKLVAAPLLSIVHVYCVVEQMRATP 269
Query: 337 LHTLNRARFSVAVESFLKTGHVPSLREGNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPD 395
++TLN R ++ V F+K G +PS + QE+ +F + + V R P
Sbjct: 270 INTLNPQRTAMIVADFVKAGRIPSPADIRYQEDLVFPGRLIEEAGNVKVGRALHEVIKPS 329
Query: 396 AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK 435
+ ++ F E++V+ N SK V +L+ A +D L+
Sbjct: 330 KLVEMKQIFPGEKFVL--NQSKKWVDMVLEHDASGEDALR 367
>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
Length = 630
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 177/360 (49%), Gaps = 35/360 (9%)
Query: 116 SLQSPQAPA---EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQT 172
SL S Q P + F+ ++P+GFP+SVT Y+ Y WR ++ GV TQ
Sbjct: 157 SLNSSQVPTFLYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQA 216
Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELG 232
LL +VG+ + A P A AINW+LKDG G + K++ + G+ FD + K R DLL
Sbjct: 217 LLYAVGLGKG-AIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAA 275
Query: 233 AGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG 292
G+E+ T A PHLF+P+ A +++ A++ STR+ + FA N +V AKGE G
Sbjct: 276 FGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 335
Query: 293 NIADL----LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
++ +G GL I P ++ +F +++ +++ + + +S+ L TLN R S+
Sbjct: 336 MVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASLV 395
Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKPVV-------------------------L 383
+L +G P ++E N +E +F + + P L
Sbjct: 396 FSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVLSSEAKDAAVEIESRLQL 455
Query: 384 GSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
GS+ + + + LA+ ++ E Y+++ + GK +LK+ D+LKA F L
Sbjct: 456 GSKLSEIINNKEEVLALFSLYKNEGYILSEH--TGKFCVVLKETCSQLDMLKALFQVNYL 513
>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 604
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 176/333 (52%), Gaps = 27/333 (8%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+P+SV+ Y+ Y WR L+ GV +TQ LL +VG+ + P A AINW+
Sbjct: 206 LLPDGYPNSVSSDYLHYSLWRGLQGVASQISGVLSTQALLYAVGLGKG-VIPTAAAINWV 264
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K+ R DLL G+E+ T PHLF+P+ AA
Sbjct: 265 LKDGLGYLSKILLSKFGRHFDVNPKRWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 324
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR----N 310
A++ AA+ +TR+ Y FA N +V AKGE G ++ LG L I ++ +
Sbjct: 325 AARSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGIMLGIALANQIGSSV 384
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
P + +FA ++ +++ + + +S+ L TLN R S+ +L +G V S++E N +E +
Sbjct: 385 PLALISFAGVTVVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVSSVKEVNDEEPL 444
Query: 371 F----SFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYV 410
F L+ ++ LGS+ + + + A+ ++ E+Y+
Sbjct: 445 FLNSSMGAKLKQNKILSAEAKDAADRICRRLQLGSKLSEIIDNEEDACALFDLYKDEQYL 504
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ KGK +LK+ + +D+LK+ FH L
Sbjct: 505 LLEY--KGKFCVVLKEGSSPEDMLKSVFHVNYL 535
>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 397
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 178/328 (54%), Gaps = 14/328 (4%)
Query: 117 LQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
L+ + P + + S P G+P+SV Y+ Y +RA++HF M V +TQ+LL +
Sbjct: 12 LKETRPPVQIAVDSFLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSLLFA 71
Query: 177 VGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGV 235
G+ R TP A ++W+LKDG VGK++ +R G + D + K+ R D + ++GAG+
Sbjct: 72 AGL---RPTPAQATIVSWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGL 128
Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
E+ + P FLP+A AN+AK +A V++ +TR P+Y +F K N+ D+ AKGE + ++
Sbjct: 129 EVISPLCPQHFLPVAGMANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLS 188
Query: 296 DLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
++LG G+ I ++ S I +LS +++ QE+++ ++TLN R ++ V
Sbjct: 189 NVLGLGVGIHLASNFASTIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVAD 248
Query: 352 FLKTGHVPSLREGNLQENIFSFPWLRDKP---VVLGSRFKDAFQDPDAYLAVEPFFEKER 408
F+KTG V S E +E + P +K V +G+ + P ++ F+ E+
Sbjct: 249 FVKTGSVSSPAELRYRERLI-LPVGLEKDAGNVRVGAPLQSGIGKPSVMAELKEKFQGEQ 307
Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKA 436
+++ + + +L +A +D ++
Sbjct: 308 FLLHFGEQATDM--VLHQKASGEDAVRG 333
>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
Length = 428
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 183/347 (52%), Gaps = 17/347 (4%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+ R TP
Sbjct: 61 SFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL---RPTPAQ 116
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG GK+I + G + D + K+ R D+L +LG G+E+ + PHLFL
Sbjct: 117 ATAVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFL 176
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI- 306
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + ++LG G+ I +
Sbjct: 177 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLA 236
Query: 307 SKRNPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
S S+ F LLS +V +E+++ ++TLN R ++ V F+KTG + S +
Sbjct: 237 STVCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPAD 296
Query: 364 GNLQENIFSFPWLRDK---PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKV 420
E++ FP + V +G A + P ++ F E+++++ P
Sbjct: 297 LRYHEDLL-FPGRLIENAGNVKVGQALHRAVR-PSKLRELKEIFPGEKFILS--PGNKWT 352
Query: 421 YALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
+L+ A +D L+A A SS+ S+ L Q Y+ N
Sbjct: 353 DLVLEQNASGEDALRAWLVAAYASSMKKSSHESTSVTL-QDAYEKMN 398
>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
Length = 559
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 170/329 (51%), Gaps = 23/329 (6%)
Query: 133 SYVVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
S V D F P V+ +Y+ Y+ W+ L F A+ V TQ + ++G+ +R+ P A A+
Sbjct: 155 SSAVRDFFIPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAAL 214
Query: 192 NWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
NW+LKDG GR+ + I+ A FD +LK++RF+ +L L GVEL T A P+ FL LA
Sbjct: 215 NWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELLTPAFPNYFLLLA 274
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMI 306
AN+AK V +T + ++++FA +N+G+V+AK + C N+ +L L+I+
Sbjct: 275 TIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNLGLMLAAVLNIL- 333
Query: 307 SKRNPSLI-----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
K N L+ + + S +F YQ +K V L TL + R + + +++ G+VPS
Sbjct: 334 CKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEIILNMWIQLGYVPSP 393
Query: 362 REGNLQENIFSFPWLRDK---PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN---- 414
E + +E I SFPW + + P+ +G + L ++ ++ Y +
Sbjct: 394 AEVSKEEGI-SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLSSEDFYFICMEACRR 452
Query: 415 ---PSKGKVYALLKDQAKSDDILKAAFHA 440
P + + L++ A + D++ A
Sbjct: 453 LTRPEQQGILLCLREGASTTDVISGLLQA 481
>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 514
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 170/329 (51%), Gaps = 23/329 (6%)
Query: 133 SYVVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
S V D F P V+ +Y+ Y+ W+ L F A+ V TQ + ++G+ +R+ P A A+
Sbjct: 110 SSAVRDFFIPQKVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAAL 169
Query: 192 NWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
NW+LKDG GR+ + I+ A FD +LK++RF+ +L L GVEL T A P+ FL LA
Sbjct: 170 NWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELLTPAFPNYFLLLA 229
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMI 306
AN+AK V +T + ++++FA +N+G+V+AK + C N+ +L L+I+
Sbjct: 230 TIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNLGLMLAAVLNIL- 288
Query: 307 SKRNPSLI-----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
K N L+ + + S +F YQ +K V L TL + R + + +++ G+VPS
Sbjct: 289 CKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEIILNMWIQLGYVPSP 348
Query: 362 REGNLQENIFSFPWLRDK---PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN---- 414
E + +E I SFPW + + P+ +G + L ++ ++ Y +
Sbjct: 349 AEVSKEEGI-SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSLSSEDFYFICMEACRR 407
Query: 415 ---PSKGKVYALLKDQAKSDDILKAAFHA 440
P + + L++ A + D++ A
Sbjct: 408 LTRPEQQGILLCLREGASTTDVISGLLQA 436
>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
Length = 436
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 103/309 (33%), Positives = 171/309 (55%), Gaps = 17/309 (5%)
Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWIL 195
P G+P SV Y+ Y +RAL+H A+ V +TQ+LL + G+ R TP A A++W+L
Sbjct: 76 PSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLLFAAGL---RPTPAQATAVSWVL 132
Query: 196 KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANV 255
KDG VGK+I + G + D + K+ R D+L +LG G+E+ + PHLFL +A N
Sbjct: 133 KDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEVAGLGNF 192
Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI- 314
AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G+ I ++ S I
Sbjct: 193 AKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGLGVGIQLASTVCSSIQ 252
Query: 315 ---TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN-I 370
LLS +VFS +E+++ ++TLN R ++ V ++K G + S + +E+ I
Sbjct: 253 GKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVADYVKAGKISSPADLRYREDLI 312
Query: 371 FSFPWLRDK-PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA--LLKDQ 427
F + D V +G AF P ++ F +E++++ ++G + +L+
Sbjct: 313 FPGRLIEDAGNVKVGRALHKAFT-PSKVHELKDIFPEEKFLL----NRGNTWTDMVLEQN 367
Query: 428 AKSDDILKA 436
A +D L+
Sbjct: 368 ASGEDALRG 376
>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 631
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 138/234 (58%), Gaps = 7/234 (2%)
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
+L+ +RS +P+GFP+SV PSY+ Y WR ++ V +T LL +VG+S +T
Sbjct: 233 QLVLQLRSVFLPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLS---ST 289
Query: 186 PGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
P A A +W+LKDG G++GK++ AR+G+ FD D K+ R + DLL + G +E+ T P
Sbjct: 290 PVAAATSWVLKDGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQY 349
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
FL LA N K+VA + R + +F N+GD++AK + + +L G GL I+
Sbjct: 350 FLALAALGNFTKSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIV 409
Query: 306 ISKR---NPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
++ PS+ + FA L+ Y +YQ +K+V L+ +NR R ++ +SF+++
Sbjct: 410 TARALPPKPSVRLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRS 463
>gi|374921923|gb|AFA26139.1| hypothetical protein, partial [Lolium perenne]
Length = 109
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 72/110 (65%), Positives = 95/110 (86%), Gaps = 1/110 (0%)
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
G+SI+ISKRNPSL+ +FA+LSCGY+ SSYQEV+SVVL+TLN ARF+VAV+SF+KTGHVPS
Sbjct: 1 GMSILISKRNPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFVKTGHVPS 60
Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYV 410
L+EGN +E +F+ PW R +PV +GSRF +AFQ+P +++A P FE ERY+
Sbjct: 61 LKEGNSEETVFNPPW-RHQPVAIGSRFGEAFQEPASFVATRPLFEDERYL 109
>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
Length = 414
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 170/329 (51%), Gaps = 12/329 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +R+++H A+ V +TQ+LL + G+ R TP
Sbjct: 47 SFMNKFF-PSGYPYSVNEGYLRYTQFRSVQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 102
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG VGK+I + G + D + K+ R D+L +LG G+E+ + PHLFL
Sbjct: 103 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLIADVLYDLGTGLEVLSPLCPHLFL 162
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM-- 305
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + +++G G+ I
Sbjct: 163 QMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLCNVIGIGVGIQLA 222
Query: 306 --ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
I + LLS +++S +E+++ ++TLN R ++ V FLK G V S +
Sbjct: 223 STICASVQGKMVVGPLLSILHIYSVSEEMRATPINTLNPRRTAMIVTDFLKAGVVSSPAD 282
Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
+E+ +F D V R P L +E E++++ N V
Sbjct: 283 LRYREDLLFRVRLTEDAGNVRVGRALHEVIKPSRLLELEQVLPGEKFLL--NRGGKCVDM 340
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSN 451
+L+ A +D L+ A NSS+
Sbjct: 341 VLEQDASGEDALRGWLVAAYAAQIENSSH 369
>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 162/323 (50%), Gaps = 11/323 (3%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
A + +S +P G+P+SV+ Y+ Y W ++ F G TQ L VGV
Sbjct: 41 ANSVSGVFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQE 100
Query: 184 ATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
AT A I W+L+DG G +G+++FA +G K D + K+ R D+L ++ +E+A
Sbjct: 101 ATVAAATITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHF 160
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
P F + C A + K++ V +TR + A+ N+ D++AK E + N+A LL
Sbjct: 161 PPFFTIIVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLV 220
Query: 300 TGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
+ I + NP L F L + ++F++Y+ V+SVV+ TLN AR S+ + +L G V
Sbjct: 221 SLALIPLVTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQV 280
Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDP-DAYLAVEPFFEKERYVVTYNPSK 417
S E N +E +F + R P+ LG R D +P D LA++ + Y++
Sbjct: 281 LSPAEANQREPVF-LSFRRTVPIKLGVRLGDLIHEPGDLQLALKN--NSKPYLIGIK--N 335
Query: 418 GKVYALLKDQAKSDDILKAAFHA 440
G V L A DD +KA A
Sbjct: 336 GAVCVCLGSDASVDDEIKAVCQA 358
>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 419
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 106/345 (30%), Positives = 181/345 (52%), Gaps = 12/345 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+H A+ V +TQ+LL + G+ R TP
Sbjct: 51 SFMNKFF-PSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 106
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG VGK+I + G + D + K+ R D+L ++G G+E+ + PHLFL
Sbjct: 107 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFL 166
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + +++G G+ I ++
Sbjct: 167 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLA 226
Query: 308 KRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+ I LLS +++S +E+++ ++TLN R ++ V FLK G V S +
Sbjct: 227 STICASIQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPAD 286
Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
++N +F+ D V + P L ++ F +E++++ + +
Sbjct: 287 LRYRDNLLFNVQVKEDTGNVRVGKNVHKVIKPSRLLELKQVFPEEKFLLNFG--NKCIDM 344
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
+L+ A +D L+ A +SS+ + ++ Y+ N
Sbjct: 345 VLEQDASGEDALRGWLVAAYAARTESSSHELSATSVLHEAYEKMN 389
>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 439
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 185/380 (48%), Gaps = 18/380 (4%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
D +G R SI +Q + ++L+ + P G+P SV Y+ Y +R L+HF
Sbjct: 40 DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFS 98
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
+ V +TQ+LL + G+ R TP A A++WILKDG GK+I + G + D + K
Sbjct: 99 SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
R D+L +LG +E+ + P LFL +A N AK +A V + +TR PIY +FAK N
Sbjct: 156 WRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215
Query: 281 IGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSV 335
+ D+ AKGE + + +++G G I + S LI LLS +++ QE+++
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRAT 274
Query: 336 VLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQ 392
++TLN R ++ V F+K+G V S E +E++ FP V +G + F
Sbjct: 275 PVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRVIEEAGSVKIGQPLRKVF- 332
Query: 393 DPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
P + F E+++++ + Y +L+ A +D L+ A S G
Sbjct: 333 SPRLVEQLRATFPGEKFLLSQK--SNRTYMVLERSASGEDALRGWLVAAFASEMERSGAG 390
Query: 453 HRSLALKQHEYDNSNLMLTT 472
R A+ Y+ + T
Sbjct: 391 SRDDAVLSEAYEKMERVFPT 410
>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 88/238 (36%), Positives = 140/238 (58%), Gaps = 9/238 (3%)
Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
P + + SF+ + P G+P SV Y+ Y +RAL+HF A+ V +TQ+LL + G+
Sbjct: 59 PQKMVESFLNKFF-PSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL--- 114
Query: 183 RATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
R TP A ++WILKDG VGK+I + G + D + K+ R D+L +LG G+EL +
Sbjct: 115 RPTPAQATVVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPL 174
Query: 242 VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
PHLFL +A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + ++ G G
Sbjct: 175 CPHLFLEMAGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIG 234
Query: 302 LSIMISKRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
I ++ S + ++LS +V+S ++++ V ++TLN R ++ V ++LK
Sbjct: 235 AGIQLASTICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYLKV 292
>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
Length = 464
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 163/332 (49%), Gaps = 24/332 (7%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+S ++ +P G+PDSV P Y Y W ++ F MG TT +++ VGV ATP
Sbjct: 45 FISIVKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEASATP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A AI WILKDG G +G+++FA G D K+ R D+L +L G+EL
Sbjct: 105 LAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSY 164
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
L + C + K++ V +TR + + A N+ DV+AK CV IA +G
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGI- 223
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
L + I ++ + L +++++Y VK++ L++LN R ++ V+S++ +P
Sbjct: 224 LILSIFHNGRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283
Query: 362 REGNLQENIFSFPWLRDKPV--VLGSRFK-----------DAFQDPDAYLAVEPFFEKER 408
E N +E++ L KP + G K D D LA++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDR-K 338
Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
Y+++ + ++ LK A+ D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370
>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
Length = 439
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 110/353 (31%), Positives = 177/353 (50%), Gaps = 19/353 (5%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF + V +TQ+LL + G+ R TP
Sbjct: 68 SFLNKFF-PSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 123
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG GK+I + G + D + K R D+L +LG +E+ + P LFL
Sbjct: 124 ATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFL 183
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI--- 304
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I
Sbjct: 184 EVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 243
Query: 305 --MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ S LI LLS +++ QE+++ ++TLN R ++ V F+K+G V S
Sbjct: 244 STVCSTTQGKLIAG-PLLSAVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPA 302
Query: 363 EGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
E +E++ FP V +G + P + F E++++T K
Sbjct: 303 ELRYREDLL-FPNRVIEEAGSVKIGQPLRRVL-SPSLVEQLRATFPDEKFLLTQK--SNK 358
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
Y +L+ A +D L+ A S R LK+ Y+ + ++ +
Sbjct: 359 TYMVLEQSASGEDALRGWLVAAFASEMERSGIVSRDTVLKE-AYEKTKMVFPS 410
>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 415
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 101/314 (32%), Positives = 167/314 (53%), Gaps = 12/314 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RA++H A+ V +TQ+LL + G+ R TP
Sbjct: 47 SFMNKFF-PSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 102
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG VGK+I + G + D + K+ R D L ++G G+E+ + PHLFL
Sbjct: 103 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFL 162
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM-- 305
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + +++G G+ I
Sbjct: 163 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLA 222
Query: 306 --ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
I + LLS +++S +E+++ ++TLN R ++ V FLK G V S +
Sbjct: 223 STICASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPAD 282
Query: 364 GNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
+EN +F+ D V + P L ++ F +E++++ + +
Sbjct: 283 LRYRENLLFNVHVKEDAGNVRVGKDVHKVIKPSRLLELKQVFPEEKFLLNFG--NKCIDM 340
Query: 423 LLKDQAKSDDILKA 436
+L+ A +D L+
Sbjct: 341 VLEQDASGEDALRG 354
>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
Length = 473
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 177/345 (51%), Gaps = 27/345 (7%)
Query: 116 SLQSPQAPAEE----LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
SL + PAE L F S ++P G+PDSV+ Y+ Y W ++ F G TQ
Sbjct: 50 SLVCERHPAEHGYRSLRDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQ 109
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLME 230
+LL GV + AT A I WIL+DG G VG+++FA +G + D D K+ R DLL +
Sbjct: 110 SLLRGSGVGDSTATVAAATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLND 169
Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG-- 288
+ +E+ P F CAA V K + V +TR I AK +N+ DV AK
Sbjct: 170 IAIFMEIMAPVFPSCFTLTVCAAGVFKCIVGVAGGATRAAITVHQAKRDNMADVCAKDGS 229
Query: 289 -ECVGNIADLLGTGLSI-MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFS 346
E + N+A LL + + ++S + F LL+ +++++Y+ V+SV++ TLN++R S
Sbjct: 230 QETLVNLAGLLVSLFLVPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLS 289
Query: 347 VAVESFLKTGHVPSLREGNLQENIFSFPWLRDK-PVVLGSRFKDAFQDPDAYLAVE-PFF 404
+ + +LK G + NL+E + P L + P+ LG P +YLA P
Sbjct: 290 IVLHHYLKEGKILDPEAANLREPL--LPGLGSQAPISLGV--------PLSYLASSVPEL 339
Query: 405 EKER------YVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
E+ + Y++ + G + +L ++A S D+++A HA +L
Sbjct: 340 EQLKKDNHSLYLLGWREDTGSLAVVLHERASSVDMIRACVHAEML 384
>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
Length = 466
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 194/395 (49%), Gaps = 26/395 (6%)
Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMG 166
F N +++ S +S Q +SF++ +P G+PDSV Y+ Y W ++ F MG
Sbjct: 24 FSVNEVKSESTES-QDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMG 82
Query: 167 VFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAG 225
TT +++ +GV + ATP A AI WILK G G +G ++FA G + D K+ R
Sbjct: 83 TLTTHSVMQGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFA 142
Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
D+L ++ G+EL + +P+ C + + K+V V +TR + + A N+ DV+
Sbjct: 143 DILNDIAMGLELMVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVS 202
Query: 286 AKGE----CVGNIADLLGTG-LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
AK CV IA +G LS+ R LI + L +++++Y VK++ L TL
Sbjct: 203 AKDSSQETCVNLIASFVGIFVLSLFHDGR--YLIELYLFLVAVHLYANYSAVKALCLDTL 260
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQ 392
N R ++ +++++ + ++ N +E++F L P + +G+ F +
Sbjct: 261 NEDRLALIIKNYMMNERISEPQKINKEESVF----LLGNPSKEICGFNIKIGASFASLLK 316
Query: 393 -DPDAYLAVE---PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL-LHCI 447
+ + + +E FFE RY++T + ++ +LK K ++L+ F+A++ L+
Sbjct: 317 RNVISSMEMEFLLKFFEDRRYLITIDIETRTIFIILKKDIKPIEVLEGYFYAFMCGLYIC 376
Query: 448 NSSNGHRSLALKQHEYDNSNLMLTTADLEAHIGES 482
+ +L L+ ++ S +L T L S
Sbjct: 377 MAKQISVNLLLRSETFEISYPLLNTYILHKKFANS 411
>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
Length = 466
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 189/386 (48%), Gaps = 24/386 (6%)
Query: 72 LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
L +QF + R CS DG RRW +++ + A P A A
Sbjct: 11 LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+S +RS ++P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A
Sbjct: 65 LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+ P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
F +N+AK + V +TR + A+ N+ DV+AK E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244
Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
L + + PSL + F LL+ +++++Y+ V+++VL TLN +R + +E FL+ G V
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304
Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
N E +++ W P + S + ++ E E Y++ +N S+
Sbjct: 305 EPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHHEPYLLCWNKSR 360
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
+V L +A + +L+AA H +L
Sbjct: 361 NQVRVALSQEAGPETVLRAATHGLIL 386
>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
Length = 649
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/337 (31%), Positives = 170/337 (50%), Gaps = 31/337 (9%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++P+GFP+SVT Y+ Y WRA++ GV TQ+LL +VG+ + A P A AINW+
Sbjct: 186 MLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG-AIPTAAAINWV 244
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ + G+ FD D K R DLL G+E+ T A P F+ + A
Sbjct: 245 LKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAG 304
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
+++ A++ STR+ + FA N +V AKGE G + +G GL I I
Sbjct: 305 ASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSST 364
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
P ++ +F +L+ +++ + + +S+ L TLN R S+ +L +G P ++E N +E +
Sbjct: 365 PLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 424
Query: 371 F-SFPWL------RDKPVVLGSRFKDA-----------------FQDPDAYLAVEPFFEK 406
F + P L + + +VL S KDA + LA+ ++
Sbjct: 425 FPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKN 484
Query: 407 ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
E Y+++ GK +LK+ D+LKA F L
Sbjct: 485 EGYILS--EYMGKFCVVLKENCSQQDMLKALFQVNYL 519
>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 534
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 96/291 (32%), Positives = 149/291 (51%), Gaps = 8/291 (2%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
++L + +P G+P SVT Y YM W L++ G V +T LL SVGV + +
Sbjct: 109 QKLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTLGSTTYVLSTHALLTSVGVGLSTS 168
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
P A AI+W+LKDG G ++FA + DY+LK +F GD+L +G +E+ T VP
Sbjct: 169 LPFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKSFKFRGDILHNVGVMLEMLTPFVP 228
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
FLPLA +N++K +A + +TR + ++F+ EN+GD+TAK + + L G GL
Sbjct: 229 GYFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQENLGDITAKYQSQSMASYLTGMGLG 288
Query: 304 IMISKRNPSLITTFAL-----LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
I + TT L +S ++ S + +KS+ L TLN R + ++ +L+ +
Sbjct: 289 TTIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSICLKTLNSQRSKIIIDHWLREKII 348
Query: 359 PSLREGNLQENIFSFPWLRDKPVV-LGSRFKDAFQDPDAYLAVEPFFEKER 408
S N EN F F D P + GS F+ D +V + + R
Sbjct: 349 LSPEIVNKNEN-FMFSSSTDTPKIHFGSSIDQCFKGQDEIDSVMEYIQNNR 398
>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
Length = 466
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 189/386 (48%), Gaps = 24/386 (6%)
Query: 72 LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
L +QF + R CS DG RRW +++ + A P A A
Sbjct: 11 LCTEQFGSGAPR-GCSAAADGSLQWDGARRWCWLSR----APIAKPGQHAGGGGGPWA-A 64
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+S +RS ++P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A
Sbjct: 65 LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+ P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
F +N+AK + V +TR + A+ N+ DV+AK E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244
Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
L + + PSL + F LL+ +++++Y+ V+++VL TLN +R + +E FL+ G V
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304
Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
N E +++ W P + S + ++ E E Y++ +N S+
Sbjct: 305 EPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHHEPYLLCWNKSR 360
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
+V L +A + +L+AA H +L
Sbjct: 361 NQVRVALSQEAGPETVLRAATHGLIL 386
>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
Length = 493
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/276 (34%), Positives = 154/276 (55%), Gaps = 9/276 (3%)
Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV-GVSRNRATPGAVA 190
+SY +P G+P+SV+P Y+ Y W ++ G V +T LL +V G+S A P A A
Sbjct: 97 KSYFLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASALPFAAA 156
Query: 191 INWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
I+W+LKDG G + FA + D+DLK+ +F GD+L LG +E+ T + FLPL
Sbjct: 157 ISWVLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQGYFLPL 216
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL----SIM 305
A +N++K +A + STR + K+F+ +NIGD+TAK + A L+G G+ ++
Sbjct: 217 ASISNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIGSTFGLL 276
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR-EG 364
+S S I+ LS ++F Y+ VKS+ L +LN+ R S+ ++++L + + +
Sbjct: 277 LSGSTFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEILNSKIVN 336
Query: 365 NLQENIFSFPWLRDKPVVLG--SRFKDAFQDPDAYL 398
N +E IF + ++ + LG S K+ +P L
Sbjct: 337 NNEEFIFDSKYNKEPHIKLGLSSDDKEFLNNPQEIL 372
>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 473
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 177/345 (51%), Gaps = 18/345 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVT YVP+ W L+ + +TQ LL+++GV AT W L
Sbjct: 93 VPEGFPSSVTADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFL 152
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R DL+ +LG ++L + P F+ + C +
Sbjct: 153 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGS 212
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NP 311
++++ V S +TR + + FA +N D++AK +A ++G L +++++ +P
Sbjct: 213 ISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHP 272
Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
++ +F L+ +++++Y+ V+ + L++LN R S+ ++ F++TG V S ++ + QE++
Sbjct: 273 LAIWFSFLSLTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHV 332
Query: 371 FSFPWLR---------DKPVVLGSRFK--DAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
++ K V LG R D + + L P++ K +Y++ KG
Sbjct: 333 LPIQFISWSSKKANCLHKKVCLGMRISSFDNMEIKEHLLCAAPYYTKAKYLLV--ERKGI 390
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD 464
+ ++ + + D+LK+ FHA VL ++ S S K +Y
Sbjct: 391 IDVIVHKDSNAADVLKSFFHALVLAKNVHKSKSLHSEGQKWIDYQ 435
>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
gi|219887187|gb|ACL53968.1| unknown [Zea mays]
Length = 439
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 184/380 (48%), Gaps = 18/380 (4%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
D +G R SI +Q + ++L+ + P G+P SV Y+ Y +R L+HF
Sbjct: 40 DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFS 98
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
+ V +TQ+LL + G+ R TP A A++WILKDG GK+I + G + D + K
Sbjct: 99 SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
D+L +LG +E+ + P LFL +A N AK +A V + +TR PIY +FAK N
Sbjct: 156 WTILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215
Query: 281 IGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSV 335
+ D+ AKGE + + +++G G I + S LI LLS +++ QE+++
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRAT 274
Query: 336 VLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQ 392
++TLN R ++ V F+K+G V S E +E++ FP V +G + F
Sbjct: 275 PVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRVIEEAGSVKIGQPLRKVF- 332
Query: 393 DPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
P + F E+++++ + Y +L+ A +D L+ A S G
Sbjct: 333 SPRLVEQLRATFPGEKFLLSQK--SNRTYMVLERSASGEDALRGWLVAAFASEMERSGAG 390
Query: 453 HRSLALKQHEYDNSNLMLTT 472
R A+ Y+ + T
Sbjct: 391 SRDDAVLSEAYEKMERVFPT 410
>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
Length = 466
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 188/386 (48%), Gaps = 24/386 (6%)
Query: 72 LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
L +QF + R CS DG RRW +++ + A P A A
Sbjct: 11 LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+S +RS ++P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A
Sbjct: 65 LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ A W++KD G +G++I A +G K D + KQ R D+L ++ +E+ P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
F +N+AK + V +TR + A+ N+ DV+AK E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244
Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
L + + PSL + F LL+ +++++Y+ V+++VL TLN +R + +E FL+ G V
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304
Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
N E +++ W P + S + ++ E E Y++ +N S+
Sbjct: 305 EPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHHEPYLLCWNKSR 360
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
+V L +A + +L+AA H +L
Sbjct: 361 NQVQVALSQEAGPETVLRAATHGLIL 386
>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
Length = 464
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 164/332 (49%), Gaps = 24/332 (7%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+S ++ +P G+PDSV P Y Y W ++ F MG TT +++ VGV ATP
Sbjct: 45 FVSILKEVFLPQGYPDSVHPDYTAYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEPAATP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A AI WILKDG G +G+++FA G D K+ R D+L ++ G+EL
Sbjct: 105 LAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDVAMGLELLLPYYSSY 164
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
L + C + K++ V +TR + + A N+ DV+AK CV IA L+G
Sbjct: 165 SLAILCVSTAMKSIVGVAGGATRAAVTQHQALQNNLADVSAKDGSQETCVNLIASLVGI- 223
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
L + I ++ + L +++++Y VKS+ L++LN R ++ V+S++ +P
Sbjct: 224 LILSIFHNGRYIMELYFFLIIVHLYANYSAVKSLCLNSLNEDRLALIVKSYITNEVIPEP 283
Query: 362 REGNLQENIFSFPWLRDKP------------VVLGSRFK-DAFQDPDAYLAVEPFFEKER 408
E N +E++ L KP V L + K + D L+++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMSIYGFNIKMGVSLAALIKNNIISTSDTELSLKLFLDR-K 338
Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
Y+++ + ++ K A+ D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICFKKDAQPCDVLEAYFHA 370
>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
Length = 302
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 141/248 (56%), Gaps = 9/248 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RA++H A+ V +TQ+LL + G+ R TP
Sbjct: 47 SFMNKFF-PSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL---RPTPAQ 102
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG VGK+I + G + D + K+ R D L ++G G+E+ + PHLFL
Sbjct: 103 ATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFL 162
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM-- 305
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + +++G G+ I
Sbjct: 163 EMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLA 222
Query: 306 --ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
I + LLS +++S +E+++ ++TLN R ++ V FLK G V S +
Sbjct: 223 STICASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAGIVSSPAD 282
Query: 364 GNLQENIF 371
+EN+
Sbjct: 283 LRYRENLL 290
>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 180/347 (51%), Gaps = 22/347 (6%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF + V +TQ+LL + G+ R TP
Sbjct: 62 SFLNKFF-PSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 117
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG GK+I + G + D + K R D+L + G +E+ + P LFL
Sbjct: 118 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 177
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI--- 304
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I
Sbjct: 178 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 237
Query: 305 --MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ S LI LLS +++ QE+++ ++TLN R ++ V F+K+G V S
Sbjct: 238 STVCSTTQGKLIAG-PLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPA 296
Query: 363 EGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
E +E++ FP V +G + P ++ F KE+++++ +
Sbjct: 297 ELRYREDLL-FPNRLIEEAGSVKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSA- 353
Query: 420 VYALLKDQAKSDDILK----AAFHAYVLLHCINSSNGHRSLALKQHE 462
Y +L+ A +D L+ AAF + + + S + ++A ++ E
Sbjct: 354 -YMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERME 399
>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 437
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 171/339 (50%), Gaps = 14/339 (4%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF + V +TQ+LL + G+ R TP
Sbjct: 66 SFLNKFF-PSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 121
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG GK+I + G + D + K R D+L + G ++ + P LFL
Sbjct: 122 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQLFL 181
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL----GTGLS 303
+A N AK +A V + +TR PIY AFAK N+ D+ AKGE + +++++ G GLS
Sbjct: 182 EVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGIGLS 241
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+ + LLS +++ QE++++ +++LN R ++ V F+K+G V S E
Sbjct: 242 STVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSSPAE 301
Query: 364 GNLQENIFSFP--WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVY 421
+E++ FP + + V + P + + E+++++ + K Y
Sbjct: 302 LRYREDLL-FPNRLIEEAGSVKVGQPLCRILSPQRVEQLRATYPNEKFLLSLKSN--KTY 358
Query: 422 ALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
+L+ A +D L+ A S G R L +
Sbjct: 359 MVLEQSATGEDALRGWLVAAFASEMEKSGVGSRDSVLNE 397
>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
Length = 380
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 180/347 (51%), Gaps = 22/347 (6%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG- 187
SF+ + P G+P SV Y+ Y +RAL+HF + V +TQ+LL + G+ R TP
Sbjct: 4 SFLNKFF-PSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL---RPTPAQ 59
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A A++WILKDG GK+I + G + D + K R D+L + G +E+ + P LFL
Sbjct: 60 ATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFL 119
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI--- 304
+A N AK +A V + +TR PIY +FAK N+ D+ AKGE + + +++G G I
Sbjct: 120 EVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLA 179
Query: 305 --MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ S LI LLS +++ QE+++ ++TLN R ++ V F+K+G V S
Sbjct: 180 STVCSTTQGKLIAG-PLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPA 238
Query: 363 EGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
E +E++ FP V +G + P ++ F KE+++++ +
Sbjct: 239 ELRYREDLL-FPNRLIEEAGSVKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSA- 295
Query: 420 VYALLKDQAKSDDILK----AAFHAYVLLHCINSSNGHRSLALKQHE 462
Y +L+ A +D L+ AAF + + + S + ++A ++ E
Sbjct: 296 -YMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTVLNVAYERME 341
>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
Length = 608
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 178/355 (50%), Gaps = 35/355 (9%)
Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
+P+ + + + +++P+GFP+SVT Y+ Y WR ++ GV TQ+LL +VG
Sbjct: 182 TPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVG 241
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
+ + A P A AINW+LKDG G + K++ ++ G+ FD K R DLL G+E+
Sbjct: 242 LGKG-AIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 300
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
T P F+ + AA ++ AA+ +TR+ FA N +V AKGE G ++ +
Sbjct: 301 TPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSV 360
Query: 299 GTGLSIMISK---RNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
G L I+++ + SL + F +++ +++++ + + + L TLN R S+ +L
Sbjct: 361 GILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 420
Query: 355 TGHVPSLREGNLQENIFSFPWLR--------------------------DKPVVLGSRFK 388
+G P ++E N +E + FP +R ++ + LGS+
Sbjct: 421 SGQAPLIKEVNDEEPL--FPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLGSKLS 478
Query: 389 DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
D + + +A+ + E Y++T + KG+ +LK+ + D+L++ F L
Sbjct: 479 DVIHNKEEAIALFDLYRNEGYILTEH--KGRFCVMLKESSTPQDMLRSLFQVNYL 531
>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
Length = 468
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 161/344 (46%), Gaps = 24/344 (6%)
Query: 115 ISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLL 174
+SL A + S + +P G+PDSV P Y Y W ++ F G TT +++
Sbjct: 31 LSLTESLAIRSRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQAFASTITGTLTTHSIM 90
Query: 175 NSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGA 233
+GV + ATP A AI WILK G G +G ++FA G K D K+ R D+L ++
Sbjct: 91 KGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQCKKWRLFADILDDIAK 150
Query: 234 GVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----E 289
G+EL + C + K++ V +TRT + A N+ DV+AK
Sbjct: 151 GIELLVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQNNVADVSAKDGSQET 210
Query: 290 CVGNIADLLGTG-LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
CV IA G LS R L+ + L +++++Y VK++ L TLN R ++
Sbjct: 211 CVNLIASFFGIFILSFFHDGR--YLLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAII 268
Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQDPDAYLA- 399
V++++ +P + N +E+IF L P + +G F + L
Sbjct: 269 VKNYMINEQIPKPGKVNKEESIF----LLTNPGRNVHGFNIKIGVSFTSILKRNIISLTE 324
Query: 400 ---VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
+ FFE +Y++T + K ++ +LK +S +IL+ F+A
Sbjct: 325 MEFLLKFFENRKYIITIDIEKKNIFIILKKDIQSIEILEGYFYA 368
>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
Length = 433
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 174/363 (47%), Gaps = 36/363 (9%)
Query: 95 WNYVA-----ERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYV 149
W YV+ ER R G + + +S +P G+P+SV+ Y+
Sbjct: 20 WKYVSKDGVMERRRDGS------------EGHSGGNSVFGVFKSVFLPQGYPESVSDDYL 67
Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA- 208
Y W ++ F G TQ L GV AT A + W+ KDG G +G+++FA
Sbjct: 68 QYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEATVAAATVTWLFKDGTGMLGRILFAW 127
Query: 209 RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTR 268
R+G K D + K+ R D+L ++ +E+ P F + C A + K++ V +TR
Sbjct: 128 RKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFPACFTFIVCTAGIFKSIVGVAGGATR 187
Query: 269 TPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGY 324
+ A+ +N+ D++AK E + N+A LL + + I + NP L + F L + +
Sbjct: 188 AALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLILIPLVTDNPVLTLCLFFLFTVLH 247
Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
+F++Y+ V+SVV+ T N AR S+ ++ +LK + S E N +E +F F + + P+ LG
Sbjct: 248 LFANYKAVRSVVMETFNEARLSIVLQQYLKDKRILSPPEANQREPVF-FEFGQTVPMKLG 306
Query: 385 SRFKDAFQDP-DAYLAVE----PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
R ++ Q P D LA++ P+ R G V L +A + D ++A
Sbjct: 307 VRLQEIAQSPEDLQLALKENSMPYLLGIR--------NGCVCVCLGPEASASDEIRAMCQ 358
Query: 440 AYV 442
A V
Sbjct: 359 AVV 361
>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 161/323 (49%), Gaps = 11/323 (3%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
A + +S +P G+P+SV+ Y+ Y W ++ F G TQ L VGV
Sbjct: 41 ANSVSGVFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQE 100
Query: 184 ATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
AT A I W+L+DG G +G+++FA +G K D + K+ R D+L ++ +E+A
Sbjct: 101 ATVAAATITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHF 160
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
P F + C A + K++ V +TR + A+ N+ D++AK E + N+A LL
Sbjct: 161 PPFFTLIVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLV 220
Query: 300 TGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
+ I + NP F L + ++F++Y+ V+SVV+ TLN AR S+ + +L G V
Sbjct: 221 SLALIPLVTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQV 280
Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDP-DAYLAVEPFFEKERYVVTYNPSK 417
S E N +E +F + R P+ LG R D +P D LA++ + Y++
Sbjct: 281 LSPAEANQREPVF-LSFRRTVPIKLGVRLGDIIHEPGDLQLALKN--NSKPYLIGIK--N 335
Query: 418 GKVYALLKDQAKSDDILKAAFHA 440
G V L A DD +KA A
Sbjct: 336 GAVCVCLGSDASVDDEIKAVCQA 358
>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 431
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 187/376 (49%), Gaps = 26/376 (6%)
Query: 103 RSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHF 160
R +F+ N +++L P + SF+ + P G+P SV Y+ Y +RAL+H
Sbjct: 36 RRFQFEPNGQLSVTLVGDSRPLYHRIVDSFLNKFF-PSGYPYSVNEGYLRYTQFRALQHT 94
Query: 161 FGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLK 219
A+ V +TQ+LL + G+ R TP A A++W+LKDG +G +I ++ G + D + K
Sbjct: 95 ASAALSVLSTQSLLFAAGL---RPTPAQATAVSWVLKDGMQHLGNIICSKLGARMDSEPK 151
Query: 220 QLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGE 279
+ R D+L +LG G+E+ + P FL +A N+AK ++ V S +TR PIY +FAK
Sbjct: 152 RWRILADVLYDLGTGLEVLSPLCPQFFLEMAGIGNLAKGMSIVASRATRLPIYSSFAKEG 211
Query: 280 NIGDVTAKGECVGNIADLLGTGLSIMISK----RNPSLITTFALLSCGYVFSSYQEVKSV 335
N+ D+ A+GE + + +++G G+ I ++ + LLS +++S +E+++
Sbjct: 212 NLSDLFARGEAISTLFNVVGIGIGIQLASTVCASMQGKLVVGPLLSIIHIYSVSEEMRAT 271
Query: 336 VLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP--WLRDKPVVLGSRFKDAFQD 393
++TLN R ++ V FLK G V S + +E++ FP + D V R
Sbjct: 272 PVNTLNPQRTAMIVADFLKAGSVSSPADLRYREDLL-FPRRLIEDAGNVRVGRDLHKVIK 330
Query: 394 PDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGH 453
P L + F E++++ N + +L+ A D L+ A + SS
Sbjct: 331 PSRLLESKQVFPGEKFIL--NGDNRCIDMVLEQDAIGKDALRGWLVASYAVQIGKSS--- 385
Query: 454 RSLALKQHEYDNSNLM 469
HE S L+
Sbjct: 386 -------HELSTSTLL 394
>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Apis florea]
Length = 464
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/332 (30%), Positives = 163/332 (49%), Gaps = 24/332 (7%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L+S I+ +P GFPDSV P Y Y W ++ F MG TT +++ VGV ATP
Sbjct: 45 LISIIKEVFLPQGFPDSVHPDYTSYQVWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A AI WILKDG G VG+++FA G D K+ R D+L +L G+EL
Sbjct: 105 LAAAITWILKDGTGMVGRIMFAWWNGTDLDGQCKKWRLFADILNDLAMGLELLLPYFSSY 164
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
L + C + K++ V +TR + A N+ DV+AK CV IA +G
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTNIKALQNNLADVSAKDGSQETCVNLIASFVGI- 223
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
L + I ++ + L +++++Y VK++ L++LN R ++ V+S++ +P
Sbjct: 224 LILSIFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283
Query: 362 REGNLQENIFSFPWLRDKPVVLGSRF-------------KDAFQDPDAYLAVEPFFEKER 408
E N +E++ L KP + F KD D L+++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDR-K 338
Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
Y+++ + ++ LK A+ D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370
>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
Length = 613
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 178/355 (50%), Gaps = 35/355 (9%)
Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
+P+ + + + +++P+GFP+SVT Y+ Y WR ++ GV TQ+LL +VG
Sbjct: 188 TPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLLYAVG 247
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
+ + A P A AINW+LKDG G + K++ ++ G+ FD K R DLL G+E+
Sbjct: 248 LGKG-AIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 306
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
T P F+ + AA ++ AA+ +TR+ FA N +V AKGE G ++ +
Sbjct: 307 TPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSM 366
Query: 299 GTGLSIMISK---RNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
G L I+++ + SL + F +++ +++++ + + + L TLN R S+ +L
Sbjct: 367 GILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 426
Query: 355 TGHVPSLREGNLQENIFSFPWLR--------------------------DKPVVLGSRFK 388
+G P ++E N +E + FP +R ++ + LGS+
Sbjct: 427 SGQAPLIKEVNDEEPL--FPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLGSKLS 484
Query: 389 DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
D + + +A+ + E Y++T + +G+ +LK+ + D+L++ F L
Sbjct: 485 DVIHNKEEAIALFDLYRNEGYILTEH--RGRFCVMLKESSTPQDMLRSLFQVNYL 537
>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
siliculosus]
Length = 789
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/241 (34%), Positives = 131/241 (54%), Gaps = 21/241 (8%)
Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
P+ VTP Y + WR + F G + VF TQ LL ++G+ R A A++W+LKD
Sbjct: 186 IPEDVTPDYYHFTFWRMFQRFVAGTINVFGTQALLLALGIKAKRLG-AAAAMSWVLKDAL 244
Query: 200 GRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
G+ G++++A + G++FD D K+ RF LL G G+E+ T P FL LA AN K
Sbjct: 245 GKFGRILWASKMGRRFDSDAKRWRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQ 304
Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAK---------------GECVGNIADLLGT--- 300
++ +TS++TR IY++FA+GENIGD+TAK GE + D+LG
Sbjct: 305 MSMLTSSATRNTIYRSFARGENIGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSG 364
Query: 301 -GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
GL ++ +++ + +LS + + Y E+++V LN R + V F+ +G P
Sbjct: 365 IGLCTLVGTSRQTIVAAYVILSLVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASGSDP 424
Query: 360 S 360
S
Sbjct: 425 S 425
>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
Length = 471
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 164/345 (47%), Gaps = 22/345 (6%)
Query: 113 RAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQT 172
R L Q + S + +P G+P+SV P Y Y W ++ F G TT +
Sbjct: 35 RKFGLTESQTVHSRITSVFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHS 94
Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMEL 231
++ +GV + ATP A AI WILK G G +G ++FA G + D K+ R D+L ++
Sbjct: 95 IMKGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDI 154
Query: 232 GAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG--- 288
G+EL + + C + K++ V +TRT + A N+ DV+AK
Sbjct: 155 AKGIELLVPYFSSYSVIILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQ 214
Query: 289 -ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
CV IA G + + + ++ + L +++++Y VK++ L TLN R ++
Sbjct: 215 ETCVNLIASFFGIFI-LSLFHDGQYVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAI 273
Query: 348 AVESFLKTGHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQ-DPDAYL 398
V+ ++ +P + N +E++F L P + +G F + + + ++
Sbjct: 274 IVKKYMMNEQIPEPWKVNKEESVF----LLKNPTRDIHGFDIKIGVSFANILKRNIISFT 329
Query: 399 AVE---PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
+E FFE +Y++ + K ++ +LK +S +IL+A F+A
Sbjct: 330 EMEFLLKFFENRKYIILIDIEKRNIFVILKKDIQSVEILEAYFYA 374
>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
Length = 464
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 164/332 (49%), Gaps = 24/332 (7%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+S ++ +P G+PDSV P Y PY W ++ F MG TT +++ VGV ATP
Sbjct: 45 FISIVKEVFLPQGYPDSVHPDYTPYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A AI WILKDG G +G+++FA G D K+ R D+L +L G+EL
Sbjct: 105 LAAAITWILKDGTGMIGRIVFAWWNGTDLDGQCKKWRLFADILNDLAMGLELFLPYFSSY 164
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
L + C + K++ V +TR + + A N+ DV+AK CV IA +G
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGI- 223
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
L + I ++ + L +++++Y VK++ L++LN R ++ V+S++ +P
Sbjct: 224 LILSIFHNGRYIMELYLFLVMIHLYANYMAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283
Query: 362 REGNLQENIFSFPWLRDKPV--VLGSRFK-----------DAFQDPDAYLAVEPFFEKER 408
E N +E++ L KP + G K D D LA++ F ++ +
Sbjct: 284 EEVNKKESVL----LLTKPTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDR-K 338
Query: 409 YVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
Y+++ + ++ LK A+ D+L+A FHA
Sbjct: 339 YLISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370
>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
Length = 442
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 158/311 (50%), Gaps = 25/311 (8%)
Query: 95 WNYVA-----ERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYV 149
W YVA ER R G R + ++ +S +P G+P+SV+ Y+
Sbjct: 21 WKYVANDGAIERRREGGESRGN--------------SVVGAFKSVFLPQGYPESVSGDYL 66
Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA- 208
Y W ++ FF G T+ L VGV AT A + W+LKDG G G+++FA
Sbjct: 67 RYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEATVAAATMTWLLKDGTGMCGRILFAW 126
Query: 209 RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTR 268
R+G K D + K+ R D+L ++ +E+ P F + C + + K++ V +TR
Sbjct: 127 RKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFPAFFTLVMCVSGIFKSLVGVAGGATR 186
Query: 269 TPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGY 324
+ A+ +N+ D++AK E + N+A LL + + I N +L + F L + +
Sbjct: 187 AALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLVLIRFVSDNAALTLGLFFLFAVLH 246
Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
+F++Y+ V+SVV+ TLN AR SV ++ +L+ G + S E N +E +F + + P+ LG
Sbjct: 247 LFANYKAVRSVVMETLNEARLSVVLQQYLENGQILSPLEANEREPVF-MDFKKVVPIRLG 305
Query: 385 SRFKDAFQDPD 395
R ++ Q +
Sbjct: 306 VRLQEVIQSSE 316
>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
Length = 444
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 172/343 (50%), Gaps = 9/343 (2%)
Query: 106 KFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
K +++ ++ + L SP+ + L F +P G+P+SV+ Y+ Y W ++ F
Sbjct: 18 KTEKDKLKTVDL-SPKWTS--LSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSIT 74
Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFA 224
G Q LL VGV AT A + WILKDG G +G+++FA QG D D K+ R
Sbjct: 75 GTLAAQALLKGVGVGDESATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLF 134
Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
D+L ++ +E+ F P+ C A V K++ V +TR + + A+ N+ DV
Sbjct: 135 ADILNDMAIFMEILAPYFKAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADV 194
Query: 285 TAKG---ECVGNIADLLGT-GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
+AK E + N+A L+ + L +++ ++ + + F L + ++F++Y V SVV+ TL
Sbjct: 195 SAKDGSQETLVNLAALVCSLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAVTSVVMETL 254
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAV 400
N+AR + V +L+TG V SL+E N +E + + R V LG+ F
Sbjct: 255 NQARLHILVHHYLQTGEVLSLQEVNYKEPVL-WKTRRKLNVHLGTSIGKVFSRASILEES 313
Query: 401 EPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
++ Y++ N G V +L + + D L + F A V+
Sbjct: 314 IALYKDSNYLLDINLQTGCVSIILSPLSTTMDHLTSCFQAEVI 356
>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
Length = 540
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 165/326 (50%), Gaps = 18/326 (5%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
E++ S + ++ +P+GFP+SV SY+ Y WR +++ V +TQ LL + GV R +
Sbjct: 134 EQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAAGVGRTHS 193
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
A A W KDG G+VG++ A G ++D D K+ R DLL ++G +E+ + A
Sbjct: 194 ETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEILSPAFSK 253
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
FL +A N++K++A S R + K F + EN+G+++AK + + +LGT + +
Sbjct: 254 YFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGMLGTLIGV 313
Query: 305 MISKRNPSL----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
+SK P+ + F + Y F +Y+ ++ L T+NR R ++ S+L +VPS
Sbjct: 314 WMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYLMFQNVPS 373
Query: 361 LREGNLQE-------NIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTY 413
+ N E +F +P +R LG + + F KE+Y+++
Sbjct: 374 VSMSNSYERFVPLFPTVFQYPHVR-----LGVGLTELSISIEKIANYTRQFSKEKYILSL 428
Query: 414 NPSKGKVYALLKDQAKSDDILKAAFH 439
S G++ L +S D+L+ H
Sbjct: 429 --SDGEIRVALHRNHESVDLLQLLLH 452
>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
Length = 423
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 162/329 (49%), Gaps = 11/329 (3%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L+ +S +P G+P+SV+ Y+ Y W L+ F TQ L VGV AT
Sbjct: 41 LVGAFKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATV 100
Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A + WILKDG G G+++FA Q G + D + K+ R D+L ++ +E+ P
Sbjct: 101 AAATVTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSF 160
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGL 302
F + C A V K + V +TR + A+ +N+ D++AK E + N+A L+ + +
Sbjct: 161 FTLIVCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLI 220
Query: 303 SIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
I + N SL + F L + ++F++Y+ V+SVV+ T N+AR S+ ++ +LK + S
Sbjct: 221 LIPLITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSP 280
Query: 362 REGNLQENIFSFPWLRDKPVVLGSRFKDAFQDP-DAYLAVEPFFEKERYVVTYNPSKGKV 420
E N +E +F + P+ LG + +D Q P D +LA+ G +
Sbjct: 281 FEANRREPVF-LEFKITTPIKLGVKLQDVVQSPEDFHLAL----RNNSMAYLLGVRNGCI 335
Query: 421 YALLKDQAKSDDILKAAFHAYVLLHCINS 449
L A D ++AA A L + + S
Sbjct: 336 CVCLGSGASVHDEIRAACQALCLSNMLKS 364
>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 430
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 179/381 (46%), Gaps = 33/381 (8%)
Query: 117 LQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
+QS + M +RS +P G+PDSVT Y+ Y W + GV T+ +L
Sbjct: 33 VQSKISKHSGAMELLRSMFLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRG 92
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
GV A+ G+ + WIL+DGAG +G++ FA + G++ D + K RF D+L ++G +
Sbjct: 93 YGVGNAAASVGSAVLQWILRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTL 152
Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECV 291
E+ + LF+PL+ V+K++ V T+ + FAK +N+GDV+AK V
Sbjct: 153 EMLSPLSNVLFMPLSVCGIVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAV 212
Query: 292 GNIADLLGTGLSIMISKRNPSLITT-------FALLSCGYVFSSYQEVKSVVLHTLNRAR 344
G + LG LS I + T F +L +++ +Y+ V VVL ++NR R
Sbjct: 213 GLVGMFLGAALSASIPDESEQTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYR 272
Query: 345 FSVAVESFLKTGHVPSLREGNLQENIFSFPWLRD--KPVVLGSRFKDAFQDPDAYLAVEP 402
++ + +L+ G +P+ + + +ENI RD +P+ LG+ + +
Sbjct: 273 ATLIYKHYLQYGEIPTPIQVSKRENIL----FRDDKQPIYLGASLLNIINLNKQQQQQQQ 328
Query: 403 FFE--------------KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN 448
E + RY+ + + ++Y LK + D+++A ++ +LH
Sbjct: 329 SKEIRESKTNSIIRHKLENRYIDIWQFQQ-EIYISLKVHCTAVDLIQAYLYSVAILHQTK 387
Query: 449 SSNGHRSLALKQHEYDNSNLM 469
N + L + N L+
Sbjct: 388 QQNNRAFIDLVTSKSFNDRLL 408
>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
Length = 466
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 159/332 (47%), Gaps = 24/332 (7%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
SF + +P GFPDSV YVPY W L+ F G TT +++ VGV + ATP A
Sbjct: 48 SFFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLTTHSIMRGVGVGESTATPLA 107
Query: 189 VAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
AI WILKDG G +G++IFA QG D K+ R D+L + +E+ +
Sbjct: 108 AAITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADILNDAAMALEVTVPYISTFSS 167
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT--GLSIM 305
C K + + +TRT I + A +N+ DV+AK G + +L G+ G+ I+
Sbjct: 168 YTLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKEHSQGTLVNLAGSIVGILIL 227
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
+ I LL ++ S+Y V S+ L+TLN R + +E ++ + ++ + N
Sbjct: 228 LLINEKFFIYVCLLLVIVHIVSNYFAVTSLELNTLNEDRLCLIIEDYILNHTMSNVEDIN 287
Query: 366 LQENIFSFPWLRDKP--------VVLGSRFKDAF-------QDPDAYLAVEPFFEKERYV 410
+E++ L +KP + +G F+ +D LA+ F++ +Y+
Sbjct: 288 SRESVII---LLEKPAKKIFGFDIEIGVSFESVLKMNLINTKDVRFLLAL---FQRRKYL 341
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
+ K++ +L A D ILKA FH V
Sbjct: 342 PVMDLKAKKIHIILSKDADDDIILKAYFHCCV 373
>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 567
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 151/276 (54%), Gaps = 9/276 (3%)
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV-GVSRNRA 184
+L I+ +P G+P +VT Y+ Y W +++ G V +T LL SV G+S A
Sbjct: 118 KLFDSIKKLFLPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSVIGISSTSA 177
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
P A AI+W+LKDG G ++FA + FDYD+K+ +F GDLL G +E+ T
Sbjct: 178 LPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLLEMFTPFFQ 237
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL----LG 299
FLPLA +N++K +A + STR + ++F+ ENIGD+TAK + A L +G
Sbjct: 238 QFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAAYLSGMAIG 297
Query: 300 TGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
+ + +++S + ++ ++ LS ++F +KS+ L TLN+ R S+ ++++LK+
Sbjct: 298 SSIGMILSVFPLTSVSNLSMVFGLSFIHIFCGINAIKSIDLKTLNQQRTSILIDNWLKSN 357
Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQ 392
+ + + N +E + + LG +++F+
Sbjct: 358 LILTSKIVNQEEKFITNDKYLKPNIKLGCTIEESFE 393
>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
Length = 437
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 175/347 (50%), Gaps = 22/347 (6%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP Y P+ W +L+ + +TQ LL+++GV AT W L
Sbjct: 56 VPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFL 115
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R DL+ +LG ++L + P F+ + C +
Sbjct: 116 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVFIVCLGS 175
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----RN 310
++++ V S +TR + + FA N D++AK +A ++G +++++ +
Sbjct: 176 LSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMAFGMLLARITMGHS 235
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
++ +F L+ +++++Y+ V+ ++L TLN R S+ +E F++TG V S +E + E++
Sbjct: 236 VAIWFSFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQVLSPQEVSKMEHV 295
Query: 371 ---FSFPWLRDKPVVLGSRFK--------DAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
++ W L +R D + D + E ++K +Y++ KG
Sbjct: 296 LPLWASSWSLKNAKSLHTRIHLGVRVSSLDKLEVTDLLHSAESHYKKAKYLLL--ERKGI 353
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSN----GHRSLALKQHE 462
+ ++ + + D+L++ HA V+ + ++ S RS KQ+E
Sbjct: 354 ISVIMHKDSTASDVLQSFIHALVMANLVDKSIYIHLESRSWMDKQYE 400
>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
Length = 466
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 182/361 (50%), Gaps = 17/361 (4%)
Query: 91 DG-RRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYV 149
DG RRW +++ S + A P A A +S +R+ ++P GFPDSV+ Y+
Sbjct: 35 DGARRWGWLSGNPASEPGQD----AGGGGGPWA-ALPALSGLRAVLLPQGFPDSVSTDYL 89
Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA- 208
PY W +++ F G TQ +L +GV +A+ A W+LKD G +G++IFA
Sbjct: 90 PYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWLLKDSTGMLGRIIFAW 149
Query: 209 RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTR 268
+G K D + KQ R D+L ++ +E+ P F +N+AK + V +TR
Sbjct: 150 WKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSISNLAKCIVGVAGGATR 209
Query: 269 TPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGY 324
+ A+ N+ DV+AK E V N+A LL + L + + PSL + F LL+ +
Sbjct: 210 AALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDCPSLSLGCFILLTALH 269
Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
++++Y+ V+++VL TLN +R + ++ FL+ G V + N E +++ W P +
Sbjct: 270 IYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQMEPLWTGFW----PSLSL 325
Query: 385 SRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
S + +E E +E Y++ +N S+ +V L A + +L+AA H V
Sbjct: 326 SLGVPLHHLVSSVFELEQLVEGHQEPYLLCWNQSQNQVQVALSQVAGPEAVLRAATHGLV 385
Query: 443 L 443
L
Sbjct: 386 L 386
>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
Length = 523
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 90/254 (35%), Positives = 143/254 (56%), Gaps = 13/254 (5%)
Query: 136 VPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
V D F P VT +Y+ Y+ W+ L F A+ V TQ + +VG+ +R+ P A A+NW+
Sbjct: 126 VRDFFIPKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPSAAALNWV 185
Query: 195 LKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
LKDG GR+ + I+ A FD +LK++RF+ +L LG G+EL T A P FL LA A
Sbjct: 186 LKDGLGRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIA 245
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRN 310
N+AK ++ +T + I+++FA +N+G+V+AK + D LG L+ +++ K N
Sbjct: 246 NIAKQISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNN 305
Query: 311 PSLI-----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
L+ + + S +F YQ +K V L TL + R + ++S++ G VP+ E +
Sbjct: 306 QRLLAGLPFVVYPIFSAIDLFGIYQGLKHVHLQTLTKDRLEMILDSWIALGRVPTPAEVS 365
Query: 366 LQENIFSFPWLRDK 379
+E F +LR K
Sbjct: 366 KKEG---FEFLRTK 376
>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
Length = 437
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 162/315 (51%), Gaps = 11/315 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G T +L +GV +A+ A W++
Sbjct: 47 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 106
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD G +G+++FA G K D + KQ R D+L ++ +E+ +P F C +N
Sbjct: 107 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCISN 166
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
+AK + V +TR + A+ N+ DV+AK E + N+A LL + L + + +P
Sbjct: 167 LAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPMVSDSP 226
Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
SL + F L+ +++++Y+ V+++V+ TLN R + + FL+ G V N E +
Sbjct: 227 SLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSANQMEPL 286
Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
++ W + LG+ + ++ E +E Y++ ++ S+ +V +L A
Sbjct: 287 WTGFW-SSVSLSLGAPLHHV---TSSVSELQQLVEGHQEPYLLRWDQSRNQVQVVLSQMA 342
Query: 429 KSDDILKAAFHAYVL 443
+ IL+AA H VL
Sbjct: 343 GPEAILRAATHGLVL 357
>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
Length = 464
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 162/330 (49%), Gaps = 20/330 (6%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L+S I+ +P G+PDSV P Y Y W ++ F MG TT +++ VGV ATP
Sbjct: 45 LISIIKEVFLPQGYPDSVHPDYTSYQIWDTVQAFASTIMGTLTTHSIMQGVGVGEAAATP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A AI WILKDG G VG+++FA G D K+ R D+L +L G+EL
Sbjct: 105 LAAAITWILKDGTGMVGRIMFAWWNGTDLDGQCKKWRLFADILNDLAMGLELLLPYFSSY 164
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTG 301
L + C + K++ V +TR + + A N+ DV+AK CV IA +G
Sbjct: 165 SLGILCISTAMKSIVGVAGGATRAALTQHQALQNNLADVSAKDGSQETCVNLIASFVGI- 223
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
L + I ++ + L +++++Y VK++ L++LN R ++ V+S++ +P
Sbjct: 224 LILSIFHNGQYIMELYLFLVIIHLYANYSAVKALCLNSLNEDRLALIVKSYISNEVIPKP 283
Query: 362 REGNLQENIFSFPWLRDKPVVLGSRF-----------KDAFQDPDAYLAVEPFFEKERYV 410
E N +E++ + + G KD D L+++ F ++ +Y+
Sbjct: 284 EEVNKKESVLLLT--KSTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDR-KYL 340
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
++ + ++ LK A+ D+L+A FHA
Sbjct: 341 ISIDVQNKTIFICLKKDAQPYDVLEAYFHA 370
>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 627
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 121/453 (26%), Positives = 204/453 (45%), Gaps = 46/453 (10%)
Query: 33 ETMRISANLATSSSPIGVGGGGAAATLLQSESDETTAFRLVEDQFVDSSL----RLICSE 88
E+ R + A S+ V G +A L S ++T+ F + + R +
Sbjct: 86 ESFRDDRDSALVSASDLVFNGSTSAALTCSPEEQTSRTSQSSATFSEVDVLYGRRAVLVY 145
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPD----GFPDSV 144
D R+ V+E++R + A+ +A L FI + ++P P V
Sbjct: 146 DPLQERYVKVSEKNRVADSTKQESVAL-----RARRSSLARFITTKILPRLSLAFLPSGV 200
Query: 145 TPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV--AINWILKDGAGRV 202
T Y ++ WR L+ F + VF TQ+LL ++ + + + GA+ A+NW+LKD G++
Sbjct: 201 TNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALNWVLKDALGKI 260
Query: 203 GKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAA 261
+M++A R G++FD D K+ RF + G G+E+ T P LFL A AN K ++
Sbjct: 261 VRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWATLANCCKQISM 320
Query: 262 VTSTSTRTPIYKAFAKG--ENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS----LIT 315
+TS+STRT IY +F G ENIGD+TAKGE I DLLG + +S+ + ++
Sbjct: 321 LTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRTVGTSIRAVLA 380
Query: 316 TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH------------------ 357
+ L + Y ++++V +N R V F +
Sbjct: 381 VYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQARQGPKDGLEGLAASCTTPTP 440
Query: 358 --VPSLREGNLQENIFSFP-WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN 414
+P+ + QE IF P L + + GS A PD + F++ER+++
Sbjct: 441 AGIPTPQTLASQERIFLPPKHLTRRAIAFGS-IGRARLSPDELGTLLEIFKRERFILVVG 499
Query: 415 PSKGKVYALLKDQAKS--DDILKAAFHAYVLLH 445
+ + AK D + + + +++LH
Sbjct: 500 KNVKHPRPFMAKTAKQNEDPVSRIQENCHIVLH 532
>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 474
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 193/391 (49%), Gaps = 39/391 (9%)
Query: 83 RLICSEDIDGRRWNYVAERDR---SGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDG 139
R +C+ + R YV + D+ +G+ + ++ ++L +E+ +P G
Sbjct: 34 RPVCANEAPRR---YVQQGDQLLSTGQGRASNAGCLTLSGLSRELQEMF-------LPAG 83
Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
+PDSV+ Y+ + W L+ GV TQ++L SVGV ++ATP A A+ W+L+DG+
Sbjct: 84 YPDSVSEDYLGFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPLAAALQWVLRDGS 143
Query: 200 GRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
G +G + FA G KFD ++K+ R D++ ++G +++ P L + C ++V K
Sbjct: 144 GMIGGLTFAYFVGPKFDVNVKRWRLFADVVNDVGLTLDMVAPYFPTLVTEVLCVSSVCKA 203
Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM----ISKRNPSLI 314
+ V + +TR+ + FAK +N+ D AK L G + ++ ++
Sbjct: 204 MCGVAAGATRSSLMTHFAKRDNMADCAAKEGSQETAVKLFGLMFGMYFANAVNSSPHAVW 263
Query: 315 TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP------------SLR 362
F +L+ +V+++Y V + + T+N +R + V+ FL +G+ P S+R
Sbjct: 264 MAFLVLTLVHVYANYNAVSCLCIPTVNCSRGLILVQRFLMSGNKPGKENVADHQGRYSIR 323
Query: 363 EGNLQENIFSFPWLR-DKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVTYNPSKG 418
N +E IF P L +++GS A D D L + +E E Y++T
Sbjct: 324 SVNQEEPIFKDPVLPVTSHLMMGSELHQAVPTSTDLDRLLQI---YEDETYLLTV--VGD 378
Query: 419 KVYALLKDQAKSDDILKAAFHAYVLLHCINS 449
+V+ + + +AK D L+A F A ++L + S
Sbjct: 379 RVHVMFEAEAKPTDELRAFFQAVLVLQALAS 409
>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Glycine max]
Length = 486
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 172/345 (49%), Gaps = 18/345 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVT YVP+ W L + +TQ LL ++GV AT W L
Sbjct: 106 VPEGFPSSVTADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKSATVIGATFQWFL 165
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D K R DL+ +LG ++L + P F+ + C +
Sbjct: 166 RDLTGMLGGILFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGS 225
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NP 311
++++ V S +TR + + FA +N D++AK +A ++G L +++++ +P
Sbjct: 226 ISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVARLTIGHP 285
Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
++ +F L+ +++++Y+ V+ + L++LN R S+ + F++TG V S + + QE+I
Sbjct: 286 LAIWFSFLSLTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQVLSPEQVSSQEHI 345
Query: 371 FSFPWLR---------DKPVVLGSRFK--DAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
++ K V LG R D + + L+ P++ K +Y++ KG
Sbjct: 346 LPIQFISWSSKKANCLHKKVNLGMRISSFDNMEIKEHLLSAAPYYTKAKYLLV--ERKGI 403
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD 464
+ ++ + + D+LK FHA VL ++ S S K +Y
Sbjct: 404 IDVIVHKDSNAADVLKLFFHALVLAKNVHKSKSVHSEGQKWIDYQ 448
>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
Length = 468
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 174/332 (52%), Gaps = 12/332 (3%)
Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
SP A A +S +++ ++P GFPDSV+P Y+PY W +++ F T ++ +G
Sbjct: 62 SPMA-ASPPVSGLKAVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLG 120
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
V A+ A W+++D G +G+++FA +G K D ++KQ R D+L + +E+
Sbjct: 121 VGNAEASLAAATTTWLVRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEI 180
Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI 294
A +P F +N+AK + +V +TR + A+ N+ DV+AK E + N+
Sbjct: 181 AAPTLPFCFTVTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNL 240
Query: 295 ADLLGTGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
A LL + + + + PS + F LL+ +++++Y+ V+++V+ TLN +R + ++ FL
Sbjct: 241 AGLLFSLVMLPLVSGCPSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFL 300
Query: 354 KTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVV 411
+ G + N E +++ W + LG D + ++ E +E Y++
Sbjct: 301 QRGEILDTTSANQMEPLWTGFW-PSLSLSLGVPLHDLI---SSVSELQQLVEGHQEPYLL 356
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
++N S+ KV +L A + +L+AA H +L
Sbjct: 357 SWNQSRNKVQVVLSHVAGPEAVLRAATHGLLL 388
>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 452
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 183/377 (48%), Gaps = 27/377 (7%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL+++GV AT W L
Sbjct: 72 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKSATVIGATFQWFL 131
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++FA QG D + K R D + +LG ++L + P + + C +
Sbjct: 132 RDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 191
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK--RNPS 312
++++ V S +TR + + FA +N D++AK +A + G GL ++++ R
Sbjct: 192 LSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHD 251
Query: 313 LI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
L+ +F L+ +++++Y+ V+S+ L TLN R S+ ++ FL+ G V + ++ + QE+I
Sbjct: 252 LVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEVRTPQQVSKQEHI 311
Query: 371 FSF--PWLR-------DKPVVLGSRFKDAFQDPDAYL--AVEPFFEKERYVVTYNPSKGK 419
F W R + V LG++ D L ++ Y + +G
Sbjct: 312 LPFWSSWRRLLRTKLPHELVHLGAK-ASMLTHSDMLLISKTRSYYTDANYFLL--DKEGS 368
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEY--DNSNLMLTTADLEA 477
V+ + QA + D+LK+ H VL S +S + H++ D N+ ++ +E
Sbjct: 369 VHIFIHKQAAATDVLKSFVHGLVLARLTRKS---KSCHAEAHQWMDDKYNIFISKLKMEG 425
Query: 478 HIGE---SCKVVSTALW 491
+ E S +V A W
Sbjct: 426 YSTERLLSHSIVWRAHW 442
>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
Length = 513
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 84/253 (33%), Positives = 145/253 (57%), Gaps = 12/253 (4%)
Query: 130 FIRSY--VVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
++R++ VV D F P VT +YV Y+ W+ L F A+ V TQ + ++GV + + P
Sbjct: 104 WLRAFGSVVWDFFIPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSNSLP 163
Query: 187 GAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A A+NW+LKDG GR+ + I+ A FD +LK++RF +L G+EL T A P
Sbjct: 164 SAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRC 223
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
FL LA AN++K ++ +TR+ ++++FA G+N+G+++AK + D+LG L+ +
Sbjct: 224 FLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAAL 283
Query: 306 IS------KRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
++ +R + + F + +F YQ +K+V L TL + R + + ++++ G+
Sbjct: 284 VNLWIENHRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGY 343
Query: 358 VPSLREGNLQENI 370
VPS E + +E I
Sbjct: 344 VPSPAEVSEKEGI 356
>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
Length = 468
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/327 (29%), Positives = 169/327 (51%), Gaps = 11/327 (3%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
A +S + + +P GFPDSV+P Y+PY W +++ F G T +L +GV
Sbjct: 66 ASPFLSRLLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAN 125
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A+ A W++KD AG +G+++FA +G K D + KQ R D+L ++ +E+
Sbjct: 126 ASVSAATATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIY 185
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
P F + C +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL
Sbjct: 186 PICFTMIVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLV 245
Query: 300 TGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
+ L + + PS + F LL+ +++++Y+ V+++V+ TLN R + ++ FL++G V
Sbjct: 246 SLLMLPLVSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEV 305
Query: 359 PSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPS 416
N E +++ W + LG ++ E +E Y++ ++ S
Sbjct: 306 LDPTSANKMEPLWTGFW-PSPSLFLGVPLHRLISR---VFELQQLIEGHQEPYLLHWDQS 361
Query: 417 KGKVYALLKDQAKSDDILKAAFHAYVL 443
+ +V +L A ++ IL+AA H VL
Sbjct: 362 QNQVQVVLSQMAGTETILRAATHGLVL 388
>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
Length = 478
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 96/338 (28%), Positives = 168/338 (49%), Gaps = 26/338 (7%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
++ +P G+PDSV+ Y+ + W L+ GV TQ++L SVGV ++ATP A A
Sbjct: 79 LQEMFLPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQSVLQSVGVGDDKATPLAAA 138
Query: 191 INWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
+ W+L+DG+G +G + FA G KFD ++K+ R D++ ++G +++ P L +
Sbjct: 139 LQWVLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDVGLTLDMVAPYFPTLVTEV 198
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM---- 305
C ++V K + V + +TR+ + FAK +N+ D AK L G +
Sbjct: 199 LCVSSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQETAVKLFGLMFGMYFANA 258
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP------ 359
++ ++ F LL+ +V+++Y V + + T+N +R + V FL +G+ P
Sbjct: 259 VNSSPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLILVRRFLASGNKPGKENVA 318
Query: 360 ------SLREGNLQENIFSFPWLR-DKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERY 409
S+R N +E IF P L +V+G+ A D D L + + E Y
Sbjct: 319 DHNGRYSIRSVNQEEPIFKDPVLPVTSHLVMGAELHQAVPSSHDLDRLLHI---YSDEAY 375
Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCI 447
++T +V + + +AK D L+A F A ++L +
Sbjct: 376 LLTV--VGDRVLVMFQAEAKPKDELRAFFQAVLVLEAL 411
>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 408
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/325 (31%), Positives = 164/325 (50%), Gaps = 18/325 (5%)
Query: 134 YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN--RATPGAVAI 191
+ +P FP SV P Y + + L G A V +TQTLL ++GV N A+ A A+
Sbjct: 29 HFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMAGAL 88
Query: 192 NWILKDGAGRVGKMIFA-RQG--KKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFL 247
NW+LKDG G++G +IFA R G ++FD + K+ R G L ++ + +++ + P + L
Sbjct: 89 NWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSAVVL 148
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
PLAC AN KN+ +T ++R +++A + N+GDVTAK A LLGTG+ I +S
Sbjct: 149 PLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGIGLS 208
Query: 308 KRNPSLITTFALLSCG----YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
F L CG + ++ ++ V LH NR R + ++ +L+ G V S RE
Sbjct: 209 AAMGHDAGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVLSPRE 268
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQ-----DPDAYLAVEPFFEKERYVVTYNPSKG 418
+E+ FP LR+ A P+ + E Y++ + G
Sbjct: 269 IATKESF--FPLLREDDTASWLSIGSALTAISPGGPNELTQLLKACPDESYIINCT-NDG 325
Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
K++ + A+ +D++K FHA L
Sbjct: 326 KIHVVFLQDAQGEDLVKGMFHACFL 350
>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
Length = 455
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 164/320 (51%), Gaps = 13/320 (4%)
Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
++ +P GFPDSV+P Y+PY W ++ F G TQ +L VGV A+ A I
Sbjct: 62 QAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATI 121
Query: 192 NWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
W+LKDG G +G++ FA G K D D K+ R D+L + +E+ A P F +
Sbjct: 122 TWLLKDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPSCFTLIV 181
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMIS 307
C +++AK + V +TR + A+ +N+ DV+AK E + N+A LL + L + +
Sbjct: 182 CVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLV 241
Query: 308 KRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
P L + F L + +++++Y+ V++V++ T N R + ++ F++ G V + N
Sbjct: 242 SERPRLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVLKHFIQWGEVLTPPSANR 301
Query: 367 QENIFS--FPWLR-DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
E +++ P L V L S AF+ L ++ Y++ +N +GK+
Sbjct: 302 MEPLWTGCLPALSLSLGVALHSLVSSAFE-----LQQVSEGHQQPYLLRWNQCRGKIQVT 356
Query: 424 LKDQAKSDDILKAAFHAYVL 443
L +A + IL+AA H +L
Sbjct: 357 LGQEASAGVILRAATHGLLL 376
>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
Length = 466
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 188/393 (47%), Gaps = 24/393 (6%)
Query: 66 ETTAFR--LVEDQFVDSSLRLICSEDIDGR-RWNYVAERDRSGKFKRNSIRAISLQS--- 119
+T + R L +QF + R CS DG +W+ R G F R SI
Sbjct: 3 DTASLRAPLCTEQFGSGAPR-SCSAAADGSLQWDRA---QRWGWFSRASITKPGQHEGGG 58
Query: 120 --PQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
P A A +S +RS ++P GFPDSV+P Y+ Y W +++ F G TQ +L +
Sbjct: 59 RGPWA-ALTTLSGLRSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGL 117
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
GV +A+ A W++KD G +G++IFA +G K D + KQ R D+L + +E
Sbjct: 118 GVGNAKASVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLE 177
Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
+ P F +N+AK + V +TR + A+ N+ DV+AK + +
Sbjct: 178 IMAPMYPIFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVN 237
Query: 297 LLGTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
L G +S+++ ++ F LL+ +++++Y+ V+++VL TLN +R + ++ F
Sbjct: 238 LAGLLVSLLMLPLVSDCLSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHF 297
Query: 353 LKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYV 410
L+ G V N E +++ W P + S + ++ E +E Y+
Sbjct: 298 LQRGEVLEPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHQEPYL 353
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ +N S+ +V L A + +L+AA H +L
Sbjct: 354 LCWNQSQNQVQVALSQVAGPETVLRAATHGLIL 386
>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
Length = 525
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/246 (33%), Positives = 139/246 (56%), Gaps = 10/246 (4%)
Query: 135 VVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINW 193
VV D F P VT +YV Y+ W+ L F A+ V TQ + ++GV + + P A A+NW
Sbjct: 122 VVWDFFIPRGVTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSCSLPSAAALNW 181
Query: 194 ILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
+LKDG GR+ + I+ A FD +LK++RF +L G+EL T A P FL LA
Sbjct: 182 VLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRCFLLLATI 241
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS----- 307
AN++K ++ +TR+ ++++FA G+N+G+++AK + D+LG L+ +++
Sbjct: 242 ANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAALVNLWIEN 301
Query: 308 -KRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+R + F + +F YQ +K+V L TL + R + + ++++ G+VPS E
Sbjct: 302 HRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGYVPSPAEV 361
Query: 365 NLQENI 370
+ +E I
Sbjct: 362 SEKEGI 367
>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
norvegicus]
Length = 466
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 189/393 (48%), Gaps = 24/393 (6%)
Query: 66 ETTAFR--LVEDQFVDSSLRLICSEDIDGR-RWNYVAERDRSGKFKRNSIRAISL----- 117
+T + R L +QF + R CS DG +W+ R G F R SI
Sbjct: 3 DTASLRAPLCTEQFGSGAPR-SCSAAADGSLQWDRA---QRWGWFSRASITKPGQHEGGG 58
Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
+ P A A +S +RS ++P GFPDSV+P Y+ Y W +++ F G TQ +L +
Sbjct: 59 RGPWA-ALTTLSGLRSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGL 117
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
GV +A+ A W++KD G +G++IFA +G K D + KQ R D+L + +E
Sbjct: 118 GVGNAKASVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLE 177
Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
+ P F +N+AK + V +TR + A+ N+ DV+AK + +
Sbjct: 178 IMAPMYPIFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVN 237
Query: 297 LLGTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
L G +S+++ ++ F LL+ +++++Y+ V+++VL TLN +R + ++ F
Sbjct: 238 LAGLLVSLLMLPLVSDCLSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHF 297
Query: 353 LKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYV 410
L+ G V N E +++ W P + S + ++ E +E Y+
Sbjct: 298 LQRGEVLEPASANQMEPLWTGFW----PSLSLSLGVPLHHLVSSVSELKQLVEGHQEPYL 353
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ +N S+ +V L A + +L+AA H +L
Sbjct: 354 LCWNQSQNQVQVALSQVAGPETVLRAATHGLIL 386
>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
Length = 584
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 161/333 (48%), Gaps = 39/333 (11%)
Query: 140 FPD--SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
FP SVT Y Y +R + M V TQ +L ++G+ +R+ P A A+NW+LKD
Sbjct: 173 FPQKTSVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSIPAAAALNWVLKD 232
Query: 198 GAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
G GR+GK+ A G+ FD D+K+LRF+ L+ +E T P FL +A ANV
Sbjct: 233 GLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFPKHFLAIATVANVG 292
Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLIT- 315
K++ T R PI K FA EN+G++ AK +AD +G L + + K ++
Sbjct: 293 KSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALGVSVMKFQTKFVSL 352
Query: 316 ---------TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
F +L+ + S ++++KSV L T+N+ R + E+F++ + + +
Sbjct: 353 ALQRAVPLALFPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFVREKKLATQLQVAE 412
Query: 367 QENIFSFPWLRDKPVV----------------LGSRFKDAFQDPDAYLAVEPFFEKERYV 410
+E + FP D + L S FKD P L+ EP+ +K+R V
Sbjct: 413 KERLL-FPARLDASALPLRVTSLGNACSTLDALASAFKDDVHRP-YVLSYEPYVQKKRTV 470
Query: 411 VT------YNPSKGKVYAL--LKDQAKSDDILK 435
++ P K K AL L +A S D+L+
Sbjct: 471 LSGIFMRDRKPLKYKGRALLALSHRATSKDVLQ 503
>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 443
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 85/241 (35%), Positives = 134/241 (55%), Gaps = 5/241 (2%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++P+GFPDSVT Y+ Y WR ++ GV TQ LL +VG+ + A P A A+NW+
Sbjct: 181 MLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKG-AIPTAAAVNWV 239
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K+ ++ G+ FD K R DLL G+E+ T A P F+ + AA
Sbjct: 240 LKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAG 299
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS-- 312
++ AA+ +TR+ Y FA N +V AKGE G ++ +G L I ++ R S
Sbjct: 300 AGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSST 359
Query: 313 --LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ F++++ ++F + + KS+ L TLN R S+ +L +G VPS+++ N +E +
Sbjct: 360 SLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPL 419
Query: 371 F 371
F
Sbjct: 420 F 420
>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
Length = 523
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/246 (33%), Positives = 138/246 (56%), Gaps = 12/246 (4%)
Query: 136 VPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
V D F P +V+ YV Y+ W+ L F A+ V TQ + ++GV +R+ A A+NW+
Sbjct: 120 VKDFFIPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWV 179
Query: 195 LKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
LKDG GR+ + +F A FD +LK++RF+ +L L GVEL T A P FL LA A
Sbjct: 180 LKDGLGRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIA 239
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKR 309
N+ K ++ ST + ++++FA +N+G+V+AK + C N+ +L L+ + SK
Sbjct: 240 NIVKQISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFL-SKN 298
Query: 310 NPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+ L + + + +F +YQ +K V L TL + R + + ++++ G+VP+ E
Sbjct: 299 DRRLQAALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEV 358
Query: 365 NLQENI 370
+ +E I
Sbjct: 359 SEREGI 364
>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
Length = 337
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 158/312 (50%), Gaps = 18/312 (5%)
Query: 72 LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
L +QF + R CS DG RRW +++ + A P A A
Sbjct: 11 LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+S +RS ++P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A
Sbjct: 65 LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+ P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGT 300
F +N+AK + V +TR + A+ N+ DV+AK E V N+A LL +
Sbjct: 185 IFFTMTVSTSNLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVS 244
Query: 301 GLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
L + + PSL + F LL+ +++++Y+ V+++VL TLN +R + +E FL+ G V
Sbjct: 245 LLMLPLVSDCPSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVL 304
Query: 360 SLREGNLQENIF 371
N E ++
Sbjct: 305 EPASANQMEPLW 316
>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
[Cucumis sativus]
Length = 523
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 80/240 (33%), Positives = 135/240 (56%), Gaps = 11/240 (4%)
Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAG 200
P +V+ YV Y+ W+ L F A+ V TQ + ++GV +R+ A A+NW+LKDG G
Sbjct: 126 PKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGLG 185
Query: 201 RVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
R+ + +F A FD +LK++RF+ +L L GVEL T A P FL LA AN+ K +
Sbjct: 186 RLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQI 245
Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSL-- 313
+ ST + ++++FA +N+G+V+AK + C N+ +L L+ + SK + L
Sbjct: 246 SLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFL-SKNDRRLQA 304
Query: 314 ---ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ + + +F +YQ +K V L TL + R + + ++++ G+VP+ E + +E I
Sbjct: 305 ALPFVVYPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSEREGI 364
>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
Length = 468
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 189/385 (49%), Gaps = 18/385 (4%)
Query: 71 RLVEDQFVDSSLRLICSEDIDGR-RWNYVAER--DRSGKF--KRNSIRAISLQSPQAPAE 125
RL +Q+ + R C+ +DG +W R SG F K +P A A
Sbjct: 10 RLCSEQYGSGAARGCCAA-VDGSLQWEAWGWRWWGFSGAFTAKPEGPDGGGEGAP-ATAS 67
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
+S + + +P GFPDSV+P Y+PY W +++ F G T +L +GV +A+
Sbjct: 68 APLSRLLALFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGDAKAS 127
Query: 186 PGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
A W++KD G +G++IFA +G K D + K R D+L ++ +E+ P
Sbjct: 128 VSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDVAMFLEIMAPVYPI 187
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTG 301
F + C +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL +
Sbjct: 188 CFTMIVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSL 247
Query: 302 LSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L + + PS + F L + +++++Y+ V+++V+ TLN R + ++ FL+ G V
Sbjct: 248 LMLPLVSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQRGEVLG 307
Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKG 418
N E +++ W P V S + + ++ E +E Y++ +N S+
Sbjct: 308 PTSANRMEPLWTGFW----PSVSLSLGVPLHRLISSVSELQQLVEGHQEPYLLRWNQSQN 363
Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
+V +L A + +L+AA H VL
Sbjct: 364 QVQVVLSQMAGPETVLRAATHGLVL 388
>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
familiaris]
Length = 468
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 163/315 (51%), Gaps = 11/315 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G T +L +GV A+ A W++
Sbjct: 78 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSAATATWLV 137
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD G +G++IFA +G K D + KQ R D+L ++ +E+ P F C +N
Sbjct: 138 KDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCISN 197
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L + + P
Sbjct: 198 LAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSACP 257
Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
S ++ F L+ +++++Y+ V+++VL TLN R + ++ FL+ G V N E +
Sbjct: 258 SFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTSANQMEPL 317
Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
++ W P + S + + ++ E +E Y++ ++ SK +V +L +A
Sbjct: 318 WTGFW----PSLSLSLGVPLHRLISSVFELQQLVEGHQEPYLLHWDKSKSQVQVVLSQKA 373
Query: 429 KSDDILKAAFHAYVL 443
+ IL+AA H VL
Sbjct: 374 GPETILRAATHGLVL 388
>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
intestinalis]
Length = 414
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 162/348 (46%), Gaps = 33/348 (9%)
Query: 117 LQSPQAPAEELMSF------IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 170
L Q+P + F + +P G+P+SV+ Y+ Y W ++ F G T
Sbjct: 29 LVQKQSPDKSFFKFSGFTQIFKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLAT 88
Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLM 229
+L GV A + I W+LKDG G +G++ FA +G D + KQ R D++
Sbjct: 89 HAVLKGSGVGDETANVASATITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMN 148
Query: 230 ELGAGVELATAAVPHLFLPLA-CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG 288
+ ++L +P + L C A +A ++ V TR + AK N+ DV+AK
Sbjct: 149 DCAIFIQLVAPVLPKAYFTLVMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKD 208
Query: 289 ---ECVGNIADLLGTGLSIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRA 343
E + N+A LL TGL IM ++ P + F + + +VF++Y+ V SVV+ NR
Sbjct: 209 GSQETLVNLAALL-TGLVIMPIVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRN 267
Query: 344 RFSVAVESFLKTGHVPSLREGNLQENIF---SFPWLRDKPVVLGSRFKDAFQDPDAYLAV 400
R S ++ F V + +E N +E IF S+ WL LG + + D L
Sbjct: 268 RLSKVIKVFCDMNKVLTPKEANEKEAIFFPPSYSWLN-----LGCKLE------DLGLTS 316
Query: 401 EPFFEK-----ERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
F +K RY++ N G++ L + DIL+A + +Y+L
Sbjct: 317 SEFHQKVDDGGVRYLIKRNQKTGQIMVSLNHSCTASDILEAFYESYLL 364
>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 297
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 142/263 (53%), Gaps = 11/263 (4%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
D +G R SI +Q + ++L+ + P G+P SV Y+ Y +R L+HF
Sbjct: 40 DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFS 98
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
+ V +TQ+LL + G+ R TP A A++WILKDG GK+I + G + D + K
Sbjct: 99 SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
R D+L +LG +E+ + P LFL +A N AK +A V + +TR PIY +FAK N
Sbjct: 156 WRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215
Query: 281 IGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSV 335
+ D+ AKGE + + +++G G I + S LI LLS +++ QE+++
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRAT 274
Query: 336 VLHTLNRARFSVAVESFLKTGHV 358
++TLN R ++ V F+K ++
Sbjct: 275 PVNTLNPQRTAMIVADFIKVRYL 297
>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
Length = 749
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 171/330 (51%), Gaps = 19/330 (5%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F++++V P+GFP SVTP YVP+ W L+ + +TQ LL+++GV AT
Sbjct: 363 FLQAFV-PEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGA 421
Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
W L+D G +G ++F QG D + K R DL+ +LG ++L + P F+
Sbjct: 422 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVF 481
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
+ C +++++ V S +TR + + FA +N D++AK +A ++G L +++++
Sbjct: 482 IVCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVAR 541
Query: 309 RNP----SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
++ +F L+ +++++Y+ V+ + L++LN R S+ ++ F+++ V S ++
Sbjct: 542 ITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESRQVLSPKQV 601
Query: 365 NLQENIFSFPWLR---------DKPVVLGSRFK--DAFQDPDAYLAVEPFFEKERYVVTY 413
+ E++ D V LG+R D + + L++ ++ K +Y++
Sbjct: 602 SPLEHVLPIQLTSMSSKKVNSLDSKVRLGTRISSFDEMEIKEHLLSLASYYAKAKYILV- 660
Query: 414 NPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
KG + ++ + S D+LK+ HA VL
Sbjct: 661 -EKKGIIDVIVHKDSSSADVLKSFIHALVL 689
>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
Length = 468
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 181/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K + + +P AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDSGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASGLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLG----TGLSIMISKRNPSLITTFALLSCG 323
R + A+ N+ DV+AK + +L+G + ++S + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V + N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
Length = 468
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 168/326 (51%), Gaps = 17/326 (5%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S +++ +P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A+
Sbjct: 70 LSGLQAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A W++KD G +G+++FA +G K D + KQ R D+L ++ +E+ P F
Sbjct: 130 AATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICF 189
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
+N+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 190 TMTVSTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLI 249
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + PS + F L+ +++++Y+ V+++V+ TLN+ R + ++ FL+ G V
Sbjct: 250 LPLVSDCPSFSLGCFFFLTALHIYANYRAVRAIVMETLNQGRLRLVLKHFLQRGEVLDPT 309
Query: 363 EGNLQENIFSFPWLRDKP-----VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
N E +++ R P V L F D VE +E Y++ ++ S+
Sbjct: 310 SANKMEPLWTG--FRSSPSLSLGVPLHHLISSVF---DLQQLVEG--HQEPYLLYWDQSR 362
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
+V +L A S+ IL+AA H VL
Sbjct: 363 NQVQVVLSQMAASETILRAATHGLVL 388
>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
Length = 590
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 132/236 (55%), Gaps = 10/236 (4%)
Query: 135 VVPDGF-PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINW 193
VV D F P V +YV ++ W+ L F A+ V TQ + ++GV + + P A A+NW
Sbjct: 110 VVRDFFIPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSLPSAAALNW 169
Query: 194 ILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
+LKDG GR+ K I+ A FD +LK++RF +L G+EL T P FL LA
Sbjct: 170 VLKDGLGRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPRFFLLLATI 229
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI-----S 307
AN+AK ++ +TR+ ++++FAK +N+G+++AK + D+LG L+ ++ +
Sbjct: 230 ANIAKQISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAALVNMWIGN 289
Query: 308 KRNPSL---ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
+ P + L + +F YQ +K V L TL + R + ++++++ G+VPS
Sbjct: 290 HQRPQAGLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFGYVPS 345
>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
Length = 468
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 167/321 (52%), Gaps = 7/321 (2%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A+
Sbjct: 70 LSGLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGALATQAVLLGIGVGDAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+ +P F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVLPICF 189
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
C +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 190 TITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + PS+ + F L+ +++++Y+ V+++V+ TLN R + ++ FL+ G V S
Sbjct: 250 LPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEGRLWLILKHFLQKGEVLSPT 309
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
N E +++ W + LG + L +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGVPLH-CLISTVSELQQLIGGHQEPYLLHWDQSRNQVQV 367
Query: 423 LLKDQAKSDDILKAAFHAYVL 443
+L A + +L+AA H VL
Sbjct: 368 VLSQTAGPETVLRAATHGLVL 388
>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
Length = 468
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 166/323 (51%), Gaps = 11/323 (3%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+ Y W +++ F G T +L +GV +A+
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A WI+KD G +G++IFA +G K D + KQ R D+L ++ +E+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
C +N+AK V +V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + PS + F LL+ +V+++Y+ V+++V+ TLN R + ++ FL+ G V
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
N E +++ W + LG+ + ++ E +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRV 365
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
+L A + IL+AA H VL
Sbjct: 366 QVVLSQMAGPETILRAATHGLVL 388
>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
Length = 825
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 163/337 (48%), Gaps = 22/337 (6%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
Q+ ++S ++ +P G+PDSV Y Y W ++ M T +++ +GV
Sbjct: 394 QSICSRIVSIVKEIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIGVG 453
Query: 181 RNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
+ ATP A AI WILK+GAG VG ++FA G K D K+ R D+L ++ AG+EL
Sbjct: 454 ESNATPLAAAITWILKNGAGMVGSIMFAWWNGTKLDGQCKKWRLFADILNDMAAGMELLV 513
Query: 240 AAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIA 295
+ + + C + + K++ V +TR + + A N+ DV AK CV IA
Sbjct: 514 SYFSSYSVVILCISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVNLIA 573
Query: 296 DLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
+G + + R LI + L +++++Y VK + L LN R ++ ++ ++
Sbjct: 574 SFIGIFILSLFHDR-QYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYYMIN 632
Query: 356 GHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAF-QDPDAYLAVE---PF 403
+P + N +E++ L D P + +G F ++ + + +E F
Sbjct: 633 ERIPEPKRLNKEESVS----LLDNPTKSICGFNIKIGVSFASVLRKNVISSMEMEFLLKF 688
Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
FE +Y++ + ++ +LK ++ +IL+A F+A
Sbjct: 689 FEDRKYLIAIDVKNRNIFVILKKDIQATEILEAYFYA 725
>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 269
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 123/233 (52%), Gaps = 13/233 (5%)
Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAG 200
P SVT Y Y +R + + VF TQ +L +VG+ +R P A A+NW+LKDG G
Sbjct: 24 PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLGASRTLPAAAAVNWVLKDGLG 83
Query: 201 RVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
R+GK+ + A G+ FD D+K+ RF ++ + + +E+ T P FLPLA AN+ K+V
Sbjct: 84 RLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQHFLPLATVANIGKSV 143
Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----------- 308
T+ R PI + F EN+ +V AK +AD +G L++ +K
Sbjct: 144 GITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAVGAAKAVSKVATVRPE 203
Query: 309 -RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
R + F L+ + Y+E+K+V L T+N+ R + E+F+K +PS
Sbjct: 204 IRRALPVFAFGPLAALDLICIYKELKAVQLRTINKERAEIIAEAFVKEREIPS 256
>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
Length = 464
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 166/323 (51%), Gaps = 11/323 (3%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+ Y W +++ F G T +L +GV +A+
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A WI+KD G +G++IFA +G K D + KQ R D+L ++ +E+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCF 189
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
C +N+AK V +V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 190 TVTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + PS + F LL+ +V+++Y+ V+++V+ TLN R + ++ FL+ G V
Sbjct: 250 LPLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPT 309
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
N E +++ W + LG+ + ++ E +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRV 365
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
+L A + IL+AA H VL
Sbjct: 366 QVVLSQMAGPETILRAATHGLVL 388
>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 437
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/332 (27%), Positives = 163/332 (49%), Gaps = 21/332 (6%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
I S +P+G+P+SVTP Y+P+ TW L+ + +TQ LL +GV AT
Sbjct: 49 ILSAFLPEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGVGETTATVVGAT 108
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
W ++D G +G ++F QG D KQ R DLL ++G ++L + P F+ +
Sbjct: 109 FLWFMRDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMDLVSPLFPGAFVTI 168
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
C ++A++V V S +TR + + FA +N DV+AK A ++G L +++++
Sbjct: 169 LCIGSMARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAATMVGMLLGMVLARL 228
Query: 310 NP----SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
+L +F LL+ +++++Y+ V+S+ L +LN R S+ + +F + VP+ RE
Sbjct: 229 TAENVVALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAFKEGRKVPTPREVA 288
Query: 366 LQENIF----SFPWLRD--------KPVVLGSRFKDAFQDP--DAYLAVEPFFEKERYVV 411
QE++ PW + K + G + D ++L++ + ++ Y++
Sbjct: 289 DQEHLLPRIPRLPWQKQRAGDAWLPKSIRFGVQISSLKLDDGWKSFLSLVERYSRDPYLM 348
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
P V+ +L A D L HA L
Sbjct: 349 V--PKGAIVHVVLHKLATPQDFLHGYVHALCL 378
>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 519
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 163/329 (49%), Gaps = 16/329 (4%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+++ + S +P+ +V+ Y + T+ L++ +GV +TQ LL +VG+S ++
Sbjct: 160 RKILDLLNSSFLPENCKKTVSKDYFWFTTFNFLQNVVNCCIGVISTQQLLRAVGLSASQG 219
Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ ++NW+LKDG GR+G ++F G +FD D K+ R GD+L G G E+ + +P
Sbjct: 220 IGKSASLNWVLKDGIGRLGAIVFGSFIGNRFDADPKRWRLWGDILYAFGIGTEIISPLLP 279
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLG 299
FL +A AN+ K + + T I ++FAK EN GD++AK E + N+ LG
Sbjct: 280 RYFLLVASLANMVKATSYMLRLPTTAAIRRSFAKRENFGDISAKANSQEVLSNLFGTFLG 339
Query: 300 TGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
G SI++ L + + + +Y VK + L TLN R + S+ K+ V
Sbjct: 340 IGFSIILGHSWWQLSVAYFVYFIMFFLLNYLGVKGLHLRTLNLQRSLLLGWSYWKSSQVS 399
Query: 360 SLREGNLQEN-----IFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYN 414
++ E N +EN I SF + + G +D A+ + + +Y++T
Sbjct: 400 TVEEVNYRENVILPSIMSF----GRYIRFGVSLRDIHLTGSELAALLDKYRQAKYLLTVR 455
Query: 415 PSKGKVYALLKDQAKSDDILKAAFHAYVL 443
G+VY + +A+ D L+A A L
Sbjct: 456 --HGQVYIVFHMEARVKDELQAILQATFL 482
>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 181/377 (48%), Gaps = 27/377 (7%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 67 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 126
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++FA QG D + K R D + +LG ++L + P + + C +
Sbjct: 127 RDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 186
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK--RNPS 312
++++ V S +TR + + FA N D++AK +A + G GL ++++ R
Sbjct: 187 LSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHD 246
Query: 313 LI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
L+ +F L+ +++++Y+ V+S+ L TLN R S+ ++ F++ G V + ++ + QE+I
Sbjct: 247 LVVWVSFLSLTVFHMYANYKAVQSLSLSTLNYERTSILLQYFMEHGEVLTPQQVSKQEHI 306
Query: 371 FSF--PW-------LRDKPVVLGSRFKDAFQDPDAYLAVE--PFFEKERYVVTYNPSKGK 419
F W L + V LG++ D L + ++ Y + G
Sbjct: 307 LPFWSSWRKLLRVKLPHELVNLGAK-ASMLTHSDMLLIAKTRSYYTNANYFLL--DKDGS 363
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEY--DNSNLMLTTADLEA 477
V + QA + D+LK+ H VL + S +S ++ H++ + N ++ +E
Sbjct: 364 VCIFIHKQAVATDVLKSFIHGLVLARLVPKS---KSCHMEAHQWMDEKYNTFISKLKVEG 420
Query: 478 HIGE---SCKVVSTALW 491
+ E S +V A W
Sbjct: 421 YSTERLLSHSIVWKAHW 437
>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
Length = 440
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 180/372 (48%), Gaps = 28/372 (7%)
Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYV---VPDGFPDSVTPSYVPYMTWRALKHFFGG 163
FK +I A S S Q A R + VP+GFP SVTP YV + W L+
Sbjct: 26 FKTATITASSSLSIQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQLWDTLQGLSTY 85
Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLR 222
+ +TQ LL+++GV AT W L+D G +G ++F QG D + K R
Sbjct: 86 TKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWR 145
Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
DL+ ++G ++L + P F+ + C +++++ V S +TR + + FA +N
Sbjct: 146 LVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAA 205
Query: 283 DVTAKGECVGNIADLLGTGLSIMISK---RNPSLI-TTFALLSCGYVFSSYQEVKSVVLH 338
D++AK +A ++G L +++++ NP I +F L+ +++++Y+ V+ +VL+
Sbjct: 206 DISAKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLN 265
Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPW---LRD-------KPVVLGSRFK 388
+LN R S+ + F++TG V S + + E + W LR K V LG R
Sbjct: 266 SLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPL-WATSLRSTNSKPLHKRVQLGVRVS 324
Query: 389 -----DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
D Q + A ++ +Y++ + KG V +L +K D+LK+ HA VL
Sbjct: 325 SLPRLDMLQLLNGVGASS--YKNAKYLLAH--IKGNVSVILHKDSKPADVLKSYIHAIVL 380
Query: 444 LHCINSSNGHRS 455
+ + S S
Sbjct: 381 ANLMEKSTSFYS 392
>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
Length = 468
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
Length = 466
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 182/361 (50%), Gaps = 15/361 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K + + +P AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDSGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NPSLITTFALLSCGY 324
R + A+ N+ DV+AK + +L+G +S+++ P L F L+ +
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCPGL-GCFFFLTALH 269
Query: 325 VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG 384
++++Y+ V+++V+ TLN R + ++ +L+ G V + N E +++ W P
Sbjct: 270 IYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPLWTGFW----PAPSL 325
Query: 385 SRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H +
Sbjct: 326 SLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGLM 385
Query: 443 L 443
L
Sbjct: 386 L 386
>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
Length = 420
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 155/322 (48%), Gaps = 41/322 (12%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F + +P G+P+SV+P Y+ Y W ++ F G TQ +L VGV +T A
Sbjct: 48 FFTAVFLPQGYPESVSPDYLAYQIWDTVQAFASSITGTLATQAVLKGVGVGDETSTVTAA 107
Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
+ WILKDG G +G+++FA +G K D D KQ R D+L ++ +E+ A P F
Sbjct: 108 TVTWILKDGTGMLGRILFAWSKGSKLDCDAKQWRLFADVLNDVAILMEILAPAFPACFTL 167
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIM 305
+ C + K + V +TR + A+ +N+ DV+AK E + N+A LL + L I
Sbjct: 168 IVCISGFFKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLFSLLLIP 227
Query: 306 ISKRNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
+ N L + + L + +++++Y+ V++V + TLNRAR + ++SFLK+G VP
Sbjct: 228 LVADNLRLTYSLYGLFTILHLYANYRAVRAVCMETLNRARLCLVLQSFLKSGEVPG---- 283
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP---SKGKVY 421
PD EP + + Y P S G V
Sbjct: 284 -----------------------------PDVTNPQEPLLLVDFSITIYVPWSSSTGTVS 314
Query: 422 ALLKDQAKSDDILKAAFHAYVL 443
LL Q +S D++KA HA +L
Sbjct: 315 ILLHQQRRSVDVIKACSHALLL 336
>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
gorilla]
Length = 468
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSRNQVQVVLNQKAGPKTILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
Length = 468
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P+A A +S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+S+V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ +N S+ +V +L A +L+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWNQSRNQVQVVLNQNAGPKTVLRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
Length = 836
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 159/304 (52%), Gaps = 27/304 (8%)
Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRF 223
A G LL +VG+ + A P A A+NW+LKDG G + K++ ++ G+ FD + K R
Sbjct: 468 AKGEAQGMALLYAVGLGKG-AIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRL 526
Query: 224 AGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
DLL G+E+ T PHLF+P+ AA ++ AA+ +TR+ Y FA N +
Sbjct: 527 FADLLENTAYGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAE 586
Query: 284 VTAKGECVGNIADLLGTGLSIMISKRNPSLIT----TFALLSCGYVFSSYQEVKSVVLHT 339
V AKGE G ++ LG L I ++ R S ++ +FA ++ +++ + + +S+ L T
Sbjct: 587 VIAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRT 646
Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIF--------------SFPWLRD------K 379
LN R S+ +L +G VPS++E N +E +F P +D +
Sbjct: 647 LNPYRASLVFSEYLLSGQVPSVKEVNDEEPLFLNLSVGTSRKESKILSPQAKDAAEIICR 706
Query: 380 PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
+ LGS+ + ++ + A+ ++ E+Y++T KGK +LK+ + +D+LK+ FH
Sbjct: 707 RLQLGSKLSEIIENKEDACALFDLYKNEQYLLT--EYKGKFCVILKEGSSPEDMLKSLFH 764
Query: 440 AYVL 443
L
Sbjct: 765 VNYL 768
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 95/170 (55%), Gaps = 9/170 (5%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+PDSV+ Y+ Y WR ++ GV +TQ LL +VG+ + A P A A+NW+
Sbjct: 321 LLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 379
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R DLL G+E+ T PHLF+P+ AA
Sbjct: 380 LKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTPVFPHLFVPIGAAAG 439
Query: 255 VAKNVAAVTSTSTRTPIYKAF--AKGENIGDVTAKGECVGNIADLLGTGL 302
++ AA+ T I+ F ++ V AKGE G +A L GL
Sbjct: 440 AGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQG-MALLYAVGL 483
>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
Length = 468
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 167/326 (51%), Gaps = 17/326 (5%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+PY W +++ F G T +L +GV +A+
Sbjct: 70 LSSLIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+ P F
Sbjct: 130 AATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
C +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 190 TITICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + P + F L+ +++++Y+ V+++V+ TLN R + ++ FL+ G V
Sbjct: 250 LPLVSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPT 309
Query: 363 EGNLQENIFSFPWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSK 417
N E +++ W + + LG R + D ++ E +E Y++ ++ S+
Sbjct: 310 SANQMEPLWTGFW-PSQSLSLGVPLHRLTSSVFD------LQQLVEGHQEPYLLRWDLSQ 362
Query: 418 GKVYALLKDQAKSDDILKAAFHAYVL 443
+V +L A + IL+AA H VL
Sbjct: 363 NQVQVVLSQMAGPETILRAATHGLVL 388
>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
Length = 442
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 172/345 (49%), Gaps = 27/345 (7%)
Query: 114 AISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTL 173
++S ++ P + L + + + +P GFPDSV+P Y+PY W ++ F G TQ +
Sbjct: 31 SLSWEAGGGPGQGLRAALTAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAV 90
Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELG 232
L VGV A+ A I W+LKDG G +G++ FA G + D + K+ R D+L +
Sbjct: 91 LQGVGVGDAAASVSAATITWLLKDGTGMLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAA 150
Query: 233 AGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---E 289
+E+ P F + C +++AK + V +TR + A+ +N+ DV+AK E
Sbjct: 151 MFLEIVAPGFPSCFTLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQE 210
Query: 290 CVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
+ N+A LL + L + + PSL + F L + +++++++ V++V++ T N AR +
Sbjct: 211 TLVNLAGLLVSLLLLPLVSNRPSLSLAAFFLFTILHLYANFRAVRAVIMETFNEARLYLV 270
Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG--SRFKDAFQDPDAYLAVEPFFE- 405
++ FL+ G V + N R +P+ G S + P L + FE
Sbjct: 271 LKHFLQWGKVLTPPSAN-----------RMEPLWTGCHSALSLSLGVPLHSL-ISSVFEL 318
Query: 406 -------KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
++ Y++ ++ +G++ L + + IL+AA H +L
Sbjct: 319 QQVSEGHQQPYLLRWDRCRGQIQVTLGQEVSTGVILRAATHGLLL 363
>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 468
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 170/333 (51%), Gaps = 23/333 (6%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
A ++ +++ +P GFP+SV+P Y+PY W +++ F G TQ +L +GV +
Sbjct: 66 ASRTLAGLQAVFLPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAK 125
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
AT A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+
Sbjct: 126 ATVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVF 185
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
P F +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL
Sbjct: 186 PVCFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLV 245
Query: 300 TGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
+ L + + PS+ + F LL+ +++++Y+ V+++V+ TLN +R + ++ FL+ G V
Sbjct: 246 SLLMLPLVSGCPSVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEV 305
Query: 359 PSLREGNLQENIFSFPWLRDKPVV--------LGSRFKDAFQDPDAYLAVEPFFEKERYV 410
N E +++ P + L S + Q D + +E Y+
Sbjct: 306 LDPTPANQMEPLWTG--FSPSPALSLGVPLHCLVSSMSELQQLIDGH--------QEPYL 355
Query: 411 VTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ ++ S+ +V +L A + +L+AA H VL
Sbjct: 356 LCWDQSRNQVQVVLSQMAGPETVLRAATHGLVL 388
>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
Length = 462
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 166/323 (51%), Gaps = 11/323 (3%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+PY W +++ F G T +L +GV +A+
Sbjct: 64 LSRLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVS 123
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A W++KD G +G+++FA G K D + KQ R D+L ++ +E+ +P F
Sbjct: 124 AATATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCF 183
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
C +N+AK + V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 184 TVTVCISNLAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLM 243
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + +PSL + F L+ +++++Y+ V+++V+ TLN R + + FL+ G V
Sbjct: 244 LPLVSDSPSLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPT 303
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
N E +++ W + LG+ + ++ E +E Y++ ++ S+ +V
Sbjct: 304 SANRMEPLWTGFW-SSVSLSLGAPLHCV---TSSVSELQHLVEGHQEPYLLHWDQSRNQV 359
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
+L A + +L+AA H VL
Sbjct: 360 QVVLSPMAGPEAVLRAATHGLVL 382
>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
melanoleuca]
gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
Length = 468
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 165/323 (51%), Gaps = 11/323 (3%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+PY W +++ F G T +L +GV +A+
Sbjct: 70 LSGLMAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A W++KD G +G+++FA +G K D + KQ R D+L ++ +E+ P F
Sbjct: 130 AATATWLVKDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFF 189
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
C +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L
Sbjct: 190 TMTVCTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLM 249
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + PS + F L+ +++++Y+ V+++VL TLN R + ++ FL+ G V
Sbjct: 250 LPLVSACPSFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPT 309
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
N E +++ W + LG + ++ E +E Y++ ++ S+ +V
Sbjct: 310 SANQMEPLWTGFW-PSLSLSLGVPLHCMI---SSVFELQQLVEGHQEPYLLRWDKSQRQV 365
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
+L A + IL+AA H VL
Sbjct: 366 QVVLSQMAGPETILRAATHGLVL 388
>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
Length = 468
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P+A A +S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGGDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+S+V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A +L+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 IL 388
>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
Length = 417
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/325 (27%), Positives = 155/325 (47%), Gaps = 14/325 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L +++ +P G+PDSV+ YV Y W L+ F G T +L +GV
Sbjct: 45 LFRVLQNIFLPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ + WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 YSATVTWILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 FSTQILCGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L MI K L T F ++ +++++ + V+S+ L T N +R+ +A+E F ++ H+ +
Sbjct: 225 YLLTMI-KSQAMLYTIFYIVVTLHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLT 283
Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
++ N E + + + LG K + + +E F ER+++ +
Sbjct: 284 PQQVNKMERVTVGQTVSVSLNIRLGLSVKHLIDEYKSSSVIENIVSSFDPHERFIIAESK 343
Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
VY L + D+LKA F A
Sbjct: 344 KHLGVY--LHFDTRPQDVLKAYFFA 366
>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
Length = 446
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/417 (24%), Positives = 195/417 (46%), Gaps = 47/417 (11%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
D+SG F + R L E+ + + +P+GFP+SVT Y+P+ W +L+
Sbjct: 34 DKSGGFTIHRERGRML--------EVWGKLSAAFLPEGFPNSVTSDYLPFQIWDSLQGLS 85
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQ 220
+ +TQ LL+++GV AT W L+D G +G ++F QG D + K
Sbjct: 86 TYIRCMLSTQALLSAIGVGETTATVVGATFQWFLRDFTGMLGGILFTLYQGSNLDSNAKM 145
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
R DL+ +LG ++L + P F+ + C ++A++ V S +TR + + FA N
Sbjct: 146 WRLIADLMNDLGMLMDLVSPLYPASFVIILCLGSIARSFTGVASGATRAALTQHFALQHN 205
Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISKRNP----SLITTFALLSCGYVFSSYQEVKSVV 336
D++AK +A ++G L ++++ ++ ++F LL+ ++F++YQ V+ +
Sbjct: 206 AADISAKEGSQETMATMIGMLLGMLLAHVTAGHPGAMWSSFLLLTFFHMFANYQAVRCLC 265
Query: 337 LHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQ---- 392
L++LN R SV ++ F+ G V + ++ + E + W++ + + G F F+
Sbjct: 266 LNSLNEGRTSVLLKCFITNGQVLNPKDVSSMEYLLP-NWMQLRMLTKGRPFNATFKLGAK 324
Query: 393 -------DPDAYLAVE-PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLL 444
D L + + K +Y++ G + +L + S DIL++ H VL
Sbjct: 325 ISSLSQSDMSKLLKLSASCYGKGKYLLLCR--NGTIEVILNKDSVSADILQSYIHGLVLS 382
Query: 445 HCINSS-------NGHRSLALKQHE------------YDNSNLMLTTADLEAHIGES 482
+ + SS N L ++++ ++ L++ + D AH ES
Sbjct: 383 YHMGSSSSVNDANNADSCLWMEKYYSMFISKQLQASGWNTGRLLVPSVDWRAHWVES 439
>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
Length = 468
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P+A A +S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+S+V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A +L+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
Length = 468
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P+A A +S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPRA-ASPPLSGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+S+V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A +L+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVLRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
Length = 440
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 159/323 (49%), Gaps = 17/323 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 64 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 123
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R D + +LG ++L + P + + C +
Sbjct: 124 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 183
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----RN 310
++++ V S +TR + + FA +N D++AK +A ++G GL ++++
Sbjct: 184 LSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGLGMLLAHITRGHA 243
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
S+ T+F L+ +++++Y+ V+S+ L TLN R S+ ++ F + G V + ++ + QE+I
Sbjct: 244 LSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGDVLAPQKVSQQEHI 303
Query: 371 FSF--PW-------LRDKPVVLGSRFKDAFQDPDAYLA-VEPFFEKERYVVTYNPSKGKV 420
F W L + V LG++ +A +E Y++ +G V
Sbjct: 304 LPFWSTWRKLNKIKLPHERVHLGAKASMLTHSDMLVIAKTRSHYENTNYLLL--NKQGSV 361
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
Y + A D+L++ H VL
Sbjct: 362 YIFIHKLATPADVLRSFVHGLVL 384
>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
adhaerens]
Length = 272
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 142/270 (52%), Gaps = 13/270 (4%)
Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAIN 192
S +P G+P+SV+ Y+ Y W L+ F G TQ +L VGV +AT A A+
Sbjct: 1 SVFLPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMT 60
Query: 193 WILKDGAGRVGKMIFA-RQGKKFDYDLKQL------RFAGDLLMELGAGVELATAAVPHL 245
W+LKDG G +G+++FA +G D D K++ R D L + +E+ PHL
Sbjct: 61 WMLKDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHL 120
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGL 302
F L C A+VAK++ V +TR + A+ N+ DV+AK E + N++ L+ GL
Sbjct: 121 FTLLLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALI-VGL 179
Query: 303 SIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
++ ++ + T F + + +++++Y+ V SVV+ TLNR R + +F + G + S
Sbjct: 180 FLLPAVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILS 239
Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDA 390
E +E++ ++LG+ FK++
Sbjct: 240 PAEVCQRESVIIMKDQYKLRMILGAEFKES 269
>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
Length = 468
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 182/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P A + L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGAPGASSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A +IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKNILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
Length = 1023
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/233 (33%), Positives = 123/233 (52%), Gaps = 13/233 (5%)
Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAG 200
P SVT Y Y +R + + VF TQ +L +VG+ +R P A A+NW+LKDG G
Sbjct: 587 PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRTLPAAAAVNWVLKDGLG 646
Query: 201 RVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
R+GK+ + A G+ FD D+K+ RF ++ + + +E+ T P FLPLA AN+ K+V
Sbjct: 647 RLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQHFLPLATVANIGKSV 706
Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITT--- 316
T+ R PI + F EN+ +V AK +AD +G L++ +K + T
Sbjct: 707 GITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAVGAAKGMSKVATVRPE 766
Query: 317 ---------FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
F L+ + Y+E+K+V L T+N+ R + E+++K +P+
Sbjct: 767 IRRALPLFAFGPLAVLDLICIYKELKAVQLRTINKERAEIIAEAYVKEREIPT 819
>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
Length = 468
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + + AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
Length = 469
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 167/323 (51%), Gaps = 11/323 (3%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S +++ +P GFP+SV+P Y+PY W +++ F T +++ +GV A+
Sbjct: 71 LSGLKAVFLPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLA 130
Query: 188 AVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A W+++D G +G++IFA +G K D + KQ R D+L + +EL + F
Sbjct: 131 AATATWLVRDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSF 190
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLS 303
+ +N+AK + +V +TR + A+ N+ DV+AK E + N+A LL + +
Sbjct: 191 TMIVSISNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIM 250
Query: 304 IMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + P L + F LL+ +++++Y+ V+++V+ TLN +R + ++ FL++G V
Sbjct: 251 LPLMSNCPRLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEVLDTT 310
Query: 363 EGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKV 420
N E +++ W + LG + ++ E +E Y++ +N S+ +V
Sbjct: 311 SANQMEPLWTGIW-PSPSLSLGVPLHCVV---SSVSELQQLVEGHQEPYLLCWNQSENQV 366
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
+L A + +L+AA H +L
Sbjct: 367 QVVLSQMAGPEAVLRAATHGLLL 389
>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 527
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 149/298 (50%), Gaps = 26/298 (8%)
Query: 96 NYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRS-------------YVVPDGFPD 142
N + E+++S I + SP ++ ++F+ + +P+G+PD
Sbjct: 15 NCIIEKNQSNSI----INKFQVASPYDDYDDRLNFLETIEKNSGFYNVLCELFLPNGYPD 70
Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
SVT Y Y W +++ G T+ +L GV + AT + W+++DG G +
Sbjct: 71 SVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASATTQWLIRDGMGMI 130
Query: 203 GKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVA 260
G+++FA R+G D + K+ R+ D+L +G E+ + LFLPL+C +AK++
Sbjct: 131 GRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFLPLSCIGLIAKSIC 190
Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRNPSLITT- 316
V T+ + + FAK +N+ DV+AK +L+G LS+++S N SLI T
Sbjct: 191 GVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVSSFINDNTSLIVTW 250
Query: 317 --FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT-GHVPSLREGNLQENIF 371
F + ++F +Y+ V +V L ++NR R + + F+ G +PS E + ENI
Sbjct: 251 LVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPSPSEISKLENIL 308
>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
Length = 665
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 84/245 (34%), Positives = 131/245 (53%), Gaps = 13/245 (5%)
Query: 140 FP--DSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
FP +SVT Y Y +R L+ + VF TQ +L ++G+ +R P A AINW+LKD
Sbjct: 199 FPARESVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPAAAAINWVLKD 258
Query: 198 GAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
G GR+GK+ + A G++FD D+K+ RF ++ + + VE+ T P FL LA AN+
Sbjct: 259 GLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALATLANIG 318
Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG-------TGLSIMISKR 309
K+V T+ R PI ++F EN+ +V AK +AD LG TGL+ +
Sbjct: 319 KSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAVLATGLTGKVVND 378
Query: 310 NPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
LI F L+ ++ Y+E+K+V L T+N+ R + + ++ G VP + +
Sbjct: 379 RARLIIPFVAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTVPGTKYVSD 438
Query: 367 QENIF 371
E +F
Sbjct: 439 AERLF 443
>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
Length = 468
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + + AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 181/362 (50%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + + AP+ L S +++ +P GFPDSV+P Y
Sbjct: 35 EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV+AK E + N+A LL + L + + P + F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W P
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW----PAPS 326
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
S + + ++ E +E Y++ ++ S+ +V +L +A IL+AA H
Sbjct: 327 LSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKAGPKTILRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
Length = 440
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 179/372 (48%), Gaps = 28/372 (7%)
Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYV---VPDGFPDSVTPSYVPYMTWRALKHFFGG 163
FK +I A + S Q A R + VP+GFP SVTP YV + W L+
Sbjct: 26 FKTATITASASLSIQRSANRFNHVWRRVLQAFVPEGFPGSVTPDYVGFQFWDTLQGLSTY 85
Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLR 222
+ +TQ LL+++GV AT W L+D G +G ++F QG D + K R
Sbjct: 86 IKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWR 145
Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
DL+ ++G ++L + P F+ + C +++++ V S +TR + + FA +N
Sbjct: 146 LVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAA 205
Query: 283 DVTAKGECVGNIADLLGTGLSIMISK---RNPSLIT-TFALLSCGYVFSSYQEVKSVVLH 338
D++AK +A ++G L +++++ NP I +F L+ +++++Y+ V+ +VL+
Sbjct: 206 DISAKEGSQETMATMMGMSLGMLLARFTSGNPMAIWFSFLSLTVFHMYANYRAVRCLVLN 265
Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPW----------LRDKPVVLGSRFK 388
+LN R S+ + F++TG V S + + E + W L K V LG R
Sbjct: 266 SLNFERSSILLTHFMQTGQVLSPEQVSSMEGVLPV-WATSLRSTNSKLLHKRVHLGVRVS 324
Query: 389 -----DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ Q + A ++ +Y++ + KG V +L + + D+LK+ HA VL
Sbjct: 325 SLPRLEMMQLLNGVGASS--YKNAKYLLAHR--KGNVSVILHKDSSAADVLKSYIHAIVL 380
Query: 444 LHCINSSNGHRS 455
+ + S S
Sbjct: 381 ANLMEKSTSFYS 392
>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 407
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 167/347 (48%), Gaps = 28/347 (8%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+ +R +P G+P+SVT Y Y TW L+ GV T+ +L GV A+
Sbjct: 41 FIEVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVLATRAILKGYGVGNLNASV 100
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PH 244
+ +I WI +D AG +G++IFA + G D + K RFA DLL + G +E+ + P
Sbjct: 101 ASASIQWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAADLLNDFGMTLEMISPLFGPS 160
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGT 300
FLP++C ++K++ V T+ + + FAK +N+ DV+AK + + LLG
Sbjct: 161 YFLPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAIALVGMLLGM 220
Query: 301 GLSIMISKRNPSLITTFA------LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
++ I + S+ F + + ++ +Y+ V V L ++NR R ++ + F +
Sbjct: 221 VMAASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLVQLTSINRYRANLIYQHFNE 280
Query: 355 TGH--VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVT 412
+ +P+ ++ +L+ENI D + LG F D D V + ++Y++
Sbjct: 281 NNNNQIPTPKQISLKENILFRQ--SDLSIELGVSFTDINGDLK---RVSGSDKDDKYMIW 335
Query: 413 YNPSKGKVYAL---LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSL 456
N + K + + + A S DI++ F + L+H GH L
Sbjct: 336 SNNKQKKNHLIKIAISSTATSSDIIEGYFESITLVH------GHSKL 376
>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 464
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 167/340 (49%), Gaps = 32/340 (9%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L +F +P G+PDSV+ Y Y W +++ F G TQ++L+ VGV A+
Sbjct: 41 LRAFFSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGDQGASV 100
Query: 187 GAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A WIL+DG G G+++FA QG DYD K+ R D+L ++ +EL +
Sbjct: 101 LAATTTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQYMKGY 160
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGL 302
+ C ++VAK++ V +TR + + A+ N+ DV+AK E + N+A L + L
Sbjct: 161 VTAVLCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAFLFSLL 220
Query: 303 SIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT-GH 357
+ I NP L+ +F LL ++F++Y+ V VV+ T NR+R+++ V FL+T G
Sbjct: 221 LLRIVAGNPWLLYGIFISFTLL---HIFANYRAVSCVVMETFNRSRYAIVVRRFLETAGD 277
Query: 358 VPSLREGNLQENI------------FSFPWLRDKPV--VLGSRFKDAFQDPDAYLAVEPF 403
+ + N +E++ F P P L R +DA D VEP
Sbjct: 278 IAPVTWVNARESVWIQCGKPFVSINFGVPLFSIAPTNKELQCRVEDAAGDRK----VEPA 333
Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ VV + + +L ++ D ++A FHA++L
Sbjct: 334 YLLA--VVRDSSRHYNINIVLSSESTVKDHMEAMFHAFLL 371
>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
Length = 410
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 124/230 (53%), Gaps = 7/230 (3%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
++ + +P G+P SV+ Y+ Y W ++ F G +Q +L VGV + AT + +
Sbjct: 41 LKKFFLPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGDSSATILSAS 100
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
+ W+ KDG+G +G++IFA G K D D K RF D+L + +E+ + +LF PL
Sbjct: 101 LTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPL 160
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMI 306
C ANV K++ V ++TR I + A N+ DV+AK E + N+ L + + +
Sbjct: 161 LCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYM 220
Query: 307 SKRNPSLITTFALLSCG--YVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
N LI F + C +++S+Y+ VK + L T NR RF +A++ + K
Sbjct: 221 VTGNQFLI-WFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWFK 269
>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 827
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 166/357 (46%), Gaps = 49/357 (13%)
Query: 135 VVPDGF------PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
++P+GF P V+ Y+PY +++ A V TQ++L ++GV++ +
Sbjct: 338 LLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIGVAKQATIGVS 397
Query: 189 VAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A+ W+LKDG GR+ +IF G ++D D K+ R GD+L LG G E+AT VP FL
Sbjct: 398 AAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEIATRLVPQYFL 457
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT----GLS 303
+ AN K+V+ + T I + F+ EN GD++AK ++ LLGT GLS
Sbjct: 458 LVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSMLLGTFVGIGLS 517
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP--SL 361
MI +L + + ++ +Y ++ + + TLN R + + + +G V S+
Sbjct: 518 YMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYWNSGGVETCSV 577
Query: 362 REGNLQENIFSFPWLRDKP---VVLGSRFKDAFQ-------------------------- 392
N EN F P R P V G R +AF+
Sbjct: 578 AMANQAEN-FLLP-SRLGPARRVRFGCRLGEAFRTVGEFEEAIARQRLWRGQYLQQTDVS 635
Query: 393 ----DPDA-YLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLL 444
+ DA + +V ERY++ + +G+V+ +L AK D L+A HA L+
Sbjct: 636 KRNGNADADFTSVPDDVLDERYLLGVDAKRGRVFVVLHRDAKVADELQALMHATYLV 692
>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
Length = 417
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 154/325 (47%), Gaps = 14/325 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L +++ +P G+PDSV+ Y Y W ++ F G T +L +GV
Sbjct: 45 LFRVLQNIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATA-AVPH 244
+ + WILK+G+G +G+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 YSATVTWILKEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
L + C + + K + V +TR + + A N+ DV +K CV IA +G
Sbjct: 165 LSTQILCGSTMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L MI K L T F ++ +++++ + V++V L T N +R+ +A+E F ++ H+ +
Sbjct: 225 YLLTMI-KSQAMLYTVFYIVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLT 283
Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
++ N E + + + LG K + + +E F ER+++ +
Sbjct: 284 PQQVNKLERVTVGQMVSVSLNIRLGLSVKHLIDEYKSSNVIENIVSSFDPHERFIIAESK 343
Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
VY L + D+LKA F A
Sbjct: 344 KYLGVY--LHFDTRPQDVLKAYFFA 366
>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
Length = 244
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 128/244 (52%), Gaps = 5/244 (2%)
Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
+S +P G+P+SV+ Y+ Y W ++ F G TQ +L VGV AT I
Sbjct: 1 QSVFLPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATI 60
Query: 192 NWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA 250
WILKDG G +G+++FA QG D D K+ R D++ + +EL A P F +
Sbjct: 61 TWILKDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIV 120
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMIS 307
C A V K + V+ +TR + A+ +N+ DV+AK E + N+ LL + +
Sbjct: 121 CVAGVGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMV 180
Query: 308 KRNPSLIT-TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
NP L F + +C ++F++Y V VV+ TLN++R + V +L+TG V + + N+
Sbjct: 181 ADNPLLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANV 240
Query: 367 QENI 370
QE +
Sbjct: 241 QEPV 244
>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/258 (33%), Positives = 134/258 (51%), Gaps = 10/258 (3%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
++S +P G+P+SV+ Y+ Y W ++ F G TQ +L GV AT A
Sbjct: 1 LQSAFLPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAAT 60
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
+ WIL+ G G VG ++FA QG D + K+ R D++ +L VEL + F L
Sbjct: 61 MTWILRSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLL 120
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMI 306
AC ++VAK++ V STR + + A+ +N+ DV AK E + N+ LL GL I
Sbjct: 121 ACLSSVAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLA-GLVITP 179
Query: 307 SKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE- 363
+L+T F + + +++++Y V SVV+ TLN R + FL+TGHV S E
Sbjct: 180 LVTGHTLLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEV 239
Query: 364 GNLQENIF--SFPWLRDK 379
G ++ IF W+ +K
Sbjct: 240 GKMEPVIFGKCCHWVYEK 257
>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
Length = 412
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 83/267 (31%), Positives = 142/267 (53%), Gaps = 6/267 (2%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
+S +P G+PDSV+ Y+ Y W L+ F G T+ L VGV +AT A
Sbjct: 75 FQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAAT 134
Query: 191 INWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
W+LKDG G G+++FA +G K D + K+ R D+L ++ +++ P F +
Sbjct: 135 TTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTLI 194
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMI 306
C + V K++ V +TR + A+ +N+ D++AK E + N+A LL + +
Sbjct: 195 MCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVSF 254
Query: 307 SKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
N +L ++ F L + ++F++Y+ V+SVV+ TLN AR S+ ++ +LK G + S E N
Sbjct: 255 VADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEAN 314
Query: 366 LQENIFSFPWLRDKPVVLGSRFKDAFQ 392
+E + + + P+ LG R ++ Q
Sbjct: 315 QREPVL-MEFKKTVPIKLGVRLQEIVQ 340
>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
Length = 450
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 160/344 (46%), Gaps = 14/344 (4%)
Query: 113 RAISLQS-PQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
R IS +S + + ++ F R ++P G+PDSV+ Y Y W ++ F G+FTTQ
Sbjct: 33 REISQESVADSVLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQ 92
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLME 230
+L VGV A+ + I WI+KDG G VG++IFA +GK D + K+ R DLL +
Sbjct: 93 AILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLND 152
Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG-- 288
+EL+ V HL + C + + K + V +TR I A N+ ++ AK
Sbjct: 153 GAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGS 212
Query: 289 --ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFS 346
V IA G L + I TF L+ +++S+YQ V +++ LN R +
Sbjct: 213 QETLVNLIASFTGIFLINYFTDATTQWIVTFLLIFL-HLYSNYQAVTALIFKNLNNLRMT 271
Query: 347 VAVESFLKTGHVPSLREGNLQENIF-----SFPWLRDKPVVLGSRFKDAFQ--DPDAYLA 399
+ ++S+L V N E++F S +VLG + + D +
Sbjct: 272 LILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELKS 331
Query: 400 VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ ++ ++Y++ + +Y + +++ +HA +L
Sbjct: 332 LTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375
>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
mulatta]
Length = 429
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 160/315 (50%), Gaps = 11/315 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G TQ +L +GV +AT A W++
Sbjct: 39 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 98
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD G +G+++FA +G K D + KQ R D+L ++ +E+ P F +N
Sbjct: 99 KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 158
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L + + P
Sbjct: 159 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 218
Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ F L+ +++++Y+ V+S+V+ TLN R + ++ +L+ G V N E +
Sbjct: 219 GFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 278
Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
++ W P S + + ++ E +E Y++ ++ S+ +V +L +A
Sbjct: 279 WTGFW----PAPSLSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKA 334
Query: 429 KSDDILKAAFHAYVL 443
+L+AA H +L
Sbjct: 335 GPKTVLRAATHGLML 349
>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
castaneum]
Length = 472
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 160/344 (46%), Gaps = 14/344 (4%)
Query: 113 RAISLQS-PQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
R IS +S + + ++ F R ++P G+PDSV+ Y Y W ++ F G+FTTQ
Sbjct: 33 REISQESVADSVLQSVLGFFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQ 92
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLME 230
+L VGV A+ + I WI+KDG G VG++IFA +GK D + K+ R DLL +
Sbjct: 93 AILKGVGVGSAEASALSATITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLND 152
Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG-- 288
+EL+ V HL + C + + K + V +TR I A N+ ++ AK
Sbjct: 153 GAMILELSIPHVTHLSTEILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGS 212
Query: 289 --ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFS 346
V IA G L + I TF L+ +++S+YQ V +++ LN R +
Sbjct: 213 QETLVNLIASFTGIFLINYFTDATTQWIVTFLLIFL-HLYSNYQAVTALIFKNLNNLRMT 271
Query: 347 VAVESFLKTGHVPSLREGNLQENIF-----SFPWLRDKPVVLGSRFKDAFQ--DPDAYLA 399
+ ++S+L V N E++F S +VLG + + D +
Sbjct: 272 LILKSYLAVNAVLGPECINKTESVFLGFGTSVEKFCGFKIVLGQALGKTLRRYNSDELKS 331
Query: 400 VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ ++ ++Y++ + +Y + +++ +HA +L
Sbjct: 332 LTEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375
>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
Length = 631
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 172/378 (45%), Gaps = 58/378 (15%)
Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
+P+ + + + +++P+GFP+SVT Y+ Y WR ++ GV TQ+LL +VG
Sbjct: 182 TPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVG 241
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
+ + A P A AINW+LKDG G + K++ ++ G+ FD K R DLL G+E+
Sbjct: 242 LGKG-AIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 300
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV-------------- 284
T P F+ + AA ++ AA+ +TR+ FA N +V
Sbjct: 301 TPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVYNNFYMALVLITYQ 360
Query: 285 -------------TAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQE 331
A+G ++ LLG ++ I + F +++ +++++ +
Sbjct: 361 QLFVFLNYSGSLGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKS 420
Query: 332 VKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR-------------- 377
+ + L TLN R S+ +L +G P ++E N +E + FP +R
Sbjct: 421 YQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPL--FPTVRFSNMKSPEKLQDFV 478
Query: 378 ------------DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLK 425
++ + LGS+ D + + +A+ + E Y++T + KG+ +LK
Sbjct: 479 LSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEH--KGRFCVMLK 536
Query: 426 DQAKSDDILKAAFHAYVL 443
+ + D+L++ F L
Sbjct: 537 ESSTPQDMLRSLFQVNYL 554
>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 417
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 131/250 (52%), Gaps = 7/250 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L F ++ +P G+PDSV+ Y+ Y W ++ F G TT +L VGV + A P
Sbjct: 45 LKRFFQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PH 244
+ I W+LKDG G G+++FA +G + D D K+ R D+L ++ GV+L + P
Sbjct: 105 LSATITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPK 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ + CA + K + V +TR+ + + A N+ DV +K CV IA +G
Sbjct: 165 VSTYILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
GL + + L F ++ +++++ + VK+V L T N AR+ +A+E + K+G + S
Sbjct: 225 GLLTHVQNQRV-LYGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLS 283
Query: 361 LREGNLQENI 370
++ N E +
Sbjct: 284 PQQVNKLERV 293
>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
jacchus]
Length = 468
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 175/362 (48%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P A A +S +++ +P GFPDSV+P Y
Sbjct: 35 EVRGWRWWGLS---RAFTVKPEGRDAGEVGAPGA-ASPPLSGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G T +L +GV +AT A W++KD G +G++IFA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIIFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLG----TGLSIMISKRNPSLITTFALLSCG 323
R + A+ N+ DV+AK + +L G + ++S F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W + L
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGFW-PTPSLSL 329
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
G + ++ E +E Y++ ++ S+ +V +L +A +L+AA H
Sbjct: 330 GVPLHRLV---SSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKADPKTVLRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 90/294 (30%), Positives = 148/294 (50%), Gaps = 27/294 (9%)
Query: 103 RSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGF----------------PDSVTP 146
+SGK R I+ SL+ EE S + DGF P V+
Sbjct: 70 KSGKVSRYFIKGDSLELLCVDEEEDDSTSFCLGLDDGFWKLIRLSSSAAKDFFLPKQVSD 129
Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
+Y+ Y+ W+ L F A+ V TQ + ++G+ ++R+ + A NWILKDG GR+ + I
Sbjct: 130 NYISYVKWKFLHRVFSSALQVLATQVMFRAIGIGQSRSLASSAAFNWILKDGLGRLSRCI 189
Query: 207 F-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTST 265
+ A FD +LK++RF+ +L L GVEL T P FL LA AN+AK ++
Sbjct: 190 YTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQISLSCYL 249
Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSL-----ITT 316
+T + ++++FA +N+G+V+AK + C N+ LL L+++ + N L
Sbjct: 250 ATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLF-QNNQRLQASLPFVL 308
Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ + S + YQ +K + L TL + R + +E +++ VPS E + +E I
Sbjct: 309 YPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFRQVPSPAEVSEEEGI 362
>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
Length = 520
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 90/294 (30%), Positives = 148/294 (50%), Gaps = 27/294 (9%)
Query: 103 RSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGF----------------PDSVTP 146
+SGK R I+ SL+ EE S + DGF P V+
Sbjct: 71 KSGKVSRYFIKGDSLELLCVDEEEDDSTSFCLGLDDGFWKLIRLTSSAAKDFFLPKQVSD 130
Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
+Y+ Y+ W+ L F A+ V TQ + ++G+ ++R+ + A NWILKDG GR+ + I
Sbjct: 131 NYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGLGRLSRCI 190
Query: 207 F-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTST 265
+ A FD +LK++RF+ +L L GVEL T P FL LA AN+AK ++
Sbjct: 191 YTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQISLSCYL 250
Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSL-----ITT 316
+T + ++++FA +N+G+V+AK + C N+ LL L+++ + N L
Sbjct: 251 ATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLF-QHNQRLQACLPFVL 309
Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ + S + YQ +K + L TL + R + +E +++ VPS E + +E I
Sbjct: 310 YPIFSTFDLLGIYQGLKHINLQTLTKDRLEIILERWIEFRQVPSPAEVSEEEGI 363
>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 85/276 (30%), Positives = 142/276 (51%), Gaps = 32/276 (11%)
Query: 151 YMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF-AR 209
Y+ W+ L F A+ V TQ + ++G+ +R+ P A A+NW+LKDG GR+ + I+ A
Sbjct: 3 YVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAALNWVLKDGLGRLSRCIYTAS 62
Query: 210 QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRT 269
FD +LK++RF +L L G+EL T P FL LA AN+AK ++ +T +
Sbjct: 63 LASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLATGS 122
Query: 270 PIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRNPSLITTFALLSCGY-- 324
++++FA +NIG+V+AK + D LG L+ +++ K N +F +S +
Sbjct: 123 AVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCFCL 182
Query: 325 ------------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS---------LRE 363
+F YQ ++ V L TL + R + + S++ GHVPS LRE
Sbjct: 183 PFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAERGILLCLRE 242
Query: 364 G----NLQENIFSFPWLRDKPVVLGSRFKDAFQDPD 395
G ++ + ++R K ++ S ++ A +D D
Sbjct: 243 GSGTADVIMGLLQACYIR-KALLFSSMWETAVEDKD 277
>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
boliviensis]
Length = 468
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 177/362 (48%), Gaps = 15/362 (4%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
++ G RW ++ R+ K A + +P A A +S +++ +P GFPDSV+P Y
Sbjct: 35 EVRGWRWWGLS---RAFTVKPEGRDAGEVGAPGA-ASPPVSGLQAVFLPQGFPDSVSPDY 90
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+PY W +++ F G TQ +L +GV +AT A W++KD G +G+++FA
Sbjct: 91 LPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
+G K D + KQ R D+L ++ +E+ P F +N+AK + +V +T
Sbjct: 151 WWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGAT 210
Query: 268 RTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL----ITTFALLSCG 323
R + A+ N+ DV+AK + +L G +S+++ F L+
Sbjct: 211 RAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTAL 270
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL 383
+++++Y+ V+++V+ TLN R + ++ +L+ G V N E +++ W + L
Sbjct: 271 HIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGFW-PTPSLSL 329
Query: 384 GSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
G + ++ E +E Y++ ++ S+ +V +L +A +L+AA H
Sbjct: 330 GVPLHRLV---SSVSELQQLVEGHQEPYLLCWDQSQNQVQVVLNQKADPKTVLRAATHGL 386
Query: 442 VL 443
+L
Sbjct: 387 ML 388
>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
melanogaster gb|AE003660 [Arabidopsis thaliana]
Length = 438
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 166/337 (49%), Gaps = 25/337 (7%)
Query: 139 GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDG 198
GFP SVTP YV + W L+ + +TQ LL+++GV AT W L+D
Sbjct: 59 GFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDF 118
Query: 199 AGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
G +G ++F QG D + K R DL+ ++G ++L + P F+ + C ++++
Sbjct: 119 TGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSR 178
Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNPSLI 314
+ V S +TR + + FA +N D++AK +A ++G L +++++ NP I
Sbjct: 179 SFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLARFTSGNPMAI 238
Query: 315 -TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
+F L+ +++++Y+ V+ +VL++LN R S+ + F++TG V S + + E +
Sbjct: 239 WLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPL 298
Query: 374 PW---LRD-------KPVVLGSRFK-----DAFQDPDAYLAVEPFFEKERYVVTYNPSKG 418
W LR K V LG R D Q + A ++ +Y++ + KG
Sbjct: 299 -WATSLRSTNSKPLHKRVQLGVRVSSLPRLDMLQLLNGVGASS--YKNAKYLLAH--IKG 353
Query: 419 KVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRS 455
V +L +K D+LK+ HA VL + + S S
Sbjct: 354 NVSVILHKDSKPADVLKSYIHAIVLANLMEKSTSFYS 390
>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
Length = 380
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 164/329 (49%), Gaps = 17/329 (5%)
Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
GFP SVTP YVP+ W L+ + +TQ LL+++GV AT W L+D
Sbjct: 2 QGFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRD 61
Query: 198 GAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
G +G ++F QG D + K R DL+ +LG ++L + P F+ + C +++
Sbjct: 62 LTGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLS 121
Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNP-S 312
++ V S +TR + + FA N D++AK +A + G L +++++ +P +
Sbjct: 122 RSFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLARITIGHPLA 181
Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI-- 370
+ +F L+ +++++Y+ V+ +VL +LN R S+ ++ F++TG V S + + E+I
Sbjct: 182 IWFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHILP 241
Query: 371 -FSFPW------LRDKPVVLGSRFKDA-FQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
++ W L + LG R Q+ L +K +Y++ KG +
Sbjct: 242 RWTNFWRTKHVKLLHSRMCLGVRVSSLDHQELMELLHSAGSHKKAKYLLM--ERKGTINI 299
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSN 451
L+ + + D+L++ HA V+ + +N
Sbjct: 300 LIHKNSTTSDVLQSFIHALVMAKLMEENN 328
>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
Length = 504
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/247 (31%), Positives = 128/247 (51%), Gaps = 9/247 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P+SV Y+ + + L AM + Q+L ++G + +A A A W+L
Sbjct: 98 LPVGYPNSVAEGYLKFSIFNNLSALSITAMSFLSAQSLFVAIGSTMTQANLAAAAYTWVL 157
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KDG G++G ++FA R G+ FD D+K+ RF + + + +E+ T P+ FL LA AN
Sbjct: 158 KDGIGQLGAILFASRYGRNFDEDIKKWRFMAMIALNISIYIEILTLRFPNHFLALASIAN 217
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK-----R 309
+ KN+ + S ++R I FAK NIGD++ K + L G G+ +M+SK
Sbjct: 218 IGKNICFLLSAASRASINVQFAKRNNIGDISGKSVSQFTASTLCGVGIGLMLSKLIDISS 277
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG--HVPSLREGNLQ 367
L F +L+ + +SYQ K + LN R + + K+ + ++ EGN+Q
Sbjct: 278 ISQLYPVFMILTILNIATSYQASKVIDEVYLNNQRAFIIFNEYFKSNKKNFLTVEEGNIQ 337
Query: 368 ENIFSFP 374
E IF P
Sbjct: 338 E-IFYMP 343
>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
Length = 422
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 88/315 (27%), Positives = 159/315 (50%), Gaps = 11/315 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G TQ +L +GV +AT A W++
Sbjct: 32 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD +G+++FA +G K D + KQ R D+L ++ +E+ P F +N
Sbjct: 92 KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
+AK + +V +TR + A+ N+ DV+AK E + N+A LL + L + + P
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211
Query: 312 SL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ F L+ +++++Y+ V+++V+ TLN R + ++ +L+ G V N E +
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 271
Query: 371 FSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQA 428
++ W P S + + ++ E +E Y++ ++ S+ +V +L +A
Sbjct: 272 WTGFW----PAPSLSLGVPLHRLVSSVFELQQLVEGHQESYLLCWDQSQNQVQVVLNQKA 327
Query: 429 KSDDILKAAFHAYVL 443
IL+AA H +L
Sbjct: 328 GPKTILRAATHGLML 342
>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
Length = 315
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 88/268 (32%), Positives = 142/268 (52%), Gaps = 25/268 (9%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
+ S +P G+P SV+ Y+ Y W ++ F G +T+ +L VGV + +TP A
Sbjct: 51 LLSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAA 110
Query: 190 AINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVEL----ATAAVP- 243
+ W++KDG G VG+++FA G K D D K+ RF DLL + +EL A AAV
Sbjct: 111 TLMWLIKDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGG 170
Query: 244 --HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLL 298
+ C A VAK++ V +TR + + A+ N+ DV+AK E + N+A L+
Sbjct: 171 SNGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALI 230
Query: 299 GTGLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
T L ++ + +T F L + +VF++ + VK+V + TLNRAR+ + ++ F T
Sbjct: 231 -TSLWLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTK 289
Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLG 384
+V S++E N R +PV++G
Sbjct: 290 YVASVKEVN-----------RLEPVIIG 306
>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 518
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 134/281 (47%), Gaps = 10/281 (3%)
Query: 123 PAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
P +++ ++P +P +V+ Y + W L+H A V TQ++L ++GV
Sbjct: 87 PENWPQRLVQTLLLPADYPSAVSSDYDRFWRWNILRHTLLEAAEVLGTQSMLLALGVGGA 146
Query: 183 RATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
A P A A WILKDG G K+ A Q + D D K+ RF GD +M LG E+ T A
Sbjct: 147 SALPLAAAWKWILKDGLGYFAKVALATQLAPRVDNDPKRFRFYGDFVMALGTLFEMLTLA 206
Query: 242 VPHLFLPLACAANVAKNVAAV-TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
P FL LA N+ + A V T + R +Y F+ N GDV++K E + L G
Sbjct: 207 FPPWFLALASTGNLLRKAADVATGPAYRVFLYH-FSIRNNSGDVSSKSESQVVVGRLSGI 265
Query: 301 GLSIMI----SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
G I + S +L+ + LLS G+++++YQ V + L TLNR R + + +
Sbjct: 266 GAGIAVSAVTSHDASALVMAYGLLSAGHLYATYQSVIRLELRTLNRTRLEYVLNVYTASS 325
Query: 357 --HVPSLREGNLQEN-IFSFPWLRDKPVVLGSRFKDAFQDP 394
VPS+ E N E F PW + + + DP
Sbjct: 326 GKRVPSIAEANAAEQKSFLRPWSHVYRIRFVNSLNELLSDP 366
>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 288
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 114/204 (55%), Gaps = 5/204 (2%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
D +G R SI +Q + ++L+ + + P G+P SV Y+ Y +R L+HF
Sbjct: 40 DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFS 98
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
+ V +TQ+LL + G+ R TP A A++WILKDG GK+I + G + D + K
Sbjct: 99 SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155
Query: 221 LRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGEN 280
R D+L +LG +E+ + P LFL +A N AK +A V + +TR PIY +FAK N
Sbjct: 156 WRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGN 215
Query: 281 IGDVTAKGECVGNIADLLGTGLSI 304
+ D+ AKGE + + +++G G I
Sbjct: 216 LSDLFAKGEAISTLFNVMGIGAGI 239
>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Cucumis sativus]
Length = 437
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 165/332 (49%), Gaps = 29/332 (8%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W +L+ + +TQ LL+++GV AT W L
Sbjct: 56 VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGATFQWFL 115
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R D + +LG ++L + P F+ + C +
Sbjct: 116 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVSPLFPSAFMLIVCLGS 175
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP--- 311
++++ V S +TR + + FA N D++AK +A ++G + +++++
Sbjct: 176 LSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLARITMGCP 235
Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
++ +F L+ +++++Y+ V + L +LN R S+A++ +++T V S ++ ++ E++
Sbjct: 236 LAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQVSMMEHV 295
Query: 371 ---FSFPW-LRDKPVV-----LGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVY 421
++ W LR ++ LG+R + FF + V + KG Y
Sbjct: 296 LPAWASSWSLRSIKLIHTRVRLGARISSLHHTE-----MXAFFNQLMNFVKSSGYKGAKY 350
Query: 422 ALLKDQ----------AKSDDILKAAFHAYVL 443
L++ + A ++DI ++ HA V+
Sbjct: 351 LLVERKETISVIVQKDATANDIFQSFVHALVM 382
>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
Length = 264
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 88/267 (32%), Positives = 142/267 (53%), Gaps = 25/267 (9%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
+ S +P G+P SV+ Y+ Y W ++ F G +T+ +L VGV + +TP A
Sbjct: 1 LSSVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAAT 60
Query: 191 INWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVEL----ATAAVP-- 243
+ W++KDG G VG+++FA G K D D K+ RF DLL + +EL A AAV
Sbjct: 61 LMWLIKDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGS 120
Query: 244 -HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLG 299
+ C A VAK++ V +TR + + A+ N+ DV+AK E + N+A L+
Sbjct: 121 NGTMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALI- 179
Query: 300 TGLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
T L ++ + +T F L + +VF++ + VK+V + TLNRAR+ + ++ F T +
Sbjct: 180 TSLWLLPILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKY 239
Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLG 384
V S++E N R +PV++G
Sbjct: 240 VASVKEVN-----------RLEPVIMG 255
>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
Length = 417
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 135/272 (49%), Gaps = 11/272 (4%)
Query: 108 KRNSIRAISLQSPQAPAEE---LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGA 164
+RN +R + L+ + +E L +++ +P G+PDSV+ Y Y W ++ F
Sbjct: 24 QRNIVR-VPLRGDKLVIQEKFLLFRVLQNIFLPKGYPDSVSDDYAAYQIWDTVQAFCSTI 82
Query: 165 MGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRF 223
G T +L +GV + + WILK+G+G +G++IFA QG + D D K+ R
Sbjct: 83 CGTLCTHAILKGIGVGSENVNAYSATVTWILKEGSGHMGRIIFAWWQGSQLDVDSKKWRL 142
Query: 224 AGDLLMELGAGVEL-ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
D L +L G+E+ PH + C + + K + V +TR + + A N+
Sbjct: 143 RADFLNDLAMGIEIYVLPKYPHFSTQILCGSTLLKAIVGVAGGATRAALTQHHAVRGNLA 202
Query: 283 DVTAKGE----CVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLH 338
DV +K CV IA +G L MI K L T F ++ +++++ + V++V L
Sbjct: 203 DVASKDSSQETCVNLIASFMGLYLLTMI-KSQAMLYTVFYIVVTLHLYANLKAVRAVCLR 261
Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
T N +R+ + +E F ++ + + ++ N E +
Sbjct: 262 TFNESRYLITLEEFFRSSQMLTPQQVNKLERV 293
>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 600
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 104/327 (31%), Positives = 156/327 (47%), Gaps = 40/327 (12%)
Query: 56 AATLLQSESDETTAF---RLVEDQFVDSSLRLICSE-DIDGRRWNYVAERDRSGKFKRNS 111
AA+ L ++ T A R++ Q+ +S R I + D +G+ +R SG R+
Sbjct: 86 AASGLSERTENTDALPEKRVLVQQWSGTSRRRIETYCDYEGK---LSVQRQISGPQVRHG 142
Query: 112 IRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQ 171
+ Q E L + S +P +P SVT SY PY W L + V T
Sbjct: 143 L--------QFSTEPLQRLLASVFLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTT 194
Query: 172 TLLNSVGVSRNRATPGAVAI-------------------NWILKDGAGRVGKMIF-ARQG 211
LL S GV + + + A+ +WILKDG G + ++ F +R
Sbjct: 195 ALLVSAGVVADPSRAASTALTRDPALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYA 254
Query: 212 KKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPI 271
+FD DLK+LR D+L LG G+EL + + P F+ LA A K +A V STR+ I
Sbjct: 255 HRFDGDLKRLRVLADVLWHLGTGLELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTI 314
Query: 272 YKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTFALLS---CGYVF 326
Y+ A NIGDVTAKG+ A+LLG GLS+ +L T+A+ G ++
Sbjct: 315 YRTVATRSNIGDVTAKGDAQSIAAELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMY 374
Query: 327 SSYQEVKSVVLHTLNRARFSVAVESFL 353
S Q ++ +VL LN R + V++++
Sbjct: 375 SRIQSMRVLVLPLLNLQRAVLLVDAYV 401
>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
Length = 417
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 90/328 (27%), Positives = 160/328 (48%), Gaps = 14/328 (4%)
Query: 125 EELMSF--IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRN 182
E+ M F ++ +P G+PDSV+ Y Y + ++ F G T +L S+GV
Sbjct: 41 EKFMLFRVLQKIFLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSE 100
Query: 183 RATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATA 240
+ + WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+
Sbjct: 101 NINAYSATVTWILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLP 160
Query: 241 AVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
PHL + C + + K V +V +TR+ + + A N+ DV +K +LL +
Sbjct: 161 KYPHLSTQILCGSTMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLAS 220
Query: 301 --GLSIMISKRNPS-LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
GL ++ ++P+ L T F L+ +++++ + V++V L T N +R+ +A+E F ++
Sbjct: 221 CVGLYVLTLIKSPAVLYTVFYLVVTLHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSR 280
Query: 358 VPSLREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQD----PDAYLAVEPFFEKERYVVT 412
+ + ++ N E + + + LG K+ + D V F ER+++
Sbjct: 281 MLTPQQVNGMERVTLGQTVSVSLNIRLGLSVKNLVDEYKSSSDIENIVSSFDTHERFIIA 340
Query: 413 YNPSKGKVYALLKDQAKSDDILKAAFHA 440
SK + L + D+LKA F A
Sbjct: 341 --ESKKYLGVYLHFDTRPQDVLKAYFFA 366
>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
Length = 492
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 114/409 (27%), Positives = 177/409 (43%), Gaps = 36/409 (8%)
Query: 51 GGGGAAATLLQSESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRN 110
GG G + +L TA R + DS LR I + E DRS +
Sbjct: 46 GGEGLSRVILVERYSNGTAKRYIIGD--DSRLRTI------------LIEEDRSTQN--- 88
Query: 111 SIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTT 170
R L SP L ++++++P GFP SV+ Y+ YM + + G T
Sbjct: 89 --RFGVLHSPDKRLSWLPDTVKAFILPAGFPGSVSDDYLQYMLLQFPTNVTGWICHTIVT 146
Query: 171 QTLLNSVGVSRNRATPGAVA---INWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGD 226
+LL +VGV T A + I W+ KDG G VG++ I R G FD D KQ R D
Sbjct: 147 SSLLKAVGVGSFSGTTAAASAAAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYAD 206
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
+ G+ +L T P FLPLA N+ K +A + I FA N+G++ A
Sbjct: 207 FIGSAGSIFDLTTPLYPGYFLPLASLGNLTKAIARGLKDPSSRVIQSHFAISANLGEIAA 266
Query: 287 KGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHT 339
K E +A L+G GL I+I P L+ ++ +LS + ++ Y+ + + +T
Sbjct: 267 KEEVWEVVAQLVGLGLGILILD-TPGLVKSYTVLSSTWLIVRLLHLWLRYESLSVLQFNT 325
Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLA 399
+N R V+S + VP + N +ENI + ++ GS ++
Sbjct: 326 INLKRARTLVKSHVLHSTVPGCMDCNKEENILVWSQFMKPIIIFGSPLDKMDGMERSHFM 385
Query: 400 VE---PFFEKERYVVTYNPSKG--KVYALLKDQAKSDDILKAAFHAYVL 443
VE + E+Y++ N K Y K A S +L++ + + L
Sbjct: 386 VEALLKLYANEKYILVVNQQLDDLKFYVSFKVGATSVSVLRSIWQTFWL 434
>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
Length = 552
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 148/297 (49%), Gaps = 20/297 (6%)
Query: 93 RRWNYVAERDRSGK----------FKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPD 142
+NY+ E+++S + R I L++ + + L + + +P G+PD
Sbjct: 9 NNYNYIIEKNQSNSVINKFQVVNPYDDYDDRLILLETKRTKSS-LQNILYDLFLPSGYPD 67
Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
SVT Y Y W +++ G T+ +L GV + AT + W+++ G +
Sbjct: 68 SVTSDYFRYQFWDSIQALCSSITGTLATRAILKGYGVGNSSATVASATTQWVIQSATGMI 127
Query: 203 GKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVA 260
G++IFA G D + K+ RF D+L ++G E+ + LFLPL+C AK++
Sbjct: 128 GRIIFAWANGTDLDCNSKKWRFMADILNDIGMTFEMISPLFNEALFLPLSCIGLFAKSIC 187
Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS---KRNPSLITT- 316
V T+ + + FAK +N+ DV+AK LLG LS+ +S + S++ T
Sbjct: 188 GVAGGCTKASLTQHFAKNDNLADVSAKDGSQETAVSLLGMILSVFVSSVLNDDTSILVTW 247
Query: 317 --FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT-GHVPSLREGNLQENI 370
F + + +++ +Y+ V +V L+++N+ R + E +++ G +PS +E + ENI
Sbjct: 248 FIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLIYEYYIRNQGSIPSPKEISNFENI 304
>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
Length = 432
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 164/353 (46%), Gaps = 17/353 (4%)
Query: 119 SPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
+ +A L ++ +P G+P+SV+ Y+ Y W L+ F G T +L VG
Sbjct: 29 TSKARNTRLADTLKEVFLPQGYPNSVSEDYLQYQLWDTLQAFCSYITGTLATHAMLKGVG 88
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
V + A+P A + WILKDG G G+++FA +G D + K+ R D+L ++ ++L
Sbjct: 89 VGDSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCNAKKWRLIADILNDIAICMQL 148
Query: 238 ATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADL 297
+ P FL +AC A++ ++V V +TR + + A+ +N+ DV+AK + +L
Sbjct: 149 VSPFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQARRDNMADVSAKDGSQETLVNL 208
Query: 298 LGTGLSIMISKRNP-------SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
G + ++I+ SL +F LL +++S+Y+ V V + TLN R + +
Sbjct: 209 CGLLIGLIITPLIAGQTVFVWSLFFSFTLL---HLYSNYKAVSVVSMETLNCNRLHLLMR 265
Query: 351 SFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF-QDPDAYLAVEPFFEKERY 409
+ G + N +E + F R V GS D ++ F + +++
Sbjct: 266 NLFLNGTISEPNIVNREEPLL-FRQERFFTVEYGSSLSSVLIHSSDYSRSILQFNKGQKF 324
Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHA----YVLLHCINSSNGHRSLAL 458
++ + +K ++ + S D LKA +V C SN + AL
Sbjct: 325 IIKLSKTKRQIRVAFHCGSSSSDQLKAGLTVELIEFVCGGCYGDSNYLKDYAL 377
>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
Length = 417
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/250 (30%), Positives = 127/250 (50%), Gaps = 7/250 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L F + +P G+PDSV+ Y+ Y W L+ F G TT +L VGV + A P
Sbjct: 45 LKRFFQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANP 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ I W+LKDG G +G+++FA +G + D D K+ R D+L ++ ++L P
Sbjct: 105 LSATITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPK 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ CA K + V +TR+ + + A N+ DV +K CV IA +G
Sbjct: 165 ASTYILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L + + L FA ++ +++++ + VK+V L T N AR+ +A+E + K+G + S
Sbjct: 225 FLLTYL-QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLS 283
Query: 361 LREGNLQENI 370
++ N E +
Sbjct: 284 PQQVNKLERV 293
>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
Length = 657
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 120/213 (56%), Gaps = 10/213 (4%)
Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLM 229
++LL +VGV R+ P A INW+LKDG GR+G++ A R G+ FD DLK+ R+A ++
Sbjct: 78 ESLLQAVGVGARRSLPAAATINWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSIIY 137
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ +E T P FL +A ANV K++ T +T+ +++F EN+ D++AK +
Sbjct: 138 AVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKTQ 197
Query: 290 CVGNIADLLG----TGLSIMI----SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLN 341
+ D +G GL+ + + R + F LL+ G +++ Y E++S+ L TLN
Sbjct: 198 AQQMVMDNIGLAAAVGLTYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRTLN 257
Query: 342 RARFSVAVESFLKTGHVPSLREGNLQENIFSFP 374
+ R + + +L+ G VPS R+ + +E F P
Sbjct: 258 KERAEIIAQHWLREGRVPSPRQVS-EEERFVLP 289
>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
Length = 428
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/237 (31%), Positives = 126/237 (53%), Gaps = 7/237 (2%)
Query: 134 YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINW 193
Y P G+P+SV+P Y+ Y W ++ G Q++L VGV +AT +++W
Sbjct: 18 YGEPSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSW 77
Query: 194 ILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
+LKDG+G +GK++F QG D D K+ R D++ + +ELA++ P +F + C
Sbjct: 78 MLKDGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICL 137
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKR 309
AN+ K+V +V +TR + A N+ DV AK E + N+ L +
Sbjct: 138 ANLCKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTG 197
Query: 310 NPSLITT-FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG--HVPSLRE 363
N S+I T F +L+ +V+++Y+ V+ + L NR RF +AV+ + + H PS+ +
Sbjct: 198 NWSIIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPSVSD 254
>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 495
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 154/332 (46%), Gaps = 16/332 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV---SRNR 183
L I+ +++P GFP+SV+ Y+ YM + + G T +LL +VG+ S
Sbjct: 110 LPGLIKDFILPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTT 169
Query: 184 ATPGAVAINWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
AVAI W+ KDG G VG++ I R G FD D KQ R D + G+ +LAT
Sbjct: 170 TAASAVAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLY 229
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
P FLPLA N+ K VA + I FA N+G++ AK E +A LLG +
Sbjct: 230 PSYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAI 289
Query: 303 SIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
I+I P L+ ++++LS + ++ YQ + + +T+N R + + +
Sbjct: 290 GILILD-TPGLVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVLH 348
Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYLAVEPFFEKERYVVTY 413
VP E N +E+I + ++ G ++ + + + + E+Y++
Sbjct: 349 NKVPGTVECNSEESILVWERFTRPSIIFGVPLEEMMGSERSSTVMKLLKLYANEKYILML 408
Query: 414 NPSKG--KVYALLKDQAKSDDILKAAFHAYVL 443
+ KV+ K A S +L++ + Y L
Sbjct: 409 DSQDKDLKVFVSFKVGASSMTVLRSIWQTYWL 440
>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
Length = 437
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 150/323 (46%), Gaps = 38/323 (11%)
Query: 132 RSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI 191
+S +P G+P+SV+ Y+ Y W ++ F G TQ +L VGV +T A A+
Sbjct: 50 KSIFLPQGYPESVSTDYLAYQFWDTIQAFASSITGTLATQAVLKGVGVGDETSTVAAAAV 109
Query: 192 NWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
WILKDG G +G++ FA F D+L ++ +E+ A P F + C
Sbjct: 110 TWILKDGTGMLGRIAFAWSKGLF----------ADVLNDVAIFMEIMAPAFPACFTLIVC 159
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISK 308
+ K + V +TR + A+ +N+ DV+AK E + N+A LL + I +
Sbjct: 160 TSGFFKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLFSLFLIPLVV 219
Query: 309 RNPSLITTF-ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
N L F AL + +++++YQ V++V + TLNRAR + ++ +LK V N Q
Sbjct: 220 DNLPLTYAFYALFTILHLYANYQAVRAVCMETLNRARLHLVLQHYLKREEVLGPAVINPQ 279
Query: 368 ENIFSFPWLRDKPVVLGSR--FKDAFQDP---------DAYLAVEPFFEKERYVVTYNPS 416
E P++LG R K P D A+E Y++ +N
Sbjct: 280 E-----------PLLLGFRQQLKITLGTPLHTVTSSVADFQKALEG--NASNYLIFFNRP 326
Query: 417 KGKVYALLKDQAKSDDILKAAFH 439
G + LL QA S D++KA H
Sbjct: 327 AGVISILLHRQADSVDVIKAYTH 349
>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
Length = 621
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 149/330 (45%), Gaps = 59/330 (17%)
Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV-GVSRNRATPGAVA--------- 190
P+ VTPSY +M WR L+ F + V TQ+LL + G+ R+ A
Sbjct: 157 PEGVTPSYYRFMRWRILQRFMNANVHVIGTQSLLMGLRGMQRSTVVSAAAKGGAALGAVA 216
Query: 191 -INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
NW+LKD G++ +M +A + G+KFD D K+ R+ + LG G+E++T P FL
Sbjct: 217 ATNWVLKDTLGKLVRMAWASKMGRKFDSDAKRWRYRACFIYALGNGLEVSTYLHPRFFLV 276
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKG----------------------------EN 280
LA AN K V+ +TS++TR +Y +F + EN
Sbjct: 277 LAMLANSCKQVSMLTSSATRNALYNSFRRMDDDVDTVGTGSNSTLKSSINGGRRRGVVEN 336
Query: 281 IGDVTAKGECVGNIADLLGTGLSIMISK----RNPSLITTFALLSCGYVFSSYQEVKSVV 336
IGD+TAKGE + DLLG I +S+ +++T +++L G ++ Y E++S+V
Sbjct: 337 IGDITAKGEAQIAVVDLLGIASGICLSRAIGVSVQNVLTVWSVLQLGEIYCMYHEIRSIV 396
Query: 337 LHTLNRARFSVAVESFLKT--------------GHVPSLREGNLQENIFSFP-WLRDKPV 381
TLN R + L +P+ + +E +F P L K +
Sbjct: 397 YQTLNFERMYTVLGRLLPVDDESGSANGESCCCDRIPTPDQIAAEEKVFLPPDHLGRKEI 456
Query: 382 VLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
GS + + + + F ER+ +
Sbjct: 457 AFGSAGRSVLSPGELDVLIRDVFRGERFFL 486
>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 516
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 167/387 (43%), Gaps = 27/387 (6%)
Query: 79 DSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQ-----APAEELMSFIRS 133
D R + E + GR + + R R I+ P+ AP L I S
Sbjct: 50 DLKSRFVAIERVGGRECHISWSEEAGSSGVRKEWRDIAAVDPEDGKIAAPTNSLTGRISS 109
Query: 134 Y----VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
+ V+P +P SV SY+P+ + + V Q LL SVGVS + GAV
Sbjct: 110 WFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALLTSVGVSAEGSVFGAV 169
Query: 190 AINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELAT---AAVPHL 245
A+ WI+KDGAG V K+ F R+ FD K F G+L+ LG+G+++AT A P
Sbjct: 170 AVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGSGLQIATLLIAPSPGN 229
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLS 303
FL A N+ K V +T + F++ N GDV AK E +IA L+G +G+S
Sbjct: 230 FLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDESQSSIAQLVGYASGIS 289
Query: 304 IM-ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
++ +S P L FA ++ + ++ L R S + ++ G V SL
Sbjct: 290 LLAVSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLPRVSCLAQEYVTRGQVSSLG 349
Query: 363 EGNLQENIFSF-PWLRDKP---VVLGSRFKDAFQDPDAYLAVE-----PFFEKERYVVTY 413
E + F + RDK + L R D V FE +RY++
Sbjct: 350 ELDAAHRTGLFGEFYRDKSDRWLTLAPRVGDVLDSGTELERVRWQICAGVFEDDRYLLF- 408
Query: 414 NPSKGKVYALLKDQAKSDDILKAAFHA 440
P + A +DD+L A +HA
Sbjct: 409 -PRGRNISVFFHPDAANDDMLCAIYHA 434
>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
[Ornithorhynchus anatinus]
Length = 353
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 152/315 (48%), Gaps = 13/315 (4%)
Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAIN 192
S +P GFPDSV+P Y+ Y W ++ F G TQ +L VGV T +
Sbjct: 1 SVFLPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGA 60
Query: 193 WILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
++K G G +G++ FA ++G K D D K+ R D+L + +E+ A P F C
Sbjct: 61 LLIKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFTLTVC 120
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG----TGLSIMIS 307
+++AK + V +TR + A+ N+ DV+AK + +L G L ++S
Sbjct: 121 VSSLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLLPLVS 180
Query: 308 KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
R + F L + +++++++ V++VV+ TLN +R + +E FL+ G V SL N +
Sbjct: 181 DRPSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLASANEK 240
Query: 368 ENIFSFPWLRDKP---VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
E + W P + LG D + L + ++ Y++ ++ ++G V L
Sbjct: 241 EPL----WTDVGPALTLALGVPLHSLVSDV-SDLQLMKAGHQQAYLLHWDRARGHVQVAL 295
Query: 425 KDQAKSDDILKAAFH 439
A IL+AA H
Sbjct: 296 AQTAGPRAILRAATH 310
>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
Length = 417
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 123/250 (49%), Gaps = 7/250 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W + F G T +L +GV
Sbjct: 45 LFRVLQKIFLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ + WILK+G+G VG++ FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 YSATVTWILKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
L + C + + K + V +TR + + A N+ DV +K CV +A +G
Sbjct: 165 LSTQILCGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++S+ + V++V L T N +R+ +A+E F ++ + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLS 283
Query: 361 LREGNLQENI 370
++ N E +
Sbjct: 284 PQQVNAMERV 293
>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
Length = 408
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 131/258 (50%), Gaps = 14/258 (5%)
Query: 95 WNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTW 154
W + AE RS SL + + L + +P S+T Y PY W
Sbjct: 83 WTHKAEGLRSAP--------TSLAAARRTGSWLYDVLWDLFLPKDARASLTKDYFPYAKW 134
Query: 155 RALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKK 213
+ A V + Q+LL ++G+ A P A A+NW+LKDG G+ G ++FA +
Sbjct: 135 SFVGSVASCAASVLSMQSLLYAIGLGAG-AIPTAAAVNWVLKDGLGQFGGVLFASIVNNR 193
Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+D D K+ R A + +++ E+ T P FL +A ANVAKNVA +++++TR +
Sbjct: 194 YDADPKRWRVASSVALDVSVLGEILTPLAPGSFLAIASLANVAKNVAWLSASATRAGFHN 253
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS----LITTFALLSCGYVFSSY 329
+FA EN+ DVTAK + + GTGL I+IS+ S + FA+LS ++ S+Y
Sbjct: 254 SFAIRENLADVTAKAGSQAIASSIFGTGLGILISQFTGSSTLNVAAAFAVLSTVHLISTY 313
Query: 330 QEVKSVVLHTLNRARFSV 347
+ V V L TLN R +
Sbjct: 314 KSVDCVQLRTLNCQRLHI 331
>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
B]
Length = 505
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 166/369 (44%), Gaps = 34/369 (9%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ R +P +P SV SY+P+ + ++ G + V Q LL SVGVS +
Sbjct: 110 LSAWFRQMFLPTNYPQSVHRSYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIF 169
Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV--- 242
GAVA+ WI+KDGAG V K+ F R+ FD K +G++L+ LG+G+++AT V
Sbjct: 170 GAVAVQWIIKDGAGEVAKLFFIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPT 229
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--T 300
P FL A N K + +T + F++ N GDV AK E ++A L G
Sbjct: 230 PLSFLICAAGGNAFKLIGYAIWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAA 289
Query: 301 GLSIMISKRNPS-LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
G+ ++ +PS L T FAL ++ ++ ++ L R S+ E++ + G V
Sbjct: 290 GIGLLTLSHDPSYLYTIFALAVPTHLVATTLMMRCATFELLTIPRLSLLAETYSRKGDVM 349
Query: 360 SLR--EGNLQENIFS--FPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNP 415
L E +F + + K V L R + +D + E+ R+ +
Sbjct: 350 PLSRLEAKRATGMFGEFYGKGQGKLVSLAPRVAEVVKDEEG--------ERARWALCTEV 401
Query: 416 SKGKVYAL--------------LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ- 460
+ + Y L L+ A SDD+L+A HA L + ALK
Sbjct: 402 CRDERYLLYPPAAPQANPASVFLRPDASSDDMLRAVLHAARLRSLLADDIVTLREALKDS 461
Query: 461 HEYDNSNLM 469
HE+ + M
Sbjct: 462 HEWTQEHFM 470
>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
Length = 417
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 153/325 (47%), Gaps = 14/325 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W ++ F G T +L +GV
Sbjct: 45 LFRVLQKIFLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATA-AVPH 244
+ + WI+K+G G VG+++FA +G + D D K+ R D L +L G+E+ PH
Sbjct: 105 YSATVTWIMKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
L + C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 LSTQILCGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++++ + V+SV L T N +R+ +A+E + ++ + +
Sbjct: 225 YLLTLI-KSQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLT 283
Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
++ N E + + V LG K+ + ++E F E +++ +
Sbjct: 284 PQQVNAMERVTLGQTVSVSLNVRLGLSVKNLIDEYKTSSSIENIVSSFDPHEHFIIAESK 343
Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
VY L + + D+LKA F A
Sbjct: 344 KYLGVY--LHFETRQQDVLKAYFFA 366
>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 710
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/214 (35%), Positives = 123/214 (57%), Gaps = 6/214 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SVTP+Y+ Y W+A++ G GV +TQ LL +VG+ A P A A+NWI+
Sbjct: 162 LPKGYPHSVTPNYMGYSRWQAVQSVTGTMTGVLSTQALLYAVGLGAG-AIPLAGALNWIV 220
Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KDG G++G ++++ KFD D K+ RF + ++ +E+ T P +FL LA +N
Sbjct: 221 KDGLGQLGGVVYSTFISSKFDSDPKRHRFWSNAALQASTLLEILTPLAPGMFLFLASVSN 280
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---RNP 311
+ KN++ + ++STR ++ + +N+GDVT K L+GTGL I I+ +P
Sbjct: 281 IGKNISWLAASSTRAQMHNSLTLRDNLGDVTGKAGSQAIATSLIGTGLGIAIAPFVGTDP 340
Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
+++ F LS + S+Y+ V + TLN R
Sbjct: 341 IAVLLAFLPLSIVNMVSNYRSNTIVHMRTLNVQR 374
>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
Length = 393
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 153/321 (47%), Gaps = 11/321 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
S R +P G+P SV+P Y+ Y W ++ F T+ +L VGV A+ A
Sbjct: 41 SLFRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALATEAILRGVGVGNETASTMA 100
Query: 189 VAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A+ W+L+DG G + +++FA D D KQ R D +L ++L T P+LF+
Sbjct: 101 AAVAWLLRDGIGMLTRILFAWFYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPNLFM 160
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSI 304
P+ C +++ + V V +TRT + A +N+GDV AK E + NI L + L +
Sbjct: 161 PIVCLSSMVRGVVGVAGGATRTAVVNHQAILDNVGDVAAKDGSQETLINIFALFCSLLLL 220
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
I N + F + + +++ +Y+ VKS+ TLN++ + V+++ +T + ++ +
Sbjct: 221 PIVNGNICVWLLFCIFTFIHLYGNYRAVKSLQFRTLNQSLLQIIVKNYAETKKIDTVSKV 280
Query: 365 NLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
N +E++ W + G R D D E F ++ V + Y +
Sbjct: 281 NNKESML-LHWNSSRRYY-GCRLSDVVTSSD-----EFSFVCSKFTVICDLRSNYGYVSM 333
Query: 425 KDQAKSDDILKAAFHAYVLLH 445
+ + D L+AA ++L+
Sbjct: 334 ASISDTSDQLRAALCLELMLN 354
>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
Length = 417
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 124/247 (50%), Gaps = 7/247 (2%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F ++ +P G+PDSV+ Y+ Y W ++ F G TT +L VGV + A P +
Sbjct: 48 FFQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSA 107
Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPHLFL 247
I W+LKDG G G++IFA +G + D D K+ R D+L +L ++L P
Sbjct: 108 TITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKAST 167
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLS 303
+ CA K + V +TR+ + + A N+ DV +K CV IA +G L
Sbjct: 168 FILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLL 227
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+ + L F ++ +++++ + VK+V L T N AR+ +A+E + KTG + S +
Sbjct: 228 THVQDQR-VLYGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSPEQ 286
Query: 364 GNLQENI 370
N E +
Sbjct: 287 VNKLERV 293
>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
Length = 423
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 128/243 (52%), Gaps = 20/243 (8%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKH--FFGGAMGVFT-----------TQTLLNSV 177
++ + +P G+P SV+ Y+ Y W ++ FF + +F+ +Q +L V
Sbjct: 41 LKKFFLPVGYPSSVSDDYLEYQIWDTIQAGIFFITKISLFSAFASSITGALASQAVLIGV 100
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
GV + AT + ++ W+ KDG+G +G++IFA G K D D K RF D+L + +E
Sbjct: 101 GVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDCDCKFWRFVADILNDCALFLE 160
Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGN 293
+ + +LF PL C ANV K++ V ++TR I + A N+ DV+AK E + N
Sbjct: 161 IISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQAINNNLADVSAKDGSQETLSN 220
Query: 294 IADLLGTGLSIMISKRNPSLITTFALLSCG--YVFSSYQEVKSVVLHTLNRARFSVAVES 351
+ L + + + N LI F + C +++S+Y+ VK + L T NR RF +A++
Sbjct: 221 LMAWLLNFILLYMVTGNQFLI-WFCFICCTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQ 279
Query: 352 FLK 354
+ K
Sbjct: 280 WFK 282
>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
Length = 480
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 166/374 (44%), Gaps = 22/374 (5%)
Query: 97 YVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRA 156
++ E DRS R L S L I+ +V+P GFP SV+ Y+ YM +
Sbjct: 65 FLVEEDRS-----TPNRFQDLHSSDESLSWLPEIIKDFVLPAGFPGSVSDDYLDYMLLQF 119
Query: 157 LKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA---INWILKDGAGRVGKM-IFARQGK 212
+ G T +LL +VG+ T A + I W+ KDG G VG++ I R G
Sbjct: 120 PTNVTGWICHTLVTSSLLKAVGIGSFTGTTAAASAAAIRWVSKDGIGAVGRLFIGGRFGS 179
Query: 213 KFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIY 272
FD D KQ R D + G+ +L T P FLPLA N+ K VA + I
Sbjct: 180 LFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQ 239
Query: 273 KAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-------V 325
FA N+G+V AK E +A L+G L I+I P L+ ++ ++S + +
Sbjct: 240 NHFAISGNLGEVAAKEEVWEVVAQLVGLSLGILILD-TPGLVKSYGVISLTWLSMRLLHL 298
Query: 326 FSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGS 385
+ Y+ + + +T+N R + V+S + VP + N +ENI ++ ++ G
Sbjct: 299 WLRYESLSVLQFNTINIKRARILVKSHVLHSTVPGCTDCNREENILAWSQFMKPKIIFGL 358
Query: 386 RFKDAFQDPDAYLAVE---PFFEKERYVVTYNPSKG--KVYALLKDQAKSDDILKAAFHA 440
+ +Y VE + E+Y++ N + Y K A + +L++ + +
Sbjct: 359 PLEKMDGVERSYFMVEALIKLYASEKYILMVNQQTEDLRFYVSFKVGATNVSVLRSVWQS 418
Query: 441 YVLLHCINSSNGHR 454
+ L +S + R
Sbjct: 419 FWLSENWDSDDNVR 432
>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
Length = 457
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 159/322 (49%), Gaps = 20/322 (6%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+ Y W +++ F G T +L +GV +A+
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A WI+KD G +G++IFA F D+L ++ +E+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGLF----------ADILNDVAMFLEIMAPIYPMCFT 179
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSI 304
C +N+AK V +V +TR + A+ N+ DV+AK E + N+A LL + L +
Sbjct: 180 VTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLML 239
Query: 305 MISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+ PS + F LL+ +V+++Y+ V+++V+ TLN R + ++ FL+ G V
Sbjct: 240 PLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTS 299
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVY 421
N E +++ W + LG+ + ++ E +E Y++ ++ S+ +V
Sbjct: 300 ANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRVQ 355
Query: 422 ALLKDQAKSDDILKAAFHAYVL 443
+L A + IL+AA H VL
Sbjct: 356 VVLSQMAGPETILRAATHGLVL 377
>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
Length = 417
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 151/325 (46%), Gaps = 14/325 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W + F G T +L +GV
Sbjct: 45 LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
L + C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 LSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++++ + V++V L + N +R+ +A+E F ++ + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283
Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
++ N E + + + LG K+ + + +E F E +++
Sbjct: 284 PQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTK 343
Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
VY L + + D+LKA F A
Sbjct: 344 KCLGVY--LHFETRPQDVLKAYFFA 366
>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
Length = 453
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 159/322 (49%), Gaps = 20/322 (6%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+S + + +P GFPDSV+P Y+ Y W +++ F G T +L +GV +A+
Sbjct: 70 LSGLLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVS 129
Query: 188 AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A WI+KD G +G++IFA F D+L ++ +E+ P F
Sbjct: 130 AATATWIVKDSTGMLGRIIFAWWKGLF----------ADILNDVAMFLEIMAPIYPMCFT 179
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSI 304
C +N+AK V +V +TR + A+ N+ DV+AK E + N+A LL + L +
Sbjct: 180 VTICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLML 239
Query: 305 MISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+ PS + F LL+ +V+++Y+ V+++V+ TLN R + ++ FL+ G V
Sbjct: 240 PLVSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTS 299
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVY 421
N E +++ W + LG+ + ++ E +E Y++ ++ S+ +V
Sbjct: 300 ANQMEPLWTGFW-PSLSLSLGAPLHRLI---SSVFELQQLVEGHREPYLLHWDQSQNRVQ 355
Query: 422 ALLKDQAKSDDILKAAFHAYVL 443
+L A + IL+AA H VL
Sbjct: 356 VVLSQMAGPETILRAATHGLVL 377
>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
Length = 457
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/415 (25%), Positives = 197/415 (47%), Gaps = 45/415 (10%)
Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYV-------VPDGFPDSVTPSYVPYMTWRALKH 159
+KR+ R + L+S + ++ F RS V +P GFP+SV+ Y Y W ++
Sbjct: 23 YKRHDQRQLILESERR--KQSTGFTRSSVNILKEVFLPRGFPNSVSKDYTEYQIWDTIQA 80
Query: 160 FFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDL 218
F + T +L VGV +AT A +I +ILKDGAG +G++ FA +G + D
Sbjct: 81 FCSTISNILATNAILKGVGVGDAQATALAASITYILKDGAGMIGRITFAWCKGSELDTQC 140
Query: 219 KQLRFAGDLLMELGAGVELATAA--VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA 276
K+ R D L +L +EL + + L + ++ AK++ +V +TR + + A
Sbjct: 141 KKWRLRADALNDLAIFIELLLSIPWIRQFSLIILSFSSCAKSIVSVAGGATRAALTQHQA 200
Query: 277 KGENIGDVTAKG----ECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEV 332
+N+GDV+AK C+ IA L+G + ++ R + + +++ ++F++Y+ V
Sbjct: 201 IRDNMGDVSAKDGSQETCINLIAFLIGLIMLPIVENRILLIWMIYLVVTSLHLFANYKAV 260
Query: 333 KSVVLHTLNRARFSVAVESFL----------KTGHVPSLREGNL--QENIFSFPWLRDKP 380
KS+ ++ N AR+ + ++ +L K ++ RE E + SF
Sbjct: 261 KSLNINVFNSARYDLTLKYYLSNDTQNHDVQKPDYINK-REACFLGDEKLCSF------K 313
Query: 381 VVLGSR-----FKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK 435
+ LG+ + + D +E ++ Y++ + + + +L +++ILK
Sbjct: 314 IQLGTSIHELLYTNTLTTWDIIDHIE-LYKDYLYILIVDTHRDIIRVVLDKNINTENILK 372
Query: 436 AAFHAYVLLHCINSSNGHRSL---ALKQHEYDNSNLMLTTADLEAHIGESCKVVS 487
A FHA +L H I N + +L+ +Y ++N E ++ SC+ V+
Sbjct: 373 AYFHANILGHLICPKNKFSLIKLNSLRSMKYTSTNTQFCCTHSE-YVQLSCEFVN 426
>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 656
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 163/363 (44%), Gaps = 67/363 (18%)
Query: 110 NSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFT 169
N A+S S ++ F+ S VP+G VTPSY +M WR L+ F + V
Sbjct: 128 NDTDALSTNSGDTLVQKWTQFLHSAFVPEG----VTPSYYRFMRWRILQRFINANVHVIG 183
Query: 170 TQTLLNSV-GVSRNRATPGAVAI--------------NWILKDGAGRVGKMIFA-RQGKK 213
TQ+LL + G++R+ + A + NW+LKD G++ +M++A + G+K
Sbjct: 184 TQSLLMGLRGMTRSGSVGEAASFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRK 243
Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
FD D K+ RF L+ LG G+E++T P FL LA AN K ++ +TS++TR +Y
Sbjct: 244 FDPDAKRWRFRSALIYALGNGLEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYN 303
Query: 274 AF---------------AKGEN-------------IGDVTAKGECVGNIADLLGTGLSIM 305
+F A G + IGD+TAKGE + DLLG I
Sbjct: 304 SFKRVGTTSSTVSTSNNASGSSAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGIC 363
Query: 306 ISK----RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL-------K 354
+S+ ++ + +L G +F Y E++S+V T N R + + + K
Sbjct: 364 LSRAVGVSVQNVFAVWVVLQIGEIFCMYHEMRSIVYRTFNFERMYMVLGKLMLDDAADNK 423
Query: 355 TGH-------VPSLREGNLQENIFSFP-WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEK 406
G +PS + E +F P L + + GS + + + V F
Sbjct: 424 GGDSSSTKNLIPSPEQIAANEKVFLPPDHLARRAIAFGSPGRSTLDPAELDVLVNDIFRG 483
Query: 407 ERY 409
E+Y
Sbjct: 484 EKY 486
>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 427
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 97/165 (58%), Gaps = 1/165 (0%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
+E+L ++ +P G+P+SV+ Y Y W +++ G + +T LL SVGV +
Sbjct: 99 SEKLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLSA 158
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
+ P + AI+W+LKDG G ++FA + D+DLK+ +F GD L G +E T +
Sbjct: 159 SLPFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPFL 218
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
P FL +A +N+AK +A + STR ++K+FA ENIGD+TAK
Sbjct: 219 PGYFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAK 263
>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
HHB-10118-sp]
Length = 532
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 173/405 (42%), Gaps = 49/405 (12%)
Query: 108 KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
+ + +A++ S +P L +++R +P +P SV PSY+ + + ++ G +GV
Sbjct: 87 RNDDTKAVTASSRFSPLGRLTAWLRQMFLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGV 146
Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGD 226
Q LL SVGVS + GAVA+ W++KDGAG V K+ + R FD K F G
Sbjct: 147 LCNQALLVSVGVSAEGSILGAVAVQWVIKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGA 206
Query: 227 LLMELGAGVELATAAVPHL---FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
+ LG+G+++A VP FL A N+ K V +T + FA+ N+GD
Sbjct: 207 TMGLLGSGLQIAALVVPPTTLNFLLCAAGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGD 266
Query: 284 VTAKGECVGNIADL----LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHT 339
V AK E G++ L G GL + +S P L T FAL ++ + +++
Sbjct: 267 VAAKSESQGSVGILSGYAAGVGL-LTVSHSAPYLYTIFALAIPIHLLITQWMLRAATFEL 325
Query: 340 LNRARFSVAVESFLKTG-------HVPSLREGNLQENIFSF-PWLRDK------------ 379
L R S F +P+L E Q I F + + K
Sbjct: 326 LTLPRLSFLAREFASRASRPEAERELPTLGELERQRAIGVFGEYFKTKEDQYVQIAPDLD 385
Query: 380 ----PVVLGSRFK-----DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKS 430
P LG+R + +AF D +P+ Y S+ K+ LL A
Sbjct: 386 AVVSPTDLGARTRWDACVEAFND-------QPYL---LYPYLMQQSECKIAVLLHPSATV 435
Query: 431 DDILKAAFH-AYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTAD 474
DD +++ H A++ H N + + Q DN ++ D
Sbjct: 436 DDRIQSVLHAAFLQRHLHNVMQIDPAESTPQELLDNDKVLKAALD 480
>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/240 (32%), Positives = 125/240 (52%), Gaps = 18/240 (7%)
Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG----- 187
S +P +P SV+ +Y+ Y W+AL++ G GV T Q+LL++ G++ A G
Sbjct: 3 SVFLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYFL 62
Query: 188 ---AVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV- 242
A+ W++KDG G++G ++FA + FD D K R G+ + A +E+ T
Sbjct: 63 AFLGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPITG 122
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
P F+ A AN+ KNV+ +++TR + ++FAK +N+ DVTAK L+GT L
Sbjct: 123 PSWFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTTL 182
Query: 303 SIMISKR--------NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
I +S + F +LS +FS Y+ V +V L LN+ RF +A +LK
Sbjct: 183 GIFVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYLK 242
>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
Length = 417
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 150/325 (46%), Gaps = 14/325 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W + F G T +L +GV
Sbjct: 45 LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++++ + V++V L + N +R+ +A+E F ++ + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283
Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
++ N E + + + LG K+ + + +E F E +++
Sbjct: 284 PQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTK 343
Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
VY L + + D+LKA F A
Sbjct: 344 KCLGVY--LHFETRPQDVLKAYFFA 366
>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
Length = 417
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 150/325 (46%), Gaps = 14/325 (4%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W + F G T +L +GV
Sbjct: 45 LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++++ + V++V L + N +R+ +A+E F ++ + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283
Query: 361 LREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PFFEKERYVVTYNP 415
++ N E + + + LG K+ + + +E F E +++
Sbjct: 284 PQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTK 343
Query: 416 SKGKVYALLKDQAKSDDILKAAFHA 440
VY L + + D+LKA F A
Sbjct: 344 KCLGVY--LHFETRPQDVLKAYFFA 366
>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
Length = 417
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 123/250 (49%), Gaps = 7/250 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W + F G T +L +GV
Sbjct: 45 LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++++ + V++V L + N +R+ +A+E F ++ + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283
Query: 361 LREGNLQENI 370
++ N E +
Sbjct: 284 PQQVNAMERV 293
>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
Length = 417
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 123/250 (49%), Gaps = 7/250 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +P G+PDSV+ Y Y W + F G T +L +GV
Sbjct: 45 LFRVLQKVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINA 104
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPH 244
+ WILK+G+G VG+++FA QG + D D K+ R D L +L G+E+ PH
Sbjct: 105 FSATATWILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPH 164
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGT 300
+ C + + K + V +TR+ + + A N+ DV +K CV +A +G
Sbjct: 165 FSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGL 224
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
L +I K L T F ++ +++++ + V++V L + N +R+ +A+E F ++ + S
Sbjct: 225 YLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLS 283
Query: 361 LREGNLQENI 370
++ N E +
Sbjct: 284 PQQVNAMERV 293
>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
Length = 479
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 163/363 (44%), Gaps = 48/363 (13%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 71 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130
Query: 196 KDGAGRVGKMIFA------------RQ-----------------GKKFDYDLKQLRFAGD 226
+D G +G ++F RQ G D + K R D
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
+ +LG ++L + P + + C +++++ V S +TR + + FA N D++A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250
Query: 287 K---GECVGNIADLLGTGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
K E + + + L +++ + + +F L+ +++++Y+ V+S+ L TLN
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310
Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFSF--PW-------LRDKPVVLGSRFKDAFQD 393
R S+ ++ F+ G V + ++ + QE+I F W L + V LG++
Sbjct: 311 ERSSIMLQYFMDNGEVLTPQQVSKQEHILPFWSSWRKFLRIKLPHEHVHLGAK-ASRLTH 369
Query: 394 PD--AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN-SS 450
PD P++ E Y + +G V+ + QA + DIL + H VL H + S
Sbjct: 370 PDLLQIAKTRPYYRNENYFLL--DKEGSVHIFIHKQAAATDILMSFIHGLVLAHLMQKSK 427
Query: 451 NGH 453
+GH
Sbjct: 428 SGH 430
>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
Length = 265
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 135/268 (50%), Gaps = 15/268 (5%)
Query: 107 FKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMG 166
F+R R I AP ++I +P+G+P SV+ Y+ + TW L+
Sbjct: 5 FRREKTRRI------AP----WTWIVLAFLPEGYPSSVSSDYLAFQTWDTLQGLSTYIRS 54
Query: 167 VFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAG 225
+ +TQ LL+ +GV AT W L+D G VG ++F QG D KQ R A
Sbjct: 55 MLSTQALLSGIGVGAATATAIGATFQWFLRDLTGMVGGIVFTLYQGSNLDNRAKQWRLAA 114
Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
D + ++G ++L + P F+ C + A+ V V +TR + + FA+ +N DV+
Sbjct: 115 DFMNDIGMLMDLVSPLFPRAFMFFLCLGSAARAVTGVAGGATRAALTQHFARKQNAADVS 174
Query: 286 AKGECVGNIADLLGTGLSIMISKRNPSLI----TTFALLSCGYVFSSYQEVKSVVLHTLN 341
AK A L+G + + +++ + I T+F L+ +V+++Y+ V ++ L ++N
Sbjct: 175 AKEGSQETAATLVGMIVGMFLARLTANNIVLMWTSFMSLTAFHVYANYKAVCALCLTSIN 234
Query: 342 RARFSVAVESFLKTGHVPSLREGNLQEN 369
R ++ ++SFLK G PS E + QE+
Sbjct: 235 AERMAIVLQSFLKDGKAPSPEEASAQES 262
>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 435
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 165/327 (50%), Gaps = 21/327 (6%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W +L+ + +TQ LL+++GV AT W L
Sbjct: 56 VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVIGATFQWFL 115
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAA 253
+D G +G ++F QG D + K R D + +LG V+ + + P F+ + C
Sbjct: 116 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSAFMLIVCLG 175
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP-- 311
+++++ V S +TR + + FA N D++AK +A ++G + +++++
Sbjct: 176 SLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLLARITMGC 235
Query: 312 --SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
++ +F L+ +++++Y+ V + L +LN R S+A++ +++T V S ++ ++ E+
Sbjct: 236 PLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSPQQVSMMEH 295
Query: 370 I---FSFPW-LRDKPVV-----LGSRFKDAFQDPDAYL----AVEPFFEKERYVVTYNPS 416
+ ++ W LR ++ LG+R L ++E+Y++
Sbjct: 296 VLPAWASSWSLRSIKLIHTRVRLGARISSLHHTEIKELLHLAGANNNDKQEKYLLV--ER 353
Query: 417 KGKVYALLKDQAKSDDILKAAFHAYVL 443
K + +++ A ++DI ++ HA V+
Sbjct: 354 KETISVIVQKDATANDIFQSFVHALVM 380
>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
Length = 404
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 125/237 (52%), Gaps = 6/237 (2%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
++ L + + +P + SV+P ++ Y W+ + A GV + Q+LL +VGV
Sbjct: 109 SQPLRTRLEHVFLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSLLYAVGVGAG- 167
Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
+ P A AINWILKDG G++G M++ G +FD D K+ RF L +++ V++ T
Sbjct: 168 SVPLAAAINWILKDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSGIVDIITPLF 227
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
PH FL LA +N KNV+ + S++TR + +F + N+GDVTAK A + GTGL
Sbjct: 228 PHHFLLLASVSNFGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSIAASVFGTGL 287
Query: 303 SIMISKRNPS----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
I++SK + L+ + LS + +Y K V L T N R + + T
Sbjct: 288 GILVSKVTGTEAALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQRAEILARHYFTT 344
>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 486
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 150/329 (45%), Gaps = 16/329 (4%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
I +V+P GFP SV+ Y+ YM W+ + G T +LL +VGV T A +
Sbjct: 95 ITDFVLPAGFPGSVSDDYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAAS 154
Query: 191 ---INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
I W+ KDG G G+++ R G FD D K+ R D + +G+ EL T P F
Sbjct: 155 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYF 214
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI 306
LPLA N+AK V + I FAK N+G++ AK E G A LLG + ++I
Sbjct: 215 LPLASLGNLAKAVGRGFRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLI 274
Query: 307 ------SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
P+L T+ + +++ YQ + + T+N R + V S + VP
Sbjct: 275 LDTSGVQSSYPTLTLTWLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPG 334
Query: 361 LREGNLQENIFSFPWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSK 417
N +ENI ++ + G R + D + ++ E Y++ + S+
Sbjct: 335 YVACNEEENILTWERFLQPQISFGVPMERMLGGGEPSDMVSRLLKLYKNEMYILFVDQSE 394
Query: 418 GKVYALL---KDQAKSDDILKAAFHAYVL 443
K A + K+ A S +L++ + A+ L
Sbjct: 395 SKEPAFVVTFKEAATSMSVLRSLWQAHWL 423
>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
gi|194689438|gb|ACF78803.1| unknown [Zea mays]
gi|223974155|gb|ACN31265.1| unknown [Zea mays]
gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 442
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 152/323 (47%), Gaps = 17/323 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 62 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 121
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R D + +LG ++L + P + + C +
Sbjct: 122 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 181
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRNP 311
++++ V S +TR + + FA N D++AK E + + + L I++ +
Sbjct: 182 LSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQITRGHA 241
Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
S+ +F L+ +++++Y+ V+S+ L TLN R S+ ++ F + G V R+ + QE+I
Sbjct: 242 LSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKVSQQEHI 301
Query: 371 FSFPW----------LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKV 420
W L + V LG++ D + + + E +G V
Sbjct: 302 LP-SWSNWRKLNRIKLPHERVHLGAK-ASMLTHSDMLVIAKTRYHYENANYFLLDKQGIV 359
Query: 421 YALLKDQAKSDDILKAAFHAYVL 443
Y + +A D+L++ H VL
Sbjct: 360 YVFIHKEATPADVLRSFVHGLVL 382
>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
Length = 479
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 163/363 (44%), Gaps = 48/363 (13%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 71 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130
Query: 196 KDGAGRVGKMIFA------------RQ-----------------GKKFDYDLKQLRFAGD 226
+D G +G ++F RQ G D + K R D
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
+ +LG ++L + P + + C +++++ V S +TR + + FA N D++A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250
Query: 287 K---GECVGNIADLLGTGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
K E + + + L +++ + + +F L+ +++++Y+ V+S+ L TLN
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310
Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFSF--PW-------LRDKPVVLGSRFKDAFQD 393
R S+ ++ F+ G V + ++ + QE+I + W L + V LG++
Sbjct: 311 ERSSILLQYFMDNGEVLTPQQVSKQEHILPYWSSWRKFLRIKLPHEHVHLGAK-ASRLTH 369
Query: 394 PD--AYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN-SS 450
PD P++ E Y + +G V+ + QA + DIL + H VL H + S
Sbjct: 370 PDLLQIAKTRPYYRNENYFLL--DKEGSVHIFIHKQAAATDILMSFIHGLVLAHLMQKSK 427
Query: 451 NGH 453
+GH
Sbjct: 428 SGH 430
>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
Length = 389
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 75/247 (30%), Positives = 132/247 (53%), Gaps = 5/247 (2%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
+R +P G+P SV+P Y+ Y W ++ F +T+ +L VGV A+ A
Sbjct: 37 LLRDIFMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAA 96
Query: 190 AINWILKDGAGRVGKMIFARQGKKF-DYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
A+ W+LKDG G + +++FA + D D KQ R D +L ++L T P LF+P
Sbjct: 97 AVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMP 156
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLGTGLSI 304
+ C +++ + V V ++TRT + A +N+GDV AK E + N+ A L L
Sbjct: 157 IICLSSMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLP 216
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
++S+ S+ F+L + +++ +Y+ VKS+ TLN++ + V+ +++T + ++ E
Sbjct: 217 VVSRNIVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEM 276
Query: 365 NLQENIF 371
N +E I
Sbjct: 277 NSKEPIL 283
>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 436
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 80/254 (31%), Positives = 129/254 (50%), Gaps = 14/254 (5%)
Query: 110 NSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFT 169
N +R + + P+A +++ +PDG+PDSV+ Y Y W L+ F G
Sbjct: 29 NLLRLVDEKHPEA--SSFAQLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLA 86
Query: 170 TQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLL 228
T +L VGV AT A A++W+L+DG G +G++ FA Q G D + K+ R D+
Sbjct: 87 TLAVLKGVGVGDETATATAAAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIF 146
Query: 229 MELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG 288
+L VEL + V FL AC +V +++ V +TR I + A+ N GDV+AK
Sbjct: 147 NDLAMFVELLSPLVGEWFLAFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKD 206
Query: 289 ---ECVGNIADLL-GTGLSIMISKRNPSLITTFALLSCGY----VFSSYQEVKSVVLHTL 340
E + N+A LL L + +P+L+ LL C + +F++Y+ V++ V T
Sbjct: 207 GSQETLVNLAALLVSLWLLPALGSDHPTLVW---LLFCSFTSLHLFANYRGVRATVFDTF 263
Query: 341 NRARFSVAVESFLK 354
NR R + +++
Sbjct: 264 NRERLFIVASDYMQ 277
>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
Length = 272
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 74/198 (37%), Positives = 116/198 (58%), Gaps = 10/198 (5%)
Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLR 222
A GV + Q LL + G+ + P A AINW+LKDG G++G +++ G +FD D K+ R
Sbjct: 3 ASGVLSMQALLYAAGLGAG-SVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQR 61
Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
F + ++ EL VPHLFLPLA AAN+AKN++ + S++TR ++ +F + N+G
Sbjct: 62 FWSLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLG 121
Query: 283 DVTAK--GECVGNIADLLGTGLSIMISKRNPS----LITTFALLSCGYVFSSYQEVKSVV 336
D+TAK + +G A ++GT L I +S S +I F L+ ++ ++ K VV
Sbjct: 122 DITAKIGAQSIG--ASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVV 179
Query: 337 LHTLNRARFSVAVESFLK 354
L TLN R +A+ +L+
Sbjct: 180 LSTLNEQRLELAMHPWLQ 197
>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 485
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 156/334 (46%), Gaps = 17/334 (5%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
E + S + VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV A
Sbjct: 51 ERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSA 110
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
T W L+D G +G ++F QG D + K R D + +LG ++L + P
Sbjct: 111 TVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFP 170
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGT 300
+ + C +++++ V S +TR + + FA N D++AK E + + +
Sbjct: 171 SSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLG 230
Query: 301 GLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
L I++ + S+ +F L+ +++++Y+ V+S+ L TLN R S+ ++ F + G V
Sbjct: 231 MLLAQITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVL 290
Query: 360 SLREGNLQENIFSFPW----------LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERY 409
R+ + QE+I W L + V LG++ D + + + E
Sbjct: 291 VPRKVSQQEHILP-SWSNWRKLNRIKLPHERVHLGAK-ASMLTHSDMLVIAKTRYHYENA 348
Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+G VY + +A D+L++ H VL
Sbjct: 349 NYFLLDKQGIVYVFIHKEATPADVLRSFVHGLVL 382
>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
Length = 417
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 124/247 (50%), Gaps = 7/247 (2%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F + +P G+PDSV+ Y+ Y W ++ F G TT +L VGV + T +
Sbjct: 48 FFQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTALSA 107
Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL-ATAAVPHLFL 247
+ W+LKDG G +G+++FA +G + D D K+ R D+L ++ ++L P
Sbjct: 108 TVTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKAST 167
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLS 303
+ C K + V +TR+ + + A N+ DV +K CV IA +G L
Sbjct: 168 YILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLLLL 227
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+ + L FA ++ +++++ + VK+V L T N AR+ +A+E + K+G + S R+
Sbjct: 228 TYLQNQRI-LYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSPRQ 286
Query: 364 GNLQENI 370
N E +
Sbjct: 287 VNKLERV 293
>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
Length = 1055
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 143/292 (48%), Gaps = 13/292 (4%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
++ +PD P VTP Y ++ WR + FF M F+ LL + G+ +A + A
Sbjct: 286 LKEAFLPD--PRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLMATGLGAKKAFAASAA 343
Query: 191 INWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
INW+LKDG R+ +M + G+ FD DLK++RF L+ E AT P F+ L
Sbjct: 344 INWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMAGEFATPFYPQHFVAL 403
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS-- 307
A ++V + V +T+ A A G N+ D+T+K + + D+L G+S ++
Sbjct: 404 ASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMVMDMLALGVSAGLTWL 463
Query: 308 ----KRNPSLI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
R L+ + L + G + + E+K+V L TLNR R + +E +++ G VP
Sbjct: 464 CRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERAEMLIERWMRRGAVPDA 523
Query: 362 REGNLQENIF--SFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVV 411
E + EN+ S W P+ +G + A +P+ + ++ E YV+
Sbjct: 524 AEISAAENLILPSDVWRGLLPLRIGPLDRIAPANPEQLRKLLREYDGEEYVL 575
>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 527
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 157/366 (42%), Gaps = 35/366 (9%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+ + S I+ +P G+P SV+ Y+ Y W + F G T + ++ +GV N A
Sbjct: 34 QHIKSLIKDIFLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVA 93
Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
TP + A+ WI+KDG G +G ++FA +G K D D K+ R D+L ++ +EL+ +
Sbjct: 94 TPMSAALTWIIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIM 153
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
+ + CAA + K + +STR I A N DV+AK +LLG
Sbjct: 154 DYSMYMLCAATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFG 213
Query: 304 IMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
I + K + ++ +F +Y+ V+ + ++ N R + + S+L T VP+
Sbjct: 214 IFLLKNVNESNQVWIVMCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPT 273
Query: 361 LREGNLQENIFSFPWLRDKPVVLGSR--------FKDAF------------QDPDAYLAV 400
+ + N R +PV+LG FK F +
Sbjct: 274 IEQVN-----------RREPVILGCGTSDYQLCGFKIKFGVSVNKAVKSEKMTNQQLSTL 322
Query: 401 EPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
F+ ++Y + K + + D+LKA FHA + I+ N + L + +
Sbjct: 323 ANVFQSKKYFIIIQLKKKIINVAFRKHFSDIDVLKAYFHAIKMGIAISIINSNFELQIIK 382
Query: 461 HEYDNS 466
NS
Sbjct: 383 RSRKNS 388
>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
Length = 525
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 151/335 (45%), Gaps = 13/335 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P+SV+ Y+ Y W + F G+ TQ + VGV A+P A + W+
Sbjct: 52 LPQGYPNSVSKDYIFYQIWDTAQAFCSTITGILATQEVFRGVGVGDTNASPLAATVTWVF 111
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KDG G +GK++FA G D K+ R D L + +E+A + C +
Sbjct: 112 KDGCGHIGKILFAYTHGTYLDAYSKKWRLYADTLNDAAMCIEIALPLFKNYITFALCVST 171
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI 314
K + V +TR + + A N+ DV+AK +L+ + ++ + + +
Sbjct: 172 CMKAIVGVAGGATRVAMTQHHALRGNLADVSAKDSAQETAVNLIASFAALFLISLIGNSV 231
Query: 315 TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF- 373
T F +L ++ +Y V++V L TLN RF ++++L+ + + E N E I +
Sbjct: 232 TIFIILLIMHIVFNYMAVRAVCLRTLNEPRFLQVIDTYLRKEVIANPCEINRNEPIIFYQ 291
Query: 374 --PWLRDKP-----VVLGSRFKDAF--QDPDAYL-AVEPFFEKERYVVTYNPSKGKVYAL 423
P L D +++G + Y+ V+ + ++Y++ + +Y
Sbjct: 292 LGPNLLDLKICGFHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPDTGNRVMYVF 351
Query: 424 LKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLAL 458
K+ A DD+L A F + VLL I + LA+
Sbjct: 352 PKEDASVDDMLCAYFQS-VLLAIITCAINDHQLAI 385
>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
Length = 509
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/332 (30%), Positives = 156/332 (46%), Gaps = 19/332 (5%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS-------RNR 183
+R +V P GFP SV+ Y+ YM W+ + G V T +LL +VGV
Sbjct: 115 VRDFVFPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAAT 174
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A A AI W+ KDG G +G+++ R G FD D KQ R D + G+ +LAT
Sbjct: 175 AAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLY 234
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
P FL LA N+AK VA + I FA N+G+V AK E A L+G G
Sbjct: 235 PSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGF 294
Query: 303 SIMI------SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
I+I K P ++ T+ + +++ YQ + + +T+N R + VES +
Sbjct: 295 GILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHS 354
Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVTY 413
VP + N +ENI + ++ G ++ + A+ + KE+Y++T
Sbjct: 355 VVPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTL 414
Query: 414 NP-SKGKVYAL-LKDQAKSDDILKAAFHAYVL 443
N +K +++ K A S D+L+ + AY L
Sbjct: 415 NKLNKDTEFSVSFKVNATSRDVLRCLWQAYWL 446
>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
Length = 507
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/228 (30%), Positives = 115/228 (50%), Gaps = 6/228 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P +V+ SY+ + + + F AMG + Q+L ++G + +A A A W+L
Sbjct: 89 LPAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGSTMTQANLAAAAYTWVL 148
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KDG G++G ++FA R G+ FD D+K+ RF + + +E+ T + P FL LA AN
Sbjct: 149 KDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEILTLSFPQYFLALASLAN 208
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK-----R 309
V KN+ + ++++R I AK NIGD++ K + L+G + ISK
Sbjct: 209 VGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLIGGACGLGISKIIDITS 268
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
L F +L+ + +SY K + LN R + F K+
Sbjct: 269 IYQLYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLFNEFFKSNE 316
>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
Length = 442
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/325 (27%), Positives = 154/325 (47%), Gaps = 21/325 (6%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 62 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 121
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R D + +LG ++L + P + + C +
Sbjct: 122 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 181
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRNP 311
++++ V S +TR + + FA N D++AK E + + + L I++ +
Sbjct: 182 LSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGLGMLLAQITRGHA 241
Query: 312 -SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
S+ +F L+ +++++Y+ V+S+ L TLN R S+ ++ F + G V R+ + QE+I
Sbjct: 242 LSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKVSQQEHI 301
Query: 371 FSFPW----------LRDKPVVLGSRFKDAFQDPDAYLAVEP--FFEKERYVVTYNPSKG 418
W L + V LG++ D + + +E Y + +G
Sbjct: 302 LP-SWSNWRKLNRIKLPHERVHLGAK-ASMLTHSDMLVIAKTRCHYENANYFLL--DKQG 357
Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
VY + +A D+L++ H VL
Sbjct: 358 IVYVFIHKEATPADVLRSFVHGLVL 382
>gi|413949257|gb|AFW81906.1| hypothetical protein ZEAMMB73_483620, partial [Zea mays]
Length = 116
Score = 122 bits (307), Expect = 3e-25, Method: Composition-based stats.
Identities = 60/116 (51%), Positives = 84/116 (72%), Gaps = 10/116 (8%)
Query: 385 SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLL 444
SRF +AFQ+P +++A++P FE ERY+VTYNP+K KVYALLKDQAK DDILKAAFHA+VLL
Sbjct: 1 SRFGEAFQEPASFIAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVLL 60
Query: 445 HCINSSNGHRSLALK-----QHEYDNSNLMLTTADLEAHIGESCKVVSTALWGFQE 495
H IN+++ +R+ + H+Y+ N+ D HI ESCK+V ++ F++
Sbjct: 61 HFINATHANRNARKRANSNLSHQYNPLNM-----DFLPHIEESCKIVMSSYGVFKK 111
>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
Length = 4586
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 98/344 (28%), Positives = 160/344 (46%), Gaps = 24/344 (6%)
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
+ S++R +P +P SV PSY+ + + + FG + V Q LL SVGVS +
Sbjct: 108 KFTSWLRQMFLPTNYPQSVHPSYLRFHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSI 167
Query: 186 PGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-- 242
GAVA+ WI+KDGAG V K+ F R+ FD K G+ ++ +G+ +++AT +
Sbjct: 168 FGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITP 227
Query: 243 -PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG-- 299
P FL A N+ K V +T + F+ N+GDV AK E +IA L G
Sbjct: 228 TPGNFLLCAAGGNIFKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYA 287
Query: 300 TGLSIMISKRNPS-LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK---- 354
G+ ++ +P+ L + F L+ ++ ++ ++ L R S+ + +
Sbjct: 288 AGIGLLTVSHSPAYLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGE 347
Query: 355 --TGHVPSLR--EGNLQENIFS--FPWLRDKPVVLGSRFK-----DAFQDPDAYLAVEPF 403
TG V +LR E + Q F F D+ V L R D ++P +
Sbjct: 348 AATGKVNTLREIEESRQTGAFGEFFKRKNDRFVELAPRVTEVIGGDVDREPVLWEMCTSE 407
Query: 404 FEKERYVV--TYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH 445
F+ E+Y++ + +PS+ + A SDD L++ HA H
Sbjct: 408 FQTEKYLLYPSASPSEKPITVFYHPDATSDDTLRSILHAARFRH 451
>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
Length = 394
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 154/320 (48%), Gaps = 22/320 (6%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
+F +P G+P SV+ Y+ Y W ++ F T+ +L GV A+ A
Sbjct: 42 AFFVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASALA 101
Query: 189 VAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
+ W+++DG G + ++ FA + + D D K+ R D L +L +EL AA+P LF
Sbjct: 102 ATLTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLFT 161
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLGTGLS 303
L C +++A+++ AV +TRT + + A+ N+ DV+AK E + N+ A L L
Sbjct: 162 LLVCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLILL 221
Query: 304 IMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
+IS R + + L + ++F++Y V ++ TLN+ + F++TG V S++
Sbjct: 222 PLISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVKV 281
Query: 364 GNLQENIFSFPWLRDKPVVLG--SRFKDA--FQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
N E P++ G SR+ + A+ A+ +++ ++V P
Sbjct: 282 ANELE-----------PLLFGCASRYYGCRLYDVLSAHCALT--LQRDSFIVGCEPKLEY 328
Query: 420 VYALLKDQAKSDDILKAAFH 439
Y + + + +KAAF+
Sbjct: 329 AYIAMASGTQQREHIKAAFY 348
>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
Length = 409
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 137/281 (48%), Gaps = 20/281 (7%)
Query: 74 EDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRS 133
ED +++ L SE G+ + +++ S + K+ S ++ + P FI++
Sbjct: 43 EDDKDNAATSLGISEYYKGKLYTLTFDKE-SNRIKKESTTSLRNHTLTLPV-----FIKT 96
Query: 134 -------YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+P +P SV Y+ Y +W A A GV +TQ++L ++G+ A P
Sbjct: 97 TKLKLTDIFLPRNYPSSVKGKYLNYCSWAAFGMGLSAAGGVLSTQSMLFAIGMGAG-AVP 155
Query: 187 GAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
A A+NW+LKDG G++G M F A + D D + R L+E+ +E+ T PH
Sbjct: 156 MAAALNWVLKDGLGQLGGMAFTALVNSRLDADSRGWRILSAWLLEISTWLEVMTPLFPHS 215
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
FL LA ANV KN++ + ++TR I F N+GD+TAK + GT L I+
Sbjct: 216 FLLLATLANVGKNISWLAGSATRAGIRYGFVNAHNMGDITAKEGSQTVAITVFGTFLGIV 275
Query: 306 ISK-----RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLN 341
IS ++ + +S +FS YQ ++ V L TLN
Sbjct: 276 ISNLIGHGHMEYVLMSSMCISAVSLFSIYQSLRCVSLPTLN 316
>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
Length = 510
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 154/340 (45%), Gaps = 21/340 (6%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS-------RNR 183
++ +V P GFP SV+ Y+ YM W+ + G V T +LL +VGV
Sbjct: 116 VKDFVFPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGVGSFSGTSAAAT 175
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A A AI W+ KDG G +G+++ R G FD D KQ R D + G+ +LAT
Sbjct: 176 AAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLY 235
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
P FL LA N+AK VA + I FA N+G+V AK E A L+G G
Sbjct: 236 PSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGF 295
Query: 303 SIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
I+I P L+ +F +S + ++ YQ + + +T+N R + VES +
Sbjct: 296 GILIID-TPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVH 354
Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVT 412
VP + N +ENI ++ G ++ + A+ + KE+Y++T
Sbjct: 355 SVVPGYVDCNKRENILLGQRFMKPRIIFGVSLEEVSGLEKSVSKVKALLKMYTKEKYILT 414
Query: 413 YNP--SKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
N S + K A S D+L+ + AY L + S
Sbjct: 415 LNKLNSDTEFSVSFKVNATSRDVLRCLWQAYWLEENMEES 454
>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
Length = 472
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 160/340 (47%), Gaps = 36/340 (10%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPD+V+ Y Y W +++ F G TQ +L VGV A+ A WIL
Sbjct: 50 LPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGDQTASVLAATTTWIL 109
Query: 196 KDGAGRVGKMIFARQGKKFDY-------------DLKQLRFA-------GDLLMELGAGV 235
KDG G +G+++FA + D L A D+L + +
Sbjct: 110 KDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYCSLFADILNDAAILL 169
Query: 236 ELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVG 292
EL + + + C ++VAK+V V +TR + + A+ N+ DV+AK E +
Sbjct: 170 ELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNMADVSAKDGSQETLV 229
Query: 293 NIADLLGTGLSIMISKRNPSLIT-TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
N+A L + L + + + L+ F + + ++F++++ V VV+ T NR+R+S+ V+
Sbjct: 230 NLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVMETFNRSRYSLLVKH 289
Query: 352 FL-KTGHVPSLREGNLQENIFSFPWLRDK----PVVLGSRFKDAFQDPDAYLAVEPFFEK 406
FL + G + S+ + N QE++ WLR V LGS D + P ++
Sbjct: 290 FLEEAGDIASVEKVNKQESV----WLRCGRHFVNVNLGSPLSSVAATHDDLCSRAPKEKQ 345
Query: 407 ERYVVTYNPSKGKVY---ALLKDQAKSDDILKAAFHAYVL 443
RY++ + Y +L +Q D ++A FH+++L
Sbjct: 346 ARYLLKVVRDSSRHYDINVVLHEQCTPRDQMEAMFHSFLL 385
>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
SS1]
Length = 506
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 151/334 (45%), Gaps = 20/334 (5%)
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
+ S++R +P +P SV SY + T +AL+ V Q LL SVG+S +
Sbjct: 111 RISSWLRQMFLPTNYPQSVHSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSV 170
Query: 186 PGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-- 242
GAVA+ WI+KDGAG V K+ F R FD K G++L LG+G+++AT +
Sbjct: 171 FGAVAVQWIIKDGAGEVAKLFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAP 230
Query: 243 -PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG-- 299
P FL A N+ K V ++T + F+ N GDV AK E ++A L G
Sbjct: 231 SPLNFLLCAAGGNIFKLVGNAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYA 290
Query: 300 TGLSIMISKRNPSLITT--FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
G+S++ P+ + FA + ++Y ++ L R S ++++ G
Sbjct: 291 AGISLLTFSHAPAYLYAIFFAAVPLHLTMTAYM-MRVATFELLTLPRISHLAQTYVNDGV 349
Query: 358 VPSLREGNLQENIFSFPWLR----DKPVVLGSRFKDAFQ-----DPDAYLAVEPFFEKER 408
V S ++ + + F D+ + L R DA D + F+ +R
Sbjct: 350 VGSQKDLDREHATGLFGEFYKNNGDRWLTLAPRVGDALNTSSDVDRSTWQVCSQVFQDDR 409
Query: 409 YVV--TYNPSKGKVYALLKDQAKSDDILKAAFHA 440
Y++ +P + L A +DD+L++ HA
Sbjct: 410 YLLLPCDSPRGPLISVLFHPDATTDDMLQSILHA 443
>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
Length = 542
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/412 (27%), Positives = 185/412 (44%), Gaps = 70/412 (16%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEE--LMSFIRSYVVPDGFPDSVTP 146
D D RR Y +E GK+ + ++SP A + L SF+ ++ P +P+SV
Sbjct: 77 DDDSRRTLYRSEG--PGKW-------LKVESPPASRQSMSLASFVHHFL-PAKYPESVCS 126
Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
SY + ++ + G A V +TQ LL +VGV A P A A+NW+LKD AG++G ++
Sbjct: 127 SYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAAALNWLLKDFAGQLGGVL 186
Query: 207 FARQ---------------------------GKKFDY-------DLKQLRFAGDLLMELG 232
F Q KK +Y + K+ R L ++L
Sbjct: 187 FVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTADTNPKRWRMVAALALDLS 246
Query: 233 AGVELATAAV-PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG--------D 283
+E+ T + P FLP A ANV KN+ + ++++R I ++ + G + D
Sbjct: 247 TLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQSLSAGGSYSSSSGSNLGD 306
Query: 284 VTAKGECVGNIADLLGTGLSIMISKR------NPSLITTFALLSCGYVFSSYQEVKSVVL 337
VTAK +A L+GT L I SK +++ F +LS + ++Y+ + +V L
Sbjct: 307 VTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVLSAVHQVATYKALMAVPL 366
Query: 338 HTLNRARFSVAVESFLKTGHVPSLREGNLQENIF-------SFPWLRDKPVVLGSRFKDA 390
+L+R R + + F +T VPS + +E F S WL +L K
Sbjct: 367 RSLDRHRLHIVLSEFSRTSSVPSPSQVATKEKFFPLITPDHSCNWLFVGSPLLEICPKGI 426
Query: 391 FQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
+ L + + E+Y++ P+ ++ L D A D+L+ HAY+
Sbjct: 427 SELQGLILWRDDAMQHEQYILKVGPASIQLTFL--DDANEVDLLRGQLHAYL 476
>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 719
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/459 (23%), Positives = 182/459 (39%), Gaps = 122/459 (26%)
Query: 121 QAPAEELMSFIRS----------------YVVPDGFPDSVTPSYVPYMTWRALKHFFGGA 164
Q+P + SFI S + +P +P SV PSY Y ++ L G +
Sbjct: 213 QSPLSTIRSFISSCRTKDMINIQNLPLIHHFLPANYPQSVCPSYATYASYCFLGSIAGSS 272
Query: 165 MGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ----GKKFDY---- 216
V +TQ LL +VGV A P A A+NW++KDG G++G ++FA Q G DY
Sbjct: 273 AMVLSTQALLVAVGVGTQSAAPMAAALNWVMKDGVGQLGGVVFASQLGKGGVDVDYWKVK 332
Query: 217 ----------------------------DLKQLRFAGDLLMELGAGVELATAAV-PHLFL 247
+ K+ R L ++L +E+ T + P FL
Sbjct: 333 LNKLMGSSSSKSTETSARGNFQRGTADSNPKRWRMVAALALDLSTLLEICTPWMGPQWFL 392
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKG----------------------------- 278
P A AN+ KNV + ++++R ++++ G
Sbjct: 393 PCASIANIGKNVGFLAASASRAAVHQSLCTGGSFLPPVPDNTTEKDESYDNPKKGKNVTS 452
Query: 279 --ENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS------LITTFALLSCGYVFSSYQ 330
N+GDVTAK +A LLGT L I +S+ S ++ F +LS + +Y+
Sbjct: 453 SSNNLGDVTAKSGSQAIVASLLGTALGIWLSRTFCSDYGTAGILAGFVILSAVHQVCTYR 512
Query: 331 EVKSVVLHTLNRARFSVAVESFL----KTGHVPSLREGNLQENIFSFPWLRDKPVVLGSR 386
+K+V L +++R R + + +++ ++ L + E P + V
Sbjct: 513 ALKAVPLRSVDRHRLHIILNAYITADGRSVQSMVLTPSQVAEEESFLPMMAPDNSVHWLS 572
Query: 387 FKDAFQD--PDAYLAVEPFF-------EKERYVV---------TYNPSKGKVYALLKDQA 428
D+ D P +E + E+Y++ T G V D A
Sbjct: 573 VGDSLIDICPSGLDELEALLIRRDDISQYEKYIMKVHLADTNATTAAVDGIVQLTFLDGA 632
Query: 429 KSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
+D+L+ FHAY+ R+L + +D+SN
Sbjct: 633 TDNDLLQGMFHAYIA----------RALMISHQCHDDSN 661
>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
gi|224029221|gb|ACN33686.1| unknown [Zea mays]
gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 481
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 170/376 (45%), Gaps = 30/376 (7%)
Query: 98 VAERDRSGKFKR-----NSIRAISLQSPQAP--------AEELM-SFIRSYVVPDGFPDS 143
+AER R G KR +S + L+ ++P A+ L+ I +V+P GFP S
Sbjct: 44 LAERYRDGVVKRYISDGSSKLQLRLEKHESPVNGVEDENADSLIPQAILDFVLPAGFPGS 103
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA---INWILKDGAG 200
V+ Y+ YM + + G V T +LL +VGV T A + I W+ KDG G
Sbjct: 104 VSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIG 163
Query: 201 RVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
G+++ R G FD D K+ R D + G+ EL+T P FLPLA N+AK +
Sbjct: 164 AFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAI 223
Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNP------SL 313
A + I FA+ N+G+V AK E A LLG + ++I + +L
Sbjct: 224 ARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTL 283
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
+ T+ + +++ YQ + + T+N R + V+S + VP N +ENI ++
Sbjct: 284 VLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVPGYVACNEEENILTW 343
Query: 374 PWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL---KDQ 427
V G R + D + ++ E+Y++ + + L K+
Sbjct: 344 ERYLRPQVSFGVPMERMLGGDKSSDMVNRLLKLYKNEKYILFFEQFGSREPTFLVTFKES 403
Query: 428 AKSDDILKAAFHAYVL 443
A S +L++ + A+ L
Sbjct: 404 ATSMSVLRSLWQAHWL 419
>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 479
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/329 (27%), Positives = 163/329 (49%), Gaps = 25/329 (7%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
V+P GFP SV P Y+ Y W+ A GV +TQ LL ++G+ A P A +NW+
Sbjct: 5 VLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAG-ALPLAATLNWV 63
Query: 195 LKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
+KDG G++G + F+ +FD + K R + ++ +EL + P FLP+A A
Sbjct: 64 IKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPIASVA 123
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS- 312
N+ KNV+ + ++++R I+ + N+ DVTAK +A + GTG + +S S
Sbjct: 124 NIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSASLGSE 183
Query: 313 ---LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL---KTGHVPSLREGNL 366
++ LS ++ +++ ++ + + T + R + V ++ +PS ++
Sbjct: 184 FHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVNGMADGDRSRGMPSPQDIAA 243
Query: 367 QENIFSFPWLRDK-------PVVLGSRFKDAFQD----PDAYLAVE--PFFEKERYVVTY 413
QE++ ++ +R+K VV+GSR K+A + + E PF R + +
Sbjct: 244 QEDVVAYA-VREKIGLDLSPTVVVGSRLKEAVKGWGELEELCRVCEGLPFLLSVRGDLAH 302
Query: 414 NPSKGKVYALLKDQAKSDDILKAAFHAYV 442
G+V+ LL + + ++L ++ A V
Sbjct: 303 R--TGEVHVLLLEGCGAREVLAGSYVALV 329
>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
Length = 1041
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 151/338 (44%), Gaps = 30/338 (8%)
Query: 51 GGGGAAATLLQSES-DETTAFRLVED--QFVDSSLRLI--CSEDIDGRRWNYVAERDRSG 105
GGG ++ ++Q E TA R + D + S++ L+ C D + ++ E D S
Sbjct: 571 GGGQVSSRVIQVERYSNGTAKRFLLDYTKIWASTIPLVSRCVLGDDLQLQTFLVEEDTS- 629
Query: 106 KFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
R SP L I+ +++P GFP SV+ Y+ YM + + G
Sbjct: 630 ----TPKRFQDSYSPDESLSWLPDTIKDFILPAGFPGSVSDDYLDYMLLQFPTNVTGWIC 685
Query: 166 GVFTTQTLLNSVGV---SRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQL 221
T +LL +VG+ S AT A AI W+ KDG G VG++ R G FD D KQ
Sbjct: 686 HTLVTSSLLKAVGIGSFSGTSATASASAIRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQW 745
Query: 222 RFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENI 281
R D + G+ L T P FLPLA N+ K VA + I FA N+
Sbjct: 746 RMYADFIGSAGSIFYLTTQVYPDYFLPLASLGNLTKAVARGLKDPSFCVIQNHFAISGNL 805
Query: 282 GDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-------VFSSYQEVKS 334
G+V AK E +A L+G L I+I PSL+ ++ +LS + ++ Y+ +
Sbjct: 806 GEVAAKEEIWEVVAQLIGLALGILILD-TPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSV 864
Query: 335 VVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
+ +T+N R + VP N +ENI +
Sbjct: 865 LQFNTINLKR--------VLHSTVPGCTYCNREENILT 894
>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
Length = 340
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 114/224 (50%), Gaps = 5/224 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+PDSVT Y Y W +L+ F F + +VGV A+ A + W+L
Sbjct: 56 LPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVVWLL 115
Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
++G G + +++FA D D K RF D+L + ++L F+ +AC AN
Sbjct: 116 RNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVACLAN 175
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI----MISKRN 310
V +++ + +TR I + ++ N+ DV AK + +L+ G ++ +++
Sbjct: 176 VCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVAHHV 235
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
+ T F + + +++++Y+ ++++ TLN RF + V+SF+K
Sbjct: 236 LLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIK 279
>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
Length = 376
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/313 (26%), Positives = 145/313 (46%), Gaps = 7/313 (2%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
+ + V+P+G+P SV+ Y Y W + F MG +T +L VGV +AT A
Sbjct: 46 LSTLVLPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAAT 105
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
+L+D G + + FA D + KQ R D++ ++ VEL + +P FLPL
Sbjct: 106 YTNLLRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPL 165
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK- 308
C A+ + + V +TR + + A+ N+ DV+AK + +L +S+ I+
Sbjct: 166 MCMASFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPL 225
Query: 309 RNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
S + +A+ L +++ +Y+ V+++ + N R A F++T VPS+ N
Sbjct: 226 AAESRVMAWAMFLPLVFMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAAN 285
Query: 366 LQENIF-SFPWLRDKP-VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYAL 423
E +F P + V LGS + A + + E + V + G V+ L
Sbjct: 286 QAEPLFPRAPAIEGHAKVCLGSSWSHIKDGVGAGVQADVVRALETWRVAAVFTDGTVHVL 345
Query: 424 LKDQAKSDDILKA 436
L + +DI++A
Sbjct: 346 LSKHCRHEDIVRA 358
>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 368
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 93/158 (58%), Gaps = 1/158 (0%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++PDG+P SV+ Y+ Y WR ++ GV +TQ LL +VG+ + A P A A+NW+
Sbjct: 197 LLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGLGKG-AIPTAAAVNWV 255
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG G + K++ ++ G+ FD + K R D L + G+E+ T PHLF+P+ AA
Sbjct: 256 LKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTPVFPHLFVPIGAAAG 315
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG 292
++ A++ ++TR+ Y FA N +V AKGE G
Sbjct: 316 AGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQG 353
>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
Length = 353
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 153/309 (49%), Gaps = 25/309 (8%)
Query: 167 VFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAG 225
+ +TQ LL+++GV AT W L+D G +G ++F QG D + K R
Sbjct: 2 MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61
Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
DL+ ++G ++L + P F+ + C +++++ V S +TR + + FA +N D++
Sbjct: 62 DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121
Query: 286 AKGECVGNIADLLGTGLSIMISK---RNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLN 341
AK +A ++G L +++++ NP I +F L+ +++++Y+ V+ +VL++LN
Sbjct: 122 AKEGSQETMATMMGMSLGMLLARFTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLN 181
Query: 342 RARFSVAVESFLKTGHVPSLREGNLQENIFSFPW---LRD-------KPVVLGSRFK--- 388
R S+ + F++TG V S + + E + W LR K V LG R
Sbjct: 182 FERSSILLTHFIQTGQVLSPEQVSSMEGVLPL-WATSLRSTNSKPLHKRVQLGVRVSSLP 240
Query: 389 --DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHC 446
D Q + A ++ +Y++ + KG V +L +K D+LK+ HA VL +
Sbjct: 241 RLDMLQLLNGVGASS--YKNAKYLLAH--IKGNVSVILHKDSKPADVLKSYIHAIVLANL 296
Query: 447 INSSNGHRS 455
+ S S
Sbjct: 297 MEKSTSFYS 305
>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
Length = 624
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 169/376 (44%), Gaps = 30/376 (7%)
Query: 98 VAERDRSGKFKR-----NSIRAISLQSPQAP--------AEELM-SFIRSYVVPDGFPDS 143
+AER R G KR NS + L+ +P A+ L+ I +V+P GFP S
Sbjct: 188 LAERYRDGVVKRYISDGNSKLQVHLEKHVSPVNAVEDENADSLIPQAIWDFVLPAGFPGS 247
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA---INWILKDGAG 200
V+ Y+ YM + + G V T +LL +VGV T A + I W+ KDG G
Sbjct: 248 VSVDYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIG 307
Query: 201 RVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNV 259
G+++ R G FD D K+ R D++ G+ EL T P FLPLA N+AK V
Sbjct: 308 AFGRLLIGGRFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLASLGNLAKAV 367
Query: 260 AAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTF 317
A + I FA+ N+G+V AK E A LLG G+ IM + S T
Sbjct: 368 ARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAGVKSSYLTL 427
Query: 318 AL----LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
AL + +++ YQ + + T+N R + V S + VP N +ENI ++
Sbjct: 428 ALTWLSVRLLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACNEEENILTW 487
Query: 374 PWLRDKPVVLG---SRFKDAFQDPDAYLAVEPFFEKERYVVTYNP--SKGKVYAL-LKDQ 427
+ G R + D + ++ E+Y++ + S+ + + K+
Sbjct: 488 ERFLRPQISFGVPMERMLGGDESSDMVDRLLKLYKNEKYILFFEQFGSRDPTFLVTFKES 547
Query: 428 AKSDDILKAAFHAYVL 443
A S +L++ + A+ L
Sbjct: 548 ATSMSVLRSLWQAHWL 563
>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
Length = 582
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 147/336 (43%), Gaps = 60/336 (17%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F+R ++P+GFP SVT Y+ Y WR ++ GV TQ LL +VG+ + A P A
Sbjct: 216 FLR-LMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKG-AIPTAA 273
Query: 190 AINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
A+NW+LKDG G + K++ ++ G+ FD K R DLL G+E+ T A PH FL +
Sbjct: 274 AVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLI 333
Query: 250 ACAANVAKNVAAVTSTSTRT---------------PIYKAFAKGENIGDVTAKGECVG-- 292
A ++ AA+ T P+ A G + + GE G
Sbjct: 334 GAVAGAGRSAAALIQLLFHTHNHLSSFALLLGSYFPLNNP-AIGNSKRKMPTHGEGDGGE 392
Query: 293 NIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
N+ LG G ++ ++T ALL Q + V+ R + +
Sbjct: 393 NVTAFLG-GPRLLXGA-----VSTLALL--------LQRNFAEVIAKGEAPRNGLVFSEY 438
Query: 353 LKTGHVPSLREGNLQENIFS-FPWLRDKP-----------------------VVLGSRFK 388
L +G VPS++E N +E +F P L KP + LGS+
Sbjct: 439 LLSGQVPSIKEVNEEEPLFPVVPLLNAKPXYKAQSAVLSTEAKDAAAEIERRLQLGSKLS 498
Query: 389 DAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALL 424
+ + LA+ + E Y++T + KG+ +A L
Sbjct: 499 EVVSSKEDVLALFDLYRNEAYILTEH--KGRFFASL 532
>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 485
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 155/332 (46%), Gaps = 37/332 (11%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y+ Y + +L+ F G+ ++ +L VGV A+P ++ +L
Sbjct: 73 LPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 132
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
++ GR+ ++FA R G + + K R A D+L + VE + A P HL + L +
Sbjct: 133 QESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHLRVVLLTFS 192
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE-CVGNIADLLGTGLSIMISKRNPS 312
++ + + V + S++ + FAK N+G++ A G V +I+ L TGL+++
Sbjct: 193 SILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHISTPLTTGLTLI------- 245
Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
F LL ++ ++Y V++V + TLNR R ++ + + G + ++ + E IF
Sbjct: 246 ----FLLLI--HLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQTSKCERIFE 299
Query: 373 ----FPWLRDKPVVLGSRFKDAFQD-----------------PDAYLAVEPFFEKERYVV 411
W + R AFQ+ P + FEKE Y++
Sbjct: 300 RDGILRWKASSTTLGFCRIGIAFQELLRSSNVSCRANSIRDIPIDIPRLLRLFEKEEYIL 359
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+NP K +LK+ K LKA HA ++
Sbjct: 360 WFNPVSKKGTIVLKNNVKPVSQLKAWSHALLV 391
>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
Length = 266
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 121/249 (48%), Gaps = 7/249 (2%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
++ +P GFP SV+ YV Y W ++ F TQ +L VGV +AT A
Sbjct: 2 LQDVFLPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAAT 61
Query: 191 INWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
+ W++KDG G +G+++F QG D D K+ R D+L ++ +++ F +
Sbjct: 62 LTWLMKDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLI 121
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTGLSIM 305
C A V +++ V +TR + + A N+ DV+AK V A L L M
Sbjct: 122 VCVAGVCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPM 181
Query: 306 ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL--KTGHVPSLRE 363
+S R + F + +++++++ V+S+ + T N+ R +A + F+ TG+ P ++
Sbjct: 182 VSDRQWLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKT 241
Query: 364 GNLQENIFS 372
N +E +
Sbjct: 242 VNQREPVLC 250
>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
Length = 381
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 157/341 (46%), Gaps = 32/341 (9%)
Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA-----INWILKD 197
SV+ Y+ YM W+ + G T +LL +VG+S PG+ A I W+ KD
Sbjct: 2 SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTE---PGSAAAASAAIKWVTKD 58
Query: 198 GAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
G G VG++ I R G FD D KQ R +++ G ELAT P FL LA ++
Sbjct: 59 GLGAVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLT 118
Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITT 316
K VA + I FA EN+GDV AK E L+G L +++ P + T+
Sbjct: 119 KAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLS-TPVVATS 177
Query: 317 FALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
++ L+ + ++ YQ + ++ L T+N R +V + ++ + + E N +EN
Sbjct: 178 YSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVGECNKKEN 237
Query: 370 IFSFPWLRDKPVV-LGSRFKDAFQDPDAYLAVEPFFE---KERYVVTY------NPSKGK 419
+ P+ P + LG ++ L +E FE E+YV+ + S
Sbjct: 238 LI-LPYTMLMPQIRLGCSLREVITLGKTPLEIEEAFELYNSEKYVLIVLSQFLSSCSCNI 296
Query: 420 VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
+ + +D ++ +L++ + A+VL H S + LKQ
Sbjct: 297 KFRVCQDGCENTTLLRSIYQAHVLFH----SKSFDTFTLKQ 333
>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 286
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 132/271 (48%), Gaps = 11/271 (4%)
Query: 85 ICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSV 144
I E+ DG R RS + + SL + ++ +++ +PDG+PD+V
Sbjct: 5 IVGEEYDGNR------LVRSWVLDQKAGAVESLGVDASNESSVLLWLKKGFLPDGYPDAV 58
Query: 145 TPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGK 204
T YV + W +++ G+ + +L +GV + +P + ++D AG G
Sbjct: 59 TSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFFVRDLAGMCGG 118
Query: 205 MIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVT 263
++FA QG + D KQ R D+ +G ++LA+ P FL LAC ++A+ V V
Sbjct: 119 VLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACLGSLARAVTGVA 178
Query: 264 STSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI----SKRNPSLITTFAL 319
+TR + + F++ +N GD++AK L+G L + S ++ F L
Sbjct: 179 GGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASDSPVAVWAVFLL 238
Query: 320 LSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
L+ +++++ + V+++ L +LN +R + ++
Sbjct: 239 LTFLHIYANVRAVRALQLTSLNCSRLDILLK 269
>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
Length = 503
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 19/332 (5%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS-------RNR 183
++ +++P GFP SV+ Y+ YM + + T +LL +VGV
Sbjct: 114 VKDFILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAAS 173
Query: 184 ATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A A AI W+ KDG G VG++ Q G FD D KQ R DL+ G+ +L+T
Sbjct: 174 AAASAAAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLY 233
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
P FL LA N+AK VA + I FA N+G+V AK E A LLG L
Sbjct: 234 PAYFLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLAL 293
Query: 303 SIM------ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
IM ++ P L T+ + +++ YQ + + ++N R + V+S +
Sbjct: 294 GIMVLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHS 353
Query: 357 HVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYL-AVEPFFEKERY--VV 411
VP + N +E I S+ ++ G ++ + P + + + + +E+Y VV
Sbjct: 354 IVPGYVDCNRKEKILSWQRFLKPRIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVV 413
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ +V+ K A S +L++ + +Y L
Sbjct: 414 KQQGTDFEVFVSFKVGATSLSVLRSIWQSYWL 445
>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 495
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 166/346 (47%), Gaps = 40/346 (11%)
Query: 125 EELMSFIRSYVV---PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSR 181
+E S++ S ++ P G+P SV+ Y+ Y + +L+ F G+ ++ +L VGV
Sbjct: 59 KESPSYLTSLLIVFLPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGD 118
Query: 182 NRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATA 240
A+P ++ +L++ GR+ ++FA R G + + K RFA D+L + V+ +
Sbjct: 119 ASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSP 178
Query: 241 AVP-HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE-CVGNIADLL 298
A P HL + L +++ + + V + S++ + FAK N+G++ A G V +I+ L
Sbjct: 179 AFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGSLVVSHISTPL 238
Query: 299 GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
TGL+++ F LL ++ ++Y V++V + TLNR R ++ + + G
Sbjct: 239 TTGLALI-----------FLLLI--HLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRA 285
Query: 359 PSLREGNLQENIFS----FPWLRDKPVVLGSRFKDAFQD--------------PDAYLAV 400
+ ++ + E IF W + R ++FQ+ DA + +
Sbjct: 286 LTPKQTSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIRDAPIDI 345
Query: 401 E---PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
FEKE Y++ +NP + +LK+ + LKA HA ++
Sbjct: 346 PRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLV 391
>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
Length = 388
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 107/196 (54%), Gaps = 5/196 (2%)
Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFA 224
G Q++L VGV A+ A WIL+DGAG G+++FA QG DYD K+ R
Sbjct: 2 GTLAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRLF 61
Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
D+L + +EL P+ C ++VAK+V V +TR + + A+ N+ DV
Sbjct: 62 ADILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMADV 121
Query: 285 TAKG---ECVGNIADLLGTGLSIMISKRNPSLITT-FALLSCGYVFSSYQEVKSVVLHTL 340
+AK E + N+A L + L + + N L++T F L + ++F++Y+ V VV+ T
Sbjct: 122 SAKDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMETF 181
Query: 341 NRARFSVAVESFLKTG 356
NR+R+SV V FL+ G
Sbjct: 182 NRSRYSVVVRHFLQKG 197
>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 306
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 133/278 (47%), Gaps = 13/278 (4%)
Query: 203 GKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
GK+I + G + D + K R D+L +LG +E+ + P LFL +A N AK +A V
Sbjct: 5 GKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVV 64
Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTF 317
+ +TR PIY +FAK N+ D+ AKGE + + +++G G I + S LI
Sbjct: 65 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG- 123
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP--- 374
LLS +++ QE+++ ++TLN R ++ V F+K+G V S E +E++ FP
Sbjct: 124 PLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRV 182
Query: 375 WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDIL 434
V +G + F P + F E+++++ + + Y +L+ A +D L
Sbjct: 183 IEEAGSVKIGQPLRKVF-SPRLVEQLRATFPGEKFLLSQKSN--RTYMVLERSASGEDAL 239
Query: 435 KAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTT 472
+ A S G R A+ Y+ + T
Sbjct: 240 RGWLVAAFASEMERSGAGSRDDAVLSEAYEKMERVFPT 277
>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
Length = 216
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 93/181 (51%), Gaps = 3/181 (1%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
+R +P G+P +VTP Y+ Y W H G T +LL++VGVS A A++
Sbjct: 1 LRELFLPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATVALS 60
Query: 191 --INWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
I WI+KDG G +G++I +R +FD D ++ R +LL G G+E+AT PH FL
Sbjct: 61 ASIKWIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPHYFL 120
Query: 248 PLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
LAC+ A+ + I FA+ +N+G V AK E A + G S+ +
Sbjct: 121 LLACSGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLAASVAVL 180
Query: 308 K 308
+
Sbjct: 181 R 181
>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
Length = 218
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 88/165 (53%), Gaps = 1/165 (0%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G T +L +GV A+ A W++
Sbjct: 53 LPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWLV 112
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD AG +G++IFA +G K D + KQ R D+L ++ +E+ P F + C +N
Sbjct: 113 KDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTSN 172
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG 299
+AK + +V +TR + A+ N+ DV+AK + +L G
Sbjct: 173 LAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAG 217
>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
Length = 412
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 157/354 (44%), Gaps = 46/354 (12%)
Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA-----INWILKD 197
SV+ Y+ YM W+ + G T +LL +VG+S PG+ A I W+ KD
Sbjct: 21 SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTE---PGSAAAASAAIKWVTKD 77
Query: 198 GAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA 256
G G VG++ I R G FD D KQ R +++ G ELAT P FL LA ++
Sbjct: 78 GLGAVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELATPLAPDYFLLLASLGHLT 137
Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITT 316
K VA + I FA EN+GDV AK E L+G L + + P + T+
Sbjct: 138 KAVARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVALLS-TPVVATS 196
Query: 317 FALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
++ L+ + ++ YQ + ++ L T+N R +V + ++ + + E N +EN
Sbjct: 197 YSQLALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVSECNKKEN 256
Query: 370 IFSFPWLRDKPVV-LGSRFKDAFQDPDAYLAVEPFFE---KERYVVTYNPSKGKVYALLK 425
+ P+ P + LG ++ L +E FE E+YV+ SK + LLK
Sbjct: 257 LI-LPYTMLMPQIRLGCSLREVITLGKTPLEIEEAFELYNSEKYVLIVF-SKSNMGILLK 314
Query: 426 -------------------DQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
D ++ +L++ + A+VL H S + LKQ
Sbjct: 315 VLSQFLSSCSCNIKCRVCQDGCENTTLLRSIYQAHVLFH----SKSFDTFTLKQ 364
>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
Length = 208
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 95/188 (50%), Gaps = 3/188 (1%)
Query: 122 APAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSR 181
+P + IR +P GFPDSVTP Y+PY W H G T +LL +VGVS
Sbjct: 12 SPLSSIPDSIRDLFLPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSLATSSLLTAVGVST 71
Query: 182 NRATPGAV--AINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
+ A A+ +I WI+KDG G +G+ I +R +FD D ++ R +LL G G+E+A
Sbjct: 72 SPAATVALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVAELLSTAGLGLEVA 131
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
T+ P F+ LA A + + I FA+ +N+G V AK E A ++
Sbjct: 132 TSLYPQSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVAAKEEVWEVAAQMV 191
Query: 299 GTGLSIMI 306
G S+ +
Sbjct: 192 GLMASVAV 199
>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
Length = 486
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 114/227 (50%), Gaps = 19/227 (8%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P+GFP SV+ Y+PY W +++ G+ ++Q +L VG L
Sbjct: 70 LPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVGF--------------FL 115
Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD G +G ++FA QG D KQ R D++ ++G +ELA+ +P FL LAC +
Sbjct: 116 KDFTGMLGGILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLACLGS 175
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR----N 310
+A+ V V ++TR + + FA N D+ AK L+G L + +
Sbjct: 176 LARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAFIRAAHGLE 235
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
P + + F L+ +V+++ + ++ + + +LN+AR + + +L+TG
Sbjct: 236 PLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGE 282
>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
Length = 638
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 141/283 (49%), Gaps = 7/283 (2%)
Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFA 224
G T +L +GV A+ A W+LKD G +G++IFA +G K D + KQ R
Sbjct: 278 GSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSKLDCNAKQWRLF 337
Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
D+L + +E+ +P F C +N+AK + +V +TR + A+ N+ DV
Sbjct: 338 ADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTMHQARRNNMADV 397
Query: 285 TAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTL 340
+AK E + N+A LL + L + + PS + F LL+ +V+++Y+ V+++V+ TL
Sbjct: 398 SAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANYRAVRALVIETL 457
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAV 400
N R + ++ FL+ G V N E +++ W + LG A L
Sbjct: 458 NEGRLWLVLKHFLQRGEVLDPTSANQMEPLWTGFW-PSLSLSLGVPLHHLVSS-VAELQQ 515
Query: 401 EPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+E Y++ ++ S+ +V +L +A + +L+AA H VL
Sbjct: 516 LAEGHQEPYLLRWDQSQNQVQVVLSQEAGPETVLRAATHGLVL 558
>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
Length = 343
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/263 (25%), Positives = 121/263 (46%), Gaps = 33/263 (12%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P+G+P SV+ Y+ + TW L+ + +TQ LL+ +GV AT W L
Sbjct: 55 LPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWFL 114
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G VG ++F QG D KQ R A D +
Sbjct: 115 RDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFM-------------------------- 148
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLI 314
++A V +TR + + FA+ +N DV+AK A L+G + + +++ + I
Sbjct: 149 --NDIAGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANNI 206
Query: 315 ----TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
T+F L+ +++++Y+ V ++ L ++N R ++ ++SFLK G PS E + QE+
Sbjct: 207 FLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQESA 266
Query: 371 FSFPWLRDKPVVLGSRFKDAFQD 393
P+ ++ G++ D
Sbjct: 267 LPRPFTSRSGIIFGAKISSLCGD 289
>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
Length = 280
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 44/82 (53%), Positives = 63/82 (76%)
Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
SY PYM WR +++FFGGA+ VFTTQ+LL ++GV+ + A AINW++KDGAGR+G+++
Sbjct: 10 SYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWVIKDGAGRLGRLL 69
Query: 207 FARQGKKFDYDLKQLRFAGDLL 228
FAR G++ D +LKQ R G L+
Sbjct: 70 FARWGRELDCELKQFRLMGYLV 91
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 4/93 (4%)
Query: 323 GYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR--DKP 380
GY+ +S +EV SV L +NRAR + A +L G VP + E N E + PW R
Sbjct: 88 GYLVASRKEVDSVELPYMNRARLAYAARQYLTGGQVPGVAEANHHEPL--LPWGRYNQGR 145
Query: 381 VVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTY 413
+VLG+ + A P L F + Y+ ++
Sbjct: 146 LVLGASVEAACAAPADLLTAAAAFREGPYLASF 178
>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
Length = 500
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 151/353 (42%), Gaps = 26/353 (7%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L ++ +++P GFP SV+ Y+ YM + + G T +LL +VG T
Sbjct: 98 LPDILKDFILPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTD 157
Query: 187 GAVA---INWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A + I W+ KDG G +G++ I R G FD D KQ R D + G+ +L T
Sbjct: 158 AAASAAAIRWVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVY 217
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL 302
P FLPLA N+ K VA + I FA N+G+V AK E V + L
Sbjct: 218 PAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEE-VWEVGAQLLGLA 276
Query: 303 SIMISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
++ P L+ +F LL+ + ++ YQ + + T+N R + ++S +
Sbjct: 277 LGILILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQ 336
Query: 356 GHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVE-----------PFF 404
V + N ENI S+ ++ G ++ + ++ +
Sbjct: 337 SRVLGCVDCNQDENILSWETFTRPRIIFGVPLEEMIGGERSVAKIKHGVVIKIKMLLKLY 396
Query: 405 EKERYVVTYNPSKGKVYALL--KDQAKSDDILKAAFHAYVLLHCINSSNGHRS 455
KE++ + N K L+ K A S +L++ + Y LH S G+RS
Sbjct: 397 AKEKHFLVVNQQKTDFEVLVSFKVGATSMSVLRSVWQTY-WLHENWKSFGNRS 448
>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
Length = 497
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 144/352 (40%), Gaps = 85/352 (24%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 148 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 207
Query: 196 KDGAGRVGKMIFA------------RQ-----------------GKKFDYDLKQLRFAGD 226
+D G +G ++F RQ G D + K R D
Sbjct: 208 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 267
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
+ +LG ++L + P + + C +++++ V S +TR + + FA N D++A
Sbjct: 268 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 327
Query: 287 K---GECVGNIADLLGTGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
K E + + + L +++ + + +F L+ +++++Y+ V+S+ L TLN
Sbjct: 328 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 387
Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEP 402
R S+ ++ F+ G +EN F L DK
Sbjct: 388 ERSSILLQYFMDNG----------EENYF----LLDK----------------------- 410
Query: 403 FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCIN-SSNGH 453
+G V+ + QA + DIL + H VL H + S +GH
Sbjct: 411 --------------EGSVHIFIHKQAAATDILMSFIHGLVLAHLMQKSKSGH 448
>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
anophagefferens]
Length = 248
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 80/245 (32%), Positives = 127/245 (51%), Gaps = 9/245 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP +VTP Y Y + + A GV +T+ LL +VGV A P A A NW +
Sbjct: 2 LPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWAI 61
Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELAT-AAVPHLFLPLACAA 253
KDG G +G + FA D +Q R ++L + VELA A P F+P+A A
Sbjct: 62 KDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAALPAFPQYFVPIAGLA 121
Query: 254 NVAKNVAAVTSTSTRTPIYKAF---AKGENIGDVTAKGECVGNIADLLGTGLSIMISK-- 308
NVAKNV+ + ++++R I+++ + N+ D+TAK +A L G L + S
Sbjct: 122 NVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGVAASNAC 181
Query: 309 --RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
+ +A LS ++ +Y +K V TL+ AR + V++F +TG P+ + +
Sbjct: 182 GAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPTPADVSE 241
Query: 367 QENIF 371
+E++F
Sbjct: 242 RESLF 246
>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
dendrobatidis JAM81]
Length = 434
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 158/336 (47%), Gaps = 31/336 (9%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSR--NRATPG 187
+ R+ +P G+P SV Y W ++ G A+ V T Q +L S+G S +
Sbjct: 35 WFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSAISVLTAQAMLTSIGASTAAHETAAL 94
Query: 188 AVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHL 245
A+A++W+LKDG G +GKM+ R +FD K+ + G+ GA ++LAT PH
Sbjct: 95 AIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRWKLYGEACSISGALMQLATCITNPHH 154
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFA--KGENIGDVTAKGECVGNIADLLG--TG 301
FL A +++ +T T + A G N+GD+ AK + ++A LLG +G
Sbjct: 155 FLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNGVNVGDIVAKADSQLSLAHLLGMVSG 214
Query: 302 LSIM-ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH--- 357
+ ++ +S + +L F++LS + + ++ L++ R + F+ +
Sbjct: 215 IGMLAVSFQPAALFAWFSVLSAIQIVCTGLLLREARFEVLDQYRLVLIPREFMSEKNALR 274
Query: 358 VPSLREGNLQENIFSFPWLRD--------KPVVLGSRFKDAFQDPDAYLAVEPFFEKERY 409
+P+L + EN WL + + +G DAF+ A++ F+ +Y
Sbjct: 275 IPTLDQVVRFEN-----WLNEGLSKGQVVTDLSIGVTVDDAFKHSRLSHALQ-IFKNGKY 328
Query: 410 VVTY-----NPSKGKVYALLKDQAKSDDILKAAFHA 440
++ + S+ KVY +L + +++AAFHA
Sbjct: 329 LLGFKLADSTHSQIKVYVVLHQSIQPVHVIQAAFHA 364
>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
Length = 467
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 150/305 (49%), Gaps = 29/305 (9%)
Query: 90 IDGRRWNYVAERDRSGKFKRNSI-----RAISLQSPQAPAEELMS--------FIRSYVV 136
I G+ V+E D+SG ++ + R I ++ PQ +L S + +
Sbjct: 18 ISGKNDVEVSELDKSGHVQKRWLHEPSGRIIHIR-PQGHTNDLFSRSLREWKKILSDSFL 76
Query: 137 PDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILK 196
P GFP SV+ Y+ Y T+ +L+ FF + + LL +GV ++ + IL+
Sbjct: 77 PVGFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILR 136
Query: 197 DGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA----- 250
D RV ++FA R G + + D K+ RF DL + +EL + P+L PLA
Sbjct: 137 DATSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYS---PYLS-PLAKVVVL 192
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMI 306
+ + V++ +++ + FAK +N+ ++ AK VG I L+GT + + +
Sbjct: 193 SVGEALRALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVGT-IVVKV 251
Query: 307 SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
+ + S+++ LL G+++ +Y V+SV + TLNR R ++ + +L+TG V S E
Sbjct: 252 VEDHRSVVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVLSPAEVAK 311
Query: 367 QENIF 371
+E I
Sbjct: 312 REYII 316
>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
Length = 561
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 145/344 (42%), Gaps = 59/344 (17%)
Query: 93 RRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMS--------------------FIR 132
RRW R +G F R Q P+ P EEL +R
Sbjct: 76 RRWG----RTPTGGFDR--------QPPRRPEEELAGQGEATSGRSGFARWDPRLSPRLR 123
Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV--- 189
+ +P G+PD+V+P Y P++ W F + V T Q+LL ++G+ +TPGA+
Sbjct: 124 AVFLPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMG---STPGALPLT 180
Query: 190 -AINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFL 247
A W+LKDG G + + G+K D D K+ + ++ +EL T A P LFL
Sbjct: 181 AATKWVLKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFL 240
Query: 248 PLACAANVAKNVAAVTSTS-TRTPIYKAFAKGENIGDVTAKGECVGN----IADLLGTGL 302
PLA A + A S + + FA+ EN DV A+ E G +A +G G+
Sbjct: 241 PLAATATFVRTAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGI 300
Query: 303 SIMISK----------RNPSL-ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
+S+ R+ ++ + + + C ++ + K + TLNRAR +
Sbjct: 301 FRTVSQSFASDPEEPARDVAIAVALYGGVVCAHLVCIWNAAKVLRFETLNRARLVRQARA 360
Query: 352 FLKTGHVPSLREGNL--QENIFSFPWLRDKPVVLGSRFKDAFQD 393
F G +E +F + P VLG+ DA +D
Sbjct: 361 FRDGGAAAIADYNAAGDEEGVFRARFSPGDP-VLGASLPDAARD 403
>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 675
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 114/228 (50%), Gaps = 13/228 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+PDG+P SVT +P+ W ++ +T+ +L VGV ++A + ++W++
Sbjct: 73 LPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLSWMM 132
Query: 196 KDGAGRVGKMIFAR---QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP---HLFLPL 249
+DGA +G ++FA QG + K + D + + +EL + +P +F +
Sbjct: 133 QDGARMIGSVVFASVIPQG--LECRAKTWQLVTDFIHDSAHLLELCSPWLPGGQAIFRVV 190
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR 309
AA+V K +A+V TR P + FA N D+ AK G++ L+G L I+I+
Sbjct: 191 LVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVGGLIGLALGIVIAYI 250
Query: 310 NPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
P+ + FAL S ++F++Y+ V+ V L LN R +E +
Sbjct: 251 VPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPRLEWCLEQY 298
>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
Length = 673
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 112/232 (48%), Gaps = 15/232 (6%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV P YV Y W L+ GV TQ +L +GV R +TP A + W+
Sbjct: 205 LPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTPLAATLQWMA 264
Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-----FLPL 249
+DGA +G ++F F ++K R D ++G +E+ A +P FL L
Sbjct: 265 RDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEM-LAPLPAFREGGRFLRL 323
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK- 308
C A+V K V +T I + +A NI D+ AK +A LLG GLS+ ++
Sbjct: 324 ICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLGLGLSVWFARA 383
Query: 309 -------RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
+ +A+L+ ++ ++Y ++++ L +LN R S+ V +L
Sbjct: 384 VSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTEYL 435
>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
Length = 406
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 149/304 (49%), Gaps = 27/304 (8%)
Query: 151 YMTWRALKHF--FGGAM-GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF 207
+ WR + F F ++ G TQ +L +GV RA+ A W++KD G +G+++F
Sbjct: 39 WRWWRFTRPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVF 98
Query: 208 ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTST 267
A F D+L ++ +E+ P F C +N+AK V +V +T
Sbjct: 99 AWWKGLF----------ADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGAT 148
Query: 268 RTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITTFALLSCG 323
R + A+ N+ DV AK E + N+A LL + L + + PS+ + F LL+
Sbjct: 149 RAALTMHQARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTAL 208
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS--FPWLRDKPV 381
+++++Y+ V+++V+ TLN R + ++ FL+TG+V N E +++ P L
Sbjct: 209 HIYANYRAVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPLWTGFCPSLSLSLG 268
Query: 382 VLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
V R + D ++ E +E Y++ ++ S+ +V +L +A + +L+AA H
Sbjct: 269 VPLHRLISSVSD------LQQLVEGHQEPYLLCWDQSRNQVQVVLSQKAGPETVLRAATH 322
Query: 440 AYVL 443
VL
Sbjct: 323 GLVL 326
>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 498
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 135/275 (49%), Gaps = 11/275 (4%)
Query: 105 GKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGA 164
G+ + ++RA ++P+ L + + +++ P G+P SVTP Y+PY T+ +L+ FF
Sbjct: 21 GESQDTTLRASRTKNPR-----LQALLHAFL-PAGYPHSVTPDYLPYQTYDSLQAFFSSI 74
Query: 165 MGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRF 223
+ + +L +GV + ++P I I D R+ ++FA R G + + K RF
Sbjct: 75 TSLLANRAVLEGLGVGDSSSSPTGALILKIAGDTISRIATILFAHRMGTAIEPECKFYRF 134
Query: 224 AGDLLMELGAGVELATAAVPHL-FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
DLL + ++L T A+P+ L + +A V +++ V + +++ + FA N+
Sbjct: 135 LADLLNDSAQLLDLLTPALPYFPKLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLA 194
Query: 283 DVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHT 339
++ AK + LLG + ++ K +AL L ++ +Y V++V T
Sbjct: 195 ELNAKEASQETVVSLLGILVGSLVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRT 254
Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP 374
LNR R +V V +L+ G V E E+I P
Sbjct: 255 LNRQRATVVVRGWLEWGRVLGPEEVARMESILQLP 289
>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
Length = 459
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 76/232 (32%), Positives = 109/232 (46%), Gaps = 20/232 (8%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT---PG 187
I+ +V+P GFP SV+ Y+ YM W+ + G T TLL +VGV T
Sbjct: 120 IKDFVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAAS 179
Query: 188 AVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A AI W+ KDG G G+++ R G FD D K+ R D + G+ EL T P F
Sbjct: 180 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYF 239
Query: 247 LPLACAANVAKNV------AAVTSTSTRTPIYKA----FAKGENIGDVTAKGECVGNIAD 296
LPLA N+AK + R P ++ FA+ N+G+V AK E A
Sbjct: 240 LPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQ 299
Query: 297 LLG--TGLSIM----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
LLG G+ IM I P+L T+ + +++ YQ + + T++R
Sbjct: 300 LLGLSIGVFIMDTSGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVDR 351
>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 497
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 149/332 (44%), Gaps = 24/332 (7%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y+ Y + +L+ F G+ ++ +L VGV A+P ++ +L
Sbjct: 74 LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
D GR+ ++FA R G + + K R A D+L + + + A P HL + + +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL 313
+V + + V + S++ + FA+ N+G++ AK + LLG ++ R +
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTP 253
Query: 314 ITTFALLSCGYVF---SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
T+A L + ++Y V+SV + TLNR R ++ + + G V + ++ + E I
Sbjct: 254 FATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERI 313
Query: 371 FS----FPWLRDKPVVLGSRFKDAFQD------------PDAYLAVE---PFFEKERYVV 411
F W + R +FQ+ D + + F +E Y++
Sbjct: 314 FERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYIL 373
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+N S +LK+ LKA HA ++
Sbjct: 374 WFNSSDKGGTIVLKNSVTPLSQLKAWSHALMV 405
>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 497
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 149/332 (44%), Gaps = 24/332 (7%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y+ Y + +L+ F G+ ++ +L VGV A+P ++ +L
Sbjct: 74 LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
D GR+ ++FA R G + + K R A D+L + + + A P HL + + +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL 313
+V + + V + S++ + FA+ N+G++ AK + LLG ++ R +
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTP 253
Query: 314 ITTFALLSCGYVF---SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
T+A L + ++Y V+SV + TLNR R ++ + + G V + ++ + E I
Sbjct: 254 FATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERI 313
Query: 371 FS----FPWLRDKPVVLGSRFKDAFQD------------PDAYLAVE---PFFEKERYVV 411
F W + R +FQ+ D + + F +E Y++
Sbjct: 314 FERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYIL 373
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+N S +LK+ LKA HA ++
Sbjct: 374 WFNSSDKGGTIVLKNSVTPLSQLKAWSHALMV 405
>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 498
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 149/332 (44%), Gaps = 24/332 (7%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y+ Y + +L+ F G+ ++ +L VGV A+P ++ +L
Sbjct: 75 LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 134
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
D GR+ ++FA R G + + K R A D+L + + + A P HL + + +
Sbjct: 135 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 194
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL 313
+V + + V + S++ + FA+ N+G++ AK + LLG ++ R +
Sbjct: 195 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRVSTP 254
Query: 314 ITTFALLSCGYVF---SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
T+A L + ++Y V+SV + TLNR R ++ + + G V + ++ + E I
Sbjct: 255 FATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCERI 314
Query: 371 FS----FPWLRDKPVVLGSRFKDAFQD------------PDAYLAVE---PFFEKERYVV 411
F W + R +FQ+ D + + F +E Y++
Sbjct: 315 FERDGVLRWKASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRRLLHLFREEEYIL 374
Query: 412 TYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+N S +LK+ LKA HA ++
Sbjct: 375 WFNSSDKGGTIVLKNSVTPLSQLKAWSHALMV 406
>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
Length = 485
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 145/338 (42%), Gaps = 19/338 (5%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV-----SRNRAT 185
I+ +++P GFP SV+ Y YM + + G T +LL +VGV S A
Sbjct: 98 IKDFILPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAA 157
Query: 186 PGAVAINWILKDGAGRVGKM-IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
A AI W+ KDG G +G++ I R G FD D KQ R D + G+ +L T P
Sbjct: 158 ASAAAIRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPA 217
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
FLPLA N+ K VA + I FA N+G+V AK E V + L
Sbjct: 218 YFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEE-VWEVGAQLLGLALG 276
Query: 305 MISKRNPSLITTFALLSCGY-------VFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
++ P L+ +F L+ + ++ Y + + T+N R + V+S +
Sbjct: 277 ILILDTPGLVKSFPALASTWLSMRLLHLWLRYLSLSVLQFDTINLKRARLLVKSHILHSV 336
Query: 358 VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVE---PFFEKERYVVTYN 414
V + N +ENI + + G ++ + V+ + KE Y++ N
Sbjct: 337 VLGCVDCNREENILLWETFTKPQITFGVPLQEMVGGERSAAQVKTLLKLYTKENYLLVVN 396
Query: 415 PSKG--KVYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
KG +V+ K A S L++ + Y L SS
Sbjct: 397 QQKGDFEVFVSFKVGATSMSALRSVWQTYWLHENWKSS 434
>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
Length = 499
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/452 (25%), Positives = 193/452 (42%), Gaps = 66/452 (14%)
Query: 62 SESDETTAFRLVEDQFVDSSLRLICSE---DIDGRRWNYVAERDR-SGKFKRNSIRAISL 117
S DE RL + S+ L C+E D G W RD G F R R I
Sbjct: 51 SRDDEMHDSRLSSNA-TASTRTLTCAEVIGDASGAIWM----RDSVGGAFTRT--RPIDA 103
Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
++ + A + +R+ +P GFP +V+ Y ++ W F + + + Q+LL ++
Sbjct: 104 KTGRERAVGVPE-LRAVFLPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVAL 162
Query: 178 GVSRNRATPGAV----AINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELG 232
GV +TPGA+ A W+LKDG G ++ A G+ +D D K+ + L ++
Sbjct: 163 GVG---STPGALPLTAAAKWVLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVA 219
Query: 233 AGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIY-KAFAKGENIGDVTAKGECV 291
+EL T A P LFLPLA +A + A S + + F + N+GD+ AK E
Sbjct: 220 RAIELVTPAAPGLFLPLAASATFVRAAALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQ 279
Query: 292 GNIADLLGTGLSI----MISKRNPSL-------------ITTFALLSCGYVFSSYQEVKS 334
G L+ + I +++ + +L + L+ ++FS +Q +S
Sbjct: 280 GRWLALVALPVGIQVFQLVNAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARS 339
Query: 335 VVLHTLNRARFSVAVESFLKTG---HVPSLREGNLQENIF------SFPWLRDKPVVLGS 385
+ +LN R E+++ + +P+ E E ++ + P P +GS
Sbjct: 340 LKFDSLNGFRLLRLAEAYVASDGKIELPNCVEIGEIEGVYKPRRTLTTPTFGASPNEIGS 399
Query: 386 RFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKV-YALLKDQAKSDDILKAAFHAYVLL 444
+ DA++ YV+ ++ S+G V AL+ + A D+L AA L
Sbjct: 400 TW-------DAFIEATNASNGHPYVLGFDSSRGDVASALILESATPRDMLAAALACEKL- 451
Query: 445 HCINSSNGHRSLALKQHEYDNSNLMLTTADLE 476
RS + ++Y +S TAD E
Sbjct: 452 ------RIDRSSVVSAYDYADSQ----TADFE 473
>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
Length = 509
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 159/343 (46%), Gaps = 35/343 (10%)
Query: 136 VPDGFPDSVTPSYVPYMTW---------RALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+P G+P SV+ Y+ + +L+ F G+ ++ +L VGV A+P
Sbjct: 73 LPAGYPHSVSDDYMDLLFLALENDRKYKDSLQAFSSSIAGLLASRAVLQGVGVGDASASP 132
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-H 244
++ +L++ GR+ ++FA R G + + K RFA D+L + V+ + A P H
Sbjct: 133 TVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKH 192
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-S 303
L + L +++ + + V + S++ + FAK N+G++ AK + L+G S
Sbjct: 193 LRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVISLMGMLCGS 252
Query: 304 IMISKRNPSLITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
+++S + L T AL L ++ ++Y V++V + TLNR R ++ + + G +
Sbjct: 253 LVVSHISTPLTTGLALIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTP 312
Query: 362 REGNLQENIFS----FPWLRDKPVVLGSRFKDAFQD--------------PDAYLAVE-- 401
++ + E IF W + R ++FQ+ DA + +
Sbjct: 313 KQTSKCERIFERDGILRWKASSTTLGFCRIGNSFQELLCGSNGSYRANSIRDAPIDIPRL 372
Query: 402 -PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
FEKE Y++ +NP + +LK+ + LKA HA ++
Sbjct: 373 LRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLV 415
>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
Length = 323
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/220 (33%), Positives = 105/220 (47%), Gaps = 10/220 (4%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
I+ +V+P GFP SV+ Y+ YM W+ + G T TLL +VGV T A +
Sbjct: 103 IKDFVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAAS 162
Query: 191 ---INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
I W+ KDG G G+++ R G FD D K+ R D + G+ EL T P F
Sbjct: 163 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYF 222
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSI 304
LPLA N+AK V + I FA+ N+G+V AK E A LLG G+ I
Sbjct: 223 LPLASLGNLAKAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVFI 282
Query: 305 M----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
M I P+L T+ + +++ YQ + + T+
Sbjct: 283 MDTSGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTV 322
>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
UAMH 10762]
Length = 454
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 131/256 (51%), Gaps = 10/256 (3%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
+AP + L+ +P GFP SVT YV Y + +L+ F +G+ +++ +L SVGV
Sbjct: 42 KAPWQRLLDIF----LPAGFPHSVTVDYVEYQIYDSLQAFSSNIVGLLSSRAVLLSVGVG 97
Query: 181 RNRATPGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
A+P + I ++ GR+G + F A G + + K R D+ + ++ +
Sbjct: 98 DADASPTYALLLHIAQESLGRIGTIAFVALYGTALEPECKMYRLLADVFNDFAFVLDCLS 157
Query: 240 AAVP-HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL 298
A+P +L + +++ K + V + S + + FA+ N+G++ AK + LL
Sbjct: 158 PAIPKYLRVVTLSFSSLLKALCGVAAGSAKASLSAHFAQWGNLGELNAKDSSQETVISLL 217
Query: 299 G--TGLSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
G G ++ R P + TT LL ++ ++ + V++VV+ TLNR R ++ V L+
Sbjct: 218 GMWVGSLVLSWVRTPQATWTTLILLLAIHLETNRRAVRAVVMRTLNRQRATL-VFHHLQR 276
Query: 356 GHVPSLREGNLQENIF 371
GHVP+ E + QE IF
Sbjct: 277 GHVPTPEEVSKQERIF 292
>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
Length = 484
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 178/397 (44%), Gaps = 56/397 (14%)
Query: 98 VAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIR--------SYVVPDGFPDSVTPSYV 149
+AE D++G K + + S P + + L+ F R +PDG+P SVT Y
Sbjct: 10 IAEYDKAGNLKAKYVESASRIGP-SRVDVLLPFKRKKPWLRLLEIFLPDGYPHSVTEDYA 68
Query: 150 PYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR 209
Y + +++ F G G+ +++ + +GV + A+P + ++++ GR + FA
Sbjct: 69 AYQIYDSVQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLVQVIRESTGRFATIAFAH 128
Query: 210 Q-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH--LFLPLACAANVAKNVAAVTSTS 266
G + + K R A D+L ++ ++ + PH FL L C +++ + + V +
Sbjct: 129 LFGTSIEAECKAWRLAADVLCDVAMVLDCLSPFFPHGVRFLVL-CFSSLLYSASGVAGNA 187
Query: 267 TRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTGLSIMISKRNPSLITTFALLSC 322
+++ + FAK N+G++ AK + I + GT + +++ + + T LL+
Sbjct: 188 SKSSLSGHFAKWNNLGELNAKDASQETAISLIGMISGTFVVSLLTDQRATWATLIFLLAL 247
Query: 323 GYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS------FPWL 376
++F +++ V++V +LNR R ++A + G V + +E IF F W
Sbjct: 248 -HLFLNWKGVRAVKSRSLNRQRANIAFSALFSKGQVLTPHHVAERERIFERRGGEVFRW- 305
Query: 377 RDKPVVLG-SRFKDAFQDPDAYLA--------------------VEPFFEKERYVV---- 411
+ VLG ++ + Q LA + FEKE YV+
Sbjct: 306 -NSGAVLGHCKYGVSLQSLLEALASGGGGGRERCFRLATVDLSTLLRRFEKEEYVLWCVV 364
Query: 412 ----TYN-PSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+Y+ S+ +V +LKD LKA FH +L
Sbjct: 365 SERTSYDAASRVRVMVVLKDGVSPKSQLKAWFHGLLL 401
>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
heterostrophus C5]
Length = 421
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 127/261 (48%), Gaps = 17/261 (6%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP SVT YV Y + +L+ F G+ ++ +L VGV + ATP A + +L
Sbjct: 6 LPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAALLLSVL 65
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA-- 252
++ GR+ + FA R G + + K R D+L + G ++ + P PL A
Sbjct: 66 QESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPK---PLRVAVL 122
Query: 253 --ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT-GLSIMISKR 309
A+ +++ V + S++ + FA N+G+V AK + LLG S ++S
Sbjct: 123 SFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVVSHI 182
Query: 310 NPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
+ T T LL ++ ++Y V++V + +LNR R ++ L+ G V S +E + +
Sbjct: 183 TSDIATWCTLILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKEVSAR 242
Query: 368 ENIFS----FPWLRDKPVVLG 384
E IF W D+ VLG
Sbjct: 243 ERIFERGGVLRWADDE--VLG 261
>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
Length = 448
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/293 (25%), Positives = 142/293 (48%), Gaps = 14/293 (4%)
Query: 98 VAERDRSGKFKRNSIRAIS--LQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWR 155
+ E D+SG R + + + P+ L+ +R +P G+P SV+ Y+ Y +
Sbjct: 16 IVELDKSGNVSRTWNHTVDNHVVAKSKPSIPLIPSLREAFMPVGYPHSVSSDYLNYQFFD 75
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
+L+ FF + + LL +GV ++ + +LKD R+ ++FA Q G +
Sbjct: 76 SLQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRIATIVFAHQFGLRI 135
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ D K+ RF DL + +EL + P+ + + + + V + +++ +
Sbjct: 136 EPDAKRFRFLADLFNDTAFFLELYSPYFGPYGKILALTSGETLRALCGVAAGASKAALSV 195
Query: 274 AFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSY 329
FAK +N+ ++ AK VG I L+GT + + + + S++ +L +++ +Y
Sbjct: 196 HFAKHDNLSELNAKEASQETAVGLIGLLVGT-IVVKYVEDHHSVLFLMTVLVFAHLWMNY 254
Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVV 382
V++V + LN+ R ++ E +LKTG+V S E +ENI + +PVV
Sbjct: 255 LGVRAVCMDNLNQQRATILFEEYLKTGNVLSPEEVAHRENILLW-----RPVV 302
>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 471
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 166/360 (46%), Gaps = 36/360 (10%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
++ I +P G+PDSV+P Y+ Y AL+ F +G+ +++ +L GV A
Sbjct: 43 RRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPSA 102
Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
T + +L+D R+ ++ A G D+K+ R D+L + ++ + +
Sbjct: 103 TATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRLD 162
Query: 244 HLFLP-----LACAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKGECVGNI 294
LF P C + + +++ A+++ ++ I FA KG ++GD+ AK +
Sbjct: 163 TLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETV 222
Query: 295 ADLLGTGLSIMISKR--NP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
LLG L +I P ++ TT +L ++ +Y V+ +VL TLN R +A +
Sbjct: 223 LALLGMLLGTLIVPHLTTPWTIYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWLAWSA 282
Query: 352 FLKT--GHVPSLREGNLQENIFSFPWLRDKP--------VVLGSRFKDAFQDPDAYLAVE 401
+++T G P+ E E IF P P +GS F + P +E
Sbjct: 283 YMRTQPGRAPTPDEVASLERIFERPGAFRDPRSGVLMGRCTIGSSFSEILSGPIPPQLLE 342
Query: 402 PFFEKERYVVTYN---------PS-KG--KVYALLKDQAKSDDILKAAFHAYVLLHCINS 449
F++ERY++ ++ PS +G +++ LK+ ++D LKA HA L I+S
Sbjct: 343 -LFKEERYLLWFDRRCLLRPESPSVEGFLRLHICLKEGYNTEDQLKAWIHAAELCRAISS 401
>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
Length = 226
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 106/216 (49%), Gaps = 7/216 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P+ +P+SV+ Y + W + G TT+ LL VGV AT + A +I
Sbjct: 3 LPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQFIA 62
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT--AAVPHLFLPLACA 252
+D G VG ++F +G+ D + KQ R DL +G +ELA A FL LACA
Sbjct: 63 RDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALACA 122
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK---- 308
+VA+++ + +TR + + FA+ N D+ AK L+G L + ++
Sbjct: 123 GSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTSTDG 182
Query: 309 RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
+ F L+ +V+ + + V+S+V+ +LNR R
Sbjct: 183 KTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRER 218
>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 263
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 82/164 (50%), Gaps = 1/164 (0%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
E + S + VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV A
Sbjct: 51 ERIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSA 110
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
T W L+D G +G ++F QG D + K R D + +LG ++L + P
Sbjct: 111 TVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFP 170
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
+ + C +++++ V S +TR + + FA N D++AK
Sbjct: 171 SSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAK 214
>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
ND90Pr]
Length = 482
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 142/296 (47%), Gaps = 16/296 (5%)
Query: 98 VAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRAL 157
V E + G+ KR + + P+ + ++ +I +P GFP SVT Y+ Y + +L
Sbjct: 33 VGEHGKHGEGKR-----VDVIVPKQAHKSVLQYIIDVFLPSGFPQSVTEDYIEYQVFDSL 87
Query: 158 KHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDY 216
+ F G+ ++ +L VGV + ATP A + +L++ GR+ + FA R G +
Sbjct: 88 QAFSSSIAGMLASRAVLQGVGVGDSNATPTAALLLSVLQESVGRMATIAFADRFGTALEP 147
Query: 217 DLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAF 275
+ K R D+L + ++ + P L + + A+ +++ V + S++ + F
Sbjct: 148 ECKMYRLTADILNDSAMILDCLSPGFPKPLRVAVLSFASCLRSLCGVCAGSSKASLSAHF 207
Query: 276 AKGENIGDVTAKGECVGNIADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQEV 332
A N+G+V AK + LLG S ++S L T T LL ++ ++Y V
Sbjct: 208 ATAGNLGEVNAKDSSQETVISLLGMLAGSFVVSHITSDLATWCTLILLLAIHLATNYAAV 267
Query: 333 KSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG 384
++V + +LNR R ++ L+ G V + +E + +E IF W D+ VLG
Sbjct: 268 RAVSMQSLNRQRANILFSHILQHGIVLNPKEVSARERIFERGGVLRWADDE--VLG 321
>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
Length = 207
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 78/153 (50%), Gaps = 1/153 (0%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
VP+GFP SVTP YVP+ W L+ + +TQ LL ++GV AT W L
Sbjct: 6 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 65
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D G +G ++F QG D + K R D + +LG ++L + P + + C +
Sbjct: 66 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 125
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
++++ V S +TR + + FA N D++AK
Sbjct: 126 LSRSFTGVASGATRAALTQHFALANNAADISAK 158
>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
Length = 462
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 144/329 (43%), Gaps = 16/329 (4%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS----RNRATP 186
++ +++P GFP SV+ Y+ YM + + T +LL + S N
Sbjct: 76 VKDFILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKASCWSWFFFSNHCCC 135
Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
+DG G VG++ Q G FD D KQ R DL+ G+ +L+T P
Sbjct: 136 FCCCFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAY 195
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
FL LA N+AK VA + I FA N+G+V AK E A LLG L IM
Sbjct: 196 FLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIM 255
Query: 306 ------ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVP 359
++ P L T+ + +++ YQ + + ++N R + V+S + VP
Sbjct: 256 VLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVP 315
Query: 360 SLREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYL-AVEPFFEKERY--VVTYN 414
+ N +E I S+ ++ G ++ + P + + + + +E+Y VV
Sbjct: 316 GYVDCNRKEKILSWQRFLKPRIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQ 375
Query: 415 PSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ +V+ K A S +L++ + +Y L
Sbjct: 376 GTDFEVFVSFKVGATSLSVLRSIWQSYWL 404
>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
206040]
Length = 468
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/326 (26%), Positives = 155/326 (47%), Gaps = 17/326 (5%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
I +P GFP SV+ Y+ Y T+ +L+ FF + + LL +GV ++
Sbjct: 57 IMDSFLPVGFPHSVSKDYLAYQTFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFAL 116
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
+ I+KD R+ ++FA R G + + D K+ RF D+ + +EL + + +
Sbjct: 117 LLTIMKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADVFNDSAFFLELYSPYLGSWGKVI 176
Query: 250 ACAANVA-KNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSI 304
A + A + + V + +++ + FAK +N+ ++ AK VG I L+GT +
Sbjct: 177 ALSTGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGTLVVK 236
Query: 305 MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
MI + S++ LL ++ ++Y V+SV + TLNR R ++ E FL++G V + +
Sbjct: 237 MIEDHS-SVVILMILLVIAHLGTNYLGVRSVCMATLNRQRATIVFEEFLRSGKVLTPEQV 295
Query: 365 NLQEN-IFSFPWLRD--KPVVLGSRFKDAF------QDPDAYLAVEPFFEKERYVVTYNP 415
+EN IF P +R+ V+ F + + DP L + R++ + P
Sbjct: 296 AERENIIFWNPIMRNLHGKKVVHVEFAEGYSHAMSCSDPHDKLIIVDGINHTRFICSRVP 355
Query: 416 SK-GKVYALLKDQAKSDDILKAAFHA 440
+ + LL A+ D ++A F A
Sbjct: 356 GRVTPIKLLLWVGAQPIDAIRAWFAA 381
>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
Length = 434
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 142/347 (40%), Gaps = 54/347 (15%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
++ +R++ +P G+P SVTP Y+ Y H G T +++ ++GV AT
Sbjct: 32 LAAVRAFFLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIA 91
Query: 188 AV--AINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP- 243
A A+ W+ KDG G G++I R FD D K+ R + + +G +E+AT P
Sbjct: 92 ATSAAVKWLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPG 151
Query: 244 -------------HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
F+ LA + K++A T I F+ N+GDV AK E
Sbjct: 152 GLRSKFRVVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEV 211
Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
A LLG S+ + AL + G T+ R ++
Sbjct: 212 WEVAAQLLGLAASV---------VALQALEAAGSP------------DTVVPKRAALMAS 250
Query: 351 SFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEP-------- 402
S ++TG VPS+ + NL E + P + V G A
Sbjct: 251 SHVRTGLVPSVEQVNLSEQVLLPPTMCQPSVHYGCTLDQALGSGGGGFGGGAASGGRGQQ 310
Query: 403 ------FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
++ E+Y++T++ +G + L+ + A+ D+L+A + A L
Sbjct: 311 LAELVRLYDGEQYMLTWH--RGAAWVLVLEGAQRADLLRAMWQAAWL 355
>gi|223974535|gb|ACN31455.1| unknown [Zea mays]
Length = 115
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/89 (57%), Positives = 66/89 (74%), Gaps = 4/89 (4%)
Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYD---NS 466
+VTYNP+K KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+ R L LKQ + + +
Sbjct: 1 MVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHA-RRLKLKQKQANPDQSE 59
Query: 467 NLMLTTADLEAHIGESCKVVSTALWGFQE 495
+L D AHI ESCK+VS++ F++
Sbjct: 60 HLYSRNVDFLAHIAESCKIVSSSYGTFKK 88
>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
Length = 226
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 94/194 (48%), Gaps = 14/194 (7%)
Query: 72 LVEDQFVDSSLRLICSEDIDG-------RRWNYVAERDRSGKFKRNSIRAISLQSPQAPA 124
L +QF + R CS DG RRW +++ + A P A A
Sbjct: 11 LCTEQFGSGAPR-GCSAAADGSLQWDGARRWGWLSR----APIAKPGQHAGGGGGPWA-A 64
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+S +RS ++P GFPDSV+P Y+PY W +++ F G TQ +L +GV +A
Sbjct: 65 LTALSGLRSVLLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKA 124
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ A W++KD G +G++IFA +G K D + KQ R D+L ++ +E+ P
Sbjct: 125 SVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYP 184
Query: 244 HLFLPLACAANVAK 257
F +N+AK
Sbjct: 185 IFFTMTVSTSNLAK 198
>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
Length = 500
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 124/243 (51%), Gaps = 18/243 (7%)
Query: 151 YMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR- 209
+ TW ++ GV T+Q +L VGV + +TP A + ++D AG +G ++F+
Sbjct: 4 FQTWDTIQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSYL 63
Query: 210 QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTR 268
+G FD KQ R D+ +LG VELA+ P +FLP+AC +VA+++ V +TR
Sbjct: 64 EGSSFDACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGATR 123
Query: 269 TPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSI---MISKRNPSLITT------- 316
+ + FA+ N DV+AK + +LG TG+++ + + P+ T
Sbjct: 124 AALTQHFARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGFVI 183
Query: 317 ----FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
F L+ +V ++ ++ ++L +LN R + V+ +L+ G V S R + E++
Sbjct: 184 AWLVFLGLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLEDLTP 243
Query: 373 FPW 375
P+
Sbjct: 244 PPF 246
>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 362
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 72/259 (27%), Positives = 113/259 (43%), Gaps = 46/259 (17%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
M +R +P GFP + + YV ++ W+ L F + + + Q+LL ++G+ TPG
Sbjct: 1 MPELRRIFLPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGN---TPG 57
Query: 188 AV----AINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A+ A W+LKDG G +I G+++D D K+ + L ++ +EL T A
Sbjct: 58 ALPLTAAAKWVLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAA 117
Query: 243 PHLFLPLACAANVAKNVAAVTSTSTRTPIY-KAFAKGENIGDVTAKGECVGNIADLLGTG 301
P LFLPLA +A + A S + + F + N+GD+ AK E G L+
Sbjct: 118 PGLFLPLAASATFVRAAALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIAL- 176
Query: 302 LSIMISKRNPSLITTFALLSCG---------------------------YVFSSYQEVKS 334
P+ I F L+S +VF+ +Q KS
Sbjct: 177 ---------PAGIQVFQLVSAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKS 227
Query: 335 VVLHTLNRARFSVAVESFL 353
+ TLNR R E+F+
Sbjct: 228 LKFDTLNRFRLLKLAEAFV 246
>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 311
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 122/255 (47%), Gaps = 13/255 (5%)
Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
D+L +LG +E+ + P LFL +A N AK +A V + +TR PIY +FAK N+ D+
Sbjct: 33 DVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 92
Query: 286 AKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
AKGE + + +++G G I + S LI LLS +++ QE+++ ++TL
Sbjct: 93 AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSAVHIYGVIQEMRATPVNTL 151
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAY 397
N R ++ V F+K+G V S E +E++ FP V +G + F P
Sbjct: 152 NPQRTAMIVADFIKSGKVSSPAELRYKEDLL-FPNRVIEEAGSVKIGQPLRKVF-SPRLV 209
Query: 398 LAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLA 457
+ F E+++++ + + Y +L+ A +D L+ A S G R A
Sbjct: 210 EQLRATFPGEKFLLSQKSN--RTYMVLERSASGEDALRGWLVAAFASEMERSGAGSRDDA 267
Query: 458 LKQHEYDNSNLMLTT 472
+ Y+ + T
Sbjct: 268 VLSEAYEKMERVFPT 282
>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 466
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 147/312 (47%), Gaps = 29/312 (9%)
Query: 98 VAERDRSGKFKR-------NSIRAISLQSPQAPAEELMSFIRSY-------VVPDGFPDS 143
++E D+SG +R N+I L+S P+ + S +R + +P GFP S
Sbjct: 30 ISELDKSGNVQRRWLHRSDNTIAV--LESETEPSSIIHSSVREWKRFLSDAFLPIGFPHS 87
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
V+ Y+ Y T+ +L+ FF + + LL +GV ++ + IL D R+
Sbjct: 88 VSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSATFAMLITILTDATSRIA 147
Query: 204 KMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA-KNVAA 261
++FA R G + + D K+ RF D+ + +EL + LA + A + +
Sbjct: 148 TIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGDYGKILALSVGQALRALCG 207
Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTF 317
V + +++ + FAK +N+ ++ AK VG I L+GT L + + + + S++
Sbjct: 208 VAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGT-LVVKLVQDHQSVVYLM 266
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
L +++ +Y V+SV + T NR R ++ ++ TG V + +ENI +
Sbjct: 267 VTLVLAHLWMNYLGVRSVCMTTFNRQRATILFREYMTTGTVLTPEAVAARENIVLWA--- 323
Query: 378 DKPVVLGSRFKD 389
PVV GSR +
Sbjct: 324 --PVV-GSRLAE 332
>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
Length = 525
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 140/303 (46%), Gaps = 17/303 (5%)
Query: 65 DETTAFRLVE-DQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRA-ISLQSPQA 122
+++TA E D+ + I SE G A R+ K N R ++ S A
Sbjct: 5 EDSTAITFTEVDEVNHPTATYIYSEPAAGP-----AHREHGKKHGENWGRVDVAHASSTA 59
Query: 123 PA----EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
PA + + F+ +P G+P SVT Y PY + +L+ F G+ +++ +L VG
Sbjct: 60 PAAWSSKSVQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVG 119
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
V A+P A + IL+D +GR+ ++FA R G + + K R A D+ +L ++
Sbjct: 120 VGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDC 179
Query: 238 ATAAVP--HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
+ +P + + A V + + V S++ + F++ N+ +V AK I
Sbjct: 180 LSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETII 239
Query: 296 DLLGTGLSIMISKRNPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
L+G + + R + TT+ +L ++ +Y V+SV + +LNR R ++ +
Sbjct: 240 SLIGMLVGSFVVSRVTNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTL 299
Query: 353 LKT 355
L T
Sbjct: 300 LNT 302
>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
Length = 346
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 111/236 (47%), Gaps = 9/236 (3%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L + +R +P G+P+SVT Y + W +L+ V T++ L+ +GV N A P
Sbjct: 60 LKNVLRRLFLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHP 119
Query: 187 GAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH- 244
A A+ + ++G G VG ++F + +FD D K R D++ + +E P
Sbjct: 120 TAAALMLLWREGFGVVGGLLFTWGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTP 179
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
+F L +AN+ ++ + +T T FAK NIGD+ +K +G L+G LS
Sbjct: 180 VFFALHSSANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGLLLST 239
Query: 305 -------MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
++ P F L+ +++ + ++S+ L LN+ R + E ++
Sbjct: 240 LCFWSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYI 295
>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
C-169]
Length = 265
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 91/182 (50%), Gaps = 3/182 (1%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV--SRNRATPG 187
F + +P G+P++VTP Y+ + W H G T +LL +VGV S AT
Sbjct: 4 FCTGFFLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAA 63
Query: 188 AVAINWILKDGAGRVGK-MIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A AI WI KDG G G+ ++ R G FD D K+ R + + +G +E+ATA P F
Sbjct: 64 AAAIKWITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANF 123
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI 306
+ LA A N+++ V + I FA N+GDV AK E A LLG S+ +
Sbjct: 124 IVLAGAGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGL 183
Query: 307 SK 308
K
Sbjct: 184 LK 185
>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 568
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 135/296 (45%), Gaps = 37/296 (12%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV- 189
+R+ +P G+PD+V+ Y ++ W F + V T+Q+LL ++G+ +TPGA+
Sbjct: 146 LRAVFLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMG---STPGALP 202
Query: 190 ---AINWILKDGAGRVGKM---IFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
A W+LKDG G + +F G+K+D D K+ + L ++ +EL T A P
Sbjct: 203 LAAATKWVLKDGVGSFATLLAGVFG--GQKYDEDPKRWWAVTNSLEDVARVLELVTPAYP 260
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIY-KAFAKGENIGDVTAKGECVGNIADLLG--- 299
LFLPLA +A + A S + + + +N GDV AK E +G IA G
Sbjct: 261 ALFLPLAASATFVRAGALTGRGSLINGTFMQHLGRNQNFGDVRAKMEALG-IAIFRGVSG 319
Query: 300 ---------------TGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
T +S + R + + + ++F ++ + LNR R
Sbjct: 320 AFGADAAAMAAEGGETAMSELT--RYGGVFGAYGAVLSSHLFCCWKAADVLRFDALNRHR 377
Query: 345 FSVAVESFLKTGH--VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYL 398
+ A E +++ G V S+ + +E ++ P LG+ +DA +D D L
Sbjct: 378 LTRAAEEYVRGGREAVLSVEDAGEREGVYRARVTASTP-TLGASVEDAARDWDDLL 432
>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 462
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 147/307 (47%), Gaps = 28/307 (9%)
Query: 98 VAERDRSGKFKR-------NSIRAISLQSPQAPAEELMSFIRSY-------VVPDGFPDS 143
++E D+SG +R N+I AI L+S P + S +R + +P GFP S
Sbjct: 30 ISELDKSGNVQRRWLHRSDNTI-AI-LESKTEPNSIIHSSVREWKRFLSDAFLPIGFPHS 87
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
V+ Y+ Y T+ +L+ FF + + LL +GV ++ + IL D R+
Sbjct: 88 VSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSSATFAMLITILTDATSRIA 147
Query: 204 KMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA-KNVAA 261
++FA R G + + D K+ RF D+ + +EL + LA + A + +
Sbjct: 148 TIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGDYGKILALSVGQALRALCG 207
Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTF 317
V + +++ + FAK +N+ ++ AK VG I ++GT L + + + + S++
Sbjct: 208 VAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVGT-LVVKLVQDHQSVVYLM 266
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
L +++ +Y V+SV + TLNR R ++ ++ TG V + +ENI WL
Sbjct: 267 VALVFAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGRVLTPEAVAARENI--VLWL- 323
Query: 378 DKPVVLG 384
PVV G
Sbjct: 324 --PVVGG 328
>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
Length = 447
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 127/272 (46%), Gaps = 12/272 (4%)
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL 249
I W+ KDG G +G+++ R G FD D KQ R D + G+ +LAT P FL L
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179
Query: 250 ACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI--- 306
A N+AK VA + I FA N+G+V AK E A L+G G I+I
Sbjct: 180 ASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILIIDT 239
Query: 307 ---SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE 363
K P ++ T+ + +++ YQ + + +T+N R + VES + VP +
Sbjct: 240 PGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPGYVD 299
Query: 364 GNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPDAYLAVEPFFEKERYVVTYNP-SKGK 419
N +ENI + ++ G ++ + A+ + KE+Y++T N +K
Sbjct: 300 CNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVSKVKALLKMYTKEKYILTLNKLNKDT 359
Query: 420 VYAL-LKDQAKSDDILKAAFHAYVLLHCINSS 450
+++ K A S D+L+ + AY L + S
Sbjct: 360 EFSVSFKVNATSRDVLRCLWQAYWLEENMEES 391
>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 500
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 173/369 (46%), Gaps = 47/369 (12%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP-GAVAINWI 194
+PDG+P SVT Y Y + + + F G G+ +++ + +GV + A+P GA+ I +
Sbjct: 82 LPDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRAVWEGMGVGDSMASPIGAMMIQ-V 140
Query: 195 LKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH--LFLPLAC 251
+++ GR+G + FA G + + K R A D+L + ++ + P FL L C
Sbjct: 141 IRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCSIAFILDCLSPLFPRSIRFLILCC 200
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIM--IS 307
++ + + V ++++ + FAK N+G++ AK L+G TG ++ ++
Sbjct: 201 SS-ILFAASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLMGMLTGTLVVSCLT 259
Query: 308 KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
+ T F L++ Y+F +++ V++V +LNR R ++A + L V + + + +
Sbjct: 260 TPTTTWTTLFILIAI-YLFLNWKGVRAVKSRSLNRQRANIAFSALLSKDQVLTPGKISEK 318
Query: 368 ENIFS------FPWLRD----------------KPVVLGSRFKDAFQDPDAYLA-VEPFF 404
E IF F W D + + G + +FQ P L+ + F
Sbjct: 319 ELIFERRGGEVFRWNADTILGHCKFGVPLESMLQALANGQQNMRSFQLPTVDLSNLMELF 378
Query: 405 EKERYVV--------TYN-PSKGKVYALLKDQAKSDDILKAAFHAYVL---LHCINSSNG 452
E ++Y++ Y+ S+ KV +LK+ + LKA FH +L LH + +
Sbjct: 379 ENQQYIMWCQISEATNYDAASRIKVLIVLKNGVSAKSQLKAWFHGLLLAERLHGLKAPEL 438
Query: 453 HRSLALKQH 461
R+ L QH
Sbjct: 439 RRA-QLGQH 446
>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
NZE10]
Length = 447
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/409 (24%), Positives = 187/409 (45%), Gaps = 53/409 (12%)
Query: 113 RAISLQSPQAPAE----ELMSFIRS---YVVPDGFPDSVTPSYVPYMTWRALKHFFGGAM 165
+A +QSP+ E +++S + +P GFP+SVT Y+ Y + +L+ F
Sbjct: 18 KATYVQSPEGSTERSRVDVISLWQQAFDVFMPVGFPNSVTHDYLEYQIYDSLQAFSSSIA 77
Query: 166 GVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFA 224
+ ++ +L+SVGV A+P A + I + AGR+ ++FA R G + K R
Sbjct: 78 SLLASRAVLSSVGVGDADASPTAAILLSIFQTSAGRLATILFAHRLGTSLGPECKMYRLL 137
Query: 225 GDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGD 283
D+L + G +++ + A P + + + +++ +++ V + S + + FA+ N+ +
Sbjct: 138 ADILNDTGFLLDMISVAFPKPISVIILSFSSILRSLCGVAAGSAKASLSAHFARWGNLAE 197
Query: 284 VTAKGECVGNIADLLGTGL-SIMIS--KRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
+ AK + LLG S++IS + TT +L ++ ++ + V++V + TL
Sbjct: 198 LNAKDSSQETVISLLGMWAGSVVISYITEPTAAWTTLIVLLAIHLETNRRAVRAVTMQTL 257
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRD---------------------- 378
NR R ++ L+ GHVP+ E QE IF RD
Sbjct: 258 NRQRATLMFH-HLQRGHVPTPVEIASQERIFE----RDGVLRGNDDRILGHCAVGASTER 312
Query: 379 --KPVVLGSRFKDAF--QDPDAYLAVEPFFEKERYVVTYN----PSKGKVYALLKDQAKS 430
K V AF Q P A ++ +EK Y++ + P + ++ +LK A
Sbjct: 313 LLKAVSGAQTSTKAFPSQGP-AMQSILTVYEKSPYILWQDRRSLPGRSSLFIVLKKGATP 371
Query: 431 DDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNLMLTTADLEAHI 479
++L A + A L +++N +S + +Y LM +T D A++
Sbjct: 372 PEMLTAWWQA---LAVASANNIKKS--TQPDQYAELELMRSTMDQAANL 415
>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 112/237 (47%), Gaps = 7/237 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+ S +RS +P G+P P Y+ Y W ++ V TQ +L VGV R AT
Sbjct: 77 VQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLRSVLATQRVLEGVGVGREGATA 136
Query: 187 GAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PH 244
+ +N++++DG G + F + +F D+K+ R D+++++G +E+A +A P
Sbjct: 137 LSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIFADVMVDIGITLEVAASACPPT 196
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LFLP+ N K + V + + I +AKG +I D+ AK + LG +
Sbjct: 197 LFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISDIQAKFGAQHTVTGSLGLVFAA 256
Query: 305 MISKRNPS-----LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTG 356
+K + L +A L+ +V ++ + ++ + N R + + + ++G
Sbjct: 257 FFAKSVSTVPGGRLWILYAALTALHVMANLRCMRIIHFDVFNTTRLRLVLRALFESG 313
>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
Length = 199
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 83/159 (52%), Gaps = 7/159 (4%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKH 159
+R K S+ + + P + + SF+ + P G+P SV Y+ Y +R L+H
Sbjct: 45 ERFSKLHDKSVTTVKVIQDSRPIHDKLVDSFLNKFF-PSGYPYSVNEGYLTYTKFRVLQH 103
Query: 160 FFGGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDL 218
F + V +TQ+LL +VG+ R TP A A++WILKDG K+I + G + D +
Sbjct: 104 FSSAMLHVLSTQSLLFAVGL---RPTPTQATAVSWILKDGMQHTRKLICSSMGARMDSEP 160
Query: 219 KQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAK 257
K R D+L +LG +E+ + P LFL +A N AK
Sbjct: 161 KSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAK 199
>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 107/220 (48%), Gaps = 5/220 (2%)
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
+ +F+R +P G+P SV Y+ Y TW ++ GV +L++ GV A+
Sbjct: 62 SMSAFVRRTFLPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENAS 121
Query: 186 PGAVAINWILKDGAGRVGKMIFARQG-KKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
A A+ W+L+DG G +G ++F+ D ++K+ R D+ ++G ++L
Sbjct: 122 AIAAAMVWVLRDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRG 181
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIAD-LLGT 300
F ++ A V K + V + T+T I FA +N+ D++AK E N+ LLG
Sbjct: 182 NFAVVSSLATVFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGV 241
Query: 301 GLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
++ +S + F S +V+++Y+ + S+ L L
Sbjct: 242 KVASYLSDAPTGRVLIFVFFSLLHVYANYRYMPSIHLLPL 281
>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
norvegicus]
Length = 246
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 14/201 (6%)
Query: 66 ETTAFR--LVEDQFVDSSLRLICSEDIDGR-RWNYVAERDRSGKFKRNSIRAISL----- 117
+T + R L +QF + R CS DG +W+ R G F R SI
Sbjct: 3 DTASLRAPLCTEQFGSGAPR-SCSAAADGSLQWD---RAQRWGWFSRASITKPGQHEGGG 58
Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
+ P A A +S +RS ++P GFPDSV+P Y+ Y W +++ F G TQ +L +
Sbjct: 59 RGPWA-ALTTLSGLRSVLLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGL 117
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVE 236
GV +A+ A W++KD G +G++IFA +G K D + KQ R D+L + +E
Sbjct: 118 GVGNAKASVSAATSTWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLE 177
Query: 237 LATAAVPHLFLPLACAANVAK 257
+ P F +N+AK
Sbjct: 178 IMAPMYPIFFTMTVSTSNLAK 198
>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 455
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 152/334 (45%), Gaps = 31/334 (9%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSVTP Y+ Y + +L+ F G+ +++ +L GV A+ + +L
Sbjct: 51 LPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLLTVL 110
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAA 253
+D GR+ ++FA + G + K R A D+ + ++ + A+P L + C A
Sbjct: 111 QDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAALCLA 170
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNIADLLGTGLSIMISKR 309
+ + V + + FA+ N+G++ AK +G L G+ + I+ R
Sbjct: 171 GALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHITSR 230
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL-KTGHVPSLREGNLQE 368
+ I LL+ ++ ++Y V++V L TL+R R ++A + ++ +V + E +E
Sbjct: 231 RTTWIALIFLLT-AHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVARRE 289
Query: 369 NIFSFPW-LRD--KPVVLG--------------SRFKDAFQDPDAYLAVEPFFEKERYVV 411
IF P +RD + +LG SR +A +A FE E YV+
Sbjct: 290 RIFEVPGAIRDSLQGFLLGTCTICTSPEELLRRSRQGSMASWKNALVA----FEHEAYVL 345
Query: 412 TYNPSK--GKVYALLKDQAKSDDILKAAFHAYVL 443
+ K+ K AK +D LKA +A L
Sbjct: 346 LGIRADFPCKILICFKHGAKPEDYLKAWVNAVEL 379
>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
Length = 444
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 140/294 (47%), Gaps = 13/294 (4%)
Query: 98 VAERDRSGKFKRNSI-----RAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYM 152
V + D+SG R + R I P + + +P G+P SV+ Y+ Y
Sbjct: 16 VIDLDKSGNVARRWVHMPSNRIIRKAKPSLFKRSMRKILMDAFLPVGYPHSVSDDYLAYQ 75
Query: 153 TWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-G 211
+ +L+ FF + + LL +GV ++ + +LKD RV +IFA+Q G
Sbjct: 76 FFDSLQAFFSTITALLANRALLQGLGVGDANSSATFAMLLTVLKDAMSRVATIIFAQQFG 135
Query: 212 KKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVAAVTSTSTRTP 270
+ + D K+ RF DL + +EL + P+ + + + + V +++
Sbjct: 136 LRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPYGKILALTSGEALRALCGVAGGASKAA 195
Query: 271 IYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVF 326
+ FAK +N+ ++ AK VG + L+GT + + + S++ +L +++
Sbjct: 196 LSVHFAKHDNLAELNAKEASQETAVGLVGLLVGT-FVVKHVEDHTSVVYLMTVLVLAHLW 254
Query: 327 SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI-FSFPWLRDK 379
+Y V+SV ++ LNR R ++ + +LK+G+V + E +E+I F P +R++
Sbjct: 255 MNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLTPNEVARRESILFWSPVVRNQ 308
>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 164
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 89/161 (55%), Gaps = 6/161 (3%)
Query: 203 GKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
GK+I + G + D + K R D+L +LG +E+ + P LFL +A N AK +A V
Sbjct: 5 GKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVV 64
Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI-----MISKRNPSLITTF 317
+ +TR PIY +FAK N+ D+ AKGE + + +++G G I + S LI
Sbjct: 65 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG- 123
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
LLS +++ QE+++ ++TLN R ++ V F+K ++
Sbjct: 124 PLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKVRYL 164
>gi|414879330|tpg|DAA56461.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
Length = 115
Score = 92.8 bits (229), Expect = 3e-16, Method: Composition-based stats.
Identities = 50/88 (56%), Positives = 63/88 (71%), Gaps = 2/88 (2%)
Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHR-SLALKQHEYDNSNL 468
+VTYNP+K KVYALLKDQAKSDDI+KAAFHA+VLLH IN+S+ R KQ D S
Sbjct: 1 MVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVLLHFINASHARRLKQKQKQANPDRSEY 60
Query: 469 MLT-TADLEAHIGESCKVVSTALWGFQE 495
+ + D AHI ESCK+VS++ F++
Sbjct: 61 LYSRNMDFLAHIAESCKIVSSSYGTFKK 88
>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
Length = 457
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 127/243 (52%), Gaps = 5/243 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP SVT Y Y + +L+ F + +++ +L+S+GV +A+P A + +L
Sbjct: 58 LPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPTAALLLSVL 117
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAA 253
++ GR+ ++FA R G + + K R D+L + +++ + P + + + +
Sbjct: 118 QESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIVILSFS 177
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT-GLSIMISKRNPS 312
++ +++ V + S++ + FA+ N+G++ AK + L+G S++IS
Sbjct: 178 SILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVISWITTP 237
Query: 313 LITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
L T AL L ++ ++ + V++V + TLNR R ++ + VPS++E + E+I
Sbjct: 238 LATWTALIGLLSIHLETNRRAVRAVKMRTLNRQRATLVFHHLQRQQTVPSIKEISSVEHI 297
Query: 371 FSF 373
F +
Sbjct: 298 FEW 300
>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
Length = 527
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 117/238 (49%), Gaps = 6/238 (2%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
++ ++ F+ +P G+P SVT Y PY + +L+ F G+ +++ +L VGV
Sbjct: 65 SKSVLDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNAD 124
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A+P A + IL+D +GR+ ++FA R G + + K R A D+ +L ++ + +
Sbjct: 125 ASPTAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMI 184
Query: 243 P--HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
P + + A V + + V S++ + F++ N+ +V AK I L+G
Sbjct: 185 PAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM 244
Query: 301 GLSIMISKRNPSLITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
+ + R S TT+ +L ++ +Y V+SV + +LNR R ++ + L T
Sbjct: 245 LVGSFVVSRVTSFSTTWICLLMLLALHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNT 302
>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
Length = 283
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 98/186 (52%), Gaps = 5/186 (2%)
Query: 194 ILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA 252
IL+D G +G + FA QG D + KQ R D+L + +EL + +P FL L C
Sbjct: 4 ILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFLVCV 63
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKR 309
A++A+ + V +TR + + A+ N+GDV AK E + N+ LL + + + +
Sbjct: 64 ASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPLVSQ 123
Query: 310 NPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQE 368
N ++ + F L +C ++F +Y+ V+S+ + LN R + VE + + + S + N +E
Sbjct: 124 NAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIANQRE 183
Query: 369 NIFSFP 374
+F P
Sbjct: 184 PLFFDP 189
>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
Length = 427
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 147/311 (47%), Gaps = 12/311 (3%)
Query: 87 SEDIDGRRWNYVAERDRSGKFKRNSIRA----ISLQSPQAPAEELMSFIRSYVVPDGFPD 142
SE+I + E D+SG F R+ + LQS +A L S +++ P G+P
Sbjct: 19 SENIGHGEPTEIIEYDKSGNFVRSWTHGPDNIVVLQS-KARTSWLPSMHEAFM-PVGYPH 76
Query: 143 SVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
SV+ Y+ Y + +++ FF + + LL +GV ++ + +LKD RV
Sbjct: 77 SVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRV 136
Query: 203 GKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVA 260
+ FA+Q G + + D K+ RF DL + +EL + + P+ + + +
Sbjct: 137 ATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALC 196
Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NPSLITTF 317
V + +++ + FAK +N+ ++ AK L+G + ++ K + +++
Sbjct: 197 GVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVVKYVEDHNTVVFLM 256
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLR 377
+L +++ +Y V+ V + LNR R ++ + +LKTG + + E +E+I + W
Sbjct: 257 IVLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTPEEVAKRESILLW-WSS 315
Query: 378 DKPVVLGSRFK 388
+ + GS +K
Sbjct: 316 TRIEMAGSYYK 326
>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
str. Neff]
Length = 449
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 88/176 (50%), Gaps = 27/176 (15%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
++S +P G+P SV Y+ + W K G+ ++ + +
Sbjct: 72 MLKSLFLPTGYPASVRQDYIHFQKWEVAK-------GIVSSAGFVGHL------------ 112
Query: 190 AINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
G G +G M+ A GK FD + K++R+A D + G +ELAT P FLP
Sbjct: 113 -------VGLGCLGMMLVAGAFGKFFDTETKRIRWAADCIHVAGVALELATPLFPAYFLP 165
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
LA AN AK +A +T+ +T+ I + FA +N+GD+TAKG G +A L+G G+ I
Sbjct: 166 LASLANSAKGIAGLTTGATKAAINQGFALRDNLGDITAKGHSQGIVAYLMGMGIGI 221
>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
NIH/UT8656]
Length = 485
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 152/334 (45%), Gaps = 27/334 (8%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P +VTP Y PY + +L+ F G+ ++ + +GV A+ + +I
Sbjct: 59 LPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRAVFVGMGVGSEEASVLTTMLLYIA 118
Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-CAA 253
++ GRV ++FA Q ++ + ++K RF D++ ++ ++ + A+P L + C +
Sbjct: 119 QETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDVAFVLDCMSPAMPILGRVVTLCLS 178
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPS 312
N + V V S++ + FAK NIG++ AK + LLG + +++SK + +
Sbjct: 179 NACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGSQETVISLLGMWVGGVVVSKVHGT 238
Query: 313 LITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
L T L L +++++++ V+SV L++ N R V + L+ G + + +E +
Sbjct: 239 LATWSWLVPLLALHLWANWKAVRSVRLNSFNTNRALVFFRAALE-GRTLDMEQVAQEETV 297
Query: 371 FSFP-------WLRDKPVVLGSRFKDAF-----------QDPDAYLAVEPFFEKERYVVT 412
+P L +GS +D Q + F E Y++
Sbjct: 298 LGWPAASRGIYGLEPCYFQVGSSVQDLVHRVYAGADQPSQPASTVGHLVQLFCDEDYLLW 357
Query: 413 YNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHC 446
++ G+ LLKD A K H +L C
Sbjct: 358 FDIETGEALVLLKDTATGITQAKGLCH---MLRC 388
>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
Length = 386
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 102/226 (45%), Gaps = 28/226 (12%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT---PG 187
I+ +V+P GFP SV+ Y+ YM W+ + G T TLL +VGV T
Sbjct: 67 IKDFVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAAS 126
Query: 188 AVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF 246
A AI W+ KDG G G+++ R G FD D K+ R D + G+ EL T P F
Sbjct: 127 AAAIRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYF 186
Query: 247 LPLACAANVAKNV------AAVTSTSTRTPIYKA----FAKGENIGDVTAKGECVGNIAD 296
LPLA N+AK + R P ++ FA+ N+G+V AK D
Sbjct: 187 LPLASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAK--------D 238
Query: 297 LLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
G I P+L T+ + +++ YQ + + T++R
Sbjct: 239 TSG------IKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVDR 278
>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
[Aspergillus nidulans FGSC A4]
Length = 564
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 111/230 (48%), Gaps = 14/230 (6%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y Y + +L+ F G+ ++ +L VGV A+P + + IL
Sbjct: 74 LPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTSALLLHIL 133
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
+D +GR+ ++FA R G + + K RFA D+ +L ++ + VP F + + A
Sbjct: 134 QDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGFNRVTVLSA 193
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
A V + + V S++ + FA+ N+ +V AK + L+G +++ S
Sbjct: 194 AGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIG----MLVGSFVVS 249
Query: 313 LITTFALLSCGYVF-------SSYQEVKSVVLHTLNRARFSVAVESFLKT 355
IT+F VF +Y V+SV + TLNR R ++ + L +
Sbjct: 250 HITSFTATWLTLVFLLSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 299
>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
Length = 184
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 77/143 (53%), Gaps = 1/143 (0%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
+R +P G+P SV+P Y+ Y W ++ F +T+ +L VGV A+ A
Sbjct: 38 LLRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAA 97
Query: 190 AINWILKDGAGRVGKMIFARQGKKF-DYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP 248
A+ W+LKDG G + +++FA + D D KQ R D +L ++L T P+LF+P
Sbjct: 98 AVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLFMP 157
Query: 249 LACAANVAKNVAAVTSTSTRTPI 271
+ C +++ + V V ++TRT +
Sbjct: 158 IICLSSMIRAVVGVAGSATRTTV 180
>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
Length = 510
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 111/226 (49%), Gaps = 6/226 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ YVPY + +L+ F G+ +++ +L VGV A+P A + IL
Sbjct: 76 LPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHIL 135
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
+D +GR+ ++FA R G + + K R A D+ + ++ + +P F + +
Sbjct: 136 QDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLST 195
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
A V + + V S++ + F++ N+ +V AK I L+G + + R S
Sbjct: 196 AGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVSRVTS 255
Query: 313 LITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
T+ +L ++ +Y V+SV + +LNR R ++ + L +
Sbjct: 256 YTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 301
>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
Length = 559
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 114/223 (51%), Gaps = 5/223 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y Y + +L+ F G+ ++ +L VGV A+P + + IL
Sbjct: 74 LPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPTSALLLHIL 133
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
+D +GR+ ++FA R G + + K RFA D+ +L ++ + VP F + + A
Sbjct: 134 QDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGFNRVTVLSA 193
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
A V + + V S++ + FA+ N+ +V A+ + I L+G+ + I+ +
Sbjct: 194 AGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAE-TVISLIGMLVGSFVVSHITSFTAT 252
Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
+T LLS ++ +Y V+SV + TLNR R ++ + L +
Sbjct: 253 WLTLVFLLSM-HLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 294
>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 679
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/224 (29%), Positives = 107/224 (47%), Gaps = 11/224 (4%)
Query: 207 FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTS 266
FA + D D+K+ RF ++ + + +E+ T P FL LA ANV K+V T+
Sbjct: 306 FACMKRPSDSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKSVGITTANV 365
Query: 267 TRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK-----RNPSL-----ITT 316
R PI +FA EN+ ++ AK +AD +G L+++ + NP L +
Sbjct: 366 VRAPIQMSFALEENLAEIAAKTSAQQVLADNIGLALAVVATGWTGKIANPRLRLAVPLLA 425
Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWL 376
FA L+ ++ Y+E+KSV L T+N+ R + F++TG +PS E +F +
Sbjct: 426 FAPLASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAERLFIPARM 485
Query: 377 RDKPVVLG-SRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGK 419
+ + L + +A P+A Y +TY P +GK
Sbjct: 486 DESSLPLKITHLAEACPTPEALRTALGDDPSRPYCLTYLPPRGK 529
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 48/84 (57%), Gaps = 3/84 (3%)
Query: 142 DSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGR 201
++VTP Y Y +R + + VF TQ +L +VG+ R P A A+NW+LKDG GR
Sbjct: 179 NAVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLPAAAAVNWVLKDGLGR 238
Query: 202 VGKMIFARQ-GKKFD--YDLKQLR 222
+GK+ A G++F Y L +R
Sbjct: 239 LGKLSVATNFGREFGAFYTLVPIR 262
>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
Silveira]
Length = 478
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 156/340 (45%), Gaps = 26/340 (7%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
+L+ F G+ +++ +L VGV A+P + + IL+D GR+ ++FA Q G
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ + K R A D+ + ++ + P + + L ++V + + V + S++ +
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFALLS--CGYVFSSYQ 330
FAK N+ ++ AK + L+G + S+++S L T AL+S ++ ++Y
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLIVHITTNYL 271
Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV---- 382
V++V + +LNR R ++ + V + ++ + QE +F W +
Sbjct: 272 AVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFCQ 331
Query: 383 LGSRFKD----------AFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDD 432
+G FKD A + +A+ FE+E Y++ ++P++ + LK A
Sbjct: 332 IGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTS 391
Query: 433 ILKAAFHAYVLLHC--INSSNGHRSLALKQHEYDNSNLML 470
LKA HA + + I SN L +Q D S LML
Sbjct: 392 QLKAWSHALLAANSFKIKESNTREPLVREQVGTD-SVLML 430
>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
Length = 425
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/310 (23%), Positives = 144/310 (46%), Gaps = 10/310 (3%)
Query: 87 SEDIDGRRWNYVAERDRSGKFKRNSIRA---ISLQSPQAPAEELMSFIRSYVVPDGFPDS 143
SE+I + E D+SG F R+ I + + L S +++ P G+P S
Sbjct: 19 SENIGHGERTEIIEYDKSGNFVRSWTHGPDNIVVLKSKVCTSWLPSMHEAFM-PVGYPHS 77
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
V+ Y+ Y + +++ FF + + LL +GV ++ + +LKD RV
Sbjct: 78 VSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVLKDAISRVA 137
Query: 204 KMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAANVAKNVAA 261
+ FA+Q G + + D K+ RF DL + +EL + + P+ + + +
Sbjct: 138 TIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTGEALRALCG 197
Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKR---NPSLITTFA 318
V + +++ + FAK +N+ ++ AK L+G + ++ K + +++
Sbjct: 198 VAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDHNTVVFLMI 257
Query: 319 LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRD 378
+L +++ +Y V+ V + LNR R ++ + +LKTG + + E +E+I + W
Sbjct: 258 VLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRESILIW-WSST 316
Query: 379 KPVVLGSRFK 388
+ + GS +K
Sbjct: 317 RIEMAGSYYK 326
>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
Length = 282
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 125/249 (50%), Gaps = 17/249 (6%)
Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
D+L + G +E+ + P LFL +A N AK +A V + +TR PIY +FAK N+ D+
Sbjct: 1 DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60
Query: 286 AKGECVGNIADLLGTGLSI-----MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTL 340
AKGE + + +++G G I + S LI LLS +++ QE+++ ++TL
Sbjct: 61 AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAG-PLLSVVHIYGVVQEMRATPVNTL 119
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQENIFSFP---WLRDKPVVLGSRFKDAFQDPDAY 397
N R ++ V F+K+G V S E +E++ FP V +G + P
Sbjct: 120 NPQRTAMIVADFIKSGKVSSPAELRYREDLL-FPNRLIEEAGSVKIGQPVRRVL-SPQRI 177
Query: 398 LAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILK----AAFHAYVLLHCINSSNGH 453
++ F KE+++++ + Y +L+ A +D L+ AAF + + + S +
Sbjct: 178 EQLKATFSKEKFLLSRKDNSA--YMVLEQSATGEDALRGWLVAAFASEMERSGVGSGDTV 235
Query: 454 RSLALKQHE 462
++A ++ E
Sbjct: 236 LNVAYERME 244
>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 678
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 117/268 (43%), Gaps = 22/268 (8%)
Query: 98 VAERDRSGKFKRNSIRAISLQSPQAPA---------EELMSFIRSYVVPDGFPDSVTPSY 148
VAE D F S +P PA +++ + ++PDG+P SVT
Sbjct: 26 VAEVDAQDFFTEPGGLVTSPTTPSTPAGSTSARQARRRVVTTLERILLPDGYPSSVTEDL 85
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+P+ W + +T+ +L VGV ++A + ++W+++DG +G ++FA
Sbjct: 86 LPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSATLSWMMQDGTRMIGSVVFA 145
Query: 209 ---RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP---HLFLPLACAANVAKNVAAV 262
QG + K + D ++ +EL +P F + AA+V K + +V
Sbjct: 146 PLIPQG--LECRAKTWQLVADFTTDIANLLELCAPWLPGGQTTFRVVLVAASVVKALGSV 203
Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL-----ITTF 317
TR + FA N D+ AK ++ +G L + ++ P+ + F
Sbjct: 204 CRNCTRASFTRHFALRNNAADIAAKASTRSSVGSFIGLVLGVAVAYIVPATSRSLNLVVF 263
Query: 318 ALLSCGYVFSSYQEVKSVVLHTLNRARF 345
AL S ++F+SY+ V+ V L LN R
Sbjct: 264 ALGSVLHIFASYRSVRGVQLKHLNAPRL 291
>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
Length = 478
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 156/340 (45%), Gaps = 26/340 (7%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
+L+ F G+ +++ +L VGV A+P + + IL+D GR+ ++FA Q G
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ + K R A D+ + ++ + P + + L ++V + + V + S++ +
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFALLS--CGYVFSSYQ 330
FAK N+ ++ AK + L+G + S+++S L T AL+S ++ ++Y
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLIVHITTNYL 271
Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV---- 382
V++V + +LNR R ++ + V + ++ + QE +F W +
Sbjct: 272 AVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFCQ 331
Query: 383 LGSRFKD----------AFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDD 432
+G FKD A + +A+ FE+E Y++ ++P++ + LK A
Sbjct: 332 IGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTS 391
Query: 433 ILKAAFHAYVLLHC--INSSNGHRSLALKQHEYDNSNLML 470
LKA HA + + I SN L +Q D S LML
Sbjct: 392 QLKAWSHALLAANSLKIKESNTREPLVREQVGTD-SVLML 430
>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
Length = 448
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 59/241 (24%), Positives = 123/241 (51%), Gaps = 5/241 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P +VT Y+PY T+ +L+ F + ++ +L +GV + A+P I I
Sbjct: 47 LPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGALILKIT 106
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-FLPLACAA 253
D R+ ++FA R G+ + + K RF D+ + ++L T A+P+L L + +A
Sbjct: 107 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKLGVIVSA 166
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT---GLSIMISKRN 310
V +++ V + +++ + FA N+ ++ AK + LLG L + + +
Sbjct: 167 GVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVGSLVVRLVEDK 226
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ +L+ ++ +Y+ V++V + +LNR R ++ +L++G V + + + +E+I
Sbjct: 227 HKVWGLMIILAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNPAQVSQRESI 286
Query: 371 F 371
Sbjct: 287 L 287
>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 158
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 68/113 (60%)
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LKDG VGK+I + G D + K+ R D L ++G G+E+ + PHLFL +A N
Sbjct: 1 LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
AK ++ V + +TR PIY +FAK N+ D+ AKGE + +++G G+ I ++
Sbjct: 61 FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLA 113
>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 377
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 143/317 (45%), Gaps = 33/317 (10%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
+L+ FF G+ + +L +GV AT + I++D GRV ++ A + G
Sbjct: 4 SLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLGPML 63
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ + KQ R A DL + G +E + +PH + + L C A V +++ V++ + + +
Sbjct: 64 EPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAALSQ 123
Query: 274 AF-AKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSL---ITTFALLSCGYVFSSY 329
F G +I D+ AK + L+G + MI R S T L ++++++
Sbjct: 124 HFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRITSFEGTWTALLFLIGVHLYANW 183
Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVL------ 383
+ V++V + TLNR R ++ + ++ G V S + +E++F R +VL
Sbjct: 184 RGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFC----RSSRLVLDGVTLG 239
Query: 384 ----GSRFKDAF------QDPD----AYLAVEPFFEKERYVV-TYNPSKGKVYAL--LKD 426
G + PD A + F ERY++ KGKV + LK
Sbjct: 240 QCHIGVSLQHLLAQIARVHGPDRSGSALAGLMDRFASERYILHCCCRRKGKVSIIICLKR 299
Query: 427 QAKSDDILKAAFHAYVL 443
+S D+L+A HA VL
Sbjct: 300 GVESRDMLRAWTHALVL 316
>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 891
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 87/387 (22%), Positives = 161/387 (41%), Gaps = 71/387 (18%)
Query: 128 MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPG 187
+ F +P G+P SV Y+ Y + +++ V + +L + GV AT
Sbjct: 431 IRFATELFMPVGYPYSVREGYLSYQIYDSIQGLSSYLRAVLCSAQVLQAAGVGDANATAI 490
Query: 188 AVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL- 245
A+ W LKDG G +G +IF+ D +K+ R D++ ++G +++ VP
Sbjct: 491 MAALTWALKDGLGMIGGLIFSYGVASHLDAYVKEFRLFADVINDVGMLLDMLAPYVPRAD 550
Query: 246 FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM 305
+ ++ AA + K + + + +T++ I + FA N+ D+ AK + +L+G +
Sbjct: 551 LIWISSAATLCKVMCGIAAGATKSSITQHFALQGNMADLNAKEATQETLVNLIGMVCGVT 610
Query: 306 ISKRNPSL---------------------------ITTFALLSCGYVFSSYQEVKSVVLH 338
+++ L F +L+ +++S+++ V + L
Sbjct: 611 LARYLSQLEAGNEEGDTDRIGYRSWIPCISHDSIQWMVFTILTLLHIWSNWKGVSLLRLE 670
Query: 339 TLNRARFSVAV-----------------------------ESFLKT-----GHVPSLREG 364
TLNR R V + E L+T ++P R
Sbjct: 671 TLNRERTKVVLCAAGLWRALVPDCEALRSEKYAELGLLRQEMLLETLQKCIENIP--RPN 728
Query: 365 NLQENIFSFPW--LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
+++E++ S W L +VL +R KD + A + F E+YVV + +G+
Sbjct: 729 DMRESLASSTWDLLMPTRLVLHARLKDMIA--IDWTATKDFV-NEKYVVVLS-DRGRALV 784
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINS 449
L A D LKA H ++L C++S
Sbjct: 785 CLLTGAAPMDELKAFMHGLLILKCLDS 811
>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 171
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 69/113 (61%)
Query: 195 LKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
LK+G VGK+I + G D++ K+ R D L ++G G+E+ + PHLFL +A N
Sbjct: 1 LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMIS 307
AK +A V + +TR PIY +FAK N+ D+ AKGE + +++G G+ I ++
Sbjct: 61 FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLA 113
>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 477
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 113/449 (25%), Positives = 201/449 (44%), Gaps = 77/449 (17%)
Query: 71 RLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSF 130
+L E + V+ S +LI + Y+ S + R I + P+ P+ + S
Sbjct: 3 KLKEIKEVNQSGKLIAT---------YIQSHAHSTSLDQKDSR-IDIHIPKGPSTWVSSL 52
Query: 131 IRSYVVPDGFPDSVTPSY---VP------YMTWRALKHFFGGAMGVFTTQTLLNSVGVSR 181
+++ P G+PDSVT Y VP Y+ L F G+ ++ +L +GV
Sbjct: 53 TGAFL-PVGYPDSVTEDYTACVPCPIHLLYLLCEKLA-FASTIAGLLASRAVLQGLGVGD 110
Query: 182 NRATP-GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
+ A+ GAV +N +L++ AGR+ ++FA R G + + K+ R D+ + ++ +
Sbjct: 111 STASATGAVLLN-VLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDCIS 169
Query: 240 AA---VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
A VP +FL A++V K++ V + S++ + FAK N+ ++ AK +
Sbjct: 170 PAFPKVPRVFL--LSASSVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETLIS 227
Query: 297 LL----GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
LL GT + IS + + I +LLS ++ ++Y V+SV + TLNR R ++ + F
Sbjct: 228 LLGMLVGTFVVSRISSQTATWIALISLLSI-HLGTNYLAVRSVTMRTLNRQRANLVISDF 286
Query: 353 L------KTG-HVPSLREGNLQENIFSFPWLRD------KPVVLG--------------- 384
+ KT ++P+ ++ + +E IF RD + VVLG
Sbjct: 287 VSNINAEKTRFNLPTPKDISRKERIFE----RDGAIRHIQGVVLGYCKVGVSLREILSSI 342
Query: 385 ---SRFKDAFQDPDAYLAVEPF--FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFH 439
++++ + + F ++ + YV+ Y+ + +LK+ L A FH
Sbjct: 343 SSSPTVSGSYKEETSSIITSLFKIYKNQGYVMWYSRRQNTFLVILKENTAPSVQLDAWFH 402
Query: 440 AYVLLHCI------NSSNGHRSLALKQHE 462
A L NS+NG LK E
Sbjct: 403 AVWALSVSSSPAVGNSTNGSDQDILKWIE 431
>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
Length = 449
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 97/427 (22%), Positives = 189/427 (44%), Gaps = 84/427 (19%)
Query: 94 RWNYVAERDRSGKFKRNSIRA------------ISLQSPQAPAEELMSFIRSYVVPDGFP 141
R + E ++SGK IR+ I + P+ P+ +MS + +++
Sbjct: 3 RSEEIKEINQSGKLIATYIRSHTTDTSGTKFDRIDVDIPKGPSTWVMSLVGAFM------ 56
Query: 142 DSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGR 201
P++ + G+ ++ +L +GV + A+ + IL++ AGR
Sbjct: 57 ----PAFASTIA------------GLLASRAVLQGLGVGDSTASATHAVLLSILQESAGR 100
Query: 202 VGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAA---VPHLFLPLACAANVAK 257
+ ++FA R G + + K+ R D+ + ++ + A +P +FL A++V K
Sbjct: 101 ISTILFAHRLGSALEPECKKYRLMADIFNDAAMILDCISPAFPKIPRVFL--LSASSVCK 158
Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLL----GTGLSIMISKRNPSL 313
++ V + S++ + FAK N+ ++ AK + LL GT + IS + +
Sbjct: 159 SLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATW 218
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH-------VPSLREGNL 366
I ALLS ++ ++Y V+SV + TLNR R ++ + S L + +PS R+ +L
Sbjct: 219 IALLALLSI-HLGTNYMAVRSVTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISL 277
Query: 367 QENIFSFPWLRDKPV-----------VLGSRFKD------------AFQDPDAYL-AVEP 402
QE IF RD + +G +D ++ +P++ L ++
Sbjct: 278 QERIFE----RDGAIRNTHGTILAYCKVGVSLQDLLSSIATPTTAGSYAEPESILTSLLS 333
Query: 403 FFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHE 462
++ + Y++ Y+ S+ +LK++ L A FHA + I S N + K +
Sbjct: 334 IYKTQGYLMWYSRSRNTFLVILKEETAPKVQLDAWFHA---IFAITSGN-QKIQKSKTNH 389
Query: 463 YDNSNLM 469
DN +++
Sbjct: 390 VDNQSIL 396
>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 407
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 155/340 (45%), Gaps = 26/340 (7%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKF 214
+L+ F G+ +++ +L VGV A+P + + IL+D GR+ ++FA Q G
Sbjct: 21 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 80
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ + K R A D+ + ++ + P + + L ++V + + V + S++ +
Sbjct: 81 EPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 140
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFALLS--CGYVFSSYQ 330
FAK N+ ++ AK + L+G + S+++S L T AL+S ++ ++Y
Sbjct: 141 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLIVHITTNYL 200
Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV---- 382
V++V + +LNR R ++ + V + ++ + QE +F W +
Sbjct: 201 AVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFCQ 260
Query: 383 LGSRFKDAFQDPDA----------YLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDD 432
+G FKD + +A+ FE+E Y++ ++P++ + LK A
Sbjct: 261 IGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTS 320
Query: 433 ILKAAFHAYVLLHC--INSSNGHRSLALKQHEYDNSNLML 470
LKA HA + + I SN L +Q D S LML
Sbjct: 321 QLKAWSHALLAANSLKIKESNTRGPLVREQVGTD-SVLML 359
>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
Length = 483
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 169/384 (44%), Gaps = 41/384 (10%)
Query: 120 PQAPAE-ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
P+ P+ ++ I + +P G+P SVT Y+ Y + +L+ F G+ +++ +L VG
Sbjct: 46 PKLPSSTSYLTSILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVG 105
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
V A+P A + +L++ GR+ ++FA R G + + K R A D+L + ++
Sbjct: 106 VGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDC 165
Query: 238 ATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
+ P + + L ++V + + V + S++ + FA+ N+G++ AK +
Sbjct: 166 LSPIFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVIS 225
Query: 297 LLGTGL-SIMISKRNPSLITTFAL--LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
L G + S+++S L T AL L ++ +++ V++V + TLNR R ++
Sbjct: 226 LAGMLVGSVVVSYITSPLETWIALIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLF 285
Query: 354 KTGHVPSLREGNLQENIFS----FPWLRDKPVVLGS-----------RFKDAFQDPDAYL 398
+ V + E + +E IF W + +LG+ D L
Sbjct: 286 EDDRVLTPAEASEEERIFERDGVLRWNAESSGILGTCQIGTSLEQLLLLLPEQADRGGAL 345
Query: 399 -----------------AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
A+ F +E Y++ ++P + +LK A + LKA HA
Sbjct: 346 ETGQTSSTTMNTSTNLTALLELFREEEYILYFDPLSRRGAIVLKAGATAKAQLKAWSHAL 405
Query: 442 VLLHCINSSNGHRSLALKQHEYDN 465
++ + N S A +HE +
Sbjct: 406 LVSRHLAGKN---STATCRHEKQD 426
>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 438
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 28/302 (9%)
Query: 78 VDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVP 137
VD + ++C+ D G V E D S K ++ + P A A I +P
Sbjct: 8 VDQTGNVLCTYDQSGA----VGEDDSSLKTEKTHV-------PLAKA------IFYAFLP 50
Query: 138 DGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD 197
GFP SVT Y+ Y T+ +L+ F + + +L +GV +P A + IL+D
Sbjct: 51 AGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPTAALVLQILQD 110
Query: 198 GAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP---LACAA 253
R+ ++FA R G + + K RF DL + A L FLP + +A
Sbjct: 111 TFSRLATILFAHRMGTAIEPECKTYRFMADLFND--ASQLLDLLVPVLPFLPKVSVMVSA 168
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKR 309
+ +++ V + +++ + FAK N+ ++ AK V + L+GT + M+ +
Sbjct: 169 AILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVGTAVVHMVEDK 228
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
L++ ++ +Y+ V+ V + TLNR R ++ FL+TG V + + +E+
Sbjct: 229 RAVWCWMIVLIAI-HLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVLTPGQVAARES 287
Query: 370 IF 371
I
Sbjct: 288 IL 289
>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 357
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 126/265 (47%), Gaps = 12/265 (4%)
Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
SL++ Q L + + +++ P G+P SVTP Y+PY T+ +L+ FF + + +L
Sbjct: 25 SLKAHQTKVAPLHALLNAFL-PAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLE 83
Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDL------L 228
+GV ++P A I I D R ++FA R G + + K RF D+ L
Sbjct: 84 GLGVGDASSSPTAALILKITSDTISRFATILFAHRLGTAIEPECKFFRFLADVLNDAAQL 143
Query: 229 MELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA-KGENIGDVTAK 287
++L V + L L + + V ++ V + +++ + F + ++G++ AK
Sbjct: 144 LDLLMPVMMMGETRYGLKLGVVVGSGVLRSWCGVAAGASKASLSAHFVTQRSSLGELNAK 203
Query: 288 GECVGNIADLLGTGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTLNRAR 344
+ LLG + ++ K + + L L ++ +Y+ V+ V L TLNR R
Sbjct: 204 EASQETVVSLLGMLVGSLVVKVVQDRLVVWVLMVGLLGVHLLMNYRGVRCVRLRTLNRQR 263
Query: 345 FSVAVESFLKTGHVPSLREGNLQEN 369
++ V S+L+TG V E L+ +
Sbjct: 264 ATMVVRSWLETGRVLGPEEDGLESH 288
>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
Length = 467
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 143/295 (48%), Gaps = 27/295 (9%)
Query: 98 VAERDRSGKFKRNSI-----RAISLQSPQAPAEELMS--------FIRSYVVPDGFPDSV 144
V+E D+SG ++ + R + ++ P+ +L S + +P GFP SV
Sbjct: 26 VSELDKSGHVQQKWLYEPSGRIVHIR-PKGHTNDLFSRSLREWKKILSDSFLPVGFPHSV 84
Query: 145 TPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGK 204
+ Y+ Y T+ +L+ FF + + LL +GV ++ + IL+D RV
Sbjct: 85 SSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSSATFALLLTILRDATSRVAT 144
Query: 205 MIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVA-----KN 258
+ FA R G + + D K+ RF DL + +EL + P+L P A ++ +
Sbjct: 145 IAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYS---PYLS-PWGKVAALSIGEALRA 200
Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRNPSLIT 315
+ V++ +++ + FAK +N+ ++ AK E + LL + + + + + S++
Sbjct: 201 LCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVGTVVVKVVEDHRSVVF 260
Query: 316 TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
LL G+++ +Y V SV + TLNR R ++ + +L+TG V S E + +E I
Sbjct: 261 LMILLVFGHLWMNYLGVCSVEMTTLNRQRATILFQEYLQTGKVLSPAEVSKREYI 315
>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
Length = 484
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 104/438 (23%), Positives = 191/438 (43%), Gaps = 66/438 (15%)
Query: 98 VAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIR---------SYVVPDGFPDSVTPSY 148
+AE D++G K + +I + P +++ +R +PDG+P SV+ Y
Sbjct: 25 IAEYDKAGSLKAKYVESIPIIGH--PRVDVLLPVRRKKIWLQLLEIFLPDGYPHSVSDDY 82
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
Y + +++ F G G+ +++ + +GV + A+P + ++++ GR + FA
Sbjct: 83 AAYQIYDSIQAFAGSIAGMISSRAVWEGLGVGDSLASPTGAMLIQVIRESTGRFATITFA 142
Query: 209 RQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH--LFLPLACAANVAKNVAAVTST 265
G + + K R A D+L ++ ++ + P FL L C +++ + + V
Sbjct: 143 HLFGTSIEAECKAYRLASDILCDVAMVLDCLSPFFPRSIRFLVL-CFSSLLYSASGVAGN 201
Query: 266 STRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTFALLS-- 321
++++ + FAK N+G++ AK L+G TG + M+S T AL+S
Sbjct: 202 ASKSSLSGHFAKWNNLGELNAKDASQETAISLMGMITG-TFMVSLLTGPKATWVALISLL 260
Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV-----PSLREGNLQENIFS-FPW 375
++F +++ V++V +LNR R ++A + V S RE +E S F W
Sbjct: 261 ALHLFLNWKGVRAVKSRSLNRQRANIAFSALFSKDQVLTPVIVSEREFIFEERWGSVFRW 320
Query: 376 LRDKPVVLG-SRFKDAFQDPDAYLA-----VEPF-------------FEKERYVVTYNPS 416
K VLG F + Q+ LA ++ F +++E YV+ S
Sbjct: 321 NSGK--VLGHCEFGVSLQEILQSLASGQERMKTFHLSTLELANLLNIYDEEEYVLWCEAS 378
Query: 417 KG---------KVYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSN 467
+ KV +LKD LKA FH +L + +G + L + D
Sbjct: 379 ETVYCDAASRVKVMVVLKDGVTPKSQLKAWFHGLLLTRRL---SGQKESELLRAGMDQQA 435
Query: 468 LMLTTADLEAHIGESCKV 485
++ +HI ES +V
Sbjct: 436 ML-------SHIQESLRV 446
>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
Length = 172
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 70/123 (56%), Gaps = 1/123 (0%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP+SV+ Y+ Y W L+ F G+ TQ +L + GV N AT A I W+L
Sbjct: 50 LPQGFPESVSEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAATITWML 109
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KDGAG +G+++FA ++G + D D K+ R D+L ++L F+ +AC ++
Sbjct: 110 KDGAGMIGRILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLIACLSS 169
Query: 255 VAK 257
VA+
Sbjct: 170 VAR 172
>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
Length = 513
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 113/229 (49%), Gaps = 9/229 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ YVPY + +L+ F G+ +++ +L VGV A+P A + IL
Sbjct: 76 LPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHIL 135
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
+D +GR+ ++FA R G + + K R A D+ + ++ + +P F + +
Sbjct: 136 QDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLST 195
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL----SIMISK 308
A V + + V S++ + F++ N+ +V AK I L+G + S ++S+
Sbjct: 196 AGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVRFVGSFVVSR 255
Query: 309 RNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
T + +L ++ +Y V+SV + +LNR R ++ + L +
Sbjct: 256 VTSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNS 304
>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
Length = 454
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 121/255 (47%), Gaps = 27/255 (10%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR---ATPGAVAIN 192
+PDGFP++V+ Y+P++ W+ L FF + V + Q+LL +VG+ N+ A P A A
Sbjct: 41 LPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQANAAAPVAGAAK 100
Query: 193 WILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC 251
W+LKDG G + + A KKFD + K + L ++ +EL T P FL +A
Sbjct: 101 WVLKDGTGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPLFPSHFLLIAG 160
Query: 252 AANVAKNVAAVTSTSTRTPIYKA-FAKGENIGDVTAKGECVGNIADL--LGTGLSIMISK 308
A ++ A +S + F++ EN GD+ K E G + L G+++ +
Sbjct: 161 TATFVRSAALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAALPVGIALFRAV 220
Query: 309 RN--------PSL----------ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVE 350
N SL + T+ + + F Y KS+ TL+R R + +
Sbjct: 221 TNGLELDFATNSLGQEMKGVTVGVATYGSMLFAHSFCCYMAAKSLNFETLSRKRALIQAK 280
Query: 351 SFLKTG--HVPSLRE 363
+F+++ +V S+ E
Sbjct: 281 TFVESNGKNVLSVEE 295
>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
Length = 232
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 71/135 (52%), Gaps = 1/135 (0%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
A S + + +P GFPDSV+P Y+PY W +++ F G T +L +GV +
Sbjct: 60 ASPPFSRLLAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAK 119
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
A+ A W++KD G +G+++FA G K D + KQ R D+L ++ +E+ +
Sbjct: 120 ASVSAATATWLVKDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPIL 179
Query: 243 PHLFLPLACAANVAK 257
P F C +N+AK
Sbjct: 180 PFCFTITVCISNLAK 194
>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
Length = 331
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 83/163 (50%), Gaps = 1/163 (0%)
Query: 126 ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
+L + + +P G+P S+ Y Y + L+ G+ +TQ +L +GV + T
Sbjct: 121 KLKRIMYATFLPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVT 180
Query: 186 PGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
A + W+ +DG+ +G ++F A F D+K+ R DL+ ++ +++ A P
Sbjct: 181 SLAATMQWMARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPD 240
Query: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
LFLP+ C ++V K + V + + T I + +A NI DV AK
Sbjct: 241 LFLPIICLSSVFKAMCGVAAGACNTAIVQHWAARGNIADVGAK 283
>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
Length = 232
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 67/123 (54%), Gaps = 1/123 (0%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G T +L +GV +A+ A W++
Sbjct: 72 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD G +G+++FA G K D + KQ R D+L ++ +E+ +P F C +N
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCISN 191
Query: 255 VAK 257
+AK
Sbjct: 192 LAK 194
>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
Length = 448
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 120/241 (49%), Gaps = 5/241 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SVT Y+PY T+ +L+ F + + +L +GV + ++P I I
Sbjct: 45 LPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-FLPLACAA 253
D R+ ++FA R G+ + + K RF D+ + ++L T A+P+ L + +A
Sbjct: 105 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKLGVIVSA 164
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK--RNP 311
V +++ V + +++ + FA N+ ++ AK + LLG + ++ K +
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVGSLVVKMVEDK 224
Query: 312 SLITTFALLSCG-YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
++ ++ G ++ +Y V++V + +LNR R ++ +L G V + + + +E+I
Sbjct: 225 QVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLTPEQVSQRESI 284
Query: 371 F 371
Sbjct: 285 L 285
>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
Length = 448
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 152/321 (47%), Gaps = 17/321 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP SV+ Y+ Y + +L+ FF + + LL +GV ++ + IL
Sbjct: 75 LPVGFPHSVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSSATFALLLTIL 134
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
KD R+ ++FA R G + + D K+ RF DL + +EL + + +A +
Sbjct: 135 KDAMSRIATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGSWGKVVALSTG 194
Query: 255 VA-KNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKR 309
A + + V + +++ + FA+ +N+ ++ AK VG I L+GT L + +
Sbjct: 195 EALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVGT-LVVKTVEN 253
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLRE-GNLQE 368
+ +++ +L ++ +Y V+SV ++T NR R ++ E +LK+G V S + N +
Sbjct: 254 HSAVVLLMIVLVLAHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVLSPEQVANRES 313
Query: 369 NIFSFPWLRD---KPVV---LGSRFKDAFQDPDAY--LAVEPFFEKERYVVTYNPSK-GK 419
IF P +R+ K VV + A DA+ L + R++ + P + +
Sbjct: 314 IIFWSPAIRNAQGKEVVHLEFAESYSHAMSCSDAHEKLIIVDGAMHTRFICSRVPGRLTR 373
Query: 420 VYALLKDQAKSDDILKAAFHA 440
+ LL D A D + A F A
Sbjct: 374 IKLLLWDGASPMDAVLAWFAA 394
>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 464
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 119/241 (49%), Gaps = 6/241 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP SVT Y+ Y + +L+ F + +++ +L VGV A + IL
Sbjct: 59 LPVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGVGVGDATANATTALLLSIL 118
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAA 253
+D GR+ ++FA R G + + K R D+L + G ++ + P + + +
Sbjct: 119 QDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLDCLSPIFPKPTRVVILSFS 178
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRN 310
++ +++ V + S + + FAK N+G++ AK E V ++A +L I K
Sbjct: 179 SILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVISLAGMLFGSWVITWVKTP 238
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ T LL ++ ++ + V++V + TLNR R ++ + L+ GH+P+ E QE I
Sbjct: 239 VATWTALILLLSIHLETNRRAVRAVKMQTLNRQRATL-LYHHLRKGHLPTPIEVADQELI 297
Query: 371 F 371
F
Sbjct: 298 F 298
>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 472
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 163/382 (42%), Gaps = 62/382 (16%)
Query: 136 VPDGFPDSVTPSYVPY----------MTWR-------------ALKHFFGGAMGVFTTQT 172
+P G+P SV+ Y+ Y + W +L+ F G+ +++
Sbjct: 58 LPVGYPHSVSGDYLEYQVYVRPIFLKLHWSFTDVANNRLFVQDSLQAFSSSIAGLLSSRA 117
Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMEL 231
+L VGV A+P + + IL+D GR+ ++FA Q G + + K R A D+ +
Sbjct: 118 VLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSLEPECKMYRLAADIFNDS 177
Query: 232 GAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
++ + P + + L ++V + + V + S++ + FAK N+ ++ A G
Sbjct: 178 SMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSAHFAKWGNLAELNAVG-- 235
Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLS--CGYVFSSYQEVKSVVLHTLNRARFSVA 348
S+++S L T AL+S ++ ++Y V++V + +LNR R ++
Sbjct: 236 ------------SLVVSYITSPLATWVALISLLIVHITTNYLAVRAVSMTSLNRQRANIV 283
Query: 349 VESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV----LGSRFKDAFQDPDA---- 396
+ V + ++ + QE +F W + +G FKD +
Sbjct: 284 FSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFCQIGVPFKDVIDHVSSSSPA 343
Query: 397 ------YLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHC--IN 448
+A+ FE+E Y++ ++P++ + LK A LKA HA + + I
Sbjct: 344 ERTEATIVAILRVFEQEEYILWFDPARKRGVIGLKTNASPTSQLKAWSHALLAANSLKIK 403
Query: 449 SSNGHRSLALKQHEYDNSNLML 470
SN L +Q D S LML
Sbjct: 404 ESNTRGPLVREQVGTD-SVLML 424
>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
Length = 455
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 120/258 (46%), Gaps = 8/258 (3%)
Query: 129 SFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGA 188
F+R +P +PDSV+P Y+ Y + + + FF G+ + +L +GV +
Sbjct: 75 QFLRDAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNH 134
Query: 189 VAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF- 246
+ LKDG R+ + FA R G + + K+ RF DL + +ELA
Sbjct: 135 ALLLTCLKDGVSRLATITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPFFAGWAK 194
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGL 302
+ + +A + + + + +++ + FA+ N+ ++ K +G I L GT +
Sbjct: 195 ILILVSAESLRALCGIAAGASKAALSTHFARRNNLAELNTKEASQETAIGLIGLLAGTAI 254
Query: 303 SIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ R T L+ ++ +Y V ++ L T+NR R ++ ++ +L+T V +
Sbjct: 255 IHYVQNRGAVFFLTIFLVFI-HLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVLTPA 313
Query: 363 EGNLQEN-IFSFPWLRDK 379
E + +EN IF P + D+
Sbjct: 314 EASKKENIIFWRPHIADE 331
>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
Gv29-8]
Length = 451
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 154/332 (46%), Gaps = 19/332 (5%)
Query: 124 AEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNR 183
A E + +P GFP SV+ Y+ Y + +L+ FF + + LL +GV
Sbjct: 63 AREWRKVLFDSFLPVGFPHSVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDAS 122
Query: 184 ATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV 242
++ + ILKD R+ ++FA R G + + D K+ RF D+ + +EL + +
Sbjct: 123 SSATFALLLTILKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYL 182
Query: 243 PHLFLPLACAANVA-KNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADL 297
+A + A + + V + +++ + FA+ +N+ ++ AK VG I L
Sbjct: 183 GSWGKVIALSTGEALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLL 242
Query: 298 LGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
+GT L + + + S++ LL ++ +Y V+SV + TLNR R ++ +LK
Sbjct: 243 VGT-LVVKTIEDHSSVVILMILLVFAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRV 301
Query: 358 VPSLREGNLQEN-IFSFPWLRD---KPVVLGSRFKDAF------QDPDAYLAVEPFFEKE 407
V + + +E+ IF P +R+ K VV F +++ DP L + +
Sbjct: 302 VLTPEQVAERESIIFWSPIMRNIRGKKVVY-LEFAESYGHAMSCSDPHDQLIIVDGIKHT 360
Query: 408 RYVVTYNPSK-GKVYALLKDQAKSDDILKAAF 438
R++ + P + LL + A+ D ++A F
Sbjct: 361 RFICSRVPKYLTPIKMLLWEGAQPIDAIRAWF 392
>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
Length = 376
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 156/333 (46%), Gaps = 50/333 (15%)
Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAG 234
S+GV + A+ + IL++ AGR+ ++FA R G + + K+ R D+ +
Sbjct: 2 SLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMI 61
Query: 235 VELATAA---VPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECV 291
++ + A +P +FL A++V K++ V + S++ + FAK N+ ++ AK
Sbjct: 62 LDCISPAFPKIPRVFL--LSASSVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQ 119
Query: 292 GNIADLL----GTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
+ LL GT + IS + + I ALLS ++ ++Y V+SV + TLNR R ++
Sbjct: 120 ETLISLLGMLVGTFVVSKISSQVATWIALLALLSI-HLGTNYMAVRSVTMRTLNRQRANL 178
Query: 348 AVESFLKTGH-------VPSLREGNLQENIFSFPWLRDKPV-----------VLGSRFKD 389
+ S L + +PS R+ +LQE IF RD + +G +D
Sbjct: 179 VISSLLSSPEHEKNKISLPSPRDISLQERIFE----RDGAIRNTHGTILAYCKVGVSLQD 234
Query: 390 ------------AFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKA 436
++ +P++ L ++ ++ + Y++ Y+ S+ +LK++ L A
Sbjct: 235 LLSSIATPTTAGSYAEPESILTSLLSIYKTQGYLMWYSRSRNTFLVILKEETAPKFQLDA 294
Query: 437 AFHAYVLLHCINSSNGHRSLALKQHEYDNSNLM 469
FHA + I S N + K + DN +++
Sbjct: 295 WFHA---IFAITSGN-QKIQKSKTNHVDNQSIL 323
>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
CCMP526]
Length = 274
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 76/156 (48%), Gaps = 5/156 (3%)
Query: 108 KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
K++ +R+I + + L + + + +P GFP SV P Y+ Y W ++ G+
Sbjct: 119 KQSEVRSIVRNAKR----RLKTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGI 174
Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGD 226
TQ +L VGV R ATP A + WI +DGA + ++F F + K R D
Sbjct: 175 LATQAILEGVGVGREGATPLAATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFAD 234
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
++++G +++ P FL C A+V K++ +
Sbjct: 235 FMVDIGITLDMLAPLFPKNFLLCICLASVCKSLCGI 270
>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
gaditana CCMP526]
Length = 277
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 76/156 (48%), Gaps = 5/156 (3%)
Query: 108 KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
K++ +R+I + + L + + + +P GFP SV P Y+ Y W ++ G+
Sbjct: 122 KQSEVRSIVRNAKR----RLKTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGI 177
Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGD 226
TQ +L VGV R ATP A + WI +DGA + ++F F + K R D
Sbjct: 178 LATQAILEGVGVGREGATPLAATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFAD 237
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
++++G +++ P FL C A+V K++ +
Sbjct: 238 FMVDIGITLDMLAPLFPKNFLLCICLASVCKSLCGI 273
>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
Length = 478
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 103/220 (46%), Gaps = 14/220 (6%)
Query: 148 YVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF 207
Y+ Y+ L+ V TQ +L VGV R AT A ++++++DG G + ++
Sbjct: 93 YLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASLVL 152
Query: 208 ARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVP-HLFLPLACAANVAKNVAAVTST 265
F D+K+ + D ++ G +E+ ++P FLP+ C ANV K + V +
Sbjct: 153 TSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRWFLPVLCGANVCKALCGVAAG 212
Query: 266 STRTPIYKAFAKGENIGD-----VTAKGECVGNIADLLGTGLSIMISK-------RNPSL 313
+ P+ +A +GD V AK I G LS + ++ P
Sbjct: 213 AVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGLVLSGLFARWMGGRGANRPVW 272
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL 353
+ +A+L+ ++ ++Y+ ++ V L+ LNR R + +E FL
Sbjct: 273 VGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFL 312
>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
Length = 448
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 121/250 (48%), Gaps = 5/250 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L+ I +P G+P +VT Y+ Y T+ +L+ F + + +L +GV + ++P
Sbjct: 36 LLKAILYAFLPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 95
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
I I D R+ ++FA R G+ + + K RF D+ + ++L T A+P+L
Sbjct: 96 TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYL 155
Query: 246 -FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT---G 301
L + +A V +++ V + +++ + FA N+ ++ AK + LLG
Sbjct: 156 PKLGVIVSAGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVGS 215
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSL 361
L + + + + +L+ ++ +Y+ V+SV + +LNR R ++ +L G V +
Sbjct: 216 LVVRMVEDKQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGVVLTP 275
Query: 362 REGNLQENIF 371
+ +E+I
Sbjct: 276 EQVAQRESIL 285
>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
Length = 479
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/288 (22%), Positives = 133/288 (46%), Gaps = 9/288 (3%)
Query: 89 DIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSY 148
D+D + N + D + NS ++ ++ P L+ I +P GFP +VT Y
Sbjct: 7 DVD-QTGNVLCSYDHGDRIDDNSDGSMKIEKTHVP---LIKAIFYAFLPAGFPHTVTEDY 62
Query: 149 VPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA 208
+ Y + +L+ F + + +L +GV +P A + I++D R+ ++FA
Sbjct: 63 LHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLKIIQDTISRLATIMFA 122
Query: 209 -RQGKKFDYDLKQLRFAGDLLMELGA-GVELATAAVPHLFLPLACAANVAKNVAAVTSTS 266
R G + + K RF D+ + L + + + +++ V + +
Sbjct: 123 HRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLVTSGCLRSLCGVAAGA 182
Query: 267 TRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLITTF-ALLSCG 323
++ + FAK N+ ++ AK + L+G TG +++ + + + + +L C
Sbjct: 183 SKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVEDKTAVWCWMVVLICM 242
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
++ +Y+ V++V + TLNR R ++ FL+TG V + + + +E+I
Sbjct: 243 HLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTPAQCSARESIL 290
>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
Length = 458
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 123/243 (50%), Gaps = 9/243 (3%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SVT Y+ Y T+ +L+ F + + +L +GV +P A + I+
Sbjct: 48 LPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLKII 107
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC--- 251
+D R+ ++FA R G+ + + K RF DL + ++L T +P P C
Sbjct: 108 QDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLP--LYPKVCIMV 165
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIM-ISK 308
A+++ +++ + +++++ + FA+ N+ ++ AK + LLG G +++ + +
Sbjct: 166 ASSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAGTAVVHVVQ 225
Query: 309 RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQE 368
+++ +L ++ +Y+ V+ V + TLNR R ++ +L+TG + S E +E
Sbjct: 226 DKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSPGEVAERE 285
Query: 369 NIF 371
++
Sbjct: 286 SVL 288
>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 425
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 112/228 (49%), Gaps = 5/228 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP +VT Y+ Y + +L+ F + + +L +GV + ++P I I
Sbjct: 45 LPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL-FLPLACAA 253
D R+ ++FA R G+ + + K RF D+ + ++L T A+P+ L + +A
Sbjct: 105 GDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKLSVIVSA 164
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTGLSIMISKRN 310
V +++ V + +++ + FA N+ ++ AK E V ++ +L L + + +
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAGSLVVRLVEDK 224
Query: 311 PSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
+ LL+ ++ +Y+ V+SV + +LNR R ++ +L G V
Sbjct: 225 QVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTV 272
>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
Length = 405
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 106/245 (43%), Gaps = 57/245 (23%)
Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
P V+ Y + WR L+ + VF TQ+LL ++GV N+ A A W+ +
Sbjct: 99 LPRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLLLALGVKTNKVGVAA-ASGWVFANAF 157
Query: 200 GRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
G+ GKMIF AR G+ FD D K+ R
Sbjct: 158 GKFGKMIFAARWGRDFDSDAKRWR------------------------------------ 181
Query: 259 VAAVTSTSTRTPIYKAFA-KGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTF 317
++TR YK FA + EN+GD+TAKG+ +ADL+G + I +++ L T
Sbjct: 182 ------SATRGTFYKQFAGRNENLGDITAKGDAQVAVADLVGLVVGIELAR----LAGTN 231
Query: 318 ALLSCG-YVFSS-------YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
AL G YV S + + ++V LN R ++ +F+ G + S + + E
Sbjct: 232 ALCVWGSYVVLSALDFIFILKSLDTIVFRFLNLERATMLASAFVSEGIIKSPADISRSEP 291
Query: 370 IFSFP 374
+ S P
Sbjct: 292 VLSRP 296
>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
FGSC 2508]
gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
2509]
Length = 361
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 123/269 (45%), Gaps = 16/269 (5%)
Query: 116 SLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLN 175
SL++ Q L + + +++ P G+P SVTP Y+PY T+ +L+ FF + + +L
Sbjct: 25 SLRAHQTKIPPLTALLHAFL-PAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLE 83
Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAG 234
+GV ++P I I D RV ++FA R G + + K RF D L +
Sbjct: 84 GLGVGDASSSPTGALILKIASDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQL 143
Query: 235 VELATAAVPH---LFLPLACAANVAKNVAAVTSTSTRTPIYKAF--------AKGENIGD 283
++L A+ L L + A V ++ V +++++ + F G N+ +
Sbjct: 144 LDLLVPAMMERYGLKLGVIVGAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAE 203
Query: 284 VTAKGECVGNIADLLGTGLSIMISKRNPSLITTFAL---LSCGYVFSSYQEVKSVVLHTL 340
+ AK + LLG + ++ + + + L L ++ +Y V+ V TL
Sbjct: 204 LNAKEASQETVVSLLGMLVGSLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTL 263
Query: 341 NRARFSVAVESFLKTGHVPSLREGNLQEN 369
NR R ++ V +L+TG V E L+ +
Sbjct: 264 NRQRATMVVREWLETGRVLGPEEDGLESH 292
>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
Length = 451
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 114/237 (48%), Gaps = 5/237 (2%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L+ I +P G+P +VT Y+ Y T+ +L+ F + + +L +GV + ++P
Sbjct: 37 LLKAILYAFLPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSP 96
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
I I D R+ ++FA R G+ + + K RF DL + ++L T A+P+
Sbjct: 97 TGALILKITGDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALPYF 156
Query: 246 -FLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK---GECVGNIADLLGTG 301
L + +A V +++ V + +++ + FA N+ ++ AK E V ++ +L
Sbjct: 157 PKLAVIVSAGVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAGS 216
Query: 302 LSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHV 358
L + + + + L+ ++ +Y+ V+ V + +LNR R ++ +L G V
Sbjct: 217 LVVRLVEDKQLVWLLMVALAAVHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHGAV 273
>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
Length = 478
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 60/241 (24%), Positives = 117/241 (48%), Gaps = 7/241 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+PDSVTP Y+ Y + +L+ FF G+ + + +L +GV ++ + IL
Sbjct: 69 LPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLLTIL 128
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV-PHLFLPLACAA 253
KDG RV + FA R G + + K+ RF D+ + ++L + P + A
Sbjct: 129 KDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPYFDPWTKVAAIVLA 188
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK----GECVGNIADLLGTGLSIMISKR 309
+ + V + +++ + K FA+ N+ ++ AK +G + L+G+ + + R
Sbjct: 189 EALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGLLVGSIVVRFVEGR 248
Query: 310 NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQEN 369
++ L ++ +Y V+ V L TLN+ R ++ + + +T V + ++ +EN
Sbjct: 249 E-AVFVLMVFLVFVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQTKRVLTPKQVAWREN 307
Query: 370 I 370
I
Sbjct: 308 I 308
>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
Length = 714
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 110/224 (49%), Gaps = 8/224 (3%)
Query: 129 SFIRSYV---VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRAT 185
S+ +S++ +P G+P+SVT Y+ Y + +L+ F G+ ++ +L VGV A+
Sbjct: 83 SWAKSFMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATAS 142
Query: 186 PGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
P + +L+D GR+ + F AR G + + K+ R D+ + ++ + A P
Sbjct: 143 PTTALLLSVLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAFPK 202
Query: 245 -LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TG 301
L + L A+ + + V + S + + FAK N+G++ AK I LLG G
Sbjct: 203 PLRVSLLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGMLVG 262
Query: 302 LSIMISKRNP-SLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
+++ P T LL ++ ++Y V++V + TLNR R
Sbjct: 263 TALIPYLTTPLQTWTALLLLLSIHLTTNYFAVRAVSMRTLNRQR 306
>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
Length = 425
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 68/280 (24%), Positives = 106/280 (37%), Gaps = 78/280 (27%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELM--SFIRSYVVPDGFPDSVTPSYVPYMTWRALKH 159
+R K S+ + + P + + SF+ + P G+P SV Y+ Y +R L+H
Sbjct: 45 ERFSKLHDKSVTTVKVIQDSRPIHDKLVDSFLNKFF-PSGYPYSVNEGYLTYTKFRVLQH 103
Query: 160 FFGGAMGVFTTQTLLNS-----------------------VGV-------SRNRATPGA- 188
F + V +TQ + + VG R P A
Sbjct: 104 FSSAMLHVLSTQLVFQNRLLSVLSSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDAS 163
Query: 189 -------------VAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAG---------- 225
+ ILKDG K+I + G + D + K R G
Sbjct: 164 NCCKLGKHMMTEMLKSKQILKDGMQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCH 223
Query: 226 ---------------------DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTS 264
D+L +LG +E+ + P LFL +A N AK + V +
Sbjct: 224 CLGSHFTIFPIICLYCSILTADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAA 283
Query: 265 TSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
+TR PIY +FAK N+ D+ AKGE + + +++G G I
Sbjct: 284 RATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGI 323
>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
Length = 326
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 123/249 (49%), Gaps = 11/249 (4%)
Query: 202 VGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVA 260
+G+++FA +G K D + KQ R D+L ++ +E+ P F +N+AK +
Sbjct: 2 LGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIV 61
Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITT 316
+V +TR + A+ N+ DV+AK E + N+A LL + L + + P +
Sbjct: 62 SVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGC 121
Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWL 376
F L+ +++++Y+ V+S+V+ TLN R + ++ +L+ G V N E +++ W
Sbjct: 122 FFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGFW- 180
Query: 377 RDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSDDIL 434
P S + + ++ E +E Y++ ++ S+ +V +L +A +L
Sbjct: 181 ---PAPSLSLGVPLHRLVSSVFELQQLVEGHQEPYLLCWDQSRNQVQVVLNQKAGPKTVL 237
Query: 435 KAAFHAYVL 443
+AA H +L
Sbjct: 238 RAATHGLML 246
>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 155
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 60/102 (58%)
Query: 203 GKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAV 262
GK+I + G + D + K R D+L +LG +E+ + P LFL +A N AK +A V
Sbjct: 5 GKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVV 64
Query: 263 TSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSI 304
+ +TR PIY +FAK N+ D+ AKGE + + +++G G I
Sbjct: 65 AARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGI 106
>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
Length = 416
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/325 (25%), Positives = 142/325 (43%), Gaps = 35/325 (10%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+PDSVTP Y+ Y T+ +L+ FF G+ + + +L +GV ++ + IL
Sbjct: 10 LPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSATYALLLTIL 69
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT------AAVPHLFLP 248
KDG RV ++FA R G + + K+ RF D+ + + L + V L L
Sbjct: 70 KDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWTKVGALVLA 129
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
A A + V + +++ + K FA+ N+ ++ AK E A L L +
Sbjct: 130 EALGA-----MCGVAAGASKAALSKHFARRNNLSELNAK-EASQETAVGLVGLLVGSVVV 183
Query: 309 R----NPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREG 364
R ++ L ++ +Y V+ V L TLN+ R ++ E + +T V + +
Sbjct: 184 RFVEGREAVFALMTFLVFVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLTPAQV 243
Query: 365 NLQENI-FSFPWLRDK--PVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVY 421
+ENI F P +++ VV RF ++ D + S G Y
Sbjct: 244 ASRENILFWSPVIKNHRGKVVANIRFAKSYADA---------------MTADVKSAGIRY 288
Query: 422 ALLKDQAKSDDILKAAFHAYVLLHC 446
LK + +S K A ++VL C
Sbjct: 289 ITLKGEGQSKKEAKTAAASFVLCLC 313
>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
Length = 413
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 69/318 (21%), Positives = 138/318 (43%), Gaps = 34/318 (10%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
+L+ F G+ +++ +L GV A+ + I +D R+ ++FA R G
Sbjct: 6 SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ + K+ R A D+ + ++ + A P + + + C + + V ++ +
Sbjct: 66 EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKASLSV 125
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQ 330
FA+ N+G++ AK + L+G S ++S + + T + LL ++ ++++
Sbjct: 126 HFARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHISSTWATWISLTLLQAVHLGTNFK 185
Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPW----------- 375
V +V L TLNR R ++ +F TG + + R+ E IF W
Sbjct: 186 AVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHERIFEKDGVLRWGGSGVLGNAEI 245
Query: 376 ----------LRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLK 425
LR P R K+ + + + + F+ E+Y++ Y+ + V LK
Sbjct: 246 GVKLTALIRSLRQSP----ERKKEGEGEGVSPVELASVFQNEKYLLWYDVQRRLVTICLK 301
Query: 426 DQAKSDDILKAAFHAYVL 443
++ ++ LK+ HA VL
Sbjct: 302 EKVSPEEQLKSWVHALVL 319
>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 272
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 116/222 (52%), Gaps = 8/222 (3%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
+++ F G+ +++ +L SVGV + A+P + + +L+ R+ K+ FA R GK
Sbjct: 34 SIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAYRLGKAI 93
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPHLFLP-LACAANVAKNVAAVTSTSTRTPIYK 273
+ + K RF DLL + + + +P F P + +++ V V ++ST+ +
Sbjct: 94 EPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASSTKASLSA 153
Query: 274 AFAKGENIGDVTAKGECVGNIADLLG--TGLSIMIS--KRNPSLITTFALLSCGYVFSSY 329
FAK EN+G++ AK I LLG TG S++++ + + T L+ ++ ++
Sbjct: 154 HFAKWENLGELNAKDSSQETIISLLGMLTG-SLLVNWISGHWATWTALILMLSVHLEANR 212
Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIF 371
+ V+++V+ TLNR R +V V L GH+P E + E IF
Sbjct: 213 RAVRAIVMQTLNRQRATV-VYHRLCVGHIPRPAEVSRVERIF 253
>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
Length = 675
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 107/237 (45%), Gaps = 13/237 (5%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+++ + ++PDG+P SVT +P+ W + +T+ +L VGV ++A
Sbjct: 64 VVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADL 123
Query: 187 GAVAINWILKDGAGRVGKMIFAR---QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ ++W+++DG +G ++FA QG + K + D ++ +EL +P
Sbjct: 124 TSSTLSWMMQDGTRMIGSIVFASLIPQG--LECRAKTWQLVADFTTDIANLLELCAPWLP 181
Query: 244 ---HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
+F + AA+ K + V + TR + FA N D+ AK ++ +G
Sbjct: 182 GCQTIFRVVLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVGSFIGL 241
Query: 301 GLSIMISKRNPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
L ++ P+ + FAL S +F+ Y+ V+ V L LN R +E +
Sbjct: 242 ALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQY 298
>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 466
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 152/370 (41%), Gaps = 39/370 (10%)
Query: 118 QSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
+S +EL++ I +P G+P++VTP Y+ Y A + F + ++ L +
Sbjct: 41 RSKHQDTKELLTKI---FLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGI 97
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVE 236
GV AT + +L+D GR+ +I A G + + K+ RF D+L ++ V+
Sbjct: 98 GVGDPSATSTGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVD 157
Query: 237 LATAAVPHLFLPLA-----CAANVAKNVAAVTSTSTRTPIYKAFAKGE----NIGDVTAK 287
+ P A C + + V V + + I FA ++GD+ AK
Sbjct: 158 VFNPVFSAFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAK 217
Query: 288 GECVGNIADLLGTGL-SIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
+ LLG L S+MI +T LL ++ +Y V+ + + + NR R
Sbjct: 218 DSSKETVLALLGMLLGSVMIPHITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHR 277
Query: 345 FSVAVESFL-KTGHVPSLREGNLQENIFSFPW-LRDKPVVL-------GSRFKDAFQDPD 395
++A + K PS + +E IF P +RD L GS D + P
Sbjct: 278 ATIAWLWYRSKDECAPSPQAVASRELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPA 337
Query: 396 AYLAVEPFFEKERYVVTYNPSKGK-----VYALLKDQAKSDDILKAAFHAYVLLHCINSS 450
F ++Y+V +NPS ++ K+ D LK+ HA +
Sbjct: 338 LSQEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELKSWIHAVEIC------ 391
Query: 451 NGHRSLALKQ 460
RS+AL Q
Sbjct: 392 ---RSIALNQ 398
>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 675
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 107/237 (45%), Gaps = 13/237 (5%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
+++ + ++PDG+P SVT +P+ W + +T+ +L VGV ++A
Sbjct: 64 VVTALERILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADL 123
Query: 187 GAVAINWILKDGAGRVGKMIFAR---QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ ++W+++DG +G ++FA QG + K + D ++ +EL +P
Sbjct: 124 TSSTLSWMMQDGTRMIGSIVFASLIPQG--LECRAKTWQLVADFTTDIANLLELCAPWLP 181
Query: 244 ---HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
+F + AA+ K + V + TR + FA N D+ AK ++ +G
Sbjct: 182 GCQTIFRVVLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVGSFIGL 241
Query: 301 GLSIMISKRNPSL-----ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
L ++ P+ + FAL S +F+ Y+ V+ V L LN R +E +
Sbjct: 242 ALGTAVAYSVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQY 298
>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 495
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 168/392 (42%), Gaps = 58/392 (14%)
Query: 120 PQAPAEE-LMSFIRSYVVPDGFPDSVTPSYVPYMTW------RALKHFFGGAMGVFT--- 169
PQ P+ ++ I + +P G+P SVT Y+ + + L G
Sbjct: 46 PQKPSSSSYLTSILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLN 105
Query: 170 --TQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGD 226
+ + +SVGV A+P A + +L++ GR+ ++FA R G + + K R A D
Sbjct: 106 CWSPVIQSSVGVGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAAD 165
Query: 227 LLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
LL + ++ + A P + + L ++V + + V + S++ + FA+ N+G++
Sbjct: 166 LLNDSAIVLDCMSPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELN 225
Query: 286 AKGECVGNIADLLGTGL-SIMISKRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNR 342
AK + L G + S+++S + L T A LL ++ +++ V++V + TLNR
Sbjct: 226 AKDSSQETVISLFGMLVGSVVVSYISSPLATWIALILLLIVHLGTNHAAVRAVKMTTLNR 285
Query: 343 ARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG-------------- 384
R ++ + V + + + +E IF W LG
Sbjct: 286 QRANIVFSYLFEDDRVLTPAQASEEERIFETDGVLRWKAKATGTLGICQIGVSLEQLLLL 345
Query: 385 ------SRFKDAFQDPDAYL--------AVEPFFEKERYVVTYNPSKGKVYALLKDQAKS 430
+ + + Q PD + A+ F++E Y++ ++P+ + +LK A
Sbjct: 346 LPEQVNTNAQGSSQKPDTTMVGASTGLSALLELFKEEEYILYFSPAARRGAIVLKAGATP 405
Query: 431 DDILKAAFHAYVLLHCINSSNGHRSLALKQHE 462
LKA HA ++ R LA ++H+
Sbjct: 406 QAQLKAWSHALLV---------SRHLAREEHD 428
>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
Length = 542
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 113/249 (45%), Gaps = 37/249 (14%)
Query: 119 SPQAPAE--ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
+P++P L + + +P G+P+SVT Y+P +L
Sbjct: 56 TPRSPWSLMTLFNLLVDVFLPSGYPNSVTDDYLP---------------------AVLQG 94
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
VGV A+P + + IL+D +GR+ ++FA R G + + K R A D+ +L +
Sbjct: 95 VGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVL 154
Query: 236 ELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN 293
+ + +P + + AA V + + V S++ + FA+G N+ +V AK
Sbjct: 155 DCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQET 214
Query: 294 IADLLGTGLSIMISKRNPSLITTFALLSCGYVF-------SSYQEVKSVVLHTLNRARFS 346
I LLG +++ S +T+FA VF +Y V+SV + +LNR R +
Sbjct: 215 IISLLG----MLVGSFIVSHVTSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRAN 270
Query: 347 VAVESFLKT 355
+ + L++
Sbjct: 271 IVFSTLLES 279
>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
Length = 542
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 113/249 (45%), Gaps = 37/249 (14%)
Query: 119 SPQAPAE--ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNS 176
+P++P L + + +P G+P+SVT Y+P +L
Sbjct: 56 TPRSPWSLMTLFNLLVDVFLPSGYPNSVTDDYLP---------------------AVLQG 94
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
VGV A+P + + IL+D +GR+ ++FA R G + + K R A D+ +L +
Sbjct: 95 VGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVL 154
Query: 236 ELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN 293
+ + +P + + AA V + + V S++ + FA+G N+ +V AK
Sbjct: 155 DCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQET 214
Query: 294 IADLLGTGLSIMISKRNPSLITTFALLSCGYVF-------SSYQEVKSVVLHTLNRARFS 346
I LLG +++ S +T+FA VF +Y V+SV + +LNR R +
Sbjct: 215 IISLLG----MLVGSFIVSHVTSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRAN 270
Query: 347 VAVESFLKT 355
+ + L++
Sbjct: 271 IVFSTLLES 279
>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
Y34]
gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
P131]
Length = 431
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 115/241 (47%), Gaps = 5/241 (2%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP +VT Y+ Y + +L+ F + + +L +GV +P A + I+
Sbjct: 41 LPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAALVLKII 100
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGA-GVELATAAVPHLFLPLACAA 253
+D R+ ++FA R G + + K RF D+ + L + + +
Sbjct: 101 QDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKVSVLVTS 160
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNP 311
+++ V + +++ + FAK N+ ++ AK + L+G TG +++ +
Sbjct: 161 GCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVVHMVEDK 220
Query: 312 SLITTF-ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+ + + +L C ++ +Y+ V++V + TLNR R ++ FL+TG V + + + +E+I
Sbjct: 221 TAVWCWMVVLICMHLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTPAQCSARESI 280
Query: 371 F 371
Sbjct: 281 L 281
>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
Length = 543
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 131/298 (43%), Gaps = 30/298 (10%)
Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELG 232
L VGV A+P A + +L+D GR ++FA R G F+ + K RF D+L + G
Sbjct: 176 LAGVGVGDANASPTAALLLSVLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTG 235
Query: 233 AGVELATAAVPH--LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
+E + A P L+C+ +V +++ V + S + + FA+ N+G+V AK
Sbjct: 236 MVLECLSPAFPKPIRVCVLSCS-SVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSS 294
Query: 291 VGNIADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
+ LLG S+++S + + T T L ++ ++Y V++V + LNR R ++
Sbjct: 295 QETVISLLGMLAGSVVVSWISSPMATWSTLIGLLAIHLATNYAAVRAVSMRCLNRQRANI 354
Query: 348 AVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLGSRFKDAFQDPDAYLAVE-- 401
L+ G + S E + +E +F W D + R + Q +++ +
Sbjct: 355 LFSRLLEHGQLLSPLEVSRRERVFEHDGVLRWSDDMAIGY-CRIGVSLQTLLSHMGLRNA 413
Query: 402 ----------------PFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
F E Y++ S G +LK+ D LKA HA +L
Sbjct: 414 RTGSLHLRGAEISELLHVFVDEAYILWPASSAGHFLIVLKEGCTPMDQLKAWAHALLL 471
>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
bisporus H97]
Length = 462
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/409 (23%), Positives = 165/409 (40%), Gaps = 41/409 (10%)
Query: 79 DSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPD 138
+ +L L+ +D R Y + ++ S R +S +EL++ I +P
Sbjct: 4 NPALLLVERDDKKQMRELYYSPEKTLHIVEKKSHR--EYRSKHQDTKELLTKI---FLPA 58
Query: 139 GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDG 198
G+P++VTP Y+ Y A + F + ++ L +GV AT + +L+D
Sbjct: 59 GYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPSATSTGAMLISVLQDI 118
Query: 199 AGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-----CA 252
GR+ +I A G + + K+ RF D+L ++ V++ P A C
Sbjct: 119 FGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVFSAFMFPGARVIGLCT 178
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGE----NIGDVTAKGECVGNIADLLGTGL-SIMIS 307
+ + V V + + I FA ++GD+ AK + LLG L S++I
Sbjct: 179 SAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKETVLALLGMLLGSVLIP 238
Query: 308 KRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFL-KTGHVPSLREG 364
+T LL ++ +Y V+ + + + NR R ++A + K PS +
Sbjct: 239 HITSPWVTYPVLLLLISVHLVINYIAVRGLAIRSPNRHRATIAWLWYRSKDECAPSPQAV 298
Query: 365 NLQENIFSFPW-LRDKPVVL-------GSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPS 416
E IF P +RD L GS D + P F ++Y+V +NPS
Sbjct: 299 ASHELIFERPENIRDPTTSLVLGKCHVGSAPLDVMRGPALSKEYLDIFHADKYIVCFNPS 358
Query: 417 KGK-----VYALLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQ 460
++ K+ D LK+ HA + RS+AL Q
Sbjct: 359 HETGHLPIIHICFKEGFTRADELKSWIHAVEIC---------RSIALNQ 398
>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
1015]
Length = 504
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/303 (24%), Positives = 129/303 (42%), Gaps = 38/303 (12%)
Query: 65 DETTAFRLVE-DQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRA-ISLQSPQA 122
+++TA E D+ + I SE G A R+ K N R ++ S A
Sbjct: 5 EDSTAITFTEVDEVNHPTATYIYSEPAAGP-----AHREHGKKHGENWGRVDVAHASSTA 59
Query: 123 PA----EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVG 178
PA + + F+ +P G+P SVT Y P +L VG
Sbjct: 60 PAAWSSKSVQDFLVEVFLPAGYPHSVTDDYAP---------------------AVLQGVG 98
Query: 179 VSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVEL 237
V A+P A + IL+D +GR+ ++FA R G + + K R A D+ +L ++
Sbjct: 99 VGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDC 158
Query: 238 ATAAVP--HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 295
+ +P + + A V + + V S++ + F++ N+ +V AK I
Sbjct: 159 LSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETII 218
Query: 296 DLLGTGLSIMISKRNPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
L+G + + R + TT+ +L ++ +Y V+SV + +LNR R ++ +
Sbjct: 219 SLIGMLVGSFVVSRVTNFSTTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTL 278
Query: 353 LKT 355
L T
Sbjct: 279 LNT 281
>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
Length = 188
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P+SV Y + +W L+ + G A+GV +Q +L S+G+ AT GAVAI W+L
Sbjct: 59 LPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGGAVAIQWVL 118
Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRF 223
KDG G +GK+ F ++ FD K +F
Sbjct: 119 KDGIGEIGKLFFIKKYASTFDSHPKTWKF 147
>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 442
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/354 (22%), Positives = 155/354 (43%), Gaps = 52/354 (14%)
Query: 133 SYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAIN 192
+Y++ + TP P +L+ F G+ +++ +L VGV A+P A +
Sbjct: 20 TYLISSPEASAGTP---PVTIVDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLL 76
Query: 193 WILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLAC 251
+L++ GR+ ++FA + + K R A D+ + ++ + P + + +
Sbjct: 77 NVLQESMGRIATILFAHR-----LECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLS 131
Query: 252 AANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG------------ 299
++V + + V + S++ + FA+ N+ ++ AK + L+G
Sbjct: 132 LSSVLRAMCGVAAGSSKASLSAHFARWGNLAELNAKDSSQETVISLMGMLVRPILGFLES 191
Query: 300 -TGL-------SIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
+GL S+++S+ +L T LL ++ ++Y V++V + TLNR R ++
Sbjct: 192 ISGLILTQQVGSLVVSRVTSTLATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVF 251
Query: 350 ESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLGS-RFKDAFQ------------ 392
+ + + QE IF W + P LGS R +FQ
Sbjct: 252 STIFDENRTLTPSQAAHQERIFERDGVLRW-KASPASLGSCRIGVSFQYMVQLIASSESC 310
Query: 393 --DPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
D + + A+ FE+E Y++ +NP++ + +LK A + LKA HA ++
Sbjct: 311 TSDGQSIVAALLHLFEREEYILWFNPTRKRGAIVLKTNATAASQLKAWSHALIV 364
>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 285
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 128/281 (45%), Gaps = 24/281 (8%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
++ I +P G+PDSV+P Y+ Y AL+ F G+ +++ +L GV A
Sbjct: 2 RRVLDLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGDPSA 61
Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
T + +L+D R+ ++ A G D K+ R D+L + ++ + +
Sbjct: 62 TATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRLD 121
Query: 244 HLFLP-----LACAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKGECVGNI 294
LF P C + + +++ A+++ ++ I FA KG ++GD+ AK +
Sbjct: 122 TLFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETV 181
Query: 295 ADLLGTGLSIMISKRNPSLIT------TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
LLG L +I P L T T +L ++ +Y V+ +VL TLN R +A
Sbjct: 182 LALLGMLLGTLIV---PHLTTPWTIYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWIA 238
Query: 349 VESFLKTG--HVPSLREGNLQENIFSFP-WLRD--KPVVLG 384
++++T P+ E E IF P RD + V+LG
Sbjct: 239 WSAYMRTQPRRAPTPVEVASLERIFERPGGFRDPRRGVILG 279
>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
Length = 549
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/226 (25%), Positives = 101/226 (44%), Gaps = 22/226 (9%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y P + A+ VGV A+P + + IL
Sbjct: 74 LPAGYPHSVSDDYTPALAQPAIH----------------AGVGVGNANASPTSALLLHIL 117
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
+D +GR+ ++FA R G + + K RFA D+ +L ++ + +P F + + A
Sbjct: 118 QDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGFNRVTVLSA 177
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPS 312
A V + + V S++ + FA+ N+ DV AK + L+G + + S
Sbjct: 178 AGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGSFVVSHITS 237
Query: 313 LITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
T+ L ++ +Y V+SV + TLNR R ++ + L +
Sbjct: 238 FTATWLTLVFLVSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 283
>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 515
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 125/291 (42%), Gaps = 44/291 (15%)
Query: 98 VAERDRSGKFKRNSIRAISLQSPQAPAEEL----------MSFIRSYVV----PDGFPDS 143
V ERD SG +R + + Q + L + IR +V P G+P S
Sbjct: 12 VLERDESGHLRRTTFGSDGKQYKASGRYNLRDELGLRVPSLDRIRRVLVDLFLPAGYPAS 71
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
VTP Y+PY + AL+ FF G+ ++ L VGV AT + + +L+D GR+
Sbjct: 72 VTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGVGDASATATSALLLTVLQDVFGRIT 131
Query: 204 KMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELAT------AAVPHLFLPLA---CAA 253
+ A G + K R DL+ + A + L T + H FL LA CA+
Sbjct: 132 TISAAYVFGTSLYPETKTFRLVADLVND--ASIVLDTLCPRLSLSFLHTFL-LAISLCAS 188
Query: 254 NVAKNVAAVTSTSTRTPIYKAF---------AKGE-----NIGDVTAKGECVGNIADLLG 299
+ + + + ++ + F A GE ++G++ AK + L G
Sbjct: 189 AALRALCGLVAGGSKAALTNHFACTVPASTAAGGEAVVAGDVGELNAKDASRETVVGLFG 248
Query: 300 TGLSIMISKRNPSLITTF---ALLSCGYVFSSYQEVKSVVLHTLNRARFSV 347
L ++ R S+ T+ L G++ ++Y V+SV L TLNR R +
Sbjct: 249 MLLGTLLVPRLTSVRATWLTLVFLIAGHLLANYLAVRSVSLRTLNRQRAGI 299
>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
[Strongylocentrotus purpuratus]
Length = 311
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 111/220 (50%), Gaps = 13/220 (5%)
Query: 235 VELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECV 291
+EL + P F+ +AC +++ K++ V +TR + A+ N+ DV+AK E +
Sbjct: 8 IELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAKDGSQETL 67
Query: 292 GNIADLLGTGLSIM-ISKRNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
N+A L GL I I+ +N L + F L +C +++++Y+ V VV+ TLN+ RF + V
Sbjct: 68 VNLAALF-VGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQNRFHILV 126
Query: 350 ESFLKTGH--VPSLREGNLQENIFSFPWLRDKPVV--LGSRFKDAFQDPDAYLAVEPFFE 405
+ +L + + N +E + W +P+ LG F+ Q
Sbjct: 127 QDYLHSSQFSMSGPDAVNSREPVI---WRLSRPLKLNLGVPFRTVAQSMRDLEMCLQSNG 183
Query: 406 KERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH 445
+E+++++++ KG + + + S+D +KA A ++++
Sbjct: 184 QEKFLLSFDVKKGCIDIAIHHDSTSEDFIKACLQAEIIMY 223
>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 515
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 100/206 (48%), Gaps = 6/206 (2%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
+L+ F G+ +++ +L VGV A+P + + IL+D +GR+ ++FA R G
Sbjct: 86 SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTAL 145
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIY 272
+ + K R A D+ ++ ++ + +P + + AA V + + V S++ +
Sbjct: 146 EPECKMYRLAADVFNDVAMVLDCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLS 205
Query: 273 KAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFA---LLSCGYVFSSY 329
FA+G N+ +V AK I L+G + I T+A L ++ +Y
Sbjct: 206 AHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHVTCFAATWASLLFLLTVHLGMNY 265
Query: 330 QEVKSVVLHTLNRARFSVAVESFLKT 355
V+SV + +LNR R ++ + + L++
Sbjct: 266 AAVRSVQMTSLNRQRANIVLSTLLES 291
>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 163
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 67/126 (53%), Gaps = 5/126 (3%)
Query: 102 DRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFF 161
D +G R SI +Q + ++L+ + + P G+P SV Y+ Y +R L+HF
Sbjct: 40 DAAGSGSR-SISVKVIQDSRPIHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFS 98
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQGKKFDYDLKQ 220
+ V +TQ+LL + G+ R TP A A++WILKDG GK+I + G + D + K
Sbjct: 99 SAMLHVLSTQSLLFAAGL---RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKS 155
Query: 221 LRFAGD 226
R G+
Sbjct: 156 WRILGE 161
>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
[Strongylocentrotus purpuratus]
Length = 311
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 114/221 (51%), Gaps = 15/221 (6%)
Query: 235 VELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECV 291
+EL + P F+ +AC +++ K++ V +TR + A+ N+ DV+AK E +
Sbjct: 8 IELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAKDGSQETL 67
Query: 292 GNIADLLGTGLSIM-ISKRNPSLI-TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
N+A L GL I I+ +N L + F L +C +++++Y+ V VV+ TLN+ RF + V
Sbjct: 68 VNLAALF-VGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQNRFHILV 126
Query: 350 ESFLKTGHVPSLREGNLQENIFSFPWLRDKPVV--LGSRFK---DAFQDPDAYLAVEPFF 404
+ +L + S+ + + W +P+ LG F+ + +D + L
Sbjct: 127 QDYLHSSQF-SMSGPDAVNSREPVIWRLSRPLKLNLGVPFRTVAKSMRDIEMCLQSNG-- 183
Query: 405 EKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH 445
+E+++++ + KG + ++ + S+D +KA A ++++
Sbjct: 184 -QEKFLLSLDVKKGCIDIVIHHDSTSEDFIKACLQAEIIMY 223
>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
Length = 236
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 79/154 (51%), Gaps = 3/154 (1%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ YVPY + +L+ F G+ +++ +L VGV A+P A + IL
Sbjct: 76 LPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHIL 135
Query: 196 KDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACA 252
+D +GR+ ++FA R G + + K R A D+ + ++ + +P F + +
Sbjct: 136 QDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLST 195
Query: 253 ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
A V + + V S++ + F++ N+ +V A
Sbjct: 196 AGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNA 229
>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
Neff]
Length = 101
Score = 72.4 bits (176), Expect = 5e-10, Method: Composition-based stats.
Identities = 32/74 (43%), Positives = 46/74 (62%)
Query: 135 VVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWI 194
++P+G+P+SV+ Y Y TW ++ G+ T+ LL VGV ATPGA A+ W
Sbjct: 21 LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80
Query: 195 LKDGAGRVGKMIFA 208
L+DGAG VG++ FA
Sbjct: 81 LRDGAGMVGRIGFA 94
>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
Length = 400
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 73/338 (21%), Positives = 144/338 (42%), Gaps = 39/338 (11%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
Q P ++ + ++ +VP G+P SV Y ++ + + M V + +L
Sbjct: 35 QQPFQKSQNHLQELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL------ 88
Query: 181 RNRATPGAVAINWIL--KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
R PG A +++ + G + +I R G + KQ R +L+ EL +E+A
Sbjct: 89 --RINPGHSAAFFVVFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIA 146
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----I 294
+ P +F ++ +V + V + +T I +AKG N+ ++T K + + +
Sbjct: 147 FISTP-IFYSISVTCSVIEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVV 205
Query: 295 ADLLGTGLSIMISKRNPSLITTF--ALLS---CGYVFSSYQEVKSVVLHTLNRARFSVAV 349
+ L+G + ++ S I F AL+S C ++ S+ +V TLN R ++ +
Sbjct: 206 SLLIGWVILHVLGDFTESGIIVFICALISHCICNFLLSN-----VIVFKTLNYERLTILM 260
Query: 350 ESFLKTGH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPF 403
F+ + V S E ++ N + +G + D Y+ +
Sbjct: 261 SYFMNIKNEILSPVVVSRIESKIKVN-------EMVNIRMGISLNSIPKSSDLYIEISQS 313
Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
+K +Y + Y S +++ +L D D+I ++ AY
Sbjct: 314 IKKYQYYL-YKTSTNQIWVILGDHITKDEIFESLLCAY 350
>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
Length = 156
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 69/130 (53%), Gaps = 8/130 (6%)
Query: 79 DSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPD 138
D SL+ ++ G RW ++ R+ K A + + AP+ L S +++ +P
Sbjct: 29 DGSLQW----EVGGWRWWGLS---RAFTVKPEGRDAGEVGASGAPSPPL-SGLQAVFLPQ 80
Query: 139 GFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDG 198
GFPDSV+P Y+PY W +++ F G TQ +L +GV +AT A W++KD
Sbjct: 81 GFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLVKDS 140
Query: 199 AGRVGKMIFA 208
G +G+++FA
Sbjct: 141 TGMLGRIVFA 150
>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
Length = 400
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/338 (21%), Positives = 144/338 (42%), Gaps = 39/338 (11%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
Q P ++ + ++ +VP G+P SV Y ++ + + M V + +L
Sbjct: 35 QQPFQKSQNHLQELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL------ 88
Query: 181 RNRATPGAVAINWIL--KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
R PG A +++ + G + +I R G + KQ R +L+ EL +E+A
Sbjct: 89 --RINPGHSAAFFVVFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIA 146
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----I 294
+ P +F ++ +V + V + +T I +AKG N+ ++T K + + +
Sbjct: 147 FISTP-IFYSISVTCSVIEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVV 205
Query: 295 ADLLGTGLSIMISKRNPSLITTF--ALLS---CGYVFSSYQEVKSVVLHTLNRARFSVAV 349
+ L+G + ++ S I F AL+S C ++ S+ +V TLN R ++ +
Sbjct: 206 SLLIGWLILHVLGDFTESGIIVFICALISHCICNFLLSN-----VIVFKTLNYERLTILM 260
Query: 350 ESFLKTGH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPF 403
F+ + V S E ++ N + +G + D Y+ +
Sbjct: 261 SYFMNIKNEILSPVVVSRIESKIKVN-------EMVNIRMGISLNSIPKSSDLYIDISQS 313
Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
+K +Y + Y S +++ +L D D+I ++ AY
Sbjct: 314 IKKYQYYL-YKTSTNQIWVILGDHITKDEIFESLLCAY 350
>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
Length = 521
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/393 (23%), Positives = 157/393 (39%), Gaps = 71/393 (18%)
Query: 109 RNSIRAISLQSP-QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGV 167
R+ R + + P + + ++ + + +P G+P SV+ Y+ + G A GV
Sbjct: 43 RHERRRVDIVQPSRKDSSSYLASLLNVFLPVGYPHSVSDDYLDCL---------GFASGV 93
Query: 168 FT----TQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLR 222
T + VGV A+P ++ +L+D GR+ ++FA R G + + K R
Sbjct: 94 LKFNCRTAGVPGWVGVGDASASPTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYR 153
Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACA-ANVAKNVAAVTSTSTRTPIYKAFAKGENI 281
A D+L + + + A+P + A ++V + + V + S++ + FAK N+
Sbjct: 154 LAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNL 213
Query: 282 GDVTA---KGECVGNIADLLGTGLSIMIS------KRNPSLITTFALLS--CGYVFSS-- 328
G++ A V I+ G + S +++ S T +LL CG V S
Sbjct: 214 GELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHI 273
Query: 329 ---------------------YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQ 367
Y V+SV + TLNR R ++ + + G V + E +
Sbjct: 274 STPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKC 333
Query: 368 ENIFS----FPWLRDKPVVLGSRFKDAFQD----------------PDAYLAVEPFFEKE 407
E IF W + + +FQ+ D Y + FEKE
Sbjct: 334 ERIFERDGILRWKASSATLGYCQIGGSFQELLRGSIHGANSIRDVGIDIYKLLR-LFEKE 392
Query: 408 RYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
YV+ +NP K +LK+ L+A HA
Sbjct: 393 EYVLWFNPIHKKGTIVLKNNVTPISQLQAWSHA 425
>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
Length = 487
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 72/126 (57%), Gaps = 2/126 (1%)
Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLR 222
A V + +L +VG+ A P A A+NW+LKDG G++G ++F + FD K
Sbjct: 150 ANSVLASTFMLYAVGLGAG-AIPTAGALNWVLKDGMGQLGTLVFGKTIAHNFDVHSKTWF 208
Query: 223 FAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIG 282
F +L++ A +E+ T VP FL + AN+ K +A + + STR+ + +FA+ NI
Sbjct: 209 FLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFHLSFARDNNIA 268
Query: 283 DVTAKG 288
DVTAKG
Sbjct: 269 DVTAKG 274
>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
Length = 470
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 140/328 (42%), Gaps = 43/328 (13%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
++ +++P GFP + + YM + K F +G F+ T + A A A
Sbjct: 113 VKDFILPAGFPVFL---FYAYMELKNQK-FVAVGVGSFSATT-------AAASAAASAAA 161
Query: 191 INWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAG----VELATAAVPHLF 246
I W+ KDG G VG++ F G +++ G +L+T P F
Sbjct: 162 IRWVSKDGIGAVGRL-----------------FIGLSVIKRAKGYKSIFDLSTQLYPAYF 204
Query: 247 LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIM- 305
L LA N+AK VA + I FA N+G+V AK E A LLG L IM
Sbjct: 205 LQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMV 264
Query: 306 -----ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPS 360
++ P L T+ + +++ YQ + + ++N R + V+S + VP
Sbjct: 265 LTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPG 324
Query: 361 LREGNLQENIFSFPWLRDKPVVLGSRFKDAF--QDPDAYL-AVEPFFEKERY--VVTYNP 415
+ N +E I S+ ++ G ++ + P + + + + +E+Y VV
Sbjct: 325 YVDCNRKEKILSWQRFLKPRIIFGVSLEELIGGERPVSKVRTLLKLYSQEKYILVVKQQG 384
Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ +V+ K A S +L++ + +Y L
Sbjct: 385 TDFEVFVSFKVGATSLSVLRSIWQSYWL 412
>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 440
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/354 (23%), Positives = 140/354 (39%), Gaps = 51/354 (14%)
Query: 98 VAERDRSGKFKRNSIRAISLQSPQAPAEE--------LMSFIRSYV----VPDGFPDSVT 145
+ ERD SG +R + + + P E L IR P G+PDSV+
Sbjct: 12 IDERDSSGLIRRRWLHDHDGRLTEMPTENSPTTISRLLTMNIRQVFSDAFFPIGYPDSVS 71
Query: 146 PSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKM 205
Y+ Y + +L+ FF + + +L +GV ++ + ILKDG R+ +
Sbjct: 72 KDYLGYQLYDSLQAFFSTITSLLANRAILQGLGVGDADSSATYALLLTILKDGISRLATI 131
Query: 206 IFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPL---------ACAANV 255
FA R G + + K+ RF D+ + +A LF PL A
Sbjct: 132 GFAYRFGLVIEPECKKYRFLADIFND--------SAFFLDLFSPLFGSWTKVAALVVAEA 183
Query: 256 AKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLIT 315
+ + V + +++ + K FA N+ ++ AK L+G + + + S
Sbjct: 184 LRAMCGVAAGASKAALSKHFALRGNLSELNAKESSQETAVGLIGLIVGSFVVRHVESREA 243
Query: 316 TFAL---LSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
FAL L ++ +Y V+ V L+ N+ R ++ E ++KT GN
Sbjct: 244 VFALMIILVFVHLGMNYLGVRCVQLNNFNQQRATIFFEEYMKT-------MGN------- 289
Query: 373 FPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKD 426
+R P + R F P Y P F K + +Y + Y +LK+
Sbjct: 290 ---VRLTPAEVAKRENIIFWKPVIYHDGRP-FAKIVFAKSYVDAMAPGYVVLKE 339
>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
Length = 512
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 110/241 (45%), Gaps = 23/241 (9%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
+ L+ + +P G+P SVT Y+P + +F A T L VGV A
Sbjct: 63 KTLVDLLVDVFLPSGYPHSVTDDYLP-------RVYFHLATPQPTNNPL--GVGVGNADA 113
Query: 185 TPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
TP + + IL+D +GR+ ++FA R G + + K R A D+ +L ++ + +P
Sbjct: 114 TPTSALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIP 173
Query: 244 HLF--LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTG 301
+ + A V + + V S++ + FA+G N+ +V AK I L+G
Sbjct: 174 AGVSRITVLSTAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVG-- 231
Query: 302 LSIMISKRNPSLITTFA-------LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
+++ S +T FA L ++ +Y V++V + +LNR R ++ + L
Sbjct: 232 --MLVGSLVVSHVTGFAATWLSLLFLLAAHLAMNYAAVRAVQMTSLNRQRANIVFSTLLD 289
Query: 355 T 355
+
Sbjct: 290 S 290
>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 501
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 118/237 (49%), Gaps = 18/237 (7%)
Query: 136 VPDGFPDSVTPSYVPYMTW-RALKHFFGGAM----GVFTTQTLLNS--------VGVSRN 182
+P G+P+SVT Y+ Y + RA + + G+ T+ L+S +GV +
Sbjct: 58 LPAGYPNSVTEDYLEYQIYVRAANKPWTQSELISEGLSTSVLKLHSWDAVLESRIGVGDS 117
Query: 183 RATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAA 241
A+P A + +L+ GR+ ++FA R G + + K R A D+ + ++ + A
Sbjct: 118 HASPTAALLLSVLQQSMGRIATIVFAHRLGTALEPECKMYRLAADIFNDAAMILDCLSPA 177
Query: 242 VPHLF-LPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGT 300
+P + L ++V +++ VT+ S++ + FA N+G++ AK + LLG
Sbjct: 178 LPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNAKDSSQETVISLLGM 237
Query: 301 -GLSIMISKRNPSLITTFA--LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
S+++S IT A LL ++ ++Y V++V + TLNR R ++ + + L+
Sbjct: 238 LAGSVVVSYITSKTITWIALILLLATHLGTNYLAVRAVCMQTLNRQRANLVLSNILQ 294
>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 400
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 72/338 (21%), Positives = 143/338 (42%), Gaps = 39/338 (11%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
Q P ++ + ++ +VP G+P SV Y ++ + + M V + +L
Sbjct: 35 QQPFQKSQNHLQELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL------ 88
Query: 181 RNRATPGAVAINWIL--KDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
R G A +++ + G + +I R G + KQ R DL+ EL +E+A
Sbjct: 89 --RFNSGHSAAFFVVFREVMTGMMHLIITERWGTSIVFFAKQWRMRIDLISELLRIIEIA 146
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN----I 294
+ P +F ++ +V + V + +T I +AKG N+ ++T K + + +
Sbjct: 147 FISTP-IFYSMSVTCSVIEGVLQASRQVIKTRILNNYAKGNNVAELTEKVQNLETLFRVV 205
Query: 295 ADLLGTGLSIMISKRNPSLITTF--ALLS---CGYVFSSYQEVKSVVLHTLNRARFSVAV 349
+ L+G + ++ S I F AL+S C ++ S+ +V TLN R ++ +
Sbjct: 206 SLLIGWLILHVLGDFTESGIIVFICALISHCVCNFLLSN-----VIVFKTLNYERLTILM 260
Query: 350 ESFLKTGH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPF 403
F+ + V S E ++ N + +G + + Y+ +
Sbjct: 261 SYFMNIKNEILSPVVVSRMESKIKVN-------EMVNIRMGISLNSIPKSSNLYIEISQS 313
Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
+K +Y + Y S +++ +L D D+I ++ AY
Sbjct: 314 IKKYQYYL-YKTSTKQIWVILGDHITKDEIFESLLCAY 350
>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
Length = 473
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 132/293 (45%), Gaps = 30/293 (10%)
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
VGV + A+P A + +L++ GR+ ++FA R G + + K R A D+ + +
Sbjct: 113 VGVGDSSASPTAALLLSVLQESMGRIATILFAHRLGTALEPECKMYRLAADVFNDTAMIL 172
Query: 236 ELATAAVPHLFLPLACA-ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNI 294
+ + A P F L + ++V +++ V + S + + FA+ N+G+V A +
Sbjct: 173 DCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAHITHMNPN 232
Query: 295 ADLLGTGLSI---MISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
D GL I+ N LI ALLS ++ ++Y VK+V + LNR R ++ +
Sbjct: 233 VDFDIEGLEPGNGHIATWNAILI---ALLSI-HLATNYAAVKAVSMRCLNRQRANIVFGN 288
Query: 352 FLKTGHVPSLREGNLQENIFS----FPWLRD-----------KPVVLGS-----RFKDAF 391
+ G V + E + +E IF W+ D V+L S R +
Sbjct: 289 LMHHGSVLTPTEVSQRERIFERDGVLRWVDDCVIGHCTIGVPLGVLLASMSQRYRRTGSL 348
Query: 392 QDPDAYLA-VEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVL 443
+ D L + F ++YV+ + S+ + +LK+ D LKA HA +
Sbjct: 349 ELSDINLPDLMDIFADQKYVLWFTDSEYEAIVVLKEGCTPIDQLKAWAHALLF 401
>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
Length = 194
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 95/189 (50%), Gaps = 9/189 (4%)
Query: 202 VGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVA 260
+G+++FA +G K D + KQ R D+L ++ +E+ P F +N+AK +
Sbjct: 2 LGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIV 61
Query: 261 AVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNPSL-ITT 316
+V +TR + A+ N+ DV+AK E + N+A LL + L + + P +
Sbjct: 62 SVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGC 121
Query: 317 FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWL 376
F L+ +++++Y+ V+++V+ TLN R + ++ +L+ G V N E + W
Sbjct: 122 FFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL----WT 177
Query: 377 RDKPVVLGS 385
D LGS
Sbjct: 178 GDPAPGLGS 186
>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
24927]
Length = 445
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 135/303 (44%), Gaps = 54/303 (17%)
Query: 98 VAERDRSG----KFKRNS----------IRAISLQSPQAPAEELMSFIRSYVVPDGFPDS 143
+ ERD+SG ++++ +++ SLQ PQA + + I + +P G+P S
Sbjct: 13 IVERDQSGIITAEYRQQGNQNIVQVHQVVQSNSLQ-PQAISH--IQNIINIFLPAGYPKS 69
Query: 144 VTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVG 203
VT Y+ G + +VG + A+ A + I++D GRV
Sbjct: 70 VTDDYLD------------GDL----------AVGFEDSNASTTAAVLLKIVQDSIGRVA 107
Query: 204 KMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAA 261
++FA R G + + K+ RFA D+ + V+ + P L + L A + +
Sbjct: 108 TILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPLRVVLLSTAGTLRALCG 167
Query: 262 VTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-----------SIMISKRN 310
V ++ + F+K N+ ++ AK + L+G + S++IS
Sbjct: 168 VAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLMRADQTLRVKAGSLVISSVT 227
Query: 311 PSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQE 368
T + L +++++Y+ VK+V + TLNR R + + +F +TG + + ++ QE
Sbjct: 228 SKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVMSTFAETGVMLTPKDVARQE 287
Query: 369 NIF 371
IF
Sbjct: 288 RIF 290
>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
Length = 194
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 59 LLQSESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ 118
++ ++ DE V +Q + SS D D RR Y +E GK+ + ++
Sbjct: 52 IVSTDKDENKV--QVWEQTISSSNANKNGGDDDSRRTLYRSEG--PGKW-------LKVE 100
Query: 119 SPQAPAEEL-MSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSV 177
SP A + + ++ I + +P +P+SV SY + ++ + G A V +TQ LL +V
Sbjct: 101 SPPASRQSMSLTSIVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAV 160
Query: 178 GVSRNRATPGAVAINWILKDGAGRVGKMIFARQ 210
GV A P A A+NW+LKD AG++G ++F Q
Sbjct: 161 GVGSQSAAPMAAALNWLLKDFAGQLGGVLFVSQ 193
>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
Length = 295
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 105/230 (45%), Gaps = 11/230 (4%)
Query: 225 GDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDV 284
D + G+ +LAT P FL LA N+AK VA + I FA N+G+V
Sbjct: 3 ADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEV 62
Query: 285 TAKGECVGNIADLLGTGLSIMI------SKRNPSLITTFALLSCGYVFSSYQEVKSVVLH 338
AK E A L+G G I+I K P ++ T+ + +++ YQ + + +
Sbjct: 63 AAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFN 122
Query: 339 TLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAF---QDPD 395
T+N R + VES + VP + N +ENI + ++ G ++ +
Sbjct: 123 TVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRIIFGVSLEELSGLEKSVS 182
Query: 396 AYLAVEPFFEKERYVVTYNP-SKGKVYAL-LKDQAKSDDILKAAFHAYVL 443
A+ + KE+Y++T N +K +++ K A S D+L+ + AY L
Sbjct: 183 KVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQAYWL 232
>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
Length = 479
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/406 (21%), Positives = 159/406 (39%), Gaps = 88/406 (21%)
Query: 115 ISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTW-------------------- 154
I+L Q + ++ I + +P G+P SVT Y+ Y +
Sbjct: 41 ITLPPRQPSSTSYLTSILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSY 100
Query: 155 -RALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKK 213
+L+ F G+ +++ +L VGV A+P A + +L++ GR+ ++FA +
Sbjct: 101 QDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLNVLQESMGRIATILFAHR--- 157
Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA----ANVAKNVAAVTSTSTRT 269
LG +E P P+ ++V + + V + S++
Sbjct: 158 -----------------LGTSLEPEYCLSPVFPKPVRVGFLSLSSVLRALCGVAAGSSKA 200
Query: 270 PIYKAFAKGENIGDVTAKGECVGNIADLLGTGL-SIMISKRNPSLITTFAL--LSCGYVF 326
+ FA+ N+G++ AK + L+G + S++++ + L T AL L ++
Sbjct: 201 SLSAHFARWGNLGELNAKDSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLAIHLA 260
Query: 327 SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVV 382
+++ V++V + TLNR R ++ S V + + + +E IF W + V
Sbjct: 261 TNHAAVRAVNMTTLNRQRANIVFSSLFDDDRVLTPAQVSKEERIFERDGVLRWKAEATGV 320
Query: 383 LG--------------------------SRFKDAFQDPDAYLAV-EPFFEKERYVVTYNP 415
LG SR +A D L V F++E Y++ ++P
Sbjct: 321 LGTCRVGVSLEQLLLLLPHSQTNDASKTSRRSEAGADFGVSLPVLVDLFKEEEYLLYFSP 380
Query: 416 SKGKVYALLKDQAKSDDILKAAFHAYVLLH---------CINSSNG 452
+LK + LKA HA ++ CIN G
Sbjct: 381 RDHAAVIILKAGVTAQAQLKAWSHALLVSRRLAGRSTSACINEKQG 426
>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 295
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 101/230 (43%), Gaps = 12/230 (5%)
Query: 226 DLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVT 285
D + G+ EL+T P FLPLA N+AK +A + I FA+ N+G+V
Sbjct: 4 DFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVA 63
Query: 286 AKGECVGNIADLLGTGLSIMISKRNP------SLITTFALLSCGYVFSSYQEVKSVVLHT 339
AK E A LLG + ++I + +L+ T+ + +++ YQ + + T
Sbjct: 64 AKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRT 123
Query: 340 LNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLG---SRFKDAFQDPDA 396
+N R + V+S + VP N +ENI ++ V G R + D
Sbjct: 124 INLKRGRILVKSHVAQHTVPGYVACNEEENILTWERYLRPQVSFGVPMERMLGGDKSSDM 183
Query: 397 YLAVEPFFEKERYVVTYNPSKGKVYALL---KDQAKSDDILKAAFHAYVL 443
+ ++ E+Y++ + + L K+ A S +L++ + A+ L
Sbjct: 184 VNRLLKLYKNEKYILFFEQFGSREPTFLVTFKESATSMSVLRSLWQAHWL 233
>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 333
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/238 (23%), Positives = 110/238 (46%), Gaps = 11/238 (4%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
+L+ F G+ ++ +L VGV + ATP + +++ G++ ++FA R G
Sbjct: 42 SLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAHRLGTAL 101
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYK 273
+ + K R A D+ + ++ + A P + + + ++ + + V + S++ +
Sbjct: 102 EPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSSKASLSA 161
Query: 274 AFAKGENIGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVF---SSYQ 330
FA N+G+V AK + LLG ++ + S + T+ L ++Y
Sbjct: 162 HFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITSPLATWGTLILLLTIHLATNYA 221
Query: 331 EVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG 384
V++V +H LNR R ++ + + G V S R+ + +E +F W DK VLG
Sbjct: 222 AVRAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRERVFERGGVLRWSDDK--VLG 277
>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 290
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 93/248 (37%), Gaps = 56/248 (22%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGV-SRNRATPGAVAINWI 194
+P G+PD+V+ Y ++ W +FF + V T+Q+LL ++G+ S A P A A W+
Sbjct: 3 LPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATKWV 62
Query: 195 LKDGAGRVGKMIFARQG-KKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAA 253
LKDG G ++ G +++D D P CA
Sbjct: 63 LKDGVGSFATLLSGVVGAQRYDED------------------------------PKRCAT 92
Query: 254 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK----- 308
V ++ + F + +N GD+ AK E G L I I +
Sbjct: 93 FVRAGALTGRTSLVNGTFMQHFGRNQNAGDIRAKLEVQGKWLALCALPTGIFIFRGVSGA 152
Query: 309 -------------------RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
R+ +I+ + + G+ F S++ + + LNR R A
Sbjct: 153 FEPLPPETLETMTLLDHLTRSLPVISAYGAVVGGHSFLSWKAASVLRVDVLNRFRLLKAA 212
Query: 350 ESFLKTGH 357
F+ G
Sbjct: 213 REFVDGGE 220
>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
Length = 507
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 128/320 (40%), Gaps = 57/320 (17%)
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
VGV A+P ++ +L+D GR+ ++FA R G + + K R A D+L +
Sbjct: 31 VGVGDASASPTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIF 90
Query: 236 ELATAAVPHLFLPLACA-ANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA---KGECV 291
+ + A+P + A ++V + + V + S++ + FAK N+G++ A V
Sbjct: 91 DCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLV 150
Query: 292 GNIADLLGTGLSIMIS------KRNPSLITTFALLS--CGYVFSS--------------- 328
I+ G + S +++ S T +LL CG V S
Sbjct: 151 IAISGAAQRGFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHISTPFATWTTLLLL 210
Query: 329 --------YQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWL 376
Y V+SV + TLNR R ++ + + G V + E + E IF W
Sbjct: 211 LLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILRWK 270
Query: 377 RDKPVVLGSRFKDAFQD----------------PDAYLAVEPFFEKERYVVTYNPSKGKV 420
+ + +FQ+ D Y + FEKE YV+ +NP K
Sbjct: 271 ASSATLGYCQIGGSFQELLRGSIHGANSIRDVGIDIYKLLR-LFEKEEYVLWFNPIHKKG 329
Query: 421 YALLKDQAKSDDILKAAFHA 440
+LK+ L+A HA
Sbjct: 330 TIVLKNNVTPISQLQAWSHA 349
>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
Length = 391
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 105/217 (48%), Gaps = 11/217 (5%)
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
VGV + ATP + ++ GR+ ++FA R G + + K RFA D+ + +
Sbjct: 26 VGVGDSTATPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMVL 85
Query: 236 ELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNI 294
+ + A P + + + ++ + V + S++ + FA+ N+G+V AK +
Sbjct: 86 DCLSPAFPKPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQETV 145
Query: 295 ADLLGT-GLSIMISKRNPSLIT--TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
LLG S++IS L T T LL ++ ++Y V++V++H LNR R ++ +
Sbjct: 146 ISLLGMLAGSVVISYITSPLATWGTLILLLTIHLATNYAAVRAVIMHCLNRQRANILFSN 205
Query: 352 FLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG 384
+ G V S R+ + +E +F W DK VLG
Sbjct: 206 MFQHGLVLSPRDVSQRERVFERGGVLRWSDDK--VLG 240
>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
Length = 768
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/234 (24%), Positives = 102/234 (43%), Gaps = 13/234 (5%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
++L F ++ +P G+P+SVTP Y+ Y AL+ F + +++ L GV A
Sbjct: 321 QKLRDFGKAVFLPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGVGSASA 380
Query: 185 TPGAVAINWILKDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVE-----LA 238
+ + IL D R+ ++ A G + K+ RF D+L ++ ++
Sbjct: 381 SANDALLLTILLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTLSPVFK 440
Query: 239 TAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKGECVGNI 294
P L + C + + + + + ++ I FA KG+ ++GD+ AK +
Sbjct: 441 AQGYPQLRVVALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDSSKETV 500
Query: 295 ADLLGTGLSIMISKRNP---SLITTFALLSCGYVFSSYQEVKSVVLHTLNRARF 345
L G L +I R S TT +L ++ +Y V +VL T+N R
Sbjct: 501 LALFGMLLGSLIVPRLQTAWSTYTTLFILLSLHIAINYVAVTGLVLRTMNWQRM 554
>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 461
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/342 (22%), Positives = 136/342 (39%), Gaps = 31/342 (9%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L + +P G+P SV+P Y+ Y T+ A + F + ++ +L GV A+
Sbjct: 29 LKHLLTDVFLPAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGVGNASASA 88
Query: 187 GAVAINWILKDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 245
+ +L+D GR+ + G + K R D+ + A + L +
Sbjct: 89 THAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFND--ASIVLDVFQPFYA 146
Query: 246 FLPLA-------CAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG----ECVGNI 294
+P+A C++ + + V + ++ + FA ++GD+ AK +G +
Sbjct: 147 SVPVAGARVLALCSSGCLRALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRETVIGLV 206
Query: 295 ADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLK 354
L GT L I + ALL ++ +Y V++V + TL R R S+A +++
Sbjct: 207 GLLSGTILLSYIGTPGATYAALCALLGA-HLALNYLAVRAVAMRTLTRQRASIAWRVWME 265
Query: 355 --TGHVPSLREGNLQENIFSFPWL-------RDKPVVLGSRFKDAF----QDPDAYLAVE 401
G P+ E +E + + P L R G F A ++
Sbjct: 266 RGAGAAPTPEEVAEREGLLADPTLIADGTGRRLGRCSFGGARSQGFIATTGKDSAMVSTS 325
Query: 402 PFFEKERYVVTYNPSKGKV---YALLKDQAKSDDILKAAFHA 440
E RY + + G + LLK +D+LKA HA
Sbjct: 326 RLIEAARYWLLIDSKPGSIPTATILLKRVYGPEDLLKAWVHA 367
>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
Length = 158
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 43/73 (58%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFPDSV+P Y+PY W +++ F G T +L +GV +A+ A W++
Sbjct: 82 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 141
Query: 196 KDGAGRVGKMIFA 208
KD G +G+++FA
Sbjct: 142 KDSTGMLGRIVFA 154
>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
Length = 405
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 16/116 (13%)
Query: 103 RSGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGF----------------PDSVTP 146
+SGK R I+ SL+ EE S + DGF P V+
Sbjct: 71 KSGKVSRYFIKGDSLELLCVDEEEDDSTSFCLGLDDGFWKLIRLTSSAAKDFFLPKQVSD 130
Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRV 202
+Y+ Y+ W+ L F A+ V TQ + ++G+ ++R+ + A NWILKDG GR+
Sbjct: 131 NYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGLGRL 186
>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 228
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 1/110 (0%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
F+R +P +PDSV+P Y+ Y + + + FF G+ + +L +GV +
Sbjct: 76 FLRDAFLPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHA 135
Query: 190 AINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELA 238
+ LKDG R+ + FA R G + + K+ RF DL + +ELA
Sbjct: 136 LLLTCLKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELA 185
>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
Length = 411
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 60/118 (50%), Gaps = 1/118 (0%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L + + +P G+P SVT Y Y + +L+ F G+ +++ +L VGV A+P
Sbjct: 53 LKTLLIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASP 112
Query: 187 GAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVP 243
+ + IL+D +GR+ ++FA R G + K R A DL +L ++ + A P
Sbjct: 113 TSALLLHILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAP 170
>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
CIRAD86]
Length = 435
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 102/213 (47%), Gaps = 21/213 (9%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP SV+ Y+ Y + +L+ F + +++ +L+SVGV + A+P A I IL
Sbjct: 48 LPVGFPHSVSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAIILSIL 107
Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAAN 254
+D GR+ ++FA G + + K R D P + + + ++
Sbjct: 108 QDSVGRLATILFAHSFGTSLEPECKMYRLLAD----------------P-IRVVVLSLSS 150
Query: 255 VAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNIADLLGTGLSIMISKRNP 311
V +++ V + S++ + FAK N+G++ AK E V ++ ++ L I
Sbjct: 151 VLRSLCGVAAGSSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWITTPI 210
Query: 312 SLITTFALLSCGYVFSSYQEVKSVVLHTLNRAR 344
+ T LL ++ ++ + V++V + TL R R
Sbjct: 211 ATWTALILLLSIHLETNRRAVRAVKMRTLTRQR 243
>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
Length = 635
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 117/282 (41%), Gaps = 40/282 (14%)
Query: 107 FKRNSIRAISLQSPQAPAE--------ELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALK 158
++ NS+ L+SP+A + ++ I+ + +P VT +Y Y WR L+
Sbjct: 75 YRNNSLYNKRLKSPEAEKDPTFVTKLSNSLAKIKKRFLNIVYPSGVTSNYYKYANWRLLE 134
Query: 159 HFFGG-----AMGVFTTQTLLNSVGVSRNRATPG-AVAINWILKDGAGRVGKMIFARQ-- 210
F + + LLN S N A I++I + ++ K I +R
Sbjct: 135 RFSHSMSQSISNNPLAAKILLNQSTYSTNDGVINIAYKIDFIATNLMTQIFKEIVSRIVK 194
Query: 211 -------GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-CAANVAKNVAAV 262
G FD + + F G L+ + + + ++ + + + + +
Sbjct: 195 TMWVGNIGIGFDMNPRAFCFWGSLICTITNTADFLSTLFNFKYISITRHITTLFRQIGML 254
Query: 263 TSTSTRTPIYKAFAKGEN---IGDVTAKGECVGNIADLLGTGLSIMISKRNPSLITTFAL 319
T++++ PIY +F N IG++TAK E V + DLLG+ +IS LI +
Sbjct: 255 TTSASIGPIYGSFKSQNNVSNIGEITAKMEAVTPLCDLLGSATGSVIS----YLIAPYQY 310
Query: 320 LSCGYV---------FSSYQEVKSVVLHTLNRARFSVAVESF 352
G +SSY V+ VV +LN +R V +E F
Sbjct: 311 YYRGIAFAIAALIANYSSYCAVQMVVFRSLNISRCFVVLEDF 352
>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
Length = 497
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 95/420 (22%), Positives = 169/420 (40%), Gaps = 86/420 (20%)
Query: 98 VAERDRSGKF-----------KRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTP 146
+AERD SG+ R+ + S++ + L+S + +P G+P SV+P
Sbjct: 6 IAERDDSGRLLATYLCAAGEHTRSDTPSFSIKQKRTDWRRLLSNV---FLPAGYPASVSP 62
Query: 147 SYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMI 206
Y+ Y + AL+ F + ++ +L GV A+ +L+D R+ ++
Sbjct: 63 DYLQYQIYNALQAFCSSLASLIASRAVLEGHGVGNADASATHAIFLTVLQDIFSRLTTIV 122
Query: 207 FARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL---FLPL------------- 249
G + K R D+ + A + L T + PHL L L
Sbjct: 123 SGYYLGTSLFPEAKTYRLLADIFND--AAIVLDTLS-PHLDGVVLSLRYPFVLSGSGSSL 179
Query: 250 ----ACAANVAKNVAAVTSTSTRTPIYKAFAK-GE---NIGDVTAKGECVGNIADLLG-- 299
C + ++ + + + ++ + FA G+ ++GD+ AK + LLG
Sbjct: 180 RVSALCLSGASRALCGMVAGGSKAALTVHFASPGQGTGDVGDLNAKDGSKETVLALLGML 239
Query: 300 TGLSIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGH 357
+G ++M + + + F +L+ ++F +Y V+ VVL +LNR R ++ ++ H
Sbjct: 240 SGTAVMHYVHSARATYLVLF-VLTFLHLFGNYLAVRVVVLRSLNRQRANIVWMAY--RAH 296
Query: 358 VPSLREGN-----------LQENIFSFP-WLRD-----KPVV----LGSRFKD------- 389
+ G +E IF+ P L D P + +GS F
Sbjct: 297 IKGPMGGKRPLVLTPEAVAARERIFANPSALHDTSSPAHPFLGICKVGSSFSSLASRQQS 356
Query: 390 ------AFQDPDAYLAVEPFFEKERYVVTYNPSKG---KVYALLKDQAKSDDILKAAFHA 440
A P + F +E YV+ + P+ G +++ LKD K D LKA HA
Sbjct: 357 APRGALAAPSPPQIHTILEIFSEEPYVLWFGPTSGPSCRLHVFLKDGHKPRDHLKAWTHA 416
>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 450
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 79/363 (21%), Positives = 146/363 (40%), Gaps = 47/363 (12%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P+G+P +V+P V Y W A++ F G+F ++ +L GV A+ IL
Sbjct: 29 LPEGYPFTVSPQCVRYQIWNAVQAFCSSLAGLFASRAVLQGHGVGNASASATDAIFLTIL 88
Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLAC--- 251
+D R+ ++ G + K R D+L + + + + H L+
Sbjct: 89 QDVFSRLTTIVSGYYLGTSLYPEAKTYRLLADILNDAAMVTDTLSPHLAHFSFSLSYPFV 148
Query: 252 --AANVAKNVAAVTSTSTRTPIYKAFAKGE----------------NIGDVTAKGECVGN 293
A+N + V A+ + + A A G ++GD++AK
Sbjct: 149 VPASNSSLRVVALCMSGAFRALCGAVAGGSKAALTVHFATAGERPGDVGDLSAKDGSKET 208
Query: 294 IADLLG--TGLSIM--ISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAV 349
+ LLG +G ++ + + + F L+ C ++ +++ V+ + + NR R +A
Sbjct: 209 VLALLGMLSGSVVLRYVHTAQATYVVLFGLIFC-HLAANFIAVRVIAMRAFNRQRAGIAW 267
Query: 350 ESFLKTGH----VPSLREGNLQENIFS----FPWLRD-----KPVVLGSRF-----KDAF 391
F ++ VP QE IF+ P + LG F ++ F
Sbjct: 268 CEFRRSFDVVTVVPDYLHVARQERIFTDSARIPISQHARGTYAHCALGVSFLSLSGREPF 327
Query: 392 QDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLH-CINSS 450
D +E F E+Y++ + ++ +LKD D LKA HA+ + + C S
Sbjct: 328 SDEQIATLLE-VFAHEKYILWPASNPPQIAVVLKDGHGPHDHLKAWAHAHEIANICDRES 386
Query: 451 NGH 453
+ H
Sbjct: 387 HAH 389
>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
Length = 413
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 132/311 (42%), Gaps = 40/311 (12%)
Query: 177 VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGV 235
VGV A+P A + +L++ GR+ ++FA R G + + K R A D+L + +
Sbjct: 40 VGVGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVL 99
Query: 236 ELATAAVPH-LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNI 294
+ + P + + L ++V + + V + S++ + FA+ N+G++ A G NI
Sbjct: 100 DCLSPIFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVGL---NI 156
Query: 295 --ADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
+ L G LS + S + I LL ++ +++ V++V + TLNR R ++
Sbjct: 157 PPSQLTGVVLSYITSPLE-TWIALIVLLIV-HLGTNHAAVRAVKMTTLNRQRANIVFSHL 214
Query: 353 LKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG------------------------ 384
+ + + E + +E IF W + +LG
Sbjct: 215 FEDNLILTPAETSKEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGAL 274
Query: 385 --SRFKDAFQDPDAYL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAY 441
SR D L A+ F +E Y++ + P + +LK A + LKA HA
Sbjct: 275 ETSRTTSTTMDASTNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHAL 334
Query: 442 VLLHCINSSNG 452
++ + NG
Sbjct: 335 LVSRHLARKNG 345
>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
Length = 269
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 64/136 (47%), Gaps = 21/136 (15%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGR-VGKMIFAR-QGKK 213
A++ F G G +T+ +L VGV + +TP A + W++KDG+ R VG+++FA G K
Sbjct: 11 AIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLMWLIKDGSTRMVGRILFAWCNGTK 70
Query: 214 FDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYK 273
D D A + C A VAK++ V +TR + +
Sbjct: 71 LDAD-------------------TAVGGSNGTMTSILCVAGVAKSIVGVAGGATRAALTQ 111
Query: 274 AFAKGENIGDVTAKGE 289
A+ N+ DV+AK E
Sbjct: 112 HQARQGNLADVSAKVE 127
>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 354
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 79/170 (46%), Gaps = 19/170 (11%)
Query: 140 FPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGA 199
+P +V P Y+ Y + +L+ GV +T +L++ GV AT + A+ W ++DG
Sbjct: 19 YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78
Query: 200 GRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKN 258
G VG ++F+ FD +K+ R D+L ++G +++A LP+ + A
Sbjct: 79 GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDMA--------LPVVLSWTRA-- 128
Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
T+ I F+ N DV+AK + L+G L + +S
Sbjct: 129 --------TKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSN 170
>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
Length = 127
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y+ Y W ++ F G +Q +L VGV + AT ++ W+
Sbjct: 46 LPVGYPSSVSDDYLEYQIWDTIQAFASSITGALASQAVLIGVGVGNSSATILGASLTWMF 105
Query: 196 KDGAGRVGKMIFA 208
KDG+G +G+++FA
Sbjct: 106 KDGSGMIGRILFA 118
>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 101/205 (49%), Gaps = 17/205 (8%)
Query: 253 ANVAKNVAAVTST-STRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMI----- 306
A+ +N+ V +TR I A +N+ +++AK I +L+G+ SI +
Sbjct: 28 ASALRNIDYVCYCGATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFS 87
Query: 307 -SKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGN 365
S+ LI + +L ++++++ VK+++ +T NR R ++ ++S+ G V + + N
Sbjct: 88 SSRAEWILIISLMVL---HLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVN 144
Query: 366 LQENIFSFPWLRDK-----PVVLGSRFKDAFQDPDA--YLAVEPFFEKERYVVTYNPSKG 418
+ E++ L K +VLG K + D A ++ ++ ++Y++ N +K
Sbjct: 145 VNESVLLGFGLNVKRMCGFDIVLGESIKKSLNDYAAADIKLLQSVYKDKKYMLLVNGTKK 204
Query: 419 KVYALLKDQAKSDDILKAAFHAYVL 443
K+Y L+ + D++ FHA L
Sbjct: 205 KIYVCLEKGETAADVIAGYFHAVCL 229
>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
Length = 743
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 90/240 (37%), Gaps = 30/240 (12%)
Query: 141 PDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAI--------- 191
P T Y Y WR + F + + + +N+ + +
Sbjct: 176 PSVATKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISES 235
Query: 192 -------NWILKDGAGRVGKMI-FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAV- 242
I KD R+ + F+ G FD + K R G +L ++
Sbjct: 236 KMKSILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFT 295
Query: 243 --PHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFA---KGENIGDVTAKGECVGNIADL 297
P + + AC N + + +T ++++ P Y +F NIG++TAK E I D
Sbjct: 296 FGPKMIMG-ACT-NAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDF 353
Query: 298 LGTGLSIMIS---KRNPSLI--TTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
G L I ++ P +I TT L S ++Y VKSV LN R V +E F
Sbjct: 354 SGIALGIYLTNLLNEQPFVIQATTVILTSLISNVATYMCVKSVSFKNLNPQRCFVVLEDF 413
>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 472
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 129/326 (39%), Gaps = 67/326 (20%)
Query: 200 GRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLACA-ANVAK 257
GR+ ++FA R G + + K R A D+L + + + A+P + A ++V +
Sbjct: 2 GRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVLR 61
Query: 258 NVAAVTSTSTRTPIYKAFAKGENIGDVTA---KGECVGNIADLLGTGLSIMIS------K 308
+ V + S++ + FAK N+G++ A V I+ G + S +
Sbjct: 62 ALCGVAAGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKLQ 121
Query: 309 RNPSLITTFALLS--CGYVFSS-----------------------YQEVKSVVLHTLNRA 343
++ S T +LL CG V S Y V+SV + TLNR
Sbjct: 122 KDSSQETVISLLGMLCGSVVVSHISTPFATWTTLLLLLLVHLSTNYAAVRSVNMTTLNRQ 181
Query: 344 RFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDKPVVLG-SRFKDAFQD----- 393
R ++ + + G V + E + E IF W + P LG + +FQ+
Sbjct: 182 RANIVFSTLFEKGSVLTPTEASKCERIFERDGILRW-KASPAALGYCQIGGSFQELLRGS 240
Query: 394 -----------PDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYV 442
D Y + FEKE YV+ +NP K +LK+ LKA HA +
Sbjct: 241 THGANSIRDVGIDIYKLLR-LFEKEEYVLWFNPIHKKGTIVLKNNVTPISQLKAWSHALI 299
Query: 443 LLHCINSSNGHRSLALKQHEYDNSNL 468
+ R L++ E D N+
Sbjct: 300 V--------AKRLTDLRREEADRINV 317
>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
Length = 278
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 106/252 (42%), Gaps = 13/252 (5%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P GFP+SV+P Y+ Y + A++ F G+ +++ LL GV AT + +
Sbjct: 9 LPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATATNAVLLTVA 68
Query: 196 KDGAGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATA---AVPHLFLPLA- 250
+D R+ +I A G + K RF D+ + + +T A H L +A
Sbjct: 69 QDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASGHANLRVAA 128
Query: 251 -CAANVAKNVAAVTSTSTRTPIYKAFA---KGE-NIGDVTAKG---ECVGNIADLLGTGL 302
C A ++ + + + FA +G +IGDV AK E + + +L
Sbjct: 129 LCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLALMGMLAGSF 188
Query: 303 SIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLR 362
+ + L ++ +Y V+ V L TLNR R S+A SF + G PS
Sbjct: 189 VVHHLSSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFREYGSAPSPT 248
Query: 363 EGNLQENIFSFP 374
E I + P
Sbjct: 249 IVAGMEGILARP 260
>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 504
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 106/246 (43%), Gaps = 15/246 (6%)
Query: 125 EELMSFIRSYVVPDGFPDSVT-----PSYVPYMTWRALKH----FFGGAMGVFTTQTLLN 175
+ ++S + +P G+P SV+ P +P + R++ H T T
Sbjct: 59 KPIISSLSDVFLPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQLTPTQHT 118
Query: 176 SVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAG 234
VGV A+P + + IL+D +GR+ + FA R G + + K R A D+ ++
Sbjct: 119 GVGVGNANASPTSALLLHILQDISGRIATICFAHRIGTALEPECKTYRLAADVFNDIAMI 178
Query: 235 VELATAAVPH--LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVG 292
++ + VP + + A V + + V S++ + FAK N+ ++ AK
Sbjct: 179 LDCLSPGVPAGPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQE 238
Query: 293 NIADLLGTGLSIMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLNRARFSVAV 349
+ L G + ++ S TT+ +L +Y V++V + +LNR R ++
Sbjct: 239 TVISLFGMLVGSVVISHITSFSTTWLILLILLALHLSMNYAAVRAVQMTSLNRQRANIVF 298
Query: 350 ESFLKT 355
+ +
Sbjct: 299 STLFDS 304
>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
Length = 266
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/217 (23%), Positives = 98/217 (45%), Gaps = 13/217 (5%)
Query: 234 GVELATA-AVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE--- 289
G+E+ PH + C + + K + V +TR+ + + A N+ DV +K
Sbjct: 2 GIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQE 61
Query: 290 -CVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVA 348
CV +A +G L +I K L T F ++ +++++ + V++V L + N +R+ +A
Sbjct: 62 TCVNLVASFVGLYLLSLI-KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIA 120
Query: 349 VESFLKTGHVPSLREGNLQENIFSFPWLR-DKPVVLGSRFKDAFQDPDAYLAVE----PF 403
+E F ++ + S ++ N E + + + LG K+ + + +E F
Sbjct: 121 LEEFFRSSRMLSPQQVNAMERVTVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIENIVSSF 180
Query: 404 FEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHA 440
E +++ VY L + + D+LKA F A
Sbjct: 181 DPHEHFIIAQTKKCLGVY--LHFETRPQDVLKAYFFA 215
>gi|440299036|gb|ELP91648.1| hypothetical protein EIN_206450 [Entamoeba invadens IP1]
Length = 360
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/347 (20%), Positives = 137/347 (39%), Gaps = 36/347 (10%)
Query: 121 QAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVS 180
Q P + ++ +VP G+P +V Y ++ F AM L+ S+ S
Sbjct: 35 QPPFLTTRNHLQDIIVPVGYPTTVERGYFRFVA-------FSVAMESCNMMLLVMSISAS 87
Query: 181 RNRATPGAVAINWILKD-GAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
+ + A I ++ AG + +I R G + KQ R +++ EL +++A
Sbjct: 88 LRIDSSHSAAFFVIFREVMAGTMHLIITERWGSSIVFYAKQWRMRIEIISELLRLIQIAF 147
Query: 240 AAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD--- 296
A P +F + + ++V + +T I +AK N+ ++T K + + +A
Sbjct: 148 IASP-IFYSINVTCSTIESVLNASRQVIKTKILSNYAKENNVAELTEKAQNIETLARVIS 206
Query: 297 -LLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKT 355
L+G + ++ I F L +V ++ + +V +N R + ++ F
Sbjct: 207 LLIGFIVLHLLGDFTEHGIIVFMLALIAHVVCNFLMGRVIVFKAINYERLMILMKYFTTV 266
Query: 356 GH------VPSLREGNLQENIFSFPWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERY 409
+ V S E ++ N K + +G + D + A+ + +Y
Sbjct: 267 KNEILSPLVVSAIESKIRVN-------EMKNIRMGVSLNSIPKSTDLFFAISQSIKTCQY 319
Query: 410 VVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSN--GHR 454
+ Y + + LL D+ + I +A L C +S GHR
Sbjct: 320 FL-YKNTNNTISVLLGDKITKEGIFEA-------LLCAHSQERVGHR 358
>gi|156088815|ref|XP_001611814.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799068|gb|EDO08246.1| hypothetical protein BBOV_III006850 [Babesia bovis]
Length = 540
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 75/170 (44%), Gaps = 12/170 (7%)
Query: 194 ILKDGAGRVGKMI-FARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL--FLPLA 250
+LKD R+ + F + G FD + K R G LL AGV + L + L
Sbjct: 58 VLKDIFSRILNFVWFGQIGAGFDDNPKAFRLLGGLLCS-AAGVACFIGNMMQLEPKVVLT 116
Query: 251 CAANVAKNVAAVTSTSTRTPIYKAFA---KGENIGDVTAKGECVGNIADLL----GTGLS 303
NV K+V AVT +++ + F N+G++TAK E I D L GT L
Sbjct: 117 VLVNVIKHVGAVTMVASQATFHSTFCVKNAATNVGEITAKLETQSPICDFLGMAAGTSLG 176
Query: 304 IMISKRNPSLITT-FALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESF 352
+ + S+ T F + ++Y KSV TLN R V +E+F
Sbjct: 177 ALTANMAASMQTVLFTGMCLTSNIATYMCAKSVCFRTLNPQRCFVLLENF 226
>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
Length = 290
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 98/239 (41%), Gaps = 34/239 (14%)
Query: 237 LATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD 296
+ T P+ FL LA +N+ K+++ + + +T I FA NIGDVTAK +
Sbjct: 1 MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60
Query: 297 LLGTGLSIMISKR--NPSLITTFA---LLSCGYVFSSYQEVKSVVLHTLNRARFSVAVES 351
LLG + +S + S+ T FA +SC +++ + ++ V L TLN R +A
Sbjct: 61 LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120
Query: 352 FLKTGHVPSLREGNLQENIFSFPWLR----DKPVVLGSRFKDAFQDPDAYLAVEP----- 402
+ G + E + + P+L + LG AY ++E
Sbjct: 121 YCSQGIIYQ-PEAVAKLERYYLPYLHVADCGTDIRLGMSLHSVL---SAYRSLEEADDVI 176
Query: 403 -FFEKERYVVTYNPSKGKVY---------------ALLKDQAKSDDILKAAFHAYVLLH 445
F Y+++Y K +Y L A + D L+A HA +L H
Sbjct: 177 QLFGDCNYMLSYCDLKQGMYRSIINRTEKTKCTIAISLNSSASTRDQLQAYLHALLLRH 235
>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 122
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 20/117 (17%)
Query: 171 QTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLME 230
Q+LL G+ A P A A+N +L+DG G Q R + ++
Sbjct: 2 QSLLYDAGLVAG-AIPTAAAVNRVLRDGLG-------------------QWRITSSVALD 41
Query: 231 LGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 287
+ E+ T P FL +A NVAKNVA +++++TR + +F EN+ DVTAK
Sbjct: 42 VAVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAK 98
>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 494
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/297 (21%), Positives = 120/297 (40%), Gaps = 49/297 (16%)
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLP 248
I + + + GR+ ++FA R G + + K R A D+L + ++ + P + +
Sbjct: 144 IEYQIYESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVLDCLSPIFPKPVRVG 203
Query: 249 LACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIMISK 308
L ++V + + V + S++ + FA+ N+G++ A G S+++S
Sbjct: 204 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVG--------------SVVVSY 249
Query: 309 RNPSLITTFALLSCGYVF--SSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNL 366
L T AL+ V +++ V++V + TLNR R ++ + + + E +
Sbjct: 250 ITSPLETWIALIVLLIVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETSK 309
Query: 367 QENIFS----FPWLRDKPVVLG--------------------------SRFKDAFQDPDA 396
+E IF W + +LG SR D
Sbjct: 310 EERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDAST 369
Query: 397 YL-AVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAAFHAYVLLHCINSSNG 452
L A+ F +E Y++ + P + +LK A + LKA HA ++ + NG
Sbjct: 370 NLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKAWSHALLVSRHLARKNG 426
>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
Length = 484
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 84/196 (42%), Gaps = 22/196 (11%)
Query: 65 DETTAFRLVE-DQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRA-ISLQSPQA 122
+++TA E D+ + I SE G A R+ K N R ++ S A
Sbjct: 5 EDSTAITFTEVDEVNHPTATYIYSEPAAG-----PAHREHGKKHGENWGRVDVAHASSTA 59
Query: 123 PA----EELMSFIRSYVVPDGFPDSVTPSYVP-YMTWRALKHFFGGAMGVFTT-QTLLNS 176
PA + + F+ +P G+P SVT Y P Y T R + ++ +L S
Sbjct: 60 PAAWSSKSVQDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILES 119
Query: 177 --------VGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDL 227
VGV A+P A + IL+D +GR+ ++FA R G + + K R A D+
Sbjct: 120 SAARLPLGVGVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADI 179
Query: 228 LMELGAGVELATAAVP 243
+L ++ + +P
Sbjct: 180 FNDLAMILDCLSPMIP 195
>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
Length = 406
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/188 (23%), Positives = 86/188 (45%), Gaps = 6/188 (3%)
Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELG 232
++ VGV A+P + + IL+D +GR+ + FA R G + + K R A D+ ++
Sbjct: 3 ISRVGVGNASASPTSALLLQILQDISGRIATICFAHRVGTALEPECKTYRLAADVFNDIA 62
Query: 233 AGVELATAAVPH--LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGEC 290
++ + VP + + A V + + V S++ + FAK N+ ++ AK
Sbjct: 63 MILDCLSPGVPAGPARVIVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSS 122
Query: 291 VGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLNRARFSV 347
+ L G + ++ S TT+ +L +Y V++V + +LNR R ++
Sbjct: 123 QETVISLFGMLVGSVVISHITSFSTTWLVLLFLLALHLSMNYAAVRAVRMTSLNRQRANI 182
Query: 348 AVESFLKT 355
+ L +
Sbjct: 183 VFSALLAS 190
>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSLITTFALLS 321
+TR+ + + A N+ DV +K CV IA +G GL + + L F ++
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHVQNQRV-LYGLFLFVT 61
Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+++++ + VK+V L T N AR+ +A+E + K+G + S ++ N E +
Sbjct: 62 LLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERV 110
>gi|384494916|gb|EIE85407.1| hypothetical protein RO3G_10117 [Rhizopus delemar RA 99-880]
Length = 247
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/181 (23%), Positives = 81/181 (44%), Gaps = 35/181 (19%)
Query: 273 KAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPS-LITTFALLSCGYVFSSY 329
K F+ NIGD+ AK + + A LLG +G+ ++ +P L FA+LS ++S+
Sbjct: 19 KHFSPNGNIGDIVAKDDAQMSTAHLLGMLSGVGLISISHSPLFLFGAFAVLSPINIWSTV 78
Query: 330 QEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFPWLRDKPVVLGSRFKD 389
+ + + LN+A+ ++ ++ GHV + + LRD+ + G K
Sbjct: 79 KMLHAAEFEILNQAKLTLLAREYIDQGHVATYEK------------LRDREIGFGEWIKP 126
Query: 390 AFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYA----------LLKDQAKSDDILKAAFH 439
+++ ++++ + PS + Y LKD A S D+LK+ H
Sbjct: 127 SYRKKRGAISIK---------IKMGPSAEQAYGSSHEVQEVVHTLKD-AGSKDVLKSVLH 176
Query: 440 A 440
+
Sbjct: 177 S 177
>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 814
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 55/110 (50%), Gaps = 1/110 (0%)
Query: 131 IRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVA 190
+R +P+G+P SV+ Y+ + A + A G+ + LL GV + AT +
Sbjct: 233 LRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGSSTATAASAT 292
Query: 191 INWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELGAGVELAT 239
++ +D +G + FA +G+ D D K+ R D + +LG +ELA+
Sbjct: 293 AQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELAS 342
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/130 (25%), Positives = 65/130 (50%), Gaps = 21/130 (16%)
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE----CVGNIAD--- 296
+ FL AC +A+++ VTS +TR + + FA G++ G +G G++AD
Sbjct: 408 YYFLVAACLGGLARSLCGVTSRATRAALTQHFAAGDDAGRSFTRGRTGGSTTGSVADVDA 467
Query: 297 ----------LLGTGLSIMISK----RNPSLITTFALLSCGYVFSSYQEVKSVVLHTLNR 342
L+G G +++++ ++ FALL+ +VF + + ++S+ L+T+NR
Sbjct: 468 KEGTQETAATLIGMGAGVLVTRLAADDAAAIWFWFALLTALHVFFNVRAMRSLRLNTMNR 527
Query: 343 ARFSVAVESF 352
+ + F
Sbjct: 528 VVVRILLRRF 537
>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 596
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 112/276 (40%), Gaps = 35/276 (12%)
Query: 104 SGKFKRNSIRAISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGG 163
S K + IR +S + ++P + L R + +P G+PDS + Y F
Sbjct: 115 STKARPGEIRLVS-RRKESPLQRL----RLFGMPTGYPDSCAEGFRRYFLLSLCSTFVSS 169
Query: 164 AMGVFTTQTLLNSVGVSRNRATPGAVAINWILKD------GAGRVGKMIFARQGKKFDYD 217
Q++LN + ++P W+LKD A +++ KF +
Sbjct: 170 FASSIAYQSILNGFLLV---SSPQL----WMLKDLAPALAAAYLANRVVSYENRPKFWFV 222
Query: 218 LKQLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAK 277
+ + +++E+ + + VPH L A + + A++ TR + FA
Sbjct: 223 VSVVLHNTSIVLEM-----IIPSVVPHNLLTAAVLTSCVRQSASLMFLVTRASALQHFAV 277
Query: 278 GENIGDVTAK----GECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGY-VFSSYQEV 332
N+ ++T K G + ++ LG + ++ L+T L CG + SY +
Sbjct: 278 SGNLAELTKKFNSFGIVIYTVSTALGIAYTYYVASLTAQLVT--VLFCCGANLVLSYMSM 335
Query: 333 KSVVLHTLNRARFSVAVESFLKT-----GHVPSLRE 363
++ LN SV + +++K GHV S RE
Sbjct: 336 CNIAFRVLNGTTISVILRAYVKEGCLGPGHVLSPRE 371
>gi|83767160|dbj|BAE57299.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 165
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 66/134 (49%), Gaps = 3/134 (2%)
Query: 156 ALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKF 214
+L+ F G+ +++ +L VGV A+P A + IL+D +GR+ ++FA R G
Sbjct: 25 SLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHILQDTSGRISTILFAHRVGTAL 84
Query: 215 DYDLKQLRFAGDLLMELGAGVELATAAVPHLF--LPLACAANVAKNVAAVTSTSTRTPIY 272
+ + K R A D+ + ++ + +P F + + A V + + V S++ +
Sbjct: 85 EPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLSTAGVLRALCGVAGGSSKASLS 144
Query: 273 KAFAKGENIGDVTA 286
F++ N+ +V A
Sbjct: 145 AHFSRWGNLAEVNA 158
>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
Query: 266 STRTPIYKAFAKGENIGDVTAKGE----CVGNIADLLGTGLSIMISKRNPSLITTFALLS 321
+TR+ + + A N+ DV +K CV IA +G GL + + + L F L+
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHV-QNHRVLYGLFLFLT 61
Query: 322 CGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENI 370
+++ + + V++V L T N AR+ +A+E + K+G + S ++ N E +
Sbjct: 62 LLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVNKLERV 110
>gi|335308911|ref|XP_003361419.1| PREDICTED: UPF0420 protein C16orf58 homolog, partial [Sus scrofa]
Length = 210
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/132 (23%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 314 ITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSF 373
+ F L+ +++++Y+ V+++V+ TLN R + ++ FL+T V N E +++
Sbjct: 3 LCCFFFLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQTQEVLGPTAANRMEPLWTG 62
Query: 374 PWLRDKPVVLGSRFKDAFQDPDAYLAVEPFFE--KERYVVTYNPSKGKVYALLKDQAKSD 431
W P + S + + L ++ E +E Y++ ++ + +V +L A +
Sbjct: 63 FW----PSLFLSLGPPLHRLTSSVLELQQLVEGHREPYLLRWDQPRNQVQVVLSPMAGPE 118
Query: 432 DILKAAFHAYVL 443
IL+AA H +L
Sbjct: 119 TILRAATHGLLL 130
>gi|159467581|ref|XP_001691970.1| hypothetical protein CHLREDRAFT_170864 [Chlamydomonas reinhardtii]
gi|158278697|gb|EDP04460.1| predicted protein [Chlamydomonas reinhardtii]
Length = 131
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 12/95 (12%)
Query: 199 AGRVGKMIFAR-QGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH-LFLPLACAANVA 256
AG G ++FA +G FD KQ R D+ +LG VELA+ +P LFLP AC ++A
Sbjct: 2 AGMAGGILFAYAEGSSFDACAKQWRLFADIANDLGMTVELASPLLPRALFLPCACFGSIA 61
Query: 257 KNVAAVT-------STSTRTPIYKAFAKGENIGDV 284
++V + P+Y +G++ G V
Sbjct: 62 RSVTVFALILPRALQANAHVPVYL---RGQDGGGV 93
>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
B]
Length = 507
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 56/281 (19%), Positives = 112/281 (39%), Gaps = 37/281 (13%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P++V+P Y+ Y + L+ + ++ +L GV A+ + I+
Sbjct: 58 LPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASATNAMLVNII 117
Query: 196 KDGAGR---VGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHLFLPLA-- 250
+D R +G + G + K+ R D+ + +++ + H + L
Sbjct: 118 QDVVSRLITIGSGYYL--GTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWSISLVYP 175
Query: 251 ---------------CAANVAKNVAAVTSTSTRTPIYKAFAKG----ENIGDVTAKGECV 291
C + + + + + ++ + FA +IGD+ AK
Sbjct: 176 FIRREQGFYLRVLALCLSGSFRALCGMVAGGSKAALTMHFATAGPVPGDIGDLNAKDGSK 235
Query: 292 GNIADLLGTGLSIMISKRNPSLITTFALLS---CGYVFSSYQEVKSVVLHTLNRARFSVA 348
+ L+G L + K S T+A+LS ++ ++ V+ VV+ T NR R + A
Sbjct: 236 ETVLALIGMLLGTFVIKHVDSGFATYAVLSMLILTHLAINFVAVRGVVMRTFNRHRAAAA 295
Query: 349 VESFLKTGHVPSLREGNL--------QENIFSFPWLRDKPV 381
S+ + + EG + +E IF+ P L P+
Sbjct: 296 WASYRRYEQGIAQYEGAILTPELMAVKERIFADPTLLMHPL 336
>gi|449280556|gb|EMC87826.1| UPF0420 protein [Columba livia]
Length = 111
Score = 45.8 bits (107), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 36/67 (53%)
Query: 130 FIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAV 189
++ ++P GFP+SV+ Y+ Y W AL+ G +++L +VGV A+
Sbjct: 18 LVQRVLLPQGFPESVSSDYLGYQLWDALQALCSSLAGAVAARSVLQAVGVGDTNASVTGA 77
Query: 190 AINWILK 196
A+ W+L+
Sbjct: 78 ALGWLLR 84
>gi|154278545|ref|XP_001540086.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150413671|gb|EDN09054.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 425
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 70/166 (42%), Gaps = 31/166 (18%)
Query: 324 YVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS----FPWLRDK 379
++ ++Y V+SV + TLNR R ++ + + G V + E + E IF W +
Sbjct: 189 HLSTNYAAVRSVNMTTLNRQRANIVFSTLFEKGSVLTPTEASKCERIFERDGILRW-KAS 247
Query: 380 PVVLG-SRFKDAFQD----------------PDAYLAVEPFFEKERYVVTYNPSKGKVYA 422
P LG + +FQ+ D Y + FEKE YV+ +NP K
Sbjct: 248 PAALGYCQIGGSFQELLRGSIHGANSIRDVGIDIYKLLR-LFEKEEYVLWFNPIHKKGTI 306
Query: 423 LLKDQAKSDDILKAAFHAYVLLHCINSSNGHRSLALKQHEYDNSNL 468
+LK+ LKA HA ++ R L++ E D N+
Sbjct: 307 VLKNNVTPISQLKAWSHALIV--------AKRLTDLRREEADMVNV 344
>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
Length = 237
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/189 (21%), Positives = 84/189 (44%), Gaps = 12/189 (6%)
Query: 259 VAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLG--TGLSIMISKRNPSLIT- 315
+ V +TR + A+ N+ DV+AK + +L+ TG I + +T
Sbjct: 7 IVGVAGGATRAALTLHQARRNNMADVSAKDGSQETLVNLMALFTGFLITPLVVDNLFLTW 66
Query: 316 -TFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFSFP 374
F + + +++++YQ V +VV+ T+N R + V+ + G V + +E N + + P
Sbjct: 67 YFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKEVNKVDPVLRRP 126
Query: 375 WLRDKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDIL 434
+ V LG ++ D L + + + + YN ++ L ++L
Sbjct: 127 GFKCS-VHLGCQYADV-------LKCDEILDLPGFSLGYNSKTSTIHVSLYKDTSEHEVL 178
Query: 435 KAAFHAYVL 443
+++F A ++
Sbjct: 179 QSSFLAMLV 187
>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
HHB-10118-sp]
Length = 464
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 62/276 (22%), Positives = 109/276 (39%), Gaps = 37/276 (13%)
Query: 136 VPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATPGAVAINWIL 195
+P G+P SV+ Y Y AL+ FF G+ + +L GV A+ + IL
Sbjct: 3 LPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLTIL 62
Query: 196 KDGAGRVGKMIFARQ-GKKFDYDLKQLRFAGDLLMEL------------GAGVELATAAV 242
+D RV ++ G + K RF D+ ++ ++ A
Sbjct: 63 QDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWLAF 122
Query: 243 PHLFLPL------------ACAANVAKNVAAVTSTSTRTPIYKAFAKGE------NIGDV 284
P LP C + + +++ AV + ++ + F++ + +I D+
Sbjct: 123 PPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDISDL 182
Query: 285 TAKGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFS---SYQEVKSVVLHTLN 341
+AK + L+G +I + T LL+ VF +Y V+SVV + N
Sbjct: 183 SAKDGSKETVLALVGMLCGSIIMPLIQTQTQTCGLLAVIIVFHIYINYYAVRSVVFKSYN 242
Query: 342 RARFSVAVESFLKT-GHVPSL--REGNLQENIFSFP 374
R R + +F ++ G L RE E++F P
Sbjct: 243 RQRTCILWSAFRRSEGEHRCLVPREVARHEHLFHNP 278
>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
Length = 191
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 67/137 (48%), Gaps = 6/137 (4%)
Query: 257 KNVAAVTSTSTRTPIYKAFAKGENIGDVTAKG---ECVGNI-ADLLGTGLSIMISKRNPS 312
+ V V ++TRT + A +N+GDV AK E + N+ A L L ++SK
Sbjct: 3 RTVVGVAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVF 62
Query: 313 LITTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGHVPSLREGNLQENIFS 372
+ F L + +++ +Y+ VK + TLN++ + V+ +++T + ++ E N +E I
Sbjct: 63 VWLLFCLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPIL- 121
Query: 373 FPWLRDKPVVLGSRFKD 389
W + G R D
Sbjct: 122 LHWSSSRH-YYGCRLSD 137
>gi|291410213|ref|XP_002721396.1| PREDICTED: HEAT repeat containing 3 [Oryctolagus cuniculus]
Length = 756
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 67/296 (22%), Positives = 118/296 (39%), Gaps = 54/296 (18%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAI--SLQSP 120
E ET A + ++ + + C+ED W ++ D S F +S L SP
Sbjct: 428 ELRETIALLTAQQTALEIIVNMCCTEDPSDDEWEELSSSDESDAFMESSFSECGGQLLSP 487
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSV-------TPSYVPYM------TWRALKHFF 161
+ E+ + + +Y++P FP+SV +PS+ P + WRAL
Sbjct: 488 LCLSHEVHTALTNYLIPKKVFEKTAFPNSVAVDICSRSPSWKPLIRKMNTVQWRAL---- 543
Query: 162 GGAMGVFTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLK 219
Q+LL+ + V P L+ A + ++IFA+ K D+ L+
Sbjct: 544 ------VCLQSLLSLLDVDHLGGAPA-------LQTLAQHLSQLIFAQPDFAKHADF-LE 589
Query: 220 QLRFAGDLLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGE 279
+ A L++ A ++ P + L C A V ++T R +
Sbjct: 590 AVSSALRALLQTMASKNISQCMTPEQLMTL-CRAGVHS-----SNTGVRVNVVSILGI-- 641
Query: 280 NIGDVTAKGECVGNIADLLGTGLSIM-ISKRNPSLITTFALLSCGY-VFSSYQEVK 333
G V AK + G + L G ++ ++ ++PSL+ L + VF+ QE +
Sbjct: 642 -TGSVLAKED--GTLETLKTIGCFLLEVATKDPSLVVAGEALDALFDVFADGQEAE 694
>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
Length = 480
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 1/126 (0%)
Query: 114 AISLQSPQAPAEELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTL 173
++S SP++ L +P G P + Y+ Y ++ GV +T
Sbjct: 40 SLSSLSPRSVPSALSGLFGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVSTGAA 99
Query: 174 LNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFA-RQGKKFDYDLKQLRFAGDLLMELG 232
L + GV + AT A+ W ++DG G +G + F+ KFD +K+ R D+ + G
Sbjct: 100 LAAAGVGDSDATAMGAAVTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVANDAG 159
Query: 233 AGVELA 238
+++A
Sbjct: 160 MILDMA 165
>gi|326927265|ref|XP_003209813.1| PREDICTED: HEAT repeat-containing protein 3-like [Meleagris
gallopavo]
Length = 667
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 64/269 (23%), Positives = 109/269 (40%), Gaps = 32/269 (11%)
Query: 14 NSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGA-AATLLQS---ESDETTA 69
N AA + ++ + V + IS + P G G G + LL S E E TA
Sbjct: 287 NRLKPAAEAETEENVIDVTDNCAISEDDGMEEIPKGKGSGEHDISDLLPSDKWELKEVTA 346
Query: 70 FRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAIS--LQSPQAPAEEL 127
+ + ++ + + CSED W ++ D S F NS S L SP + E+
Sbjct: 347 LLMAQQTALEIIVNMCCSEDPSDDEWEELSSSDESDLFMENSYSEGSGLLMSPLCLSAEV 406
Query: 128 MSFIRSYVVPD------GFPDSV-------TPSYVPYMTWRALKHFFGGAMGVFTTQTLL 174
S + ++P FP+SV PS+ P + + L A+ + L+
Sbjct: 407 HSAFLNNLIPKKILEKTAFPNSVALDICMHNPSWKPLI--KKLNTVQCRALTCLHSILLV 464
Query: 175 NSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYD-LKQLRFAGDLLMELGA 233
+ V GA A L+ A + ++IF++ D + L+ + A L++ A
Sbjct: 465 SDVD-----CLGGASA----LQSLAQHLSQLIFSKAELPEDTEFLEAITSALRALLQTMA 515
Query: 234 GVELATAAVPHLFLPLACAANVAKNVAAV 262
++ P L L C A++ + A+V
Sbjct: 516 SKNISQCMTPEQLLAL-CEASIHSSNASV 543
>gi|313238188|emb|CBY13282.1| unnamed protein product [Oikopleura dioica]
Length = 93
Score = 41.2 bits (95), Expect = 1.4, Method: Composition-based stats.
Identities = 19/75 (25%), Positives = 33/75 (44%)
Query: 127 LMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRATP 186
L I +P +P+SV+ Y Y W + + G ++ +GV + A+
Sbjct: 18 LKGLICKVFLPKDYPNSVSEDYTTYQVWDSAQALCSSVTGSLAINSVFKGIGVGDSSASS 77
Query: 187 GAVAINWILKDGAGR 201
+ NWI++DG R
Sbjct: 78 LSATSNWIIRDGIDR 92
>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 595
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 52/238 (21%), Positives = 93/238 (39%), Gaps = 12/238 (5%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
E L+ + + +P G+PDS + Y F Q++LN ++ +
Sbjct: 155 ESLLQRLHLFGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLA---S 211
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELAT-AAVP 243
+P W+LKD A + A + ++ K L L E+ VP
Sbjct: 212 SPQL----WMLKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVP 267
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
+ L A + + A++ TR + FA N+ ++T K G + + T L
Sbjct: 268 NHLLAAAILTSCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALG 327
Query: 304 IMISKRNPSLITTFA--LLSCGYVFS-SYQEVKSVVLHTLNRARFSVAVESFLK-TGH 357
I + PS+ A L+ CG + S+ + ++ LN SV + +++ GH
Sbjct: 328 IAYTSLVPSVTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGH 385
>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 595
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 52/238 (21%), Positives = 93/238 (39%), Gaps = 12/238 (5%)
Query: 125 EELMSFIRSYVVPDGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVSRNRA 184
E L+ + + +P G+PDS + Y F Q++LN ++ +
Sbjct: 155 ESLLQRLHLFGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSILNGFLLA---S 211
Query: 185 TPGAVAINWILKDGAGRVGKMIFARQGKKFDYDLKQLRFAGDLLMELGAGVELAT-AAVP 243
+P W+LKD A + A + ++ K L L E+ VP
Sbjct: 212 SPQL----WMLKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVP 267
Query: 244 HLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLS 303
+ L A + + A++ TR + FA N+ ++T K G + + T L
Sbjct: 268 NHLLAAAILTSCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALG 327
Query: 304 IMISKRNPSLITTFA--LLSCGYVFS-SYQEVKSVVLHTLNRARFSVAVESFLK-TGH 357
I + PS+ A L+ CG + S+ + ++ LN SV + +++ GH
Sbjct: 328 IAYTSLVPSVTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGH 385
>gi|449473073|ref|XP_002193970.2| PREDICTED: HEAT repeat-containing protein 3 [Taeniopygia guttata]
Length = 638
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 66/292 (22%), Positives = 114/292 (39%), Gaps = 36/292 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAIS--LQSP 120
E E TA + ++ + + CSED W ++ D S F NS S L SP
Sbjct: 311 ELKEVTALLTAQQTALEIIVNMCCSEDPSDDEWEELSSSDESDLFMENSYNEGSGLLLSP 370
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSV-------TPSYVPYMTWRALKHFFGGAMGV 167
++E+ S + ++P FP+SV PS+ P + + L A+
Sbjct: 371 LCLSDEVNSAFLNNLIPKKILEKTAFPNSVALDICMHNPSWKPLI--KKLNAVQCRALTC 428
Query: 168 FTTQTLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYD-LKQLRFAGD 226
+ L++ V GA A L+ A + ++IF++ D + L+ + A
Sbjct: 429 LQSILLVSDVD-----CLGGASA----LQSLAQHLSQLIFSKTELPTDTEFLEAVTSALR 479
Query: 227 LLMELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTA 286
L++ A + P L L C A + ++TS R + G V A
Sbjct: 480 ALLQTMASKNIPQCMTPEQLLAL-CEAGIQS-----SNTSVRVNMVSILGIS---GSVLA 530
Query: 287 KGECVGNIADLLGTGLSIMISKRNPSLITTFALLSCGYVFSSYQEVKSVVLH 338
KG+ ++G L + K + ++ AL + VF+ +E + L
Sbjct: 531 KGQGAAETLKMIGKFLLEVAMKESSLVVAGEALDALFDVFADGEEAERAALQ 582
>gi|449528331|ref|XP_004171158.1| PREDICTED: uncharacterized protein LOC101231720 [Cucumis sativus]
Length = 157
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 2/66 (3%)
Query: 378 DKPVVLGSRFKDAFQDPDAYLAVEPFFEKERYVVTYNPSKGKVYALLKDQAKSDDILKAA 437
+K + LGS+ D + L + F KE Y+++ + +GK +LK+ A D+LKA
Sbjct: 27 EKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEH--RGKYCVMLKESASPVDMLKAV 84
Query: 438 FHAYVL 443
FH L
Sbjct: 85 FHVNYL 90
>gi|397582228|gb|EJK52211.1| hypothetical protein THAOC_28545 [Thalassiosira oceanica]
Length = 263
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 61/140 (43%), Gaps = 13/140 (9%)
Query: 173 LLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIF-ARQGKKFDYDLKQLRFAGDLLMEL 231
+ +GV R AT +++ L+D G + F A + +KQ +F L+++L
Sbjct: 123 IFEGLGVGRASATALVATMSFALRDFFGICSSLAFSAVVSRNLARHVKQWKFFSALMLDL 182
Query: 232 GAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKG-----ENIGDVTA 286
G +++ FLPL C ++ + + VT++ + I +A + I ++ +
Sbjct: 183 GLAMKVTANLKRQWFLPLLCGGSICEAMYGVTTSPCTSIINLHWANTVLGSEDGIAEIIS 242
Query: 287 K-------GECVGNIADLLG 299
K E VG D LG
Sbjct: 243 KRRAQRTLNELVGGRTDGLG 262
>gi|335056143|dbj|BAK39267.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056145|dbj|BAK39268.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056147|dbj|BAK39269.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056473|dbj|BAK39241.1| similar to EMB1879 [Oryza rufipogon]
gi|335056475|dbj|BAK39242.1| similar to EMB1879 [Oryza rufipogon]
gi|335056477|dbj|BAK39243.1| similar to EMB1879 [Oryza rufipogon]
gi|335056479|dbj|BAK39244.1| similar to EMB1879 [Oryza rufipogon]
gi|335056481|dbj|BAK39245.1| similar to EMB1879 [Oryza rufipogon]
gi|335056483|dbj|BAK39246.1| similar to EMB1879 [Oryza rufipogon]
gi|335056485|dbj|BAK39247.1| similar to EMB1879 [Oryza rufipogon]
gi|335056487|dbj|BAK39248.1| similar to EMB1879 [Oryza rufipogon]
gi|335056489|dbj|BAK39249.1| similar to EMB1879 [Oryza rufipogon]
gi|335056491|dbj|BAK39250.1| similar to EMB1879 [Oryza rufipogon]
gi|335056493|dbj|BAK39251.1| similar to EMB1879 [Oryza rufipogon]
gi|335056495|dbj|BAK39252.1| similar to EMB1879 [Oryza rufipogon]
gi|335056497|dbj|BAK39253.1| similar to EMB1879 [Oryza rufipogon]
gi|335056499|dbj|BAK39254.1| similar to EMB1879 [Oryza rufipogon]
gi|335056501|dbj|BAK39255.1| similar to EMB1879 [Oryza rufipogon]
gi|335056503|dbj|BAK39256.1| similar to EMB1879 [Oryza rufipogon]
gi|335056505|dbj|BAK39257.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056507|dbj|BAK39258.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056509|dbj|BAK39259.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056511|dbj|BAK39260.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056513|dbj|BAK39261.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056515|dbj|BAK39262.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056517|dbj|BAK39263.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056519|dbj|BAK39264.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056521|dbj|BAK39265.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056777|dbj|BAK39266.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056779|dbj|BAK39270.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056781|dbj|BAK39271.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056783|dbj|BAK39272.1| similar to EMB1879 [Oryza sativa Japonica Group]
gi|335056785|dbj|BAK39273.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056787|dbj|BAK39274.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056789|dbj|BAK39275.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056791|dbj|BAK39276.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056793|dbj|BAK39277.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056795|dbj|BAK39278.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056797|dbj|BAK39279.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056799|dbj|BAK39280.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056801|dbj|BAK39281.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056803|dbj|BAK39282.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056805|dbj|BAK39283.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056807|dbj|BAK39284.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056809|dbj|BAK39285.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056811|dbj|BAK39286.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056813|dbj|BAK39287.1| similar to EMB1879 [Oryza sativa Indica Group]
gi|335056815|dbj|BAK39288.1| similar to EMB1879 [Oryza sativa Japonica Group]
Length = 21
Score = 39.7 bits (91), Expect = 3.4, Method: Composition-based stats.
Identities = 19/21 (90%), Positives = 19/21 (90%)
Query: 288 GECVGNIADLLGTGLSIMISK 308
GE VGNIADLLGTGLSI ISK
Sbjct: 1 GESVGNIADLLGTGLSIFISK 21
>gi|7020991|dbj|BAA91342.1| unnamed protein product [Homo sapiens]
Length = 594
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
E ET A + ++ + + C+ED W ++ D S F NS Q SP
Sbjct: 266 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 325
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
+ E+ + + +Y++P FP+S+ TW+ L K +F Q
Sbjct: 326 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 385
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
+L++ + V GA A+ + A + +++F++ K D+ L+ + A L+
Sbjct: 386 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 437
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ A ++ P + L A + NV + + I G V AK +
Sbjct: 438 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 488
Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
G + L G ++ ++ ++PSL+
Sbjct: 489 --GTLETLKNIGCFLLEVTTKDPSLV 512
>gi|119603150|gb|EAW82744.1| hypothetical protein FLJ20718, isoform CRA_b [Homo sapiens]
Length = 594
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
E ET A + ++ + + C+ED W ++ D S F NS Q SP
Sbjct: 266 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 325
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
+ E+ + + +Y++P FP+S+ TW+ L K +F Q
Sbjct: 326 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 385
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
+L++ + V GA A+ + A + +++F++ K D+ L+ + A L+
Sbjct: 386 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 437
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ A ++ P + L A + NV + + I G V AK +
Sbjct: 438 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 488
Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
G + L G ++ ++ ++PSL+
Sbjct: 489 --GTLETLKNIGCFLLEVTTKDPSLV 512
>gi|332845876|ref|XP_520630.3| PREDICTED: HEAT repeat-containing protein 3 isoform 2 [Pan
troglodytes]
gi|410210380|gb|JAA02409.1| HEAT repeat containing 3 [Pan troglodytes]
gi|410268164|gb|JAA22048.1| HEAT repeat containing 3 [Pan troglodytes]
gi|410288786|gb|JAA22993.1| HEAT repeat containing 3 [Pan troglodytes]
Length = 683
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
E ET A + ++ + + C+ED W ++ D S F NS Q SP
Sbjct: 355 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 414
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
+ E+ + + +Y++P FP+S+ TW+ L K +F Q
Sbjct: 415 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 474
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
+L++ + V GA A+ + A + +++F++ K D+ L+ + A L+
Sbjct: 475 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 526
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ A ++ P + L A + NV + + I G V AK +
Sbjct: 527 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 577
Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
G + L G ++ ++ ++PSL+
Sbjct: 578 --GTLETLKNIGCFLLEVTTKDPSLV 601
>gi|41281996|ref|NP_891552.1| HEAT repeat-containing protein 3 [Homo sapiens]
gi|45477280|sp|Q7Z4Q2.2|HEAT3_HUMAN RecName: Full=HEAT repeat-containing protein 3
gi|21619962|gb|AAH33077.1| HEAT repeat containing 3 [Homo sapiens]
gi|54311359|gb|AAH18730.2| HEAT repeat containing 3 [Homo sapiens]
gi|119603149|gb|EAW82743.1| hypothetical protein FLJ20718, isoform CRA_a [Homo sapiens]
gi|158260921|dbj|BAF82638.1| unnamed protein product [Homo sapiens]
Length = 680
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
E ET A + ++ + + C+ED W ++ D S F NS Q SP
Sbjct: 352 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 411
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
+ E+ + + +Y++P FP+S+ TW+ L K +F Q
Sbjct: 412 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 471
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
+L++ + V GA A+ + A + +++F++ K D+ L+ + A L+
Sbjct: 472 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 523
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ A ++ P + L A + NV + + I G V AK +
Sbjct: 524 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 574
Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
G + L G ++ ++ ++PSL+
Sbjct: 575 --GTLETLKNIGCFLLEVTTKDPSLV 598
>gi|15680144|gb|AAH14415.1| HEATR3 protein [Homo sapiens]
Length = 726
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
E ET A + ++ + + C+ED W ++ D S F NS Q SP
Sbjct: 398 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 457
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
+ E+ + + +Y++P FP+S+ TW+ L K +F Q
Sbjct: 458 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 517
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
+L++ + V GA A+ + A + +++F++ K D+ L+ + A L+
Sbjct: 518 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 569
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ A ++ P + L A + NV + + I G V AK +
Sbjct: 570 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 620
Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
G + L G ++ ++ ++PSL+
Sbjct: 621 --GTLETLKNIGCFLLEVTTKDPSLV 644
>gi|397498211|ref|XP_003819878.1| PREDICTED: HEAT repeat-containing protein 3 [Pan paniscus]
Length = 766
Score = 38.9 bits (89), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 57/266 (21%), Positives = 108/266 (40%), Gaps = 33/266 (12%)
Query: 63 ESDETTAFRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAISLQ--SP 120
E ET A + ++ + + C+ED W ++ D S F NS Q SP
Sbjct: 438 ELRETIALLTAQQTALEIIVNMCCNEDPSDDEWEELSSSDESDAFMENSFSECGGQLFSP 497
Query: 121 QAPAEELMSFIRSYVVPD------GFPDSVTPSYVPYM-TWRAL--KHFFGGAMGVFTTQ 171
+ E+ + + +Y++P FP+S+ TW+ L K +F Q
Sbjct: 498 LCLSHEVHTALTNYLIPKKIFEKTAFPNSIAVDLCSRNPTWKPLIRKMNTIQCRALFCLQ 557
Query: 172 TLLNSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQ--GKKFDYDLKQLRFAGDLLM 229
+L++ + V GA A+ + A + +++F++ K D+ L+ + A L+
Sbjct: 558 SLVSLLDVEH---LGGAAALQTL----AQHLSQLLFSQPDFAKHVDF-LEAISSALRALL 609
Query: 230 ELGAGVELATAAVPHLFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGE 289
+ A ++ P + L A + NV + + I G V AK +
Sbjct: 610 QTMASKNISQCMTPDQLMTLCKAGIHSSNVGVRVNVVSILGIT---------GSVLAKED 660
Query: 290 CVGNIADLLGTGLSIM-ISKRNPSLI 314
G + L G ++ ++ ++PSL+
Sbjct: 661 --GTLETLKNIGCFLLEVTTKDPSLV 684
>gi|71895555|ref|NP_001025740.1| HEAT repeat-containing protein 3 [Gallus gallus]
gi|53134845|emb|CAG32369.1| hypothetical protein RCJMB04_23o18 [Gallus gallus]
Length = 675
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 61/269 (22%), Positives = 109/269 (40%), Gaps = 32/269 (11%)
Query: 14 NSTAAAASSAAQDARLLVRETMRISANLATSSSPIGVGGGGA-AATLLQS---ESDETTA 69
N AA + +++ + V + +S + G G G + LL S E E TA
Sbjct: 295 NRLKPAAGAETEESEIDVTDNCALSEDDGMEEVSKGKGSGEHDISDLLPSDKWELKEVTA 354
Query: 70 FRLVEDQFVDSSLRLICSEDIDGRRWNYVAERDRSGKFKRNSIRAIS--LQSPQAPAEEL 127
+ + ++ + + CSED W ++ D S F NS S L SP + E+
Sbjct: 355 LLMAQQTALEIIVNMCCSEDPSDDEWEELSSSDESDLFMENSYSEGSGLLMSPLCLSAEV 414
Query: 128 MSFIRSYVVPD------GFPDSV-------TPSYVPYMTWRALKHFFGGAMGVFTTQTLL 174
S + ++P FP+SV PS+ P + + + A+ + L+
Sbjct: 415 HSAFLNNLIPKKILEKTAFPNSVALDICVHNPSWKPLI--KKMNTVQCRALTCLHSILLV 472
Query: 175 NSVGVSRNRATPGAVAINWILKDGAGRVGKMIFARQGKKFDYD-LKQLRFAGDLLMELGA 233
+ V GA A L+ A + ++IF++ D + L+ + A L++ A
Sbjct: 473 SDVD-----CLGGASA----LQSLAQHLSQLIFSKTELPEDTEFLEAITSALRALLQTMA 523
Query: 234 GVELATAAVPHLFLPLACAANVAKNVAAV 262
++ P L L C A++ + A+V
Sbjct: 524 SKNISQCMTPEQLLAL-CEASIHSSNASV 551
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.132 0.384
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,471,718,190
Number of Sequences: 23463169
Number of extensions: 310087588
Number of successful extensions: 820440
Number of sequences better than 100.0: 506
Number of HSP's better than 100.0 without gapping: 425
Number of HSP's successfully gapped in prelim test: 81
Number of HSP's that attempted gapping in prelim test: 819024
Number of HSP's gapped (non-prelim): 643
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)