BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 045134
(529 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
Length = 713
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/500 (71%), Positives = 412/500 (82%), Gaps = 15/500 (3%)
Query: 27 DSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIP 86
+++ ++LLFV S+LY F H Q+ TA+++ KE + VWEV+G K K+IP
Sbjct: 199 ETAHMLRAVLLFVFSVLYSFFHFQLDTALSK---------EKEEEGVWEVRGGKWHKIIP 249
Query: 87 DFTKDAFVVAS----ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSL 142
D +KD F+V + A A SS L LW +C+ELF++ MLPEGFP SVTSDYL+Y+L
Sbjct: 250 DSSKDEFLVVTPGIGAVGAPKSSTLP--NLWLQCKELFLRLMLPEGFPHSVTSDYLDYTL 307
Query: 143 WRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
WR VQGVASQISGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHF
Sbjct: 308 WRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHF 367
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
DV+PKGWRLFADLLENAA+GLE+LTPAFPH F+ IGA AGAGRSAAALIQASTRSCFYAG
Sbjct: 368 DVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAG 427
Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
FAA+RNFAEVIAKGEAQGMVSK+IGIMLGIALAN IGSS P + ASF+VVT +HM+CNLK
Sbjct: 428 FAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLK 487
Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVL 382
SYQSI+LRTLNPYRASLVFSEYLLSGQ P +KEVN+EEPLFP K K+Q VL
Sbjct: 488 SYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVL 547
Query: 383 SSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESA 442
S+EAKDAA EIE RLQLGSKLS+VV++KED ALF LY +E YILTEH G+F V+LKES
Sbjct: 548 STEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESC 607
Query: 443 LPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASV 502
PQDMLKS+F +YLYWLERNAGI++ S DC PGGRL+ISL+YVQREFNH+K+DS V
Sbjct: 608 SPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFV 667
Query: 503 GWVTDGLIARPLPNRIRPGY 522
GW TDGLIARPLPNRIRPG+
Sbjct: 668 GWATDGLIARPLPNRIRPGH 687
>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
Length = 537
Score = 721 bits (1860), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/494 (72%), Positives = 410/494 (82%), Gaps = 15/494 (3%)
Query: 34 SLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAF 93
++LLFV S+LY F H Q+ TA+++ KE + VWEV+G K K+IPD +KD F
Sbjct: 4 AVLLFVFSVLYSFFHFQLDTALSK---------EKEEEGVWEVRGGKWHKIIPDSSKDEF 54
Query: 94 VVAS----ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGV 149
+V + A A SS L LW +C+ELF++ MLPEGFP SVTSDYL+Y+LWR VQGV
Sbjct: 55 LVVTPGIGAVGAPKSSTLP--NLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGV 112
Query: 150 ASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGW 209
ASQISGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDV+PKGW
Sbjct: 113 ASQISGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGW 172
Query: 210 RLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNF 269
RLFADLLENAA+GLE+LTPAFPH F+ IGA AGAGRSAAALIQASTRSCFYAGFAA+RNF
Sbjct: 173 RLFADLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNF 232
Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
AEVIAKGEAQGMVSK+IGIMLGIALAN IGSS P + ASF+VVT +HM+CNLKSYQSI+L
Sbjct: 233 AEVIAKGEAQGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQL 292
Query: 330 RTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA 389
RTLNPYRASLVFSEYLLSGQ P +KEVN+EEPLFP K K+Q VLS+EAKDA
Sbjct: 293 RTLNPYRASLVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDA 352
Query: 390 AVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLK 449
A EIE RLQLGSKLS+VV++KED ALF LY +E YILTEH G+F V+LKES PQDMLK
Sbjct: 353 AAEIERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLK 412
Query: 450 SLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGL 509
S+F +YLYWLERNAGI++ S DC PGGRL+ISL+YVQREFNH+K+DS VGW TDGL
Sbjct: 413 SVFHVNYLYWLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGL 472
Query: 510 IARPLPNRIRPGYV 523
IARPLPNRIRPG++
Sbjct: 473 IARPLPNRIRPGHI 486
>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
Length = 541
Score = 712 bits (1838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/493 (75%), Positives = 414/493 (83%), Gaps = 19/493 (3%)
Query: 42 LLYCFCHL---QVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASA 98
LL CF L ++A ART +++ E D+VW VKGSKR +LIPDF KD F+V
Sbjct: 46 LLCCFVALWLQSASSAFARTTLKEKEEEGAE-DSVWVVKGSKRIRLIPDFIKDEFLV--- 101
Query: 99 SNASLSSLLSV----------NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQG 148
N SL S LW +CR LFV+ MLPEG+P SVTSDYL+YSLWR VQG
Sbjct: 102 -NPSLPSSYDDIISSSWLHFGRTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQG 160
Query: 149 VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKG 208
VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKI+LS +GRHFDVNPKG
Sbjct: 161 VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKG 220
Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
WRLFADLLENAAFGLE+LTPAFPH FVFIGAAAGAGRSAAALIQA+TRSCFYAGFAA+RN
Sbjct: 221 WRLFADLLENAAFGLEILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRN 280
Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
FAEVIAKGEAQGMVSK IGIMLGI LAN IGSS+P ALASFSVVTWIHM+CNLKSYQSI+
Sbjct: 281 FAEVIAKGEAQGMVSKFIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQ 340
Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
LRTLNPYRASLVFSEYLLSGQAPP+K+VNDEEPLFPA F KSA+K L+VLS EA+D
Sbjct: 341 LRTLNPYRASLVFSEYLLSGQAPPIKDVNDEEPLFPAV-FPHFKSADKPSLVVLSLEARD 399
Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDML 448
AA EIE RLQLGSKLSDVVN+KED ALF+LY+DEGYILTE+ G+FCVVLKES QDML
Sbjct: 400 AATEIERRLQLGSKLSDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDML 459
Query: 449 KSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDG 508
K+LFQ +YLYWLERNAG+ A TSADC GGRL++SL+Y+QREF+HV++DS SVGWV DG
Sbjct: 460 KALFQVNYLYWLERNAGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADG 519
Query: 509 LIARPLPNRIRPG 521
LIARPLPNRI PG
Sbjct: 520 LIARPLPNRIYPG 532
>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
Length = 608
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/489 (72%), Positives = 405/489 (82%), Gaps = 4/489 (0%)
Query: 36 LLFVPSLLYCFCH--LQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAF 93
L F+ L CF H L A+AIA+ +S+ +G+ + VWEV+GSKR +L+PDF KD F
Sbjct: 110 LCFLLLGLSCFFHFRLSAASAIAKD-QNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEF 168
Query: 94 VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
V + S LSS L+ L +CR L QF+LPEGFP+SVTSDYL+YSLWR VQG+ASQI
Sbjct: 169 V-SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQI 227
Query: 154 SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFA 213
SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFA
Sbjct: 228 SGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFA 287
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
DLLENAAFG+EMLTP FP FV IGAAAGAGRSAAALIQA+TRSCF AGFA++RNFAEVI
Sbjct: 288 DLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVI 347
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
AKGEAQGMVSK++GI+LGI +AN IG+S ALA+F VVT IHMY NLKSYQ I+LRTLN
Sbjct: 348 AKGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLN 407
Query: 334 PYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEI 393
PYRASLVFSEYL+SGQAP +KEVNDEEPLFP F +KS K Q VLSSEAK AA +I
Sbjct: 408 PYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADI 467
Query: 394 EHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQ 453
E RLQLGSKLSDV++NKE+A ALF LY +EGYILTEH G+FCV+LKES+ PQDML+SLFQ
Sbjct: 468 EERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQ 527
Query: 454 ASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARP 513
+YLYWLE+NAGI ST +DC PGGRL ISLDYV+REF H K DS SVGWVT+GLIARP
Sbjct: 528 VNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARP 587
Query: 514 LPNRIRPGY 522
LP RIR G+
Sbjct: 588 LPTRIRLGH 596
>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
Length = 649
Score = 692 bits (1786), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/499 (72%), Positives = 402/499 (80%), Gaps = 7/499 (1%)
Query: 35 LLLFVPSLLYCFCHL-QVATAIART-ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDA 92
L L S L FCHL A A+T + S+ D + + V+EVKG K TKL+PD T D
Sbjct: 90 LSLLCSSALCFFCHLLHAKLAKAKTLSPSTTADTSLFSEPVYEVKGGKWTKLVPDLTNDV 149
Query: 93 FVVASASNAS-LSSLLSVNKL----WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQ 147
FV A S LSSL ++L W +C ++F + MLPEGFP+SVTSDYL YSLWR+VQ
Sbjct: 150 FVSAQQGFLSELSSLKVPSQLATFVWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQ 209
Query: 148 GVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPK 207
GVA Q+SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDV+PK
Sbjct: 210 GVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPK 269
Query: 208 GWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR 267
GWRLFADLLENAAFGLEM TPAFP FV IGA AGA RSAA+LIQASTRSCF+AGFAA+R
Sbjct: 270 GWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQR 329
Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
NFAEVIAKGE QGM S+ IGI LGI L N IGSS P LASF+V+TWIHMYCNLKSYQSI
Sbjct: 330 NFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSI 389
Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
+LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPA ANK+Q +VLSSEAK
Sbjct: 390 QLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAK 449
Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDM 447
DAA EIEHRLQLGSKLS++VN+KED ALF LY++EGYIL+E+ GKFCVVLKE+ QDM
Sbjct: 450 DAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDM 509
Query: 448 LKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
LK+LFQ +YLYWLE+NAGI T D PGGRL ISLDYV+REFNHVK+D VGWVTD
Sbjct: 510 LKALFQVNYLYWLEKNAGIGGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTD 569
Query: 508 GLIARPLPNRIRPGYVEPS 526
GLIARPLPNRIR G PS
Sbjct: 570 GLIARPLPNRIRIGDTPPS 588
>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
lyrata]
Length = 613
Score = 690 bits (1781), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/515 (69%), Positives = 410/515 (79%), Gaps = 3/515 (0%)
Query: 16 WWWHGGNGGDDDSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEY-DAVW 74
WW++GG DDSS + V L CF H +++ A A S D + VW
Sbjct: 97 WWFNGGGDNSDDSSFNLRYFCFLVLGL-SCFFHFRLSAASAIAKASDSDSSGDTDKETVW 155
Query: 75 EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
EV+GSKR +L+PDF KD FV + S LSS L+ L +CR L QF+LPEGFP+SVT
Sbjct: 156 EVRGSKRKRLVPDFVKDEFV-SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVT 214
Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIM 194
SDYL+YSLWR VQG+ASQ+SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIM
Sbjct: 215 SDYLDYSLWRGVQGIASQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIM 274
Query: 195 LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQAS 254
LS +GRHFDV+PKGWRLFADLLENAAFG+EMLTP FP FV IGAAAGAGRSAAALIQA+
Sbjct: 275 LSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAA 334
Query: 255 TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTW 314
TRSCF AGFA++RNFAEVIAKGEAQGMVSK++GI+LGI +AN IG+S ALA+F VVT
Sbjct: 335 TRSCFNAGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTT 394
Query: 315 IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSA 374
IHMY NLKSYQ I+LRTLNPYRASLVFSEYL+SGQAP +KEVNDEEPLFP F +KS
Sbjct: 395 IHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSP 454
Query: 375 NKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKF 434
K Q VLSSEAK AA +IE RLQLGSKLSDV++NKE+A ALF LY +EGYILTEH G+F
Sbjct: 455 EKLQDFVLSSEAKAAAEDIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRF 514
Query: 435 CVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNH 494
CV+LKES+ PQDML+SLFQ +YLYWLE+NAGI ST DC PGGRL ISLDYV+REF H
Sbjct: 515 CVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYTDCKPGGRLHISLDYVRREFEH 574
Query: 495 VKSDSASVGWVTDGLIARPLPNRIRPGYVEPSVAS 529
K DS SVGWVT+GLIARPLP RIR G+ ++S
Sbjct: 575 AKEDSQSVGWVTEGLIARPLPTRIRLGHDREPLSS 609
>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
Length = 592
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/462 (74%), Positives = 378/462 (81%), Gaps = 8/462 (1%)
Query: 71 DAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNK------LWDECRELFVQFM 124
+ V+EVKG K TKL+PD T D FV SA LS L S+ +W +C ++F + M
Sbjct: 128 EPVYEVKGGKWTKLVPDPTDDVFV--SAQQGFLSELSSLKPSQLATFVWLKCSDIFTRLM 185
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLK 184
LPEGFP+SVTSDYL YSLWR+VQGVA Q+SGVLATQ+LLYA+GLGKGAIPTAAAINWVLK
Sbjct: 186 LPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLK 245
Query: 185 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEM TPA P FV IGA AGA
Sbjct: 246 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACPQFFVLIGAVAGAS 305
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
RSAA+LIQASTRSCF+AGFAA+RNFAEVIAKGE QGM S+ IGI+LGI L N IGSS P
Sbjct: 306 RSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLGIGLGNCIGSSTPL 365
Query: 305 ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP 364
LASF+V+TWIHMYCNLKSYQSI+LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP
Sbjct: 366 VLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP 425
Query: 365 AFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEG 424
A A+K+Q LSSEAKDAA EIEHRLQLGSKLS++VN+KED ALF LY++EG
Sbjct: 426 AVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKNEG 485
Query: 425 YILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEIS 484
YIL+EH GK+ VVLKE DMLK+LFQ +YLYWLE+NAGI T D PGGRL IS
Sbjct: 486 YILSEHMGKYSVVLKEKCSQLDMLKALFQVNYLYWLEKNAGIEGRGTLNDSKPGGRLHIS 545
Query: 485 LDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEPS 526
LDYV+REFNHVK+D VGWVTDGLIARPLPNRI G PS
Sbjct: 546 LDYVEREFNHVKNDGELVGWVTDGLIARPLPNRICIGDTAPS 587
>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
Length = 630
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/503 (71%), Positives = 406/503 (80%), Gaps = 20/503 (3%)
Query: 33 YSLL--LFVPSLLYCFCHLQVATAIARTAT-SSEDDGNKEYDAVWEVKGSKRTKLIPDFT 89
Y+LL L S+ +C C L A+A+T + SSEDD + ++EVKG KL PD
Sbjct: 87 YTLLFTLLFSSVTFCLCQL----AMAKTRSLSSEDDILTQ--PIYEVKGGNLIKLFPDNL 140
Query: 90 KDAFVVASASNASL-SSLLSVNK------LWDECRELFVQFMLPEGFPDSVTSDYLNYSL 142
KD F+ ASN L S L S+N L+++CRE V+ MLPEGFP+SVTSDYL YSL
Sbjct: 141 KDIFI---ASNPGLFSELSSLNSSQVPTFLYNKCREFVVRLMLPEGFPNSVTSDYLEYSL 197
Query: 143 WRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
WR VQGVA Q+SGVLATQALLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LS+FGRHF
Sbjct: 198 WRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHF 257
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
DVNPKGWRLFADLLENAAFGLEM TPAFPH FV IGA AGA RSAA+LIQASTRSCF+AG
Sbjct: 258 DVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAG 317
Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
FAA+RNFAEVIAKGE QGMVS+ IGI +GI L N IGSS P LASF VVTW+HMYCNLK
Sbjct: 318 FAAQRNFAEVIAKGEVQGMVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLK 377
Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANK-SQLLV 381
SYQSI+LRTLNP+RASLVFSEYLLSGQAPPVKEVN EEPLFPA ANK +Q +V
Sbjct: 378 SYQSIQLRTLNPHRASLVFSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIV 437
Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKES 441
LSSEAKDAAVEIE RLQLGSKLS+++NNKE+ ALFSLY++EGYIL+EH GKFCVVLKE+
Sbjct: 438 LSSEAKDAAVEIESRLQLGSKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKET 497
Query: 442 ALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSAS 501
DMLK+LFQ +YLYWLE+NAGI T DC PGGRL+ISL+Y +REFNHV++D S
Sbjct: 498 CSQLDMLKALFQVNYLYWLEKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGES 557
Query: 502 VGWVTDGLIARPLPNRIRPGYVE 524
VGW+TDGLIARPLPNR RPG E
Sbjct: 558 VGWITDGLIARPLPNRCRPGNTE 580
>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
Length = 631
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/512 (68%), Positives = 403/512 (78%), Gaps = 27/512 (5%)
Query: 36 LLFVPSLLYCFCH--LQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAF 93
L F+ L CF H L A+AIA+ +S+ +G+ + VWEV+GSKR +L+PDF KD F
Sbjct: 110 LCFLLLGLSCFFHFRLSAASAIAKD-QNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEF 168
Query: 94 VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
V + S LSS L+ L +CR L QF+LPEGFP+SVTSDYL+YSLWR VQG+ASQI
Sbjct: 169 V-SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQI 227
Query: 154 SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFA 213
SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFA
Sbjct: 228 SGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFA 287
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
DLLENAAFG+EMLTP FP FV IGAAAGAGRSAAALIQA+TRSCF AGFA++RNFAEV
Sbjct: 288 DLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVY 347
Query: 274 -----------------------AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFS 310
+ GEAQGMVSK++GI+LGI +AN IG+S ALA+F
Sbjct: 348 NNFYMALVLITYQQLFVFLNYSGSLGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFG 407
Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK 370
VVT IHMY NLKSYQ I+LRTLNPYRASLVFSEYL+SGQAP +KEVNDEEPLFP F
Sbjct: 408 VVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSN 467
Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH 430
+KS K Q VLSSEAK AA +IE RLQLGSKLSDV++NKE+A ALF LY +EGYILTEH
Sbjct: 468 MKSPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEH 527
Query: 431 GGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQR 490
G+FCV+LKES+ PQDML+SLFQ +YLYWLE+NAGI ST +DC PGGRL ISLDYV+R
Sbjct: 528 KGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRR 587
Query: 491 EFNHVKSDSASVGWVTDGLIARPLPNRIRPGY 522
EF H K DS SVGWVT+GLIARPLP RIR G+
Sbjct: 588 EFEHAKEDSESVGWVTEGLIARPLPTRIRLGH 619
>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 611
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/510 (65%), Positives = 392/510 (76%), Gaps = 11/510 (2%)
Query: 14 GSWWWHGGNGGDDDSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAV 73
G WW S +F L F S+L CFC Q+A A+AR ++E ++
Sbjct: 112 GFGWWQYDGDSPPWSDNAF--LAFFFSSVLGCFCLFQLAVALARNNMNTE--------SI 161
Query: 74 WEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
WEVKG KR +LI D +D F VA+ +S S VN +W C ++F + MLPEGFPDSV
Sbjct: 162 WEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVN-VWLRCSDIFTRLMLPEGFPDSV 220
Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKI 193
TSDYL YSLWR VQG+ASQ+SGVLATQALLYA+GLGKGAIPTAAA+NWVLKDG GYLSKI
Sbjct: 221 TSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI 280
Query: 194 MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQA 253
LS +GRHFDV+PKGWRLFADLLENAA+G+EMLTPAFP HFV IGAAAGAGRSAAALIQA
Sbjct: 281 FLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQA 340
Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
+TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IG+MLGI LAN I SS AL FS+VT
Sbjct: 341 ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVT 400
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
IHM+CNLKSY+SI+LRTLNPYRASLVFSEYLLSG+ P +K+VN+EEPLFPA K+
Sbjct: 401 LIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKA 460
Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGK 433
+ S+ +LS+EAK++A IE RLQLGSKLSDV +ED L SL+ E YIL+EH GK
Sbjct: 461 PDWSRDFLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGK 520
Query: 434 FCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFN 493
+CV+LKESA P DMLK++F +YL+WLERNAGI A S S DC PGGRL++SL+YV+REF
Sbjct: 521 YCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFK 580
Query: 494 HVKSDSASVGWVTDGLIARPLPNRIRPGYV 523
HVK D GW TDGLIARPL RI +V
Sbjct: 581 HVKYDGELAGWSTDGLIARPLTTRICECHV 610
>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
Length = 593
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 321/473 (67%), Positives = 372/473 (78%), Gaps = 21/473 (4%)
Query: 66 GNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS----------------ASNASLSSLLSV 109
G +E +VWEV+G RT L+PD T ++VVA A A L++L
Sbjct: 121 GGREGASVWEVRGGARTLLVPDPTGTSYVVAGDGRRKQAEGEDAEKAGAGRAELAALRRQ 180
Query: 110 NKL-WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+L W C + VQ +LP+G+PDSV+SDYL YSLWR VQG+ASQISGVL+TQALLYA+GL
Sbjct: 181 LELSWRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGL 240
Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
GKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A+GLE+LTP
Sbjct: 241 GKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTP 300
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
FPH FV IGAAAGAGRSAAALIQA+TRSCFYAGFA +RNFAEVIAKGEAQGMVSK +GI
Sbjct: 301 VFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGI 360
Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
MLGIALAN IGSS+ AL SF+ VT +HMYCNLKSYQSI+LRTLNPYRASLVFSEYLLSG
Sbjct: 361 MLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 420
Query: 349 QAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN 408
Q P VKEVNDEEPLF + +K +LS +AKDAA I RLQLGSKLS+++
Sbjct: 421 QVPSVKEVNDEEPLFLNLSIGTSRKESK----ILSPQAKDAAEIICRRLQLGSKLSEIIE 476
Query: 409 NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVA 468
NKEDA ALF LY++E Y+LTE+ GKFCV+LKE + P+DMLKSLF +YLYWLE+ GI
Sbjct: 477 NKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLKSLFHVNYLYWLEKYLGIKP 536
Query: 469 TSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 521
+ ++ C PGGRLE SLDY QREF+HVK D ++ GWV DGLIARPLP RIR G
Sbjct: 537 SDVASACRPGGRLEASLDYTQREFSHVKLDGSNGGWVMDGLIARPLPVRIRIG 589
>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 593
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 316/477 (66%), Positives = 375/477 (78%), Gaps = 25/477 (5%)
Query: 60 TSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASN----------ASLSSLLS- 108
TS +D G+ VWEV+G KRT+L+PD T ++++A S+S L+
Sbjct: 123 TSGDDAGD-----VWEVRGGKRTRLMPDPTWTSYLIAGDDGTKREDGDEKVGSISKDLAA 177
Query: 109 ----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
+ + W E+ V +LP+G+P SV+SDYLNYSLWR VQGVASQISGVL+TQALLY
Sbjct: 178 LRRQLGRSWRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLY 237
Query: 165 AIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLE 224
A+GLGKGAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD LEN A+GLE
Sbjct: 238 AVGLGKGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLE 297
Query: 225 MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK 284
+LTP FPH FV IGAAAGAGRSAA+LIQ++TRSCFYAGFA +RNFAEVIAKGEAQGMVSK
Sbjct: 298 ILTPVFPHLFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSK 357
Query: 285 AIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY 344
+GI LGIALANHIGSS+P AL SF+ +T +HMYCNLKSYQSI+LRTLNPYRASLVFSEY
Sbjct: 358 FVGIGLGIALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEY 417
Query: 345 LLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS 404
LLSGQ P VKEVNDEEPL FF + Q +LS+EAKDAA I RLQ+GSKLS
Sbjct: 418 LLSGQVPSVKEVNDEEPL-----FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLS 472
Query: 405 DVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
+++ +KEDA ALF LY++E Y+L ++ GKFC+VLKE + P+DMLKSLF +YLYWLER
Sbjct: 473 EIIESKEDACALFDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLKSLFHVNYLYWLERYM 532
Query: 465 GIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 521
G ++ +++C PGGRLE+SLDY QREF+HVK D + GWV DGLIARPLP RIR G
Sbjct: 533 GFKPSNVASECRPGGRLEVSLDYAQREFSHVKHDGSDGGWVMDGLIARPLPVRIRIG 589
>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
Length = 569
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 316/489 (64%), Positives = 381/489 (77%), Gaps = 23/489 (4%)
Query: 52 ATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--------- 102
A+A+A +S DD VWEV+G RT+L+PD T ++++A +
Sbjct: 89 ASALAEALGNSGDDAG----GVWEVRGGTRTRLVPDSTWTSYLIAGDDGSKREDEDGKVG 144
Query: 103 -----LSSLL-SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
L++L + + W C ++ VQ +LP+G+P SV+SDY+NYSLWR VQGVASQISGV
Sbjct: 145 DGPEGLAALRRQLGRSWRRCADVAVQLLLPDGYPHSVSSDYMNYSLWRGVQGVASQISGV 204
Query: 157 LATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
L+TQALLYA+GLG+GAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD L
Sbjct: 205 LSTQALLYAVGLGRGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFL 264
Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKG 276
EN A+GLE+LTP FPH FV IGAAAGA RSAA+LIQA+TRSCFYAGFA +RNFAEVIAKG
Sbjct: 265 ENMAYGLEILTPVFPHLFVPIGAAAGASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKG 324
Query: 277 EAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
EAQGMVSK +GI LGIALAN IGSS+P AL SF+ +T +HMYCNLKSYQSI+LRTLNPYR
Sbjct: 325 EAQGMVSKFVGIGLGIALANQIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYR 384
Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
ASLVFSEYLLSGQ P VKEVNDEEPLF F+ S +S+ +LS+EAKDAA I R
Sbjct: 385 ASLVFSEYLLSGQVPSVKEVNDEEPLF--FNLSLGASCQQSK--ILSAEAKDAADRICRR 440
Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
LQ+GSKLS+++ +KEDA ALF LY++E Y+L ++ GKFC+VLKE + P+DMLKSLF +Y
Sbjct: 441 LQMGSKLSEIIESKEDACALFDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLKSLFHVNY 500
Query: 457 LYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPN 516
LYWLER G ++ +++C PGGRLE+SLDY QREF+HVK D + GWV DGLIARPLP
Sbjct: 501 LYWLERYMGFKPSNVASECRPGGRLEVSLDYAQREFSHVKHDGSDGGWVMDGLIARPLPV 560
Query: 517 RIRPGYVEP 525
RIR G P
Sbjct: 561 RIRIGDGAP 569
>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 604
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 320/491 (65%), Positives = 376/491 (76%), Gaps = 23/491 (4%)
Query: 51 VATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS------------- 97
+ +A+A +S DG +VWEV+GS RT L+PD ++ VA
Sbjct: 122 IPSALAEALGASGGDGA----SVWEVRGSSRTLLVPDPDGTSYRVAEDEKHKQEEGMEKV 177
Query: 98 -ASNASLSSLL-SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
A L++L + + W C ++ +Q +LP+G+P+SV+SDYL+YSLWR +QGVASQISG
Sbjct: 178 WAGRDDLAALRWQLEQSWRRCTDVALQLLLPDGYPNSVSSDYLHYSLWRGLQGVASQISG 237
Query: 156 VLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADL 215
VL+TQALLYA+GLGKG IPTAAAINWVLKDG+GYLSKI+LS FGRHFDVNPK WRLFADL
Sbjct: 238 VLSTQALLYAVGLGKGVIPTAAAINWVLKDGLGYLSKILLSKFGRHFDVNPKRWRLFADL 297
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
LEN A+GLE+LTP FPH FV IGAAAGA RSAAALIQA+TRSCFYAGFA +RNFAEVIAK
Sbjct: 298 LENTAYGLEILTPVFPHLFVPIGAAAGAARSAAALIQAATRSCFYAGFAVQRNFAEVIAK 357
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
GEAQGMVSK +GIMLGIALAN IGSS+P AL SF+ VT +HMYCNLKSYQSI+LRTLNPY
Sbjct: 358 GEAQGMVSKFLGIMLGIALANQIGSSVPLALISFAGVTVVHMYCNLKSYQSIQLRTLNPY 417
Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
RASLVFSEYLLSGQ VKEVNDEEPLF A Q +LS+EAKDAA I
Sbjct: 418 RASLVFSEYLLSGQVSSVKEVNDEEPLF----LNSSMGAKLKQNKILSAEAKDAADRICR 473
Query: 396 RLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQAS 455
RLQLGSKLS++++N+EDA ALF LY+DE Y+L E+ GKFCVVLKE + P+DMLKS+F +
Sbjct: 474 RLQLGSKLSEIIDNEEDACALFDLYKDEQYLLLEYKGKFCVVLKEGSSPEDMLKSVFHVN 533
Query: 456 YLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLP 515
YLYWLE+ G + ++ C PGGRLE+SLDY QREF HVK D + GWV DGLIARPLP
Sbjct: 534 YLYWLEKYMGPKPCNVASACRPGGRLEVSLDYAQREFRHVKQDGSDGGWVMDGLIARPLP 593
Query: 516 NRIRPGYVEPS 526
RIR G V PS
Sbjct: 594 VRIRVGDVVPS 604
>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
Length = 609
Score = 603 bits (1554), Expect = e-170, Method: Compositional matrix adjust.
Identities = 321/489 (65%), Positives = 372/489 (76%), Gaps = 37/489 (7%)
Query: 66 GNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS----------------ASNASLSSLLSV 109
G +E +VWEV+G RT L+PD T ++VVA A A L++L
Sbjct: 121 GGREGASVWEVRGGARTLLVPDPTGTSYVVAGDGRRKQAEGEDAEKAGAGRAELAALRRQ 180
Query: 110 NKL-WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+L W C + VQ +LP+G+PDSV+SDYL YSLWR VQG+ASQISGVL+TQALLYA+GL
Sbjct: 181 LELSWRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGL 240
Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
GKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A+GLE+LTP
Sbjct: 241 GKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTP 300
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQ----------------ASTRSCFYAGFAARRNFAEV 272
FPH FV IGAAAGAGRSAAALIQ A+TRSCFYAGFA +RNFAEV
Sbjct: 301 VFPHLFVPIGAAAGAGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEV 360
Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
IAKGEAQGMVSK +GIMLGIALAN IGSS+ AL SF+ VT +HMYCNLKSYQSI+LRTL
Sbjct: 361 IAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTL 420
Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
NPYRASLVFSEYLLSGQ P VKEVNDEEPLF + +K +LS +AKDAA
Sbjct: 421 NPYRASLVFSEYLLSGQVPSVKEVNDEEPLFLNLSIGTSRKESK----ILSPQAKDAAEI 476
Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
I RLQLGSKLS+++ NKEDA ALF LY++E Y+LTE+ GKFCV+LKE + P+DMLKSLF
Sbjct: 477 ICRRLQLGSKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLKSLF 536
Query: 453 QASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
+YLYWLE+ GI + ++ C PGGRLE SLDY QREF+HVK D ++ GWV DGLIAR
Sbjct: 537 HVNYLYWLEKYLGIKPSDVASACRPGGRLEASLDYTQREFSHVKLDGSNGGWVMDGLIAR 596
Query: 513 PLPNRIRPG 521
PLP RIR G
Sbjct: 597 PLPVRIRIG 605
>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
gi|194698746|gb|ACF83457.1| unknown [Zea mays]
Length = 451
Score = 590 bits (1521), Expect = e-166, Method: Compositional matrix adjust.
Identities = 296/413 (71%), Positives = 344/413 (83%), Gaps = 5/413 (1%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+ + W E+ V +LP+G+P SV+SDYLNYSLWR VQGVASQISGVL+TQALLYA+GL
Sbjct: 40 LGRSWRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGL 99
Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
GKGAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD LEN A+GLE+LTP
Sbjct: 100 GKGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTP 159
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
FPH FV IGAAAGAGRSAA+LIQ++TRSCFYAGFA +RNFAEVIAKGEAQGMVSK +GI
Sbjct: 160 VFPHLFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGI 219
Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
LGIALANHIGSS+P AL SF+ +T +HMYCNLKSYQSI+LRTLNPYRASLVFSEYLLSG
Sbjct: 220 GLGIALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 279
Query: 349 QAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN 408
Q P VKEVNDEEPL FF + Q +LS+EAKDAA I RLQ+GSKLS+++
Sbjct: 280 QVPSVKEVNDEEPL-----FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIE 334
Query: 409 NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVA 468
+KEDA ALF LY++E Y+L ++ GKFC+VLKE + P+DMLKSLF +YLYWLER G
Sbjct: 335 SKEDACALFDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLKSLFHVNYLYWLERYMGFKP 394
Query: 469 TSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 521
++ +++C PGGRLE+SLDY QREF+HVK D + GWV DGLIARPLP RIR G
Sbjct: 395 SNVASECRPGGRLEVSLDYAQREFSHVKHDGSDGGWVMDGLIARPLPVRIRIG 447
>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
Length = 836
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 268/363 (73%), Positives = 303/363 (83%), Gaps = 4/363 (1%)
Query: 161 ALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAA 220
ALLYA+GLGKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A
Sbjct: 476 ALLYAVGLGKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTA 535
Query: 221 FGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQG 280
+GLE+LTP FPH FV IGAAAGAGRSAAALIQA+TRSCFYAGFA +RNFAEVIAKGEAQG
Sbjct: 536 YGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQG 595
Query: 281 MVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
MVSK +GIMLGIALAN IGSS+ AL SF+ VT +HMYCNLKSYQSI+LRTLNPYRASLV
Sbjct: 596 MVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLV 655
Query: 341 FSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLG 400
FSEYLLSGQ P VKEVNDEEPLF + +K +LS +AKDAA I RLQLG
Sbjct: 656 FSEYLLSGQVPSVKEVNDEEPLFLNLSVGTSRKESK----ILSPQAKDAAEIICRRLQLG 711
Query: 401 SKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
SKLS+++ NKEDA ALF LY++E Y+LTE+ GKFCV+LKE + P+DMLKSLF +YLYWL
Sbjct: 712 SKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLKSLFHVNYLYWL 771
Query: 461 ERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRP 520
E+ GI + ++ C PGGRLE SLDY QREF+HVK D ++ GWV DGLIARPLP RIR
Sbjct: 772 EKYLGIKPSDVASACRPGGRLEASLDYTQREFSHVKLDGSNGGWVMDGLIARPLPVRIRI 831
Query: 521 GYV 523
G V
Sbjct: 832 GDV 834
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/235 (62%), Positives = 172/235 (73%), Gaps = 24/235 (10%)
Query: 66 GNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS----------------ASNASLSSLL-S 108
G +E +VWEV+G RT L+PD T ++VVA A A L++L
Sbjct: 246 GGREGASVWEVRGGARTLLVPDPTGTSYVVAGDGRRKQAEGEDAEKAGAGRAELAALRRQ 305
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+ + W C + VQ +LP+G+PDSV+SDYL YSLWR VQG+ASQISGVL+TQALLYA+GL
Sbjct: 306 LERSWRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGL 365
Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
GKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A+GLE+LTP
Sbjct: 366 GKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTP 425
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR--RNFAEVIAKGEAQGM 281
FPH FV IGAAAGAGRSAAALIQ +A F + +VIAKGEAQGM
Sbjct: 426 VFPHLFVPIGAAAGAGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQGM 475
>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 503
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 251/479 (52%), Positives = 330/479 (68%), Gaps = 4/479 (0%)
Query: 49 LQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLS 108
LQ +AIA D K +WEVKG + L+ +D FVV + N+ +L
Sbjct: 24 LQQQSAIALLGPREVVD--KARTGIWEVKGGRWKYLVKHPERDEFVVQTVKNSEEEALAY 81
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+ EL Q +LP+G+P SVT DY Y+LWR Q +ASQISGVL TQALLYA+GL
Sbjct: 82 EEDVITRFVELGKQIVLPDGYPQSVTDDYFEYTLWRMGQVIASQISGVLTTQALLYAVGL 141
Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
GKGAIPTAAA+NWVL+DGIGYLSKI+LS +GRHFDV+PKGWRL +DL+EN ++GLE+LTP
Sbjct: 142 GKGAIPTAAAVNWVLRDGIGYLSKIVLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTP 201
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
AFPH FV++ AAAGAGRSA+ LIQA+T+SCF AG AA RNFAE+IAKGEAQGMVSK+IGI
Sbjct: 202 AFPHLFVYLAAAAGAGRSASGLIQAATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGI 261
Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
LGIA++ ++GS+ P + +F VT +H++CNLKSYQ+++LRTLNPYRASLV +EY SG
Sbjct: 262 ALGIAVSAYVGSTGPLLVGTFFGVTALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSG 321
Query: 349 QAPPVKEVNDEEPLFPAFHFFKIK-SANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVV 407
V+EVN EEP+FP F FF + +K +LS E+K A IE+ L+ G+ V
Sbjct: 322 NVVSVREVNGEEPIFP-FSFFNFRLRGSKGDGTLLSEESKAVAHSIENHLEFGATFGAAV 380
Query: 408 NNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIV 467
+E+A AL +Y+ E Y+L++H + VVLKE A P+DML+++ QA YL +L+
Sbjct: 381 KTREEADALLDIYKSERYLLSQHDDRLLVVLKEGATPRDMLRAMMQAIYLDYLQGTGFTS 440
Query: 468 ATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEPS 526
+ D GG L +S D+V +F ++ D A VGW+ +GLIARP PNR+ P+
Sbjct: 441 HRNLMHDSGNGGVLRVSHDFVVNQFEQIRQDIAGVGWICEGLIARPAPNRLLESRPSPT 499
>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
Length = 440
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/424 (53%), Positives = 291/424 (68%), Gaps = 10/424 (2%)
Query: 96 ASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
A A + S L V+++ C + MLP G+P +V+ DY Y+LWR Q +ASQISG
Sbjct: 25 AGAVSTGRSRKLGVDEIVSWCLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISG 84
Query: 156 VLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADL 215
VL TQALLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LS +GRHFDV+PKGWRL +DL
Sbjct: 85 VLTTQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDL 144
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
+E+ A GLE+LTP FPH FV + A AGAG+SAA LIQA+TRSCFYAGFA + NFAEVIAK
Sbjct: 145 IEDGACGLELLTPTFPHLFVPLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAK 204
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
GEAQGMVSK++GI LGI +++ +GSS P + SF VT +H+ CNLKSYQ+++L TLNPY
Sbjct: 205 GEAQGMVSKSLGIALGIGISSKVGSSGPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPY 264
Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
R L+ +EYL + + P V++VN EP+F FF K A S L +S E+K AA +IE
Sbjct: 265 RTGLIMAEYLTTQRIPSVRDVNSVEPIFTVADFFSSKKAPVSTHLSISCESKCAAADIEK 324
Query: 396 RLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT-EHGGKFCVVLKESALPQDMLKSLFQA 454
L+LG + V ++ A AL LY E Y+LT + F VVLK+SA P DML+++ QA
Sbjct: 325 NLKLGVSFGEAVASRGAAQALLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQA 384
Query: 455 SYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 514
SYL + S S D L++S +Q+ FN + + GW T GL+ARP
Sbjct: 385 SYLRHIHNQ-----QSLSKD----ELLKLSYQSMQQSFNELIQGLQAAGWSTSGLVARPG 435
Query: 515 PNRI 518
NR+
Sbjct: 436 SNRL 439
>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 443
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/308 (74%), Positives = 259/308 (84%), Gaps = 1/308 (0%)
Query: 67 NKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLP 126
N +++WEVKG KR +LI D +D F VA+ +S S VN +W C ++F + MLP
Sbjct: 125 NMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVN-VWLRCSDIFTRLMLP 183
Query: 127 EGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDG 186
EGFPDSVTSDYL YSLWR VQG+ASQ+SGVLATQALLYA+GLGKGAIPTAAA+NWVLKDG
Sbjct: 184 EGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDG 243
Query: 187 IGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
GYLSKI LS +GRHFDV+PKGWRLFADLLENAA+G+EMLTPAFP HFV IGAAAGAGRS
Sbjct: 244 FGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRS 303
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL 306
AAALIQA+TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IG+MLGI LAN I SS AL
Sbjct: 304 AAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLAL 363
Query: 307 ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
FS+VT IHM+CNLKSY+SI+LRTLNPYRASLVFSEYLLSG+ P +K+VN+EEPLFPA
Sbjct: 364 GCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAV 423
Query: 367 HFFKIKSA 374
K A
Sbjct: 424 PLLNRKLA 431
>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
Length = 389
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 220/399 (55%), Positives = 278/399 (69%), Gaps = 14/399 (3%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT 175
C + MLP G+P +V+ DY Y+LWR Q +ASQISGVL TQALLYA+GLGKGAIPT
Sbjct: 4 CLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPT 63
Query: 176 AAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
AAAINWVLKDGIGYLSKI+LS +GRHFDV+PKGWRL +DL+E+ A GLE+LTP FPH FV
Sbjct: 64 AAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFV 123
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ A AGAG+SAA LIQA+TRSCFYAGFA + NFAEVIAKGEAQGMVSK++GI LGI ++
Sbjct: 124 PLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGIS 183
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ +GSS P + SF VT +H+ CNLKSYQ+++L TLNPYR L+ +EYL + + P V++
Sbjct: 184 SKVGSSGPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQRIPSVRD 243
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
VN EP+F FF K A L +S E+K AA +IE L+LG + V ++ A A
Sbjct: 244 VNSVEPIFTVADFFSSKKAP----LSISCESKCAAADIEKNLKLGVSFGEAVASRGAAQA 299
Query: 416 LFSLYEDEGYILT-EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSAD 474
L LY E Y+LT + F VVLK+SA P DML+++ QASYL + S S D
Sbjct: 300 LLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQASYLRHIHNQ-----QSLSKD 354
Query: 475 CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARP 513
L++S +Q+ FN + + GW T GL+ARP
Sbjct: 355 ----ELLKLSYQSMQQSFNELIQGLQAAGWSTSGLVARP 389
>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
Length = 582
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/426 (53%), Positives = 280/426 (65%), Gaps = 45/426 (10%)
Query: 27 DSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIP 86
+++ ++LLFV S+LY F H Q+ A+++ KE + VWEV+G K K+IP
Sbjct: 130 ETAHKLRAVLLFVFSVLYSFFHFQLDAALSK---------EKEEEGVWEVRGGKWHKIIP 180
Query: 87 DFTKDAFVVAS----ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSL 142
D +KD F+V + A A SS L LW +C+ELF++ MLPEGFP SVTSDYL+Y+L
Sbjct: 181 DSSKDEFLVVTPGIGAVGAPKSSTLP--NLWLQCKELFLRLMLPEGFPHSVTSDYLDYTL 238
Query: 143 WRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
WR VQGVASQISGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHF
Sbjct: 239 WRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHF 298
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ--------AS 254
DV+PKGWRLFADLLENAA+GLE+LTPAFPH F+ IGA AGAGRSAAALIQ S
Sbjct: 299 DVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQLLFHTHNHLS 358
Query: 255 TRSCFYAGFAARRNFA------EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALAS 308
+ + + N A ++ GE G + G L S++ L
Sbjct: 359 SFALLLGSYFPLNNPAIGNSKRKMPTHGEGDGG-ENVTAFLGGPRLLXGAVSTLALLLQ- 416
Query: 309 FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHF 368
+++ + + P R LVFSEYLLSGQ P +KEVN+EEPLFP
Sbjct: 417 -------------RNFAEVIAKGEAP-RNGLVFSEYLLSGQVPSIKEVNEEEPLFPVVPL 462
Query: 369 FKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT 428
K K+Q VLS+EAKDAA EIE RLQLGSKLS+VV++KED ALF LY +E YILT
Sbjct: 463 LNAKPXYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILT 522
Query: 429 EHGGKF 434
EH G+F
Sbjct: 523 EHKGRF 528
>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
Length = 368
Score = 290 bits (742), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 162/254 (63%), Positives = 193/254 (75%), Gaps = 27/254 (10%)
Query: 60 TSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASN----------ASLSSLLS- 108
TS +D G+ VWEV+G KRT+L+PD T ++++A S+S L+
Sbjct: 123 TSGDDAGD-----VWEVRGGKRTRLMPDPTWTSYLIAGDDGTKREDGDEKVGSISKDLAA 177
Query: 109 ----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
+ + W E+ V +LP+G+P SV+SDYLNYSLWR VQGVASQISGVL+TQALLY
Sbjct: 178 LRRQLGRSWRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLY 237
Query: 165 AIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLE 224
A+GLGKGAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD LEN A+GLE
Sbjct: 238 AVGLGKGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLE 297
Query: 225 MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK 284
+LTP FPH FV IGAAAGAGRSAA+LIQ++TRSCFYAGFA +RNFAEVIAKGEAQGM +
Sbjct: 298 ILTPVFPHLFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMFTC 357
Query: 285 AIGIMLGIALANHI 298
IA ++HI
Sbjct: 358 -------IATSSHI 364
>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 636
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 123/335 (36%), Positives = 192/335 (57%), Gaps = 25/335 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP-TAAAINWVL 183
LP+G+P+SVT DYL+YS+WR++Q +A+ + GV AT+ALL+ +GLG+ + P T+AAI+WVL
Sbjct: 280 LPDGYPNSVTKDYLSYSIWRAIQNLAASVMGVFATEALLFGLGLGRKSTPATSAAISWVL 339
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KDG+GY+ K+ + G FDV+PK WRL AD +E+A LE++TP FP HF+ + + A
Sbjct: 340 KDGLGYIGKVFYGSIAGNQFDVDPKSWRLAADAVEDAGGVLEIITPLFPGHFLLLASVAN 399
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
A + AA+ +TR Y A N ++ KGE+QG+ K +G+ LGI ++ +G +
Sbjct: 400 AMKGVAAMTGTATRHAIYKSLALHENQGDIATKGESQGVTCKMVGLGLGIVASSKMGQNY 459
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
L+++++ ++H++ N KS ++ TLN R SL E+L +G P EV+ E +
Sbjct: 460 FALLSAYALGCFVHLFANWKSLSCVQFATLNRQRCSLALQEFLQTGNVPSPYEVSRREKV 519
Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYED 422
+ K L R+ +G+ L + N + +LY D
Sbjct: 520 --------VVPPWKGYL---------------DRIVVGANLKECFQNGRELREAVALYRD 556
Query: 423 EGYILTEHGGKFCVVLKESALPQDMLKSLFQASYL 457
YILT+ K +VL ES PQD+ ++ QA L
Sbjct: 557 CRYILTQRDNKILIVLHESCRPQDIFQAFMQAHKL 591
>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 474
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 191/366 (52%), Gaps = 17/366 (4%)
Query: 91 DAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVA 150
D + AS + L+++ L E +E+F LP G+PDSV+ DYL + W ++Q +
Sbjct: 50 DQLLSTGQGRASNAGCLTLSGLSRELQEMF----LPAGYPDSVSEDYLGFQFWDTLQAMC 105
Query: 151 SQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKG 208
S + GVLATQ++L ++G+G A P AAA+ WVL+DG G + + + F G FDVN K
Sbjct: 106 SYLRGVLATQSVLQSVGVGDDKATPLAAALQWVLRDGSGMIGGLTFAYFVGPKFDVNVKR 165
Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
WRLFAD++ + L+M+ P FP + + ++ + +TRS FA R N
Sbjct: 166 WRLFADVVNDVGLTLDMVAPYFPTLVTEVLCVSSVCKAMCGVAAGATRSSLMTHFAKRDN 225
Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
A+ AK +Q K G+M G+ AN + SS +F V+T +H+Y N + +
Sbjct: 226 MADCAAKEGSQETAVKLFGLMFGMYFANAVNSSPHAVWMAFLVLTLVHVYANYNAVSCLC 285
Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
+ T+N R ++ +L+SG P + V D + + I+S N+ + + KD
Sbjct: 286 IPTVNCSRGLILVQRFLMSGNKPGKENVADHQGR------YSIRSVNQEEPIF-----KD 334
Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDML 448
+ + L +GS+L V D L +YEDE Y+LT G + V+ + A P D L
Sbjct: 335 PVLPVTSHLMMGSELHQAVPTSTDLDRLLQIYEDETYLLTVVGDRVHVMFEAEAKPTDEL 394
Query: 449 KSLFQA 454
++ FQA
Sbjct: 395 RAFFQA 400
>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
Length = 478
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/355 (32%), Positives = 187/355 (52%), Gaps = 17/355 (4%)
Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
S + L+++ L E +E+F LP G+PDSV+ DYL++ W ++Q + S + GVLATQ+
Sbjct: 65 SAAGCLTLSGLSRELQEMF----LPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQS 120
Query: 162 LLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENA 219
+L ++G+G A P AAA+ WVL+DG G + + + G FDVN K WRLFAD++ +
Sbjct: 121 VLQSVGVGDDKATPLAAALQWVLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDV 180
Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
L+M+ P FP + + ++ + +TRS FA + N A+ AK +Q
Sbjct: 181 GLTLDMVAPYFPTLVTEVLCVSSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQ 240
Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
K G+M G+ AN + SS +F ++T IH+Y N + + + T+N R +
Sbjct: 241 ETAVKLFGLMFGMYFANAVNSSPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLI 300
Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
+ +L SG P + V D + I+S N+ + + KD + + L +
Sbjct: 301 LVRRFLASGNKPGKENVADHN------GRYSIRSVNQEEPIF-----KDPVLPVTSHLVM 349
Query: 400 GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
G++L V + D L +Y DE Y+LT G + V+ + A P+D L++ FQA
Sbjct: 350 GAELHQAVPSSHDLDRLLHIYSDEAYLLTVVGDRVLVMFQAEAKPKDELRAFFQA 404
>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 849
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/387 (32%), Positives = 206/387 (53%), Gaps = 33/387 (8%)
Query: 74 WEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
++V+GSKR L D + F +AS L S K W R ++F+ FP+ V
Sbjct: 372 YDVEGSKRRSLEWDESSARFKLASHDEEGLDWSTSW-KTWLTKR---IEFLRQSFFPEGV 427
Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKI 193
+ DY ++++WR Q + S GV TQALL A+G+ G I AAA +WVLKDG+G L K+
Sbjct: 428 SQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAAATSWVLKDGLGRLGKM 487
Query: 194 M-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
+ S GR FD +PK WR + LL GLE++T FP F+ A + + + L
Sbjct: 488 VWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLANSMKQISMLTA 547
Query: 253 ASTRSCFYAGFAAR-RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
++TR+ Y FA R N A++ AKGEAQ +V+ +G+ LGI L+ IG+S P L ++S
Sbjct: 548 SATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTSRPNVLTAYSF 607
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
++ + +Y K ++++ RTLN R+S++ ++ G P EV+ E +F + +
Sbjct: 608 LSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQRENIFLGPR-YDV 666
Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH- 430
+S + + LSD V+ ++ ++L +++ E +++T
Sbjct: 667 RS-------------------------MFASLSDAVSCPQELNSLIKIFKGEQFMVTRGW 701
Query: 431 GGKFCVVLKESALPQDMLKSLFQASYL 457
G++ +VL+E A D+L++L +L
Sbjct: 702 NGQYRIVLREHARNGDILRALLTLGFL 728
>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 496
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 195/404 (48%), Gaps = 29/404 (7%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTA 176
+ F + LP+G+PDSV DY Y+ WR + GV +TQ+LL A+G+G+ A P A
Sbjct: 118 DRFSRMFLPDGYPDSVGPDYAEYTKWRGRAFFFGGMVGVFSTQSLLLAVGVGRPSAAPIA 177
Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
AA+ WV++DG+G +++ S G FD K +RL A + N + LE LTP FPH F+
Sbjct: 178 AALQWVIRDGLGRAGRMLFSQVGTGFDAETKQYRLAAAFVLNVSCALEALTPLFPHLFLP 237
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + A + A+ + ASTR Y F R N ++ AK E G+ +G +G+ L+
Sbjct: 238 LASLANMAKGASTVAAASTRGAIYRSFMRRENLGDITAKQETVGVAGDLLGTAVGVILSR 297
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL-SGQAPPVKE 355
S ++ +F + H+ +S+ELRTLN RA ++ YL + P+++
Sbjct: 298 ASSHSHRLSVIAFIGASLGHLISAYYEVKSVELRTLNRQRAHMLIWNYLEDQDRVLPIRD 357
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
VN E L + + N +++G++LSD + +
Sbjct: 358 VNRRERLVYRPWLDSLHAPN---------------------VEIGARLSDAAPDADALKF 396
Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
L LY DE Y++ GG+ +VL+ A QD LK+ FQ+ + L +G + A
Sbjct: 397 LLRLYRDEKYMMNYAGGRVRIVLRNDATHQDQLKAYFQSRAFWRLYMTSGERFSPNRA-- 454
Query: 476 APGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIR 519
+E S + F D + GW D ++ RPL R++
Sbjct: 455 ----LIEDSYRATNQLFPSFVRDCRAEGWNVDCVLLRPLARRVQ 494
>gi|449528331|ref|XP_004171158.1| PREDICTED: uncharacterized protein LOC101231720 [Cucumis sativus]
Length = 157
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 92/153 (60%), Positives = 114/153 (74%)
Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH 430
+K+ + +L +LS+EAK++A IE RLQLGSKLSDV +ED L SL+ E YIL+EH
Sbjct: 4 VKALKEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEH 63
Query: 431 GGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQR 490
GK+CV+LKESA P DMLK++F +YL+WLERNAGI A S S DC PGGRL++SL+YV+R
Sbjct: 64 RGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVER 123
Query: 491 EFNHVKSDSASVGWVTDGLIARPLPNRIRPGYV 523
EF HVK D GW TDGLIARPL RI +V
Sbjct: 124 EFKHVKYDGELAGWSTDGLIARPLTTRICECHV 156
>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 719
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 132/415 (31%), Positives = 212/415 (51%), Gaps = 34/415 (8%)
Query: 108 SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG 167
S L + R +Q LP G+P +VT DYL ++LWR +Q A+ + GV +T+ALL +G
Sbjct: 326 SERSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLG 385
Query: 168 LGKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEML 226
LGK TA A+ W+LKDG G+ KI + G+ FD++P+ WR+ +DLLE+ A LE++
Sbjct: 386 LGKNVSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEII 445
Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
TP F F+ + + A R+ AA+ +TR + A N ++ KGE+QG+ +K I
Sbjct: 446 TPIFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKLI 505
Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
G+ LGI L+ IG L ++ + IH+ N K+ Q +E TLN R +L+ + +
Sbjct: 506 GLGLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAF-M 564
Query: 347 SGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
S +A ++ + + L P + K + ++ +D EI G+ L V
Sbjct: 565 SARAQTSRQASRQLTL-PT----PREICAKERFMI--PPWRDYRPEIV----FGASLQRV 613
Query: 407 VNNKEDAHALFSLYEDEGYIL-TEHGGKFCVV-LKESALPQDMLKSLFQASYLYWLERNA 464
+ + H +LY +E YI+ E K +V L++ A +D LK+ A Y L+R+A
Sbjct: 614 KHARHFLHCA-ALYRNERYIIYAEPSTKHVLVFLRDGATVKDQLKAFMNAQR-YLLDRDA 671
Query: 465 GIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIR 519
E + ++ QREF+ + + GW T+ + P RIR
Sbjct: 672 -----------------ERAYNFTQREFSTFLQATVASGWSTNVSLLNPTSYRIR 709
>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
Length = 749
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 127/400 (31%), Positives = 207/400 (51%), Gaps = 34/400 (8%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
F+Q +PEGFP SVT DY+ + +W +QG+++ I +L+TQALL AIG+G K A A
Sbjct: 363 FLQAFVPEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGAT 422
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W L+D G L I+ + + G + D N K WRL ADL+ + +++++P FP FVFI
Sbjct: 423 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFI 482
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
RS + +TR+ FA + N A++ AK +Q V+ IG+ LG+ +A
Sbjct: 483 VCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVA-R 541
Query: 298 IGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
I + P A+ SF +T HMY N ++ Q + L +LNP R+S++ ++ S Q K+V
Sbjct: 542 ITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESRQVLSPKQV 601
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS--DVVNNKEDAH 414
+ E + P QL +SS+ ++ ++ +++LG+++S D + KE
Sbjct: 602 SPLEHVLPI------------QLTSMSSKKVNS---LDSKVRLGTRISSFDEMEIKEHLL 646
Query: 415 ALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSAD 474
+L S Y YIL E G V++ + + D+LKS A L N S +D
Sbjct: 647 SLASYYAKAKYILVEKKGIIDVIVHKDSSSADVLKSFIHALVL----ANNVYKGKSLHSD 702
Query: 475 CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 514
S +++ ++ S+GW T+ L++ P+
Sbjct: 703 ---------SQMWMENQYEVFIQKLKSLGWKTERLLSSPI 733
>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 631
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 174/340 (51%), Gaps = 35/340 (10%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLK 184
LPEGFP+SV YL YSLWR Q V S ++ VL+T ALL A+GL + P AAA +WVLK
Sbjct: 243 LPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLS--STPVAAATSWVLK 300
Query: 185 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
DGIG L K++ + GR FD +PK +RL +DLL +A LE+LTP FP +F+ + A
Sbjct: 301 DGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQYFLALAALGNFT 360
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
+S A + + R+ + F R N ++ AK +AQ +V+ G+ LGI A +
Sbjct: 361 KSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIVTARALPPKPSV 420
Query: 305 ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--QAPPVKEVNDEEPL 362
L +F+ +T ++ N +S ++++L +N R +++ ++ S V+ VND E +
Sbjct: 421 RLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRSHAEDVLDVETVNDLECI 480
Query: 363 FPAF--HFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLY 420
P HF + L+LG L V
Sbjct: 481 IPFVPEHFQWLP------------------------LKLGVSLERV-----GRLPPPQRR 511
Query: 421 EDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
Y+++ + + V L E A P+D+L+ L Q++YL +L
Sbjct: 512 PRARYVMSMNERELRVALHERAAPRDLLQVLLQSAYLRYL 551
>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
Length = 430
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 195/397 (49%), Gaps = 41/397 (10%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVL 183
++PEGFPDSV Y Y WR +Q GV T++LL+A+G+ GA +A A+NWV+
Sbjct: 71 VVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVFTTRSLLHALGVRNGASSSAIAVNWVV 130
Query: 184 KDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGA 243
KDG G + K++ + G+ FD + K R D+L A LE+ T A P F+ + A
Sbjct: 131 KDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVLMQLAAALELATSATPQFFLPLACIANI 190
Query: 244 GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMP 303
G++ AA+ STR+ Y FA R N ++ AKGE G ++ +G +GI ++ + P
Sbjct: 191 GKNVAAVASTSTRAPIYKAFARRENIGDITAKGECIGNIADLLGTGMGILMSKNFPVFAP 250
Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
F+ +F ++Y + + ++I+L TLN +R + +L +G+ P + E N+ E
Sbjct: 251 FSALAFG-----YVYSSFREVKAIQLPTLNRHRFGIAVDTFLETGKVPGLVEANERE--- 302
Query: 364 PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDE 423
+I + L +L+LG+++SD + +D + SL++ E
Sbjct: 303 ------RIIVGPQWSL---------------EKLELGARVSDAFSAPQDYQYIDSLFKGE 341
Query: 424 GYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRL 481
Y+++ + VVLKE A D+L++ F I +SA+ G
Sbjct: 342 NYLVSYNPKRQRTYVVLKERANGDDVLRAAFHGQLF------VNIFDKCSSAE----GAY 391
Query: 482 EISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
+ S Y+ + H K D+ +GW+ + P R+
Sbjct: 392 QESCKYISPLYEHFKLDAEKLGWIMSETLLSPGNARV 428
>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 473
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 124/388 (31%), Positives = 197/388 (50%), Gaps = 28/388 (7%)
Query: 75 EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
E GS TKL FT A +S+S S +W + FV PEGFP SVT
Sbjct: 50 EWNGSSPTKLSKTFTIKA---SSSSVLITRSGARFTHVWRRLLQAFV----PEGFPSSVT 102
Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
+DY+ + +W +QG+++ I +L+TQALL AIG+G K A A W L+D G L I
Sbjct: 103 ADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDLTGMLGGI 162
Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
+ + + G + D N K WRL ADL+ + +++++P FP FVF+ RS +
Sbjct: 163 LFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISRSFTGVAS 222
Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
+TR+ FA + N A++ AK +Q V+ IG+ LG+ +A + P A+ SF
Sbjct: 223 GATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVAR-LTIGHPLAIWFSFLS 281
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
+T HMY N ++ + + L +LNP R+S++ ++ +GQ K+V+ +E + P
Sbjct: 282 LTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHVLPI------ 335
Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS--LYEDEGYILTE 429
Q + SS+ + + ++ LG ++S N + H L + Y Y+L E
Sbjct: 336 ------QFISWSSKKANC---LHKKVCLGMRISSFDNMEIKEHLLCAAPYYTKAKYLLVE 386
Query: 430 HGGKFCVVLKESALPQDMLKSLFQASYL 457
G V++ + + D+LKS F A L
Sbjct: 387 RKGIIDVIVHKDSNAADVLKSFFHALVL 414
>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 710
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 90/237 (37%), Positives = 144/237 (60%), Gaps = 1/237 (0%)
Query: 111 KLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK 170
+L ++ R ++ LP+G+P SVT +Y+ YS W++VQ V ++GVL+TQALLYA+GLG
Sbjct: 148 RLREDTRHRLIEAFLPKGYPHSVTPNYMGYSRWQAVQSVTGTMTGVLSTQALLYAVGLGA 207
Query: 171 GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
GAIP A A+NW++KDG+G L ++ S F FD +PK R +++ A+ LE+LTP
Sbjct: 208 GAIPLAGALNWIVKDGLGQLGGVVYSTFISSKFDSDPKRHRFWSNAALQASTLLEILTPL 267
Query: 230 FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
P F+F+ + + G++ + L +STR+ + R N +V K +Q + + IG
Sbjct: 268 APGMFLFLASVSNIGKNISWLAASSTRAQMHNSLTLRDNLGDVTGKAGSQAIATSLIGTG 327
Query: 290 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
LGIA+A +G+ L +F ++ ++M N +S + +RTLN RA +F +LL
Sbjct: 328 LGIAIAPFVGTDPIAVLLAFLPLSIVNMVSNYRSNTIVHMRTLNVQRAERLFMHFLL 384
>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 488
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 184/366 (50%), Gaps = 35/366 (9%)
Query: 100 NASLSSLLSVNKLWDECRELF---VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
N+ L L+ N D E+ +++PEGFPDSVT Y+ Y WR+++ GV
Sbjct: 75 NSFLPVSLNSNHPRDPLHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGV 134
Query: 157 LATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADL 215
TQ LL ++G+ + A P A AINW+LKDG G + K++ + G+ FD + K R DL
Sbjct: 135 FTTQTLLSSVGVCRNRAAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDL 194
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
L G+E+ T A PH F+ + AA ++ AA+ STR+ Y FA N +V AK
Sbjct: 195 LMELGAGVELATAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 254
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
GE G ++ +G L I +A P + +FS+++ ++ + + +S+ L TLN
Sbjct: 255 GECVGNIADLLGTGLSILIAKR----NPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCG 310
Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
R S+ +L++GQ P ++E N E +F S KD V
Sbjct: 311 RFSVAVEHFLMTGQVPTLQEGNMNENIF-------------------SFPWKDRPV---- 347
Query: 396 RLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQ 453
LGS++ + + A+ L++ E YI+T + K V VLK+ A D+LK+ F
Sbjct: 348 --VLGSRIKEAFQDPSAYVAIEPLFDRERYIVTYNPSKHKVYAVLKDQAKSDDILKAAFH 405
Query: 454 ASYLYW 459
A L++
Sbjct: 406 AHVLFF 411
>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
Length = 540
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 114/355 (32%), Positives = 175/355 (49%), Gaps = 29/355 (8%)
Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI 166
L+ LW++ F LPEGFP+SV YL YS+WR +Q + S IS VL+TQALL A
Sbjct: 127 LNRQNLWEQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAA 186
Query: 167 GLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEM 225
G+G+ + AAA W KDGIG + ++ + G +D +PK WRL +DLL + LE+
Sbjct: 187 GVGRTHSETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEI 246
Query: 226 LTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKA 285
L+PAF +F+F+ A +S A I S R F R N E+ AK +AQ +V+
Sbjct: 247 LSPAFSKYFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGM 306
Query: 286 IGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
+G ++G+ ++ + ++ L F T I+ + N KS Q EL T+N R +++ YL
Sbjct: 307 LGTLIGVWMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYL 366
Query: 346 LSGQAPPVKEVNDEE---PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK 402
+ P V N E PLFP + H ++LG
Sbjct: 367 MFQNVPSVSMSNSYERFVPLFPT------------------------VFQYPH-VRLGVG 401
Query: 403 LSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYL 457
L+++ + E + E YIL+ G+ V L + D+L+ L +++
Sbjct: 402 LTELSISIEKIANYTRQFSKEKYILSLSDGEIRVALHRNHESVDLLQLLLHIAFI 456
>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
Length = 440
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 134/451 (29%), Positives = 219/451 (48%), Gaps = 47/451 (10%)
Query: 75 EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
E GS TKL K A + AS+S + S N +W + FV PEGFP SVT
Sbjct: 16 EWNGSSSTKLF----KTATITASSSLSIQRSANRFNHVWRRVLQAFV----PEGFPGSVT 67
Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
DY+ + LW ++QG+++ +L+TQALL AIG+G K A A W L+D G L I
Sbjct: 68 PDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGI 127
Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
+ + + G + D N K WRL ADL+ + +++L+P FP F+ + RS +
Sbjct: 128 LFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVAS 187
Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
+TR+ FA + N A++ AK +Q ++ +G+ LG+ LA S P A+ SF
Sbjct: 188 GATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLA-RFTSGNPMAIWLSFLS 246
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
+T HMY N ++ + + L +LN R+S++ + ++ +GQ ++V+ E + P + +
Sbjct: 247 LTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPLWA-TSL 305
Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF-----SLYEDEGYI 426
+S N L R+QLG ++S + + D L S Y++ Y+
Sbjct: 306 RSTNSKPL--------------HKRVQLGVRVSSL--PRLDMLQLLNGVGASSYKNAKYL 349
Query: 427 LTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLD 486
L G V+L + + P D+LKS A L A ++ STS
Sbjct: 350 LAHIKGNVSVILHKDSKPADVLKSYIHAIVL------ANLMEKSTSFYSEGEA------- 396
Query: 487 YVQREFNHVKSDSASVGWVTDGLIARPLPNR 517
++ + ++ + S GW T+ L++ + R
Sbjct: 397 WIDKHYDELLHKLRSGGWKTERLLSPSITWR 427
>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 166/338 (49%), Gaps = 28/338 (8%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+F LP+G+P+SV+ DYL Y LW +VQ +S +SG LATQA L +G+G + A AA
Sbjct: 47 VFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAA 106
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
I W+L+DG G L +I+ + G D K WRLFAD+L + A +E+ P FP F
Sbjct: 107 TITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTI 166
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I AG +S + +TR+ A R N A++ AK +Q + G+++ +AL
Sbjct: 167 IVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIP 226
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ + F + T +H++ N ++ +S+ + TLN R S+V +YLL GQ E
Sbjct: 227 LVTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEA 286
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
N EP+F +F V I +LG +L D+++ D
Sbjct: 287 NQREPVFLSFR---------------------RTVPI----KLGVRLGDLIHEPGDLQLA 321
Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
+ Y++ G CV L A D +K++ QA
Sbjct: 322 LK-NNSKPYLIGIKNGAVCVCLGSDASVDDEIKAVCQA 358
>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
Length = 473
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 173/345 (50%), Gaps = 28/345 (8%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
R+ FV +LP G+PDSV+ DYL Y LW ++Q +S ++G LATQ+LL G+G A
Sbjct: 66 RDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQSLLRGSGVGDSTATVA 125
Query: 177 AAIN-WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AA W+L+DG G + +I+ + G D + K WRLFADLL + A +E++ P FP F
Sbjct: 126 AATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLNDIAIFMEIMAPVFPSCF 185
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
AAG + + +TR+ A R N A+V AK +Q + G+++ + L
Sbjct: 186 TLTVCAAGVFKCIVGVAGGATRAAITVHQAKRDNMADVCAKDGSQETLVNLAGLLVSLFL 245
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
+ SM F ++T +H+Y N ++ +S+ + TLN R S+V YL G+ +
Sbjct: 246 VPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLSIVLHHYLKEGKILDPE 305
Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK-DAAVEIEHRLQLGSKLSDVVNNKEDA 413
N EPL P L S+A V + + L S + ++ K+D
Sbjct: 306 AANLREPLLPG----------------LGSQAPISLGVPLSY---LASSVPELEQLKKDN 346
Query: 414 HALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLY 458
H+L+ L G+ E G VVL E A DM+++ A L+
Sbjct: 347 HSLYLL----GW--REDTGSLAVVLHERASSVDMIRACVHAEMLH 385
>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
Length = 437
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 176/340 (51%), Gaps = 21/340 (6%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
+Q +PEGFP SVT DY + LW S+QG+++ I +L+TQALL AIG+G K A A
Sbjct: 51 ILQAFVPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGAT 110
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W L+D G L I+ + + G + D N K WRL ADL+ + +++++P FP FVFI
Sbjct: 111 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVFI 170
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN- 296
RS + +TR+ FA + N A++ AK +Q V+ +G+ G+ LA
Sbjct: 171 VCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMAFGMLLARI 230
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+G S+ SF +T HMY N ++ + + L TLN R+S++ ++ +GQ +EV
Sbjct: 231 TMGHSVAIWF-SFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQVLSPQEV 289
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS--DVVNNKEDAH 414
+ E + P L SS + A + R+ LG ++S D + + H
Sbjct: 290 SKMEHVLP---------------LWASSWSLKNAKSLHTRIHLGVRVSSLDKLEVTDLLH 334
Query: 415 ALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
+ S Y+ Y+L E G V++ + + D+L+S A
Sbjct: 335 SAESHYKKAKYLLLERKGIISVIMHKDSTASDVLQSFIHA 374
>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
Length = 442
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/345 (31%), Positives = 168/345 (48%), Gaps = 31/345 (8%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
F LP+G+P+SV+ DYL Y W +VQ S +SG LAT+A L +G+G + A AA
Sbjct: 48 FKSVFLPQGYPESVSGDYLRYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEATVAAAT 107
Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
+ W+LKDG G +I+ + G D K WRLFAD+L + A +E+L P FP F +
Sbjct: 108 MTWLLKDGTGMCGRILFAWRKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFPAFFTLV 167
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+G +S + +TR+ A R N A++ AK +Q + G+++ + L
Sbjct: 168 MCVSGIFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLVLIRF 227
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ + L F + +H++ N K+ +S+ + TLN R S+V +YL +GQ E N
Sbjct: 228 VSDNAALTLGLFFLFAVLHLFANYKAVRSVVMETLNEARLSVVLQQYLENGQILSPLEAN 287
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
+ EP+F F V I +LG +L +V+ + E+
Sbjct: 288 EREPVFMDFK---------------------KVVPI----RLGVRLQEVIQSSEELKFAL 322
Query: 418 SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLER 462
L ++ ++L G CV L A QD ++ + QA WL R
Sbjct: 323 KL-SNKPFLLGVKNGGVCVCLGPEASVQDEIRVMCQA---LWLRR 363
>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 497
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 129/426 (30%), Positives = 196/426 (46%), Gaps = 54/426 (12%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E +++PEGFPDSV Y+ Y WR+++ GV TQ LL ++G+ + A
Sbjct: 101 EVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRAA 160
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
P A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T A PH
Sbjct: 161 PGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 220
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ + AA ++ AA+ STR+ Y FA N +V AKGE G ++ +G L I
Sbjct: 221 FLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIL 280
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
+A P + +FS+++ ++ + + +S+ L TLN R S+ +L +GQ P +
Sbjct: 281 IAKR----NPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQVPTL 336
Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
+E N E +F S KD V LGS++ + +
Sbjct: 337 QEGNMNENIF-------------------SFPWKDRPV------VLGSRIKEAFQDPSAY 371
Query: 414 HALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYLY------WLERNAG 465
A+ L++ E YI+T + K V VLK+ A D+LK+ F A L W E NA
Sbjct: 372 FAIEPLFDRERYIVTYNPSKHKVYAVLKDQAKSDDILKAAFHAHVLLFNLIKSWNESNAS 431
Query: 466 IVA--------TSTSAD---CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 514
+ T T AD C G + V + K+++ GW P
Sbjct: 432 SLKQREDLSNMTHTVADIEACMAG-----TCKTVADSYGFFKNNAKEQGWTMSESHLNPG 486
Query: 515 PNRIRP 520
R+ P
Sbjct: 487 RARLHP 492
>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
Length = 423
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 161/337 (47%), Gaps = 28/337 (8%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
F LP+G+P+SV+ DYL Y W ++Q AS +S L TQA L +G+G + A AA
Sbjct: 45 FKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATVAAAT 104
Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
+ W+LKDG G +I+ + G D K WRLFAD+L + A +E+L P FP F I
Sbjct: 105 VTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSFFTLI 164
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
AG ++ + +TR+ A R N A++ AK +Q + G+++ + L
Sbjct: 165 VCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLILIPL 224
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I + +L F + T +H++ N K+ +S+ + T N R S+V +YL + E N
Sbjct: 225 ITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSPFEAN 284
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
EP+F F +I ++LG KL DVV + ED H L
Sbjct: 285 RREPVFLEF-------------------------KITTPIKLGVKLQDVVQSPEDFH-LA 318
Query: 418 SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
Y+L G CV L A D +++ QA
Sbjct: 319 LRNNSMAYLLGVRNGCICVCLGSGASVHDEIRAACQA 355
>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 507
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 117/406 (28%), Positives = 194/406 (47%), Gaps = 42/406 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
DE +++PEGFPDSVT Y+ Y WR+++ GV TQ LL ++G+ + A
Sbjct: 114 DEMMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKA 173
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T A PH
Sbjct: 174 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPH 233
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + AA ++ +A+ STR+ Y FA N +V AKGE G ++ +LG
Sbjct: 234 LFLPLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LLGT 289
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + +F +++ +++ + + +S+ L TLN R ++ +L +G+ P
Sbjct: 290 GLSIMISKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPS 349
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
+++ N E + +F + K E+ + LG + D +
Sbjct: 350 LQKGNMNERIL-SFPWLK-----------------------ENPIVLGPRFKDAFQDAGS 385
Query: 413 AHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAG----- 465
A+ L++ E Y++T + GK +LK+ A D++K+ F A L R++
Sbjct: 386 FLAIEPLFDREKYLVTYNQTKGKVYALLKDQAKSDDIIKAAFHAHVLLHFIRSSSGAQNS 445
Query: 466 ----IVATSTSADCAPGGRLEI--SLDYVQREFNHVKSDSASVGWV 505
+ A S+S EI S V + KS ++ GWV
Sbjct: 446 NQKQVDAFSSSVPTTTNLEAEIAASCKMVSNSYEIFKSKASEQGWV 491
>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
Length = 432
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 28/338 (8%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+F LP+G+P+SV+ DYL Y LW +VQ +S +SG LATQA L +G+G + A AA
Sbjct: 47 VFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAA 106
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
I W+L+DG G L +I+ + G D K WRLFAD+L + A +E+ P FP F
Sbjct: 107 TITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTL 166
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I AG +S + +TR+ A R N A++ AK +Q + G+++ +AL
Sbjct: 167 IVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIP 226
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ + F + T +H++ N K+ +S+ + TLN R S+V +YLL GQ E
Sbjct: 227 LVTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEA 286
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
N EP+F +F V I +LG +L D+++ D
Sbjct: 287 NQREPVFLSFR---------------------RTVPI----KLGVRLGDIIHEPGDLQLA 321
Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
+ Y++ G CV L A D +K++ QA
Sbjct: 322 LK-NNSKPYLIGIKNGAVCVCLGSDASVDDEIKAVCQA 358
>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
lyrata]
Length = 440
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 139/451 (30%), Positives = 220/451 (48%), Gaps = 47/451 (10%)
Query: 75 EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
E GS TKL K A + ASAS + S N +W + FV PEGFP SVT
Sbjct: 16 EWNGSSSTKLF----KTATITASASLSIQRSANRFNHVWRRVLQAFV----PEGFPGSVT 67
Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
DY+ + W ++QG+++ I +L+TQALL AIG+G K A A W L+D G L I
Sbjct: 68 PDYVGFQFWDTLQGLSTYIKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGI 127
Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
+ + + G + D N K WRL ADL+ + +++L+P FP F+ + RS +
Sbjct: 128 LFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVAS 187
Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
+TR+ FA + N A++ AK +Q ++ +G+ LG+ LA S P A+ SF
Sbjct: 188 GATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLAR-FTSGNPMAIWFSFLS 246
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
+T HMY N ++ + + L +LN R+S++ + ++ +GQ ++V+ E + P + +
Sbjct: 247 LTVFHMYANYRAVRCLVLNSLNFERSSILLTHFMQTGQVLSPEQVSSMEGVLPVWA-TSL 305
Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF-----SLYEDEGYI 426
+S N S+LL R+ LG ++S + + + L S Y++ Y+
Sbjct: 306 RSTN-SKLL-------------HKRVHLGVRVSSL--PRLEMMQLLNGVGASSYKNAKYL 349
Query: 427 LTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLD 486
L G V+L + + D+LKS A L A ++ STS E +D
Sbjct: 350 LAHRKGNVSVILHKDSSAADVLKSYIHAIVL------ANLMEKSTSFYSEG----EAWID 399
Query: 487 YVQREFNHVKSDSASVGWVTDGLIARPLPNR 517
EF H S GW T+ L++ + R
Sbjct: 400 KHYDEFLH---KLRSGGWKTERLLSPSITWR 427
>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
Length = 432
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 99/299 (33%), Positives = 157/299 (52%), Gaps = 8/299 (2%)
Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
+L D +E+F LP+G+P+SV+ DYL Y LW ++Q S I+G LAT A+L +G+G
Sbjct: 35 TRLADTLKEVF----LPQGYPNSVSEDYLQYQLWDTLQAFCSYITGTLATHAMLKGVGVG 90
Query: 170 -KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT 227
A P AA + W+LKDG+G +I+ + G D N K WRL AD+L + A +++++
Sbjct: 91 DSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCNAKKWRLIADILNDIAICMQLVS 150
Query: 228 PAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIG 287
P FP F+ I A +S + +TR+ A R N A+V AK +Q + G
Sbjct: 151 PFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQARRDNMADVSAKDGSQETLVNLCG 210
Query: 288 IMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+++G+ + I F + F T +H+Y N K+ + + TLN R L+ L+
Sbjct: 211 LLIGLIITPLIAGQTVFVWSLFFSFTLLHLYSNYKAVSVVSMETLNCNRLHLLMRNLFLN 270
Query: 348 GQAPPVKEVNDEEP-LFPAFHFFKIK-SANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS 404
G VN EEP LF FF ++ ++ S +L+ SS+ + ++ + KLS
Sbjct: 271 GTISEPNIVNREEPLLFRQERFFTVEYGSSLSSVLIHSSDYSRSILQFNKGQKFIIKLS 329
>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
Length = 446
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/357 (31%), Positives = 190/357 (53%), Gaps = 25/357 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LPEGFP+SVTSDYL + +W S+QG+++ I +L+TQALL AIG+G+ A A W L
Sbjct: 60 LPEGFPNSVTSDYLPFQIWDSLQGLSTYIRCMLSTQALLSAIGVGETTATVVGATFQWFL 119
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G L I+ + + G + D N K WRL ADL+ + +++++P +P FV I
Sbjct: 120 RDFTGMLGGILFTLYQGSNLDSNAKMWRLIADLMNDLGMLMDLVSPLYPASFVIILCLGS 179
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
RS + +TR+ FA + N A++ AK +Q ++ IG++LG+ LA H+ +
Sbjct: 180 IARSFTGVASGATRAALTQHFALQHNAADISAKEGSQETMATMIGMLLGMLLA-HVTAGH 238
Query: 303 PFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
P A+ +SF ++T+ HM+ N ++ + + L +LN R S++ ++ +GQ K+V+ E
Sbjct: 239 PGAMWSSFLLLTFFHMFANYQAVRCLCLNSLNEGRTSVLLKCFITNGQVLNPKDVSSMEY 298
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSL-- 419
L P N QL +L+ A +LG+K+S + ++ D L L
Sbjct: 299 LLP----------NWMQLRMLTKGRPFNAT-----FKLGAKISSL--SQSDMSKLLKLSA 341
Query: 420 --YEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSAD 474
Y Y+L G V+L + ++ D+L+S L + ++ V + +AD
Sbjct: 342 SCYGKGKYLLLCRNGTIEVILNKDSVSADILQSYIHGLVLSYHMGSSSSVNDANNAD 398
>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Glycine max]
Length = 486
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 192/388 (49%), Gaps = 37/388 (9%)
Query: 75 EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
E GS TKL FT + S A + +W + FV PEGFP SVT
Sbjct: 72 EWNGSSPTKLSKTFT------ITRSGARFT------HVWRRLLQAFV----PEGFPSSVT 115
Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
+DY+ + +W + G+++ I +L+TQALL AIG+ K A A W L+D G L I
Sbjct: 116 ADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKSATVIGATFQWFLRDLTGMLGGI 175
Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
+ + + G + D K WRL ADL+ + +++++P FP FVF+ RS +
Sbjct: 176 LFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISRSFTGVAS 235
Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
+TR+ FA + N A++ AK +Q V+ IG+ LG+ +A + P A+ SF
Sbjct: 236 GATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVAR-LTIGHPLAIWFSFLS 294
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
+T HMY N ++ Q + L +LNP R+S++F ++ +GQ ++V+ +E + P
Sbjct: 295 LTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQVLSPEQVSSQEHILPI------ 348
Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS--LYEDEGYILTE 429
Q + SS+ + + ++ LG ++S N + H L + Y Y+L E
Sbjct: 349 ------QFISWSSKKANC---LHKKVNLGMRISSFDNMEIKEHLLSAAPYYTKAKYLLVE 399
Query: 430 HGGKFCVVLKESALPQDMLKSLFQASYL 457
G V++ + + D+LK F A L
Sbjct: 400 RKGIIDVIVHKDSNAADVLKLFFHALVL 427
>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
Length = 520
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 177/347 (51%), Gaps = 31/347 (8%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ K A
Sbjct: 125 EELMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTKTLLSSVGVSKSTA 184
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
IP A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T A PH
Sbjct: 185 IPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + AA ++ AA+ STR+ Y FA N +V AKGE G V+ +LG
Sbjct: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVAD----LLGT 300
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + +F++++ +++ + + +S+ L TLN R S+ +L +G+ P
Sbjct: 301 GLSIMISKRNPSLVTTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGRVPS 360
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
++E N E +F +F + K E + +GS+ D +
Sbjct: 361 LQEGNTRENIF-SFPWLK-----------------------ERPIVIGSRFKDAFQDPSA 396
Query: 413 AHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+E E YI+T + K V +LK+ A D+LK+ F A L
Sbjct: 397 FLAIEPLFEKERYIVTYNPTKDKVYALLKDQAKADDILKAAFHAHVL 443
>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
Length = 439
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/410 (27%), Positives = 206/410 (50%), Gaps = 35/410 (8%)
Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
S+ + + D+ + F+ P G+P SV YL Y+ +R++Q +S + VL+TQ+
Sbjct: 51 SVKVIQDARPIHDKVVDSFLNKFFPSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQS 110
Query: 162 LLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAF 221
LL+A GL + A A++W+LKDG+ + K++ S+ G D PK WR+ AD+L +
Sbjct: 111 LLFAAGL-RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGT 169
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE+++P P F+ + + A + +TR Y+ FA N +++ AKGEA
Sbjct: 170 ALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 229
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ +GI GI LA+ + S+ L + +++ +H+Y ++ ++ + TLNP R +++
Sbjct: 230 LFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVVQEMRATPVNTLNPQRTAMIV 289
Query: 342 SEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLG 400
++++ SG+ + P + E+ LFP + +E +++G
Sbjct: 290 ADFIKSGKVSSPAELRYREDLLFP-----------------------NRVIEEAGSVKIG 326
Query: 401 SKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
L V+ + L + + DE ++LT+ K +VL++SA +D L+ A++ +
Sbjct: 327 QPLRRVL-SPSLVEQLRATFPDEKFLLTQKSNKTYMVLEQSASGEDALRGWLVAAFASEM 385
Query: 461 ERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
ER+ GIV+ T L+ + + + F S+ S GW TD +
Sbjct: 386 ERS-GIVSRDTV--------LKEAYEKTKMVFPSFVSEVRSRGWYTDQFL 426
>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 506
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 175/347 (50%), Gaps = 32/347 (9%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + +
Sbjct: 111 EEIMAFVRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 170
Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K R +DLL G+E+ T AFP
Sbjct: 171 TPGAIAINWILKDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELATAAFPQ 230
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 231 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 286
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ +I P + SF++++ ++ + + +S+ L TLN R ++ ++ +G P
Sbjct: 287 GLSIYITKRNPSLVTSFALLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKTGYVPS 346
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
+KE N +E LF H ++ + +GS+ +
Sbjct: 347 LKEGNSQETLF--IHPWR-----------------------HEPVAIGSRFGEAFQEPVS 381
Query: 413 AHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A L+EDE Y++T + K V +LK+ A P D+LK+ F A L
Sbjct: 382 FVATRPLFEDERYMVTYNPTKDKVYALLKDQAKPDDVLKAAFHAHVL 428
>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 437
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/391 (30%), Positives = 191/391 (48%), Gaps = 29/391 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LPEG+P+SVT DYL + W ++QG+++ I +L+TQALL IG+G+ A A W +
Sbjct: 54 LPEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGVGETTATVVGATFLWFM 113
Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G L I+ + + G + D + K WRL ADLL + +++++P FP FV I
Sbjct: 114 RDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMDLVSPLFPGAFVTILCIGS 173
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
RS + +TR+ FA R+N A+V AK +Q + +G++LG+ LA ++
Sbjct: 174 MARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAATMVGMLLGMVLARLTAENV 233
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
SF ++T HMY N ++ +S+ L +LN R S+V S + + P +EV D+E L
Sbjct: 234 VALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAFKEGRKVPTPREVADQEHL 293
Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAV--EIEHRLQLGS-KLSDVVNNKEDAHALFSL 419
P +L A DA + I +Q+ S KL D + +L
Sbjct: 294 LPRI----------PRLPWQKQRAGDAWLPKSIRFGVQISSLKLDD---GWKSFLSLVER 340
Query: 420 YEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGG 479
Y + Y++ G VVL + A PQD L A L L+ + AD +
Sbjct: 341 YSRDPYLMVPKGAIVHVVLHKLATPQDFLHGYVHALCLADLQLR-------SPADQSE-- 391
Query: 480 RLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
E+++ +++ + GW TD ++
Sbjct: 392 --ELAISWMKNNYTAFLDSLGKSGWATDRIL 420
>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
Length = 433
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/339 (30%), Positives = 165/339 (48%), Gaps = 30/339 (8%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+F LP+G+P+SV+ DYL Y W ++Q +S +SG LATQA L G+G + A AA
Sbjct: 48 VFKSVFLPQGYPESVSDDYLQYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEATVAAA 107
Query: 178 AINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+ W+ KDG G L +I+ + G D K WRLFAD+L + A +E+L P FP F F
Sbjct: 108 TVTWLFKDGTGMLGRILFAWRKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFPACFTF 167
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I AG +S + +TR+ A R N A++ AK +Q + G+++ + L
Sbjct: 168 IVCTAGIFKSIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLILIP 227
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ + L F + T +H++ N K+ +S+ + T N R S+V +YL + E
Sbjct: 228 LVTDNPVLTLCLFFLFTVLHLFANYKAVRSVVMETFNEARLSIVLQQYLKDKRILSPPEA 287
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
N EP+F F F + ++LG +L ++ + ED
Sbjct: 288 NQREPVF--FEFGQTVP-----------------------MKLGVRLQEIAQSPEDLQ-- 320
Query: 417 FSLYEDE-GYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
+L E+ Y+L G CV L A D ++++ QA
Sbjct: 321 LALKENSMPYLLGIRNGCVCVCLGPEASASDEIRAMCQA 359
>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
Length = 490
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 170/349 (48%), Gaps = 34/349 (9%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-- 171
DE +++PEGFP +VT Y+ Y WR+++ GV TQ LL ++G+ +
Sbjct: 95 DEMMSFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRA 154
Query: 172 -AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T A
Sbjct: 155 TPAAAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAM 214
Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
PH F+ + AA ++ A+ STR+ Y FA N +V AKGE G ++ +L
Sbjct: 215 PHLFLPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LL 270
Query: 291 GIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
G + I S P + +FS+++ ++ + K +S+ L TLN R S+ +L +GQ
Sbjct: 271 GTGFSIMIAKSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQV 330
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
P ++E N E +F S KD V LGS++ D +
Sbjct: 331 PTLREGNLNEDIF-------------------SFPWKDRPV------VLGSRIKDAFQSP 365
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L++ E YI+T + K V VLK+ A D+LK+ F A L
Sbjct: 366 SAYVAIEPLFDKERYIVTYNPSKSKVHAVLKDQAKSDDILKAAFHAHVL 414
>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/403 (28%), Positives = 200/403 (49%), Gaps = 37/403 (9%)
Query: 120 FVQ-FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
FV+ +++PEG+PDSV Y Y WR+VQ V T++L++A+G+ +G + +A
Sbjct: 35 FVKAYIVPEGYPDSVAPSYTPYMQWRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSAV 94
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
A+NWV+KDG G + K++ + G+ FDV+ K R +L G+E+ T A PH F+ +
Sbjct: 95 AVNWVIKDGAGRIGKMLFARHGKKFDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLPL 154
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
AA ++ AA+ +STR+ Y FA R N +V AKGE ++ +G LGI
Sbjct: 155 ACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLLGTGLGI----F 210
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I P +A+F V++ +M + +S+ L TLN R +V +L +GQ P +++ N
Sbjct: 211 ISKRSPSLVATFCVLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGQVPSIEDAN 270
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
E ++ L + K L+LGS++ +N +
Sbjct: 271 RRE-----------------SVITLPWQEKP--------LELGSRVGQAFSNPREFIRTQ 305
Query: 418 SLYEDEGYILTEHGGK--FCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
++ E Y++T K VV+KESA D++++ FQA + + R T+++ +
Sbjct: 306 EQFQKENYLVTYRPNKKRAYVVIKESASSDDVVRATFQAHVMLHMLRGLEGTITNSNVED 365
Query: 476 APGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
A + + D ++ K ++ S GW+ + P +R+
Sbjct: 366 AVAESTKRTYDL----YDSFKEEAKSKGWMMGETLLNPGDSRV 404
>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
distachyon]
Length = 521
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 172/349 (49%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFPDSVT Y+ Y WR+++ V T+ALL ++G+ + A
Sbjct: 121 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRA 180
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T AFPH
Sbjct: 181 TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPH 240
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 241 LFLPLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 296
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
++ I P +ASF+ ++ ++ + + +S+ L TLN R ++ ++ +G P
Sbjct: 297 GMSILISKRNPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPS 356
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N EE +F P H + +GS+ +
Sbjct: 357 LKEGNLEETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 389
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
+ L+EDE YI+T + K V +LK+ A P D+LK+ F A L
Sbjct: 390 ASFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 438
>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
Length = 433
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/396 (28%), Positives = 193/396 (48%), Gaps = 39/396 (9%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E F+ P G+P SV YL Y+ +R++Q +S VL+TQ+LL+A GL + A
Sbjct: 65 ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 123
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
++W+LKDG+ ++ K++ SN G D PK WR+ AD+L + GLE+++P PH F+ +
Sbjct: 124 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 183
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ A + +TR Y+ FA N +++ AKGEA + GI GI LA+
Sbjct: 184 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 243
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I SSM L S+++ +H+Y ++ + + + TLNP R +L+ + +L +G+ P ++
Sbjct: 244 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 303
Query: 358 DEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
+E L FP + ++ +++G L V E L
Sbjct: 304 FQEDLMFP-----------------------ERPIQDAGNVKVGRALHKAVKPSE-VQRL 339
Query: 417 FSLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
++ +E ++L+ HG + +VL+ A +D L+ A+Y V + T
Sbjct: 340 KQVFVEEKFLLS-HGKSWTDMVLEHDATGEDALRGWLVAAY----------VKSMTKIYN 388
Query: 476 APGG-RLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
P L+ + D + FN S + GW TD +
Sbjct: 389 DPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRFL 424
>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
Length = 432
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/396 (28%), Positives = 193/396 (48%), Gaps = 39/396 (9%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E F+ P G+P SV YL Y+ +R++Q +S VL+TQ+LL+A GL + A
Sbjct: 64 ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 122
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
++W+LKDG+ ++ K++ SN G D PK WR+ AD+L + GLE+++P PH F+ +
Sbjct: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 182
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ A + +TR Y+ FA N +++ AKGEA + GI GI LA+
Sbjct: 183 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 242
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I SSM L S+++ +H+Y ++ + + + TLNP R +L+ + +L +G+ P ++
Sbjct: 243 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 302
Query: 358 DEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
+E L FP + ++ +++G L V E L
Sbjct: 303 FQEDLMFP-----------------------ERPIQDAGNVKVGRALHKAVKPSE-VQRL 338
Query: 417 FSLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
++ +E ++L+ HG + +VL+ A +D L+ A+Y V + T
Sbjct: 339 KQVFVEEKFLLS-HGKSWTDMVLEHDATGEDALRGWLVAAY----------VKSMTKIYN 387
Query: 476 APGG-RLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
P L+ + D + FN S + GW TD +
Sbjct: 388 DPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRFL 423
>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
Length = 415
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 170/349 (48%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + A
Sbjct: 18 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRA 77
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T AFP
Sbjct: 78 TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 137
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 138 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 193
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + SF+ ++ ++ + +S+ L TLN R ++ ++ SG P
Sbjct: 194 GLSILISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPS 253
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N +E +F P H + +GS+ +
Sbjct: 254 LKEGNSQETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 286
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE YI+T + K V +LK+ A P D+LK+ F A L
Sbjct: 287 ASFVAIRPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 335
>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
Length = 401
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 169/349 (48%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + A
Sbjct: 4 QEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRA 63
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T AFP
Sbjct: 64 TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 123
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 124 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 179
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + SF+ ++ ++ + +S+ L TLN R ++ ++ SG P
Sbjct: 180 GLSILISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPS 239
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N +E +F P H + +GS+ +
Sbjct: 240 LKEGNSQETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 272
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE YI+T + K V +LK+ A P D+LK+ F A L
Sbjct: 273 ASFVAIRPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 321
>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
Length = 399
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 162/299 (54%), Gaps = 17/299 (5%)
Query: 111 KLWDECREL-------FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
+L D+ R L F+ P G+P SV YL YS +R++Q +S I V +TQ+L+
Sbjct: 27 RLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLV 86
Query: 164 YAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
+A GL + A I+WVLKDG+ ++ K++ S G D PK WR+FAD+L +A L
Sbjct: 87 FAAGL-RPTPAQATVISWVLKDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTAL 145
Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
EM++PA P +F+ + A + A + +TR Y FA N +++ AKGEA ++
Sbjct: 146 EMISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLA 205
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
G+ +GI LA+ +S+ L +++ IH+Y ++ +++ L TLN R +++ ++
Sbjct: 206 NVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVAD 265
Query: 344 YLLSGQAPPVKEVNDEE----PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQ 398
YL G+ P E+ E P+ P H ++ AN L + S + + E++HR Q
Sbjct: 266 YLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDI-LQIPSGKIR----ELKHRFQ 319
>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 390
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/403 (29%), Positives = 204/403 (50%), Gaps = 37/403 (9%)
Query: 120 FVQ-FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGA-IPTAA 177
FV+ +++PE +PDSV Y Y WR+VQ + V T++L++A+G+ +GA +A
Sbjct: 19 FVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTRSLMHALGVSRGASASSAV 78
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
A+NWV+KDG G + K++ + G+ FDV+ K R +L G+E+ T A PH F+ +
Sbjct: 79 AVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQLGAGVELATMAVPHLFLPL 138
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
AA ++ AA+ +STR+ Y FA R N +V AKGE ++ +G LGI
Sbjct: 139 ACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLMGTGLGI----F 194
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I P +A+F V++ +M + +S+ L TLN R +V +L +G+ P V++ N
Sbjct: 195 ISKKSPSLVATFCVLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGKVPSVEDAN 254
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
E ++ L + K L+LGS+++ +N ++
Sbjct: 255 RRE-----------------SVITLPWQEKP--------LELGSRVAQAYSNPKEFLRAR 289
Query: 418 SLYEDEGYILTE--HGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
++ E Y++T + VV+KESA D++++ FQA + + R G+ +T +++
Sbjct: 290 EQFQKEKYLVTYSLKKKRAYVVIKESASSDDVVRATFQAHVMLHILR--GLRSTLITSNV 347
Query: 476 APGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
G +E S ++ K ++ S GW+ + P +R+
Sbjct: 348 --GAIVEESTKLTHDLYDSFKEEAKSQGWMMGETLLSPGDSRL 388
>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
Length = 504
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 109/347 (31%), Positives = 177/347 (51%), Gaps = 31/347 (8%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
DE +++PEGFPDSVT Y+ Y WR+++ GV TQ LL ++G+ + +
Sbjct: 109 DELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRS 168
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K RL DLL G+E+ T A PH
Sbjct: 169 APGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPH 228
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + AA ++ AA+ STR+ Y FA N +V AKGE G ++ +LG
Sbjct: 229 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LLGT 284
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + +F++++ ++ + + +S+ L TLN R S+ +L +G+ P
Sbjct: 285 GLSIMISKRNPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPS 344
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
++E N +E +F +F + + K + LGS+ D +
Sbjct: 345 LQEGNMKENIF-SFPWLEDKP-----------------------IVLGSRFKDAFQDAGS 380
Query: 413 AHALFSLYEDEGYILTEH--GGKFCVVLKESALPQDMLKSLFQASYL 457
++ L+E E Y++T + GK +LK+ A D+LK+ F A L
Sbjct: 381 YLSIEPLFEKERYLVTYNPSKGKVYALLKDQAKSDDILKAAFHAHML 427
>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/341 (30%), Positives = 179/341 (52%), Gaps = 32/341 (9%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA-- 177
F+ P G+P SV+ YL Y+ +R++Q V S VL+TQ+LL+A GL PTAA
Sbjct: 62 FLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLR----PTAAQA 117
Query: 178 -AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
++WVLKDG+ ++ K++ SN G D PK WR+ AD+L + GLE+++P PH F+
Sbjct: 118 TVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFLE 177
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + A + +TR Y+ FA N +++ AKGEA + +GI G+ LA+
Sbjct: 178 MAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLAS 237
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
I SS+ L + +++ +H+YC ++ ++ + TLNP R +++ ++++ +G+ P ++
Sbjct: 238 TICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPADI 297
Query: 357 NDEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
+E L FP L EA + V++ L K S +V K+
Sbjct: 298 RYQEDLVFPG---------------RLIEEAGN--VKVGRALHEVIKPSKLVEMKQ---- 336
Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
++ E ++L + +VL+ A +D L+ A+Y
Sbjct: 337 ---IFPGEKFVLNQSKKWVDMVLEHDASGEDALRGWLVAAY 374
>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 428
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/341 (30%), Positives = 179/341 (52%), Gaps = 32/341 (9%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA-- 177
F+ P G+P SV+ YL Y+ +R++Q V S VL+TQ+LL+A GL PTAA
Sbjct: 62 FLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLR----PTAAQA 117
Query: 178 -AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
++WVLKDG+ ++ K++ SN G D PK WR+ AD+L + GLE+++P PH F+
Sbjct: 118 TVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFLE 177
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + A + +TR Y+ FA N +++ AKGEA + +GI G+ LA+
Sbjct: 178 MAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLAS 237
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
I SS+ L + +++ +H+YC ++ ++ + TLNP R +++ ++++ +G+ P ++
Sbjct: 238 TICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPADI 297
Query: 357 NDEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
+E L FP L EA + V++ L K S +V K+
Sbjct: 298 RYQEDLVFPGR---------------LIEEAGN--VKVGRALHEVIKPSKLVEMKQ---- 336
Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
++ E ++L + +VL+ A +D L+ A+Y
Sbjct: 337 ---IFPGEKFVLNQSKKWVDMVLEHDASGEDALRGWLVAAY 374
>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
Length = 436
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/408 (26%), Positives = 204/408 (50%), Gaps = 42/408 (10%)
Query: 111 KLWDECRELFVQFM-------LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
K+ D+ R +F + + P G+P SV YL Y+ +R++Q ++S VL+TQ+LL
Sbjct: 54 KVIDDSRSMFHRVVESFQNKFFPSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLL 113
Query: 164 YAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
+A GL + A A++WVLKDG+ ++ K++ SN G D PK WR+ AD+L + GL
Sbjct: 114 FAAGL-RPTPAQATAVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGL 172
Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
E+L+P PH F+ + + A + +TR Y+ FA N +++ AKGEA +
Sbjct: 173 EVLSPLCPHLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 232
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
+G+ +GI LA+ + SS+ L +++ +H++ ++ ++ + TLNP R +++ ++
Sbjct: 233 NVVGLGVGIQLASTVCSSIQGKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVAD 292
Query: 344 YLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK 402
Y+ +G+ + P E+ +FP +E +++G
Sbjct: 293 YVKAGKISSPADLRYREDLIFPG-----------------------RLIEDAGNVKVGRA 329
Query: 403 LSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLER 462
L + H L ++ +E ++L +VL+++A +D L+ A+Y +E+
Sbjct: 330 LHKAFTPSK-VHELKDIFPEEKFLLNRGNTWTDMVLEQNASGEDALRGWLVAAYATSMEK 388
Query: 463 NAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
++ + C P L+ + + + F+ S+ + GW TD +
Sbjct: 389 SSHM--------CTP-SVLQDAYEKMNSTFDSFLSELQAKGWHTDRFL 427
>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
gi|224035549|gb|ACN36850.1| unknown [Zea mays]
gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
Length = 511
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/349 (30%), Positives = 171/349 (48%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + A
Sbjct: 115 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRA 174
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T AFP
Sbjct: 175 ASGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 234
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 235 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 290
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
++ I S P + SF+ ++ ++ + +S+ L TLN R ++ ++ +G P
Sbjct: 291 GMSILISKSNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPS 350
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+K+ N +E +F P H + +GS+ D
Sbjct: 351 LKDGNSQETIFNPPWRH---------------------------EPVAIGSRFGDAFQEP 383
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE YI+T + K V +LK+ A P D+LK+ F A L
Sbjct: 384 ASFLAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 432
>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
Length = 508
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + +
Sbjct: 116 EEVMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 175
Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K R +DLL G+E+ T AFP
Sbjct: 176 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQ 235
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 236 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 291
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + SF+ ++ +++ + +S+ L TLN R ++ ++ +G P
Sbjct: 292 GLSIFISKRNPSLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 351
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N +E +F P H + +GS+ +
Sbjct: 352 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 384
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE Y++T + K V +LK+ A D++K+ F A L
Sbjct: 385 ASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSDDIIKAAFHAHVL 433
>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
Length = 508
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + +
Sbjct: 116 EEVMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 175
Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K R +DLL G+E+ T AFP
Sbjct: 176 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQ 235
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 236 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 291
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + SF+ ++ +++ + +S+ L TLN R ++ ++ +G P
Sbjct: 292 GLSIFISKRNPSLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 351
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N +E +F P H + +GS+ +
Sbjct: 352 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 384
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE Y++T + K V +LK+ A D++K+ F A L
Sbjct: 385 ASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSDDIIKAAFHAHVL 433
>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Cucumis sativus]
Length = 437
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/350 (30%), Positives = 176/350 (50%), Gaps = 31/350 (8%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
R F F +PEGFP SVT DY+ + +W S+QG+++ I +L+TQALL AIG+G + A
Sbjct: 49 RRFFDAF-VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVI 107
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
A W L+D G L I+ + + G + D N K WRL AD + + +++++P FP F
Sbjct: 108 GATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVSPLFPSAF 167
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ I RS + +TR+ FA + N A++ AK +Q V+ IG+ +G+ L
Sbjct: 168 MLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLL 227
Query: 295 ANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
A I P A+ SF +T HMY N K+ + L +LNP R S+ Y+ + +
Sbjct: 228 A-RITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSP 286
Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
++V+ E + PA+ SS + + I R++LG+++S + + +
Sbjct: 287 QQVSMMEHVLPAWA---------------SSWSLRSIKLIHTRVRLGARISSL--HHTEM 329
Query: 414 HALF---------SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
A F S Y+ Y+L E V++++ A D+ +S A
Sbjct: 330 XAFFNQLMNFVKSSGYKGAKYLLVERKETISVIVQKDATANDIFQSFVHA 379
>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 183/377 (48%), Gaps = 29/377 (7%)
Query: 88 FTKDAFVVASASNASLSSLLSVNKL---WDECRELFVQFMLPEGFPDSVTSDYLNYSLWR 144
++ A + ASAS+ L+ N+ W + +PEGFP SVT DY+ + +W
Sbjct: 33 LSRTAVLTASASS------LTSNRFGSRWGRIGSRVLGAFVPEGFPGSVTPDYVPFQMWD 86
Query: 145 SVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHF 202
++QG+++ I +L+TQALL AIG+G K A A W L+D G L I+ + + G +
Sbjct: 87 TLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFLRDLTGMLGGILFAFYQGSNL 146
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
D N K WRL AD + + +++L+P FP + I RS + +TR+
Sbjct: 147 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 206
Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
FA N A++ AK +Q ++ G+ LG+ LA+ SF +T HMY N K
Sbjct: 207 FALANNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHDLVVWVSFLSLTVFHMYANYK 266
Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVL 382
+ QS+ L TLN R S++ ++ G+ ++V+ +E + P +
Sbjct: 267 AVQSLSLSTLNYERTSILLQYFMEHGEVLTPQQVSKQEHILPFW---------------- 310
Query: 383 SSEAKDAAVEIEHRL-QLGSKLSDVVNNKEDAHA-LFSLYEDEGYILTEHGGKFCVVLKE 440
SS K V++ H L LG+K S + ++ A S Y + Y L + G C+ + +
Sbjct: 311 SSWRKLLRVKLPHELVNLGAKASMLTHSDMLLIAKTRSYYTNANYFLLDKDGSVCIFIHK 370
Query: 441 SALPQDMLKSLFQASYL 457
A+ D+LKS L
Sbjct: 371 QAVATDVLKSFIHGLVL 387
>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
Length = 503
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + +
Sbjct: 105 EEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 164
Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K R +DLL G+E+ T AFP
Sbjct: 165 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQ 224
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 225 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 280
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + SF+ ++ ++ + + +S+ L TLN R ++ ++ +G P
Sbjct: 281 GLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 340
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N +E +F P H + +GS+ +
Sbjct: 341 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 373
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE Y++T + K V +LK+ A D++K+ F A L
Sbjct: 374 ASFLAIRPLFEDERYMVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVL 422
>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
Length = 503
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + +
Sbjct: 105 EEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 164
Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K R +DLL G+E+ T AFP
Sbjct: 165 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQ 224
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 225 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 280
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ I P + SF+ ++ ++ + + +S+ L TLN R ++ ++ +G P
Sbjct: 281 GLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 340
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N +E +F P H + +GS+ +
Sbjct: 341 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 373
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE Y++T + K V +LK+ A D++K+ F A L
Sbjct: 374 ASFLAIRPLFEDERYMVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVL 422
>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
Length = 399
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 162/299 (54%), Gaps = 17/299 (5%)
Query: 111 KLWDECREL-------FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
+L D+ R L F+ P G+P SV YL YS +R++Q +S I V +TQ+L+
Sbjct: 27 RLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLV 86
Query: 164 YAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
+A GL + A I+WVLKDG+ ++ K++ S G D PK WR+FAD+L +A L
Sbjct: 87 FAAGL-RPTPAQATVISWVLKDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTAL 145
Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
E+++PA P +F+ + A + A + +TR Y FA N +++ AKGEA ++
Sbjct: 146 EIISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLA 205
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
G+ +GI LA+ +S+ L +++ IH+Y ++ +++ L TLN R +++ ++
Sbjct: 206 NVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVAD 265
Query: 344 YLLSGQAPPVKEVNDEE----PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQ 398
YL G+ P E+ E P+ P H ++ AN L + S + + E++HR Q
Sbjct: 266 YLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDI-LQIPSGKIR----ELKHRFQ 319
>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
Length = 408
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 165/313 (52%), Gaps = 5/313 (1%)
Query: 29 SGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDF 88
+GS + L PS+ ++ + +S+D+G ++ V+G + +
Sbjct: 16 AGSPHRSALLRPSVCIASTSSSSSSTSSPAPIASQDNGFARGRTIYRVEGDGSQRFHVEK 75
Query: 89 TKDAFVV----ASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWR 144
T+ A + A ++ +SL + + ++ LP+ S+T DY Y+ W
Sbjct: 76 TRGALGLWTHKAEGLRSAPTSLAAARRTGSWLYDVLWDLFLPKDARASLTKDYFPYAKWS 135
Query: 145 SVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
V VAS + VL+ Q+LLYAIGLG GAIPTAAA+NWVLKDG+G ++ ++ +D
Sbjct: 136 FVGSVASCAASVLSMQSLLYAIGLGAGAIPTAAAVNWVLKDGLGQFGGVLFASIVNNRYD 195
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
+PK WR+ + + + + E+LTP P F+ I + A ++ A L ++TR+ F+ F
Sbjct: 196 ADPKRWRVASSVALDVSVLGEILTPLAPGSFLAIASLANVAKNVAWLSASATRAGFHNSF 255
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A R N A+V AK +Q + S G LGI ++ GSS A+F+V++ +H+ KS
Sbjct: 256 AIRENLADVTAKAGSQAIASSIFGTGLGILISQFTGSSTLNVAAAFAVLSTVHLISTYKS 315
Query: 324 YQSIELRTLNPYR 336
++LRTLN R
Sbjct: 316 VDCVQLRTLNCQR 328
>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
Length = 440
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 176/348 (50%), Gaps = 32/348 (9%)
Query: 113 WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KG 171
W + + +PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G +
Sbjct: 52 WGQIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 111
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
A A W L+D G L I+ + + G + D N K WRL AD + + +++L+P F
Sbjct: 112 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 171
Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
P + I RS + +TR+ FA +N A++ AK +Q ++ IG+ L
Sbjct: 172 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGL 231
Query: 291 GIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
G+ LA HI ++ SF +T HMY N K+ QS+ L TLN RAS++ + G
Sbjct: 232 GMLLA-HITRGHALSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGD 290
Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVN 408
++V+ +E + P + S+ K +++ H R+ LG+K S + +
Sbjct: 291 VLAPQKVSQQEHILPFW----------------STWRKLNKIKLPHERVHLGAKASMLTH 334
Query: 409 N------KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKS 450
+ K +H YE+ Y+L G + + + A P D+L+S
Sbjct: 335 SDMLVIAKTRSH-----YENTNYLLLNKQGSVYIFIHKLATPADVLRS 377
>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
Length = 414
Score = 161 bits (408), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 188/368 (51%), Gaps = 26/368 (7%)
Query: 99 SNASLSSLL--SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
SN LS +L L+ + F+ P G+P SV YL Y+ +RSVQ V S V
Sbjct: 25 SNGQLSVMLVDDSRPLYQKVAGSFMNKFFPSGYPYSVNEGYLRYTQFRSVQHVTSAALSV 84
Query: 157 LATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
L+TQ+LL+A GL + A A++W+LKDG+ ++ K++ SN+G D PK WRL AD+L
Sbjct: 85 LSTQSLLFAAGL-RPTPAQATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLIADVL 143
Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKG 276
+ GLE+L+P PH F+ + + A + +TR Y+ FA N +++ AKG
Sbjct: 144 YDLGTGLEVLSPLCPHLFLQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKG 203
Query: 277 EAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
EA + IGI +GI LA+ I +S+ + +++ +H+Y + ++ + TLNP R
Sbjct: 204 EAFSTLCNVIGIGVGIQLASTICASVQGKMVVGPLLSILHIYSVSEEMRATPINTLNPRR 263
Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
+++ +++L +G ++ E L F+++ L+ +A +
Sbjct: 264 TAMIVTDFLKAGVVSSPADLRYREDL-----LFRVR---------LTEDAGN-------- 301
Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
+++G L +V+ L + E ++L G +VL++ A +D L+ A+Y
Sbjct: 302 VRVGRALHEVIKPSRLLE-LEQVLPGEKFLLNRGGKCVDMVLEQDASGEDALRGWLVAAY 360
Query: 457 LYWLERNA 464
+E ++
Sbjct: 361 AAQIENSS 368
>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
Length = 428
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 197/394 (50%), Gaps = 35/394 (8%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E F+ P G+P SV YL Y+ +R++Q +S VL+TQ+LL+A GL + A
Sbjct: 60 ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 118
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
A++W+LKDG+ + K++ SN G D PK WR+ AD+L + GLE+L+P PH F+ +
Sbjct: 119 AVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEV 178
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ A + +TR Y+ FA N +++ AKGEA + +G+ +GI LA+
Sbjct: 179 AGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLAST 238
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APPVKEV 356
+ SSM + +++ +H+ C ++ ++ + TLNP R ++V ++++ +G+ + P
Sbjct: 239 VCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPADLR 298
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
E+ LFP + Q L HR SKL + L
Sbjct: 299 YHEDLLFPGRLIENAGNVKVGQAL--------------HRAVRPSKLRE----------L 334
Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCA 476
++ E +IL+ +VL+++A +D L++ A+Y +++++ STS
Sbjct: 335 KEIFPGEKFILSPGNKWTDLVLEQNASGEDALRAWLVAAYASSMKKSS---HESTSVT-- 389
Query: 477 PGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
L+ + + + F+ S+ + GW TD +
Sbjct: 390 ----LQDAYEKMNSVFDPFLSELQAKGWHTDRFL 419
>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
Length = 412
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 88/250 (35%), Positives = 136/250 (54%), Gaps = 2/250 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
F LP+G+PDSV+SDYL Y W ++Q +S +SG LAT+A L +G+G + A AA
Sbjct: 75 FQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAAT 134
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W+LKDG G +I+ + G D K WRLFAD+L + A +++L P FP F I
Sbjct: 135 TTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTLI 194
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+G +S + +TR+ A R N A++ AK +Q + G+++ +AL +
Sbjct: 195 MCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVSF 254
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ + L+ F + T +H++ N K+ +S+ + TLN R S+V EYL G+ E N
Sbjct: 255 VADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEAN 314
Query: 358 DEEPLFPAFH 367
EP+ F
Sbjct: 315 QREPVLMEFK 324
>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
Length = 517
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/349 (30%), Positives = 172/349 (49%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFP+SVT Y+ Y WR+++ GV T+ LL ++G+ + A
Sbjct: 121 EEIMSFIRSYVVPEGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRA 180
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+LKDG G + K++ + G+ FD + K R DLL G+E+ T AFP
Sbjct: 181 ASGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 240
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 241 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 296
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
++ I S P + SF+V++ ++ + +S+ L TLN R ++ ++ +G P
Sbjct: 297 GMSILISKSNPSLVTSFAVLSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPS 356
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+K+ N +E +F P H + +GS+ +
Sbjct: 357 LKDGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 389
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE YI+T + K V +LK+ A P D+LK+ F A L
Sbjct: 390 ASFIAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 438
>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 519
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 171/349 (48%), Gaps = 36/349 (10%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFPDSVT Y+ Y WR+++ V T+ALL ++G+ + A
Sbjct: 119 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRA 178
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+L+DG G + K++ + G+ FD + K R DL+ G+E+ T AFP
Sbjct: 179 TSGAVAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQ 238
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +G + I
Sbjct: 239 LFLPLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI 298
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
++ P +ASF+V++ ++ + + +S+ L TLN R ++ ++ +G P
Sbjct: 299 LMSKR----NPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPS 354
Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+KE N EE +F P H + +GS+ +
Sbjct: 355 LKEGNLEETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 387
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A L+EDE YI+T + K V +LK+ A D+LK+ F A L
Sbjct: 388 ASFVATRPLFEDERYIVTYNPAKDKVYALLKDQAKQDDILKAAFHAHVL 436
>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
melanogaster gb|AE003660 [Arabidopsis thaliana]
Length = 438
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 200/406 (49%), Gaps = 40/406 (9%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
F+ F P GFP SVT DY+ + LW ++QG+++ +L+TQALL AIG+G K A A
Sbjct: 52 FLGFSSP-GFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGAT 110
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W L+D G L I+ + + G + D N K WRL ADL+ + +++L+P FP F+ +
Sbjct: 111 FQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVV 170
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
RS + +TR+ FA + N A++ AK +Q ++ +G+ LG+ LA
Sbjct: 171 VCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLA-R 229
Query: 298 IGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
S P A+ SF +T HMY N ++ + + L +LN R+S++ + ++ +GQ ++V
Sbjct: 230 FTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQV 289
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
+ E + P + ++S N L R+QLG ++S + + D L
Sbjct: 290 SSMEGVLPLWA-TSLRSTNSKPL--------------HKRVQLGVRVSSL--PRLDMLQL 332
Query: 417 F-----SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATST 471
S Y++ Y+L G V+L + + P D+LKS A L A ++ ST
Sbjct: 333 LNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVL------ANLMEKST 386
Query: 472 SADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNR 517
S ++ + ++ + S GW T+ L++ + R
Sbjct: 387 SFYSEGEA-------WIDKHYDELLHKLRSGGWKTERLLSPSITWR 425
>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 417
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 166/346 (47%), Gaps = 26/346 (7%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
+ F LP G+PDSV+SDYL Y W +VQ S ISG L T A+L +G+G A P
Sbjct: 46 KRFFQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPL 105
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHH 233
+A I WVLKDG+G+ +I+ + + G D++ K WR+ AD+L + A G+++ P +P
Sbjct: 106 SATITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPKV 165
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
+I A ++ + +TRS A R N A+V +K AQ I +G+
Sbjct: 166 STYILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGLG 225
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
L H+ + F VT +H+Y N+K+ +++ LRT N R + EY SG
Sbjct: 226 LLTHVQNQRVL-YGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLSP 284
Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
++VN K++ Q + L++ K E L+ N D
Sbjct: 285 QQVN------------KLERVTIGQTVSLTARVKIGCSARE--------LAQYYRNCYDL 324
Query: 414 HALFSLYED-EGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
+ + ++ E ++L E V L + P D++KS F ASYL
Sbjct: 325 ENVIACFDSREKFLLAETRHYVGVYLHFTVKPLDIIKSYFYVASYL 370
>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
distachyon]
Length = 437
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/412 (26%), Positives = 204/412 (49%), Gaps = 38/412 (9%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
D+ + F+ P G+P SV YL Y+ +R++Q +S + VL+TQ+LL+A GL +
Sbjct: 61 DKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL-RPTP 119
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
A A++W+LKDG+ + K++ S G D PK WR+FAD+L + L+ ++P P
Sbjct: 120 AQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQL 179
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ + + A + +TR Y+ FA N +++ AKGEA +S +GI GI
Sbjct: 180 FLEVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGIG 239
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APP 352
L++ + S+ L +++ +H+Y ++ ++I + +LNP R ++V ++++ SG+ + P
Sbjct: 240 LSSTVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSSP 299
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
+ E+ LFP + +E +++G L ++ + +
Sbjct: 300 AELRYREDLLFP-----------------------NRLIEEAGSVKVGQPLCRIL-SPQR 335
Query: 413 AHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTS 472
L + Y +E ++L+ K +VL++SA +D L+ A++ +E++ V + S
Sbjct: 336 VEQLRATYPNEKFLLSLKSNKTYMVLEQSATGEDALRGWLVAAFASEMEKSG--VGSRDS 393
Query: 473 ADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVE 524
L + ++ F S+ S GW TD + NR R Y +
Sbjct: 394 V-------LNEAYQKMEDVFPMFVSEVRSRGWYTDQFLD---GNRSRIAYAK 435
>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 738
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/235 (38%), Positives = 134/235 (57%), Gaps = 3/235 (1%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
P+ VT+DY ++ WR Q + S GV TQALL A+G+ G I AA I+WVLKDG+
Sbjct: 379 IPEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGL 438
Query: 188 GYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
+ K++ S GR D +PK +R + LL + GLE+ T P F+ + A +
Sbjct: 439 SRVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKL 498
Query: 247 AAALIQASTRSCFYAGFAAR-RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFA 305
+ L ++TR+ Y FA R N A++ AKGEAQ ++ G+ LGI L+ IG+S P
Sbjct: 499 CSMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKV 558
Query: 306 LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
+A++ +++ I +Y K + IE RTLN R+SLV YL +G P K+V+ E
Sbjct: 559 VATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAE 613
>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 172/338 (50%), Gaps = 31/338 (9%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINW 181
+++PEGFPDSV Y+ Y WR+++ GV T+ LL ++G+ K AIP A AINW
Sbjct: 12 YVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQAIPGAVAINW 71
Query: 182 VLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+LKDG G + K++ + G+ FD + K R DLL +E+ T A PH F+ + AA
Sbjct: 72 ILKDGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPHLFLPLACAA 131
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
++ AA+ STR+ Y FA N +V AKGE G V+ +LG L+ I
Sbjct: 132 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVAD----LLGTGLSIMISKR 187
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
P + +F++++ ++ + + +S+ L TLN R S+ +L +GQ P + E N +E
Sbjct: 188 NPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEGNVKEN 247
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
+F F + K +S + LG + D + A+ L+E
Sbjct: 248 IF-NFPWLKDRS-----------------------ISLGPRFKDAFQDPRAYLAIEPLFE 283
Query: 422 DEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
E YI+T + K V +LK+ A P D++K+ F A L
Sbjct: 284 RERYIVTYNPSKDKVYALLKDQAKPDDIVKAAFHAHVL 321
>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
Length = 417
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 168/346 (48%), Gaps = 26/346 (7%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
+ F LP G+PDSV+SDYL+Y W ++Q S ISG L T A+L +G+G A P
Sbjct: 46 KRFFQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANPL 105
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHH 233
+A I WVLKDG G+L +I+ + + G D++ K WR+ AD+L + A +++ P +P
Sbjct: 106 SATITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPKA 165
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
+I A ++ + +TRS A R N A+V +K AQ I +G+
Sbjct: 166 STYILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLF 225
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
L ++ + F+ VT IH+Y N+K+ +++ LRT N R + EY SG
Sbjct: 226 LLTYL-QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284
Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
++VN K++ Q + L++ K E L++ N D
Sbjct: 285 QQVN------------KLERVTVGQTVTLTARVKIGCSVRE--------LTEFYRNCYDL 324
Query: 414 HALFSLYED-EGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
L + ++ + +++ E V L + P D++KS F ASYL
Sbjct: 325 ENLIACFDSRDKFLIAETRHYVGVYLHFTVKPLDIIKSYFYVASYL 370
>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/429 (25%), Positives = 209/429 (48%), Gaps = 38/429 (8%)
Query: 98 ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
A S+ + + D+ + F+ P G+P SV YL Y+ +R++Q +S + VL
Sbjct: 41 AGRVSVKVIQDSRPIHDKMIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVL 100
Query: 158 ATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLE 217
+TQ+LL+A GL + A A++W+LKDG+ + K++ S G D PK WR+ AD+L
Sbjct: 101 STQSLLFAAGL-RPTPAQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLY 159
Query: 218 NAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGE 277
+ LE+++P P F+ + + A + +TR Y+ FA N +++ AKGE
Sbjct: 160 DFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGE 219
Query: 278 AQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRA 337
A + +GI GI LA+ + S+ L + +++ +H+Y ++ ++ + TLNP R
Sbjct: 220 AISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRT 279
Query: 338 SLVFSEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
+++ ++++ SG+ + P + E+ LFP + +E
Sbjct: 280 AMIVADFIKSGKVSSPAELRYREDLLFP-----------------------NRLIEEAGS 316
Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
+++G + V+ + + L + + E ++L+ +VL++SA +D L+ A++
Sbjct: 317 VKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWLVAAF 375
Query: 457 LYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPN 516
+ER +G+ + T L ++ + ++ F ++ S GW TD + N
Sbjct: 376 ASEMER-SGVGSGDTV--------LNVAYERMENVFPMFVAEVKSRGWYTDQFLD---GN 423
Query: 517 RIRPGYVEP 525
R R Y P
Sbjct: 424 RSRIAYANP 432
>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
Length = 468
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
C Q + AR ++ D + W G R + +DA V + AS L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPRAASPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +S+ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
Length = 468
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
C Q + AR ++ D + W G R + +DA V + AS L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPRAASPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +S+ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 439
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 107/400 (26%), Positives = 200/400 (50%), Gaps = 34/400 (8%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
+ D+ + F+ P G+P SV YL Y+ +R +Q +S + VL+TQ+LL+A GL +
Sbjct: 60 IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
A A++W+LKDG+ + K++ S+ G D PK WR+ AD+L + LE+++P P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCP 178
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
F+ + + A + +TR Y+ FA N +++ AKGEA + +GI G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-A 350
I LA+ + S+ L + +++ +H+Y ++ ++ + TLNP R +++ ++++ SG+ +
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVS 298
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
P + E+ LFP + +E +++G L V + +
Sbjct: 299 SPAELRYKEDLLFP-----------------------NRVIEEAGSVKIGQPLRKVFSPR 335
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
L + + E ++L++ + +VL+ SA +D L+ A++ +ER+ +
Sbjct: 336 L-VEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALRGWLVAAFASEMERSG-----A 389
Query: 471 TSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
S D A L + + ++R F S+ S GW TD +
Sbjct: 390 GSRDDAV---LSEAYEKMERVFPTFVSEVRSRGWYTDQFL 426
>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
Length = 468
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
C Q + AR ++ D + W G R + +DA V + AS L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPRAASPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +S+ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
Length = 468
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 105/325 (32%), Positives = 159/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
C Q + AR ++ D + W G R + +DA V + AS+ L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPGASSPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +++ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 420
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 194/395 (49%), Gaps = 37/395 (9%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E F+ P G+P SV YL Y+ +R++Q +S VL+TQ+LL+A GL + A
Sbjct: 52 ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 110
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
A++WVLKDG+ ++ K++ SN G D PK WR+ AD L + GLE+L+P PH F+ +
Sbjct: 111 AVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEM 170
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ A + +TR Y+ FA N +++ AKGEA + +G+ GI LA+
Sbjct: 171 AGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLAST 230
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I SSM + + +++ IH+Y ++ ++ + TLNP R +++ ++++ +G+ ++
Sbjct: 231 ICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLR 290
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
+E L +E +++G L VV + L
Sbjct: 291 YQEDLIFTGRL----------------------IEDAGNVKVGRDLHKVVKPSKLCK-LK 327
Query: 418 SLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCA 476
++ +E ++L HG K+ +VL+ A +D L+ A+Y+ LE+ D
Sbjct: 328 GVFPEEKFLLC-HGKKWTDMVLEHDATGEDALRGWLVAAYVANLEKT----------DHE 376
Query: 477 PGGR-LEISLDYVQREFNHVKSDSASVGWVTDGLI 510
P L+ + D + F+ S + GW TD +
Sbjct: 377 PSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFL 411
>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
Length = 424
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 194/395 (49%), Gaps = 37/395 (9%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E F+ P G+P SV YL Y+ +R++Q +S VL+TQ+LL+A GL + A
Sbjct: 56 ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 114
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
A++WVLKDG+ ++ K++ SN G D PK WR+ AD L + GLE+L+P PH F+ +
Sbjct: 115 AVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEM 174
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ A + +TR Y+ FA N +++ AKGEA + +G+ GI LA+
Sbjct: 175 AGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLAST 234
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I SSM + + +++ IH+Y ++ ++ + TLNP R +++ ++++ +G+ ++
Sbjct: 235 ICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLR 294
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
+E L +E +++G L VV + L
Sbjct: 295 YQEDLIFTGRL----------------------IEDAGNVKVGRDLHKVVKPSKLCK-LK 331
Query: 418 SLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCA 476
++ +E ++L HG K+ +VL+ A +D L+ A+Y+ LE+ D
Sbjct: 332 GVFPEEKFLLC-HGKKWTDMVLEHDATGEDALRGWLVAAYVANLEKT----------DHE 380
Query: 477 PGGR-LEISLDYVQREFNHVKSDSASVGWVTDGLI 510
P L+ + D + F+ S + GW TD +
Sbjct: 381 PSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFL 415
>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
Length = 466
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 134/245 (54%), Gaps = 2/245 (0%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+ +L F ++T +H+Y N ++ +++ L TLN R LV +L G+ N EP
Sbjct: 255 LSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314
Query: 362 LFPAF 366
L+ F
Sbjct: 315 LWTGF 319
>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
Length = 444
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 172/355 (48%), Gaps = 29/355 (8%)
Query: 113 WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KG 171
W + F LP+G+P+SV+SDYL Y +W ++Q AS I+G LA QALL +G+G +
Sbjct: 33 WTSLSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSITGTLAAQALLKGVGVGDES 92
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
A AA + W+LKDG G + +I+ + G + D + K WRLFAD+L + A +E+L P F
Sbjct: 93 ATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLFADILNDMAIFMEILAPYF 152
Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
+F I AG +S + +TR+ A R N A+V AK +Q + ++
Sbjct: 153 KAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADVSAKDGSQETLVNLAALVC 212
Query: 291 GIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
+ L + + F + T +H++ N + S+ + TLN R ++ YL +G+
Sbjct: 213 SLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAVTSVVMETLNQARLHILVHHYLQTGEV 272
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
++EVN +EP+ +K + + + LG+ + V +
Sbjct: 273 LSLQEVNYKEPV-----LWKTRR--------------------KLNVHLGTSIGKVFSRA 307
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYLYWLERN 463
+LY+D Y+L + CV +L + D L S FQA + ++ N
Sbjct: 308 SILEESIALYKDSNYLLDINLQTGCVSIILSPLSTTMDHLTSCFQAEVIDYVYDN 362
>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
Length = 420
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 95/273 (34%), Positives = 151/273 (55%), Gaps = 8/273 (2%)
Query: 97 SASNASLSSLLSVNKLWDE----CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQ 152
S + LS +L ++L DE R+ F LP+G+P+SV+ DYL Y +W +VQ AS
Sbjct: 24 SLPDGRLSCVL--DRLEDEGYRSIRDFFTAVFLPQGYPESVSPDYLAYQIWDTVQAFASS 81
Query: 153 ISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWR 210
I+G LATQA+L +G+G + + TAA + W+LKDG G L +I+ + + G D + K WR
Sbjct: 82 ITGTLATQAVLKGVGVGDETSTVTAATVTWILKDGTGMLGRILFAWSKGSKLDCDAKQWR 141
Query: 211 LFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
LFAD+L + A +E+L PAFP F I +G + + +TR+ A R N A
Sbjct: 142 LFADVLNDVAILMEILAPAFPACFTLIVCISGFFKCIVGVAGGATRAALTMHQARRDNMA 201
Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
+V AK +Q + G++ + L + ++ + + + T +H+Y N ++ +++ +
Sbjct: 202 DVSAKDGSQETLVNLAGLLFSLLLIPLVADNLRLTYSLYGLFTILHLYANYRAVRAVCME 261
Query: 331 TLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
TLN R LV +L SG+ P N +EPL
Sbjct: 262 TLNRARLCLVLQSFLKSGEVPGPDVTNPQEPLL 294
>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 452
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 168/344 (48%), Gaps = 20/344 (5%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
+PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G K A A W L
Sbjct: 72 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKSATVIGATFQWFL 131
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G L I+ + + G + D N K WRL AD + + +++L+P FP + I
Sbjct: 132 RDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 191
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
RS + +TR+ FA +N A++ AK +Q ++ G+ LG+ LA+
Sbjct: 192 LSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHD 251
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
SF +T HMY N K+ QS+ L TLN R S++ +L G+ ++V+ +E +
Sbjct: 252 LVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEVRTPQQVSKQEHI 311
Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRL-QLGSKLSDVVNNKE-DAHALFSLY 420
P + SS + ++ H L LG+K S + ++ S Y
Sbjct: 312 LPFW----------------SSWRRLLRTKLPHELVHLGAKASMLTHSDMLLISKTRSYY 355
Query: 421 EDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
D Y L + G + + + A D+LKS L L R +
Sbjct: 356 TDANYFLLDKEGSVHIFIHKQAAATDVLKSFVHGLVLARLTRKS 399
>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 415
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/229 (37%), Positives = 134/229 (58%), Gaps = 1/229 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
F+ P G+P SV YL Y+ +R+VQ V S VL+TQ+LL+A GL + A A+
Sbjct: 48 FMNKFFPSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL-RPTPAQATAV 106
Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
+W+LKDG+ ++ K++ SN+G D PK WRL AD L + GLE+L+P PH F+ +
Sbjct: 107 SWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAG 166
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ A + +TR Y+ FA N +++ AKGEA + IGI +GI LA+ I
Sbjct: 167 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTIC 226
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+SM L + +++ IH+Y + ++ + TLNP R ++V +++L +G
Sbjct: 227 ASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAG 275
>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 430
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 152/304 (50%), Gaps = 25/304 (8%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
EL LP G+PDSVTSDY++Y +W + Q ++S +SGVLAT+A+L G+G A +
Sbjct: 45 ELLRSMFLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRGYGVGNAAASVGS 104
Query: 178 AI-NWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
A+ W+L+DG G + +I + GR D N K WR AD+L + LEML+P F+
Sbjct: 105 AVLQWILRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTLEMLSPLSNVLFM 164
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ +S + T++ FA N +V AK +Q +G+ LG AL+
Sbjct: 165 PLSVCGIVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAVGLVGMFLGAALS 224
Query: 296 NHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
I + +V W+ H+YCN ++ + L ++N YRA+L++ YL
Sbjct: 225 ASIPDESE-QTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYRATLIYKHYLQY 283
Query: 348 GQAPPVKEV---------NDEEPLFPAFHFFKIKSANKSQLLVLSS----EAKDAAVEIE 394
G+ P +V +D++P++ I + NK Q S E+K ++ I
Sbjct: 284 GEIPTPIQVSKRENILFRDDKQPIYLGASLLNIINLNKQQQQQQQSKEIRESKTNSI-IR 342
Query: 395 HRLQ 398
H+L+
Sbjct: 343 HKLE 346
>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
Length = 468
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 157/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
C Q + AR ++ D + W G R + DA V + AS L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGGDAGEVGAPRAASPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +S+ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
Length = 380
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/407 (26%), Positives = 201/407 (49%), Gaps = 38/407 (9%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
F+ P G+P SV YL Y+ +R++Q +S + VL+TQ+LL+A GL + A A+
Sbjct: 5 FLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL-RPTPAQATAV 63
Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
+W+LKDG+ + K++ S G D PK WR+ AD+L + LE+++P P F+ +
Sbjct: 64 SWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFLEVAG 123
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ A + +TR Y+ FA N +++ AKGEA + +GI GI LA+ +
Sbjct: 124 FGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVC 183
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APPVKEVND 358
S+ L + +++ +H+Y ++ ++ + TLNP R +++ ++++ SG+ + P +
Sbjct: 184 STTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYR 243
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E+ LFP + +E +++G + V+ + + L +
Sbjct: 244 EDLLFP-----------------------NRLIEEAGSVKIGQPVRRVL-SPQRIEQLKA 279
Query: 419 LYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPG 478
+ E ++L+ +VL++SA +D L+ A++ +ER+ G+ + T
Sbjct: 280 TFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWLVAAFASEMERS-GVGSGDTV------ 332
Query: 479 GRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEP 525
L ++ + ++ F ++ S GW TD + NR R Y P
Sbjct: 333 --LNVAYERMENVFPMFVAEVKSRGWYTDQFLD---GNRSRIAYANP 374
>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
Length = 417
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 168/346 (48%), Gaps = 26/346 (7%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT- 175
+ F LP G+PDSV+SDYL Y W +VQ S ISG L T A+L +G+G ++
Sbjct: 46 KRFFQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTAL 105
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHH 233
+A + WVLKDG+G+L +I+ + + G D++ K WR+ AD+L + A +++ P +P
Sbjct: 106 SATVTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKA 165
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
+I ++ + +TRS A R N A+V +K AQ I +G+
Sbjct: 166 STYILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLL 225
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
L ++ + F+ VT++H+Y N+K+ +++ LRT N R + EY SG
Sbjct: 226 LLTYL-QNQRILYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284
Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
++VN K++ Q + L++ K E L++ N D
Sbjct: 285 RQVN------------KLERVTVGQTVSLTARVKIGCSVRE--------LTEFYRNCYDL 324
Query: 414 HALFSLYED-EGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
L + ++ E +++ E V L S P D++KS F ASYL
Sbjct: 325 ENLIACFDSREKFLIAETRHYVGVYLHFSVKPLDIIKSYFYVASYL 370
>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
mulatta]
Length = 429
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 133/244 (54%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L IG+G A +AA W++
Sbjct: 39 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 98
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 99 KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 158
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + G+++ + + +
Sbjct: 159 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 218
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
F+L F +T +H+Y N ++ +S+ + TLN R LV YL G+ N EPL
Sbjct: 219 GFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 278
Query: 363 FPAF 366
+ F
Sbjct: 279 WTGF 282
>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 431
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 188/364 (51%), Gaps = 32/364 (8%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
L+ + F+ P G+P SV YL Y+ +R++Q AS VL+TQ+LL+A GL +
Sbjct: 57 LYHRIVDSFLNKFFPSGYPYSVNEGYLRYTQFRALQHTASAALSVLSTQSLLFAAGL-RP 115
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
A A++WVLKDG+ +L I+ S G D PK WR+ AD+L + GLE+L+P P
Sbjct: 116 TPAQATAVSWVLKDGMQHLGNIICSKLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCP 175
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
F+ + + + + +TR Y+ FA N +++ A+GEA + +GI +G
Sbjct: 176 QFFLEMAGIGNLAKGMSIVASRATRLPIYSSFAKEGNLSDLFARGEAISTLFNVVGIGIG 235
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-A 350
I LA+ + +SM L +++ IH+Y + ++ + TLNP R +++ +++L +G +
Sbjct: 236 IQLASTVCASMQGKLVVGPLLSIIHIYSVSEEMRATPVNTLNPQRTAMIVADFLKAGSVS 295
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAA-VEIEHRLQLGSKLSDVVNN 409
P E+ LFP +L+ +DA V + L K S ++ +
Sbjct: 296 SPADLRYREDLLFP------------RRLI------EDAGNVRVGRDLHKVIKPSRLLES 337
Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYLYWLERNAGIV 467
K+ ++ E +IL +G C+ VL++ A+ +D L+ ASY + +++ +
Sbjct: 338 KQ-------VFPGEKFIL--NGDNRCIDMVLEQDAIGKDALRGWLVASYAVQIGKSSHEL 388
Query: 468 ATST 471
+TST
Sbjct: 389 STST 392
>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
gorilla]
Length = 468
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
C Q + AR ++ D + W G R + +DA V + A + L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPGAPSPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +++ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
Length = 302
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/229 (37%), Positives = 134/229 (58%), Gaps = 1/229 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
F+ P G+P SV YL Y+ +R+VQ V S VL+TQ+LL+A GL + A A+
Sbjct: 48 FMNKFFPSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL-RPTPAQATAV 106
Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
+W+LKDG+ ++ K++ SN+G D PK WRL AD L + GLE+L+P PH F+ +
Sbjct: 107 SWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAG 166
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ A + +TR Y+ FA N +++ AKGEA + IGI +GI LA+ I
Sbjct: 167 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTIC 226
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+SM L + +++ IH+Y + ++ + TLNP R ++V +++L +G
Sbjct: 227 ASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAG 275
>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
Length = 464
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 180/373 (48%), Gaps = 25/373 (6%)
Query: 90 KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
K+ F+V S S++ LLS + + + LP+G+PDSV DY Y +W +
Sbjct: 15 KERFIVKSKDEQSITELLSDTTRTKPFYSGFVSILKEVFLPQGYPDSVHPDYTAYQIWDT 74
Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
VQ AS I G L T +++ +G+G+ A P AAAI W+LKDG G + +I+ + + G D
Sbjct: 75 VQAFASTIMGTLTTHSIMQGVGVGEPAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLD 134
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
K WRLFAD+L + A GLE+L P + + + I + A +S + +TR+
Sbjct: 135 GQCKKWRLFADILNDVAMGLELLLPYYSSYSLAILCVSTAMKSIVGVAGGATRAAVTQHQ 194
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A + N A+V AK +Q I ++GI + + I + + + + + +H+Y N +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASLVGILILS-IFHNGRYIMELYFFLIIVHLYANYSA 253
Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+S+ L +LN R +L+ Y+ + P +EVN +E S LL+
Sbjct: 254 VKSLCLNSLNEDRLALIVKSYITNEVIPEPEEVNKKE----------------SVLLLTK 297
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
I+ + L + + + + + D L+ D Y+++ + K+
Sbjct: 298 PTMSIYGFNIKMGVSLAALIKNNIISTSDTELSLKLFLDRKYLISIDVQNKTIFICFKKD 357
Query: 442 ALPQDMLKSLFQA 454
A P D+L++ F A
Sbjct: 358 AQPCDVLEAYFHA 370
>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
Length = 468
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
C Q + AR ++ D + W G R + +DA V + A + L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPGAPSPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +++ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
Length = 468
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
C Q + AR ++ D + W G R + +DA V + A + L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +++ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
Length = 468
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
C Q + AR ++ D + W G R + +DA V + A + L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P +P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +++ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
Length = 466
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 134/245 (54%), Gaps = 2/245 (0%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+KD G L +I+L+ + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 VKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+L F ++T +H+Y N ++ +++ L TLN R LV +L G+ N EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314
Query: 362 LFPAF 366
L+ F
Sbjct: 315 LWTGF 319
>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
Length = 419
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 135/229 (58%), Gaps = 1/229 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
F+ P G+P SV YL Y+ +R++Q V S VL+TQ+LL+A GL + A A+
Sbjct: 52 FMNKFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL-RPTPAQATAV 110
Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
+W+LKDG+ ++ K++ SN+G D PK WRL AD+L + GLE+L+P PH F+ +
Sbjct: 111 SWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFLEMAG 170
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ A + +TR Y+ FA N +++ AKGEA + +GI +GI LA+ I
Sbjct: 171 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLASTIC 230
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+S+ L + +++ IH+Y + ++ + TLNP R ++V +++L +G
Sbjct: 231 ASIQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAG 279
>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
Length = 468
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/334 (32%), Positives = 167/334 (50%), Gaps = 24/334 (7%)
Query: 41 SLLYCFCHLQVATAIARTATSSEDDGNKEYDAV----WEVKGSKRTKLIPDFTKDAFVVA 96
SL+ C Q + AR ++ D G+ ++DA W G+ K P+ + V A
Sbjct: 6 SLVAPLCSEQFGSGAARRCHAAAD-GSLQWDARGWRWWGFSGAFTVK--PEGRDEGGVGA 62
Query: 97 -SASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
A++ LS L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG
Sbjct: 63 PGAASPPLSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSG 108
Query: 156 VLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFA 213
LATQA+L +G+G A +AA W++KD G L +I+ + + G D N K WRLFA
Sbjct: 109 SLATQAVLLGLGVGNAKASVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFA 168
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
D+L + A LE++ P +P F + + + ++ +TR+ A R N A+V
Sbjct: 169 DILNDVAMFLEIMAPIYPICFTMTVSTSNLAKCIVSVAGGATRAALTMHQARRNNMADVS 228
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
AK +Q + G+++ + + + F+L F +T +H+Y N ++ ++I + TLN
Sbjct: 229 AKDSSQETLVNLAGLLVSLLILPLVSDCPSFSLGCFFFLTALHIYANYRAVRAIVMETLN 288
Query: 334 PYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFH 367
R LV +L G+ N EPL+ F
Sbjct: 289 QGRLRLVLKHFLQRGEVLDPTSANKMEPLWTGFR 322
>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 435
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/346 (31%), Positives = 175/346 (50%), Gaps = 25/346 (7%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
R F F +PEGFP SVT DY+ + +W S+QG+++ I +L+TQALL AIG+G + A
Sbjct: 49 RRFFDAF-VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVI 107
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLE-MLTPAFPHH 233
A W L+D G L I+ + + G + D N K WRL AD + + ++ ++P FP
Sbjct: 108 GATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSA 167
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ I RS + +TR+ FA + N A++ AK +Q V+ IG+ +G+
Sbjct: 168 FMLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGML 227
Query: 294 LANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
LA I P A+ SF +T HMY N K+ + L +LNP R S+ Y+ + +
Sbjct: 228 LA-RITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLS 286
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN--K 410
++V+ E + PA+ SS + + I R++LG+++S + + K
Sbjct: 287 PQQVSMMEHVLPAWA---------------SSWSLRSIKLIHTRVRLGARISSLHHTEIK 331
Query: 411 EDAHALFSLYED--EGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
E H + D E Y+L E V++++ A D+ +S A
Sbjct: 332 ELLHLAGANNNDKQEKYLLVERKETISVIVQKDATANDIFQSFVHA 377
>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
norvegicus]
Length = 466
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/245 (35%), Positives = 133/245 (54%), Gaps = 2/245 (0%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+ + F ++T +H+Y N ++ +++ L TLN R LV +L G+ N EP
Sbjct: 255 LSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314
Query: 362 LFPAF 366
L+ F
Sbjct: 315 LWTGF 319
>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
gi|219887187|gb|ACL53968.1| unknown [Zea mays]
Length = 439
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 199/400 (49%), Gaps = 34/400 (8%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
+ D+ + F+ P G+P SV YL Y+ +R +Q +S + VL+TQ+LL+A GL +
Sbjct: 60 IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
A A++W+LKDG+ + K++ S+ G D PK W + AD+L + LE+++P P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWTILADVLYDLGTALEVVSPLCP 178
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
F+ + + A + +TR Y+ FA N +++ AKGEA + +GI G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-A 350
I LA+ + S+ L + +++ +H+Y ++ ++ + TLNP R +++ ++++ SG+ +
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVS 298
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
P + E+ LFP + +E +++G L V + +
Sbjct: 299 SPAELRYKEDLLFP-----------------------NRVIEEAGSVKIGQPLRKVFSPR 335
Query: 411 EDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
L + + E ++L++ + +VL+ SA +D L+ A++ +ER+ +
Sbjct: 336 L-VEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALRGWLVAAFASEMERSG-----A 389
Query: 471 TSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
S D A L + + ++R F S+ S GW TD +
Sbjct: 390 GSRDDAV---LSEAYEKMERVFPTFVSEVRSRGWYTDQFL 426
>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 397
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 139/244 (56%), Gaps = 1/244 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
F+ P G+P+SV YL+YS +R+VQ AS + VL+TQ+LL+A GL + A +
Sbjct: 26 FLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSLLFAAGL-RPTPAQATIV 84
Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
+WVLKDG+ ++ K++ S G D PK WR+FAD + + GLE+++P P HF+ +
Sbjct: 85 SWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGLEVISPLCPQHFLPVAG 144
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
A + A + +TR Y+ F N +++ AKGEA +S +G+ +GI LA++
Sbjct: 145 MANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLSNVLGLGVGIHLASNFA 204
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
S++ L +++ IH+YC + ++ + TLN R +++ ++++ +G E+
Sbjct: 205 STIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVADFVKTGSVSSPAELRYR 264
Query: 360 EPLF 363
E L
Sbjct: 265 ERLI 268
>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
Length = 468
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 88/244 (36%), Positives = 134/244 (54%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L IG+G A +AA W++
Sbjct: 78 LPQGFPDSVSPDYLPYQLWDSVQAFASGLSGSLATQAVLLGIGVGNAKATVSAATATWLV 137
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 138 KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 197
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + +G+++ + + +
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCP 257
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
F+L F +T +H+Y N ++ +++ + TLN R LV YL G+ N EPL
Sbjct: 258 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPL 317
Query: 363 FPAF 366
+ F
Sbjct: 318 WTGF 321
>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
Length = 464
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 178/373 (47%), Gaps = 25/373 (6%)
Query: 90 KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
KD F V S S++ LLS + + + LP+G+PDSV DY +Y +W +
Sbjct: 15 KDRFFVKSKDEQSITELLSDTTRTKSFYSGFISIVKEVFLPQGYPDSVHPDYTSYQIWDT 74
Query: 146 VQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
VQ AS I G L T +++ +G+G+ A P AAAI W+LKDG G + +I+ + + G D
Sbjct: 75 VQAFASTIMGTLTTHSIMQGVGVGEASATPLAAAITWILKDGTGMIGRIVFAWWNGTDLD 134
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
K WRLFAD+L + A GLE+ P F + + I + A +S + +TR+
Sbjct: 135 GQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQ 194
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A + N A+V AK +Q I +GI + + I + + + + + IH+Y N +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGRYIMELYLFLVMIHLYANYMA 253
Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+++ L +LN R +L+ Y+ + P +EVN +E S LL+
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
I+ + L + + + + D L+ D Y+++ + LK+
Sbjct: 298 PTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIFICLKKD 357
Query: 442 ALPQDMLKSLFQA 454
A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370
>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
jacchus]
Length = 468
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 166/337 (49%), Gaps = 31/337 (9%)
Query: 39 VPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFT-----KDAF 93
+P+LL C Q + AR ++ D + WEV+G + L FT +DA
Sbjct: 7 LPALL---CSEQFGSGEARGCRAAADGSLQ-----WEVRGWRWWGLSRAFTVKPEGRDAG 58
Query: 94 VVASASNAS--LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVAS 151
V + AS LS L +V LP+GFPDSV+ DYL Y LW SVQ AS
Sbjct: 59 EVGAPGAASPPLSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFAS 104
Query: 152 QISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGW 209
+SG LAT A+L IG+G A +AA W++KD G L +I+ + + G D N K W
Sbjct: 105 SLSGSLATHAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQW 164
Query: 210 RLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNF 269
RLFAD+L + A LE++ P +P F + + + ++ +TR+ A R N
Sbjct: 165 RLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNM 224
Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
A+V AK +Q + G+++ + + + F+ F +T +H+Y N ++ +++ +
Sbjct: 225 ADVSAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTALHIYANYRAVRALVM 284
Query: 330 RTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
TLN R LV YL G+ N EPL+ F
Sbjct: 285 ETLNEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGF 321
>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
Length = 468
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 134/249 (53%), Gaps = 2/249 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP+GFPDSV+ DYL+Y LW SVQ AS +SG LAT A+L IG+G A +AA
Sbjct: 73 LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W++KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F
Sbjct: 133 ATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCFTVT 192
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ + ++ +TR+ A R N A+V AK +Q + G+++ + +
Sbjct: 193 ICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ F+L F ++T +H+Y N ++ Q++ + TLN R LV +L G+ N
Sbjct: 253 VSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSAN 312
Query: 358 DEEPLFPAF 366
EPL+ F
Sbjct: 313 QMEPLWTGF 321
>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
Length = 519
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 174/370 (47%), Gaps = 27/370 (7%)
Query: 91 DAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVA 150
D F+ S + ++ + +W + +L LPE +V+ DY ++ + +Q V
Sbjct: 137 DYFLNPSNGDLLIAPSIDSRTVWRKILDLLNSSFLPENCKKTVSKDYFWFTTFNFLQNVV 196
Query: 151 SQISGVLATQALLYAIGL-GKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKG 208
+ GV++TQ LL A+GL I +A++NWVLKDGIG L I+ +F G FD +PK
Sbjct: 197 NCCIGVISTQQLLRAVGLSASQGIGKSASLNWVLKDGIGRLGAIVFGSFIGNRFDADPKR 256
Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
WRL+ D+L G E+++P P +F+ + + A ++ + +++ T + FA R N
Sbjct: 257 WRLWGDILYAFGIGTEIISPLLPRYFLLVASLANMVKATSYMLRLPTTAAIRRSFAKREN 316
Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
F ++ AK +Q ++S G LGI + +G S ++ V + N + +
Sbjct: 317 FGDISAKANSQEVLSNLFGTFLGIGFSIILGHSWWQLSVAYFVYFIMFFLLNYLGVKGLH 376
Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVN-DEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
LRTLN R+ L+ Y S Q V+EVN E + P+ F
Sbjct: 377 LRTLNLQRSLLLGWSYWKSSQVSTVEEVNYRENVILPSIMSFG----------------- 419
Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDM 447
++ G L D+ + AL Y Y+LT G+ +V A +D
Sbjct: 420 -------RYIRFGVSLRDIHLTGSELAALLDKYRQAKYLLTVRHGQVYIVFHMEARVKDE 472
Query: 448 LKSLFQASYL 457
L+++ QA++L
Sbjct: 473 LQAILQATFL 482
>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
Length = 466
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 133/245 (54%), Gaps = 2/245 (0%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+L F ++T +H+Y N ++ +++ L TLN R LV +L G+ N EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314
Query: 362 LFPAF 366
L+ F
Sbjct: 315 LWTGF 319
>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
Length = 468
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 111/375 (29%), Positives = 183/375 (48%), Gaps = 33/375 (8%)
Query: 89 TKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQG 148
+KD F AS + SL+ L++ +F Q LP+G+PDSV DY Y +W +VQ
Sbjct: 20 SKDGFS-ASETKLSLTESLAIR---SRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQA 75
Query: 149 VASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNP 206
AS I+G L T +++ IG+G+ A P AAAI W+LK G G + IM + + G D
Sbjct: 76 FASTITGTLTTHSIMKGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQC 135
Query: 207 KGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
K WRLFAD+L++ A G+E+L P F + I + +S + +TR+ A +
Sbjct: 136 KKWRLFADILDDIAKGIELLVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQ 195
Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALAN--HIGSSMPFALASFSVVTWIHMYCNLKSY 324
N A+V AK +Q I GI + + H G + L + + +H+Y N +
Sbjct: 196 NNVADVSAKDGSQETCVNLIASFFGIFILSFFHDGR---YLLELYLFLVAVHLYANYSAV 252
Query: 325 QSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSS 384
+++ L TLN R +++ Y+++ Q P +VN EE +F +L++
Sbjct: 253 KALCLDTLNEDRLAIIVKNYMINEQIPKPGKVNKEESIF-----------------LLTN 295
Query: 385 EAKDA-AVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT---EHGGKFCVVLKE 440
++ I+ + S L + + + L +E+ YI+T E F ++LK+
Sbjct: 296 PGRNVHGFNIKIGVSFTSILKRNIISLTEMEFLLKFFENRKYIITIDIEKKNIF-IILKK 354
Query: 441 SALPQDMLKSLFQAS 455
++L+ F AS
Sbjct: 355 DIQSIEILEGYFYAS 369
>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
Length = 466
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 133/245 (54%), Gaps = 2/245 (0%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+L F ++T +H+Y N ++ +++ L TLN R LV +L G+ N EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314
Query: 362 LFPAF 366
L+ F
Sbjct: 315 LWTGF 319
>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
Length = 468
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/325 (32%), Positives = 157/325 (48%), Gaps = 18/325 (5%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
C Q + AR ++ D + W G R + +DA V + A + L
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPL 70
Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
S L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L
Sbjct: 71 SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116
Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L + A
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
LE++ P P F + + + ++ +TR+ A R N A+V AK +Q
Sbjct: 177 FLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G+++ + + + F+L F +T +H+Y N ++ +++ + TLN R LV
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
YL G+ N EPL+ F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321
>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
tauri]
Length = 1023
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 126/429 (29%), Positives = 199/429 (46%), Gaps = 51/429 (11%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
C E +LPE P SVTSDY Y+ +R Q +AS V ATQ +L A+GLG +
Sbjct: 574 RCVERASHALLPE--PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRTL 631
Query: 174 PTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P AAA+NWVLKDG+G L K+ + +NFGR FD + K +R + ++ +A+ +E++TP FP
Sbjct: 632 PAAAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQ 691
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
HF+ + A G+S R+ F N AEV AK AQ +++ IG+ L +
Sbjct: 692 HFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAV 751
Query: 293 ALAN----------HIGSSMP-FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
A I ++P FA +V+ I +Y LK+ Q LRT+N RA ++
Sbjct: 752 GAAKGMSKVATVRPEIRRALPLFAFGPLAVLDLICIYKELKAVQ---LRTINKERAEIIA 808
Query: 342 SEYLLSGQAPPVKEV-NDEEPLFPA-----------FHFFKIKSANKSQLLVLSSEAKDA 389
Y+ + P V N E L PA ++ KS + LS + + A
Sbjct: 809 EAYVKEREIPTRARVANAERLLIPARLDKSNLPLKVTSIGEVCPTPKSLIRALSGKGRRA 868
Query: 390 AV-----------EIEHRLQLGSKL---SDVVNNKEDAHALFSLYEDEGYILTEHGGKFC 435
+ + L G ++ VN K+ A F G + + G+
Sbjct: 869 STPPYILAYEPGASKKRVLVDGPRIGAPKTPVNGKKKMSAKFQFPWQRGRVGLK--GRAL 926
Query: 436 VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHV 495
+ L E A +D+L+++ Q +++ L + +AD A L+ S +R+ +
Sbjct: 927 LALSEDATSKDVLRAVLQVAHMRALPYRPDL-----TADEAYVWALQESESLAKRDIDDF 981
Query: 496 KSDSASVGW 504
++ + GW
Sbjct: 982 TANLSKKGW 990
>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
Length = 525
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 179/375 (47%), Gaps = 29/375 (7%)
Query: 90 KDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGV 149
K+ + V +A L +L VN+ + LF + LP+G+P+SV+ DY+ Y +W + Q
Sbjct: 19 KERYYVKAADQ--LPIVLVVNEKSRDVAGLFAKIFLPQGYPNSVSKDYIFYQIWDTAQAF 76
Query: 150 ASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPK 207
S I+G+LATQ + +G+G A P AA + WV KDG G++ KI+ + G + D K
Sbjct: 77 CSTITGILATQEVFRGVGVGDTNASPLAATVTWVFKDGCGHIGKILFAYTHGTYLDAYSK 136
Query: 208 GWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR 267
WRL+AD L +AA +E+ P F ++ F + ++ + +TR A R
Sbjct: 137 KWRLYADTLNDAAMCIEIALPLFKNYITFALCVSTCMKAIVGVAGGATRVAMTQHHALRG 196
Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
N A+V AK AQ I + L + IG+S+ F ++ +H+ N + +++
Sbjct: 197 NLADVSAKDSAQETAVNLIASFAALFLISLIGNSVTI----FIILLIMHIVFNYMAVRAV 252
Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
LRTLN R V YL E+N EP+ F+++ N L +
Sbjct: 253 CLRTLNEPRFLQVIDTYLRKEVIANPCEINRNEPII----FYQL-GPNLLDLKICG---- 303
Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDA---HALFSLYEDEGYILTEHGGK--FCVVLKESA 442
+ +G +S ++N + +A + + +Y D+ YI+ G V KE A
Sbjct: 304 -------FHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPDTGNRVMYVFPKEDA 356
Query: 443 LPQDMLKSLFQASYL 457
DML + FQ+ L
Sbjct: 357 SVDDMLCAYFQSVLL 371
>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
Length = 464
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 87/249 (34%), Positives = 134/249 (53%), Gaps = 2/249 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP+GFPDSV+ DYL+Y LW SVQ AS +SG LAT A+L IG+G A +AA
Sbjct: 73 LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W++KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F
Sbjct: 133 ATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCFTVT 192
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ + ++ +TR+ A R N A+V AK +Q + G+++ + +
Sbjct: 193 ICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ F+L F ++T +H+Y N ++ Q++ + TLN R LV +L G+ N
Sbjct: 253 VSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSAN 312
Query: 358 DEEPLFPAF 366
EPL+ F
Sbjct: 313 QMEPLWTGF 321
>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
boliviensis]
Length = 468
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 108/334 (32%), Positives = 162/334 (48%), Gaps = 28/334 (8%)
Query: 42 LLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFT-----KDAFVVA 96
L C Q + AR + D + WEV+G + L FT +DA V
Sbjct: 7 LQTALCSEQFGSGEARGCRVAADGSLQ-----WEVRGWRWWGLSRAFTVKPEGRDAGEVG 61
Query: 97 SASNAS--LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
+ AS +S L +V LP+GFPDSV+ DYL Y LW SVQ AS +S
Sbjct: 62 APGAASPPVSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLS 107
Query: 155 GVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLF 212
G LATQA+L IG+G A +AA W++KD G L +I+ + + G D N K WRLF
Sbjct: 108 GSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLF 167
Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
AD+L + A LE++ P +P F + + + ++ +TR+ A R N A+V
Sbjct: 168 ADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADV 227
Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
AK +Q + G+++ + + + F+ F +T +H+Y N ++ +++ + TL
Sbjct: 228 SAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTALHIYANYRAVRALVMETL 287
Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
N R LV YL G+ N EPL+ F
Sbjct: 288 NEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGF 321
>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 130/228 (57%), Gaps = 1/228 (0%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E F+ P G+P SV YL Y+ +R++Q +S VL+TQ+LL+A GL + A
Sbjct: 64 ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 122
Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
++W+LKDG+ ++ K++ SN G D PK WR+ AD+L + GLE+++P PH F+ +
Sbjct: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 182
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ A + +TR Y+ FA N +++ AKGEA + GI GI LA+
Sbjct: 183 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 242
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
I SSM L S+++ +H+Y ++ + + + TLNP R +L+ + YL
Sbjct: 243 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYL 290
>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
Length = 466
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 134/245 (54%), Gaps = 2/245 (0%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV++DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSTDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
LKD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 LKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSIS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+L F ++T +H+Y N ++ +++ L TLN R L+ +L G+ N EP
Sbjct: 255 PSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQMEP 314
Query: 362 LFPAF 366
L+ F
Sbjct: 315 LWTGF 319
>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
Length = 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 99/331 (29%), Positives = 160/331 (48%), Gaps = 30/331 (9%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAIN-WV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +S LAT A++ +G+G AAA W+
Sbjct: 77 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLGVGNAEASLAAATTTWL 136
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
++D G L +I+ + + G D N K WRLFAD+L +AA LE+ P P F + +
Sbjct: 137 VRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEIAAPTLPFCFTVTVSTS 196
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ ++ +TR+ A R N A+V AK +Q + G++ + + +
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLFSLVMLPLVSGC 256
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
F+L F ++T +H+Y N ++ +++ + TLN R LV +L G+ N EP
Sbjct: 257 PSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFLQRGEILDTTSANQMEP 316
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
L+ F L LG L D++++ + L ++
Sbjct: 317 LWTGFW-------------------------PSLSLSLGVPLHDLISSVSELQQLVEGHQ 351
Query: 422 DEGYILT--EHGGKFCVVLKESALPQDMLKS 450
E Y+L+ + K VVL A P+ +L++
Sbjct: 352 -EPYLLSWNQSRNKVQVVLSHVAGPEAVLRA 381
>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
familiaris]
Length = 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 90/251 (35%), Positives = 136/251 (54%), Gaps = 3/251 (1%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTA 176
L V F LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +A
Sbjct: 72 RLLVVF-LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSA 130
Query: 177 AAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
A W++KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F
Sbjct: 131 ATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFT 190
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ + ++ +TR+ A R N A+V AK +Q + G+++ + +
Sbjct: 191 MTVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLML 250
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ + F+L+ F +T +H+Y N ++ +++ L TLN R LV +L G+
Sbjct: 251 PLVSACPSFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTS 310
Query: 356 VNDEEPLFPAF 366
N EPL+ F
Sbjct: 311 ANQMEPLWTGF 321
>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
Length = 422
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/244 (35%), Positives = 132/244 (54%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L IG+G A +AA W++
Sbjct: 32 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 92 KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + G+++ + + +
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
F+L F +T +H+Y N ++ +++ + TLN R LV YL G+ N EPL
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 271
Query: 363 FPAF 366
+ F
Sbjct: 272 WTGF 275
>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
Length = 468
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 135/249 (54%), Gaps = 2/249 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA
Sbjct: 73 LLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNANASVSAAT 132
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W++KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F I
Sbjct: 133 ATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMI 192
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ + ++ +TR+ A R N A+V AK +Q + G+++ + +
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ F+L F ++T +H+Y N ++ +++ + TLN R LV +L SG+ N
Sbjct: 253 VSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEVLDPTSAN 312
Query: 358 DEEPLFPAF 366
EPL+ F
Sbjct: 313 KMEPLWTGF 321
>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
Length = 380
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 176/335 (52%), Gaps = 26/335 (7%)
Query: 127 EGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKD 185
+GFP SVT DY+ + +W +QG+++ I +L+TQALL AIG+G K A A W L+D
Sbjct: 2 QGFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRD 61
Query: 186 GIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
G L I+ + + G D N K WRL ADL+ + +++++P FP FV +
Sbjct: 62 LTGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLS 121
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
RS + +TR+ FA + N A++ AK +Q V+ G+ LG+ LA I P
Sbjct: 122 RSFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLAR-ITIGHPL 180
Query: 305 AL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
A+ SF +T HMY N ++ + + L +LN R+S++ ++ +GQ ++V+ E +
Sbjct: 181 AIWFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHIL 240
Query: 364 PAF-HFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNNKEDAHALFSL-- 419
P + +F++ K V++ H R+ LG ++S ++++E L S
Sbjct: 241 PRWTNFWRTKH-----------------VKLLHSRMCLGVRVSS-LDHQELMELLHSAGS 282
Query: 420 YEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
++ Y+L E G +++ +++ D+L+S A
Sbjct: 283 HKKAKYLLMERKGTINILIHKNSTTSDVLQSFIHA 317
>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
Length = 417
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/343 (30%), Positives = 163/343 (47%), Gaps = 24/343 (6%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
F LP G+PDSV+ DYL Y W +VQ S ISG L T A+L +G+G A P +A
Sbjct: 48 FFQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSA 107
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHHFV 235
I WVLKDG+G+ +I+ + + G D++ K WR+ AD+L + A +++ P +P
Sbjct: 108 TITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKAST 167
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
FI A ++ + +TRS A R N A+V +K AQ I +G+ L
Sbjct: 168 FILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLL 227
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
H+ F VT +H+Y N+K+ +++ LRT N R + EY +G ++
Sbjct: 228 THVQDQRVL-YGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSPEQ 286
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
VN K++ Q + L++ K E ++ V + E+ A
Sbjct: 287 VN------------KLERVTIGQTVTLTARVKIGCSAREL-----AQYYRVCYDLENLMA 329
Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
F + +++ E V L +A P D++KS F ASYL
Sbjct: 330 CFD--SRDKFLIAETRNYVGVYLHFTAKPLDIIKSYFYVASYL 370
>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
Length = 442
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/253 (35%), Positives = 133/253 (52%), Gaps = 2/253 (0%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP-T 175
R LP GFPDSV+ DYL Y LW +VQ AS ++G LATQA+L +G+G A +
Sbjct: 45 RAALTAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVS 104
Query: 176 AAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AA I W+LKDG G L +I + G D K WRLFAD+L +AA LE++ P FP F
Sbjct: 105 AATITWLLKDGTGMLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAAMFLEIVAPGFPSCF 164
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
I + + + +TR+ A R N A+V AK +Q + G+++ + L
Sbjct: 165 TLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLL 224
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
+ + +LA+F + T +H+Y N ++ +++ + T N R LV +L G+
Sbjct: 225 LPLVSNRPSLSLAAFFLFTILHLYANFRAVRAVIMETFNEARLYLVLKHFLQWGKVLTPP 284
Query: 355 EVNDEEPLFPAFH 367
N EPL+ H
Sbjct: 285 SANRMEPLWTGCH 297
>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
nagariensis]
Length = 495
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 172/353 (48%), Gaps = 32/353 (9%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGA 172
D R L ++LP GFPDSV Y Y WR VQ V TQ+LL A+G+ G+
Sbjct: 103 DHFRALIHAYLLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRFQ 162
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
AAAINWV+KDG G L +++ + +GR D K +RL DLL A LE+ T P
Sbjct: 163 GEAAAAINWVIKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATVYAPP 222
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + A ++ AA+ +STR+ Y FA + N A++ AKGE+ ++ +G + GI
Sbjct: 223 AFLPLACTANLSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGTVAGI 282
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
AL+ MP +F V++ ++ + K S+EL +N R + YL G P
Sbjct: 283 ALSRMKLPRMP----TFCVLSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPG 338
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
V E N EPL P N+++L++ A+VE S E
Sbjct: 339 VAEANHNEPLLP------WGRHNQNRLVL------GASVE-----------SACAGPSEL 375
Query: 413 AHALFSLYEDE-GYILT--EHGGKFCVVLKESALPQDMLK-SLFQASYLYWLE 461
AHA+ E E Y++T K V+L+E A D L+ S F +L+ L+
Sbjct: 376 AHAVQRFPEPEYRYLVTYRPDTKKAYVLLREGATSMDCLQASFFGHVFLHLLD 428
>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 485
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 123/428 (28%), Positives = 203/428 (47%), Gaps = 48/428 (11%)
Query: 59 ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
A+ E + + V E GS + L ++ A + ASAS+ + S W+
Sbjct: 3 ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRFGS---RWERIGS 55
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+ +PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G + A A
Sbjct: 56 RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
W L+D G L I+ + + G + D N K WRL AD + + +++L+P FP +
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I RS + +TR+ FA N A++ AK +Q ++ +G+ LG+ LA
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQ 235
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
ASF +T HMY N K+ QS+ L TLN RAS++ + G+ ++V
Sbjct: 236 ITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKV 295
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNN------ 409
+ +E + P++ S+ K +++ H R+ LG+K S + ++
Sbjct: 296 SQQEHILPSW----------------SNWRKLNRIKLPHERVHLGAKASMLTHSDMLVIA 339
Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 469
K H YE+ Y L + G V + + A P D+L+S G+V
Sbjct: 340 KTRYH-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVH-----------GLVLA 383
Query: 470 STSADCAP 477
S++ + P
Sbjct: 384 SSTQNSKP 391
>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
gi|194689438|gb|ACF78803.1| unknown [Zea mays]
gi|223974155|gb|ACN31265.1| unknown [Zea mays]
gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 442
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 122/428 (28%), Positives = 202/428 (47%), Gaps = 48/428 (11%)
Query: 59 ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
A+ E + + V E GS + L ++ A + ASAS+ + W+
Sbjct: 3 ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRF---GSRWERIGS 55
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+ +PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G + A A
Sbjct: 56 RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
W L+D G L I+ + + G + D N K WRL AD + + +++L+P FP +
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I RS + +TR+ FA N A++ AK +Q ++ +G+ LG+ LA
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQ 235
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
ASF +T HMY N K+ QS+ L TLN RAS++ + G+ ++V
Sbjct: 236 ITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKV 295
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNN------ 409
+ +E + P++ S+ K +++ H R+ LG+K S + ++
Sbjct: 296 SQQEHILPSW----------------SNWRKLNRIKLPHERVHLGAKASMLTHSDMLVIA 339
Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 469
K H YE+ Y L + G V + + A P D+L+S G+V
Sbjct: 340 KTRYH-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVH-----------GLVLA 383
Query: 470 STSADCAP 477
S++ + P
Sbjct: 384 SSTQNSKP 391
>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 468
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 85/244 (34%), Positives = 133/244 (54%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFP+SV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W++
Sbjct: 78 LPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKATVSAATATWLV 137
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P FP F + +
Sbjct: 138 KDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVFPVCFTMTVSTSN 197
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + G+++ + + +
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 257
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
+L F ++T +H+Y N ++ +++ + TLN R LV +L G+ N EPL
Sbjct: 258 SVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEVLDPTPANQMEPL 317
Query: 363 FPAF 366
+ F
Sbjct: 318 WTGF 321
>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
Length = 471
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/345 (29%), Positives = 172/345 (49%), Gaps = 29/345 (8%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAA 177
+F Q LP+G+P+SV DY Y +W +VQ AS I+G L T +++ IG+G+ A P AA
Sbjct: 52 VFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIGVGESNATPLAA 111
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
AI W+LK G G + IM + + G D K WRLFAD+L++ A G+E+L P F + V
Sbjct: 112 AITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDIAKGIELLVPYFSSYSVI 171
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I + +S + +TR+ A + N A+V AK +Q I GI + +
Sbjct: 172 ILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQETCVNLIASFFGIFILS 231
Query: 297 --HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
H G + L + + +H+Y N + +++ L TLN R +++ +Y+++ Q P
Sbjct: 232 LFHDGQ---YVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKKYMMNEQIPEPW 288
Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA-AVEIEHRLQLGSKLSDVVNNKEDA 413
+VN EE +F +L + +D +I+ + + L + + +
Sbjct: 289 KVNKEESVF-----------------LLKNPTRDIHGFDIKIGVSFANILKRNIISFTEM 331
Query: 414 HALFSLYEDEGYIL---TEHGGKFCVVLKESALPQDMLKSLFQAS 455
L +E+ YI+ E F V+LK+ ++L++ F AS
Sbjct: 332 EFLLKFFENRKYIILIDIEKRNIF-VILKKDIQSVEILEAYFYAS 375
>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
Length = 337
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 135/251 (53%), Gaps = 5/251 (1%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ + +TR+ A R N A+V AK +Q V G+++ + + +
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+L F ++T +H+Y N ++ +++ L TLN R LV +L G+ N EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314
Query: 362 LF---PAFHFF 369
L+ P+ F
Sbjct: 315 LWIGDPSLWFL 325
>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
Length = 442
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 121/428 (28%), Positives = 201/428 (46%), Gaps = 48/428 (11%)
Query: 59 ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
A+ E + + V E GS + L ++ A + ASAS+ + W+
Sbjct: 3 ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRF---GSRWERIGS 55
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+ +PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G + A A
Sbjct: 56 RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
W L+D G L I+ + + G + D N K WRL AD + + +++L+P FP +
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I RS + +TR+ FA N ++ AK +Q ++ +G+ LG+ LA
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGLGMLLAQ 235
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
ASF +T HMY N K+ QS+ L TLN RAS++ + G+ ++V
Sbjct: 236 ITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKV 295
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNN------ 409
+ +E + P++ S+ K +++ H R+ LG+K S + ++
Sbjct: 296 SQQEHILPSW----------------SNWRKLNRIKLPHERVHLGAKASMLTHSDMLVIA 339
Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 469
K H YE+ Y L + G V + + A P D+L+S G+V
Sbjct: 340 KTRCH-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVH-----------GLVLA 383
Query: 470 STSADCAP 477
S++ + P
Sbjct: 384 SSTQNSKP 391
>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
Length = 468
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 86/249 (34%), Positives = 132/249 (53%), Gaps = 2/249 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA
Sbjct: 73 LIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
W++KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F
Sbjct: 133 ATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTIT 192
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ + ++ +TR+ A R N A+V AK +Q + G+++ + +
Sbjct: 193 ICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ F+L F +T +H+Y N ++ +++ + TLN R LV +L G+ N
Sbjct: 253 VSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPTSAN 312
Query: 358 DEEPLFPAF 366
EPL+ F
Sbjct: 313 QMEPLWTGF 321
>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
Length = 468
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 104/327 (31%), Positives = 161/327 (49%), Gaps = 22/327 (6%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAV----WEVKGSKRTKLIPDFTKDAFVVASASNA 101
C Q + AR ++ D G+ +++A W G+ K A+AS A
Sbjct: 11 LCSEQYGSGAARGCCAAVD-GSLQWEAWGWRWWGFSGAFTAKPEGPDGGGEGAPATAS-A 68
Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
LS LL++ LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A
Sbjct: 69 PLSRLLAL--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHA 114
Query: 162 LLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENA 219
+L +G+G A +AA W++KD G L +I+ + + G D N K WRLFAD+L +
Sbjct: 115 VLLGLGVGDAKASVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDV 174
Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
A LE++ P +P F I + + ++ +TR+ A R N A+V AK +Q
Sbjct: 175 AMFLEIMAPVYPICFTMIVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQ 234
Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
+ G+++ + + + F+L F + T +H+Y N ++ +++ + TLN R L
Sbjct: 235 ETLVNLAGLLVSLLMLPLVSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWL 294
Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAF 366
V +L G+ N EPL+ F
Sbjct: 295 VLKHFLQRGEVLGPTSANRMEPLWTGF 321
>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
Length = 437
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 129/244 (52%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA W++
Sbjct: 47 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 106
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P P F +
Sbjct: 107 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCISN 166
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ + +TR+ A R N A+V AK +Q + G+++ + + + S
Sbjct: 167 LAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPMVSDSP 226
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
+L F +T +H+Y N ++ +++ + TLN R LV +L G+ N EPL
Sbjct: 227 SLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSANQMEPL 286
Query: 363 FPAF 366
+ F
Sbjct: 287 WTGF 290
>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
melanoleuca]
gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
Length = 468
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 132/244 (54%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L +G+G A +AA W++
Sbjct: 78 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVSAATATWLV 137
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F +
Sbjct: 138 KDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCTSN 197
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + G+++ + + + +
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSACP 257
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
F+L F +T +H+Y N ++ +++ L TLN R LV +L G+ N EPL
Sbjct: 258 SFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPTSANQMEPL 317
Query: 363 FPAF 366
+ F
Sbjct: 318 WTGF 321
>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
[Ornithorhynchus anatinus]
Length = 353
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 132/244 (54%), Gaps = 8/244 (3%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI----PTAAAIN 180
LP GFPDSV+ DYL+Y LW +VQ AS ++G LATQA+L +G+G G + PT A +
Sbjct: 4 LPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGALL- 62
Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
+K G G L +I + G D + K WRLFAD+L +AA LE++ PAFP +F
Sbjct: 63 --IKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFTLTVC 120
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ + + +TR+ A R N A+V AK +Q + G+++ + L +
Sbjct: 121 VSSLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLLPLVS 180
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
L F + T +H+Y N ++ +++ + TLN R LV +L GQ + N++
Sbjct: 181 DRPSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLASANEK 240
Query: 360 EPLF 363
EPL+
Sbjct: 241 EPLW 244
>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
Length = 466
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 134/244 (54%), Gaps = 4/244 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L IG+G A +AA W++
Sbjct: 78 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 137
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F + +
Sbjct: 138 KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 197
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + +G+++ + + + S
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLV-SGC 256
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
P L F +T +H+Y N ++ +++ + TLN R LV YL G+ N EPL
Sbjct: 257 P-GLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPL 315
Query: 363 FPAF 366
+ F
Sbjct: 316 WTGF 319
>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 128/241 (53%), Gaps = 2/241 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP+G+P+SV+ DYL Y W +VQ S I+G LATQA+L G+G + A AA + W+L
Sbjct: 6 LPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAATMTWIL 65
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+ G G + I+ + G + D N K WRLFAD++ + A +E+++P F +F + +
Sbjct: 66 RSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLLACLSS 125
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+S + STR+ A R N A+V AK +Q + + ++ G+ + +
Sbjct: 126 VAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLAGLVITPLVTGHT 185
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
F + T++H+Y N + S+ + TLN R +V +E+L +G EV EP+
Sbjct: 186 LLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEVGKMEPV 245
Query: 363 F 363
Sbjct: 246 I 246
>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
Length = 497
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 117/433 (27%), Positives = 195/433 (45%), Gaps = 44/433 (10%)
Query: 105 SLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
SL S +DE +++PEGFP SV Y+ Y WR+++ GV TQ LL
Sbjct: 93 SLESPQTPFDEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLN 152
Query: 165 AIGLGKGAIPTAAA-INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
++G + + +AA INW+LKDG G + K++ + G+ FD + K R DLL G+
Sbjct: 153 SVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGV 212
Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
E+ T A PH F+ + AA ++ AA+ STR+ Y FA N +V AKGE G ++
Sbjct: 213 ELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 272
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
+G I I P + +F +++ ++ + + +S+ L TLN R ++
Sbjct: 273 DLMGTGFSIL----ISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVES 328
Query: 344 YLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKL 403
+L +G+ P ++E N +E +F F + + + LG++
Sbjct: 329 FLKTGRVPSLQEGNIQEKIF-TFPWVD-----------------------DRPVMLGARF 364
Query: 404 SDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWLE 461
D + A+ ++ E Y++T GK +LK A D+LK+ F A L
Sbjct: 365 KDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVLLHFM 424
Query: 462 RNAGIVATSTSADCAPG---------GRLEISLDYVQREFNHVKSDSASVGW-VTDGLIA 511
+ + P R+ S + V + KS +A GW +++ L+
Sbjct: 425 NQSKDGNPRSVEQLDPAFAPTEYELESRIAESCEMVSTSYGVFKSRAAEQGWRMSESLLN 484
Query: 512 RPLPNRIRPGYVE 524
P R R +V+
Sbjct: 485 ---PGRARLCHVK 494
>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
Length = 455
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 88/241 (36%), Positives = 132/241 (54%), Gaps = 2/241 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP-TAAAINWVL 183
LP GFPDSV+ DYL Y LW +VQ AS ++G LATQA+L +G+G A +AA I W+L
Sbjct: 66 LPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATITWLL 125
Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KDG G L +I + G D + K WRLFAD+L +AA LE++ PAFP F I +
Sbjct: 126 KDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPSCFTLIVCVSS 185
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ + +TR+ A R N A+V AK +Q + G+++ + L +
Sbjct: 186 LAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLVSERP 245
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
+LA+F + T +H+Y N ++ +++ + T N R LV ++ G+ N EPL
Sbjct: 246 RLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVLKHFIQWGEVLTPPSANRMEPL 305
Query: 363 F 363
+
Sbjct: 306 W 306
>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
Length = 468
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 161/331 (48%), Gaps = 22/331 (6%)
Query: 42 LLYCFCHLQVATAIARTATSSEDDGNKEYDA----VWEVKGSKRTKLIPDFTKDAFVVAS 97
L C + + AR ++ D G+ +++A W + G+ KL +
Sbjct: 7 LAVTLCSERFGSGTARGCRAAAD-GSLQWEAGAWRWWGLSGAFSVKLAGRDGGRGGTPGT 65
Query: 98 ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
AS+ LS LL+V LP+GFPDSV+ DYL Y LW SVQ AS +SG L
Sbjct: 66 ASS-PLSGLLAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGAL 110
Query: 158 ATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADL 215
ATQA+L IG+G A +AA W++KD G L +I+ + + G D N K WRLFAD+
Sbjct: 111 ATQAVLLGIGVGDAKASVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADI 170
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
L + A LE++ P P F + + ++ +TR+ A R N A+V AK
Sbjct: 171 LNDVAMFLEIMAPVLPICFTITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAK 230
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
+Q + G+++ + + + +L F +T +H+Y N ++ +++ + TLN
Sbjct: 231 DSSQETLVNLAGLLVSLLMLPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEG 290
Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
R L+ +L G+ N EPL+ F
Sbjct: 291 RLWLILKHFLQKGEVLSPTSANQMEPLWTGF 321
>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
Length = 466
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 178/366 (48%), Gaps = 39/366 (10%)
Query: 107 LSVNKLWDECRE------LFVQFM----LPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
SVN++ E E FV F+ LP+G+PDSV +DYL Y +W +VQ AS I G
Sbjct: 24 FSVNEVKSESTESQDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMGT 83
Query: 157 LATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFAD 214
L T +++ IG+G+ A P AAAI W+LK G G + IM + + G D K WRLFAD
Sbjct: 84 LTTHSVMQGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFAD 143
Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
+L + A GLE++ P + V I + +S + +TR+ A R N A+V A
Sbjct: 144 ILNDIAMGLELMVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVSA 203
Query: 275 KGEAQGMVSKAIGIMLGIALAN--HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
K +Q I +GI + + H G + + + + +H+Y N + +++ L TL
Sbjct: 204 KDSSQETCVNLIASFVGIFVLSLFHDGR---YLIELYLFLVAVHLYANYSAVKALCLDTL 260
Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD-AAV 391
N R +L+ Y+++ + +++N EE +F +L + +K+
Sbjct: 261 NEDRLALIIKNYMMNERISEPQKINKEESVF-----------------LLGNPSKEICGF 303
Query: 392 EIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT---EHGGKFCVVLKESALPQDML 448
I+ S L V + + L +ED Y++T E F ++LK+ P ++L
Sbjct: 304 NIKIGASFASLLKRNVISSMEMEFLLKFFEDRRYLITIDIETRTIF-IILKKDIKPIEVL 362
Query: 449 KSLFQA 454
+ F A
Sbjct: 363 EGYFYA 368
>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
Length = 437
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/271 (32%), Positives = 142/271 (52%), Gaps = 13/271 (4%)
Query: 100 NASLSSLLSVNKLW--DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
N L+S +K W + +F LP+G+P+SV++DYL Y W ++Q AS I+G L
Sbjct: 27 NGKLNSKTDASKDWHLNSFHRIFKSIFLPQGYPESVSTDYLAYQFWDTIQAFASSITGTL 86
Query: 158 ATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
ATQA+L +G+G + + AAA+ W+LKDG G L +I + KG LFAD+L
Sbjct: 87 ATQAVLKGVGVGDETSTVAAAAVTWILKDGTGMLGRIAFAW--------SKG--LFADVL 136
Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKG 276
+ A +E++ PAFP F I +G + + +TR+ A R N A+V AK
Sbjct: 137 NDVAIFMEIMAPAFPACFTLIVCTSGFFKCIVGVAGGATRAALTMHQARRDNMADVSAKD 196
Query: 277 EAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
+Q + G++ + L + ++P A +++ T +H+Y N ++ +++ + TLN R
Sbjct: 197 GSQETLVNLAGLLFSLFLIPLVVDNLPLTYAFYALFTILHLYANYQAVRAVCMETLNRAR 256
Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFH 367
LV YL + +N +EPL F
Sbjct: 257 LHLVLQHYLKREEVLGPAVINPQEPLLLGFR 287
>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
Length = 464
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 179/373 (47%), Gaps = 25/373 (6%)
Query: 90 KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
KD F V S S++ LLS + + + LP+G+PDSV DY +Y +W +
Sbjct: 15 KDRFFVKSKDEQSITELLSDTTRTKSFYSGLISIIKEVFLPQGYPDSVHPDYTSYQIWDT 74
Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
VQ AS I G L T +++ +G+G+ A P AAAI W+LKDG G + +IM + + G D
Sbjct: 75 VQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWILKDGTGMVGRIMFAWWNGTDLD 134
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
K WRLFAD+L + A GLE+L P F + + I + A +S + +TR+
Sbjct: 135 GQCKKWRLFADILNDLAMGLELLLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQ 194
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A + N A+V AK +Q I +GI + + I + + + + + IH+Y N +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSA 253
Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+++ L +LN R +L+ Y+ + P +EVN +E S LL+
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
S I+ + L + + + D L+ D Y+++ + LK+
Sbjct: 298 STMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDRKYLISIDVQNKTIFICLKKD 357
Query: 442 ALPQDMLKSLFQA 454
A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370
>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 169/356 (47%), Gaps = 31/356 (8%)
Query: 105 SLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
SL S +DE +++PEGFP SV Y+ Y WR+++ GV TQ LL
Sbjct: 96 SLESPQTPFDEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLN 155
Query: 165 AIGLGKGAIPTAAA-INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
++G + + +AA INW+LKDG G + K++ + G+ FD + K R DLL G+
Sbjct: 156 SVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGV 215
Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
E+ T A PH F+ + AA ++ AA+ STR+ Y FA N +V AKGE G ++
Sbjct: 216 ELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 275
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
+G I I P + +F +++ ++ + + +S+ L TLN R ++
Sbjct: 276 DLMGTGFSIL----ISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVES 331
Query: 344 YLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKL 403
+L +G+ P ++E N +E +F F + + + LG++
Sbjct: 332 FLKTGRVPSLQEGNIQEKIF-TFPWVD-----------------------DRPVMLGARF 367
Query: 404 SDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYL 457
D + A+ ++ E Y++T GK +LK A D+LK+ F A L
Sbjct: 368 KDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVL 423
>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
Length = 407
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 114/435 (26%), Positives = 195/435 (44%), Gaps = 44/435 (10%)
Query: 103 LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQAL 162
+ S+ L E +++PEGFP SV Y+ Y WR+++ GV TQ L
Sbjct: 1 MCSIFVFGDLLQEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTL 60
Query: 163 LYAIGLGKGAIPTAAA-INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAF 221
L ++G + + +AA INW+LKDG G + K++ + G+ FD + K R DLL
Sbjct: 61 LNSVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGA 120
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
G+E+ T A PH F+ + AA ++ AA+ STR+ Y FA N +V AKGE G
Sbjct: 121 GVELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN 180
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
++ ++G + I P + +F +++ ++ + + +S+ L TLN R ++
Sbjct: 181 IAD----LMGTGFSILISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAV 236
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGS 401
+L +G+ P ++E N +E +F F + + + LG+
Sbjct: 237 ESFLKTGRVPSLQEGNIQEKIF-TFPWVD-----------------------DRPVMLGA 272
Query: 402 KLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYW 459
+ D + A+ ++ E Y++T GK +LK A D+LK+ F A L
Sbjct: 273 RFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVLLH 332
Query: 460 LERNAGIVATSTSADCAPG---------GRLEISLDYVQREFNHVKSDSASVGW-VTDGL 509
+ + P R+ S + V + KS +A GW +++ L
Sbjct: 333 FMNQSKDGNPRSVEQLDPAFAPTEYELESRIAESCEMVSTSYGVFKSRAAEQGWRMSESL 392
Query: 510 IARPLPNRIRPGYVE 524
+ P R R +V+
Sbjct: 393 LN---PGRARLCHVK 404
>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
Length = 466
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 165/341 (48%), Gaps = 23/341 (6%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
F + LP+GFPDSV DY+ Y +W ++Q AS I+G L T +++ +G+G+ A P AA
Sbjct: 49 FFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLTTHSIMRGVGVGESTATPLAA 108
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
AI W+LKDG G + +I+ + + G + D K WRLFAD+L +AA LE+ P +
Sbjct: 109 AITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADILNDAAMALEVTVPYISTFSSY 168
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
++ + +TR+ A + N A+V AK +QG + G ++GI +
Sbjct: 169 TLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKEHSQGTLVNLAGSIVGILILL 228
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
I + V+ +H+ N + S+EL TLN R L+ +Y+L+ V+++
Sbjct: 229 LINEKFFIYVCLLLVI--VHIVSNYFAVTSLELNTLNEDRLCLIIEDYILNHTMSNVEDI 286
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA-AVEIEHRLQLGSKLSDVVNNKEDAHA 415
N E S +++L AK +IE + S L + N +D
Sbjct: 287 NSRE----------------SVIILLEKPAKKIFGFDIEIGVSFESVLKMNLINTKDVRF 330
Query: 416 LFSLYEDEGY--ILTEHGGKFCVVLKESALPQDMLKSLFQA 454
L +L++ Y ++ K ++L + A +LK+ F
Sbjct: 331 LLALFQRRKYLPVMDLKAKKIHIILSKDADDDIILKAYFHC 371
>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
Length = 584
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 92/250 (36%), Positives = 135/250 (54%), Gaps = 15/250 (6%)
Query: 129 FPD--SVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKD 185
FP SVTSDY +Y+ +R Q +AS VLATQ +L AIGLG +IP AAA+NWVLKD
Sbjct: 173 FPQKTSVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSIPAAAALNWVLKD 232
Query: 186 GIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
G+G L K+ + +NFGR FD + K R + L+ A +E +TP FP HF+ I A G
Sbjct: 233 GLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFPKHFLAIATVANVG 292
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH------- 297
+S R FA N E+ AK AQ +++ IG+ LG+++
Sbjct: 293 KSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALGVSVMKFQTKFVSL 352
Query: 298 -IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APPVKE 355
+ ++P AL F ++ I + K +S++LRT+N RA ++ ++ + A ++
Sbjct: 353 ALQRAVPLAL--FPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFVREKKLATQLQV 410
Query: 356 VNDEEPLFPA 365
E LFPA
Sbjct: 411 AEKERLLFPA 420
>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
Length = 417
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 168/337 (49%), Gaps = 24/337 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
LP+G+PDSV+ DY Y +W +VQ S I G L T A+L IG+G + +A + W+L
Sbjct: 54 LPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVTWIL 113
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIGAAA 241
K+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I +
Sbjct: 114 KEGSGHMGRIIFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCGS 173
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
++ + +TR+ A R N A+V +K +Q I +G+ L I S
Sbjct: 174 TLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFMGLYLLTMIKSQ 233
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+ VVT +H+Y NLK+ +++ LRT N R + E+ S Q ++VN
Sbjct: 234 AMLYTVFYIVVT-LHLYANLKAVRAVCLRTFNESRYLITLEEFFRSSQMLTPQQVN---- 288
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
K++ Q++ +S + + + ++H + K SDV+ ED + F +E
Sbjct: 289 --------KLERVTLGQMVSVSLKIR-LGLSVKHLID-EYKSSDVI---EDIVSSFDPHE 335
Query: 422 DEGYILTEHGGKFCVVLKESALPQDMLKS-LFQASYL 457
+I+ E V L PQD+LK+ F SYL
Sbjct: 336 H--FIIAESKKFLGVYLHFDTRPQDVLKAYFFGVSYL 370
>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
Length = 266
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 85/255 (33%), Positives = 135/255 (52%), Gaps = 4/255 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP+GFP SV+ DY+ Y +W +VQ AS I+ LATQA+L +G+G + A AA + W++
Sbjct: 7 LPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAATLTWLM 66
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KDG G L +I+ + G + D + K WRLFAD+L + + +++ P F +F I AG
Sbjct: 67 KDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLIVCVAG 126
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
RS + +TR+ A R N A+V AK +Q + ++ +AL +
Sbjct: 127 VCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPMVSDRQ 186
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL--SGQAPPVKEVNDEE 360
F+ T +H+Y N K+ +S+++ T N R L E++ +G P VK VN E
Sbjct: 187 WLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKTVNQRE 246
Query: 361 PLFPAFHFFKIKSAN 375
P+ + + S +
Sbjct: 247 PVLCSESLLSVPSTS 261
>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
intestinalis]
Length = 414
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 168/356 (47%), Gaps = 41/356 (11%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
++F LP+G+P+SV+ DYL+Y +W +VQ S I+G LAT A+L G+G A+
Sbjct: 47 QIFKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLATHAVLKGSGVGDETANVAS 106
Query: 178 A-INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HF 234
A I W+LKDG G L +I + F G D N K WRLFAD++ + A ++++ P P +F
Sbjct: 107 ATITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMNDCAIFIQLVAPVLPKAYF 166
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ AG S + TR+ A N A+V AK +Q + ++ G+ +
Sbjct: 167 TLVMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKDGSQETLVNLAALLTGLVI 226
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
+ +++P F + T++H++ N K+ S+ + N R S V + + K
Sbjct: 227 MPIVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRNRLSKVIKVFCDMNKVLTPK 286
Query: 355 EVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
E N++E +F P++ + L LG KL D+ +
Sbjct: 287 EANEKEAIFFPPSYSW----------------------------LNLGCKLEDLGLTSSE 318
Query: 413 AHALFSLYEDEG--YIL--TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
H +D G Y++ + G+ V L S D+L++ ++ SYL + R +
Sbjct: 319 FH---QKVDDGGVRYLIKRNQKTGQIMVSLNHSCTASDILEAFYE-SYLLHINRTS 370
>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 534
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 138/264 (52%), Gaps = 10/264 (3%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAI 173
+ ++F + LP+G+P SVT DY Y W +Q + VL+T ALL ++G+G ++
Sbjct: 110 KLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTLGSTTYVLSTHALLTSVGVGLSTSL 169
Query: 174 PTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P AAAI+WVLKDG+G + ++ S + D N K ++ D+L N LEMLTP P
Sbjct: 170 PFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKSFKFRGDILHNVGVMLEMLTPFVPG 229
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
+F+ + + + + A LI +TR+ F+ + N ++ AK ++Q M S G+ LG
Sbjct: 230 YFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQENLGDITAKYQSQSMASYLTGMGLGT 289
Query: 293 ALANHIGSSMPFA-LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ + A L + +++ +HM C++KS +SI L+TLN R+ ++ +L
Sbjct: 290 TIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSICLKTLNSQRSKIIIDHWLREKIIL 349
Query: 352 PVKEVNDEEPLF-------PAFHF 368
+ VN E P HF
Sbjct: 350 SPEIVNKNENFMFSSSTDTPKIHF 373
>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
Length = 244
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 82/240 (34%), Positives = 132/240 (55%), Gaps = 2/240 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP+G+P+SV+ DYL+Y +W +VQ S I+G LATQA+L +G+G + A A I W+L
Sbjct: 5 LPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATITWIL 64
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KDG G L +I+ + G D + K WRLFAD++ + + LE++ PAFP F I AG
Sbjct: 65 KDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIVCVAG 124
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
G++ + +TR+ A R N A+V AK +Q + + ++ G+ + + +
Sbjct: 125 VGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMVADNP 184
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
F V T +H++ N + + + TLN R ++ S YL +G + N +EP+
Sbjct: 185 LLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANVQEPV 244
>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
[Apis florea]
Length = 464
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 178/373 (47%), Gaps = 25/373 (6%)
Query: 90 KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
KD F V S S++ LLS + + + LP+GFPDSV DY +Y +W +
Sbjct: 15 KDRFFVKSKDEQSITELLSDTTRTKSFYSGLISIIKEVFLPQGFPDSVHPDYTSYQVWDT 74
Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
VQ AS I G L T +++ +G+G+ A P AAAI W+LKDG G + +IM + + G D
Sbjct: 75 VQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWILKDGTGMVGRIMFAWWNGTDLD 134
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
K WRLFAD+L + A GLE+L P F + + I + A +S + +TR+
Sbjct: 135 GQCKKWRLFADILNDLAMGLELLLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTNIK 194
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A + N A+V AK +Q I +GI + + I + + + + + IH+Y N +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSA 253
Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+++ L +LN R +L+ Y+ + P +EVN +E S LL+
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
I+ + L + + + D L+ D Y+++ + LK+
Sbjct: 298 PTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDRKYLISIDVQNKTIFICLKKD 357
Query: 442 ALPQDMLKSLFQA 454
A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370
>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 269
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 89/261 (34%), Positives = 138/261 (52%), Gaps = 16/261 (6%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
C E +LPE P SVT+DY Y+ +R Q +AS V ATQ +L A+GLG +
Sbjct: 11 RCVERAQHALLPE--PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLGASRTL 68
Query: 174 PTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P AAA+NWVLKDG+G L K+ + +NFGR FD + K +R + ++ +A+ +E++TP FP
Sbjct: 69 PAAAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQ 128
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
HF+ + A G+S R+ F N AEV AK AQ +++ IG+ L +
Sbjct: 129 HFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAV 188
Query: 293 ALAN----------HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
A I ++P + +F + + + C K ++++LRT+N RA ++
Sbjct: 189 GAAKAVSKVATVRPEIRRALP--VFAFGPLAALDLICIYKELKAVQLRTINKERAEIIAE 246
Query: 343 EYLLSGQAPPVKEVNDEEPLF 363
++ + P V D E LF
Sbjct: 247 AFVKEREIPSRAHVADAERLF 267
>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 527
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 138/260 (53%), Gaps = 6/260 (2%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA 178
+ + LP G+PDSVT+DY Y W S+Q + S I+G LAT+A+L G+G + A+A
Sbjct: 58 VLCELFLPNGYPDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASA 117
Query: 179 -INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH-FV 235
W+++DG+G + +I+ + G D N K WR AD+L N EM++P F F+
Sbjct: 118 TTQWLIRDGMGMIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFL 177
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ +S + T++ FA R N A+V AK +Q +G++L + ++
Sbjct: 178 PLSCIGLIAKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVS 237
Query: 296 NHI--GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS-GQAPP 352
+ I +S+ F T +H++CN ++ +++L+++N YRA L++ ++ + G P
Sbjct: 238 SFINDNTSLIVTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPS 297
Query: 353 VKEVNDEEPLFPAFHFFKIK 372
E++ E + + I+
Sbjct: 298 PSEISKLENILFSIKELDIR 317
>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
distachyon]
Length = 508
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/348 (28%), Positives = 164/348 (47%), Gaps = 47/348 (13%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ V T+ALL ++G+ +
Sbjct: 121 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQS-- 178
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
+DG G + K++ + G+ FD + K R DLL G+E+ T AFPH
Sbjct: 179 ----------RDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHL 228
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 229 FLPLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGTG 284
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
++ I P +ASF+ ++ ++ + + +S+ L TLN R ++ ++ +G P +
Sbjct: 285 MSILISKRNPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSL 344
Query: 354 KEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
KE N EE +F P H + +GS+ +
Sbjct: 345 KEGNLEETIFNPPWRH---------------------------QPVAIGSRFGEAFQEPA 377
Query: 412 DAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
+ L+EDE YI+T + K V +LK+ A P D+LK+ F A L
Sbjct: 378 SFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 425
>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 827
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 177/362 (48%), Gaps = 28/362 (7%)
Query: 120 FVQFMLPEGF------PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGA- 172
V +LPEGF P V+ DYL Y++ +Q + + V+ATQ++L AIG+ K A
Sbjct: 334 LVGSLLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIGVAKQAT 393
Query: 173 IPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
I +AA+ WVLKDG+G ++ I+ S G +D +PK WR+ D+L G E+ T P
Sbjct: 394 IGVSAAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEIATRLVP 453
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
+F+ +G A A +S + +++ T + F+ R NF ++ AK +Q ++S +G +G
Sbjct: 454 QYFLLVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSMLLGTFVG 513
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--Q 349
I L+ I +S+ A + + + + N S + + +RTLN R +V Y SG +
Sbjct: 514 IGLSYMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYWNSGGVE 573
Query: 350 APPVKEVNDEEP-LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE-----HRLQLGSKL 403
V N E L P+ ++ A + + EA E E RL G L
Sbjct: 574 TCSVAMANQAENFLLPS----RLGPARRVRFGCRLGEAFRTVGEFEEAIARQRLWRGQYL 629
Query: 404 --SDVVNNKEDAHALFSLYE----DEGYIL--TEHGGKFCVVLKESALPQDMLKSLFQAS 455
+DV +A A F+ DE Y+L G+ VVL A D L++L A+
Sbjct: 630 QQTDVSKRNGNADADFTSVPDDVLDERYLLGVDAKRGRVFVVLHRDAKVADELQALMHAT 689
Query: 456 YL 457
YL
Sbjct: 690 YL 691
>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
Length = 407
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/257 (32%), Positives = 139/257 (54%), Gaps = 11/257 (4%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTA 176
E+ LP+G+P+SVTSDY Y W ++Q + S ++GVLAT+A+L G+G A +
Sbjct: 43 EVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVLATRAILKGYGVGNLNASVAS 102
Query: 177 AAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHF 234
A+I W+ +D G L +I+ + +G D N K WR ADLL + LEM++P F P +F
Sbjct: 103 ASIQWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAADLLNDFGMTLEMISPLFGPSYF 162
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ I +S + T++ FA R N A+V AK +Q +G++LG+ +
Sbjct: 163 LPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAIALVGMLLGMVM 222
Query: 295 ANHI---GSSMPFALAS---FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL-- 346
A I S+ + F + T IH+ CN ++ ++L ++N YRA+L++ +
Sbjct: 223 AASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLVQLTSINRYRANLIYQHFNENN 282
Query: 347 SGQAPPVKEVNDEEPLF 363
+ Q P K+++ +E +
Sbjct: 283 NNQIPTPKQISLKENIL 299
>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
Length = 393
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/277 (32%), Positives = 147/277 (53%), Gaps = 8/277 (2%)
Query: 89 TKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQG 148
T DA+ S NA + L SV W + LF +P G+P SV+ DY+NY +W ++Q
Sbjct: 17 TVDAYTTNSKYNA-FTRLESV-AYW---KSLFRDIFMPRGYPQSVSPDYMNYQIWDTIQA 71
Query: 149 VASQISGVLATQALLYAIGLGKGAIPT-AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNP 206
AS +S LAT+A+L +G+G T AAA+ W+L+DGIG L++I+ + F + D +
Sbjct: 72 FASSMSSALATEAILRGVGVGNETASTMAAAVAWLLRDGIGMLTRILFAWFYSPYLDADC 131
Query: 207 KGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
K WRL AD + AF L+++TP FP+ F+ I + R + +TR+ A
Sbjct: 132 KQWRLIADCFNDLAFCLDLITPIFPNLFMPIVCLSSMVRGVVGVAGGATRTAVVNHQAIL 191
Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQS 326
N +V AK +Q + + + L + ++ L F + T+IH+Y N ++ +S
Sbjct: 192 DNVGDVAAKDGSQETLINIFALFCSLLLLPIVNGNICVWLL-FCIFTFIHLYGNYRAVKS 250
Query: 327 IELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
++ RTLN ++ Y + + V +VN++E +
Sbjct: 251 LQFRTLNQSLLQIIVKNYAETKKIDTVSKVNNKESML 287
>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
Length = 509
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 175/371 (47%), Gaps = 45/371 (12%)
Query: 110 NKLW--DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG 167
N LW D R+ F+ P GFP SV+ DYL+Y LW+ + I VL T +LL A+G
Sbjct: 107 NILWLPDVVRD----FVFPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVG 162
Query: 168 LGK--------GAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLEN 218
+G A +AAAI WV KDGIG L ++++ FG FD +PK WR++AD + +
Sbjct: 163 VGSFSGTSAAATAAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGS 222
Query: 219 AAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEA 278
A ++ T +P F+ + + ++ A ++ + FA N EV AK E
Sbjct: 223 AGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEV 282
Query: 279 QGMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
+ ++ IG+ GI + + G S PF L +++ + +H++ +S ++ T+N R
Sbjct: 283 WEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKR 342
Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
A ++ +++ P + N E + F K + V +E
Sbjct: 343 ARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRII--------------FGVSLE-- 386
Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLF 452
+LS + + AL +Y E YILT +F V K +A +D+L+ L+
Sbjct: 387 -----ELSGLEKSVSKVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLW 441
Query: 453 QASYLYWLERN 463
QA YWLE N
Sbjct: 442 QA---YWLEEN 449
>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
Length = 417
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 165/340 (48%), Gaps = 30/340 (8%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
LP+G+PDSV+ DY+ Y +W ++Q S I G L T A+L IG+G I +A + W+L
Sbjct: 54 LPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWIL 113
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIGAAA 241
K+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I +
Sbjct: 114 KEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCGS 173
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS- 300
++ + +TRS A R N A+V +K +Q + +G+ L I S
Sbjct: 174 TMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTMIKSQ 233
Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
+M + + F +V +H+Y NLK+ +SI LRT N R + E+ S ++VN E
Sbjct: 234 AMLYTI--FYIVVTLHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLTPQQVNKME 291
Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK-LSDVVNNKEDAHALFSL 419
+ + + +V + RL L K L D + + S
Sbjct: 292 RV---------------------TVGQTVSVSLNIRLGLSVKHLIDEYKSSSVIENIVSS 330
Query: 420 YE-DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
++ E +I+ E V L PQD+LK+ F A SYL
Sbjct: 331 FDPHERFIIAESKKHLGVYLHFDTRPQDVLKAYFFAVSYL 370
>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
Length = 552
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 138/273 (50%), Gaps = 6/273 (2%)
Query: 106 LLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYA 165
LL + + + LP G+PDSVTSDY Y W S+Q + S I+G LAT+A+L
Sbjct: 42 LLETKRTKSSLQNILYDLFLPSGYPDSVTSDYFRYQFWDSIQALCSSITGTLATRAILKG 101
Query: 166 IGLGKGAIPTAAA-INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGL 223
G+G + A+A WV++ G + +I+ + G D N K WR AD+L +
Sbjct: 102 YGVGNSSATVASATTQWVIQSATGMIGRIIFAWANGTDLDCNSKKWRFMADILNDIGMTF 161
Query: 224 EMLTPAFPHHFVFIGAAAGA-GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMV 282
EM++P F + G +S + T++ FA N A+V AK +Q
Sbjct: 162 EMISPLFNEALFLPLSCIGLFAKSICGVAGGCTKASLTQHFAKNDNLADVSAKDGSQETA 221
Query: 283 SKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
+G++L + +++ + +S+ F + T H+YCN K+ +++L ++N YRA L+
Sbjct: 222 VSLLGMILSVFVSSVLNDDTSILVTWFIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLI 281
Query: 341 FSEYLLS-GQAPPVKEVNDEEPLFPAFHFFKIK 372
+ Y+ + G P KE+++ E + FKI+
Sbjct: 282 YEYYIRNQGSIPSPKEISNFENILFNSKDFKIR 314
>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
Length = 462
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 86/244 (35%), Positives = 129/244 (52%), Gaps = 2/244 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA W++
Sbjct: 72 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P P F +
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTVTVCISN 191
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ + +TR+ A R N A+V AK +Q + G+++ + + + S
Sbjct: 192 LAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDSP 251
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
+L F +T +H+Y N ++ +++ + TLN R LV +L G+ N EPL
Sbjct: 252 SLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPTSANRMEPL 311
Query: 363 FPAF 366
+ F
Sbjct: 312 WTGF 315
>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
lyrata]
Length = 510
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 172/370 (46%), Gaps = 41/370 (11%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
N LW ++ F+ P GFP SV+ DYL+Y LW+ V I VL T +LL A+G+
Sbjct: 107 TNTLW--LPDVVKDFVFPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGV 164
Query: 169 GK--------GAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENA 219
G A +AAAI WV KDGIG L ++++ FG FD +PK WR++AD + +A
Sbjct: 165 GSFSGTSAAATAAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSA 224
Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
++ T +P F+ + + ++ A ++ + FA N EV AK E
Sbjct: 225 GSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVW 284
Query: 280 GMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRA 337
+ ++ IG+ GI + + G S PF +++ V +H++ +S ++ T+N RA
Sbjct: 285 EVAAQLIGLGFGILIIDTPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRA 344
Query: 338 SLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRL 397
++ +++ P + N E + F K + V +E
Sbjct: 345 RIIVESHVVHSVVPGYVDCNKRENILLGQRFMKPRII--------------FGVSLEEVS 390
Query: 398 QLGSKLSDVVNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLFQ 453
L +S V AL +Y E YILT +F V K +A +D+L+ L+Q
Sbjct: 391 GLEKSVSKV-------KALLKMYTKEKYILTLNKLNSDTEFSVSFKVNATSRDVLRCLWQ 443
Query: 454 ASYLYWLERN 463
A YWLE N
Sbjct: 444 A---YWLEEN 450
>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
Length = 417
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 165/337 (48%), Gaps = 24/337 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
LP+G+PDSV+ DY Y +W +VQ S I G L T A+L IG+G + +A + W+L
Sbjct: 54 LPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVTWIL 113
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIGAAA 241
K+G G+L +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I +
Sbjct: 114 KEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCGS 173
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
++ + +TR+ A R N A+V +K +Q I +G+ L I S
Sbjct: 174 TMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGLYLLTMIKSQ 233
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+ VVT +H+Y NLK+ +++ LRT N R + E+ S ++VN
Sbjct: 234 AMLYTVFYIVVT-LHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLTPQQVN---- 288
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
K++ Q++ +S + + ++H + K S+V+ N + S
Sbjct: 289 --------KLERVTVGQMVSVSLNIR-LGLSVKHLID-EYKSSNVIEN-----IVSSFDP 333
Query: 422 DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
E +I+ E V L PQD+LK+ F A SYL
Sbjct: 334 HERFIIAESKKYLGVYLHFDTRPQDVLKAYFFAVSYL 370
>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
castaneum]
Length = 472
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 161/343 (46%), Gaps = 22/343 (6%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
F + +LP G+PDSV+ DY Y LW +VQ S I+G+ TQA+L +G+G A +A
Sbjct: 51 FFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQAILKGVGVGSAEASALSA 110
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
I W++KDG G + +I+ + + G+ D N K WRLFADLL + A LE+ P H
Sbjct: 111 TITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLNDGAMILELSIPHVTHLSTE 170
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I + ++ + +TR+ A + N AE+ AK +Q + I GI L N
Sbjct: 171 ILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGSQETLVNLIASFTGIFLIN 230
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ + + +F ++ ++H+Y N ++ ++ + LN R +L+ YL + +
Sbjct: 231 YFTDATTQWIVTFLLI-FLHLYSNYQAVTALIFKNLNNLRMTLILKSYLAVNAVLGPECI 289
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
N E +F F +S K +I LG L N ++ +L
Sbjct: 290 NKTESVFLGFG---------------TSVEKFCGFKIVLGQALGKTLRRY--NSDELKSL 332
Query: 417 FSLYEDEGYILTE--HGGKFCVVLKESALPQDMLKSLFQASYL 457
+Y+D+ Y++ V ++ PQ+++ + A L
Sbjct: 333 TEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375
>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
Length = 464
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 177/373 (47%), Gaps = 25/373 (6%)
Query: 90 KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
KD F V S S++ LLS + + + LP+G+PDSV DY Y +W +
Sbjct: 15 KDRFFVKSKDEQSITELLSDTTRTKSFYSGFISIVKEVFLPQGYPDSVHPDYTPYQIWDT 74
Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
VQ AS I G L T +++ +G+G+ A P AAAI W+LKDG G + +I+ + + G D
Sbjct: 75 VQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLD 134
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
K WRLFAD+L + A GLE+ P F + + I + A +S + +TR+
Sbjct: 135 GQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQ 194
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A + N A+V AK +Q I +GI + + I + + + + + IH+Y N +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGRYIMELYLFLVMIHLYANYMA 253
Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+++ L +LN R +L+ Y+ + P +EVN +E S LL+
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
I+ + L + + + + D L+ D Y+++ + LK+
Sbjct: 298 PTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIFICLKKD 357
Query: 442 ALPQDMLKSLFQA 454
A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370
>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
Length = 450
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 161/343 (46%), Gaps = 22/343 (6%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
F + +LP G+PDSV+ DY Y LW +VQ S I+G+ TQA+L +G+G A +A
Sbjct: 51 FFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQAILKGVGVGSAEASALSA 110
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
I W++KDG G + +I+ + + G+ D N K WRLFADLL + A LE+ P H
Sbjct: 111 TITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLNDGAMILELSIPHVTHLSTE 170
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I + ++ + +TR+ A + N AE+ AK +Q + I GI L N
Sbjct: 171 ILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGSQETLVNLIASFTGIFLIN 230
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ + + +F ++ ++H+Y N ++ ++ + LN R +L+ YL + +
Sbjct: 231 YFTDATTQWIVTFLLI-FLHLYSNYQAVTALIFKNLNNLRMTLILKSYLAVNAVLGPECI 289
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
N E +F F +S K +I LG L N ++ +L
Sbjct: 290 NKTESVFLGFG---------------TSVEKFCGFKIVLGQALGKTLRRY--NSDELKSL 332
Query: 417 FSLYEDEGYILTE--HGGKFCVVLKESALPQDMLKSLFQASYL 457
+Y+D+ Y++ V ++ PQ+++ + A L
Sbjct: 333 TEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375
>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
Length = 825
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 160/338 (47%), Gaps = 21/338 (6%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAIN 180
+ LP+G+PDSV SDY Y +W ++Q AS I LAT +++ IG+G+ A P AAAI
Sbjct: 406 EIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIGVGESNATPLAAAIT 465
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
W+LK+G G + IM + + G D K WRLFAD+L + A G+E+L F + V I
Sbjct: 466 WILKNGAGMVGSIMFAWWNGTKLDGQCKKWRLFADILNDMAAGMELLVSYFSSYSVVILC 525
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ +S + +TR A R N A+V AK +Q M I +GI + + +
Sbjct: 526 ISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVNLIASFIGIFILS-LF 584
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
+ + + + +H+Y N + + + L LN R +L+ Y+++ + P K +N E
Sbjct: 585 HDRQYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYYMINERIPEPKRLNKE 644
Query: 360 EPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSL 419
E S L+ + I+ + S L V + + L
Sbjct: 645 E----------------SVSLLDNPTKSICGFNIKIGVSFASVLRKNVISSMEMEFLLKF 688
Query: 420 YEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQAS 455
+ED Y++ V+LK+ ++L++ F AS
Sbjct: 689 FEDRKYLIAIDVKNRNIFVILKKDIQATEILEAYFYAS 726
>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
Length = 567
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 144/290 (49%), Gaps = 50/290 (17%)
Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYA- 165
L NKL+D ++LF LP+G+P +VTSDY+ Y W +Q V + VL+T ALL +
Sbjct: 114 LDDNKLFDSIKKLF----LPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSV 169
Query: 166 IGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
IG+ A+P AAAI+WVLKDG+G + ++ +N + FD + K ++ DLL N L
Sbjct: 170 IGISSTSALPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLL 229
Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM-- 281
EM TP F F+ + + + + A LI STR+ F+ + N ++ AK ++Q M
Sbjct: 230 EMFTPFFQQFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAA 289
Query: 282 ------VSKAIGIML---------------GIALANHIGSSMPFALASFSVVTWIHMYCN 320
+ +IG++L G++ +IH++C
Sbjct: 290 YLSGMAIGSSIGMILSVFPLTSVSNLSMVFGLS--------------------FIHIFCG 329
Query: 321 LKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK 370
+ + +SI+L+TLN R S++ +L S K VN EE + K
Sbjct: 330 INAIKSIDLKTLNQQRTSILIDNWLKSNLILTSKIVNQEEKFITNDKYLK 379
>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
Length = 409
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/292 (31%), Positives = 145/292 (49%), Gaps = 12/292 (4%)
Query: 88 FTKDAFVVASASNASLSS-LLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
F K++ + S SL + L++ + LP +P SV YLNY W +
Sbjct: 68 FDKESNRIKKESTTSLRNHTLTLPVFIKTTKLKLTDIFLPRNYPSSVKGKYLNYCSWAAF 127
Query: 147 QGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVN 205
S GVL+TQ++L+AIG+G GA+P AAA+NWVLKDG+G L + + D +
Sbjct: 128 GMGLSAAGGVLSTQSMLFAIGMGAGAVPMAAALNWVLKDGLGQLGGMAFTALVNSRLDAD 187
Query: 206 PKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA 265
+GWR+ + L + LE++TP FPH F+ + A G++ + L ++TR+ GF
Sbjct: 188 SRGWRILSAWLLEISTWLEVMTPLFPHSFLLLATLANVGKNISWLAGSATRAGIRYGFVN 247
Query: 266 RRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG-SSMPFALASFSVVTWIHMYCNLKSY 324
N ++ AK +Q + G LGI ++N IG M + L S ++ + ++ +S
Sbjct: 248 AHNMGDITAKEGSQTVAITVFGTFLGIVISNLIGHGHMEYVLMSSMCISAVSLFSIYQSL 307
Query: 325 QSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANK 376
+ + L TLN S LLS ND+ L +H F ++ A K
Sbjct: 308 RCVSLPTLNYQVGRSSVSRCLLSSS-------NDQ--LKAYYHGFLLRKAIK 350
>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
Length = 417
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 103/342 (30%), Positives = 160/342 (46%), Gaps = 28/342 (8%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAIN 180
+ LP+G+PDSV+ DY Y +W +VQ S I G L T A+L IG+G I +A +
Sbjct: 51 KIFLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W++K+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WIMKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TRS A R N A+V +K +Q + +G+ L I
Sbjct: 171 CGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +S+ LRT N R + EY S + ++VN
Sbjct: 231 -KSQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLTPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK-LSDVVNNKEDAHALF 417
E + + + +V + RL L K L D +
Sbjct: 290 MERV---------------------TLGQTVSVSLNVRLGLSVKNLIDEYKTSSSIENIV 328
Query: 418 SLYE-DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
S ++ E +I+ E V L QD+LK+ F A SYL
Sbjct: 329 SSFDPHEHFIIAESKKYLGVYLHFETRQQDVLKAYFFAVSYL 370
>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
Length = 417
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 101/342 (29%), Positives = 166/342 (48%), Gaps = 28/342 (8%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAIN 180
+ LP+G+PDSV+ DY Y ++ ++Q S I G L T A+L +IG+G I +A +
Sbjct: 51 KIFLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSENINAYSATVT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ ++ +TRS A R N A+V +K +Q + +G+ + I
Sbjct: 171 CGSTMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLASCVGLYVLTLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S + VVT +H+Y NLK+ +++ LRT N R + E+ S + ++VN
Sbjct: 231 KSPAVLYTVFYLVVT-LHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSRMLTPQQVNG 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK-LSDVVNNKEDAHALF 417
E + + + +V + RL L K L D + D +
Sbjct: 290 MERV---------------------TLGQTVSVSLNIRLGLSVKNLVDEYKSSSDIENIV 328
Query: 418 SLYE-DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
S ++ E +I+ E V L PQD+LK+ F A SYL
Sbjct: 329 SSFDTHERFIIAESKKYLGVYLHFDTRPQDVLKAYFFAVSYL 370
>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 712
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 94/285 (32%), Positives = 149/285 (52%), Gaps = 20/285 (7%)
Query: 98 ASNASLSSLL--SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
A N S+S + SV KL + +FV PD V YL Y W+ V S +
Sbjct: 257 ARNESVSGQVAGSVLKLRQTLQRIFV--------PDQVWPHYLVYLKWKLVHRFLSSVLH 308
Query: 156 VLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFA 213
TQA+L+A+G+G K +P AAA+NWV+KDG+G L K++ + NFGR FD + K R +
Sbjct: 309 FQCTQAMLWAVGVGAKRRLPAAAALNWVMKDGVGRLGKLLFTANFGRTFDSDLKRVRFWT 368
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
+L +++ GLE+LTP FP HF+ + A +S A +T S + FA N A++
Sbjct: 369 SVLFSSSVGLEILTPLFPQHFLLLATLANIAKSIAYAAYLATSSAIHRSFALGDNLADIS 428
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMP-----FALASFSVVTWIHMYCNLKSYQSIE 328
AKG AQ +V+ +G+ + L+ I + P LA + V+ ++ + Q++
Sbjct: 429 AKGMAQTVVADNLGLAAAVCLSQFI-HNFPKIEKFLPLAMYPVLASAELFAIYQQLQAVH 487
Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE--PLFPAFHFFKI 371
L+TLN R ++ + ++ + P +EV+ E L F F +
Sbjct: 488 LQTLNKERLEIIINIWVKDRRIPSFEEVSKIENVALLDLFRVFGV 532
>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
Length = 470
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 160/347 (46%), Gaps = 65/347 (18%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
DE +++PEGFPDSVT Y+ Y WR+++ GV TQ LL ++G+ + +
Sbjct: 109 DELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRS 168
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
P A AINW+LKDG G + K++ + G+ FD + K RL DLL G+E+ T A PH
Sbjct: 169 APGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPH 228
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F+ + AA ++ AA+ STR+ Y FA N +V AKGE G ++ +
Sbjct: 229 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLV------ 282
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
+S+ L TLN R S+ +L +G+ P
Sbjct: 283 --------------------------------KSVVLHTLNRARYSVAIDCFLKTGRVPS 310
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
++E N +E +F +F + + K + LGS+ D +
Sbjct: 311 LQEGNMKENIF-SFPWLEDKP-----------------------IVLGSRFKDAFQDAGS 346
Query: 413 AHALFSLYEDEGYILTEH--GGKFCVVLKESALPQDMLKSLFQASYL 457
++ L+E E Y++T + GK +LK+ A D+LK+ F A L
Sbjct: 347 YLSIEPLFEKERYLVTYNPSKGKVYALLKDQAKSDDILKAAFHAHML 393
>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
adhaerens]
Length = 272
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 139/273 (50%), Gaps = 15/273 (5%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP+G+P+SV+ DYL Y LW ++Q S I+G LATQA+L +G+G + A AAA+ W+L
Sbjct: 4 LPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMTWML 63
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPK------GWRLFADLLENAAFGLEMLTPAFPHHFVF 236
KDG G + +I+ + G D + K +RLFAD L + A +EM+ P FPH F
Sbjct: 64 KDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHLFTL 123
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ A +S + +TR+ A R N A+V AK +Q + +++G+ L
Sbjct: 124 LLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALIVGLFLLP 183
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ + F + T +H+Y N K+ S+ + TLN R ++ + G EV
Sbjct: 184 AVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILSPAEV 243
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA 389
E + +K++ ++L +E K++
Sbjct: 244 CQRESVIIMKDQYKLR-------MILGAEFKES 269
>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
gi|194699768|gb|ACF83968.1| unknown [Zea mays]
Length = 491
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 163/348 (46%), Gaps = 47/348 (13%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ +
Sbjct: 106 EEIMSFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLSSVGVSQS-- 163
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
KDG G + K++ + G+ FD + K R +DLL G+E+ T AFP
Sbjct: 164 ----------KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELTTAAFPQF 213
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ + A ++ AA+ STR+ Y +A N +V AKGE+ G ++ +LG
Sbjct: 214 FLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGTG 269
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
L+ I P + SF+ ++ ++ + + +S+ L TLN R ++ ++ +G P +
Sbjct: 270 LSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSL 329
Query: 354 KEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
KE N +E +F P H + +GS+ +
Sbjct: 330 KEGNSQETIFNPPWRH---------------------------ETVAIGSRFGEAFQEPA 362
Query: 412 DAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
A+ L+EDE Y++T + K V +LK+ A D++K+ F A L
Sbjct: 363 SFLAIRPLFEDERYMVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVL 410
>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
Length = 457
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 182/353 (51%), Gaps = 37/353 (10%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP GFP SVT DY Y ++ S+Q +S I+ +L+++A+L +IG+G A PTAA + VL
Sbjct: 58 LPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPTAALLLSVL 117
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
++ +G L+ I+ ++ FG + K +RL AD+L ++AF L++L+P FP + I + +
Sbjct: 118 QESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIVILSFS 177
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
RS + S+++ A FA N E+ AK +Q V +G++ G + + I +
Sbjct: 178 SILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVISWITTP 237
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
+ A +++ IH+ N ++ +++++RTLN RA+LVF P +KE++ E
Sbjct: 238 LATWTALIGLLS-IHLETNRRAVRAVKMRTLNRQRATLVFHHLQRQQTVPSIKEISSVE- 295
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAA------------VEIEHRLQLGSKLSDVVNN 409
H F+ VL + +D +E Q +K S +
Sbjct: 296 -----HIFEWDG-------VLRTSTRDIMGYCDIGTPFLRLLEAVSESQTSTKASHI--Q 341
Query: 410 KEDAHALFSLYEDEGYIL------TEHGGKFCVVLKESALPQDMLKSLFQASY 456
++ + SLY YIL T+ +F ++LK+ A P+D+L + +QA +
Sbjct: 342 QQTLSQILSLYNSSRYILWHDRRSTKDVPRFNIILKKGAEPKDLLIAWWQALF 394
>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
Length = 389
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 138/247 (55%), Gaps = 2/247 (0%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
L +P G+P SV+ DY+NY +W ++Q AS +S L+T+A+L +G+G K A AA
Sbjct: 37 LLRDIFMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAA 96
Query: 178 AINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
A+ W+LKDGIG L++I+ + + + D + K WRL AD + AF L+++TP FP F+
Sbjct: 97 AVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMP 156
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I + R+ + ++TR+ A N +V AK +Q + ++ + L
Sbjct: 157 IICLSSMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLP 216
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ ++ FS+ T+IH+Y N ++ +S++ RTLN +V +Y+ + + V E+
Sbjct: 217 VVSRNIVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEM 276
Query: 357 NDEEPLF 363
N +EP+
Sbjct: 277 NSKEPIL 283
>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
Length = 624
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/395 (29%), Positives = 190/395 (48%), Gaps = 47/395 (11%)
Query: 82 TKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYS 141
+KL K V + + + SL+ +WD F+LP GFP SV+ DYL+Y
Sbjct: 206 SKLQVHLEKHVSPVNAVEDENADSLIP-QAIWD--------FVLPAGFPGSVSVDYLDYM 256
Query: 142 LWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAAINWVLKDGIGYLSKIML-S 196
L + V I VL T +LL A+G+ G A +AAAI WV KDGIG ++++
Sbjct: 257 LLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIGAFGRLLIGG 316
Query: 197 NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTR 256
FG FD +PK WR++AD++ +A E++TP +P +F+ + + ++ A + +
Sbjct: 317 RFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLASLGNLAKAVARGFKDPSF 376
Query: 257 SCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG---SSMPFALASFSVVT 313
FA N EV AK E + ++ +G+ G+ + + G S + AL S V
Sbjct: 377 RVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAGVKSSYLTLALTWLS-VR 435
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
+H++ +S ++ RT+N R ++ ++ P N+EE + F + +
Sbjct: 436 LLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACNEEENILTWERFLRPQI 495
Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHG 431
+ V +E R+ G + SD+V+ L LY++E YIL + G
Sbjct: 496 S--------------FGVPME-RMLGGDESSDMVDR------LLKLYKNEKYILFFEQFG 534
Query: 432 GK---FCVVLKESALPQDMLKSLFQASYLYWLERN 463
+ F V KESA +L+SL+QA +WL +N
Sbjct: 535 SRDPTFLVTFKESATSMSVLRSLWQA---HWLHKN 566
>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
Length = 665
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/247 (34%), Positives = 133/247 (53%), Gaps = 14/247 (5%)
Query: 129 FP--DSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGA-IPTAAAINWVLKD 185
FP +SVT DY Y+ +R +Q +AS V ATQ +L AIG+G +P AAAINWVLKD
Sbjct: 199 FPARESVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPAAAAINWVLKD 258
Query: 186 GIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
G+G L K+ + +NFGR FD + K +R + ++ +A+ +EM+TP FP F+ + A G
Sbjct: 259 GLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALATLANIG 318
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS--- 301
+S R+ F N AEV AK AQ +V+ +G+ + + G
Sbjct: 319 KSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAVLATGLTGKVVND 378
Query: 302 -----MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+PF +F + + +YC + ++++L T+N RA ++ ++ G P K V
Sbjct: 379 RARLIIPF--VAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTVPGTKYV 436
Query: 357 NDEEPLF 363
+D E LF
Sbjct: 437 SDAERLF 443
>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
Length = 486
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 171/353 (48%), Gaps = 36/353 (10%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG----KGAIPTAAA 178
F+LP GFP SV+ DYL Y LW+ V I L T +LL A+G+G A +AAA
Sbjct: 98 FVLPAGFPGSVSDDYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAASAAA 157
Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG ++++ FG FD +PK WR++AD + + E+ TP +P +F+ +
Sbjct: 158 IRWVSKDGIGAFGRLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYFLPL 217
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ ++ + + FA N E+ AK E G+ ++ +G+ +G+ + +
Sbjct: 218 ASLGNLAKAVGRGFRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLILDT 277
Query: 298 IG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
G SS P ++ V +H++ +S ++ RT+N RA ++ ++ P
Sbjct: 278 SGVQSSYPTLTLTWLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPGYVA 337
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
N+EE + F + + + V +E R+ G + SD+V+
Sbjct: 338 CNEEENILTWERFLQPQIS--------------FGVPME-RMLGGGEPSDMVSR------ 376
Query: 416 LFSLYEDEGYIL-----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERN 463
L LY++E YIL F V KE+A +L+SL+QA +WL +N
Sbjct: 377 LLKLYKNEMYILFVDQSESKEPAFVVTFKEAATSMSVLRSLWQA---HWLHKN 426
>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
variabilis]
Length = 325
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 162/346 (46%), Gaps = 31/346 (8%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTA 176
+L ++LP+ FP SV Y +Y RSVQ V T++LL ++G+ K + A
Sbjct: 5 DLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHSSEAA 64
Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
AAINWV+KDG G L + + + +GR D K +RL D+L LE+ T P F+
Sbjct: 65 AAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPRMFLP 124
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ A ++ AA+ +STR+ Y FA + N A+V AKGE+ ++ +G GIALA
Sbjct: 125 LACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGIALAK 184
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+++P L +F+ ++ ++ + + S+ L LN R S + +G+ P E
Sbjct: 185 ---ANLPV-LPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGRVPETLEG 240
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
N EPL P S+ + R+ LG+ + + + H
Sbjct: 241 NYREPLMP------------------WSDPHNG------RVVLGATVEEACAGPQQLHDA 276
Query: 417 FSLYEDEGYILTEHGG--KFCVVLKESALPQDMLKSLFQASYLYWL 460
+ + Y LT K +LK+ A P+ + ++ A L W+
Sbjct: 277 LAAFSGRQYALTYRPDTRKCYALLKQGASPRSVQQAAMDAHALLWM 322
>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
Length = 469
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 129/241 (53%), Gaps = 2/241 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI-NWVL 183
LP+GFP+SV+ DYL Y W SVQ AS + LAT +++ +G+G AAA W++
Sbjct: 79 LPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLAAATATWLV 138
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G L +I+ + + G D N K WRLFAD+L +AA LE++ P F I + +
Sbjct: 139 RDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSFTMIVSISN 198
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ ++ +TR+ A R N A+V AK +Q + G+++ + + + +
Sbjct: 199 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIMLPLMSNCP 258
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
+L F ++T +H+Y N ++ +++ + TLN R LV +L SG+ N EPL
Sbjct: 259 RLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEVLDTTSANQMEPL 318
Query: 363 F 363
+
Sbjct: 319 W 319
>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 479
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/224 (34%), Positives = 129/224 (57%), Gaps = 1/224 (0%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
+ +LP GFP SV Y+ Y+ W+ AS +GV++TQALLYA+GLG GA+P AA +
Sbjct: 1 MLGLVLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAGALPLAATL 60
Query: 180 NWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
NWV+KDG+G L + S+ FD NPK WR+ A + +A+ LE+L+P P +F+ I
Sbjct: 61 NWVIKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPIA 120
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ A G++ + L +++R+ + + + N A+V AK AQ +++ G G+ L+ +
Sbjct: 121 SVANIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSASL 180
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
GS + + ++ +H+ N S + + + T + R LV +
Sbjct: 181 GSEFHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVN 224
>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 297
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 132/234 (56%), Gaps = 1/234 (0%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
+ D+ + F+ P G+P SV YL Y+ +R +Q +S + VL+TQ+LL+A GL +
Sbjct: 60 IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
A A++W+LKDG+ + K++ S+ G D PK WR+ AD+L + LE+++P P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCP 178
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
F+ + + A + +TR Y+ FA N +++ AKGEA + +GI G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
I LA+ + S+ L + +++ +H+Y ++ ++ + TLNP R +++ ++++
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFI 292
>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 527
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 126/248 (50%), Gaps = 3/248 (1%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
+ L LP+G+P SV+SDYL Y +W Q S I G L +A++ IG+G A P
Sbjct: 37 KSLIKDIFLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVATPM 96
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
+AA+ W++KDG G + IM + G D + K WRLFAD+L + A +E+ P +
Sbjct: 97 SAALTWIIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIMDYS 156
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+++ AA ++ +STR+ A + N A+V AK +Q + + +G + GI L
Sbjct: 157 MYMLCAATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFGIFL 216
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
++ S + T H+ N K+ + +++ N R L YL + P ++
Sbjct: 217 LKNVNESNQVWIV-MCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPTIE 275
Query: 355 EVNDEEPL 362
+VN EP+
Sbjct: 276 QVNRREPV 283
>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
heterostrophus C5]
Length = 421
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 171/343 (49%), Gaps = 13/343 (3%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP GFP SVT DY+ Y ++ S+Q +S I+G+LA++A+L +G+G A PTAA
Sbjct: 1 MIDVFLPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAAL 60
Query: 179 INWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
+ VL++ +G ++ I ++ FG + K +RL AD+L ++ L+ L+P FP V
Sbjct: 61 LLSVLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPKPLRVA 120
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + A RS + S+++ A FA N EV AK +Q V +G++ G + +
Sbjct: 121 VLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVVS 180
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
HI S + + ++ IH+ N + +++ +++LN RA+++FS L G KEV
Sbjct: 181 HITSDIA-TWCTLILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKEV 239
Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE----- 411
+ E +F + + + + I R +L L+ + +
Sbjct: 240 SARERIFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGADGESEV 299
Query: 412 DAHALFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKS 450
D L S++E E Y+L GG + +VLK P D L++
Sbjct: 300 DVFDLLSIFEAEDYVLWASGGPAAYEAVIVLKAGCGPIDQLRA 342
>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
Length = 404
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 81/251 (32%), Positives = 132/251 (52%), Gaps = 6/251 (2%)
Query: 103 LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQAL 162
LSSL S R LP + SV+ +L+Y W+ V SGVL+ Q+L
Sbjct: 104 LSSLTS-----QPLRTRLEHVFLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSL 158
Query: 163 LYAIGLGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAF 221
LYA+G+G G++P AAAINW+LKDG+G L ++ S FG FD +PK R A L +
Sbjct: 159 LYAVGVGAGSVPLAAAINWILKDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSG 218
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
++++TP FPHHF+ + + + G++ + L ++TR+ F N +V AK +Q +
Sbjct: 219 IVDIITPLFPHHFLLLASVSNFGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSI 278
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ G LGI ++ G+ +A++ ++ + + N S + + L+T N RA ++
Sbjct: 279 AASVFGTGLGILVSKVTGTEAALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQRAEILA 338
Query: 342 SEYLLSGQAPP 352
Y + P
Sbjct: 339 RHYFTTKVCTP 349
>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
gi|224029221|gb|ACN33686.1| unknown [Zea mays]
gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 481
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 116/405 (28%), Positives = 192/405 (47%), Gaps = 43/405 (10%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPT 175
+ F+LP GFP SV++DYL+Y L + V I VL T +LL A+G+ G A +
Sbjct: 91 ILDFVLPAGFPGSVSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAAS 150
Query: 176 AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AAAI WV KDGIG ++++ FG FD +PK WR++AD + +A E+ T +P +F
Sbjct: 151 AAAIRWVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGYF 210
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ + + ++ A + + FA N EV AK E + ++ +G+ +G+ +
Sbjct: 211 LPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVLI 270
Query: 295 ANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
+ G SS + ++ V +H++ +S ++ RT+N R ++ ++ P
Sbjct: 271 MDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVPG 330
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
N+EE + + + + + V +E R+ G K SD+VN
Sbjct: 331 YVACNEEENILTWERYLRPQVS--------------FGVPME-RMLGGDKSSDMVNR--- 372
Query: 413 AHALFSLYEDEGYIL--TEHGGK---FCVVLKESALPQDMLKSLFQASYLYWLERNAGIV 467
L LY++E YIL + G + F V KESA +L+SL+QA +WL +N
Sbjct: 373 ---LLKLYKNEKYILFFEQFGSREPTFLVTFKESATSMSVLRSLWQA---HWLHKN---- 422
Query: 468 ATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
+ P LE S+ ++ F S GW D +I +
Sbjct: 423 -QLRQDEVFPW--LEESIVALENGFTDFLEQLGSAGWDQDLIILK 464
>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
Length = 315
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 151/321 (47%), Gaps = 28/321 (8%)
Query: 62 SEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFV 121
+E DG K Y S++ + D T++ V + S L N + L
Sbjct: 9 TEKDGEKGY--------SRQYVKVTDTTENGVVC-------IDSDLDSNGKFRGITHLLS 53
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
LP+G+P SV+ DYL Y +W ++Q AS +SG L+T+A+L +G+G + P AA +
Sbjct: 54 SVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLM 113
Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTP-------AFPH 232
W++KDG G + +I+ + G D + K WR FADLL + A LE+ P
Sbjct: 114 WLIKDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSNG 173
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
I AG +S + +TR+ A + N A+V AK +Q + ++ +
Sbjct: 174 TMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITSL 233
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L + + F + T +H++ NLK+ +++ + TLN R ++ ++ +
Sbjct: 234 WLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVAS 293
Query: 353 VKEVNDEEP----LFPAFHFF 369
VKEVN EP L P +F
Sbjct: 294 VKEVNRLEPVIIGLIPTGNFI 314
>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
Length = 457
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 83/248 (33%), Positives = 130/248 (52%), Gaps = 11/248 (4%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP+GFPDSV+ DYL+Y LW SVQ AS +SG LAT A+L IG+G A +AA
Sbjct: 73 LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132
Query: 179 INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
W++KD G L +I+ + + KG LFAD+L + A LE++ P +P F
Sbjct: 133 ATWIVKDSTGMLGRIIFAWW--------KG--LFADILNDVAMFLEIMAPIYPMCFTVTI 182
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ + ++ +TR+ A R N A+V AK +Q + G+++ + + +
Sbjct: 183 CISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLV 242
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
F+L F ++T +H+Y N ++ Q++ + TLN R LV +L G+ N
Sbjct: 243 SDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSANQ 302
Query: 359 EEPLFPAF 366
EPL+ F
Sbjct: 303 MEPLWTGF 310
>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
Length = 265
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 133/242 (54%), Gaps = 2/242 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA-INWVL 183
LPEG+P SV+SDYL + W ++QG+++ I +L+TQALL IG+G A W L
Sbjct: 24 LPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWFL 83
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G + I+ + + G + D K WRL AD + + +++++P FP F+F
Sbjct: 84 RDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIGMLMDLVSPLFPRAFMFFLCLGS 143
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
A R+ + +TR+ FA ++N A+V AK +Q + +G+++G+ LA +++
Sbjct: 144 AARAVTGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANNI 203
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
SF +T H+Y N K+ ++ L ++N R ++V +L G+AP +E + +E
Sbjct: 204 VLMWTSFMSLTAFHVYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQESA 263
Query: 363 FP 364
P
Sbjct: 264 LP 265
>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
ND90Pr]
Length = 482
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 111/402 (27%), Positives = 194/402 (48%), Gaps = 13/402 (3%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTA 176
+ + LP GFP SVT DY+ Y ++ S+Q +S I+G+LA++A+L +G+G A PTA
Sbjct: 59 QYIIDVFLPSGFPQSVTEDYIEYQVFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTA 118
Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF- 234
A + VL++ +G ++ I ++ FG + K +RL AD+L ++A L+ L+P FP
Sbjct: 119 ALLLSVLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSAMILDCLSPGFPKPLR 178
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
V + + A RS + S+++ A FA N EV AK +Q V +G++ G +
Sbjct: 179 VAVLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFV 238
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
+HI S + + ++ IH+ N + +++ +++LN RA+++FS L G K
Sbjct: 239 VSHITSDLA-TWCTLILLLAIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLNPK 297
Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH 414
EV+ E +F + + + + I R +L L+ + +D
Sbjct: 298 EVSARERIFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGGDDGS 357
Query: 415 A-----LFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKSLFQASYLYWLERNAG 465
L S++E E Y+L GG + +VLK P D L++ A + + G
Sbjct: 358 EVDVFDLLSIFEAEDYVLWATGGPAAYEAVIVLKAGCGPIDQLRAWAHALLVAEKRQRGG 417
Query: 466 IVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
+ + G L+ +L V+R F+ + + GW D
Sbjct: 418 DSGGTRAKHGLVVGDLKDTLYEVRRMFDKYGDEMRAKGWNLD 459
>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
Length = 340
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 128/247 (51%), Gaps = 4/247 (1%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAIN 180
+ LP+G+PDSVT DY +Y W S+Q S ++ A A+ A+G+G + A AA +
Sbjct: 53 KIFLPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVV 112
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
W+L++G+G +S+IM + F H D + K WR AD+L + + L+++ P FV++
Sbjct: 113 WLLRNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVAC 172
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
A RS + +TR+ + R N A+V AK +Q + I + +AL +
Sbjct: 173 LANVCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVA 232
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP--VKEVN 357
+ F + T++H+Y N ++ ++++ TLN R ++ ++ V+ VN
Sbjct: 233 HHVLLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIKDNTNAKMGVEIVN 292
Query: 358 DEEPLFP 364
EE L P
Sbjct: 293 YEESLVP 299
>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
Length = 453
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/248 (33%), Positives = 130/248 (52%), Gaps = 11/248 (4%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP+GFPDSV+ DYL+Y LW SVQ AS +SG LAT A+L IG+G A +AA
Sbjct: 73 LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132
Query: 179 INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
W++KD G L +I+ + + KG LFAD+L + A LE++ P +P F
Sbjct: 133 ATWIVKDSTGMLGRIIFAWW--------KG--LFADILNDVAMFLEIMAPIYPMCFTVTI 182
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ + ++ +TR+ A R N A+V AK +Q + G+++ + + +
Sbjct: 183 CISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLV 242
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
F+L F ++T +H+Y N ++ Q++ + TLN R LV +L G+ N
Sbjct: 243 SDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSANQ 302
Query: 359 EEPLFPAF 366
EPL+ F
Sbjct: 303 MEPLWTGF 310
>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
Length = 493
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/241 (31%), Positives = 133/241 (55%), Gaps = 3/241 (1%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI-GLG-KGAIPTAAAIN 180
+ LP+G+P+SV+ DY++Y W Q V ++ VL+T ALL + G+ A+P AAAI+
Sbjct: 99 YFLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASALPFAAAIS 158
Query: 181 WVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
WVLKDG+G + + +N F D + K ++ D+L N LEM TP +F+ + +
Sbjct: 159 WVLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQGYFLPLAS 218
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
+ + A LI STR+ F+ + N ++ AK ++Q M + +G+ +G +
Sbjct: 219 ISNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIGSTFGLLLS 278
Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
S ++ ++++HM+ K+ +SI L++LN R S++ +LL+ + K VN+
Sbjct: 279 GSTFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEILNSKIVNNN 338
Query: 360 E 360
E
Sbjct: 339 E 339
>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
Length = 410
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 133/261 (50%), Gaps = 6/261 (2%)
Query: 87 DFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
D F + + + +SSL ++ W ++ F LP G+P SV+ DYL Y +W ++
Sbjct: 12 DEVSRVFTLEKSQSRQISSLFDLHLSWVTLKKFF----LPVGYPSSVSDDYLEYQIWDTI 67
Query: 147 QGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDV 204
Q AS I+G LA+QA+L +G+G A +A++ W+ KDG G + +I+ + + G D
Sbjct: 68 QAFASSITGALASQAVLIGVGVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDC 127
Query: 205 NPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFA 264
+ K WR AD+L + A LE+++P F + F + A +S + ++TR+ A
Sbjct: 128 DCKFWRFVADILNDCALFLEIISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQA 187
Query: 265 ARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSY 324
N A+V AK +Q +S + +L L + + F T IH+Y N ++
Sbjct: 188 INNNLADVSAKDGSQETLSNLMAWLLNFILLYMVTGNQFLIWFCFICCTSIHLYSNYRAV 247
Query: 325 QSIELRTLNPYRASLVFSEYL 345
+ ++LRT N R L ++
Sbjct: 248 KCLKLRTFNRTRFHLAIQQWF 268
>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
Length = 428
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 153/307 (49%), Gaps = 20/307 (6%)
Query: 126 PEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLK 184
P G+P+SV+ DYL Y +W +VQ +AS ++G LA Q++L +G+G A A+++W+LK
Sbjct: 21 PSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSWMLK 80
Query: 185 DGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGA 243
DG G + KI+ + G + D + K WRL AD++ + A LE+ + FP F FI A
Sbjct: 81 DGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICLANL 140
Query: 244 GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMP 303
+S ++ +TR+ A N A+V AK +Q +S + + +++ + +
Sbjct: 141 CKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTGNWS 200
Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--QAPPVKEVNDEEP 361
F ++T+IH+Y N ++ Q + L N R + ++ P V +
Sbjct: 201 IIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPSVSDPT---- 256
Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE-DAHALFSLY 420
A F I+ N + ++L A DA R+ LG L + N ++ + L +++
Sbjct: 257 ---ATPFPSIEWVNSREPILL--PASDA------RIHLGCSLQTLPNKEQYNLRDLINVF 305
Query: 421 EDEGYIL 427
E Y+L
Sbjct: 306 SQEKYLL 312
>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 464
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 139/296 (46%), Gaps = 14/296 (4%)
Query: 111 KLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG- 169
+ W R F + LP G+PDSV+ DY Y +W S+Q AS I+G LATQ++L +G+G
Sbjct: 36 RKWSSLRAFFSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGD 95
Query: 170 KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
+GA AA W+L+DG G +I+ + F G + D + K WRLFAD+L + A LE++
Sbjct: 96 QGASVLAATTTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQ 155
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
+ + + +S + +TR+ A N A+V AK +Q +
Sbjct: 156 YMKGYVTAVLCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAF 215
Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL-LS 347
+ + L + + F T +H++ N ++ + + T N R ++V +L +
Sbjct: 216 LFSLLLLRIVAGNPWLLYGIFISFTLLHIFANYRAVSCVVMETFNRSRYAIVVRRFLETA 275
Query: 348 GQAPPVKEVNDEEPL-------FPAFHF----FKIKSANKSQLLVLSSEAKDAAVE 392
G PV VN E + F + +F F I NK + A D VE
Sbjct: 276 GDIAPVTWVNARESVWIQCGKPFVSINFGVPLFSIAPTNKELQCRVEDAAGDRKVE 331
>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
Length = 495
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 114/416 (27%), Positives = 187/416 (44%), Gaps = 54/416 (12%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA 178
L F+LP GFP+SV+ DYL Y + + V I L T +LL A+G+G + T AA
Sbjct: 113 LIKDFILPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTTTAA 172
Query: 179 ----INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
I WV KDGIG + ++ + FG FD +PK WR++AD + +A ++ TP +P +
Sbjct: 173 SAVAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLYPSY 232
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ + + ++ A ++ + FA N E+ AK E +V++ +G+ +GI
Sbjct: 233 FLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAIGIL 292
Query: 294 LANHIGSSMPFALASFSVV--TW-----IHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
+ + P + S+SV+ TW +H++ +S + T+N RA ++ ++L
Sbjct: 293 ILD-----TPGLVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVL 347
Query: 347 SGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
+ P E N EE + F + + V +E +GS+ S
Sbjct: 348 HNKVPGTVECNSEESILVWERF--------------TRPSIIFGVPLEE--MMGSERSST 391
Query: 407 VNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLFQASYLYWLER 462
V L LY +E YIL + K V K A +L+S++Q YWL +
Sbjct: 392 V------MKLLKLYANEKYILMLDSQDKDLKVFVSFKVGASSMTVLRSIWQT---YWLNK 442
Query: 463 NAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
+ +L SL ++ +FN GW T L + +PN I
Sbjct: 443 HLDTTENVIH-------QLAQSLSEMEDKFNEFVQLLEGAGWDTHQL-SLKVPNNI 490
>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
Length = 492
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 169/356 (47%), Gaps = 43/356 (12%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA---- 178
F+LP GFP SV+ DYL Y L + V I + T +LL A+G+G + TAAA
Sbjct: 110 FILPAGFPGSVSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAVGVGSFSGTTAAASAAA 169
Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG + ++ + FG FD +PK WR++AD + +A ++ TP +P +F+ +
Sbjct: 170 IRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTPLYPGYFLPL 229
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ ++ A ++ + + FA N E+ AK E +V++ +G+ LGI + +
Sbjct: 230 ASLGNLTKAIARGLKDPSSRVIQSHFAISANLGEIAAKEEVWEVVAQLVGLGLGILILDT 289
Query: 298 IGSSMPFALASFS--VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
G + + S + +V +H++ +S ++ T+N RA + ++L P +
Sbjct: 290 PGLVKSYTVLSSTWLIVRLLHLWLRYESLSVLQFNTINLKRARTLVKSHVLHSTVPGCMD 349
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH- 414
N EE +LV S K + GS L D ++ E +H
Sbjct: 350 CNKEE-----------------NILVWSQFMKPIII-------FGSPL-DKMDGMERSHF 384
Query: 415 ---ALFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKSLFQASYLYWLERN 463
AL LY +E YIL + KF V K A +L+S++Q +WL N
Sbjct: 385 MVEALLKLYANEKYILVVNQQLDDLKFYVSFKVGATSVSVLRSIWQT---FWLSEN 437
>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
Length = 417
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 164/340 (48%), Gaps = 24/340 (7%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
+ LP+G+PDSV+ DY Y +W + Q S I G L T A+L IG+G I +A
Sbjct: 51 KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TRS A R N A+V +K +Q + +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +++ LR+ N R + E+ S + ++VN
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E + ++ + S + L K+ E + S + D+V++ D H
Sbjct: 290 MERVTVG------QTVSVSLNIKLGLSVKNLIDEYKS----SSVIEDIVSSF-DPH---- 334
Query: 419 LYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
E +I+ E V L PQD+LK+ F A SYL
Sbjct: 335 ----EHFIIAETKKCLGVYLHFETRPQDVLKAYFFAVSYL 370
>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
Length = 417
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
+ LP+G+PDSV+ DY Y +W + Q S I G L T A+L IG+G I +A
Sbjct: 51 KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TRS A R N A+V +K +Q + +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +++ LR+ N R + E+ S + ++VN
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E + V + ++LG + ++++ + + + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326
Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
+ E +I+ E V L PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAETKKCLGVYLHFETRPQDVLKAYFFAVSYL 370
>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 408
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 167/383 (43%), Gaps = 34/383 (8%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL---GKGAIPTA 176
+Q LP FP SV Y + + + V+ VL+TQ LL AIG+ A A
Sbjct: 26 LIQHFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMA 85
Query: 177 AAINWVLKDGIGYLSKIML-SNFG--RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
A+NWVLKDGIG L ++ S G R FD NPK WR+ L ++A L++L+P P
Sbjct: 86 GALNWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSA 145
Query: 234 FVF-IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
V + A A ++ L ++R+ + N +V AK +Q + + +G +GI
Sbjct: 146 VVLPLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGI 205
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
L+ +G + F ++ IH CN + Q ++L N +R LV YL G+
Sbjct: 206 GLSAAMGHDAGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVLS 265
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVV-NNKE 411
+E+ +E FP L E A+ L +GS L+ +
Sbjct: 266 PREIATKESFFP-----------------LLREDDTAS-----WLSIGSALTAISPGGPN 303
Query: 412 DAHALFSLYEDEGYIL-TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
+ L DE YI+ + GK VV + A +D++K +F A + L + G+
Sbjct: 304 ELTQLLKACPDESYIINCTNDGKIHVVFLQDAQGEDLVKGMFHACF---LRKKTGLSIDG 360
Query: 471 TSADCAPGGRLEISLDYVQREFN 493
A A ++ + E N
Sbjct: 361 VEAVVATHEETKVRFPHFLEELN 383
>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
Length = 417
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 103/342 (30%), Positives = 164/342 (47%), Gaps = 28/342 (8%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAIN 180
+ LP+G+PDSV+ DY Y +W + Q S I G L T A+L IG+G I +A +
Sbjct: 51 KIFLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TR+ A R N A+V +K +Q + +G+ L + I
Sbjct: 171 CGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +++ LRT N R + E+ S + ++VN
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLSPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E + ++ + S + L K+ E + S + D + + D H
Sbjct: 290 MERVTIG------QTVSVSLNIKLGLSVKNLIDEYK-----SSSVIDNIVSSFDPH---- 334
Query: 419 LYEDEGYILTEHGGKFC--VVLKESALPQDMLKSLFQA-SYL 457
E +I+ E GK C V L P D+LK+ F A SYL
Sbjct: 335 ----EHFIIAE--GKKCLGVYLHFDTRPLDVLKAYFFAVSYL 370
>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
Length = 417
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
+ LP+G+PDSV+ DY Y +W + Q S I G L T A+L IG+G I +A
Sbjct: 51 KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TRS A R N A+V +K +Q + +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +++ LR+ N R + E+ S + ++VN
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E + V + ++LG + ++++ + + + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326
Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
+ E +I+ + V L PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAVSYL 370
>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
Length = 417
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
+ LP+G+PDSV+ DY Y +W + Q S I G L T A+L IG+G I +A
Sbjct: 51 KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TRS A R N A+V +K +Q + +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +++ LR+ N R + E+ S + ++VN
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E + V + ++LG + ++++ + + + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326
Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
+ E +I+ + V L PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAVSYL 370
>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
Length = 486
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 125/242 (51%), Gaps = 14/242 (5%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
F LPEGFP SV+SDYL Y LW S+QG++S + G+L++QA+L +G
Sbjct: 65 FAACFLPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVG------------ 112
Query: 180 NWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
+ LKD G L I+ S G D K WRL AD++ + LE+ +P P F+ +
Sbjct: 113 -FFLKDFTGMLGGILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLA 171
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
R+ + ++TR FA +RN A++ AK +Q +G++LG+A
Sbjct: 172 CLGSLARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAFIRAA 231
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
P ++F +T +H++ N+++ + + + +LN R L+ YL +G+A ++
Sbjct: 232 HGLEPLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGEALTPGQMAA 291
Query: 359 EE 360
EE
Sbjct: 292 EE 293
>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
Length = 417
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
+ LP+G+PDSV+ DY Y +W + Q S I G L T A+L IG+G I +A
Sbjct: 51 KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
W+LK+G G++ +I+ + + G DV+ K WRL AD L + A G+E+ + P +PH I
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ ++ + +TRS A R N A+V +K +Q + +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S F VV +H+Y NLK+ +++ LR+ N R + E+ S + ++VN
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289
Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
E + V + ++LG + ++++ + + + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326
Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
+ E +I+ + V L PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAVSYL 370
>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
Length = 479
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 166/361 (45%), Gaps = 48/361 (13%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
+PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G K A A W L
Sbjct: 71 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130
Query: 184 KDGIGYLSKIMLSNF-----------------------------GRHFDVNPKGWRLFAD 214
+D G L I+ + + G + D N K WRL AD
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190
Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
+ + +++L+P FP + I RS + +TR+ FA N A++ A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250
Query: 275 KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 334
K +Q ++ +G+ LG+ LA+ SF +T HMY N K+ QS+ L TLN
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310
Query: 335 YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 394
R+S++ ++ +G+ ++V+ +E + P + SS K +++
Sbjct: 311 ERSSILLQYFMDNGEVLTPQQVSKQEHILPYW----------------SSWRKFLRIKLP 354
Query: 395 H-RLQLGSKLSDVVN-NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
H + LG+K S + + + Y +E Y L + G + + + A D+L S
Sbjct: 355 HEHVHLGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFI 414
Query: 453 Q 453
Sbjct: 415 H 415
>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
Length = 346
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 133/251 (52%), Gaps = 8/251 (3%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
+ + + LP G+P+SVT DY N+ W S+QG+ S + V+ T++ L +G+G A PT
Sbjct: 61 KNVLRRLFLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHPT 120
Query: 176 AAAINWVLKDGIGYLSKIMLSNFGR--HFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
AAA+ + ++G G + ++ + +G+ FD + K WRLFAD++ N A +E P FP
Sbjct: 121 AAALMLLWREGFGVVGGLLFT-WGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTP 179
Query: 234 FVF-IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
F + ++A S + +T + F A FA R N ++ +K G +G++L
Sbjct: 180 VFFALHSSANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGLLLST 239
Query: 293 AL---ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
++ + +S+PF+ F +T H+Y N+ + +S+ L LN R +V Y+ Q
Sbjct: 240 LCFWSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYICHKQ 299
Query: 350 APPVKEVNDEE 360
V V+ +E
Sbjct: 300 LLDVSSVSKKE 310
>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
Length = 479
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 167/365 (45%), Gaps = 48/365 (13%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
+PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G K A A W L
Sbjct: 71 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130
Query: 184 KDGIGYLSKIMLSNF-----------------------------GRHFDVNPKGWRLFAD 214
+D G L I+ + + G + D N K WRL AD
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190
Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
+ + +++L+P FP + I RS + +TR+ FA N A++ A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250
Query: 275 KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 334
K +Q ++ +G+ LG+ LA+ SF +T HMY N K+ QS+ L TLN
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310
Query: 335 YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 394
R+S++ ++ +G+ ++V+ +E + P + SS K +++
Sbjct: 311 ERSSIMLQYFMDNGEVLTPQQVSKQEHILPFW----------------SSWRKFLRIKLP 354
Query: 395 H-RLQLGSKLSDVVN-NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
H + LG+K S + + + Y +E Y L + G + + + A D+L S
Sbjct: 355 HEHVHLGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFI 414
Query: 453 QASYL 457
L
Sbjct: 415 HGLVL 419
>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 286
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 121/229 (52%), Gaps = 2/229 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
LP+G+PD+VTSDY+ + +W S+Q + S + G++ + A+L IG+GK A P A + +
Sbjct: 50 LPDGYPDAVTSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFFV 109
Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G ++ + G D K WRLFAD+ N + L++ +P FP F+ +
Sbjct: 110 RDLAGMCGGVLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACLGS 169
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
R+ + +TR+ F+ N ++ AK +Q + IG++LG+A S
Sbjct: 170 LARAVTGVAGGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASDSP 229
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
A F ++T++H+Y N+++ ++++L +LN R ++ S P
Sbjct: 230 VAVWAVFLLLTFLHIYANVRAVRALQLTSLNCSRLDILLKHSSASKVTP 278
>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
Length = 264
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 130/258 (50%), Gaps = 13/258 (5%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+G+P SV+ DYL Y +W ++Q AS +SG L+T+A+L +G+G + P AA + W++
Sbjct: 6 LPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLMWLI 65
Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTP-------AFPHHFV 235
KDG G + +I+ + G D + K WR FADLL + A LE+ P
Sbjct: 66 KDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSNGTMT 125
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
I AG +S + +TR+ A + N A+V AK +Q + ++ + L
Sbjct: 126 SILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITSLWLL 185
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ + F + T +H++ NLK+ +++ + TLN R ++ ++ + VKE
Sbjct: 186 PILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVASVKE 245
Query: 356 VNDEEP----LFPAFHFF 369
VN EP L P +F
Sbjct: 246 VNRLEPVIMGLIPTGNFI 263
>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 600
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 118/453 (26%), Positives = 200/453 (44%), Gaps = 55/453 (12%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----G 169
+ + L LP +P SVT+ Y Y+ W + + + VL T ALL + G+
Sbjct: 148 EPLQRLLASVFLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTTALLVSAGVVADPS 207
Query: 170 KGA------------IPTAAA----INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLF 212
+ A +PTAA+ +W+LKDG+G+L ++ S + FD + K R+
Sbjct: 208 RAASTALTRDPALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYAHRFDGDLKRLRVL 267
Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
AD+L + GLE+++ +FP F+ + + A A + A ++ STRS Y A R N +V
Sbjct: 268 ADVLWHLGTGLELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTIYRTVATRSNIGDV 327
Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPF-ALASFSVVTWIHMYCNLKSYQSIELRT 331
AKG+AQ + ++ +G+ +G++L + F A F +V MY ++S + + L
Sbjct: 328 TAKGDAQSIAAELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMYSRIQSMRVLVLPL 387
Query: 332 LNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIK-SANKSQLLVLSSEAKDAA 390
LN RA L+ Y+ S A P+ + ++ + LLVLS E
Sbjct: 388 LNLQRAVLLVDAYVASRLAAAATASEAIRPIRKQEYVSNVRENGTMKTLLVLSPEEVAQR 447
Query: 391 VEIEHRLQLGSKLSDVV---------NNKEDAHALFSLYEDEGYIL--------TEHGGK 433
H S + VV + D L ++E E +++ + +
Sbjct: 448 ERFLHWQSHWSTVPQVVPDVRMHWFSESASDVGTLLRVFETEAFVVGRCRRQHPAQSKEQ 507
Query: 434 FCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFN 493
V+K A ++ L++L A +L + +T D RLE++ D+ +
Sbjct: 508 VGFVMKTGAGAREALRALIAAIFLVRQPQ-------ATERDALLFSRLELTEDFFR---- 556
Query: 494 HVKSDSASVGWVTDGLIARPLPNRIRPGYVEPS 526
++AS GW D + NR+ VEPS
Sbjct: 557 ----EAASRGWQLDCVHGVGQRNRLLCNAVEPS 585
>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
Length = 353
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 177/370 (47%), Gaps = 39/370 (10%)
Query: 156 VLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFA 213
+L+TQALL AIG+G K A A W L+D G L I+ + + G + D N K WRL A
Sbjct: 2 MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
DL+ + +++L+P FP F+ + RS + +TR+ FA + N A++
Sbjct: 62 DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTL 332
AK +Q ++ +G+ LG+ LA S P A+ SF +T HMY N ++ + + L +L
Sbjct: 122 AKEGSQETMATMMGMSLGMLLAR-FTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSL 180
Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
N R+S++ + ++ +GQ ++V+ E + P + ++S N L
Sbjct: 181 NFERSSILLTHFIQTGQVLSPEQVSSMEGVLPLWA-TSLRSTNSKPL------------- 226
Query: 393 IEHRLQLGSKLSDVVNNKEDAHALF-----SLYEDEGYILTEHGGKFCVVLKESALPQDM 447
R+QLG ++S + + D L S Y++ Y+L G V+L + + P D+
Sbjct: 227 -HKRVQLGVRVSSL--PRLDMLQLLNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADV 283
Query: 448 LKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
LKS A L A ++ STS ++ + ++ + S GW T+
Sbjct: 284 LKSYIHAIVL------ANLMEKSTSFYSEGEA-------WIDKHYDELLHKLRSGGWKTE 330
Query: 508 GLIARPLPNR 517
L++ + R
Sbjct: 331 RLLSPSITWR 340
>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
Length = 457
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 178/396 (44%), Gaps = 43/396 (10%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
+ LP GFP+SV+ DY Y +W ++Q S IS +LAT A+L +G+G A AA+I
Sbjct: 54 EVFLPRGFPNSVSKDYTEYQIWDTIQAFCSTISNILATNAILKGVGVGDAQATALAASIT 113
Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEML--TPAFPHHFVFI 237
++LKDG G + +I + G D K WRL AD L + A +E+L P + I
Sbjct: 114 YILKDGAGMIGRITFAWCKGSELDTQCKKWRLRADALNDLAIFIELLLSIPWIRQFSLII 173
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ + +S ++ +TR+ A R N +V AK +Q I ++G+ +
Sbjct: 174 LSFSSCAKSIVSVAGGATRAALTQHQAIRDNMGDVSAKDGSQETCINLIAFLIGLIMLPI 233
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ + + + VVT +H++ N K+ +S+ + N R L YL + + +
Sbjct: 234 VENRILLIWMIYLVVTSLHLFANYKAVKSLNINVFNSARYDLTLKYYL----SNDTQNHD 289
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE----DA 413
++P + NK + L E ++QLG+ + +++ D
Sbjct: 290 VQKPDY----------INKREACFLGDEKL-----CSFKIQLGTSIHELLYTNTLTTWDI 334
Query: 414 HALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWL----------- 460
LY+D YIL H VVL ++ +++LK+ F A+ L L
Sbjct: 335 IDHIELYKDYLYILIVDTHRDIIRVVLDKNINTENILKAYFHANILGHLICPKNKFSLIK 394
Query: 461 ---ERNAGIVATSTSADCAPGGRLEISLDYVQREFN 493
R+ +T+T C +++S ++V + F+
Sbjct: 395 LNSLRSMKYTSTNTQFCCTHSEYVQLSCEFVNKNFD 430
>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
Length = 480
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 167/360 (46%), Gaps = 41/360 (11%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
E+ F+LP GFP SV+ DYL+Y L + V I L T +LL A+G+G TAA
Sbjct: 92 EIIKDFVLPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFTGTTAA 151
Query: 178 A----INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A I WV KDGIG + ++ + FG FD +PK WR++AD + +A ++ T +P
Sbjct: 152 ASAAAIRWVSKDGIGAVGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPA 211
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
+F+ + + ++ A ++ + FA N EV AK E +V++ +G+ LGI
Sbjct: 212 YFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVVAQLVGLSLGI 271
Query: 293 ALANHIGSSMPFALASFSVVTW-----IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+ + G + + S +TW +H++ +S ++ T+N RA ++ ++L
Sbjct: 272 LILDTPGLVKSYGVIS---LTWLSMRLLHLWLRYESLSVLQFNTINIKRARILVKSHVLH 328
Query: 348 GQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVV 407
P + N EE + F K K + L E D VE + +
Sbjct: 329 STVPGCTDCNREENILAWSQFMKPKI-----IFGLPLEKMD-GVERSYFM---------- 372
Query: 408 NNKEDAHALFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKSLFQASYLYWLERN 463
AL LY E YIL + +F V K A +L+S++Q+ +WL N
Sbjct: 373 -----VEALIKLYASEKYILMVNQQTEDLRFYVSFKVGATNVSVLRSVWQS---FWLSEN 424
>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
siliculosus]
Length = 789
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 133/278 (47%), Gaps = 38/278 (13%)
Query: 112 LWDECRELFV---------------------QFMLPEGF-PDSVTSDYLNYSLWRSVQGV 149
+WDE +E FV Q +L F P+ VT DY +++ WR Q
Sbjct: 147 VWDESKERFVEADSLRDRSNSQRVRGAVGGLQLLLRHWFIPEDVTPDYYHFTFWRMFQRF 206
Query: 150 ASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIM-LSNFGRHFDVNPKG 208
+ V TQALL A+G+ + AAA++WVLKD +G +I+ S GR FD + K
Sbjct: 207 VAGTINVFGTQALLLALGIKAKRLGAAAAMSWVLKDALGKFGRILWASKMGRRFDSDAKR 266
Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
WR + LL A GLE++T FP F+ + A A + + + L ++TR+ Y FA N
Sbjct: 267 WRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQMSMLTSSATRNTIYRSFARGEN 326
Query: 269 FAEVIAK---------------GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
++ AK GEAQ V +G++ GI L +G+S +A++ +++
Sbjct: 327 IGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSGIGLCTLVGTSRQTIVAAYVILS 386
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ + +++ LN R L+ +++ SG P
Sbjct: 387 LVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASGSDP 424
>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 518
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 83/266 (31%), Positives = 134/266 (50%), Gaps = 7/266 (2%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
+ L +LP +P +V+SDY + W ++ + + VL TQ++L A+G+G A+P
Sbjct: 92 QRLVQTLLLPADYPSAVSSDYDRFWRWNILRHTLLEAAEVLGTQSMLLALGVGGASALPL 151
Query: 176 AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AAA W+LKDG+GY +K+ L + D +PK +R + D + EMLT AFP F
Sbjct: 152 AAAWKWILKDGLGYFAKVALATQLAPRVDNDPKRFRFYGDFVMALGTLFEMLTLAFPPWF 211
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ + + R AA + F F+ R N +V +K E+Q +V + GI GIA+
Sbjct: 212 LALASTGNLLRKAADVATGPAYRVFLYHFSIRNNSGDVSSKSESQVVVGRLSGIGAGIAV 271
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--QAPP 352
+ + ++ +++ H+Y +S +ELRTLN R V + Y S + P
Sbjct: 272 SAVTSHDASALVMAYGLLSAGHLYATYQSVIRLELRTLNRTRLEYVLNVYTASSGKRVPS 331
Query: 353 VKEVNDEEP---LFPAFHFFKIKSAN 375
+ E N E L P H ++I+ N
Sbjct: 332 IAEANAAEQKSFLRPWSHVYRIRFVN 357
>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 497
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 179/358 (50%), Gaps = 31/358 (8%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP G+P SV+ DYL Y ++ S+Q +S I+G+LA++A+L +G+G A PT A ++ VL
Sbjct: 74 LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
D +G ++ I+ ++ G + K +RL AD+L ++A + L+PAFP H + + A +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
R+ + S+++ + FA N E+ AK +Q V +G++ G + + + S
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRV--S 251
Query: 302 MPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
PFA A+ + +H+ N + +S+ + TLN RA++VFS G+ K+ + E
Sbjct: 252 TPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCE 311
Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE---HRLQLGS--KLSDVVNNKEDAHA 415
+F + K +S A + I L GS +++ + + D
Sbjct: 312 RIFERDGVLRWK----------ASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRR 361
Query: 416 LFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
L L+ +E YIL ++ GG +VLK S P LK +L A + L AG
Sbjct: 362 LLHLFREEEYILWFNSSDKGGT--IVLKNSVTPLSQLKAWSHALMVAKRIADLREGAG 417
>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 498
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 179/358 (50%), Gaps = 31/358 (8%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP G+P SV+ DYL Y ++ S+Q +S I+G+LA++A+L +G+G A PT A ++ VL
Sbjct: 75 LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 134
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
D +G ++ I+ ++ G + K +RL AD+L ++A + L+PAFP H + + A +
Sbjct: 135 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 194
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
R+ + S+++ + FA N E+ AK +Q V +G++ G + + + S
Sbjct: 195 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRV--S 252
Query: 302 MPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
PFA A+ + +H+ N + +S+ + TLN RA++VFS G+ K+ + E
Sbjct: 253 TPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCE 312
Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE---HRLQLGS--KLSDVVNNKEDAHA 415
+F + K +S A + I L GS +++ + + D
Sbjct: 313 RIFERDGVLRWK----------ASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRR 362
Query: 416 LFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
L L+ +E YIL ++ GG +VLK S P LK +L A + L AG
Sbjct: 363 LLHLFREEEYILWFNSSDKGGT--IVLKNSVTPLSQLKAWSHALMVAKRMADLREGAG 418
>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 497
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 179/358 (50%), Gaps = 31/358 (8%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP G+P SV+ DYL Y ++ S+Q +S I+G+LA++A+L +G+G A PT A ++ VL
Sbjct: 74 LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
D +G ++ I+ ++ G + K +RL AD+L ++A + L+PAFP H + + A +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
R+ + S+++ + FA N E+ AK +Q V +G++ G + + + S
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRV--S 251
Query: 302 MPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
PFA A+ + +H+ N + +S+ + TLN RA++VFS G+ K+ + E
Sbjct: 252 TPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCE 311
Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE---HRLQLGS--KLSDVVNNKEDAHA 415
+F + K +S A + I L GS +++ + + D
Sbjct: 312 RIFERDGVLRWK----------ASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRR 361
Query: 416 LFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
L L+ +E YIL ++ GG +VLK S P LK +L A + L AG
Sbjct: 362 LLHLFREEEYILWFNSSDKGGT--IVLKNSVTPLSQLKAWSHALMVAKRIADLREGAG 417
>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
Length = 394
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 126/249 (50%), Gaps = 2/249 (0%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
+ FV LP G+P SV++DY++Y +W +VQ AS ++ LAT+A+L G+G + A
Sbjct: 41 KAFFVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASAL 100
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AA + W+++DG+G LS+I + D + K WRL AD L + AF LE+L A P F
Sbjct: 101 AATLTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLF 160
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ + RS A+ +TR+ A N A+V AK +Q + ++ + L
Sbjct: 161 TLLVCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLIL 220
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
I + + T +H++ N + ++ TLN ++ + ++ +G VK
Sbjct: 221 LPLISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVK 280
Query: 355 EVNDEEPLF 363
N+ EPL
Sbjct: 281 VANELEPLL 289
>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
Length = 514
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 111/419 (26%), Positives = 189/419 (45%), Gaps = 37/419 (8%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+N L+ C F +P+ V+ +Y++Y W+ + V S VLATQA+ AIG+
Sbjct: 102 LNNLFALCSSAVRDFFIPQ----KVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGI 157
Query: 169 G-KGAIPTAAAINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML 226
G ++P+AAA+NWVLKDG+G L + I ++ FD N K R +L + + G+E+L
Sbjct: 158 GYSRSLPSAAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELL 217
Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
TPAFP++F+ + A + +T S + FA N EV AK + Q + +
Sbjct: 218 TPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNL 277
Query: 287 GIMLGIAL------ANHIGSSMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
G+ML L + + +PF + FS + +Y LK + L+TL R +
Sbjct: 278 GLMLAAVLNILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLK---HVHLQTLTKDRLEI 334
Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
+ + ++ G P EV+ EE + +F + K + ++ L + + + + L
Sbjct: 335 ILNMWIQLGYVPSPAEVSKEEGI--SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSL 392
Query: 400 GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY--- 456
S+ D +A + E +G +L L+E A D++ L QA Y
Sbjct: 393 SSE--DFYFICMEACRRLTRPEQQGILLC---------LREGASTTDVISGLLQACYVRK 441
Query: 457 --LYWLERNAGIVATSTSADCAPGGRLEISLD---YVQREFNHVKSDSASVGWVTDGLI 510
L R IV ++D ++ D QR+ N + + +GW ++
Sbjct: 442 KLLLNRTRWKNIVEDCDTSDTFVREWFKVVEDSKQSAQRDLNLLNEEMLRLGWTVKNIL 500
>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 627
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 164/344 (47%), Gaps = 27/344 (7%)
Query: 25 DDDSSGSFYSLLLF---VPSLLYCFCHLQVATAIARTATSSEDD---GNKE---YDAVWE 75
D DS+ S L+F + L C Q + +AT SE D G + YD + E
Sbjct: 91 DRDSALVSASDLVFNGSTSAALTCSPEEQTSRTSQSSATFSEVDVLYGRRAVLVYDPLQE 150
Query: 76 --VKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
VK S++ + + D TK V A +SL+ ++ L L + F+ P V
Sbjct: 151 RYVKVSEKNR-VADSTKQESVALRARRSSLARFITTKIL----PRLSLAFL-----PSGV 200
Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT---AAAINWVLKDGIGYL 190
T+DY + WR +Q + V TQ+LL A+ + A +AA+NWVLKD +G +
Sbjct: 201 TNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALNWVLKDALGKI 260
Query: 191 SKIM-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAA 249
+++ S GR FD + K WR + + A GLE++T FP F+ A + +
Sbjct: 261 VRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWATLANCCKQISM 320
Query: 250 LIQASTRSCFYAGF--AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALA 307
L +STR+ Y F +R N ++ AKGEAQ + +GI G+ L+ +G+S+ LA
Sbjct: 321 LTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRTVGTSIRAVLA 380
Query: 308 SFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ + I + C +++ R +N R V +++ + Q P
Sbjct: 381 VYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQARQGP 424
>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
Length = 559
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 112/419 (26%), Positives = 189/419 (45%), Gaps = 37/419 (8%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+N L+ C F +P+ V+ +Y++Y W+ + V S VLATQA+ AIG+
Sbjct: 147 LNNLFALCSSAVRDFFIPQ----KVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGI 202
Query: 169 G-KGAIPTAAAINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML 226
G ++P+AAA+NWVLKDG+G L + I ++ FD N K R +L + + G+E+L
Sbjct: 203 GYSRSLPSAAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELL 262
Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
TPAFP++F+ + A + +T S + FA N EV AK + Q + +
Sbjct: 263 TPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNL 322
Query: 287 GIMLGIAL------ANHIGSSMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
G+ML L + + +PF + FS + +Y LK + L+TL R +
Sbjct: 323 GLMLAAVLNILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLK---HVHLQTLTKDRLEI 379
Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
+ + ++ G P EV+ EE + +F + K + ++ L + + + + L
Sbjct: 380 ILNMWIQLGYVPSPAEVSKEEGI--SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSL 437
Query: 400 GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY--- 456
S+ D +A + E +G +L L+E A D++ L QA Y
Sbjct: 438 SSE--DFYFICMEACRRLTRPEQQGILLC---------LREGASTTDVISGLLQACYVRK 486
Query: 457 --LYWLERNAGIVATSTSADCAPG---GRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
L R IV ++D +E S QR+ N + + +GW ++
Sbjct: 487 KLLLNRTRWKNIVEDCDTSDTFVREWFKVVEDSKQSAQRDLNLLNEEMLRLGWTVKNIL 545
>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
communis]
Length = 523
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 159/341 (46%), Gaps = 30/341 (8%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGI 187
P VT +Y++Y W+ + V S VLATQA+ A+G+G ++P+AAA+NWVLKDG+
Sbjct: 131 IPKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPSAAALNWVLKDGL 190
Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G LS+ I ++ FD N K R +L + GLE+LTPAFP F+ + A +
Sbjct: 191 GRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIANIAKQ 250
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA------NHIGS 300
+ +T S + FA N EV AK + Q + +G+ML L + +
Sbjct: 251 ISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNNQRLLA 310
Query: 301 SMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
+PF + FS + +Y LK + L+TL R ++ ++ G+ P EV+ +
Sbjct: 311 GLPFVVYPIFSAIDLFGIYQGLK---HVHLQTLTKDRLEMILDSWIALGRVPTPAEVSKK 367
Query: 360 EPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN---NKEDAHAL 416
E F F + K + + AKD SKLS + + ED + +
Sbjct: 368 E----GFEFLRTKGKELWPIRIGCLNAKDQI----------SKLSVIAMQSLSSEDYYFI 413
Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYL 457
G T+ G + L+E A D++ L QA Y+
Sbjct: 414 CVENSYRGLTRTKQQG-ILLSLREGAGTADVIMGLLQACYI 453
>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
Length = 504
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 122/436 (27%), Positives = 207/436 (47%), Gaps = 22/436 (5%)
Query: 94 VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
V + N +S L NK++ E F LP G+P+SV YL +S++ ++ ++
Sbjct: 71 VTVISKNDRISWGLVKNKIYIGLFETF----LPVGYPNSVAEGYLKFSIFNNLSALSITA 126
Query: 154 SGVLATQALLYAIG-LGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRL 211
L+ Q+L AIG A AAA WVLKDGIG L I+ S +GR+FD + K WR
Sbjct: 127 MSFLSAQSLFVAIGSTMTQANLAAAAYTWVLKDGIGQLGAILFASRYGRNFDEDIKKWRF 186
Query: 212 FADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
A + N + +E+LT FP+HF+ + + A G++ L+ A++R+ FA R N +
Sbjct: 187 MAMIALNISIYIEILTLRFPNHFLALASIANIGKNICFLLSAASRASINVQFAKRNNIGD 246
Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIG-SSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
+ K +Q S G+ +G+ L+ I SS+ F ++T +++ + ++ + I+
Sbjct: 247 ISGKSVSQFTASTLCGVGIGLMLSKLIDISSISQLYPVFMILTILNIATSYQASKVIDEV 306
Query: 331 TLNPYRASLVFSEYLLSGQAP--PVKEVNDEE----PLFPAFHFFK-IKSANKSQLLVLS 383
LN RA ++F+EY S + V+E N +E P + ++ K IK +++ VL+
Sbjct: 307 YLNNQRAFIIFNEYFKSNKKNFLTVEEGNIQEIFYMPNYLNMNYCKFIKFGHRTIGKVLA 366
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKES-- 441
+ +K LQL + V + + L + Y + T +G KF + L S
Sbjct: 367 T-SKPHYYTKSVILQLEKNDRNFVYHVK----LTNWYTRKLNFFTNNGRKFEIHLNMSKN 421
Query: 442 ALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSAS 501
A D++K+ + A L N V A+ +++ + FN ++
Sbjct: 422 ATNFDIMKAYYFARLLDEQLFNFQKVQ-RIDANIIEQSIKTAEIEFQETSFNEIELKLKQ 480
Query: 502 VGWVTDGLIARPLPNR 517
GW D + P NR
Sbjct: 481 HGWSLDYIYLDPRKNR 496
>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 464
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 187/367 (50%), Gaps = 37/367 (10%)
Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI 166
+S+ LW + ++F LP GFP SVT DYL Y ++ S+Q +S I+ +L+++A+L +
Sbjct: 45 ISIASLWHKTVDVF----LPVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGV 100
Query: 167 GLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLE 224
G+G A T A + +L+D +G ++ IM ++ FG + K +RL AD+L + F L+
Sbjct: 101 GVGDATANATTALLLSILQDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLD 160
Query: 225 MLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
L+P FP V I + + RS + S ++ A FA + N E+ AK +Q V
Sbjct: 161 CLSPIFPKPTRVVILSFSSILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVI 220
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
G++ G + + + + A +++ IH+ N ++ ++++++TLN RA+L++
Sbjct: 221 SLAGMLFGSWVITWVKTPVATWTALILLLS-IHLETNRRAVRAVKMQTLNRQRATLLY-H 278
Query: 344 YLLSGQAPPVKEVNDEEPLFPAFHFFKIKS----------ANKSQLLVLSSEAKDAAVEI 393
+L G P EV D+E +F + + A S+LL AAV
Sbjct: 279 HLRKGHLPTPIEVADQELIFERDGILRSREGRAIGHCSVGAPMSRLL--------AAVGQ 330
Query: 394 EHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL----TEHGGKFCV--VLKESALPQDM 447
+H+ + S V++++ + + LY D I+ + GK V VLK++A + +
Sbjct: 331 QHK----TTKSIRVDDQQHLNRILELYSDAQCIVWCDPSSRPGKASVFIVLKKAADSKAL 386
Query: 448 LKSLFQA 454
+ +QA
Sbjct: 387 TLAWWQA 393
>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
Length = 472
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 159/376 (42%), Gaps = 55/376 (14%)
Query: 111 KLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG- 169
+ W R +F + LP GFPD+V+ DY Y +W S+Q AS I+G LATQA+L +G+G
Sbjct: 36 RKWSSLRAVFSEIFLPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGD 95
Query: 170 KGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKG--------------------W 209
+ A AA W+LKDG G + +I+ + R+ +
Sbjct: 96 QTASVLAATTTWILKDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYC 155
Query: 210 RLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNF 269
LFAD+L +AA LE+++ + + + +S + +TR+ A N
Sbjct: 156 SLFADILNDAAILLELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNM 215
Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
A+V AK +Q + + + L + S F++ T +H++ N ++ + +
Sbjct: 216 ADVSAKDGSQETLVNLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVM 275
Query: 330 RTLNPYRASLVFSEYLL-SGQAPPVKEVNDEEP--LFPAFHFFKIKSANKSQLLVLSSEA 386
T N R SL+ +L +G V++VN +E L HF +
Sbjct: 276 ETFNRSRYSLLVKHFLEEAGDIASVEKVNKQESVWLRCGRHFVNV--------------- 320
Query: 387 KDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL-----TEHGGKFCVVLKES 441
LGS LS V +D + + Y+L + VVL E
Sbjct: 321 -----------NLGSPLSSVAATHDDLCSRAPKEKQARYLLKVVRDSSRHYDINVVLHEQ 369
Query: 442 ALPQDMLKSLFQASYL 457
P+D ++++F + L
Sbjct: 370 CTPRDQMEAMFHSFLL 385
>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
Length = 525
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 79/251 (31%), Positives = 128/251 (50%), Gaps = 10/251 (3%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIP 174
C + F +P G VT +Y+ Y W+ + V S VLATQA+ A+G+G ++P
Sbjct: 119 CGSVVWDFFIPRG----VTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSCSLP 174
Query: 175 TAAAINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
+AAA+NWVLKDG+G LS+ I ++ FD N K R +L A+ GLE+LTPAFP
Sbjct: 175 SAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRC 234
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG-- 291
F+ + A + + +TRS + FA N E+ AK + Q + +G+ML
Sbjct: 235 FLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAAL 294
Query: 292 --IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
+ + NH F + + ++ + +++ L+TL R ++ S ++ G
Sbjct: 295 VNLWIENHRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGY 354
Query: 350 APPVKEVNDEE 360
P EV+++E
Sbjct: 355 VPSPAEVSEKE 365
>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
Length = 423
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 133/274 (48%), Gaps = 19/274 (6%)
Query: 87 DFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
D F + + + +SSL ++ W ++ F LP G+P SV+ DYL Y +W ++
Sbjct: 12 DEVSRVFTLEKSQSRQISSLFDLHLSWVTLKKFF----LPVGYPSSVSDDYLEYQIWDTI 67
Query: 147 QG-------------VASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSK 192
Q AS I+G LA+QA+L +G+G A +A++ W+ KDG G + +
Sbjct: 68 QAGIFFITKISLFSAFASSITGALASQAVLIGVGVGDSSATILSASLTWMFKDGSGMIGR 127
Query: 193 IMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALI 251
I+ + + G D + K WR AD+L + A LE+++P F + F + A +S +
Sbjct: 128 IIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPLLCLANVLKSLVGVA 187
Query: 252 QASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
++TR+ A N A+V AK +Q +S + +L L + + F
Sbjct: 188 GSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYMVTGNQFLIWFCFIC 247
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
T IH+Y N ++ + ++LRT N R L ++
Sbjct: 248 CTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWF 281
>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
Length = 272
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 73/193 (37%), Positives = 109/193 (56%), Gaps = 1/193 (0%)
Query: 154 SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLF 212
SGVL+ QALLYA GLG G++P AAAINWVLKDG+G L ++ FG FD +PK R +
Sbjct: 4 SGVLSMQALLYAAGLGAGSVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQRFW 63
Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
+ + A E+L P PH F+ + +AA ++ + L ++TR+ +A F R N ++
Sbjct: 64 SLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLGDI 123
Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
AK AQ + + +G LGI ++ H GS + + +F + +Y N S + + L TL
Sbjct: 124 TAKIGAQSIGASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVVLSTL 183
Query: 333 NPYRASLVFSEYL 345
N R L +L
Sbjct: 184 NEQRLELAMHPWL 196
>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
Length = 503
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 165/359 (45%), Gaps = 45/359 (12%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--------GAIP 174
F+LP GFP SV+ DYL Y L + V + I L T +LL A+G+G A
Sbjct: 117 FILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAASAAA 176
Query: 175 TAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
+AAAI WV KDGIG + ++ + FG FD +PK WR++ADL+ +A ++ T +P +
Sbjct: 177 SAAAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAY 236
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
F+ + + ++ A ++ + FA N EV AK E + ++ +G+ LGI
Sbjct: 237 FLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIM 296
Query: 294 LANH--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ + +S P ++ + +H++ +S + +++N RA ++ ++L P
Sbjct: 297 VLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVP 356
Query: 352 PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
+ N +E + F K R+ G L +++ +
Sbjct: 357 GYVDCNRKEKILSWQRFLK------------------------PRIIFGVSLEELIGGER 392
Query: 412 ---DAHALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
L LY E YIL + G F V + K A +L+S++Q+ YWL+ N
Sbjct: 393 PVSKVRTLLKLYSQEKYILVVKQQGTDFEVFVSFKVGATSLSVLRSIWQS---YWLQEN 448
>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
Length = 376
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 127/244 (52%), Gaps = 2/244 (0%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI-NW 181
+LPEG+P SV+SDY +Y +W + Q +S + G ++T A+L +G+G AAA
Sbjct: 49 LVLPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAATYTN 108
Query: 182 VLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAA 240
+L+D G L+ I + + D N K WRL AD++ + A +E+++P P F+ +
Sbjct: 109 LLRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPLMCM 168
Query: 241 AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS 300
A R+ + +TR+ A N A+V AK +Q + +++ +A+
Sbjct: 169 ASFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPLAAE 228
Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
S A A F + ++H+YCN ++ +++E+ N +R + ++ + + P V N E
Sbjct: 229 SRVMAWAMFLPLVFMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAANQAE 288
Query: 361 PLFP 364
PLFP
Sbjct: 289 PLFP 292
>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
Length = 513
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 125/244 (51%), Gaps = 10/244 (4%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINW 181
F +P G VT +Y+ Y W+ + V S VLATQA+ A+G+G ++P+AAA+NW
Sbjct: 115 FFIPRG----VTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSNSLPSAAALNW 170
Query: 182 VLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAA 240
VLKDG+G LS+ I ++ FD N K R +L A+ GLE+LTPAFP F+ +
Sbjct: 171 VLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRCFLLLATI 230
Query: 241 AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG----IALAN 296
A + + +TRS + FA N E+ AK + Q + +G+ML + + N
Sbjct: 231 ANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAALVNLWIEN 290
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
H + + ++ + +++ L+TL R ++ S ++ G P EV
Sbjct: 291 HRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGYVPSPAEV 350
Query: 357 NDEE 360
+++E
Sbjct: 351 SEKE 354
>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 113/210 (53%), Gaps = 2/210 (0%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP G+P SV +Y+ Y W +VQ + S + GVL A+L+A G+G + A AAA+ WVL
Sbjct: 72 LPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENASAIAAAMVWVL 131
Query: 184 KDGIGYLSKIMLSNFGR-HFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+DG G + ++ S H D N K WRLFAD+ + L+++ P F +F + + A
Sbjct: 132 RDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRGNFAVVSSLAT 191
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
++ + T++ + FA ++N A++ AK AQ G++LG+ +A+++ +
Sbjct: 192 VFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGVKVASYLSDAP 251
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTL 332
+ F + +H+Y N + SI L L
Sbjct: 252 TGRVLIFVFFSLLHVYANYRYMPSIHLLPL 281
>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 288
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 116/207 (56%), Gaps = 1/207 (0%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
+ D+ + F+ + P G+P SV YL Y+ +R +Q +S + VL+TQ+LL+A GL +
Sbjct: 60 IHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118
Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
A A++W+LKDG+ + K++ S+ G D PK WR+ AD+L + LE+++P P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCP 178
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
F+ + + A + +TR Y+ FA N +++ AKGEA + +GI G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238
Query: 292 IALANHIGSSMPFALASFSVVTWIHMY 318
I LA+ + S+ L + +++ +H++
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIW 265
>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
Length = 714
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 126/221 (57%), Gaps = 4/221 (1%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
F++ LP G+P+SVT DY+ Y ++ S+Q +S I+G++A++A+L +G+G A PT A
Sbjct: 88 FMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATASPTTAL 147
Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
+ VL+D G ++ I + G + K WRL AD+ +AA L+ L+PAFP V
Sbjct: 148 LLSVLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAFPKPLRVS 207
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ +A+ A R+ + S ++ A FA N E+ AK +Q + +G+++G AL
Sbjct: 208 LLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGMLVGTALIP 267
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRA 337
++ + + A +++ IH+ N + +++ +RTLN RA
Sbjct: 268 YLTTPLQTWTALLLLLS-IHLTTNYFAVRAVSMRTLNRQRA 307
>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
Length = 590
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 175/396 (44%), Gaps = 31/396 (7%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGI 187
P V +Y+++ W+ + V S VLATQA+ AIG+G ++P+AAA+NWVLKDG+
Sbjct: 116 IPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSLPSAAALNWVLKDGL 175
Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G LSK I ++ FD N K R +L + GLE+LTP FP F+ + A +
Sbjct: 176 GRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPRFFLLLATIANIAKQ 235
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG----IALANHIGSSM 302
+ +TRS + FA N E+ AK + Q + +G+ML + + NH
Sbjct: 236 ISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAALVNMWIGNHQRPQA 295
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
+ + ++ ++ + + + L+TL R ++ ++ G P +V+++E
Sbjct: 296 GLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFGYVPSPADVSEKE-- 353
Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYED 422
+F +K + + + KD E K + +++ ++E
Sbjct: 354 --VINFMGVKGKSLWPVRIGCISPKDQIPE------WSMKTIQSITDEDYYFVCMEIFEG 405
Query: 423 EGYILTEHGGKFCVVL--KESALPQDMLKSLFQASY-----LYW-LERNAGIVATSTSAD 474
+ G+ ++L +E A + ++ L QA Y L W + R + S D
Sbjct: 406 -----LKRTGQLRILLSIREGAESKHIIMGLLQACYIRRTLLKWNISRGENNASNSALED 460
Query: 475 CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
+ +E Y +R+ + +GWV ++
Sbjct: 461 WSV--IVEDGKRYAERDVTDLIEQMVGMGWVVKNIL 494
>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
Length = 673
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 81/245 (33%), Positives = 132/245 (53%), Gaps = 12/245 (4%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIP 174
R + LP G+P SV +Y+ Y W +Q ++S + GVLATQA+L +G+G+ G+ P
Sbjct: 196 VRGGLMNTFLPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTP 255
Query: 175 TAAAINWVLKDGIGYLSKIMLSNFGR-HFDVNPKGWRLFADLLENAAFGLEMLT--PAFP 231
AA + W+ +DG + ++ ++ +F VN K WRLFAD + LEML PAF
Sbjct: 256 LAATLQWMARDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEMLAPLPAFR 315
Query: 232 H--HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
F+ + A ++A + +T + +A N A++ AK AQ V+ +G+
Sbjct: 316 EGGRFLRLICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLGLG 375
Query: 290 LGIALANHI----GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
L + A + +S ++V+T +H+ N + +++ LR+LNP RASL+ +EYL
Sbjct: 376 LSVWFARAVSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTEYL 435
Query: 346 LSGQA 350
GQA
Sbjct: 436 --GQA 438
>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
Length = 657
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 151/339 (44%), Gaps = 50/339 (14%)
Query: 160 QALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLE 217
++LL A+G+G + ++P AA INWVLKDG+G L ++ + + FG FD + K +R ++
Sbjct: 78 ESLLQAVGVGARRSLPAAATINWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSIIY 137
Query: 218 NAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGE 277
+ LE LTP P HF+ + + A G+S +T+ F+ F R N A++ AK +
Sbjct: 138 AVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKTQ 197
Query: 278 AQGMVSKAIGIMLGIA---LANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
AQ MV IG+ + L H + L F ++ ++ +SI LRTLN
Sbjct: 198 AQQMVMDNIGLAAAVGLTYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRTLN 257
Query: 334 PYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEI 393
RA ++ +L G+ P ++V++EE H + + L+ + D AV+
Sbjct: 258 KERAEIIAQHWLREGRVPSPRQVSEEERFVLPPHI-------EVGRMPLTISSLDQAVQS 310
Query: 394 EHRLQLGSKLSDVVNNKEDAHAL--------------------------------FSLYE 421
LQL + ++ D + L
Sbjct: 311 SQDLQLLEQ-----QDRGDHYFLTFTSPGGSSSSPSSSSSSSDGSGPAGRGRRGLLPPPL 365
Query: 422 DEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
+ L H G V L++ A D+++++ QA+YL L
Sbjct: 366 PLPWQLGRHAGSVRVCLRQDAAEPDIVQAVLQAAYLRQL 404
>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
Length = 218
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/166 (39%), Positives = 94/166 (56%), Gaps = 2/166 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS ++G LAT A+L IG+G A +AA W++
Sbjct: 53 LPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWLV 112
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F I +
Sbjct: 113 KDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTSN 172
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
+ ++ +TR+ A R N A+V AK +Q + G+
Sbjct: 173 LAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGL 218
>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 477
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 181/356 (50%), Gaps = 35/356 (9%)
Query: 125 LPEGFPDSVTSDY---------LNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
LP G+PDSVT DY L Y L + AS I+G+LA++A+L +G+G A
Sbjct: 57 LPVGYPDSVTEDYTACVPCPIHLLYLLCEKL-AFASTIAGLLASRAVLQGLGVGDSTASA 115
Query: 175 TAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH- 232
T A + VL++ G ++ I+ ++ G + K +RL AD+ + A L+ ++PAFP
Sbjct: 116 TGAVLLNVLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDCISPAFPKV 175
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
VF+ +A+ +S + S+++ A FA + N AE+ AK +Q + +G+++G
Sbjct: 176 PRVFLLSASSVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETLISLLGMLVGT 235
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA-- 350
+ + I S +A S+++ IH+ N + +S+ +RTLN RA+LV S+++ + A
Sbjct: 236 FVVSRISSQTATWIALISLLS-IHLGTNYLAVRSVTMRTLNRQRANLVISDFVSNINAEK 294
Query: 351 -----PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSD 405
P K+++ +E +F + + Q +VL V + L S
Sbjct: 295 TRFNLPTPKDISRKERIFE-----RDGAIRHIQGVVLG--YCKVGVSLREILSSISSSPT 347
Query: 406 VVNN-KEDAH----ALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKSLFQA 454
V + KE+ +LF +Y+++GY++ + F V+LKE+ P L + F A
Sbjct: 348 VSGSYKEETSSIITSLFKIYKNQGYVMWYSRRQNTFLVILKENTAPSVQLDAWFHA 403
>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 675
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 142/272 (52%), Gaps = 10/272 (3%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
+ LP+G+P SVT D L ++ W VQ +A ++ L+T+A+L +G+G+ A T++ ++
Sbjct: 70 RIFLPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLS 129
Query: 181 WVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH---FVF 236
W+++DG + ++ ++ + + K W+L D + ++A LE+ +P P F
Sbjct: 130 WMMQDGARMIGSVVFASVIPQGLECRAKTWQLVTDFIHDSAHLLELCSPWLPGGQAIFRV 189
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ AA ++ A++ TR+ F FA R N A++ AK +G V IG+ LGI +A
Sbjct: 190 VLVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVGGLIGLALGIVIAY 249
Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ +S LA F++ + +H++ N + + ++LR LN R +Y L +
Sbjct: 250 IVPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPRLEWCLEQYYLFKAKVLIAS 309
Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
D PL P +SAN ++ LL+L S
Sbjct: 310 KKDSGPLSVMPILD-VSPRSANAAERLLILPS 340
>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
NZE10]
Length = 447
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 187/376 (49%), Gaps = 22/376 (5%)
Query: 90 KDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGV 149
+ A V S ++ S + V LW + ++F +P GFP+SVT DYL Y ++ S+Q
Sbjct: 17 EKATYVQSPEGSTERSRVDVISLWQQAFDVF----MPVGFPNSVTHDYLEYQIYDSLQAF 72
Query: 150 ASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPK 207
+S I+ +LA++A+L ++G+G A PTAA + + + G L+ I+ ++ G K
Sbjct: 73 SSSIASLLASRAVLSSVGVGDADASPTAAILLSIFQTSAGRLATILFAHRLGTSLGPECK 132
Query: 208 GWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
+RL AD+L + F L+M++ AFP V I + + RS + S ++ A FA
Sbjct: 133 MYRLLADILNDTGFLLDMISVAFPKPISVIILSFSSILRSLCGVAAGSAKASLSAHFARW 192
Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQS 326
N AE+ AK +Q V +G+ G + ++I A + V+ IH+ N ++ ++
Sbjct: 193 GNLAELNAKDSSQETVISLLGMWAGSVVISYITEPTA-AWTTLIVLLAIHLETNRRAVRA 251
Query: 327 IELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEA 386
+ ++TLN RA+L+F +L G P E+ +E +F + N + +L A
Sbjct: 252 VTMQTLNRQRATLMF-HHLQRGHVPTPVEIASQERIFERDGVLR---GNDDR--ILGHCA 305
Query: 387 KDAAVEIEHRLQLGSKLSDVV--NNKEDAHALFSLYEDEGYILTEH------GGKFCVVL 438
A+ E + G++ S + ++ ++YE YIL + +VL
Sbjct: 306 VGASTERLLKAVSGAQTSTKAFPSQGPAMQSILTVYEKSPYILWQDRRSLPGRSSLFIVL 365
Query: 439 KESALPQDMLKSLFQA 454
K+ A P +ML + +QA
Sbjct: 366 KKGATPPEMLTAWWQA 381
>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
[Aspergillus nidulans FGSC A4]
Length = 564
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 132/235 (56%), Gaps = 5/235 (2%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
R++ ++ LP G+P SV+ DY Y ++ S+Q +S I+G+LA++A+L +G+G A PT
Sbjct: 66 RDVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPT 125
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
+A + +L+D G ++ I+ ++ G + K +R AD+ + A L+ +P P F
Sbjct: 126 SALLLHILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185
Query: 235 --VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
V + +AAG R+ + S+++ A FA N AEV AK +Q V IG+++G
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIGMLVGS 245
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+ +HI +S + + +H+ N + +S+++ TLN RA++VFS L S
Sbjct: 246 FVVSHI-TSFTATWLTLVFLLSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 299
>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
Length = 497
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/255 (31%), Positives = 126/255 (49%), Gaps = 30/255 (11%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
+PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G K A A W L
Sbjct: 148 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 207
Query: 184 KDGIGYLSKIM-----------------------------LSNFGRHFDVNPKGWRLFAD 214
+D G L I+ ++ G + D N K WRL AD
Sbjct: 208 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 267
Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
+ + +++L+P FP + I RS + +TR+ FA N A++ A
Sbjct: 268 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 327
Query: 275 KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 334
K +Q ++ +G+ LG+ LA+ SF +T HMY N K+ QS+ L TLN
Sbjct: 328 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 387
Query: 335 YRASLVFSEYLLSGQ 349
R+S++ ++ +G+
Sbjct: 388 ERSSILLQYFMDNGE 402
>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
nagariensis]
Length = 208
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 106/201 (52%), Gaps = 11/201 (5%)
Query: 98 ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
A AS S L S+ D R+LF LP GFPDSVT DYL Y LW V +S L
Sbjct: 6 APGASPSPLSSIP---DSIRDLF----LPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSL 58
Query: 158 ATQALLYAIGLGKGAIPT---AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFA 213
AT +LL A+G+ T +A+I W++KDG+G L + ++ S F FD +P+ WRL A
Sbjct: 59 ATSSLLTAVGVSTSPAATVALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVA 118
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
+LL A GLE+ T +P F+ + +A G++ + FA +N V
Sbjct: 119 ELLSTAGLGLEVATSLYPQSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVA 178
Query: 274 AKGEAQGMVSKAIGIMLGIAL 294
AK E + ++ +G+M +A+
Sbjct: 179 AKEEVWEVAAQMVGLMASVAV 199
>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
Length = 1041
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 93/317 (29%), Positives = 142/317 (44%), Gaps = 46/317 (14%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG----KGAIPTAAA 178
F+LP GFP SV+ DYL+Y L + V I L T +LL A+G+G A +A+A
Sbjct: 654 FILPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTSATASASA 713
Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG + ++ L FG FD +PK WR++AD + +A + T +P +F+ +
Sbjct: 714 IRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPDYFLPL 773
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ ++ A ++ + FA N EV AK E +V++ IG+ LGI + +
Sbjct: 774 ASLGNLTKAVARGLKDPSFCVIQNHFAISGNLGEVAAKEEIWEVVAQLIGLALGILILD- 832
Query: 298 IGSSMPFALASFSVVT-------WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
P + S+ V++ ++H++ KS ++ T+N R +L
Sbjct: 833 ----TPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSVLQFNTINLKR--------VLHSTV 880
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
P N EE + F K K I L L K+ V +
Sbjct: 881 PGCTYCNREENILTRSQFMKPK--------------------INFGLPL-EKIDGVERSH 919
Query: 411 EDAHALFSLYEDEGYIL 427
AL LY E YIL
Sbjct: 920 FMVEALLKLYASEKYIL 936
>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
Length = 510
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 78/260 (30%), Positives = 138/260 (53%), Gaps = 5/260 (1%)
Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
V+ LP G+P SV+ DY+ Y ++ S+Q +S I+G+L+++A+L +G+G A PTAA +
Sbjct: 72 VEVFLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALL 131
Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VF 236
+L+D G +S I+ ++ G + K +RL AD+ +AA L+ L+P P F V
Sbjct: 132 LHILQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVT 191
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + AG R+ + S+++ A F+ N AEV AK +Q + IG+++G + +
Sbjct: 192 VLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVS 251
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+ +S S ++ +H+ N + +S+++ +LN RA++VFS L S +
Sbjct: 252 RV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNSDTDLDIANF 310
Query: 357 NDEEPLFPAFHFFKIKSANK 376
N P K + K
Sbjct: 311 NPTHETHPTPKHSKATQSQK 330
>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
Length = 507
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 78/260 (30%), Positives = 133/260 (51%), Gaps = 5/260 (1%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG- 167
VN+L + LP G+P +V+S YL ++++ ++ G L+ Q+L AIG
Sbjct: 73 VNQLNKNIYNGLFETFLPAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGS 132
Query: 168 LGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEML 226
A AAA WVLKDG+G L I+ S +GR+FDV+ K WR A N + +E+L
Sbjct: 133 TMTQANLAAAAYTWVLKDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEIL 192
Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
T +FP +F+ + + A G++ L+ +++R+ A N ++ K +Q S I
Sbjct: 193 TLSFPQYFLALASLANVGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLI 252
Query: 287 GIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
G G+ ++ I + + L F V+T +++ + S + I+ LN RA L+F+E+
Sbjct: 253 GGACGLGISKIIDITSIYQLYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLFNEFF 312
Query: 346 LSGQA--PPVKEVNDEEPLF 363
S + + +ND E +
Sbjct: 313 KSNEKLIHTAEYINDIEAFY 332
>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
Length = 523
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/240 (29%), Positives = 125/240 (52%), Gaps = 10/240 (4%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
P +V+ Y+ Y W+ + V S V+ATQA+ AIG+G ++ +AAA+NWVLKDG+
Sbjct: 125 IPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGL 184
Query: 188 GYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G LS+ + + + FD N K R +L + + G+E+LTPAFP +F+ + + A +
Sbjct: 185 GRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQ 244
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL------ANHIGS 300
+ ST S + FA N EV AK + Q + +G++L L + +
Sbjct: 245 ISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFLSKNDRRLQA 304
Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
++PF + + + + ++ + + + L+TL R ++ S ++ G P EV++ E
Sbjct: 305 ALPFVV--YPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSERE 362
>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
[Cucumis sativus]
Length = 523
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 71/240 (29%), Positives = 125/240 (52%), Gaps = 10/240 (4%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
P +V+ Y+ Y W+ + V S V+ATQA+ AIG+G ++ +AAA+NWVLKDG+
Sbjct: 125 IPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGL 184
Query: 188 GYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G LS+ + + + FD N K R +L + + G+E+LTPAFP +F+ + + A +
Sbjct: 185 GRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQ 244
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL------ANHIGS 300
+ ST S + FA N EV AK + Q + +G++L L + +
Sbjct: 245 ISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFLSKNDRRLQA 304
Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
++PF + + + + ++ + + + L+TL R ++ S ++ G P EV++ E
Sbjct: 305 ALPFVV--YPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSERE 362
>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb03]
Length = 509
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 187/398 (46%), Gaps = 37/398 (9%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLW---------RSVQGVASQISGVLATQALLYAIGLG 169
L + F LP G+P SV+ DY++ S+Q +S I+G+LA++A+L +G+G
Sbjct: 68 LLIVF-LPAGYPHSVSDDYMDLLFLALENDRKYKDSLQAFSSSIAGLLASRAVLQGVGVG 126
Query: 170 KG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
A PT A ++ VL++ IG ++ I+ ++ G + K +R AD+L ++A ++ L+
Sbjct: 127 DASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLS 186
Query: 228 PAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
PAFP H V + + R+ + S+++ + FA N E+ AK +Q V +
Sbjct: 187 PAFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVISLM 246
Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
G++ G + +HI + + LA + IH+ N + +++ + TLN RA++VFS
Sbjct: 247 GMLCGSLVVSHISTPLTTGLA-LIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFE 305
Query: 347 SGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
G+A K+ + E +F + K+++ + + L GS S
Sbjct: 306 EGRALTPKQTSKCERIFERDGILRWKASSTTLGFCRIGNSF-------QELLCGSNGSYR 358
Query: 407 VNNKEDA----HALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLK----SLFQASY 456
N+ DA L L+E E YIL + +VLK P LK +L A +
Sbjct: 359 ANSIRDAPIDIPRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLVAKH 418
Query: 457 LYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNH 494
L +A + +P + E+ L+ R+ N+
Sbjct: 419 LTKTTEDA------EKNNNSPIKKGEVKLERQDRKKNY 450
>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 425
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 131/250 (52%), Gaps = 20/250 (8%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP GFP +VT DYL+Y L+ S+Q AS I+ +LA +A+L +G+G + PT A I +
Sbjct: 45 LPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH-FVFIGAAA 241
D I ++ I+ ++ GR + K +R AD+ ++A L++LTPA P+ + + +A
Sbjct: 105 GDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKLSVIVSA 164
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
G RS + ++++ A FA N AE+ AK +Q V +G++ G S
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAG---------S 215
Query: 302 MPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
+ L V W +H+ N ++ +S+++R+LN RA++VF E+L G
Sbjct: 216 LVVRLVEDKQVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTVLTP 275
Query: 354 KEVNDEEPLF 363
+V E +
Sbjct: 276 DQVAKRESIL 285
>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 485
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 183/385 (47%), Gaps = 35/385 (9%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
L + F LP G+P SV+ DY+ Y ++ S+Q +S I+G+LA++A+L +G+G A PT A
Sbjct: 68 LLIVF-LPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126
Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-V 235
++ VL++ +G ++ I+ ++ G + K +RL AD+L ++A +E L+PAFP H V
Sbjct: 127 LLHSVLQESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHLRV 186
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ + R+ + S+++ + FA N E+ A G +
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGS---------------LVV 231
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+HI + + L + + IH+ N + +++ + TLN RA++VFS G+A K+
Sbjct: 232 SHISTPLTTGL-TLIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQ 290
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
+ E +F + K+++ + A E+ + + + + + D
Sbjct: 291 TSKCERIFERDGILRWKASSTTLGFC---RIGIAFQELLRSSNVSCRANSIRDIPIDIPR 347
Query: 416 LFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAGIVAT 469
L L+E E YIL K +VLK + P LK +L A +L +A
Sbjct: 348 LLRLFEKEEYILWFNPVSKKGTIVLKNNVKPVSQLKAWSHALLVAKHLMKTTEDA----- 402
Query: 470 STSADCAPGGRLEISLDYVQREFNH 494
+ +P + E+ L+ R+ N+
Sbjct: 403 -EKNNNSPITKGEVKLERQNRKKNY 426
>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 436
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 120/231 (51%), Gaps = 3/231 (1%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTA 176
+L LP+G+PDSV++DY +Y W ++Q S I+G LAT A+L +G+G + A TA
Sbjct: 46 QLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLATLAVLKGVGVGDETATATA 105
Query: 177 AAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
AA++W+L+DG+G L +I + G D N K WRL AD+ + A +E+L+P F+
Sbjct: 106 AAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIFNDLAMFVELLSPLVGEWFL 165
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
RS + +TR+ A R N +V AK +Q + +++ + L
Sbjct: 166 AFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKDGSQETLVNLAALLVSLWLL 225
Query: 296 NHIGSSMP-FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
+GS P F T +H++ N + ++ T N R +V S+Y+
Sbjct: 226 PALGSDHPTLVWLLFCSFTSLHLFANYRGVRATVFDTFNRERLFIVASDYM 276
>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
Length = 261
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/255 (30%), Positives = 129/255 (50%), Gaps = 15/255 (5%)
Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG------LGKGAIP 174
+ LP+ +P SV+S+Y+NY W+++Q A SGVL Q+LL+A G LG
Sbjct: 2 ISVFLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYF 61
Query: 175 TA---AAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
A A+ WV+KDG+G L I+ S +FD +PK WR+ + A+ LE+ TP
Sbjct: 62 LAFLGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPIT 121
Query: 231 -PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
P F+ + A G++ + ++TR+ FA N A+V AK +Q + IG
Sbjct: 122 GPSWFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTT 181
Query: 290 LGIALANHIGSSMPFALAS----FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
LGI +++ + F A F V++ I ++ K+ +++L+ LN R + S+YL
Sbjct: 182 LGIFVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYL 241
Query: 346 LSGQAPPVKEVNDEE 360
+V++ E
Sbjct: 242 KDRAVLDPSKVSERE 256
>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
Length = 427
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 97/174 (55%), Gaps = 2/174 (1%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
L ++ E F LP G+P+SV+ DY +Y W VQ ++ +L+T ALL ++G+G
Sbjct: 98 LSEKLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLS 157
Query: 172 A-IPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
A +P +AAI+WVLKDG+G + ++ S + D + K ++ D L N LE TP
Sbjct: 158 ASLPFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPF 217
Query: 230 FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
P +F+ + + + + A LI STR+ + FA + N ++ AK ++Q M S
Sbjct: 218 LPGYFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAKYQSQAMSS 271
>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
dendrobatidis JAM81]
Length = 434
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 161/352 (45%), Gaps = 36/352 (10%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
+ F LP G+P SV + Y +W + VA VL QA+L +IG A TAA
Sbjct: 34 QWFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSAISVLTAQAMLTSIGASTAAHETAA 93
Query: 178 ---AINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PH 232
A++WVLKDGIG L K +M+ F FD +PK W+L+ + + +++ T PH
Sbjct: 94 LAIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRWKLYGEACSISGALMQLATCITNPH 153
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR--NFAEVIAKGEAQGMVSKAIGIML 290
HF+F + RS I A+T + F A N +++AK ++Q ++ +G++
Sbjct: 154 HFLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNGVNVGDIVAKADSQLSLAHLLGMVS 213
Query: 291 GIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
GI + S P AL A FSV++ I + C + L+ YR L+ E++
Sbjct: 214 GIGMLA--VSFQPAALFAWFSVLSAIQIVCTGLLLREARFEVLDQYRLVLIPREFMSEKN 271
Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 409
A + ++ F + + +K Q++ L +G + D +
Sbjct: 272 ALRIPTLDQ----VVRFENWLNEGLSKGQVVT--------------DLSIGVTVDDAFKH 313
Query: 410 KEDAHALFSLYEDEGYIL-------TEHGGKFCVVLKESALPQDMLKSLFQA 454
+HAL ++++ Y+L T K VVL +S P ++++ F A
Sbjct: 314 SRLSHAL-QIFKNGKYLLGFKLADSTHSQIKVYVVLHQSIQPVHVIQAAFHA 364
>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
Length = 226
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 113/226 (50%), Gaps = 4/226 (1%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI-NW 181
LPE +P+SV+ DY + W + QG++S + G L T+ALL +G+G A+A +
Sbjct: 1 MFLPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQF 60
Query: 182 VLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH--FVFIG 238
+ +D G + ++ + GRH D K WRLFADL N +E+ P F+ +
Sbjct: 61 IARDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALA 120
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
A RS +TR+ FA RN A++ AK +Q +G+ LG+A+
Sbjct: 121 CAGSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTST 180
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY 344
+ A F +T H+YCN+++ +S+ + +LN R + + Y
Sbjct: 181 DGKTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRERVVRLLARY 226
>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
Length = 513
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/268 (30%), Positives = 138/268 (51%), Gaps = 18/268 (6%)
Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
V+ LP G+P SV+ DY+ Y ++ S+Q +S I+G+L+++A+L +G+G A PTAA +
Sbjct: 72 VEVFLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALL 131
Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VF 236
+L+D G +S I+ ++ G + K +RL AD+ +AA L+ L+P P F V
Sbjct: 132 LHILQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVT 191
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + AG R+ + S+++ A F+ N AEV AK +Q + IG+ L
Sbjct: 192 VLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM-----LVR 246
Query: 297 HIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+GS + + S++ TWI H+ N + +S+++ +LN RA++VFS L S
Sbjct: 247 FVGSFVVSRVTSYT-ATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNSD 305
Query: 349 QAPPVKEVNDEEPLFPAFHFFKIKSANK 376
+ N P K + K
Sbjct: 306 TDLDIANFNPTHETHPTPKHSKATQSQK 333
>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 516
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 172/383 (44%), Gaps = 32/383 (8%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
N L F Q +LP +P SV YL + + + + + I+ VL QALL ++G+
Sbjct: 100 TNSLTGRISSWFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALLTSVGV 159
Query: 169 -GKGAIPTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLE-- 224
+G++ A A+ W++KDG G ++K+ + F +FD +PK + F +L+ GL+
Sbjct: 160 SAEGSVFGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGSGLQIA 219
Query: 225 -MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
+L P +F+ A + I +T F F+ + N +V AK E+Q ++
Sbjct: 220 TLLIAPSPGNFLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDESQSSIA 279
Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIH--MYCNLKSYQSIELRTLNPYRASLVF 341
+ +G GI+L + + P+ A F+ +H M + + EL TL R S +
Sbjct: 280 QLVGYASGISLLA-VSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLP--RVSCLA 336
Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLG 400
EY+ GQ + E++ F++ KS L + D+ E+E R Q+
Sbjct: 337 QEYVTRGQVSSLGELDAAHRTGLFGEFYRDKSDRWLTLAPRVGDVLDSGTELERVRWQIC 396
Query: 401 SKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
+ ++ED+ Y+L G V A DML +++ A+ L
Sbjct: 397 A----------------GVFEDDRYLLFPRGRNISVFFHPDAANDDMLCAIYHAA----L 436
Query: 461 ERNAGIVATSTSADCAPG-GRLE 482
R+A I ++S D P G LE
Sbjct: 437 VRDALINREASSTDVPPAEGLLE 459
>gi|297602390|ref|NP_001052390.2| Os04g0290800 [Oryza sativa Japonica Group]
gi|255675293|dbj|BAF14304.2| Os04g0290800, partial [Oryza sativa Japonica Group]
Length = 107
Score = 116 bits (290), Expect = 4e-23, Method: Composition-based stats.
Identities = 57/72 (79%), Positives = 63/72 (87%)
Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
+VIAKGEAQGMVSK +GIMLGIALAN IGSS+ AL SF+ VT +HMYCNLKSYQSI+LR
Sbjct: 29 QVIAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLR 88
Query: 331 TLNPYRASLVFS 342
TLNPYRAS V S
Sbjct: 89 TLNPYRASNVLS 100
>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
Length = 458
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 77/250 (30%), Positives = 136/250 (54%), Gaps = 5/250 (2%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
+ +F F LP G+P SVT DYL Y + S+Q +S I+ +LA +A+L +G+G + PT
Sbjct: 41 KAIFYAF-LPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPT 99
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP-HH 233
AA + +++D ++ I+ ++ G+ + K +R ADL ++A L++LTP P +
Sbjct: 100 AALVLKIIQDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLPLYP 159
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
V I A+ RS + +++++ A FA N AE+ AK +Q V +G++ G A
Sbjct: 160 KVCIMVASSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAGTA 219
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
+ H+ L ++ +H+ N + + +++ TLN RA++VF EYL +G+
Sbjct: 220 VV-HVVQDKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSP 278
Query: 354 KEVNDEEPLF 363
EV + E +
Sbjct: 279 GEVAERESVL 288
>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
Length = 483
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 138/247 (55%), Gaps = 4/247 (1%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
+ LP G+P SVT DY+ Y ++ S+Q +S I+G+L+++A+L +G+G A PTAA
Sbjct: 58 ILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAAL 117
Query: 179 INWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
+ VL++ +G ++ I+ ++ G + K +RL AD+L ++A L+ L+P FP V
Sbjct: 118 LLNVLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDCLSPIFPKPVRVG 177
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + + R+ + S+++ A FA N E+ AK +Q V G+++G + +
Sbjct: 178 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLAGMLVGSVVVS 237
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
+I S + +A ++ +H+ N + +++++ TLN RA++VFS + E
Sbjct: 238 YITSPLETWIA-LIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLFEDDRVLTPAEA 296
Query: 357 NDEEPLF 363
++EE +F
Sbjct: 297 SEEERIF 303
>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
Length = 343
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 79/242 (32%), Positives = 124/242 (51%), Gaps = 30/242 (12%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA-INWVL 183
LPEG+P SV+SDYL + W ++QG+++ I +L+TQALL IG+G A W L
Sbjct: 55 LPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWFL 114
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G + I+ + + G + D K WRL AD + + A G A G
Sbjct: 115 RDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIA-----------------GVAGG 157
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
A R AAL Q FA ++N A+V AK +Q + +G+++G+ LA +++
Sbjct: 158 ATR--AALTQH---------FARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANNI 206
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
SF +T HMY N K+ ++ L ++N R ++V +L G+AP +E + +E
Sbjct: 207 FLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQESA 266
Query: 363 FP 364
P
Sbjct: 267 LP 268
>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
Length = 448
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 131/243 (53%), Gaps = 6/243 (2%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP G+P SVT DYL Y + S+Q AS I+ +LA +A+L +G+G + PT A I +
Sbjct: 45 LPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
D I ++ I+ ++ G+ + K +R AD+ ++A L++LTPA P +F +G
Sbjct: 105 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALP-YFPKLGVIVS 163
Query: 243 AG--RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS 300
AG RS + ++++ A FA N AE+ AK +Q V +G+++G +L +
Sbjct: 164 AGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG-SLVVKMVE 222
Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
V+ +H+ N + +++++R+LN RA+LVF E+L G ++V+ E
Sbjct: 223 DKQVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLTPEQVSQRE 282
Query: 361 PLF 363
+
Sbjct: 283 SIL 285
>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 455
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 114/410 (27%), Positives = 184/410 (44%), Gaps = 35/410 (8%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAIN 180
+ LP GFPDSVT DY+ Y ++ S+Q +S I+G+L+++A+L G+G + A T A +
Sbjct: 48 EIFLPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLL 107
Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG- 238
VL+D G L+ I+ + FG K +RL AD+ ++AF L+ L+PA P
Sbjct: 108 TVLQDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAAL 167
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
AGA R+ + ++ FA N E+ AK +Q V G++ G + HI
Sbjct: 168 CLAGALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHI 227
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL-LSGQAPPVKEVN 357
S +A ++T H+ N + +++ L TL+ R ++ + Y S EV
Sbjct: 228 TSRRTTWIALIFLLT-AHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVA 286
Query: 358 DEEPLFPAFHFFKIKSANKSQL---LVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH 414
E F++ A + L L+ + + E+ R + GS S +
Sbjct: 287 RRE------RIFEVPGAIRDSLQGFLLGTCTICTSPEELLRRSRQGSMAS-------WKN 333
Query: 415 ALFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAG--IVA 468
AL + +E E Y+L + K + K A P+D LK+ A L R AG +
Sbjct: 334 ALVA-FEHEAYVLLGIRADFPCKILICFKHGAKPEDYLKAWVNAVELC---RIAGSRLQV 389
Query: 469 TSTSADCAPG--GRLEISLDYVQREFNHVKSDSASVGW--VTDGLIARPL 514
S + PG L+ SL+ +F + GW TD + P+
Sbjct: 390 GQESNNVTPGSFSALQKSLECTNTKFLDFLDKANKAGWDLKTDAICTMPV 439
>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 678
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 141/272 (51%), Gaps = 10/272 (3%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
+ +LP+G+P SVT D L ++LW Q +A ++ L+T+A+L +G+G+ A T+A ++
Sbjct: 70 RILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSATLS 129
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP---HHFVF 236
W+++DG + ++ + + + K W+L AD + A LE+ P P F
Sbjct: 130 WMMQDGTRMIGSVVFAPLIPQGLECRAKTWQLVADFTTDIANLLELCAPWLPGGQTTFRV 189
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ AA ++ ++ + TR+ F FA R N A++ AK + V IG++LG+A+A
Sbjct: 190 VLVAASVVKALGSVCRNCTRASFTRHFALRNNAADIAAKASTRSSVGSFIGLVLGVAVAY 249
Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ +S L F++ + +H++ + +S + ++L+ LN R +Y L +
Sbjct: 250 IVPATSRSLNLVVFALGSVLHIFASYRSVRGVQLKHLNAPRLEWCLVQYSLFEAKVLIAS 309
Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
D PL P +SAN ++ LL+L S
Sbjct: 310 KKDSGPLSVMPILD-VSPRSANAAERLLILPS 340
>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
Length = 467
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 169/361 (46%), Gaps = 31/361 (8%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E +++ LP GFP SV+SDYL Y + S+Q S I+ +LA +ALL +G+G +
Sbjct: 66 EWKKILSDSFLPVGFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSS 125
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-P 231
T A + +L+D ++ I+ ++ FG + + K +R ADL + AF LE+ +P P
Sbjct: 126 ATFALLLTILRDATSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLSP 185
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
V + + A R+ + ++++ FA N AE+ AK +Q IG+++G
Sbjct: 186 LAKVVVLSVGEALRALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVG 245
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ + ++ ++ + H++ N +S+++ TLN RA+++F EYL +G+
Sbjct: 246 TIVVKVVEDHRS-VVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVL 304
Query: 352 PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
EV E + + S+ V+S+ +IE G +S V
Sbjct: 305 SPAEVAKRE-------YIIVWSS------VVSNRHGQEVAKIEMAKSFGHAMSSSVGR-- 349
Query: 412 DAHALFSLYEDEGYILT------EHGGKFCVVLKESALPQDMLKSLFQASYLYW-LERNA 464
+ + + Y +T C++L + LP+ + + F+A + W ++R
Sbjct: 350 -----VKILDGDKYSITATPAVPHKKTTLCILLWDDVLPKQAIMAWFEAVEMAWVMDRQT 404
Query: 465 G 465
G
Sbjct: 405 G 405
>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 495
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 181/389 (46%), Gaps = 43/389 (11%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
L + F LP G+P SV+ DY+ Y ++ S+Q +S I+G+LA++A+L +G+G A PT A
Sbjct: 68 LLIVF-LPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126
Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-V 235
++ VL++ IG ++ I+ ++ G + K +R AD+L ++A ++ L+PAFP H V
Sbjct: 127 LLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKHLRV 186
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ + R+ + S+++ + FA N E+ A G +
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGS---------------LVV 231
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+HI + + LA + IH+ N + +++ + TLN RA++VFS G+A K+
Sbjct: 232 SHISTPLTTGLA-LIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQ 290
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA-- 413
+ E +F + K+++ + + L GS S N+ DA
Sbjct: 291 TSKCERIFERDGILRWKASSTTLGFCRIGNSF-------QELLCGSNGSYRANSIRDAPI 343
Query: 414 --HALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
L L+E E YIL + +VLK P LK +L A +L +A
Sbjct: 344 DIPRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLVAKHLTKTTEDA- 402
Query: 466 IVATSTSADCAPGGRLEISLDYVQREFNH 494
+ +P + E+ L+ R+ N+
Sbjct: 403 -----EKNNNSPIKKGEVKLERQDRKKNY 426
>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
Length = 263
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 82/262 (31%), Positives = 130/262 (49%), Gaps = 9/262 (3%)
Query: 59 ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
A+ E + + V E GS + L ++ A + ASAS+ + W+
Sbjct: 3 ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRF---GSRWERIGS 55
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
+ +PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G + A A
Sbjct: 56 RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
W L+D G L I+ + + G + D N K WRL AD + + +++L+P FP +
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
I RS + +TR+ FA N A++ AK +Q ++ +G+ LG+ LA
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQ 235
Query: 297 HIGSSMPFALASFSVVTWIHMY 318
ASF +T HMY
Sbjct: 236 ITRGHALSVWASFLSLTMFHMY 257
>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
77-13-4]
Length = 444
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 127/251 (50%), Gaps = 4/251 (1%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
R++ + LP G+P SV+ DYL Y + S+Q S I+ +LA +ALL +G+G +
Sbjct: 50 MRKILMDAFLPVGYPHSVSDDYLAYQFFDSLQAFFSTITALLANRALLQGLGVGDANSSA 109
Query: 175 TAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PH 232
T A + VLKD + ++ I+ + FG + + K +R ADL + AF +E+ +P F P+
Sbjct: 110 TFAMLLTVLKDAMSRVATIIFAQQFGLRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPY 169
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
+ + A R+ + ++++ FA N AE+ AK +Q +G+++G
Sbjct: 170 GKILALTSGEALRALCGVAGGASKAALSVHFAKHDNLAELNAKEASQETAVGLVGLLVGT 229
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
+ H+ + +V+ H++ N +S+ + LN RA++++ EYL SG
Sbjct: 230 FVVKHVEDHTS-VVYLMTVLVLAHLWMNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLT 288
Query: 353 VKEVNDEEPLF 363
EV E +
Sbjct: 289 PNEVARRESIL 299
>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 440
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 75/250 (30%), Positives = 129/250 (51%), Gaps = 9/250 (3%)
Query: 99 SNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA 158
S ++S LL++N R++F P G+PDSV+ DYL Y L+ S+Q S I+ +LA
Sbjct: 41 SPTTISRLLTMN-----IRQVFSDAFFPIGYPDSVSKDYLGYQLYDSLQAFFSTITSLLA 95
Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLL 216
+A+L +G+G + T A + +LKDGI L+ I + FG + K +R AD+
Sbjct: 96 NRAILQGLGVGDADSSATYALLLTILKDGISRLATIGFAYRFGLVIEPECKKYRFLADIF 155
Query: 217 ENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
++AF L++ +P F V A A R+ + ++++ FA R N +E+ AK
Sbjct: 156 NDSAFFLDLFSPLFGSWTKVAALVVAEALRAMCGVAAGASKAALSKHFALRGNLSELNAK 215
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
+Q IG+++G + H+ S A ++ ++H+ N + ++L N
Sbjct: 216 ESSQETAVGLIGLIVGSFVVRHV-ESREAVFALMIILVFVHLGMNYLGVRCVQLNNFNQQ 274
Query: 336 RASLVFSEYL 345
RA++ F EY+
Sbjct: 275 RATIFFEEYM 284
>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
UAMH 10762]
Length = 454
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 137/254 (53%), Gaps = 9/254 (3%)
Query: 113 WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG- 171
W ++F LP GFP SVT DY+ Y ++ S+Q +S I G+L+++A+L ++G+G
Sbjct: 45 WQRLLDIF----LPAGFPHSVTVDYVEYQIYDSLQAFSSNIVGLLSSRAVLLSVGVGDAD 100
Query: 172 AIPTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
A PT A + + ++ +G + I ++ +G + K +RL AD+ + AF L+ L+PA
Sbjct: 101 ASPTYALLLHIAQESLGRIGTIAFVALYGTALEPECKMYRLLADVFNDFAFVLDCLSPAI 160
Query: 231 PHHFVFIGAAAGAGRSAAALIQA-STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
P + + + + A + A S ++ A FA N E+ AK +Q V +G+
Sbjct: 161 PKYLRVVTLSFSSLLKALCGVAAGSAKASLSAHFAQWGNLGELNAKDSSQETVISLLGMW 220
Query: 290 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
+G + + + + + ++ IH+ N ++ +++ +RTLN RA+LVF +L G
Sbjct: 221 VGSLVLSWVRTPQA-TWTTLILLLAIHLETNRRAVRAVVMRTLNRQRATLVF-HHLQRGH 278
Query: 350 APPVKEVNDEEPLF 363
P +EV+ +E +F
Sbjct: 279 VPTPEEVSKQERIF 292
>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
Length = 448
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 130/251 (51%), Gaps = 8/251 (3%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
RE F +P G+P SV+SDYLNY + S+Q S I+ +LA +ALL +G+G +
Sbjct: 51 LREAF----MPVGYPHSVSSDYLNYQFFDSLQAFFSTITSLLANRALLQGLGVGDANSSA 106
Query: 175 TAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PH 232
T A + VLKD + ++ I+ ++ FG + + K +R ADL + AF LE+ +P F P+
Sbjct: 107 TFAMLLTVLKDAMSRIATIVFAHQFGLRIEPDAKRFRFLADLFNDTAFFLELYSPYFGPY 166
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
+ + R+ + ++++ FA N +E+ AK +Q IG+++G
Sbjct: 167 GKILALTSGETLRALCGVAAGASKAALSVHFAKHDNLSELNAKEASQETAVGLIGLLVGT 226
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
+ ++ L +V+ + H++ N +++ + LN RA+++F EYL +G
Sbjct: 227 IVVKYVEDHHS-VLFLMTVLVFAHLWMNYLGVRAVCMDNLNQQRATILFEEYLKTGNVLS 285
Query: 353 VKEVNDEEPLF 363
+EV E +
Sbjct: 286 PEEVAHRENIL 296
>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
Length = 484
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 131/257 (50%), Gaps = 8/257 (3%)
Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
K W E+F LP+G+P SVT DY Y ++ SVQ A I+G+++++A+ +G+G
Sbjct: 44 KKPWLRLLEIF----LPDGYPHSVTEDYAAYQIYDSVQAFAGSIAGMISSRAVWEGLGVG 99
Query: 170 KG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
A PT A + V+++ G + I ++ FG + K WRL AD+L + A L+ L+
Sbjct: 100 DSLASPTGAMLVQVIRESTGRFATIAFAHLFGTSIEAECKAWRLAADVLCDVAMVLDCLS 159
Query: 228 PAFPHHFVFIGAA-AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
P FPH F+ + SA+ + +++S FA N E+ AK +Q I
Sbjct: 160 PFFPHGVRFLVLCFSSLLYSASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLI 219
Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
G++ G + + + + A+ + +H++ N K ++++ R+LN RA++ FS
Sbjct: 220 GMISGTFVVSLL-TDQRATWATLIFLLALHLFLNWKGVRAVKSRSLNRQRANIAFSALFS 278
Query: 347 SGQAPPVKEVNDEEPLF 363
GQ V + E +F
Sbjct: 279 KGQVLTPHHVAERERIF 295
>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
42464]
Length = 448
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 131/242 (54%), Gaps = 4/242 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP G+P +VT DYL Y + S+Q AS I+ +LA +A+L +G+G + PT A I +
Sbjct: 45 LPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAA 241
D I ++ I+ ++ G+ + K +R AD+ ++A L++LTPA P+ + + +A
Sbjct: 105 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYLPKLGVIVSA 164
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
G RS + ++++ A FA N AE+ AK +Q V +G+++G +L +
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG-SLVVRMVED 223
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
++ +H+ N ++ +S+++R+LN RA++VF E+L G ++V E
Sbjct: 224 KQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGVVLTPEQVAQRES 283
Query: 362 LF 363
+
Sbjct: 284 IL 285
>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 75/241 (31%), Positives = 125/241 (51%), Gaps = 12/241 (4%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
P V+ +Y++Y W+ + V S VLATQ + AIG+G+ ++ ++AA NW+LKDG+
Sbjct: 123 LPKQVSDNYISYVKWKFLHRVFSSALQVLATQVMFRAIGIGQSRSLASSAAFNWILKDGL 182
Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G LS+ I ++ FD N K R +L + + G+E++TP FP +F+ + + A +
Sbjct: 183 GRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQ 242
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA------NHIGS 300
+ +T S + FA N EV AK + Q + +G++L + L + +
Sbjct: 243 ISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLFQNNQRLQA 302
Query: 301 SMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
S+PF L FS + +Y LK I L+TL R ++ ++ Q P EV++E
Sbjct: 303 SLPFVLYPIFSTFDLLGIYQGLK---HINLQTLTKDRLEIILERWIEFRQVPSPAEVSEE 359
Query: 360 E 360
E
Sbjct: 360 E 360
>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
Length = 520
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 122/241 (50%), Gaps = 12/241 (4%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
P V+ +Y++Y W+ + V S VLATQA+ AIG+G+ ++ ++AA NW+LKDG+
Sbjct: 124 LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGL 183
Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G LS+ I ++ FD N K R +L + + G+E++TP FP +F+ + + A +
Sbjct: 184 GRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQ 243
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMV------SKAIGIMLGIALANHIGS 300
+ +T S + FA N EV AK + Q + A+ + + + +
Sbjct: 244 ISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLFQHNQRLQA 303
Query: 301 SMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
+PF L FS + +Y LK I L+TL R ++ ++ Q P EV++E
Sbjct: 304 CLPFVLYPIFSTFDLLGIYQGLK---HINLQTLTKDRLEIILERWIEFRQVPSPAEVSEE 360
Query: 360 E 360
E
Sbjct: 361 E 361
>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
Length = 172
Score = 111 bits (278), Expect = 9e-22, Method: Composition-based stats.
Identities = 67/174 (38%), Positives = 97/174 (55%), Gaps = 11/174 (6%)
Query: 74 WEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
+E GS K++ + +D F + + LSV + + LF LP+GFP+SV
Sbjct: 8 YESYGSNEEKIV--YYEDGFCKKTKN-------LSVIGFYTTLKLLFQNVFLPQGFPESV 58
Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSK 192
+ DYL Y LW ++Q S I+G+LATQA+L A G+G A TAA I W+LKDG G + +
Sbjct: 59 SEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAATITWMLKDGAGMIGR 118
Query: 193 IMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGR 245
I+ + G D + K WRL AD+L N A L+++ P F +FV I + R
Sbjct: 119 ILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLIACLSSVAR 172
>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
Length = 525
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 130/242 (53%), Gaps = 21/242 (8%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
++ V+ LP G+P SVT DY Y L+ S+Q +S I+G+L+++A+L +G+G A P
Sbjct: 68 VQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127
Query: 175 TAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP-- 231
TAA + +L+D G ++ I+ ++ G + K +RL AD+ + A L+ L+P P
Sbjct: 128 TAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDCLSPMIPAG 187
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
V + + AG R+ + S+++ A F+ N AEV AK +Q + IG+++G
Sbjct: 188 APRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVG 247
Query: 292 IALANHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSE 343
+ + + +FS TWI H+ N + +S+++ +LN RA++VFS
Sbjct: 248 SFVVSRV--------TNFS-TTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFST 298
Query: 344 YL 345
L
Sbjct: 299 LL 300
>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 891
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 129/273 (47%), Gaps = 30/273 (10%)
Query: 99 SNASLSSLLSVNKLWDECRELFVQF----MLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
S+ + S LS W + ++F +P G+P SV YL+Y ++ S+QG++S +
Sbjct: 409 SSHTFYSNLSSISFWQHLSQTPIRFATELFMPVGYPYSVREGYLSYQIYDSIQGLSSYLR 468
Query: 155 GVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLF 212
VL + +L A G+G A AA+ W LKDG+G + ++ S H D K +RLF
Sbjct: 469 AVLCSAQVLQAAGVGDANATAIMAALTWALKDGLGMIGGLIFSYGVASHLDAYVKEFRLF 528
Query: 213 ADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
AD++ + L+ML P P ++I +AA + + +T+S FA + N A+
Sbjct: 529 ADVINDVGMLLDMLAPYVPRADLIWISSAATLCKVMCGIAAGATKSSITQHFALQGNMAD 588
Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIG-----------------SSMP------FALAS 308
+ AK Q + IG++ G+ LA ++ S +P
Sbjct: 589 LNAKEATQETLVNLIGMVCGVTLARYLSQLEAGNEEGDTDRIGYRSWIPCISHDSIQWMV 648
Query: 309 FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
F+++T +H++ N K + L TLN R +V
Sbjct: 649 FTILTLLHIWSNWKGVSLLRLETLNRERTKVVL 681
>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
Length = 448
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 131/242 (54%), Gaps = 4/242 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP G+P +VT+DYL Y + S+Q AS I+ +LA++A+L +G+G A PT A I +
Sbjct: 47 LPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGALILKIT 106
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAA 241
D I ++ I+ ++ G+ + K +R AD+ +AA L++LTPA P+ + + +A
Sbjct: 107 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKLGVIVSA 166
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
G RS + ++++ A FA N AE+ AK +Q V +G+++G +L +
Sbjct: 167 GVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVG-SLVVRLVED 225
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
++ H+ N ++ +++ + +LN RA++VF E+L SG +V+ E
Sbjct: 226 KHKVWGLMIILAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNPAQVSQRES 285
Query: 362 LF 363
+
Sbjct: 286 IL 287
>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
Length = 559
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/235 (31%), Positives = 130/235 (55%), Gaps = 10/235 (4%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
R++ ++ LP G+P SV+ DY Y ++ S+Q +S I+G+LA++A+L +G+G A PT
Sbjct: 66 RDVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPT 125
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
+A + +L+D G ++ I+ ++ G + K +R AD+ + A L+ +P P F
Sbjct: 126 SALLLHILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185
Query: 235 --VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
V + +AAG R+ + S+++ A FA N AEV A+ V IG+++G
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAE-----TVISLIGMLVGS 240
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+ +HI +S + + +H+ N + +S+++ TLN RA++VFS L S
Sbjct: 241 FVVSHI-TSFTATWLTLVFLLSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 294
>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
Length = 451
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 127/242 (52%), Gaps = 4/242 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
LP G+P +VT DYL Y + S+Q AS I+ +LA +A+L +G+G + PT A I +
Sbjct: 46 LPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 105
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH-FVFIGAAA 241
D I ++ I+ ++ G+ + K +R ADL ++A L++LTPA P+ + + +A
Sbjct: 106 GDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALPYFPKLAVIVSA 165
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
G RS + ++++ A FA N AE+ AK +Q V +G++ G + +
Sbjct: 166 GVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAGSLVVRLVEDK 225
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
L ++ +H+ N ++ + + +R+LN RA++VF E+L G +V E
Sbjct: 226 QLVWLLMVALAA-VHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHGAVLTPAQVARRES 284
Query: 362 LF 363
+
Sbjct: 285 IL 286
>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
Length = 323
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 114/218 (52%), Gaps = 7/218 (3%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAA 178
F+LP GFP SV+ DYL Y LW+ V I L T LL A+G+ G A +AAA
Sbjct: 106 FVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAASAAA 165
Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG ++++ FG FD +PK WR++AD + +A E++TP +P +F+ +
Sbjct: 166 IRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYFLPL 225
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ ++ + + FA N EV AK E + ++ +G+ +G+ + +
Sbjct: 226 ASLGNLAKAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVFIMDT 285
Query: 298 --IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
I SS P ++ V +H++ +S ++ RT+
Sbjct: 286 SGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVR 323
>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
Length = 447
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 134/292 (45%), Gaps = 31/292 (10%)
Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG L ++++ FG FD +PK WR++AD + +A ++ T +P F+ +
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
+ ++ A ++ + FA N EV AK E + ++ IG+ GI + +
Sbjct: 180 ASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILIIDT 239
Query: 298 IG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
G S PF L +++ + +H++ +S ++ T+N RA ++ +++ P +
Sbjct: 240 PGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPGYVD 299
Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
N E + F K + V +E +LS + + A
Sbjct: 300 CNKRENILLWQRFMKPRII--------------FGVSLE-------ELSGLEKSVSKVKA 338
Query: 416 LFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERN 463
L +Y E YILT +F V K +A +D+L+ L+QA YWLE N
Sbjct: 339 LLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQA---YWLEEN 387
>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
Length = 478
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 128/249 (51%), Gaps = 6/249 (2%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK- 170
L+ ++ LP G+PDSVT DY+ Y + S+Q S I+G+L+ +A+L +G+G
Sbjct: 56 LFTNMNQILFDAFLPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDP 115
Query: 171 GAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
+ T A + +LKDGI ++ I + FG + K +R AD+ ++AF L++ +P
Sbjct: 116 NSSATYALLLTILKDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPY 175
Query: 230 F-PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
F P V A A R+ + ++++ FA R N +E+ AK +Q +G+
Sbjct: 176 FDPWTKVAAIVLAEALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGL 235
Query: 289 MLGIALANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
++G + + G F L F V ++H+ N + ++L TLN RA+++F Y +
Sbjct: 236 LVGSIVVRFVEGREAVFVLMVFLV--FVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQT 293
Query: 348 GQAPPVKEV 356
+ K+V
Sbjct: 294 KRVLTPKQV 302
>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
norvegicus]
Length = 246
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 74/115 (64%), Gaps = 2/115 (1%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTM 189
>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
Length = 638
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 114/225 (50%), Gaps = 2/225 (0%)
Query: 144 RSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAIN-WVLKDGIGYLSKIMLSNF-GRH 201
+ V AS +SG LAT A+L IG+G + +AA W+LKD G L +I+ + + G
Sbjct: 267 KPVWAFASSLSGSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSK 326
Query: 202 FDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYA 261
D N K WRLFAD+L + A LE++ P P F + + ++ +TR+
Sbjct: 327 LDCNAKQWRLFADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTM 386
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
A R N A+V AK +Q + G+++ + + + F+L F ++T +H+Y N
Sbjct: 387 HQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANY 446
Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
++ +++ + TLN R LV +L G+ N EPL+ F
Sbjct: 447 RAVRALVIETLNEGRLWLVLKHFLQRGEVLDPTSANQMEPLWTGF 491
>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
Length = 216
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 105/192 (54%), Gaps = 14/192 (7%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT- 175
RELF LP G+P +VT DYL+Y LW + +S L T +LL A+G+ G T
Sbjct: 2 RELF----LPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATV 57
Query: 176 --AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
+A+I W++KDG+G L ++++ S F FD +P+ WR+ A+LL GLE+ T +PH
Sbjct: 58 ALSASIKWIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPH 117
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCF---YAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
+F+ + A +G+ A AL + + F FA +N V AK E + ++ G+
Sbjct: 118 YFLLL---ACSGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLA 174
Query: 290 LGIALANHIGSS 301
+A+ + +S
Sbjct: 175 ASVAVLRALEAS 186
>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
Length = 527
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/263 (31%), Positives = 141/263 (53%), Gaps = 26/263 (9%)
Query: 95 VASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
VA AS+ +L++ S + L + V+ LP G+P SVT DY Y L+ S+Q +S I+
Sbjct: 52 VAHASSTTLAAWSSKSVL-----DFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIA 106
Query: 155 GVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLF 212
G+L+++A+L +G+G A PTAA + +L+D G ++ I+ ++ G + K +RL
Sbjct: 107 GLLSSRAVLQGVGVGNADASPTAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLA 166
Query: 213 ADLLENAAFGLEMLTPAFP--HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
AD+ + A L+ L+P P V + + AG R+ + S+++ A F+ N A
Sbjct: 167 ADVFNDLAMILDCLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLA 226
Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWI--------HMYCNLK 322
EV AK +Q + IG+++G + + + SFS TWI H+ N
Sbjct: 227 EVNAKDSSQETIISLIGMLVGSFVVSRV--------TSFS-TTWICLLMLLALHLSLNYA 277
Query: 323 SYQSIELRTLNPYRASLVFSEYL 345
+ +S+++ +LN RA++VFS L
Sbjct: 278 AVRSVQMTSLNRQRANIVFSTLL 300
>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
Length = 405
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 140/310 (45%), Gaps = 62/310 (20%)
Query: 121 VQFMLPEGF-PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
V F + + F P V+ DY ++++WR +Q + S V TQ+LL A+G+ + AAA
Sbjct: 90 VAFKVKQSFLPRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLLLALGVKTNKVGVAAAS 149
Query: 180 NWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
WV + G K++ + +GR FD + K WR
Sbjct: 150 GWVFANAFGKFGKMIFAARWGRDFDSDAKRWR---------------------------- 181
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAAR-RNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
++TR FY FA R N ++ AKG+AQ V+ +G+++GI LA
Sbjct: 182 --------------SATRGTFYKQFAGRNENLGDITAKGDAQVAVADLVGLVVGIELARL 227
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
G++ S+ V++ + LKS +I R LN RA+++ S ++ G +++
Sbjct: 228 AGTNALCVWGSYVVLSALDFIFILKSLDTIVFRFLNLERATMLASAFVSEGIIKSPADIS 287
Query: 358 DEEPLFPAFHFFKIKSANK----------SQLLVLSSEAKD-----AAVEIEH-RLQLGS 401
EP+ K++ K SQ +VL AKD + + +EH RL L S
Sbjct: 288 RSEPVLSRPR-HKLRETFKGYWASTFRSISQCVVLKQSAKDLDILQSLLALEHFRLHLSS 346
Query: 402 KLSDVVNNKE 411
+ SD ++ +E
Sbjct: 347 RQSDSLDFEE 356
>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
Length = 427
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 120/242 (49%), Gaps = 4/242 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
+P G+P SV+SDYLNY + S+Q S I+ +LA +ALL +G+G + T A + VL
Sbjct: 70 MPVGYPHSVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVL 129
Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
KD I ++ I + FG + + K +R ADL + AF +E+ +P P+ +
Sbjct: 130 KDAISRVATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTG 189
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
A R+ + ++++ FA N AE+ AK +Q IG+ +G + ++
Sbjct: 190 EALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVVKYVEDH 249
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
V+ H++ N + + + LN RA+++F EYL +GQ +EV E
Sbjct: 250 NTVVFLMI-VLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTPEEVAKRES 308
Query: 362 LF 363
+
Sbjct: 309 IL 310
>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
Length = 462
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 153/356 (42%), Gaps = 42/356 (11%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYA-----IGLGKGAIPTAA 177
F+LP GFP SV+ DYL Y L + V + I L T +LL A
Sbjct: 79 FILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKASCWSWFFFSNHCCCFCC 138
Query: 178 AINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+DGIG + ++ + FG FD +PK WR++ADL+ +A ++ T +P +F+
Sbjct: 139 CFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAYFLQ 198
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + ++ A ++ + FA N EV AK E + ++ +G+ LGI +
Sbjct: 199 LASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVLT 258
Query: 297 H--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
+ +S P ++ + +H++ +S + +++N RA ++ ++L P
Sbjct: 259 SPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGYV 318
Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE--- 411
+ N +E + F K R+ G L +++ +
Sbjct: 319 DCNRKEKILSWQRFLK------------------------PRIIFGVSLEELIGGERPVS 354
Query: 412 DAHALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
L LY E YIL + G F V + K A +L+S++Q+ YWL+ N
Sbjct: 355 KVRTLLKLYSQEKYILVVKQQGTDFEVFVSFKVGATSLSVLRSIWQS---YWLQEN 407
>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 568
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 125/253 (49%), Gaps = 16/253 (6%)
Query: 113 WDECRELFVQ-FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK- 170
WD + ++ LP G+PD+V+SDY + W + + VL +Q+LL A+G+G
Sbjct: 138 WDPRKSPRLRAVFLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMGST 197
Query: 171 -GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
GA+P AAA WVLKDG+G + ++ F G+ +D +PK W + LE+ A LE++TP
Sbjct: 198 PGALPLAAATKWVLKDGVGSFATLLAGVFGGQKYDEDPKRWWAVTNSLEDVARVLELVTP 257
Query: 229 AFPHHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGM-----V 282
A+P F+ + A+A R+ A + S F +NF +V AK EA G+ V
Sbjct: 258 AYPALFLPLAASATFVRAGALTGRGSLINGTFMQHLGRNQNFGDVRAKMEALGIAIFRGV 317
Query: 283 SKAIGIMLGIALANHIGSSMP------FALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
S A G A ++M ++ V H++C K+ + LN +R
Sbjct: 318 SGAFGADAAAMAAEGGETAMSELTRYGGVFGAYGAVLSSHLFCCWKAADVLRFDALNRHR 377
Query: 337 ASLVFSEYLLSGQ 349
+ EY+ G+
Sbjct: 378 LTRAAEEYVRGGR 390
>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
Length = 226
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 74/115 (64%), Gaps = 2/115 (1%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
+LP+GFPDSV+ DYL Y LW SVQ AS +SG LATQA+L +G+G A +AA W+
Sbjct: 75 LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+KD G L +I+ + + G D N K WRLFAD+L + A LE++ P +P F
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTM 189
>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
Length = 207
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 106/196 (54%), Gaps = 2/196 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
+PEGFP SVT DY+ + +W ++QG+++ I +L+TQALL AIG+G + A A W L
Sbjct: 6 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 65
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
+D G L I+ + + G + D N K WRL AD + + +++L+P FP + I
Sbjct: 66 RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 125
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
RS + +TR+ FA N A++ AK +Q ++ +G+ LG+ LA
Sbjct: 126 LSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQITRGHA 185
Query: 303 PFALASFSVVTWIHMY 318
ASF +T HMY
Sbjct: 186 LSVWASFLSLTMFHMY 201
>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
Length = 455
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 141/286 (49%), Gaps = 28/286 (9%)
Query: 100 NASLSSLLSVNKLWDECRELFV----------------QFM----LPEGFPDSVTSDYLN 139
+A ++S ++ + + R+LF QF+ LP +PDSV+ DY+
Sbjct: 37 DAQVTSYRTIGTILEPLRQLFSPANAVHRVESLTRGTPQFLRDAFLPVDYPDSVSPDYIK 96
Query: 140 YSLWRSVQGVASQISGVLATQALLYAIGLG--KGAIPTAAAINWVLKDGIGYLSKIMLS- 196
Y ++ S Q S I+G+LA +A+L +G+G G++ A + LKDG+ L+ I +
Sbjct: 97 YQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLLT-CLKDGVSRLATITFAY 155
Query: 197 NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQAST 255
FG + N K +R ADL + AF LE+ P F + I +A + R+ + ++
Sbjct: 156 RFGSVIEANAKKYRFLADLFNDTAFFLELANPFFAGWAKILILVSAESLRALCGIAAGAS 215
Query: 256 RSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS-MPFALASFSVVTW 314
++ FA R N AE+ K +Q IG++ G A+ +++ + F L F V +
Sbjct: 216 KAALSTHFARRNNLAELNTKEASQETAIGLIGLLAGTAIIHYVQNRGAVFFLTIFLV--F 273
Query: 315 IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
IH+ N +++L T+N RA++V YL + + E + +E
Sbjct: 274 IHLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVLTPAEASKKE 319
>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
206040]
Length = 468
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 136/273 (49%), Gaps = 10/273 (3%)
Query: 91 DAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVA 150
D ++ SN+ S + + E R+ + LP GFP SV+ DYL Y + S+Q
Sbjct: 34 DGHILPKTSNSLFS------RSFREWRKAIMDSFLPVGFPHSVSKDYLAYQTFDSLQAFF 87
Query: 151 SQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKG 208
S IS +LA +ALL +G+G + T A + ++KD + ++ I+ ++ FG + + K
Sbjct: 88 STISSLLANRALLQGLGVGDASSSATFALLLTIMKDAMSRIATIVFAHRFGLRIEPDAKR 147
Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARR 267
+R AD+ ++AF LE+ +P I + G A R+ + ++++ FA
Sbjct: 148 YRFLADVFNDSAFFLELYSPYLGSWGKVIALSTGQALRALCGVAAGASKAALSVHFAKHD 207
Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
N AE+ AK +Q IG+++G + I + +V H+ N +S+
Sbjct: 208 NLAELNAKEASQETAVGLIGLLVGTLVVKMIEDHSSVVILMILLVI-AHLGTNYLGVRSV 266
Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
+ TLN RA++VF E+L SG+ ++V + E
Sbjct: 267 CMATLNRQRATIVFEEFLRSGKVLTPEQVAERE 299
>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
Length = 459
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/228 (32%), Positives = 117/228 (51%), Gaps = 17/228 (7%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAA 178
F+LP GFP SV+ DYL Y LW+ V I L T LL A+G+ G A +AAA
Sbjct: 123 FVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAASAAA 182
Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG ++++ FG FD +PK WR++AD + +A E++TP +P +F+ +
Sbjct: 183 IRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYFLPL 242
Query: 238 GAAAGAGRS---AAALIQASTR-------SCFYAGFAARRNFAEVIAKGEAQGMVSKAIG 287
+ + + L QA R FA N EV AK E + ++ +G
Sbjct: 243 ASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLG 302
Query: 288 IMLGIALANH--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
+ +G+ + + I SS P ++ V +H++ +S ++ RT++
Sbjct: 303 LSIGVFIMDTSGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVD 350
>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 306
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 151/320 (47%), Gaps = 33/320 (10%)
Query: 192 KIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALI 251
K++ S+ G D PK WR+ AD+L + LE+++P P F+ + + A +
Sbjct: 6 KLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVA 65
Query: 252 QASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
+TR Y+ FA N +++ AKGEA + +GI GI LA+ + S+ L + +
Sbjct: 66 ARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPL 125
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN-DEEPLFPAFHFFK 370
++ +H+Y ++ ++ + TLNP R +++ ++++ SG+ E+ E+ LFP
Sbjct: 126 LSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLLFP------ 179
Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH 430
+ +E +++G L V + + L + + E ++L++
Sbjct: 180 -----------------NRVIEEAGSVKIGQPLRKVFSPRL-VEQLRATFPGEKFLLSQK 221
Query: 431 GGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQR 490
+ +VL+ SA +D L+ A++ +ER+ + S D A L + + ++R
Sbjct: 222 SNRTYMVLERSASGEDALRGWLVAAFASEMERSG-----AGSRDDAV---LSEAYEKMER 273
Query: 491 EFNHVKSDSASVGWVTDGLI 510
F S+ S GW TD +
Sbjct: 274 VFPTFVSEVRSRGWYTDQFL 293
>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
gaditana CCMP526]
Length = 277
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 79/136 (58%), Gaps = 2/136 (1%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
+ L LP GFP SV +YL Y W VQ +++ + G+LATQA+L +G+G+ GA P
Sbjct: 138 KTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPL 197
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AA + W+ +DG +S ++ ++F +F VN K WRLFAD + + L+ML P FP +F
Sbjct: 198 AATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNF 257
Query: 235 VFIGAAAGAGRSAAAL 250
+ A +S +
Sbjct: 258 LLCICLASVCKSLCGI 273
>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
CCMP526]
Length = 274
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 79/136 (58%), Gaps = 2/136 (1%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
+ L LP GFP SV +YL Y W VQ +++ + G+LATQA+L +G+G+ GA P
Sbjct: 135 KTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPL 194
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AA + W+ +DG +S ++ ++F +F VN K WRLFAD + + L+ML P FP +F
Sbjct: 195 AATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNF 254
Query: 235 VFIGAAAGAGRSAAAL 250
+ A +S +
Sbjct: 255 LLCICLASVCKSLCGI 270
>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
Length = 485
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 159/357 (44%), Gaps = 43/357 (12%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA---- 178
F+LP GFP SV+ DY Y L + V I L T +LL A+G+G TAAA
Sbjct: 101 FILPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAAASA 160
Query: 179 --INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
I WV KDGIG L ++ + FG FD +PK WR++AD + +A +++T +P +F+
Sbjct: 161 AAIRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPAYFL 220
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ + ++ A ++ + FA N EV AK E + ++ +G+ LGI +
Sbjct: 221 PLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVGAQLLGLALGILIL 280
Query: 296 NHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
+ G S P +++ + +H++ S ++ T+N RA L+ ++L
Sbjct: 281 DTPGLVKSFPALASTWLSMRLLHLWLRYLSLSVLQFDTINLKRARLLVKSHILHSVVLGC 340
Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
+ N EE + F K ++ G L ++V + A
Sbjct: 341 VDCNREENILLWETFTK------------------------PQITFGVPLQEMVGGERSA 376
Query: 414 ---HALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
L LY E Y+L + G F V + K A L+S++Q YWL N
Sbjct: 377 AQVKTLLKLYTKENYLLVVNQQKGDFEVFVSFKVGATSMSALRSVWQT---YWLHEN 430
>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
Length = 454
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 131/270 (48%), Gaps = 24/270 (8%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG----K 170
+ R F + LP+GFP++V+ DYL + W+ + + VL+ Q+LL A+G+
Sbjct: 31 QFRAQFKRIFLPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQAN 90
Query: 171 GAIPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
A P A A WVLKDG G L+ + L + FD NPK + A+ LE+ + +E+LTP
Sbjct: 91 AAAPVAGAAKWVLKDGTGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPL 150
Query: 230 FPHHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
FP HF+ I A RSAA ++S F F+ N ++ K E QG + +
Sbjct: 151 FPSHFLLIAGTATFVRSAALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAAL 210
Query: 289 MLGIAL-------------ANHIGSSMP---FALASFSVVTWIHMYCNLKSYQSIELRTL 332
+GIAL N +G M +A++ + + H +C + +S+ TL
Sbjct: 211 PVGIALFRAVTNGLELDFATNSLGQEMKGVTVGVATYGSMLFAHSFCCYMAAKSLNFETL 270
Query: 333 NPYRASLVFSEYLLSG--QAPPVKEVNDEE 360
+ RA + ++ S V+EV+D E
Sbjct: 271 SRKRALIQAKTFVESNGKNVLSVEEVSDVE 300
>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
nagariensis]
Length = 1055
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 147/323 (45%), Gaps = 22/323 (6%)
Query: 57 RTATSSEDDGNKEYDAVWE---VKGSKRTKLIPDFTKDAFVVASASNASLS-SLLSVNKL 112
R A+ +G+ A E V K +L+P + D + + S A+L S ++ +L
Sbjct: 217 REASERHSNGHPSAPASLERLPVAVKKGDQLLP-YVWDGRRITAMSEAALQDSDVTTGRL 275
Query: 113 WDECRELFVQFMLPEGF---PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
+ L VQ L E F P VT DY + WR Q S + ALL A GLG
Sbjct: 276 RNWL--LGVQRQLKEAFLPDPRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLMATGLG 333
Query: 170 KGAIPTAAA-INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
A+A INW+LKDG+ + ++ +S +FG+ FD + K R L+ A E T
Sbjct: 334 AKKAFAASAAINWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMAGEFAT 393
Query: 228 PAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK--A 285
P +P HFV + + + GR+ +T+ F A A N A++ +K +AQ MV A
Sbjct: 394 PFYPQHFVALASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMVMDMLA 453
Query: 286 IGIMLGIALANHIGSSMP-----FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
+G+ G+ + MP + + + C ++++LRTLN RA ++
Sbjct: 454 LGVSAGLTW---LCRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERAEML 510
Query: 341 FSEYLLSGQAPPVKEVNDEEPLF 363
++ G P E++ E L
Sbjct: 511 IERWMRRGAVPDAAEISAAENLI 533
>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
Gv29-8]
Length = 451
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 130/251 (51%), Gaps = 4/251 (1%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E R++ LP GFP SV+ DYL Y ++ S+Q S IS +LA +ALL +G+G +
Sbjct: 65 EWRKVLFDSFLPVGFPHSVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDASSS 124
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
T A + +LKD + ++ I+ ++ FG + + K +R AD+ ++AF LE+ +P
Sbjct: 125 ATFALLLTILKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYLGS 184
Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
I + G A R+ + ++++ FA + N AE+ AK +Q IG+++G
Sbjct: 185 WGKVIALSTGEALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVG 244
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ I + +V + H+ N +S+ + TLN RA+++FSEYL +
Sbjct: 245 TLVVKTIEDHSSVVILMILLV-FAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRVVL 303
Query: 352 PVKEVNDEEPL 362
++V + E +
Sbjct: 304 TPEQVAERESI 314
>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
Length = 500
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/412 (25%), Positives = 185/412 (44%), Gaps = 30/412 (7%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG----LGKGAI 173
++ F+LP GFP SV+ DYL Y + + + I L T +LL A+G G A
Sbjct: 100 DILKDFILPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTDAA 159
Query: 174 PTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
+AAAI WV KDGIG L ++ + FG FD +PK WR++AD + +A ++ T +P
Sbjct: 160 ASAAAIRWVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPA 219
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
+F+ + + ++ A ++ + FA N EV AK E + ++ +G+ LGI
Sbjct: 220 YFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEEVWEVGAQLLGLALGI 279
Query: 293 ALAN--HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
+ + + S P +++ + +H++ +S + T+N RA ++ ++L +
Sbjct: 280 LILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQSRV 339
Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
+ N +E + F + + L + + A +I+H + + K+
Sbjct: 340 LGCVDCNQDENILSWETFTRPRIIFGVPLEEMIGGERSVA-KIKHGVVIKIKM------- 391
Query: 411 EDAHALFSLYEDEGYILTEHGGK----FCVVLKESALPQDMLKSLFQASYLYWLERNAGI 466
L LY E + L + K V K A +L+S++Q YWL N
Sbjct: 392 -----LLKLYAKEKHFLVVNQQKTDFEVLVSFKVGATSMSVLRSVWQT---YWLHENWKS 443
Query: 467 VATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
+S + +L SL ++ +F +VGW T I +P +I
Sbjct: 444 FGNRSSD--SDYDQLAQSLLEMEVKFEDFMQQLDAVGWDTSQ-INLKVPRKI 492
>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
Length = 425
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 120/242 (49%), Gaps = 4/242 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
+P G+P SV+SDYLNY + S+Q S I+ +LA +ALL +G+G + T A + VL
Sbjct: 70 MPVGYPHSVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVL 129
Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
KD I ++ I + FG + + K +R ADL + AF +E+ +P P+ +
Sbjct: 130 KDAISRVATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTG 189
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
A R+ + ++++ FA N AE+ AK +Q IG+ +G + ++
Sbjct: 190 EALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDH 249
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
V+ H++ N + + + LN RA+++F EYL +G+ +EV E
Sbjct: 250 NTVVFLMI-VLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRES 308
Query: 362 LF 363
+
Sbjct: 309 IL 310
>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
Length = 484
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 171/361 (47%), Gaps = 18/361 (4%)
Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
K+W + E+F LP+G+P SV+ DY Y ++ S+Q A I+G+++++A+ +G+G
Sbjct: 59 KKIWLQLLEIF----LPDGYPHSVSDDYAAYQIYDSIQAFAGSIAGMISSRAVWEGLGVG 114
Query: 170 KG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
A PT A + V+++ G + I ++ FG + K +RL +D+L + A L+ L+
Sbjct: 115 DSLASPTGAMLIQVIRESTGRFATITFAHLFGTSIEAECKAYRLASDILCDVAMVLDCLS 174
Query: 228 PAFPHHFVFIGAA-AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
P FP F+ + SA+ + +++S FA N E+ AK +Q +
Sbjct: 175 PFFPRSIRFLVLCFSSLLYSASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLM 234
Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
G++ G + + + +A S++ +H++ N K ++++ R+LN RA++ FS
Sbjct: 235 GMITGTFMVSLLTGPKATWVALISLLA-LHLFLNWKGVRAVKSRSLNRQRANIAFSALFS 293
Query: 347 SGQAPPVKEVNDEEPLFP-----AFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL-- 399
Q V++ E +F F + K + V E + + R++
Sbjct: 294 KDQVLTPVIVSEREFIFEERWGSVFRWNSGKVLGHCEFGVSLQEILQSLASGQERMKTFH 353
Query: 400 --GSKLSDVVN-NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
+L++++N E+ + L+ + Y K VVLK+ P+ LK+ F
Sbjct: 354 LSTLELANLLNIYDEEEYVLWCEASETVYCDAASRVKVMVVLKDGVTPKSQLKAWFHGLL 413
Query: 457 L 457
L
Sbjct: 414 L 414
>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 498
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/250 (31%), Positives = 129/250 (51%), Gaps = 6/250 (2%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTA 176
+ + LP G+P SVT DYL Y + S+Q S I+ +LA +A+L +G+G + PT
Sbjct: 39 QALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSITSLLANRAVLEGLGVGDSSSSPTG 98
Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
A I + D I ++ I+ ++ G + K +R ADLL ++A L++LTPA P +F
Sbjct: 99 ALILKIAGDTISRIATILFAHRMGTAIEPECKFYRFLADLLNDSAQLLDLLTPALP-YFP 157
Query: 236 FIGAAAGAG--RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
+G AG RS + ++++ A FA R N AE+ AK +Q V +GI++G +
Sbjct: 158 KLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLAELNAKEASQETVVSLLGILVG-S 216
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
L + A + +H+ N + +++ RTLN RA++V +L G+
Sbjct: 217 LVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRTLNRQRATVVVRGWLEWGRVLGP 276
Query: 354 KEVNDEEPLF 363
+EV E +
Sbjct: 277 EEVARMESIL 286
>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
anophagefferens]
Length = 248
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/251 (31%), Positives = 127/251 (50%), Gaps = 18/251 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP GFP +VT Y Y+ + S GVL+T+ LL A+G+G G A P AAA NW +
Sbjct: 2 LPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWAI 61
Query: 184 KDGIGYLSKIML-SNFGRHFDVNPKGWRL-------FADLLENAAFGLEMLTPAFPHHFV 235
KDG+G L + + + D P+ WRL A L+E AA PAFP +FV
Sbjct: 62 KDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAAL------PAFPQYFV 115
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAAR---RNFAEVIAKGEAQGMVSKAIGIMLGI 292
I A ++ + L +++R+ + +AR N A++ AK +Q +V+ G+ LG+
Sbjct: 116 PIAGLANVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGV 175
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
A +N G+ +++ ++ H+ C S + + TL+ R + + + +G P
Sbjct: 176 AASNACGAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPT 235
Query: 353 VKEVNDEEPLF 363
+V++ E LF
Sbjct: 236 PADVSERESLF 246
>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
Length = 479
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 71/250 (28%), Positives = 130/250 (52%), Gaps = 5/250 (2%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
+ +F F LP GFP +VT DYL+Y ++ S+Q +S I+ +LA +A+L +G+G + PT
Sbjct: 43 KAIFYAF-LPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPT 101
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAA-FGLEMLTPAFPHH 233
AA + +++D I L+ IM ++ G + K +R AD+ +AA ++
Sbjct: 102 AALVLKIIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLP 161
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
V + +G RS + ++++ A FA N AE+ AK +Q V +G++ G A
Sbjct: 162 KVSVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTA 221
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
+ H+ V+ +H+ N ++ +++ + TLN RA++VF E+L +G+
Sbjct: 222 VV-HMVEDKTAVWCWMVVLICMHLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTP 280
Query: 354 KEVNDEEPLF 363
+ + E +
Sbjct: 281 AQCSARESIL 290
>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 362
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/239 (28%), Positives = 122/239 (51%), Gaps = 21/239 (8%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GA 172
E R +F LP GFP + ++DY+ + W+ + + + +++ Q+LL A+G+G GA
Sbjct: 3 ELRRIF----LPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGNTPGA 58
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
+P AA WVLKDG+G + ++ +FG + +D +PK W ++ LE+ A +E++TPA P
Sbjct: 59 LPLTAAAKWVLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAAP 118
Query: 232 HHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
F+ + A+A R+AA + S F F N ++ AK E QG I +
Sbjct: 119 GLFLPLAASATFVRAAALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIALPA 178
Query: 291 GIALANHIG-------------SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
GI + + + A ++ V +H++ ++ +S++ TLN +R
Sbjct: 179 GIQVFQLVSAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKSLKFDTLNRFR 237
>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
Y34]
gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
P131]
Length = 431
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 71/248 (28%), Positives = 129/248 (52%), Gaps = 5/248 (2%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAA 177
+F F LP GFP +VT DYL+Y ++ S+Q +S I+ +LA +A+L +G+G + PTAA
Sbjct: 36 IFYAF-LPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAA 94
Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAA-FGLEMLTPAFPHHFV 235
+ +++D I L+ IM ++ G + K +R AD+ +AA ++ V
Sbjct: 95 LVLKIIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKV 154
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ +G RS + ++++ A FA N AE+ AK +Q V +G++ G A+
Sbjct: 155 SVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVV 214
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
H+ V+ +H+ N ++ +++ + TLN RA++VF E+L +G+ +
Sbjct: 215 -HMVEDKTAVWCWMVVLICMHLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTPAQ 273
Query: 356 VNDEEPLF 363
+ E +
Sbjct: 274 CSARESIL 281
>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
Length = 675
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 139/272 (51%), Gaps = 10/272 (3%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
+ +LP+G+P SVT D L ++LW Q +A ++ L+T+A+L +G+G+ A T++ ++
Sbjct: 70 RILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLS 129
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP---HHFVF 236
W+++DG + I+ ++ + + K W+L AD + A LE+ P P F
Sbjct: 130 WMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELCAPWLPGCQTIFRV 189
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ AA A ++ + + TR+ F FA R N A++ AK + V IG+ LG A+A
Sbjct: 190 VLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVGSFIGLALGTAVAY 249
Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ +S LA+F++ + ++ + + + ++LR LN R +Y L +
Sbjct: 250 SVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQYSLFKAKVLIAS 309
Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
D PL P +SAN ++ LL+L S
Sbjct: 310 RKDSGPLSVMPILD-VSPRSANAAERLLILPS 340
>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 675
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 139/272 (51%), Gaps = 10/272 (3%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
+ +LP+G+P SVT D L ++LW Q +A ++ L+T+A+L +G+G+ A T++ ++
Sbjct: 70 RILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLS 129
Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP---HHFVF 236
W+++DG + I+ ++ + + K W+L AD + A LE+ P P F
Sbjct: 130 WMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELCAPWLPGCQTIFRV 189
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ AA A ++ + + TR+ F FA R N A++ AK + V IG+ LG A+A
Sbjct: 190 VLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVGSFIGLALGTAVAY 249
Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ +S LA+F++ + ++ + + + ++LR LN R +Y L +
Sbjct: 250 SVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQYSLFKAKVLIAS 309
Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
D PL P +SAN ++ LL+L S
Sbjct: 310 RKDSGPLSVMPILD-VSPRSANAAERLLILPS 340
>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
Length = 542
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 134/291 (46%), Gaps = 46/291 (15%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
FV LP +P+SV S Y ++ + +A + VL+TQALL A+G+G + A P AAA
Sbjct: 111 FVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAAA 170
Query: 179 INWVLKDGIGYLSKIML-------------------------SNFGR---------HFDV 204
+NW+LKD G L ++ +N GR D
Sbjct: 171 LNWLLKDFAGQLGGVLFVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTADT 230
Query: 205 NPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAAGAGRSAAALIQASTRSCF---- 259
NPK WR+ A L + + LE+ TP P +F+ + A G++ L +++R+
Sbjct: 231 NPKRWRMVAALALDLSTLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQSL 290
Query: 260 ----YAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFA--LASFSVVT 313
++ N +V AK +Q +V+ +G LGI + + + ++ F V++
Sbjct: 291 SAGGSYSSSSGSNLGDVTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVLS 350
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP 364
+H K+ ++ LR+L+ +R +V SE+ + P +V +E FP
Sbjct: 351 AVHQVATYKALMAVPLRSLDRHRLHIVLSEFSRTSSVPSPSQVATKEKFFP 401
>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
Length = 449
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 165/337 (48%), Gaps = 53/337 (15%)
Query: 150 ASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPK 207
AS I+G+LA++A+L +G+G A T A + +L++ G +S I+ ++ G + K
Sbjct: 60 ASTIAGLLASRAVLQGLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECK 119
Query: 208 GWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
+RL AD+ +AA L+ ++PAFP VF+ +A+ +S + S+++ A FA
Sbjct: 120 KYRLMADIFNDAAMILDCISPAFPKIPRVFLLSASSVCKSLCGVAAGSSKASLSAHFAKM 179
Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQS 326
N AE+ AK +Q + +G+++G + + I S + +A ++++ IH+ N + +S
Sbjct: 180 GNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRS 238
Query: 327 IELRTLNPYRASLVFSEYLLSGQ-------APPVKEVNDEEPLFPAFHFFKIKSANKSQL 379
+ +RTLN RA+LV S L S + P ++++ +E +F
Sbjct: 239 VTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISLQERIFE--------------- 283
Query: 380 LVLSSEAKDAAVEIEH-----RLQLGSKLSDVVNN-------------KEDAHALFSLYE 421
+D A+ H ++G L D++++ + +L S+Y+
Sbjct: 284 -------RDGAIRNTHGTILAYCKVGVSLQDLLSSIATPTTAGSYAEPESILTSLLSIYK 336
Query: 422 DEGYIL--TEHGGKFCVVLKESALPQDMLKSLFQASY 456
+GY++ + F V+LKE P+ L + F A +
Sbjct: 337 TQGYLMWYSRSRNTFLVILKEETAPKVQLDAWFHAIF 373
>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 462
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 67/238 (28%), Positives = 123/238 (51%), Gaps = 4/238 (1%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E + LP GFP SV++DYL Y + S+Q S I+ +LA +ALL +G+G +
Sbjct: 70 EWKRFLSDAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSS 129
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
T A + +L D ++ I+ ++ FG + + K +R AD+ ++AF LE+ +P F
Sbjct: 130 ATFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGD 189
Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
+ + + G A R+ + ++++ FA N AE+ AK +Q IG+++G
Sbjct: 190 YGKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVG 249
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
+ + ++V + H++ N +S+ + TLN RA+++F EY+ +G+
Sbjct: 250 TLVVKLVQDHQSVVYLMVALV-FAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGR 306
>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
Length = 448
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 128/252 (50%), Gaps = 4/252 (1%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E R+ LP GFP SV+ DYL+Y + S+Q S IS +LA +ALL +G+G +
Sbjct: 65 EWRKALFDSFLPVGFPHSVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSS 124
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
T A + +LKD + ++ I+ ++ FG + + K +R ADL + AF LE+ +P
Sbjct: 125 ATFALLLTILKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGS 184
Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
+ + G A R+ + ++++ FA + N AE+ AK +Q IG+++G
Sbjct: 185 WGKVVALSTGEALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVG 244
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
+ + + L +V H+ N +S+ + T N RA+++F EYL SG+
Sbjct: 245 TLVVKTVENHSAVVLLMIVLVL-AHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVL 303
Query: 352 PVKEVNDEEPLF 363
++V + E +
Sbjct: 304 SPEQVANRESII 315
>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
Length = 232
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 2/113 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA W++
Sbjct: 72 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P P F
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFT 184
>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
Length = 232
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 2/113 (1%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA W++
Sbjct: 72 LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131
Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
KD G L +I+ + + G D N K WRLFAD+L + A LE++ P P F
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFT 184
>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
Length = 412
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 175/389 (44%), Gaps = 24/389 (6%)
Query: 132 SVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK---GAIPTAAAINWVLKDGIG 188
SV+ DYL Y +W+ + I L T +LL A+GL A +AAI WV KDG+G
Sbjct: 21 SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 80
Query: 189 YLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSA 247
+ ++ + FG FD +PK WR++A+++ + E+ TP P +F+ + + ++
Sbjct: 81 AVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 140
Query: 248 AALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH--IGSSMPFA 305
A ++ + FA N +V AK E + + IG+ LG+AL + + +S
Sbjct: 141 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVALLSTPVVATSYSQL 200
Query: 306 LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPA 365
S+ ++ H++ ++ ++EL T+N R +++ E++ V E N +E L
Sbjct: 201 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVSECNKKENLILP 260
Query: 366 FHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL--GSKLSDVVNNKEDAHALFSLYEDE 423
+ + L + + K +EIE +L K +V +K + L +
Sbjct: 261 YTMLMPQIRLGCSLREVITLGK-TPLEIEEAFELYNSEKYVLIVFSKSNMGILLKVLSQ- 318
Query: 424 GYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEI 483
++ + C V ++ +L+S++QA L+ S S D L+
Sbjct: 319 -FLSSCSCNIKCRVCQDGCENTTLLRSIYQAHVLF----------HSKSFDTFT---LKQ 364
Query: 484 SLDYVQREFNHVKSDSASVGWVTDGLIAR 512
SL +Q+ F + + GW L+ +
Sbjct: 365 SLTSMQKSFASFLDELVAAGWNPSKLVVK 393
>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
SS1]
Length = 506
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/411 (26%), Positives = 182/411 (44%), Gaps = 40/411 (9%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAIN 180
Q LP +P SV S Y ++ ++++ + I+ VL QALL ++G+ +G++ A A+
Sbjct: 118 QMFLPTNYPQSVHSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSVFGAVAVQ 177
Query: 181 WVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF---PHHFVF 236
W++KDG G ++K+ + +F +FD +PK + LF ++L GL++ T P +F+
Sbjct: 178 WIIKDGAGEVAKLFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAPSPLNFLL 237
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
A + I ++T F F+ + N +V AK E+Q V++ G GI+L
Sbjct: 238 CAAGGNIFKLVGNAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYAAGISLLT 297
Query: 297 HIGSSMPFALASFSVVTWIH--MYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
+ + A F +H M + + EL TL R S + Y+ G K
Sbjct: 298 -FSHAPAYLYAIFFAAVPLHLTMTAYMMRVATFELLTLP--RISHLAQTYVNDGVVGSQK 354
Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH 414
+++ E F+K N + L L+ DA + SDV ++
Sbjct: 355 DLDREHATGLFGEFYK---NNGDRWLTLAPRVGDAL----------NTSSDV--DRSTWQ 399
Query: 415 ALFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
+++D+ Y+L + G V+ A DML+S+ A+ + L S
Sbjct: 400 VCSQVFQDDRYLLLPCDSPRGPLISVLFHPDATTDDMLQSILHAASVRSL--------LS 451
Query: 471 TSADC-APGGRLEISLDYV--QREFNHVKSDSASVGWVTDGLIARPLPNRI 518
A C P R +S V Q++F+ K GW TD L L +R+
Sbjct: 452 RDASCPTPSLRTVLSETRVQAQQDFSAFKQALHERGWRTDELCFADLGHRV 502
>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 438
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 132/252 (52%), Gaps = 9/252 (3%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
+ +F F LP GFP SVT DYL Y + S+Q +S I+ +LA +A+L +G+G + PT
Sbjct: 42 KAIFYAF-LPAGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPT 100
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA---FP 231
AA + +L+D L+ I+ ++ G + K +R ADL +A+ L++L P P
Sbjct: 101 AALVLQILQDTFSRLATILFAHRMGTAIEPECKTYRFMADLFNDASQLLDLLVPVLPFLP 160
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
V + AA RS + ++++ A FA N AE+ AK +Q V +G+++G
Sbjct: 161 KVSVMVSAA--ILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVG 218
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
A+ H+ V+ IH+ N ++ + +++ TLN RA++VF E+L +G+
Sbjct: 219 TAVV-HMVEDKRAVWCWMIVLIAIHLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVL 277
Query: 352 PVKEVNDEEPLF 363
+V E +
Sbjct: 278 TPGQVAARESIL 289
>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
Length = 381
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 175/399 (43%), Gaps = 56/399 (14%)
Query: 132 SVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK---GAIPTAAAINWVLKDGIG 188
SV+ DYL Y LW+ + I L T +LL A+GL A +AAI WV KDG+G
Sbjct: 2 SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 61
Query: 189 YLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSA 247
+ ++ + FG FD +PK WRL+A+++ + E+ TP P +F+ + + ++
Sbjct: 62 AVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 121
Query: 248 AALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH--IGSSMPFA 305
A ++ + FA N +V AK E + + IG+ LG+ L + + +S
Sbjct: 122 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLSTPVVATSYSQL 181
Query: 306 LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPA 365
S+ ++ H++ ++ ++EL T+N R +++ E++ Q P+ V +
Sbjct: 182 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHI---QGRPLSGVGE------- 231
Query: 366 FHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN---NKEDAHALFSLYED 422
NK + L+L + +++LG L +V+ + F LY
Sbjct: 232 --------CNKKENLILPYTM------LMPQIRLGCSLREVITLGKTPLEIEEAFELYNS 277
Query: 423 EGYIL---------TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSA 473
E Y+L KF V ++ +L+S++QA L+ S S
Sbjct: 278 EKYVLIVLSQFLSSCSCNIKFRVC-QDGCENTTLLRSIYQAHVLF----------HSKSF 326
Query: 474 DCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
D L+ SL +Q+ F + + GW L+ +
Sbjct: 327 DTF---TLKQSLTSMQKSFASFLDELVAAGWNPSKLVVK 362
>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
Length = 499
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 136/270 (50%), Gaps = 26/270 (9%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GA 172
E R +F LP GFP +V++DY ++ W + + +++ Q+LL A+G+G GA
Sbjct: 115 ELRAVF----LPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVALGVGSTPGA 170
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
+P AA WVLKDG+G + ++ FG R +D +PK W ++ LE+ A +E++TPA P
Sbjct: 171 LPLTAAAKWVLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVARAIELVTPAAP 230
Query: 232 HHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQG----MVSKAI 286
F+ + A+A R+AA + S F F N ++ AK E QG +V+ +
Sbjct: 231 GLFLPLAASATFVRAAALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQGRWLALVALPV 290
Query: 287 GIMLGIALANHIGSSM----------PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
GI + L N S++ A + +V H++ ++ +S++ +LN +R
Sbjct: 291 GIQV-FQLVNAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARSLKFDSLNGFR 349
Query: 337 ASLVFSEYLLSG---QAPPVKEVNDEEPLF 363
+ Y+ S + P E+ + E ++
Sbjct: 350 LLRLAEAYVASDGKIELPNCVEIGEIEGVY 379
>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
Length = 184
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 89/143 (62%), Gaps = 2/143 (1%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
+ L +P G+P SV+ DY+NY +W ++Q AS +S L+T+A+L +G+G K A
Sbjct: 36 KLLLRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTM 95
Query: 176 AAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AAA+ W+LKDGIG L++I+ + + + D + K WRL AD + A+ L+++TP FP+ F
Sbjct: 96 AAAVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLF 155
Query: 235 VFIGAAAGAGRSAAALIQASTRS 257
+ I + R+ + ++TR+
Sbjct: 156 MPIICLSSMIRAVVGVAGSATRT 178
>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 466
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 119/237 (50%), Gaps = 4/237 (1%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E + LP GFP SV++DYL Y + S+Q S I+ +LA +ALL +G+G +
Sbjct: 70 EWKRFLSDAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSS 129
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
T A + +L D ++ I+ ++ FG + + K +R AD+ + AF LE+ +P F
Sbjct: 130 ATFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGD 189
Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
+ + + G A R+ + ++++ FA N AE+ AK +Q IG+++G
Sbjct: 190 YGKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG 249
Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+ + ++V H++ N +S+ + T N RA+++F EY+ +G
Sbjct: 250 TLVVKLVQDHQSVVYLMVTLVL-AHLWMNYLGVRSVCMTTFNRQRATILFREYMTTG 305
>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
Length = 416
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 131/252 (51%), Gaps = 7/252 (2%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
R++ LP G+PDSVT DYL Y + S+Q S I+G+L+ +A+L +G+G + T
Sbjct: 2 RQIIYDAFLPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSAT 61
Query: 176 AAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHH 233
A + +LKDGI ++ I+ + FG + K +R AD+ ++AF L + +P F P
Sbjct: 62 YALLLTILKDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWT 121
Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
V A A + + ++++ FA R N +E+ AK +Q +G+++G
Sbjct: 122 KVGALVLAEALGAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAVGLVGLLVGSV 181
Query: 294 LANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA-P 351
+ + G FAL +F V ++H+ N + ++L TLN RA+++F Y + Q
Sbjct: 182 VVRFVEGREAVFALMTFLV--FVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLT 239
Query: 352 PVKEVNDEEPLF 363
P + + E LF
Sbjct: 240 PAQVASRENILF 251
>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
C-169]
Length = 265
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 75/238 (31%), Positives = 118/238 (49%), Gaps = 12/238 (5%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG---AIPTAAAI 179
F LP G+P++VT DYL + LW + +S LAT +LL A+G+G A AAAI
Sbjct: 8 FFLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAAAAAI 67
Query: 180 NWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
W+ KDGIG ++++ G FD +PK WR+ A+ LE+ T P +F+ +
Sbjct: 68 KWITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANFIVL- 126
Query: 239 AAAGAGRSAAALIQASTRSCF---YAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
AGAG + A+ + T CF FAA N +V AK E + ++ +G+ + L
Sbjct: 127 --AGAGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGLL 184
Query: 296 NHI-GSSMPF-ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
I + P L ++ V +H KS ++ ++N RA L+ + ++ P
Sbjct: 185 KAIEATGKPENVLGVWAAVQVLHSLLRYKSLAVLQFPSVNQKRACLLAAAHVAGNPLP 242
>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
Length = 561
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 73/231 (31%), Positives = 114/231 (49%), Gaps = 17/231 (7%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GAIPTAAAINWV 182
LP G+PD+V+ DY + W + + V+ Q+LL A+G+G GA+P AA WV
Sbjct: 127 LPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMGSTPGALPLTAATKWV 186
Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
LKDGIG L+ + + G+ D +PK W + ++ E+ A +E++TPA P F+ + A A
Sbjct: 187 LKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFLPLAATA 246
Query: 242 GAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS 300
R+AA + S FA N A+V A+ E QG + + +GI + +
Sbjct: 247 TFVRTAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGIFRTVSQ 306
Query: 301 S-----------MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
S + A+A + V H+ C + + + TLN RA LV
Sbjct: 307 SFASDPEEPARDVAIAVALYGGVVCAHLVCIWNAAKVLRFETLN--RARLV 355
>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
Length = 467
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 126/253 (49%), Gaps = 12/253 (4%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
E +++ LP GFP SV+SDYL Y + S+Q S I+ +LA +ALL +G+G +
Sbjct: 66 EWKKILSDSFLPVGFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSS 125
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-P 231
T A + +L+D ++ I ++ FG + + K +R ADL ++AF LE+ +P P
Sbjct: 126 ATFALLLTILRDATSRVATIAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYSPYLSP 185
Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
V + A R+ + ++++ FA N AE+ AK +Q IG+++G
Sbjct: 186 WGKVAALSIGEALRALCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG 245
Query: 292 IALA----NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+ +H L F H++ N S+E+ TLN RA+++F EYL +
Sbjct: 246 TVVVKVVEDHRSVVFLMILLVFG-----HLWMNYLGVCSVEMTTLNRQRATILFQEYLQT 300
Query: 348 GQAPPVKEVNDEE 360
G+ EV+ E
Sbjct: 301 GKVLSPAEVSKRE 313
>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 500
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 151/308 (49%), Gaps = 24/308 (7%)
Query: 69 EYDAVWEVKGSKRTKLIP----------DFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
EYD G+ +TK I D A V + S+ L ++ K W + +
Sbjct: 26 EYDRA----GNLKTKFIESAPRAGPARVDILSCAMHVINGSSVQLPTIR--KKPWRKVLD 79
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
+F LP+G+P SVT DY Y ++ S Q A I+G+++++A+ +G+G A P A
Sbjct: 80 IF----LPDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRAVWEGMGVGDSMASPIGA 135
Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+ V+++ +G L I ++ G + K +R+ +D+L + AF L+ L+P FP F
Sbjct: 136 MMIQVIRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCSIAFILDCLSPLFPRSIRF 195
Query: 237 IGAAAGAGRSAAALIQA-STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+ + AA+ + +++S FA N E+ AK +Q +G++ G +
Sbjct: 196 LILCCSSILFAASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLMGMLTGTLVV 255
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ + + F ++ I+++ N K ++++ R+LN RA++ FS L Q +
Sbjct: 256 SCLTTPTTTWTTLFILIA-IYLFLNWKGVRAVKSRSLNRQRANIAFSALLSKDQVLTPGK 314
Query: 356 VNDEEPLF 363
++++E +F
Sbjct: 315 ISEKELIF 322
>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 719
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 106/463 (22%), Positives = 180/463 (38%), Gaps = 124/463 (26%)
Query: 94 VVASASNASLSSLLSVNKLWDECREL---------FVQFMLPEGFPDSVTSDYLNYSLWR 144
V + +S + S L ++ CR + LP +P SV Y Y+ +
Sbjct: 204 VPSKSSGGTQSPLSTIRSFISSCRTKDMINIQNLPLIHHFLPANYPQSVCPSYATYASYC 263
Query: 145 SVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS------- 196
+ +A + VL+TQALL A+G+G + A P AAA+NWV+KDG+G L ++ +
Sbjct: 264 FLGSIAGSSAMVLSTQALLVAVGVGTQSAAPMAAALNWVMKDGVGQLGGVVFASQLGKGG 323
Query: 197 ----------------------------NFGR-HFDVNPKGWRLFADLLENAAFGLEMLT 227
NF R D NPK WR+ A L + + LE+ T
Sbjct: 324 VDVDYWKVKLNKLMGSSSSKSTETSARGNFQRGTADSNPKRWRMVAALALDLSTLLEICT 383
Query: 228 PAF-PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF----------------------- 263
P P F+ + A G++ L +++R+ +
Sbjct: 384 PWMGPQWFLPCASIANIGKNVGFLAASASRAAVHQSLCTGGSFLPPVPDNTTEKDESYDN 443
Query: 264 --------AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFA--LASFSVVT 313
++ N +V AK +Q +V+ +G LGI L+ S A LA F +++
Sbjct: 444 PKKGKNVTSSSNNLGDVTAKSGSQAIVASLLGTALGIWLSRTFCSDYGTAGILAGFVILS 503
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYL------LSGQAPPVKEVNDEEPLFPAFH 367
+H C ++ +++ LR+++ +R ++ + Y+ + +V +EE P
Sbjct: 504 AVHQVCTYRALKAVPLRSVDRHRLHIILNAYITADGRSVQSMVLTPSQVAEEESFLPMM- 562
Query: 368 FFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA-HALFSLYED---- 422
A D +V H L +G L D+ + D AL +D
Sbjct: 563 ------------------APDNSV---HWLSVGDSLIDICPSGLDELEALLIRRDDISQY 601
Query: 423 EGYILTEH-----------GGKFCVVLKESALPQDMLKSLFQA 454
E YI+ H G + + A D+L+ +F A
Sbjct: 602 EKYIMKVHLADTNATTAAVDGIVQLTFLDGATDNDLLQGMFHA 644
>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
Length = 333
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 114/233 (48%), Gaps = 15/233 (6%)
Query: 138 LNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSK-IML 195
+ Y W+ + V S VLATQA+ AIG+G ++P+AAA+NWVLKDG+G LS+ I
Sbjct: 1 MGYVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAALNWVLKDGLGRLSRCIYT 60
Query: 196 SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQAST 255
++ FD N K R +L + + G+E+LTP FP +F+ + A + + +T
Sbjct: 61 ASLASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLAT 120
Query: 256 RSCFYAGFAARRNFAEVIAKGEAQGMVSKAIG----IMLGIALANHIGSSMPFALASFS- 310
S + FA N EV AK + Q + +G +L + N+ S+ F SF
Sbjct: 121 GSAVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCF 180
Query: 311 --------VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ + I ++ + Q + L+TL R ++ + ++ G P E
Sbjct: 181 CLPFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAE 233
>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 679
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 124/303 (40%), Gaps = 71/303 (23%)
Query: 131 DSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGY 189
++VT DY Y+ +R Q VAS V ATQ +L A+GLG +P AAA+NWVLKDG+G
Sbjct: 179 NAVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLPAAAAVNWVLKDGLGR 238
Query: 190 LSKI-MLSNFGRHF-------DVNPKGWRLFADLLEN----------------AAFGLEM 225
L K+ + +NFGR F + P+ WR L AF L +
Sbjct: 239 LGKLSVATNFGREFGAFYTLVPIRPR-WRGERRSLRTFPGGSLAHHTVSIPALGAFQLHL 297
Query: 226 L----TPAF-----------------------------------PHHFVFIGAAAGAGRS 246
TP F P F+ + A G+S
Sbjct: 298 TPFDSTPTFACMKRPSDSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKS 357
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS------ 300
R+ FA N AE+ AK AQ +++ IG+ L + G
Sbjct: 358 VGITTANVVRAPIQMSFALEENLAEIAAKTSAQQVLADNIGLALAVVATGWTGKIANPRL 417
Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
+ L +F+ + + +YC + +S+ LRT+N R ++ + ++ +G P V E
Sbjct: 418 RLAVPLLAFAPLASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAE 477
Query: 361 PLF 363
LF
Sbjct: 478 RLF 480
>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
NIH/UT8656]
Length = 485
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/268 (27%), Positives = 141/268 (52%), Gaps = 15/268 (5%)
Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
S SS + + ++C F LP G+P +VT DY Y ++ S+Q AS I+G+LA++A
Sbjct: 40 SYSSPRTGTSILNKCMSPF----LPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRA 95
Query: 162 LLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENA 219
+ +G+G + A + ++ ++ +G ++ I+ ++ F + + K +R FAD++ +
Sbjct: 96 VFVGMGVGSEEASVLTTMLLYIAQETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDV 155
Query: 220 AFGLEMLTPAFP-HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEA 278
AF L+ ++PA P V + A R+ + S+++ + FA N E+ AK +
Sbjct: 156 AFVLDCMSPAMPILGRVVTLCLSNACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGS 215
Query: 279 QGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT---WIHMYCNLKSYQSIELRTLNPY 335
Q V +G+ +G + + + + LA++S + +H++ N K+ +S+ L + N
Sbjct: 216 QETVISLLGMWVGGVVVSKVHGT----LATWSWLVPLLALHLWANWKAVRSVRLNSFNTN 271
Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLF 363
RA LVF L G+ +++V EE +
Sbjct: 272 RA-LVFFRAALEGRTLDMEQVAQEETVL 298
>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
Length = 543
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 146/308 (47%), Gaps = 39/308 (12%)
Query: 163 LYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAA 220
L +G+G A PTAA + VL+D +G + I+ ++ G F+ K +R AD+L +
Sbjct: 176 LAGVGVGDANASPTAALLLSVLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTG 235
Query: 221 FGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
LE L+PAFP V + + + RS + S ++ A FA + N EV AK +Q
Sbjct: 236 MVLECLSPAFPKPIRVCVLSCSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSSQ 295
Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTW---IHMYCNLKSYQSIELRTLNPYR 336
V +G++ G + + I S M A++S + IH+ N + +++ +R LN R
Sbjct: 296 ETVISLLGMLAGSVVVSWISSPM----ATWSTLIGLLAIHLATNYAAVRAVSMRCLNRQR 351
Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFP-----------AFHFFKIKSANKSQLLVLS-S 384
A+++FS L GQ EV+ E +F A + +I + ++ L +
Sbjct: 352 ANILFSRLLEHGQLLSPLEVSRRERVFEHDGVLRWSDDMAIGYCRIGVSLQTLLSHMGLR 411
Query: 385 EAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESA 442
A+ ++ + G+++S+ L ++ DE YIL G F +VLKE
Sbjct: 412 NARTGSLHLR-----GAEISE----------LLHVFVDEAYILWPASSAGHFLIVLKEGC 456
Query: 443 LPQDMLKS 450
P D LK+
Sbjct: 457 TPMDQLKA 464
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 29/39 (74%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
LP G+P SVT DY+ Y ++ S+Q +S I+ +LA++A+L
Sbjct: 58 LPAGYPHSVTEDYIQYQIYDSLQAFSSSIASMLASRAVL 96
>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 495
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 171/355 (48%), Gaps = 25/355 (7%)
Query: 120 FVQFMLPEGFPDSVTSDYL-NYSLWRSVQG--VASQISGVLAT--------QALLYAIGL 168
+ LP G+P SVT DY+ + SL+R + + S ISG+ ++ + ++G+
Sbjct: 58 ILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLNCWSPVIQSSVGV 117
Query: 169 GKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEML 226
G A PTAA + VL++ +G ++ I+ ++ G + K +RL ADLL ++A L+ +
Sbjct: 118 GDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAADLLNDSAIVLDCM 177
Query: 227 TPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKA 285
+PAFP V + + + R+ + S+++ A FA N E+ AK +Q V
Sbjct: 178 SPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISL 237
Query: 286 IGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
G+++G + ++I S + +A ++ +H+ N + +++++ TLN RA++VFS
Sbjct: 238 FGMLVGSVVVSYISSPLATWIA-LILLLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLF 296
Query: 346 LSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS------SEAKDAAVEIEHRLQL 399
+ + ++EE +F + K+ L + ++ Q
Sbjct: 297 EDDRVLTPAQASEEERIFETDGVLRWKAKATGTLGICQIGVSLEQLLLLLPEQVNTNAQG 356
Query: 400 GSKLSD--VVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKS 450
S+ D +V AL L+++E YIL + + +VLK A PQ LK+
Sbjct: 357 SSQKPDTTMVGASTGLSALLELFKEEEYILYFSPAARRGAIVLKAGATPQAQLKA 411
>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
Length = 434
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 158/364 (43%), Gaps = 75/364 (20%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP---TAAAI 179
F LP G+P SVT DYL Y L + +S LAT +++ A+G+ G T+AA+
Sbjct: 38 FFLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIAATSAAV 97
Query: 180 NWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH----- 233
W+ KDG+G ++++ FD +PK WR+ A+ + +E+ T +P
Sbjct: 98 KWLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPGGLRSKF 157
Query: 234 ------FVFIGAAAGAGRSAAALIQASTRSCF---YAGFAARRNFAEVIAKGEAQGMVSK 284
+ + A AG G ++ + CF F+A N +V AK E + ++
Sbjct: 158 RVVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEVWEVAAQ 217
Query: 285 AIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR----TLNPYRASLV 340
+G+ + SVV + Q++E T+ P RA+L+
Sbjct: 218 LLGL-------------------AASVV----------ALQALEAAGSPDTVVPKRAALM 248
Query: 341 FSEYLLSGQAPPVKEVNDEEPLF-------PAFHFFKIKSANKSQLLVLSSEAKDAAVEI 393
S ++ +G P V++VN E + P+ H+ Q L
Sbjct: 249 ASSHVRTGLVPSVEQVNLSEQVLLPPTMCQPSVHY----GCTLDQALGSGGGGFGGGAAS 304
Query: 394 EHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQ 453
R Q +L+++V LY+ E Y+LT H G V++ E A D+L++++Q
Sbjct: 305 GGRGQ---QLAELVR----------LYDGEQYMLTWHRGAAWVLVLEGAQRADLLRAMWQ 351
Query: 454 ASYL 457
A++L
Sbjct: 352 AAWL 355
>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 515
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 168/384 (43%), Gaps = 56/384 (14%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
D R + V LP G+P SVT DYL Y + ++Q S ++G++A++ L +G+G A
Sbjct: 53 DRIRRVLVDLFLPAGYPASVTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGVGDASA 112
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTP--- 228
T+A + VL+D G ++ I + FG K +RL ADL+ +A+ L+ L P
Sbjct: 113 TATSALLLTVLQDVFGRITTISAAYVFGTSLYPETKTFRLVADLVNDASIVLDTLCPRLS 172
Query: 229 -AFPHHFV-------------FIGAAAGAGRSA-----AALIQASTRSCFYAGFAARRNF 269
+F H F+ G AG ++A A + AST + A A +
Sbjct: 173 LSFLHTFLLAISLCASAALRALCGLVAGGSKAALTNHFACTVPASTAAGGEAVVAG--DV 230
Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
E+ AK ++ V G++LG L + S L ++ H+ N + +S+ L
Sbjct: 231 GELNAKDASRETVVGLFGMLLGTLLVPRLTSVRATWLTLVFLIAG-HLLANYLAVRSVSL 289
Query: 330 RTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFH----------FFKIKSANKSQL 379
RTLN RA ++++ + A ++E D + + P+ + + + ++
Sbjct: 290 RTLNRQRAGILWTAW-----AARMREREDADDVSPSISSLAKAGGQLALTPAEVSRRERI 344
Query: 380 LVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSL-------YEDEGYIL----- 427
L L +DA +LG+ +S VV + +L + E Y+L
Sbjct: 345 LALPGTLRDARGHSIGYCELGASVSRVVARGGAENVRRALQSSSDAKFAKERYMLFALPR 404
Query: 428 --TEHGGKFCVVLKESALPQDMLK 449
E + V KE A P+D LK
Sbjct: 405 RRVESKVELTVCFKEGATPRDQLK 428
>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
Length = 406
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 110/224 (49%), Gaps = 11/224 (4%)
Query: 144 RSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
R AS +SG LATQA+L IG+G A +AA W++KD G L +I+ + +
Sbjct: 46 RPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVFAWW---- 101
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
KG LFAD+L + A LE++ P +P F + + ++ +TR+
Sbjct: 102 ----KG--LFADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGATRAALTMH 155
Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
A R N A+V AK +Q + G+++ + + + +L F ++T +H+Y N +
Sbjct: 156 QARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTALHIYANYR 215
Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
+ +++ + TLN R LV +L +G N EPL+ F
Sbjct: 216 AVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPLWTGF 259
>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
B]
Length = 505
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 108/409 (26%), Positives = 175/409 (42%), Gaps = 59/409 (14%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAA 178
F Q LP +P SV Y+ + + + V+G + VL QALL ++G+ +G+I A A
Sbjct: 114 FRQMFLPTNYPQSVHRSYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIFGAVA 173
Query: 179 INWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEM---LTPAFPHHF 234
+ W++KDG G ++K+ + F +FD +PK + L ++L GL+M L P F
Sbjct: 174 VQWIIKDGAGEVAKLFFIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPTPLSF 233
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ A A + I +T F F+ + N +V AK E+Q V++ G GI L
Sbjct: 234 LICAAGGNAFKLIGYAIWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAAGIGL 293
Query: 295 ------ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+++ + A+ + V T + M C + EL T+ R SL+ Y G
Sbjct: 294 LTLSHDPSYLYTIFALAVPTHLVATTLMMRCA-----TFELLTIP--RLSLLAETYSRKG 346
Query: 349 QAPPVK--EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
P+ E +F F+ K Q ++S L ++++V
Sbjct: 347 DVMPLSRLEAKRATGMFGEFY-------GKGQGKLVS---------------LAPRVAEV 384
Query: 407 VNNKEDAHALFSL----YEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLY 458
V ++E A ++L DE Y+L V L+ A DML+++ A+ L
Sbjct: 385 VKDEEGERARWALCTEVCRDERYLLYPPAAPQANPASVFLRPDASSDDMLRAVLHAARLR 444
Query: 459 WLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
L A + T A L+ S ++ Q F K GW TD
Sbjct: 445 SL--LADDIVTLREA-------LKDSHEWTQEHFMTFKGALDDKGWRTD 484
>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
nagariensis]
Length = 500
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 117/243 (48%), Gaps = 19/243 (7%)
Query: 139 NYSLWRSVQGVASQISGVLATQALLYAIGLGKGA-IPTAAAINWVLKDGIGYLSKIMLSN 197
+ W ++Q ++S + GVL +QA+L +G+G+ A P AA + ++D G L I+ S
Sbjct: 3 GFQTWDTIQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSY 62
Query: 198 F-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQAST 255
G FD K WRLFAD+ + +E+ +P FP F+ I RS + +T
Sbjct: 63 LEGSSFDACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGAT 122
Query: 256 RSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM-----PFALASFS 310
R+ FA R N A+V AK ++Q + +G++ G+A+ + + P A F
Sbjct: 123 RAALTQHFARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGF- 181
Query: 311 VVTWIHMYCNLK---------SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
V+ W+ ++ L + + + L +LN R L+ YL G+ + V+ E
Sbjct: 182 VIAWL-VFLGLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLED 240
Query: 362 LFP 364
L P
Sbjct: 241 LTP 243
>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
Length = 413
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/398 (24%), Positives = 178/398 (44%), Gaps = 36/398 (9%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
S+Q +S I+G+L+++A+L G+G A T A + + +D + L+ I+ ++ FG
Sbjct: 6 SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
+ K +RL AD+ ++A L+ L+PAFP + + +G+ R+A + +++
Sbjct: 66 EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKASLSV 125
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
FA N E+ AK +Q V +G++ G + +HI S+ + S +++ +H+ N
Sbjct: 126 HFARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHISSTWATWI-SLTLLQAVHLGTNF 184
Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
K+ ++ LRTLN RA++V S + +G+ ++ E +F K +L
Sbjct: 185 KAVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHERIF-----------EKDGVLR 233
Query: 382 LSSEAKDAAVEIEHRLQ-LGSKLSDVVNNKEDAHA-------LFSLYEDEGYIL--TEHG 431
EI +L L L K++ L S++++E Y+L
Sbjct: 234 WGGSGVLGNAEIGVKLTALIRSLRQSPERKKEGEGEGVSPVELASVFQNEKYLLWYDVQR 293
Query: 432 GKFCVVLKESALPQDMLKSLFQASYL------YWLERNAGIVATSTSADCAPGGRLEISL 485
+ LKE P++ LKS A L +W + A ++ + A + SL
Sbjct: 294 RLVTICLKEKVSPEEQLKSWVHALVLAREIRHHWPKEGAD--NSNRPSQQALLDTVRRSL 351
Query: 486 DYVQREFNHVKSDSASVGWVTDGLIARPLPN---RIRP 520
D + F + GW D + +P IRP
Sbjct: 352 DDIGMSFTPMLGRLKGAGWDLDAAVLETVPGYRVSIRP 389
>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 471
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 169/368 (45%), Gaps = 47/368 (12%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTA 176
+L + LP G+PDSV+ DYL Y + ++Q + + G+L+++A+L G+G A T
Sbjct: 47 DLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPSATATN 106
Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-----F 230
A + VL+D L+ I+ ++ G + K +RL AD+L +AA L+ ++P F
Sbjct: 107 ALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRLDTLFF 166
Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA----RRNFAEVIAKGEAQGMVSKAI 286
P V + RS A+ +++ FA + ++ AK ++ V +
Sbjct: 167 PGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETVLALL 226
Query: 287 GIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
G++LG + H+ + P+ + + V+ +H+ N + + LRTLN R L +S Y+
Sbjct: 227 GMLLGTLIVPHL--TTPWTIYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWLAWSAYM 284
Query: 346 LS--GQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKL 403
+ G+AP EV E +F F+ +S +L+ R +GS
Sbjct: 285 RTQPGRAPTPDEVASLERIFERPGAFR---DPRSGVLM-------------GRCTIGSSF 328
Query: 404 SDVVNNKEDAHALFSLYEDEGYIL--------------TEHGGKFCVVLKESALPQDMLK 449
S++++ L L+++E Y+L E + + LKE +D LK
Sbjct: 329 SEILSGPIPPQ-LLELFKEERYLLWFDRRCLLRPESPSVEGFLRLHICLKEGYNTEDQLK 387
Query: 450 SLFQASYL 457
+ A+ L
Sbjct: 388 AWIHAAEL 395
>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 357
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 121/242 (50%), Gaps = 26/242 (10%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP G+P SVT DYL Y + S+Q S IS +LA +A+L +G+G + PTAA I +
Sbjct: 44 LPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTAALILKIT 103
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF----PHHFVFIG 238
D I + I+ ++ G + K +R AD+L +AA L++L P + + +G
Sbjct: 104 SDTISRFATILFAHRLGTAIEPECKFFRFLADVLNDAAQLLDLLMPVMMMGETRYGLKLG 163
Query: 239 AAAGAG--RSAAALIQASTRSCFYAGFAARR-NFAEVIAKGEAQGMVSKAIGIMLGIALA 295
G+G RS + ++++ A F +R + E+ AK +Q V +G+++G
Sbjct: 164 VVVGSGVLRSWCGVAAGASKASLSAHFVTQRSSLGELNAKEASQETVVSLLGMLVG---- 219
Query: 296 NHIGSSMPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
S+ + +V W +H+ N + + + LRTLN RA++V +L +
Sbjct: 220 -----SLVVKVVQDRLVVWVLMVGLLGVHLLMNYRGVRCVRLRTLNRQRATMVVRSWLET 274
Query: 348 GQ 349
G+
Sbjct: 275 GR 276
>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
Length = 386
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 107/220 (48%), Gaps = 21/220 (9%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAA 178
F+LP GFP SV+ DYL Y LW+ V I L T LL A+G+ G A +AAA
Sbjct: 70 FVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAASAAA 129
Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
I WV KDGIG ++++ FG FD +PK WR++AD + +A E++TP +P +F+ +
Sbjct: 130 IRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYFLPL 189
Query: 238 GAAAGAGRS---AAALIQASTRSCFYAGFAARRN-FAEVIAKGEAQGMVSKAIGIMLGIA 293
+ + + L QA R F +N FAE GE +
Sbjct: 190 ASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKDTSG-------- 241
Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
I SS P ++ V +H++ +S ++ RT++
Sbjct: 242 ----IKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVD 277
>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
Length = 621
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 118/272 (43%), Gaps = 45/272 (16%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI-GLGKGAI 173
+CR+ + +PEG VT Y + WR +Q + V+ TQ+LL + G+ + +
Sbjct: 146 KCRQSLHKNFVPEG----VTPSYYRFMRWRILQRFMNANVHVIGTQSLLMGLRGMQRSTV 201
Query: 174 PTAAA-----------INWVLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLLENAAF 221
+AAA NWVLKD +G L ++ S GR FD + K WR A +
Sbjct: 202 VSAAAKGGAALGAVAATNWVLKDTLGKLVRMAWASKMGRKFDSDAKRWRYRACFIYALGN 261
Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF------------------ 263
GLE+ T P F+ + A + + + L ++TR+ Y F
Sbjct: 262 GLEVSTYLHPRFFLVLAMLANSCKQVSMLTSSATRNALYNSFRRMDDDVDTVGTGSNSTL 321
Query: 264 -----AARR-----NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
RR N ++ AKGEAQ V +GI GI L+ IG S+ L +SV+
Sbjct: 322 KSSINGGRRRGVVENIGDITAKGEAQIAVVDLLGIASGICLSRAIGVSVQNVLTVWSVLQ 381
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
+YC +SI +TLN R V L
Sbjct: 382 LGEIYCMYHEIRSIVYQTLNFERMYTVLGRLL 413
>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
Length = 331
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 92/165 (55%), Gaps = 3/165 (1%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT- 175
R ++ F LP G+P S+ +Y Y + +Q ++S + G+L+TQ +L +G+GK + +
Sbjct: 124 RIMYATF-LPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVTSL 182
Query: 176 AAAINWVLKDGIGYLSKIMLSNFGRH-FDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
AA + W+ +DG L ++ + F F ++ K WRLFADL+ + A L+ML PA+P F
Sbjct: 183 AATMQWMARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPDLF 242
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
+ I + ++ + + + +AAR N A+V AK AQ
Sbjct: 243 LPIICLSSVFKAMCGVAAGACNTAIVQHWAARGNIADVGAKTGAQ 287
>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
Length = 199
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 50/132 (37%), Positives = 79/132 (59%), Gaps = 6/132 (4%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
D+ + F+ P G+P SV YL Y+ +R +Q +S + VL+TQ+LL+A+GL
Sbjct: 69 DKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAVGLR--PT 126
Query: 174 PT-AAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
PT A A++W+LKDG+ + K++ S+ G D PK WR+ AD+L + LE+++P P
Sbjct: 127 PTQATAVSWILKDGMQHTRKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQ 186
Query: 233 HFVFIGAAAGAG 244
F+ + AG G
Sbjct: 187 LFLEV---AGLG 195
>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
Length = 497
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 98/394 (24%), Positives = 178/394 (45%), Gaps = 65/394 (16%)
Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI 166
S+ + + R L LP G+P SV+ DYL Y ++ ++Q S ++ ++A++A+L
Sbjct: 34 FSIKQKRTDWRRLLSNVFLPAGYPASVSPDYLQYQIYNALQAFCSSLASLIASRAVLEGH 93
Query: 167 GLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLE 224
G+G A T A VL+D L+ I+ + G K +RL AD+ +AA L+
Sbjct: 94 GVGNADASATHAIFLTVLQDIFSRLTTIVSGYYLGTSLFPEAKTYRLLADIFNDAAIVLD 153
Query: 225 MLTP-------AFPHHFVFIGAA----------AGAGRSAAALIQASTRSCFYAGFAA-- 265
L+P + + FV G+ +GA R+ ++ +++ FA+
Sbjct: 154 TLSPHLDGVVLSLRYPFVLSGSGSSLRVSALCLSGASRALCGMVAGGSKAALTVHFASPG 213
Query: 266 --RRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
+ ++ AK ++ V +G++ G A+ +++ S+ L F V+T++H++ N +
Sbjct: 214 QGTGDVGDLNAKDGSKETVLALLGMLSGTAVMHYVHSARATYLVLF-VLTFLHLFGNYLA 272
Query: 324 YQSIELRTLNPYRASLVFSEYL------LSGQAPPV---KEVNDEEPLFPAFHFFKIKSA 374
+ + LR+LN RA++V+ Y + G+ P V + V E +F A
Sbjct: 273 VRVVVLRSLNRQRANIVWMAYRAHIKGPMGGKRPLVLTPEAVAARERIF----------A 322
Query: 375 NKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA-------------HALFSLYE 421
N S L SS A ++GS S + + ++ A H + ++
Sbjct: 323 NPSALHDTSSPAH----PFLGICKVGSSFSSLASRQQSAPRGALAAPSPPQIHTILEIFS 378
Query: 422 DEGYIL--TEHGGKFC---VVLKESALPQDMLKS 450
+E Y+L G C V LK+ P+D LK+
Sbjct: 379 EEPYVLWFGPTSGPSCRLHVFLKDGHKPRDHLKA 412
>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
HHB-10118-sp]
Length = 532
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 103/440 (23%), Positives = 178/440 (40%), Gaps = 54/440 (12%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAIN 180
Q LP +P SV YL + L + ++ + + GVL QALL ++G+ +G+I A A+
Sbjct: 112 QMFLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGVLCNQALLVSVGVSAEGSILGAVAVQ 171
Query: 181 WVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLE---MLTPAFPHHFVF 236
WV+KDG G ++K+ ++ F +FD +PK + + GL+ ++ P +F+
Sbjct: 172 WVIKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGATMGLLGSGLQIAALVVPPTTLNFLL 231
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
A + + +T F FA + N +V AK E+QG V G G+ L
Sbjct: 232 CAAGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGDVAAKSESQGSVGILSGYAAGVGLLT 291
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-------Q 349
+ S P+ F++ IH+ ++ L R S + E+ +
Sbjct: 292 -VSHSAPYLYTIFALAIPIHLLITQWMLRAATFELLTLPRLSFLAREFASRASRPEAERE 350
Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 409
P + E+ + + +FK K +D V+I L +D +
Sbjct: 351 LPTLGELERQRAIGVFGEYFKTK--------------EDQYVQIAPDLDAVVSPTD-LGA 395
Query: 410 KEDAHALFSLYEDEGYIL-----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
+ A + D+ Y+L + K V+L SA D ++S+ A++ L+R+
Sbjct: 396 RTRWDACVEAFNDQPYLLYPYLMQQSECKIAVLLHPSATVDDRIQSVLHAAF---LQRHL 452
Query: 465 GIVATSTSADCAP------GGRLEISLDYVQR----EFNHVKSDSASVGWVTDGLIARPL 514
V A+ P L+ +LD +R K+ W TD + L
Sbjct: 453 HNVMQIDPAESTPQELLDNDKVLKAALDDAKRLVALYLREFKTQLVEKEWRTDEIAVPEL 512
Query: 515 PNRI--------RPGYVEPS 526
R+ +P VE S
Sbjct: 513 GRRVLWGTAAIQKPKRVEKS 532
>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
Length = 542
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/236 (29%), Positives = 117/236 (49%), Gaps = 26/236 (11%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
L V LP G+P+SVT DYL A+L +G+G A PT+A
Sbjct: 69 LLVDVFLPSGYPNSVTDDYL---------------------PAVLQGVGVGNANASPTSA 107
Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+ +L+D G ++ I+ ++ G + K +RL AD+ + A L+ L+P P
Sbjct: 108 LLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIPAGVGR 167
Query: 237 IG--AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+G +AAG R+ + S+++ A FA N AEV AK +Q + +G+++G +
Sbjct: 168 VGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVGSFI 227
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
+H+ +S S + +H+ N + +S+++ +LN RA++VFS L S A
Sbjct: 228 VSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPA 282
>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
Length = 542
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/236 (29%), Positives = 117/236 (49%), Gaps = 26/236 (11%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
L V LP G+P+SVT DYL A+L +G+G A PT+A
Sbjct: 69 LLVDVFLPSGYPNSVTDDYL---------------------PAVLQGVGVGNANASPTSA 107
Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
+ +L+D G ++ I+ ++ G + K +RL AD+ + A L+ L+P P
Sbjct: 108 LLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIPAGVGR 167
Query: 237 IG--AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+G +AAG R+ + S+++ A FA N AEV AK +Q + +G+++G +
Sbjct: 168 VGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVGSFI 227
Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
+H+ +S S + +H+ N + +S+++ +LN RA++VFS L S A
Sbjct: 228 VSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPA 282
>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
Length = 253
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 72/132 (54%), Gaps = 1/132 (0%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
+E +++PEGFPDSVT Y+ Y WR+++ GV T+ LL ++G+ + A
Sbjct: 116 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRA 175
Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
A AINW+LKDG G + K+ + G+ FD + K R DLL G+E+ T AFP
Sbjct: 176 ASGAVAINWILKDGAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 235
Query: 233 HFVFIGAAAGAG 244
F+ + A G
Sbjct: 236 FFLPMACIANVG 247
>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 656
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 122/282 (43%), Gaps = 45/282 (15%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI-GLGK-G 171
D + + QF+ P+ VT Y + WR +Q + V+ TQ+LL + G+ + G
Sbjct: 139 DTLVQKWTQFLHSAFVPEGVTPSYYRFMRWRILQRFINANVHVIGTQSLLMGLRGMTRSG 198
Query: 172 AIPTAAA--------------INWVLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLL 216
++ AA+ NWVLKD +G + +++ S GR FD + K WR + L+
Sbjct: 199 SVGEAASFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRKFDPDAKRWRFRSALI 258
Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFA------------ 264
GLE+ T P +F+ + A + + + L ++TR+ Y F
Sbjct: 259 YALGNGLEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYNSFKRVGTTSSTVSTS 318
Query: 265 ---------ARRNFA-------EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALAS 308
RR ++ AKGEAQ V +GI GI L+ +G S+ A
Sbjct: 319 NNASGSSAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGICLSRAVGVSVQNVFAV 378
Query: 309 FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
+ V+ ++C +SI RT N R +V + +L A
Sbjct: 379 WVVLQIGEIFCMYHEMRSIVYRTFNFERMYMVLGKLMLDDAA 420
>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
Length = 549
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 119/235 (50%), Gaps = 21/235 (8%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
R++ + LP G+P SV+ DY + LA A+ +G+G A PT
Sbjct: 66 RDVLIDVFLPAGYPHSVSDDY----------------TPALAQPAIHAGVGVGNANASPT 109
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
+A + +L+D G ++ I+ ++ G + K +R AD+ + A L+ L+P P F
Sbjct: 110 SALLLHILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGF 169
Query: 235 --VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
V + +AAG R+ + S+++ A FA N A+V AK +Q V IG+++G
Sbjct: 170 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGS 229
Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+ +HI S L +V+ +H+ N + +S+++ TLN RA++VFS L S
Sbjct: 230 FVVSHITSFTATWLTLVFLVS-MHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 283
>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
Length = 478
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 115/223 (51%), Gaps = 13/223 (5%)
Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKI 193
+ YL Y ++Q + + + VLATQ +L +G+G+ GA AA ++++++DG G ++ +
Sbjct: 91 TGYLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASL 150
Query: 194 ML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
+L S F F + K W+LFAD +A LE++ P+ P + F+ GA A
Sbjct: 151 VLTSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRW-FLPVLCGANVCKALCGV 209
Query: 253 ASTRSC--FYAGFAAR-----RNFAEVIAKGEAQGMVSKAIGIMLGIALANHI---GSSM 302
A+ C +AAR +EV AK AQ + A G++L A + G++
Sbjct: 210 AAGAVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGLVLSGLFARWMGGRGANR 269
Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
P + ++V+T +H+ N +S + + L LN R L+ ++L
Sbjct: 270 PVWVGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFL 312
>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 407
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 160/321 (49%), Gaps = 32/321 (9%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
S+Q +S I+G+L+++A+L +G+G A PT+A + +L+D +G ++ I+ ++ G
Sbjct: 21 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 80
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
+ K +RL AD+ +++ ++ L+P FP V + + + R+ + S+++ A
Sbjct: 81 EPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 140
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
FA N AE+ AK +Q V +G+++G + ++I S + +A S++ +H+ N
Sbjct: 141 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLI-VHITTNY 199
Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
+ +++ + +LN RA++VFS + K+ + +E +F + K+++ +
Sbjct: 200 LAVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFC 259
Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEH 430
Q+G DV+++ + A+ ++E E YIL +
Sbjct: 260 ----------------QIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYILWFDP 303
Query: 431 GGKFCVV-LKESALPQDMLKS 450
K V+ LK +A P LK+
Sbjct: 304 ARKRGVIGLKTNASPTSQLKA 324
>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
Silveira]
Length = 478
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 160/321 (49%), Gaps = 32/321 (9%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
S+Q +S I+G+L+++A+L +G+G A PT+A + +L+D +G ++ I+ ++ G
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
+ K +RL AD+ +++ ++ L+P FP V + + + R+ + S+++ A
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
FA N AE+ AK +Q V +G+++G + ++I S + +A S++ +H+ N
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLI-VHITTNY 270
Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
+ +++ + +LN RA++VFS + K+ + +E +F + K+++ +
Sbjct: 271 LAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFC 330
Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEH 430
Q+G DV+++ + A+ ++E E YIL +
Sbjct: 331 ----------------QIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDP 374
Query: 431 GGKFCVV-LKESALPQDMLKS 450
K V+ LK +A P LK+
Sbjct: 375 ARKRGVIGLKTNASPTSQLKA 395
>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
delta SOWgp]
Length = 478
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 160/321 (49%), Gaps = 32/321 (9%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
S+Q +S I+G+L+++A+L +G+G A PT+A + +L+D +G ++ I+ ++ G
Sbjct: 92 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
+ K +RL AD+ +++ ++ L+P FP V + + + R+ + S+++ A
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
FA N AE+ AK +Q V +G+++G + ++I S + +A S++ +H+ N
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLI-VHITTNY 270
Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
+ +++ + +LN RA++VFS + K+ + +E +F + K+++ +
Sbjct: 271 LAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFC 330
Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEH 430
Q+G DV+++ + A+ ++E E YIL +
Sbjct: 331 ----------------QIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDP 374
Query: 431 GGKFCVV-LKESALPQDMLKS 450
K V+ LK +A P LK+
Sbjct: 375 ARKRGVIGLKTNASPTSQLKA 395
>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
Length = 388
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 103/213 (48%), Gaps = 3/213 (1%)
Query: 154 SGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRL 211
+G LA Q++L +G+G +GA AA W+L+DG G +I+ + F G + D + K WRL
Sbjct: 1 TGTLAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRL 60
Query: 212 FADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
FAD+L +AA LE++ F + I + +S + +TR+ A N A+
Sbjct: 61 FADILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMAD 120
Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRT 331
V AK +Q + + + L + + F + T +H++ N ++ + + T
Sbjct: 121 VSAKDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMET 180
Query: 332 LNPYRASLVFSEYLLSGQA-PPVKEVNDEEPLF 363
N R S+V +L G PV VN E ++
Sbjct: 181 FNRSRYSVVVRHFLQKGGGIAPVGWVNARESVW 213
>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
Length = 487
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 1/135 (0%)
Query: 146 VQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDV 204
+ +A + VLA+ +LYA+GLG GAIPTA A+NWVLKDG+G L ++ +FDV
Sbjct: 143 LNNIAVTANSVLASTFMLYAVGLGAGAIPTAGALNWVLKDGMGQLGTLVFGKTIAHNFDV 202
Query: 205 NPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFA 264
+ K W + +L AA LEMLT P HF+ +G+ A + A + STRS F+ FA
Sbjct: 203 HSKTWFFLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFHLSFA 262
Query: 265 ARRNFAEVIAKGEAQ 279
N A+V AKG +Q
Sbjct: 263 RDNNIADVTAKGTSQ 277
>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 515
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 114/210 (54%), Gaps = 5/210 (2%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
S+Q S I+G+L+++A+L +G+G A PT+A + +L+D G ++ I+ ++ G
Sbjct: 86 SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTAL 145
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG--AAAGAGRSAAALIQASTRSCFY 260
+ K +RL AD+ + A L+ L+P P +G +AAG R+ + S+++
Sbjct: 146 EPECKMYRLAADVFNDVAMVLDCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLS 205
Query: 261 AGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCN 320
A FA N AEV AK +Q + IG+++G + +H+ + AS + +H+ N
Sbjct: 206 AHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHV-TCFAATWASLLFLLTVHLGMN 264
Query: 321 LKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
+ +S+++ +LN RA++V S L S A
Sbjct: 265 YAAVRSVQMTSLNRQRANIVLSTLLESDLA 294
>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
CIRAD86]
Length = 435
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 171/357 (47%), Gaps = 60/357 (16%)
Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
V LP GFP SV+ DYL Y ++ S+Q +S I+ +L+++A+L ++G+G A PTAA I
Sbjct: 44 VDVFLPVGFPHSVSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAII 103
Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
+L+D +G L+ I+ ++ FG + K +RL AD + L + + G
Sbjct: 104 LSILQDSVGRLATILFAHSFGTSLEPECKMYRLLADPIRVVVLSLSSVLRS------LCG 157
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
AAG S+++ A FA + N E+ AK +Q V +G+M G + + I
Sbjct: 158 VAAG-----------SSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWI 206
Query: 299 GSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ P A + ++ IH+ N ++ +++++RTL RA+L++ + + + P +
Sbjct: 207 --TTPIATWTALILLLSIHLETNRRAVRAVKMRTLTRQRAALIYYQ-VQQNKTPTPDFIA 263
Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA-- 415
+E +F + +L ++A E+ +G L+ ++N+ + H+
Sbjct: 264 SQERIF-----------ERDGVL------RNAKAEVLGFCSVGVPLARLLNSIGEQHSQT 306
Query: 416 ------------LFSLYEDEGYIL------TEHGGKFCVVLKESALPQDMLKSLFQA 454
+ LY+ +IL +V+K+SA+ +D++ + +QA
Sbjct: 307 KSSTMASEKFTRILHLYQTSAFILWSDRHSRSRRNSLFIVVKKSAVSRDLILAWWQA 363
>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
Length = 376
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 153/322 (47%), Gaps = 53/322 (16%)
Query: 165 AIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFG 222
++G+G A T A + +L++ G +S I+ ++ G + K +RL AD+ +AA
Sbjct: 2 SLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMI 61
Query: 223 LEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
L+ ++PAFP VF+ +A+ +S + S+++ A FA N AE+ AK +Q
Sbjct: 62 LDCISPAFPKIPRVFLLSASSVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQET 121
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
+ +G+++G + + I S + +A ++++ IH+ N + +S+ +RTLN RA+LV
Sbjct: 122 LISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRSVTMRTLNRQRANLVI 180
Query: 342 SEYLLSGQ-------APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 394
S L S + P ++++ +E +F +D A+
Sbjct: 181 SSLLSSPEHEKNKISLPSPRDISLQERIFE----------------------RDGAIRNT 218
Query: 395 H-----RLQLGSKLSDVVNN-------------KEDAHALFSLYEDEGYIL--TEHGGKF 434
H ++G L D++++ + +L S+Y+ +GY++ + F
Sbjct: 219 HGTILAYCKVGVSLQDLLSSIATPTTAGSYAEPESILTSLLSIYKTQGYLMWYSRSRNTF 278
Query: 435 CVVLKESALPQDMLKSLFQASY 456
V+LKE P+ L + F A +
Sbjct: 279 LVILKEETAPKFQLDAWFHAIF 300
>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 285
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 132/263 (50%), Gaps = 22/263 (8%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
+L + LP G+PDSV+ DYL Y + ++Q + ++G+L+++A+L G+G P+A
Sbjct: 6 DLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGD---PSAT 62
Query: 178 AIN----WVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA--- 229
A N VL+D L+ I+ ++ G + K +RL AD+L +AA L+ ++P
Sbjct: 63 ATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRLDT 122
Query: 230 --FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA----RRNFAEVIAKGEAQGMVS 283
FP V + RS A+ +++ FA + ++ AK ++ V
Sbjct: 123 LFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETVL 182
Query: 284 KAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
+G++LG + H+ + P+ + A+ V+ +H+ N + + LRTLN R + +S
Sbjct: 183 ALLGMLLGTLIVPHL--TTPWTIYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWIAWS 240
Query: 343 EYLLSG--QAPPVKEVNDEEPLF 363
Y+ + +AP EV E +F
Sbjct: 241 AYMRTQPRRAPTPVEVASLERIF 263
>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
bisporus H97]
Length = 462
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/416 (23%), Positives = 175/416 (42%), Gaps = 49/416 (11%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAI 173
+ +EL + LP G+P++VT DYL Y + + Q + ++ +LA++A L IG+G A
Sbjct: 46 DTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPSAT 105
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
T A + VL+D G ++ I+ + + G K +R AD+L + A +++ P F
Sbjct: 106 STGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF-S 164
Query: 233 HFVFIGA------AAGAGRSAAALIQASTRSCFYAGFAARR----NFAEVIAKGEAQGMV 282
F+F GA + A R+ ++ ++ FA R + ++ AK ++ V
Sbjct: 165 AFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKETV 224
Query: 283 SKAIGIMLGIALANHIGS---SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
+G++LG L HI S + P L SV H+ N + + + +R+ N +RA++
Sbjct: 225 LALLGMLLGSVLIPHITSPWVTYPVLLLLISV----HLVINYIAVRGLAIRSPNRHRATI 280
Query: 340 VFSEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQ 398
+ Y + AP + V E +F + + + LVL +
Sbjct: 281 AWLWYRSKDECAPSPQAVASHELIFERPENIR----DPTTSLVLG------------KCH 324
Query: 399 LGSKLSDVVNNKEDAHALFSLYEDEGYIL-------TEHGGKFCVVLKESALPQDMLKSL 451
+GS DV+ + ++ + YI+ T H + KE D LK
Sbjct: 325 VGSAPLDVMRGPALSKEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELK-- 382
Query: 452 FQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
S+++ +E I + + V+ +++H GW D
Sbjct: 383 ---SWIHAVEICRSIALNQVPVQKGALEHIRSTFQVVKEKYDHFVDHMKKAGWNFD 435
>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 377
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 144/325 (44%), Gaps = 18/325 (5%)
Query: 142 LWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLS-NFG 199
++ S+Q S+I+G+LA +A+L +G+G GA T A + +++D G ++ I+ + G
Sbjct: 1 MFDSLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLG 60
Query: 200 RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSC 258
+ K WRL ADL +A LE L+P PH + + A RS + ++
Sbjct: 61 PMLEPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAA 120
Query: 259 FYAGFAA-RRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHM 317
F + A++ AK +Q V IG+++G + I +S + + +H+
Sbjct: 121 LSQHFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRI-TSFEGTWTALLFLIGVHL 179
Query: 318 YCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKS 377
Y N + +++ + TLN RA++V E + G+ ++V E +F +
Sbjct: 180 YANWRGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFCRSSRLVLDGVTLG 239
Query: 378 QLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL---TEHGGKF 434
Q + V ++H L +++ + L + E YIL GK
Sbjct: 240 QCHI--------GVSLQHLLAQIARVHGPDRSGSALAGLMDRFASERYILHCCCRRKGKV 291
Query: 435 CVV--LKESALPQDMLKSLFQASYL 457
++ LK +DML++ A L
Sbjct: 292 SIIICLKRGVESRDMLRAWTHALVL 316
>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
FGSC 2508]
gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
2509]
Length = 361
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 120/247 (48%), Gaps = 30/247 (12%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
LP G+P SVT DYL Y + S+Q S IS +LA +A+L +G+G + PT A I +
Sbjct: 44 LPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTGALILKIA 103
Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
D I ++ I+ ++ G + K +R AD L +AA L++L PA + + +G
Sbjct: 104 SDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQLLDLLVPAMMERYGLKLGVIV 163
Query: 242 GAG--RSAAALIQASTRSCFYAGF--------AARRNFAEVIAKGEAQGMVSKAIGIMLG 291
GAG RS + +++++ A F N AE+ AK +Q V +G+++G
Sbjct: 164 GAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAELNAKEASQETVVSLLGMLVG 223
Query: 292 IALANHIGSSMPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSE 343
S+ + +V W +H+ N + + RTLN RA++V E
Sbjct: 224 ---------SLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTLNRQRATMVVRE 274
Query: 344 YLLSGQA 350
+L +G+
Sbjct: 275 WLETGRV 281
>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
Length = 326
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 82/168 (48%)
Query: 199 GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSC 258
G D N K WRLFAD+L + A LE++ P +P F + + + ++ +TR+
Sbjct: 12 GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71
Query: 259 FYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMY 318
A R N A+V AK +Q + G+++ + + + F+L F +T +H+Y
Sbjct: 72 LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131
Query: 319 CNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
N ++ +S+ + TLN R LV YL G+ N EPL+ F
Sbjct: 132 ANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGF 179
>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
Length = 236
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 91/156 (58%), Gaps = 4/156 (2%)
Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
V+ LP G+P SV+ DY+ Y ++ S+Q +S I+G+L+++A+L +G+G A PTAA +
Sbjct: 72 VEVFLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALL 131
Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VF 236
+L+D G +S I+ ++ G + K +RL AD+ +AA L+ L+P P F V
Sbjct: 132 LHILQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVT 191
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
+ + AG R+ + S+++ A F+ N AEV
Sbjct: 192 VLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEV 227
>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 466
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 129/265 (48%), Gaps = 21/265 (7%)
Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAI 173
+ +EL + LP G+P++VT DYL Y + + Q + ++ +LA++A L IG+G A
Sbjct: 46 DTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPSAT 105
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
T A + VL+D G ++ I+ + + G K +R AD+L + A +++ P F
Sbjct: 106 STGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF-S 164
Query: 233 HFVFIGA------AAGAGRSAAALIQASTRSCFYAGFAARR----NFAEVIAKGEAQGMV 282
F+F GA + A R+ ++ ++ FA R + ++ AK ++ V
Sbjct: 165 AFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKETV 224
Query: 283 SKAIGIMLGIALANHIGS---SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
+G++LG + HI S + P L SV H+ N + + + +R+ N +RA++
Sbjct: 225 LALLGMLLGSVMIPHITSPWVTYPVLLLLISV----HLVINYIAVRGLAIRSPNRHRATI 280
Query: 340 VFSEYLLSGQ-APPVKEVNDEEPLF 363
+ Y + AP + V E +F
Sbjct: 281 AWLWYRSKDECAPSPQAVASRELIF 305
>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
Length = 4586
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 92/414 (22%), Positives = 174/414 (42%), Gaps = 54/414 (13%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAIN 180
Q LP +P SV YL + + + + VL QALL ++G+ +G+I A A+
Sbjct: 115 QMFLPTNYPQSVHPSYLRFHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSIFGAVAVQ 174
Query: 181 WVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEM----LTPAFPHHFV 235
W++KDG G ++K+ + F +FD +PK + LF + + L+M +TP P +F+
Sbjct: 175 WIIKDGAGEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITPT-PGNFL 233
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
A + + +T + F+ + N +V AK E+Q +++ G GI L
Sbjct: 234 LCAAGGNIFKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYAAGIGLL 293
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY------LLSGQ 349
+ S + + F +T +H+ + + + L R SL+ EY +G+
Sbjct: 294 T-VSHSPAYLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGEAATGK 352
Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 409
++E+ + FFK K+ + ++L ++++V+
Sbjct: 353 VNTLREIEESRQTGAFGEFFKRKN--------------------DRFVELAPRVTEVIGG 392
Query: 410 KEDAHALF-----SLYEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
D + S ++ E Y+L + V A D L+S+ A+
Sbjct: 393 DVDREPVLWEMCTSEFQTEKYLLYPSASPSEKPITVFYHPDATSDDTLRSILHAARF--- 449
Query: 461 ERNAGIVATSTSAD---CAPGGRLEISL----DYVQREFNHVKSDSASVGWVTD 507
R+ + + +S D + G L+++L + ++ F K + GW TD
Sbjct: 450 -RHDLLESRLSSLDSPWSSRTGELQVALASSHAWTRQHFGAFKQELDDKGWRTD 502
>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 127/259 (49%), Gaps = 7/259 (2%)
Query: 96 ASASNASLSSLLSVNK-LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
+S+ + L+S L+V + L + LP G+P YL YS+W VQ +++Q+
Sbjct: 56 SSSGRSELNSRLNVGRVLLRNVQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLR 115
Query: 155 GVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLF 212
VLATQ +L +G+G+ GA +A +N++++DG G + + ++ F + K WR+F
Sbjct: 116 SVLATQRVLEGVGVGREGATALSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIF 175
Query: 213 ADLLENAAFGLEMLTPAFPHH-FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
AD++ + LE+ A P F+ + + A ++ + + +A + ++
Sbjct: 176 ADVMVDIGITLEVAASACPPTLFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISD 235
Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL--ASFSVVTWIHMYCNLKSYQSIEL 329
+ AK AQ V+ ++G++ A + S++P ++ +T +H+ NL+ + I
Sbjct: 236 IQAKFGAQHTVTGSLGLVFAAFFAKSV-STVPGGRLWILYAALTALHVMANLRCMRIIHF 294
Query: 330 RTLNPYRASLVFSEYLLSG 348
N R LV SG
Sbjct: 295 DVFNTTRLRLVLRALFESG 313
>gi|294462406|gb|ADE76751.1| unknown [Picea sitchensis]
Length = 78
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 55/77 (71%)
Query: 447 MLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVT 506
ML+++FQ +LYWLER+ G+ + DC GG+L +S D++ REF++V+ +A+VGW
Sbjct: 1 MLRAMFQVCHLYWLERHLGVESKKIINDCMAGGKLALSHDFMVREFDNVQKKAATVGWYP 60
Query: 507 DGLIARPLPNRIRPGYV 523
+GL+ARPLP R+ Y+
Sbjct: 61 EGLVARPLPFRVHLNYI 77
>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 311
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 140/299 (46%), Gaps = 33/299 (11%)
Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
AD+L + LE+++P P F+ + + A + +TR Y+ FA N +++
Sbjct: 32 ADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDL 91
Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
AKGEA + +GI GI LA+ + S+ L + +++ +H+Y ++ ++ + TL
Sbjct: 92 FAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTL 151
Query: 333 NPYRASLVFSEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAV 391
NP R +++ ++++ SG+ + P + E+ LFP + +
Sbjct: 152 NPQRTAMIVADFIKSGKVSSPAELRYKEDLLFP-----------------------NRVI 188
Query: 392 EIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSL 451
E +++G L V + + L + + E ++L++ + +VL+ SA +D L+
Sbjct: 189 EEAGSVKIGQPLRKVFSPRL-VEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALRGW 247
Query: 452 FQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
A++ +ER+ + S D A L + + ++R F S+ S GW TD +
Sbjct: 248 LVAAFASEMERSG-----AGSRDDAV---LSEAYEKMERVFPTFVSEVRSRGWYTDQFL 298
>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
Length = 164
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 84/154 (54%)
Query: 192 KIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALI 251
K++ S+ G D PK WR+ AD+L + LE+++P P F+ + + A +
Sbjct: 6 KLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVA 65
Query: 252 QASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
+TR Y+ FA N +++ AKGEA + +GI GI LA+ + S+ L + +
Sbjct: 66 ARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPL 125
Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
++ +H+Y ++ ++ + TLNP R +++ ++++
Sbjct: 126 LSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFI 159
>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
Length = 283
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 87/186 (46%), Gaps = 1/186 (0%)
Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
+ +L+D G + I + G D N K WRL AD+L + A LE+++P P +F+F+
Sbjct: 1 MTQILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFL 60
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
A R+ + +TR+ A N +V AK +Q + + +++ + L
Sbjct: 61 VCVASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPL 120
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
+ + + F + T H++ N K+ +S+ + LN RA L+ Y Q K N
Sbjct: 121 VSQNAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIAN 180
Query: 358 DEEPLF 363
EPLF
Sbjct: 181 QREPLF 186
>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
Length = 472
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/379 (22%), Positives = 169/379 (44%), Gaps = 70/379 (18%)
Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLW-----------------------RSV 146
+ W + LP G+P SV+ DYL Y ++ S+
Sbjct: 43 RRTWSSYFTAILDAFLPVGYPHSVSGDYLEYQVYVRPIFLKLHWSFTDVANNRLFVQDSL 102
Query: 147 QGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDV 204
Q +S I+G+L+++A+L +G+G A PT+A + +L+D +G ++ I+ ++ G +
Sbjct: 103 QAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSLEP 162
Query: 205 NPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGF 263
K +RL AD+ +++ ++ L+P FP V + + + R+ + S+++ A F
Sbjct: 163 ECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSAHF 222
Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
A N AE+ A +G + ++I S + +A S++ +H+ N +
Sbjct: 223 AKWGNLAELNA---------------VGSLVVSYITSPLATWVALISLLI-VHITTNYLA 266
Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+++ + +LN RA++VFS + K+ + +E +F + K+++ +
Sbjct: 267 VRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFC-- 324
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEHGG 432
Q+G DV+++ + A+ ++E E YIL +
Sbjct: 325 --------------QIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYILWFDPAR 370
Query: 433 KFCVV-LKESALPQDMLKS 450
K V+ LK +A P LK+
Sbjct: 371 KRGVIGLKTNASPTSQLKA 389
>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 442
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 158/332 (47%), Gaps = 40/332 (12%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFD 203
S+Q +S I+G+L+++A+L +G+G A PTAA + VL++ +G ++ I+ ++
Sbjct: 40 SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLNVLQESMGRIATILFAH-----R 94
Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAG 262
+ K +RL AD+ ++A ++ L+P FP V + + + R+ + S+++ A
Sbjct: 95 LECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLSLSSVLRAMCGVAAGSSKASLSAH 154
Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIML-----------GIALANHIGSSMPFALASFSV 311
FA N AE+ AK +Q V +G+++ G+ L +G S+ + + ++
Sbjct: 155 FARWGNLAELNAKDSSQETVISLMGMLVRPILGFLESISGLILTQQVG-SLVVSRVTSTL 213
Query: 312 VTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
TW IH+ N + +++ + TLN RA++VFS + + +E +F
Sbjct: 214 ATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVFSTIFDENRTLTPSQAAHQERIF 273
Query: 364 PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL-GSKLSDVVNNKEDAHALFSLYED 422
+ K++ S + V ++ +QL S S + + AL L+E
Sbjct: 274 ERDGVLRWKASPASL------GSCRIGVSFQYMVQLIASSESCTSDGQSIVAALLHLFER 327
Query: 423 EGYIL----TEHGGKFCVVLKESALPQDMLKS 450
E YIL T G +VLK +A LK+
Sbjct: 328 EEYILWFNPTRKRG--AIVLKTNATAASQLKA 357
>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 158
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 70/120 (58%)
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
LKDG+ ++ K++ SN+G D PK WRL AD L + GLE+L+P PH F+ +
Sbjct: 1 LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ + + +TR Y+ FA N ++++AKGEA + IGI +GI LA+ I +SM
Sbjct: 61 FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLASTICASM 120
>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 461
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 125/257 (48%), Gaps = 12/257 (4%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
+ L LP G+P SV+ DYL Y + + Q S ++ ++A++A+L G+G A T
Sbjct: 30 KHLLTDVFLPAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGVGNASASAT 89
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-- 232
A + VL+D G L+ + + G K +RL AD+ +A+ L++ P +
Sbjct: 90 HAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFNDASIVLDVFQPFYASVP 149
Query: 233 ---HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
V ++G R+ + +++ FA+ + ++ AK ++ V +G++
Sbjct: 150 VAGARVLALCSSGCLRALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRETVIGLVGLL 209
Query: 290 LGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL-- 346
G L ++IG+ P A A+ + H+ N + +++ +RTL RAS+ + ++
Sbjct: 210 SGTILLSYIGT--PGATYAALCALLGAHLALNYLAVRAVAMRTLTRQRASIAWRVWMERG 267
Query: 347 SGQAPPVKEVNDEEPLF 363
+G AP +EV + E L
Sbjct: 268 AGAAPTPEEVAEREGLL 284
>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
str. Neff]
Length = 449
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 104/226 (46%), Gaps = 33/226 (14%)
Query: 88 FTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQ 147
F + A +VA N L V+ + + LF LP G+P SV DY+++ W +
Sbjct: 48 FEESAAIVAREKNQHERKPLRVDLM---LKSLF----LPTGYPASVRQDYIHFQKWEVAK 100
Query: 148 GVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNP 206
G+ S +A + ++ G+G L ++++ FG+ FD
Sbjct: 101 GIVS-----------------------SAGFVGHLV--GLGCLGMMLVAGAFGKFFDTET 135
Query: 207 KGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
K R AD + A LE+ TP FP +F+ + + A + + A L +T++ GFA R
Sbjct: 136 KRIRWAADCIHVAGVALELATPLFPAYFLPLASLANSAKGIAGLTTGATKAAINQGFALR 195
Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVV 312
N ++ AKG +QG+V+ +G+ +GI G + A + V+
Sbjct: 196 DNLGDITAKGHSQGIVAYLMGMGIGIGTTYVTGGVLGALFAVYGVL 241
>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
Length = 194
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 81/165 (49%)
Query: 199 GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSC 258
G D N K WRLFAD+L + A LE++ P +P F + + + ++ +TR+
Sbjct: 12 GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71
Query: 259 FYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMY 318
A R N A+V AK +Q + G+++ + + + F+L F +T +H+Y
Sbjct: 72 LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131
Query: 319 CNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
N ++ +++ + TLN R LV YL G+ N EPL+
Sbjct: 132 ANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLW 176
>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
Length = 479
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 103/434 (23%), Positives = 184/434 (42%), Gaps = 73/434 (16%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLW---------------------RSVQGVASQISGVLA 158
+ LP G+P SVT DYL Y ++ S+Q +S I+G+L+
Sbjct: 57 ILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSYQDSLQAFSSSIAGLLS 116
Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLE 217
++A+L +G+G A PTAA + VL++ +G ++ I LFA L
Sbjct: 117 SRAVLQGVGVGDATASPTAALLLNVLQESMGRIATI-----------------LFAHRLG 159
Query: 218 NAAFGLEMLTPAFPHHFVFIG--AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
+ L+P FP V +G + + R+ + S+++ A FA N E+ AK
Sbjct: 160 TSLEPEYCLSPVFPKP-VRVGFLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAK 218
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
+Q V +G+++G + ++ S + +A ++ IH+ N + +++ + TLN
Sbjct: 219 DSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLA-IHLATNHAAVRAVNMTTLNRQ 277
Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
RA++VFS + +V+ EE +F + K+ VL + ++E
Sbjct: 278 RANIVFSSLFDDDRVLTPAQVSKEERIFERDGVLRWKAEATG---VLGTCRVGVSLEQLL 334
Query: 396 RLQLGSKLSDVVNNKEDAHA----------LFSLYEDEGYIL----TEHGGKFCVVLKES 441
L S+ +D + A L L+++E Y+L +H ++LK
Sbjct: 335 LLLPHSQTNDASKTSRRSEAGADFGVSLPVLVDLFKEEEYLLYFSPRDHAA--VIILKAG 392
Query: 442 ALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQRE--------FN 493
Q LK+ S+ + R +TS + GG+ E+S+ + R+ F+
Sbjct: 393 VTAQAQLKAW---SHALLVSRRLAGRSTSACINEKQGGQNELSVLSILRDTLREHSKAFD 449
Query: 494 HVKSDSASVGWVTD 507
GW TD
Sbjct: 450 DRVERLKQAGWETD 463
>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
Length = 282
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 71/314 (22%), Positives = 144/314 (45%), Gaps = 37/314 (11%)
Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
D+L + LE+++P P F+ + + A + +TR Y+ FA N +++
Sbjct: 1 DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
AKGEA + +GI GI LA+ + S+ L + +++ +H+Y ++ ++ + TLN
Sbjct: 61 AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLN 120
Query: 334 PYRASLVFSEYLLSGQAPPVKEVN-DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
P R +++ ++++ SG+ E+ E+ LFP + +E
Sbjct: 121 PQRTAMIVADFIKSGKVSSPAELRYREDLLFP-----------------------NRLIE 157
Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
+++G + V+ + + L + + E ++L+ +VL++SA +D L+
Sbjct: 158 EAGSVKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWL 216
Query: 453 QASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
A++ +ER+ G+ + T L ++ + ++ F ++ S GW TD +
Sbjct: 217 VAAFASEMERS-GVGSGDTV--------LNVAYERMENVFPMFVAEVKSRGWYTDQFLD- 266
Query: 513 PLPNRIRPGYVEPS 526
NR R Y PS
Sbjct: 267 --GNRSRIAYAIPS 278
>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 501
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 123/245 (50%), Gaps = 33/245 (13%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSV------------QGVASQISGVLATQALLYA-IGLGKG 171
LP G+P+SVT DYL Y ++ +G+++ + + + A+L + IG+G
Sbjct: 58 LPAGYPNSVTEDYLEYQIYVRAANKPWTQSELISEGLSTSVLKLHSWDAVLESRIGVGDS 117
Query: 172 -AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
A PTAA + VL+ +G ++ I+ ++ G + K +RL AD+ +AA L+ L+PA
Sbjct: 118 HASPTAALLLSVLQQSMGRIATIVFAHRLGTALEPECKMYRLAADIFNDAAMILDCLSPA 177
Query: 230 FPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
P V + + + RS + S+++ A FA + N E+ AK +Q V +G+
Sbjct: 178 LPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNAKDSSQETVISLLGM 237
Query: 289 MLGIALANHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLV 340
+ G + ++I S +TWI H+ N + +++ ++TLN RA+LV
Sbjct: 238 LAGSVVVSYITS---------KTITWIALILLLATHLGTNYLAVRAVCMQTLNRQRANLV 288
Query: 341 FSEYL 345
S L
Sbjct: 289 LSNIL 293
>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
B]
Length = 507
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 161/381 (42%), Gaps = 42/381 (11%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
R+LF LP G+P++V+ DYL+Y ++ +Q S I+ +LA++A+L G+G+ A T
Sbjct: 50 RQLFSGIFLPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASAT 109
Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-- 232
A + +++D + L I + G K +RL AD+ +AA L++++P F H
Sbjct: 110 NAMLVNIIQDVVSRLITIGSGYYLGTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWS 169
Query: 233 ---HFVFIG------------AAAGAGRSAAALIQASTRSCFYAGFAAR----RNFAEVI 273
+ FI +G+ R+ ++ +++ FA + ++
Sbjct: 170 ISLVYPFIRREQGFYLRVLALCLSGSFRALCGMVAGGSKAALTMHFATAGPVPGDIGDLN 229
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
AK ++ V IG++LG + H+ S A S++ H+ N + + + +RT N
Sbjct: 230 AKDGSKETVLALIGMLLGTFVIKHVDSGFA-TYAVLSMLILTHLAINFVAVRGVVMRTFN 288
Query: 334 PYRASLVFSEYLLSGQ----------APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
+RA+ ++ Y Q P + V + P ++ + L
Sbjct: 289 RHRAAAAWASYRRYEQGIAQYEGAILTPELMAVKERIFADPTLLMHPLRPEKRLGHCHLG 348
Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVN-NKEDAHALFSLYEDEGYIL-TEHGGK-----FCV 436
+ +E HR S+ S V N + + +E+E YIL H + V
Sbjct: 349 ASLSTIFLE-THRPCPPSRTSTVPNLTSAQLNVVLQTFENERYILWLRHDSRSESLQLVV 407
Query: 437 VLKESALPQDMLKSLFQASYL 457
K P D LK A L
Sbjct: 408 CFKTGHQPIDRLKGWIHAQEL 428
>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
1015]
Length = 504
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 114/241 (47%), Gaps = 42/241 (17%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
++ V+ LP G+P SVT DY A+L +G+G A PT
Sbjct: 69 QDFLVEVFLPAGYPHSVTDDY---------------------APAVLQGVGVGNADASPT 107
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-- 232
AA + +L+D G ++ I+ ++ G + K +RL AD+ + A L+ L+P P
Sbjct: 108 AALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDCLSPMIPAGA 167
Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
V + + AG R+ + S+++ A F+ N AEV AK +Q + IG+++G
Sbjct: 168 PRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGS 227
Query: 293 ALANHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSEY 344
+ + + +FS TWI H+ N + +S+++ +LN RA++VFS
Sbjct: 228 FVVSRV--------TNFS-TTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTL 278
Query: 345 L 345
L
Sbjct: 279 L 279
>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 333
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 118/223 (52%), Gaps = 6/223 (2%)
Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
S+Q +S I+G+LA++A+L +G+G A PT A + V++ +G ++ I+ ++ G
Sbjct: 42 SLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAHRLGTAL 101
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
+ K +RL AD+ + A L+ L+PAFP V + + + R+ + S+++ A
Sbjct: 102 EPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSSKASLSA 161
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCN 320
FA + N EV AK +Q V +G++ G + + I S P A + ++ IH+ N
Sbjct: 162 HFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITS--PLATWGTLILLLTIHLATN 219
Query: 321 LKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
+ +++ + LN RA+++FS G ++V+ E +F
Sbjct: 220 YAAVRAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRERVF 262
>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
Length = 171
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 69/120 (57%)
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
LK+G+ ++ K++ SN+G D PK WRL AD L + GLE+L+P PH F+ +
Sbjct: 1 LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60
Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
+ A + +TR Y+ FA N +++ AKGEA + IGI +GI LA+ I +SM
Sbjct: 61 FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASM 120
>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 272
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 118/222 (53%), Gaps = 5/222 (2%)
Query: 145 SVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHF 202
S+Q S I+G+L+++A+L ++G+G A PT+A + +L+ + ++K+ + G+
Sbjct: 34 SIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAYRLGKAI 93
Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV-FIGAAAGAGRSAAALIQASTRSCFYA 261
+ K +R ADLL +++F L L+P P F I A + + ++ +ST++ A
Sbjct: 94 EPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASSTKASLSA 153
Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
FA N E+ AK +Q + +G++ G L N I A +++ +H+ N
Sbjct: 154 HFAKWENLGELNAKDSSQETIISLLGMLTGSLLVNWISGHWATWTALILMLS-VHLEANR 212
Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
++ ++I ++TLN RA++V+ L G P EV+ E +F
Sbjct: 213 RAVRAIVMQTLNRQRATVVY-HRLCVGHIPRPAEVSRVERIF 253
>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
Length = 521
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 104/433 (24%), Positives = 179/433 (41%), Gaps = 78/433 (18%)
Query: 59 ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
+T+SE + D+ +R ++ KD S++ L+SLL+V
Sbjct: 26 STTSEPTQFRTPDSSGVRHERRRVDIVQPSRKD-------SSSYLASLLNV--------- 69
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA----TQALLYAIGLGKG-AI 173
LP G+P SV+ DYL+ G AS GVL T + +G+G A
Sbjct: 70 -----FLPVGYPHSVSDDYLD------CLGFAS---GVLKFNCRTAGVPGWVGVGDASAS 115
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
PT A ++ VL+D +G ++ I+ ++ G + K +RL AD+L ++A + L+PA P
Sbjct: 116 PTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPK 175
Query: 233 HF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV------------------- 272
V + A + R+ + S+++ + FA N E+
Sbjct: 176 RLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQR 235
Query: 273 ------------IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYC 319
+ K +Q V +GI+ G + +HI S PFA + ++ +H+
Sbjct: 236 GFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHI--STPFATWTTLLLLLLVHLST 293
Query: 320 NLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQL 379
N + +S+ + TLN R ++VFS G E + E +F + K++ S
Sbjct: 294 NYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILRWKAS--SAT 351
Query: 380 LVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVV 437
L + + + + DV D + L L+E E Y+L K +V
Sbjct: 352 LGYCQIGGSFQELLRGSIHGANSIRDV---GIDIYKLLRLFEKEEYVLWFNPIHKKGTIV 408
Query: 438 LKESALPQDMLKS 450
LK + P L++
Sbjct: 409 LKNNVTPISQLQA 421
>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
Length = 411
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 88/160 (55%), Gaps = 13/160 (8%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
+ L ++ LP G+P SVT DY Y ++ S+Q +S I+G+L+++A+L +G+G A PT
Sbjct: 54 KTLLIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASPT 113
Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
+A + +L+D G L+ I+ ++ G K +RL ADL + A L+ L+PA P
Sbjct: 114 SALLLHILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAPA-- 171
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
GA A SAA +++A G A N+A V A
Sbjct: 172 ---GAPRVAVLSAAGVLRA------LCGVAGALNYAAVRA 202
>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
Length = 156
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 79/159 (49%), Gaps = 27/159 (16%)
Query: 46 FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFT-----KDAFVVAS--A 98
C Q + AR ++ D + WEV G + L FT +DA V + A
Sbjct: 11 LCSEQFGSGEARGCRAAADGSLQ-----WEVGGWRWWGLSRAFTVKPEGRDAGEVGASGA 65
Query: 99 SNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA 158
+ LS L +V LP+GFPDSV+ DYL Y LW SVQ AS +SG LA
Sbjct: 66 PSPPLSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLA 111
Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLS 196
TQA+L IG+G A +AA W++KD G L +I+ +
Sbjct: 112 TQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150
>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 450
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 105/465 (22%), Positives = 187/465 (40%), Gaps = 57/465 (12%)
Query: 94 VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
+V S+ L S + + W R L LPEG+P +V+ + Y +W +VQ S +
Sbjct: 1 MVLHQSSPYLISQTNAGRHW---RRLLNDVFLPEGYPFTVSPQCVRYQIWNAVQAFCSSL 57
Query: 154 SGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRL 211
+G+ A++A+L G+G A T A +L+D L+ I+ + G K +RL
Sbjct: 58 AGLFASRAVLQGHGVGNASASATDAIFLTILQDVFSRLTTIVSGYYLGTSLYPEAKTYRL 117
Query: 212 FADLLENAAFGLEMLTPAFPH-----HFVFIGAA------------AGAGRSAAALIQAS 254
AD+L +AA + L+P H + F+ A +GA R+ +
Sbjct: 118 LADILNDAAMVTDTLSPHLAHFSFSLSYPFVVPASNSSLRVVALCMSGAFRALCGAVAGG 177
Query: 255 TRSCFYAGFAAR----RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFS 310
+++ FA + ++ AK ++ V +G++ G + ++ ++ + F
Sbjct: 178 SKAALTVHFATAGERPGDVGDLSAKDGSKETVLALLGMLSGSVVLRYVHTAQATYVVLFG 237
Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS----GQAPPVKEVNDEEPLFPAF 366
++ + H+ N + + I +R N RA + + E+ S P V +E +F
Sbjct: 238 LI-FCHLAANFIAVRVIAMRAFNRQRAGIAWCEFRRSFDVVTVVPDYLHVARQERIF--- 293
Query: 367 HFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYI 426
S + +S A+ + S + E L ++ E YI
Sbjct: 294 --------TDSARIPISQHARGTYAHCALGVSFLSLSGREPFSDEQIATLLEVFAHEKYI 345
Query: 427 L--TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSA----DCAPGGR 480
L + + VVLK+ P D LK+ A + A I + A +C R
Sbjct: 346 LWPASNPPQIAVVLKDGHGPHDHLKAWAHAHEI------ANICDRESHAHGFNECLVAIR 399
Query: 481 LEISLDYVQREFNHVKSDSASVGWVT-DGLIARPLPNRIRPGYVE 524
+L+ V R F + VGW T +G++ P + G E
Sbjct: 400 Y--ALENVSRMFPLFLRAAQEVGWKTEEGVVVGGSPITVTIGEAE 442
>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
Length = 512
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 69/241 (28%), Positives = 119/241 (49%), Gaps = 22/241 (9%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLAT-QALLYAIGLGKG---AI 173
+L V LP G+P SVT DYL + LAT Q +G+G G A
Sbjct: 67 DLLVDVFLPSGYPHSVTDDYLPRVYFH------------LATPQPTNNPLGVGVGNADAT 114
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
PT+A + +L+D G ++ I+ ++ G + K +RL AD+ + A L+ L+P P
Sbjct: 115 PTSALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPA 174
Query: 233 HF--VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
+ + + AG R+ + S+++ A FA N AEV AK +Q + +G+++
Sbjct: 175 GVSRITVLSTAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVGMLV 234
Query: 291 GIALANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
G + +H+ G + + F + + M N + +++++ +LN RA++VFS L S
Sbjct: 235 GSLVVSHVTGFAATWLSLLFLLAAHLAM--NYAAVRAVQMTSLNRQRANIVFSTLLDSDP 292
Query: 350 A 350
A
Sbjct: 293 A 293
>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
Neff]
Length = 101
Score = 79.0 bits (193), Expect = 6e-12, Method: Composition-based stats.
Identities = 35/71 (49%), Positives = 50/71 (70%), Gaps = 1/71 (1%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWV 182
+LPEG+P+SV+ DY Y W +VQ + S I+G+L T+ALL +G+G GA P AAA+ W
Sbjct: 21 LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80
Query: 183 LKDGIGYLSKI 193
L+DG G + +I
Sbjct: 81 LRDGAGMVGRI 91
>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
Length = 127
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 63/113 (55%), Gaps = 5/113 (4%)
Query: 87 DFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
D F + N LS+L ++ W ++ F LP G+P SV+ DYL Y +W ++
Sbjct: 12 DEISRVFTLEKNQNRRLSTLFGLHLSWVTLKKCF----LPVGYPSSVSDDYLEYQIWDTI 67
Query: 147 QGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF 198
Q AS I+G LA+QA+L +G+G A A++ W+ KDG G + +I+ + +
Sbjct: 68 QAFASSITGALASQAVLIGVGVGNSSATILGASLTWMFKDGSGMIGRILFAGY 120
>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 163
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 61/101 (60%), Gaps = 1/101 (0%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
D+ + F+ + P G+P SV YL Y+ +R +Q +S + VL+TQ+LL+A GL +
Sbjct: 62 DKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RPTP 120
Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFAD 214
A A++W+LKDG+ + K++ S+ G D PK WR+ +
Sbjct: 121 AQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILGE 161
>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
24927]
Length = 445
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 116/256 (45%), Gaps = 34/256 (13%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAA 178
+ LP G+P SVT DYL+ L A+G A TAA
Sbjct: 57 IINIFLPAGYPKSVTDDYLDGDL----------------------AVGFEDSNASTTAAV 94
Query: 179 INWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
+ +++D IG ++ I+ ++ G + K +R AD+ ++A ++ L+P FP V
Sbjct: 95 LLKIVQDSIGRVATILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPLRVV 154
Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+ + AG R+ + +++ F+ N AE+ AK +Q V +G+++
Sbjct: 155 LLSTAGTLRALCGVAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLMRADQTL 214
Query: 297 HIGS-SMPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
+ + S+ + + TW +H++ N ++ +++ + TLN RA+ V S + +
Sbjct: 215 RVKAGSLVISSVTSKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVMSTFAET 274
Query: 348 GQAPPVKEVNDEEPLF 363
G K+V +E +F
Sbjct: 275 GVMLTPKDVARQERIF 290
>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
Length = 473
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 161/381 (42%), Gaps = 69/381 (18%)
Query: 117 RELFVQ----FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL--------- 163
R+ +VQ LP G+P S S+Q +S I+G+LA++A+L
Sbjct: 36 RKSYVQQVFDVFLPAGYPQS-----------DSLQAFSSSIAGMLASRAVLEGHLDAEDT 84
Query: 164 ----------YAIGLGK-----------------GAIPTAAAINWVLKDGIGYLSKIMLS 196
YA +G A PTAA + VL++ +G ++ I+ +
Sbjct: 85 LRLQNLVPSVYAASIGDQRHMSSHQSRCVGVGDSSASPTAALLLSVLQESMGRIATILFA 144
Query: 197 N-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQAS 254
+ G + K +RL AD+ + A L+ L+PAFP F V + + + RS + S
Sbjct: 145 HRLGTALEPECKMYRLAADVFNDTAMILDCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGS 204
Query: 255 TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN-HIGSSMPFALASFSVVT 313
++ A FA + N EV A M + G+ N HI + +A S
Sbjct: 205 AKASLSAHFARKGNLGEVNA--HITHMNPNVDFDIEGLEPGNGHIATWNAILIALLS--- 259
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK-IK 372
IH+ N + +++ +R LN RA++VF + G EV+ E +F + +
Sbjct: 260 -IHLATNYAAVKAVSMRCLNRQRANIVFGNLMHHGSVLTPTEVSQRERIFERDGVLRWVD 318
Query: 373 SANKSQLLV-LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TE 429
+ + A++ +R +LSD+ N D L ++ D+ Y+L T+
Sbjct: 319 DCVIGHCTIGVPLGVLLASMSQRYRRTGSLELSDI--NLPD---LMDIFADQKYVLWFTD 373
Query: 430 HGGKFCVVLKESALPQDMLKS 450
+ VVLKE P D LK+
Sbjct: 374 SEYEAIVVLKEGCTPIDQLKA 394
>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
Length = 278
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 119/257 (46%), Gaps = 12/257 (4%)
Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTA 176
++ + LP GFP+SV+ DYL Y ++ ++Q S ++G+L+++ALL G+G A T
Sbjct: 2 DVLAKVFLPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATATN 61
Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTP---AFPH 232
A + V +D L+ I+ + G K +R AD+ + A + TP A H
Sbjct: 62 AVLLTVAQDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASGH 121
Query: 233 HFVFIGA--AAGAGRSAAALIQASTRSCFYAGFA----ARRNFAEVIAKGEAQGMVSKAI 286
+ + A AGA R+ I ++ FA + +V AK ++ + +
Sbjct: 122 ANLRVAALCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLALM 181
Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
G++ G + +H+ SS A + H+ N + + LRTLN RASL ++ +
Sbjct: 182 GMLAGSFVVHHL-SSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFRE 240
Query: 347 SGQAPPVKEVNDEEPLF 363
G AP V E +
Sbjct: 241 YGSAPSPTIVAGMEGIL 257
>gi|242208267|ref|XP_002469985.1| predicted protein [Postia placenta Mad-698-R]
gi|242226469|ref|XP_002477622.1| predicted protein [Postia placenta Mad-698-R]
gi|220722555|gb|EED77179.1| predicted protein [Postia placenta Mad-698-R]
gi|220731047|gb|EED84896.1| predicted protein [Postia placenta Mad-698-R]
Length = 226
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 115/224 (51%), Gaps = 11/224 (4%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI--GLGK-GAIPTAAAINW 181
LP GFP +V+ Y ++ ++Q S ++G++A++A+L G+GK GA T A
Sbjct: 6 LPAGFPATVSPG--KYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLT 63
Query: 182 VLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAA 240
VL+D L+ I+ + G K +RL AD+ +AA L+ L+P V
Sbjct: 64 VLQDIFSRLTTIVAGYYLGTSLFPEAKAYRLMADIFNDAAIILDTLSPHLGSFRVASLCL 123
Query: 241 AGAGRSAAALIQASTRSCFYAGFAARR----NFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
+GA R+ ++ +++ FA + ++ AK ++ V +G++ G A+ +
Sbjct: 124 SGAFRALCGVVAGGSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMH 183
Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
++ S+ + F+++ +H++CN + + + R+LN RAS+V
Sbjct: 184 YVHSTRATYVVLFALIA-LHLFCNYLAVRVVVFRSLNRQRASIV 226
>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
Length = 295
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/259 (23%), Positives = 111/259 (42%), Gaps = 30/259 (11%)
Query: 211 LFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
++AD + +A ++ T +P F+ + + ++ A ++ + FA N
Sbjct: 1 MYADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLG 60
Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIE 328
EV AK E + ++ IG+ GI + + G S PF L +++ + +H++ +S ++
Sbjct: 61 EVAAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQ 120
Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
T+N RA ++ +++ P + N E + F K +
Sbjct: 121 FNTVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRII-------------- 166
Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALP 444
V +E L +S V AL +Y E YILT +F V K +A
Sbjct: 167 FGVSLEELSGLEKSVSKV-------KALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATS 219
Query: 445 QDMLKSLFQASYLYWLERN 463
+D+L+ L+QA YWLE N
Sbjct: 220 RDVLRCLWQA---YWLEEN 235
>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 228
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 63/109 (57%), Gaps = 4/109 (3%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG--KGAIPTAAAINWV 182
LP +PDSV+ DY+ Y ++ S Q S I+G+LA +A+L +G+G G++ A +
Sbjct: 82 LPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLLT-C 140
Query: 183 LKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
LKDG+ L+ I + FG + N K +R ADL + AF LE+ P F
Sbjct: 141 LKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPFF 189
>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
Length = 236
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-- 171
DE +++PEGFPDSVT Y+ Y WR+++ GV TQ LL ++G+ +
Sbjct: 106 DEFISFVRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRA 165
Query: 172 -AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
A AINW+L DG G + K++ + G+ FD + K R DLL
Sbjct: 166 TPAAAAVAINWILNDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLL 211
>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 290
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 64/251 (25%), Positives = 108/251 (43%), Gaps = 55/251 (21%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GAIPTAAAIN 180
LP G+PD+V+SDY + W + VL +Q+LL A+G+G GA+P AAA
Sbjct: 1 MFLPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATK 60
Query: 181 WVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
WVLKDG+G + ++ G + +D +PK F+ A
Sbjct: 61 WVLKDGVGSFATLLSGVVGAQRYDEDPK------------------------RCATFVRA 96
Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM------VSKAIGIMLGIA 293
A GR+ +L+ + F F +N ++ AK E QG + I I G++
Sbjct: 97 GALTGRT--SLVNGT----FMQHFGRNQNAGDIRAKLEVQGKWLALCALPTGIFIFRGVS 150
Query: 294 ---------------LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRAS 338
L +H+ S+P ++++ V H + + K+ + + LN +R
Sbjct: 151 GAFEPLPPETLETMTLLDHLTRSLP-VISAYGAVVGGHSFLSWKAASVLRVDVLNRFRLL 209
Query: 339 LVFSEYLLSGQ 349
E++ G+
Sbjct: 210 KAAREFVDGGE 220
>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
Length = 158
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 49/74 (66%), Gaps = 1/74 (1%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
LP+GFPDSV+ DYL Y LW SVQ AS +SG LAT A+L IG+G A +AA W+
Sbjct: 81 FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 140
Query: 183 LKDGIGYLSKIMLS 196
+KD G L +I+ +
Sbjct: 141 VKDSTGMLGRIVFA 154
>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 596
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 110/248 (44%), Gaps = 25/248 (10%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P G+PDS + Y L S + +A Q++L G + ++ W+
Sbjct: 140 FGMPTGYPDSCAEGFRRYFLLSLCSTFVSSFASSIAYQSIL------NGFLLVSSPQLWM 193
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
LKD L+ L+N ++ PK W + + +L N + LEM+ P+ PH+ +
Sbjct: 194 LKDLAPALAAAYLANRVVSYENRPKFWFVVSVVLHNTSIVLEMIIPSVVPHNLLTAAVLT 253
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS- 300
R +A+L+ TR+ FA N AE+ K + G+V + LGIA ++ S
Sbjct: 254 SCVRQSASLMFLVTRASALQHFAVSGNLAELTKKFNSFGIVIYTVSTALGIAYTYYVASL 313
Query: 301 -----SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP--- 352
++ F + V++++ M CN I R LN S++ Y+ G P
Sbjct: 314 TAQLVTVLFCCGANLVLSYMSM-CN------IAFRVLNGTTISVILRAYVKEGCLGPGHV 366
Query: 353 --VKEVND 358
+EV+D
Sbjct: 367 LSPREVSD 374
>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
Length = 425
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 60/266 (22%), Positives = 99/266 (37%), Gaps = 76/266 (28%)
Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL-------- 163
+ D+ + F+ P G+P SV YL Y+ +R +Q +S + VL+TQ +
Sbjct: 67 IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQLVFQNRLLSVL 126
Query: 164 ---------YA----------------------------IGLGKGAIPTAAAINWVLKDG 186
YA LGK + +LKDG
Sbjct: 127 SSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDASNCCKLGKHMMTEMLKSKQILKDG 186
Query: 187 IGYLSKIMLSNFGRHFDVNPKGWRLF-------------------------------ADL 215
+ + K++ S+ G D PK WR+ AD+
Sbjct: 187 MQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCHCLGSHFTIFPIICLYCSILTADV 246
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
L + LE+++P P F+ + + + +TR Y+ FA N +++ AK
Sbjct: 247 LYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAARATRLPIYSSFAKEGNLSDLFAK 306
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSS 301
GEA + +GI GI LA+ + S+
Sbjct: 307 GEAISTLFNVMGIGAGIGLASTVCST 332
>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
Length = 295
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 73/309 (23%), Positives = 134/309 (43%), Gaps = 38/309 (12%)
Query: 211 LFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
++AD + +A E+ T +P +F+ + + ++ A + + FA N
Sbjct: 1 MYADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLG 60
Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIE 328
EV AK E + ++ +G+ +G+ + + G SS + ++ V +H++ +S ++
Sbjct: 61 EVAAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLK 120
Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
RT+N R ++ ++ P N+EE + + + + +
Sbjct: 121 FRTINLKRGRILVKSHVAQHTVPGYVACNEEENILTWERYLRPQVS-------------- 166
Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGK---FCVVLKESAL 443
V +E R+ G K SD+VN L LY++E YIL + G + F V KESA
Sbjct: 167 FGVPME-RMLGGDKSSDMVNR------LLKLYKNEKYILFFEQFGSREPTFLVTFKESAT 219
Query: 444 PQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVG 503
+L+SL+QA +WL +N + P LE S+ ++ F S G
Sbjct: 220 SMSVLRSLWQA---HWLHKNQ-----LRQDEVFPW--LEESIVALENGFTDFLEQLGSAG 269
Query: 504 WVTDGLIAR 512
W D +I +
Sbjct: 270 WDQDLIILK 278
>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 814
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 60/115 (52%), Gaps = 2/115 (1%)
Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIP 174
C + LPEG+P SV+SDYL + L + QGV S G+L + ALL G+G A
Sbjct: 229 CLRALRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGSSTATA 288
Query: 175 TAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
+A +V +D L + + GR D + K WRLFAD + + +E+ +P
Sbjct: 289 ASATAQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELASP 343
>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
Length = 155
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 68/130 (52%)
Query: 189 YLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAA 248
+ K++ S+ G D PK WR+ AD+L + LE+++P P F+ + + A
Sbjct: 3 HAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMA 62
Query: 249 ALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALAS 308
+ +TR Y+ FA N +++ AKGEA + +GI GI LA+ + S+ L +
Sbjct: 63 VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIA 122
Query: 309 FSVVTWIHMY 318
+++ +H++
Sbjct: 123 GPLLSAVHIW 132
>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 122
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 64/125 (51%), Gaps = 19/125 (15%)
Query: 160 QALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENA 219
Q+LLY GL GAIPTAAA+N VL+DG+G WR+ + + +
Sbjct: 2 QSLLYDAGLVAGAIPTAAAVNRVLRDGLGQ-------------------WRITSSVALDV 42
Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
A E+LTP P F+ I + ++ A L ++TR+ F+ F R N A+V AK Q
Sbjct: 43 AVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAKTGPQ 102
Query: 280 GMVSK 284
G+ +
Sbjct: 103 GIAQQ 107
>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
Length = 507
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/322 (23%), Positives = 135/322 (41%), Gaps = 44/322 (13%)
Query: 166 IGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
+G+G A PT A ++ VL+D +G ++ I+ ++ G + K +RL AD+L ++A
Sbjct: 31 VGVGDASASPTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIF 90
Query: 224 EMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV---------- 272
+ L+PA P V + A + R+ + S+++ + FA N E+
Sbjct: 91 DCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLV 150
Query: 273 ---------------------IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFS 310
+ K +Q V +GI+ G + +HI S PFA +
Sbjct: 151 IAISGAAQRGFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHI--STPFATWTTLL 208
Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK 370
++ +H+ N + +S+ + TLN R ++VFS G E + E +F +
Sbjct: 209 LLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILR 268
Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--T 428
K++ S L + + + + DV D + L L+E E Y+L
Sbjct: 269 WKAS--SATLGYCQIGGSFQELLRGSIHGANSIRDV---GIDIYKLLRLFEKEEYVLWFN 323
Query: 429 EHGGKFCVVLKESALPQDMLKS 450
K +VLK + P L++
Sbjct: 324 PIHKKGTIVLKNNVTPISQLQA 345
>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 504
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 130/294 (44%), Gaps = 39/294 (13%)
Query: 125 LPEGFPDSVTSDYLNYSLW-----RSVQGVA-----SQISGVLATQALLYAIGLGKG-AI 173
LP G+P SV+ DYL L RS+ A +Q + L T +G+G A
Sbjct: 70 LPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQL-TPTQHTGVGVGNANAS 128
Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
PT+A + +L+D G ++ I ++ G + K +RL AD+ + A L+ L+P P
Sbjct: 129 PTSALLLHILQDISGRIATICFAHRIGTALEPECKTYRLAADVFNDIAMILDCLSPGVPA 188
Query: 233 --HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
V + + AG R+ + S+++ A FA N AE+ AK +Q V G+++
Sbjct: 189 GPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQETVISLFGMLV 248
Query: 291 GIALANHIGSSMPFALASFSVVTWIHMY--------CNLKSYQSIELRTLNPYRASLVFS 342
G + +HI SFS TW+ + N + +++++ +LN RA++VFS
Sbjct: 249 GSVVISHI--------TSFST-TWLILLILLALHLSMNYAAVRAVQMTSLNRQRANIVFS 299
Query: 343 EYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
S + N L P S KS+ VL+ I HR
Sbjct: 300 TLFDSDR-------NLSLDLDPELDTTPSLSKTKSEWTVLTPAQVSKHERIFHR 346
>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 354
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/170 (31%), Positives = 85/170 (50%), Gaps = 20/170 (11%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGI 187
+P +V YL Y + S+QG++S + GV++T A+L A G+G A +AA+ W ++DG+
Sbjct: 19 YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78
Query: 188 GYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
G + ++ S HFD + K +RL AD+L + L+M A P + A
Sbjct: 79 GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDM---ALPVVLSWTRA------- 128
Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
T+ F+ N A+V AK Q + IG+ LG+ L+N
Sbjct: 129 --------TKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSN 170
>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
Length = 188
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 54/97 (55%), Gaps = 2/97 (2%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
++ + LP G+P+SV S Y + W ++ GVL +QA+L ++GLG A
Sbjct: 51 KQNLTEMFLPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGG 110
Query: 176 AAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRL 211
A AI WVLKDGIG + K+ + + FD +PK W+
Sbjct: 111 AVAIQWVLKDGIGEIGKLFFIKKYASTFDSHPKTWKF 147
>gi|146096657|ref|XP_001467882.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020910|ref|XP_003863618.1| hypothetical protein, conserved [Leishmania donovani]
gi|134072248|emb|CAM70952.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501851|emb|CBZ36933.1| hypothetical protein, conserved [Leishmania donovani]
Length = 468
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 97/228 (42%), Gaps = 8/228 (3%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P+GFPD+ + + + S + + Q+LL LG ++ W+
Sbjct: 33 FGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 86
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
LKD + L ++N ++ PK W + + + N +ML P+ P+H +
Sbjct: 87 LKDLVPALFAAYMANKVVSYESRPKFWFVVSVFINNLTVISDMLIPSVLPNHLLAAAIIT 146
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ ++AL+ TR+ +A N AE+ K + GMVS + LGI +I S
Sbjct: 147 STVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGILYCTYIA-S 205
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
L + + +M+ + S I R LN L+ ++L +
Sbjct: 206 FTVQLITVAACCVTNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253
>gi|157874033|ref|XP_001685512.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128584|emb|CAJ08716.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 468
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 97/225 (43%), Gaps = 8/225 (3%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P+GFPD+ + + + S + + Q+LL LG ++ W+
Sbjct: 33 FGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 86
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
LKD + L ++N ++ PK W + + + N +ML P+ P+H +
Sbjct: 87 LKDLVPALFAAYMANKVVSYESRPKFWFVVSVFVNNLTVISDMLIPSVLPNHLLAAAIIT 146
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ ++AL+ TR+ +A N AE+ K + GMVS + LGI +I +S
Sbjct: 147 STVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGILYCTYI-AS 205
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
L + + +M+ + S I R LN L+ ++L
Sbjct: 206 FTVQLITVAACCVTNMFLSSMSMMPITFRLLNVDTMKLLLGAFIL 250
>gi|340059715|emb|CCC54109.1| conserved hypothetical protein, fragment, partial [Trypanosoma
vivax Y486]
Length = 618
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 102/233 (43%), Gaps = 8/233 (3%)
Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLK 184
+P G+P S + Y L S + + Q++L G ++ W+LK
Sbjct: 182 MPSGYPHSCGVGFRRYFLLSLCSSSVSSFASSIGFQSIL------NGFFLASSPQLWMLK 235
Query: 185 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAAGA 243
D + L+ L+N ++ PK W + + + N + EM+ P+ P H +F
Sbjct: 236 DLVPALAAAYLANRIVSYENRPKFWFVVSVGMHNLSVIAEMIVPSLVPQHLLFAAVTTSC 295
Query: 244 GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMP 303
R +A+L+ TR+ FA N AE+ K + G+V + LGIA + + +S+
Sbjct: 296 IRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGIVIYTVATALGIAFTS-VVTSLT 354
Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
L + + +++ + +S I R LN S++ EY+ +G + V
Sbjct: 355 AQLITVLMCCAVNLIISHQSMCKIAFRMLNGTTISVILHEYIQNGSTCSCRRV 407
>gi|242208074|ref|XP_002469889.1| predicted protein [Postia placenta Mad-698-R]
gi|220731120|gb|EED84968.1| predicted protein [Postia placenta Mad-698-R]
Length = 214
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 112/215 (52%), Gaps = 9/215 (4%)
Query: 138 LNYSLWRSVQGVASQISGVLATQALL--YAIGLGK-GAIPTAAAINWVLKDGIGYLSKIM 194
+ Y ++ ++Q S ++G++A++A+L + G+GK GA T A VL+D L+ I+
Sbjct: 1 VRYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLTVLQDIFSRLTTIV 60
Query: 195 LSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQA 253
+ G K +RL AD+ +AA L+ L+P V +GA R+ ++
Sbjct: 61 AGYYLGTSLFPEAKAYRLTADIFNDAAIILDTLSPHLGSFRVASLCLSGAFRALCGVVAG 120
Query: 254 STRSCFYAGFAARR----NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASF 309
+++ FA + ++ AK ++ V +G++ G A+ +++ S+ + F
Sbjct: 121 GSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMHYVHSTRATYVVLF 180
Query: 310 SVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY 344
+++ +H++CN + + + R+LN RAS+V++ Y
Sbjct: 181 ALIA-LHLFCNYLAVRVVVFRSLNRQRASIVWAAY 214
>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
Length = 768
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 110/236 (46%), Gaps = 16/236 (6%)
Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
+ KL D + +F LP G+P+SVT DYL Y + ++Q S ++ +++++A L G+
Sbjct: 320 LQKLRDFGKAVF----LPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGV 375
Query: 169 GKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEML 226
G A A + +L D L+ I+ ++ G K +R AD+L + A L+ L
Sbjct: 376 GSASASANDALLLTILLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTL 435
Query: 227 TPAF-----PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA----RRNFAEVIAKGE 277
+P F P V + + R+ + +++ FA + + ++ AK
Sbjct: 436 SPVFKAQGYPQLRVVALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDS 495
Query: 278 AQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
++ V G++LG + + ++ F +++ +H+ N + + LRT+N
Sbjct: 496 SKETVLALFGMLLGSLIVPRLQTAWSTYTTLFILLS-LHIAINYVAVTGLVLRTMN 550
>gi|401427109|ref|XP_003878038.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494285|emb|CBZ29584.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 468
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 8/228 (3%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P+GFP++ + + + S + + Q+LL LG ++ W+
Sbjct: 33 FGMPKGFPETTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 86
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
LKD + L ++N ++ PK W + + + N +ML P+ P+H +
Sbjct: 87 LKDLVPALFAAYMANKVVSYENRPKFWFVVSVFMSNLMVISDMLIPSVLPNHLLAAAIIT 146
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ ++AL+ TR+ +A N AE+ K + GMVS + LGI +I S
Sbjct: 147 STVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGILYCTYIA-S 205
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
L + + ++M+ + S I R LN L+ ++L +
Sbjct: 206 FTVQLITVAACCVMNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253
>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
Length = 743
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/242 (23%), Positives = 97/242 (40%), Gaps = 31/242 (12%)
Query: 130 PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI---------- 179
P T Y Y+ WR + + + +A+ L + + I
Sbjct: 176 PSVATKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISES 235
Query: 180 -------NWVLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF- 230
+ KD + + S G FD NPK +RL +L + A ++ + F
Sbjct: 236 KMKSILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFT 295
Query: 231 --PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR---NFAEVIAKGEAQGMVSKA 285
P + +GA A R L ++++ FY F + N E+ AK EAQ +
Sbjct: 296 FGPK--MIMGACTNAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDF 353
Query: 286 IGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSY---QSIELRTLNPYRASLVFS 342
GI LGI L N + PF + + +V+ + N+ +Y +S+ + LNP R +V
Sbjct: 354 SGIALGIYLTNLLNEQ-PFVIQATTVI-LTSLISNVATYMCVKSVSFKNLNPQRCFVVLE 411
Query: 343 EY 344
++
Sbjct: 412 DF 413
>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
Length = 470
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/358 (22%), Positives = 142/358 (39%), Gaps = 75/358 (20%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--------GAIP 174
F+LP GFP + Y+ L Q + A+G+G A
Sbjct: 116 FILPAGFPVFLFYAYME-----------------LKNQKFV-AVGVGSFSATTAAASAAA 157
Query: 175 TAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
+AAAI WV KDGIG + ++ + G KG++ DL T +P +F
Sbjct: 158 SAAAIRWVSKDGIGAVGRLFI---GLSVIKRAKGYKSIFDL----------STQLYPAYF 204
Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
+ + + ++ A ++ + FA N EV AK E + ++ +G+ LGI +
Sbjct: 205 LQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMV 264
Query: 295 ANH--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
+ +S P ++ + +H++ +S + +++N RA ++ ++L P
Sbjct: 265 LTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPG 324
Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE- 411
+ N +E + F K R+ G L +++ +
Sbjct: 325 YVDCNRKEKILSWQRFLK------------------------PRIIFGVSLEELIGGERP 360
Query: 412 --DAHALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
L LY E YIL + G F V + K A +L+S++Q+ YWL+ N
Sbjct: 361 VSKVRTLLKLYSQEKYILVVKQQGTDFEVFVSFKVGATSLSVLRSIWQS---YWLQEN 415
>gi|71414333|ref|XP_809272.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70873629|gb|EAN87421.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 630
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 109/249 (43%), Gaps = 14/249 (5%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
+ LF+ F +P G+P+S + Y L S + + Q++L LG +
Sbjct: 156 QRLFL-FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------S 208
Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFV 235
+ W++KD + L+ L+N ++ PK W + L+N + EML P+ P H +
Sbjct: 209 SPQLWMMKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLL 268
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
R +A+LI +R+ FA N AE+ K + GMV I GI +
Sbjct: 269 LGAIMTSCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-MY 327
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-----QA 350
+ +S+ LA+ +++ + +S I R LN S+V +Y+ G
Sbjct: 328 TSVVTSLVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGIHRRHI 387
Query: 351 PPVKEVNDE 359
P +EV+D
Sbjct: 388 PTPQEVSDR 396
>gi|71661222|ref|XP_817635.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70882839|gb|EAN95784.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 630
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 109/249 (43%), Gaps = 14/249 (5%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
+ LF+ F +P G+P+S + Y L S + + Q++L LG +
Sbjct: 156 QRLFL-FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------S 208
Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFV 235
+ W++KD + L+ L+N ++ PK W + L+N + EML P+ P H +
Sbjct: 209 SPQLWMMKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLL 268
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
R +A+LI +R+ FA N AE+ K + GMV I GI +
Sbjct: 269 LGAIMTSCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-MY 327
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-----QA 350
+ +S+ LA+ +++ + +S I R LN S+V +Y+ G
Sbjct: 328 TSVVTSLVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGIHRRHI 387
Query: 351 PPVKEVNDE 359
P +EV+D
Sbjct: 388 PTPQEVSDR 396
>gi|83767160|dbj|BAE57299.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 165
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 81/144 (56%), Gaps = 4/144 (2%)
Query: 133 VTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLS 191
+TS + +L S+Q +S I+G+L+++A+L +G+G A PTAA + +L+D G +S
Sbjct: 13 ITSVTNSLTLKDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHILQDTSGRIS 72
Query: 192 KIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VFIGAAAGAGRSAA 248
I+ ++ G + K +RL AD+ +AA L+ L+P P F V + + AG R+
Sbjct: 73 TILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLSTAGVLRALC 132
Query: 249 ALIQASTRSCFYAGFAARRNFAEV 272
+ S+++ A F+ N AEV
Sbjct: 133 GVAGGSSKASLSAHFSRWGNLAEV 156
>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
[Strongylocentrotus purpuratus]
Length = 311
Score = 65.9 bits (159), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/249 (22%), Positives = 102/249 (40%), Gaps = 31/249 (12%)
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
+ +AA +E+++ FP +FV I + +S + +TR+ A R N A+V AK
Sbjct: 1 MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
+Q + + +G+ + ++ + F + T +H+Y N ++ + + TLN
Sbjct: 61 DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120
Query: 336 RASLVFSEYLLSGQ---APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
R ++ +YL S Q + P VN EP+ + LS K
Sbjct: 121 RFHILVQDYLHSSQFSMSGP-DAVNSREPV----------------IWRLSRPLK----- 158
Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFC--VVLKESALPQDMLKS 450
L LG V + D E ++L+ K C +V+ + +D +K+
Sbjct: 159 ----LNLGVPFRTVAKSMRDIEMCLQSNGQEKFLLSLDVKKGCIDIVIHHDSTSEDFIKA 214
Query: 451 LFQASYLYW 459
QA + +
Sbjct: 215 CLQAEIIMY 223
>gi|407850424|gb|EKG04824.1| hypothetical protein TCSYLVIO_004113 [Trypanosoma cruzi]
Length = 631
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 109/249 (43%), Gaps = 14/249 (5%)
Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
+ LF+ F +P G+P+S + Y L S + + Q++L LG +
Sbjct: 156 QRLFL-FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------S 208
Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFV 235
+ W++KD + L+ L+N ++ PK W + L+N + EML P+ P H +
Sbjct: 209 SPQLWMMKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLL 268
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
R +A+LI +R+ FA N AE+ K + GM+ I GI +
Sbjct: 269 LGAIMTSCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMMIYTIFTAFGI-MY 327
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-----QA 350
+ +S+ LA+ +++ + +S I R LN S+V +Y+ G
Sbjct: 328 TSVVTSLVAQLATVLFCCALNLVLSYQSMSKIAFRILNATTLSVVLRDYVGRGGIHRRHI 387
Query: 351 PPVKEVNDE 359
P +EV+D
Sbjct: 388 PTPQEVSDR 396
>gi|154343241|ref|XP_001567566.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064898|emb|CAM43006.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 439
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 97/230 (42%), Gaps = 12/230 (5%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P+GFPD+ + + + S + + Q+LL LG ++ W+
Sbjct: 4 FGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 57
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
LKD L ++N ++ PK W + + L+ N +ML P+ P+H +
Sbjct: 58 LKDLAPALLAAYMANQVVSYENRPKFWFVVSVLMNNIMVMSDMLIPSVVPNHLLAAAIVT 117
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
+ ++AL+ R+ +A N AE+ K + GMVS I LGI +I S
Sbjct: 118 STVKQSSALMYFVARASTLQHYAINNNLAELTKKFNSFGMVSYTIATALGILYCTYIAS- 176
Query: 302 MPFALASFSVVTWI--HMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
F + F+V +M + + I R LN L+ +++ +
Sbjct: 177 --FTMQLFTVAVCCAANMLLSSRMMMPISFRLLNFDTMKLLLRAFIMEKR 224
>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
Length = 413
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 136/297 (45%), Gaps = 19/297 (6%)
Query: 166 IGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
+G+G A PTAA + VL++ +G ++ I+ ++ G + K +RL AD+L ++A L
Sbjct: 40 VGVGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVL 99
Query: 224 EMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMV 282
+ L+P FP V + + + R+ + S+++ A FA N E+ A G
Sbjct: 100 DCLSPIFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVG-LNIPP 158
Query: 283 SKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
S+ G++L ++I S + +A V+ +H+ N + +++++ TLN RA++VFS
Sbjct: 159 SQLTGVVL-----SYITSPLETWIA-LIVLLIVHLGTNHAAVRAVKMTTLNRQRANIVFS 212
Query: 343 EYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSAN-----KSQLLVLSSEAKDAAVEIEHR- 396
E + EE +F + K+ + Q+ + E R
Sbjct: 213 HLFEDNLILTPAETSKEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRG 272
Query: 397 -LQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKS 450
L+ S ++ + AL L+ +E YIL + +VLK A Q LK+
Sbjct: 273 ALETSRTTSTTMDASTNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKA 329
>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 595
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 105/242 (43%), Gaps = 13/242 (5%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P G+PDS + Y L S + + Q++L G + ++ W+
Sbjct: 164 FGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSIL------NGFLLASSPQLWM 217
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
LKD L+ L+N ++ PK W +F+ L N + EM+ P P+H +
Sbjct: 218 LKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILT 277
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
R +A+L+ TR+ FA N AE+ K + GMV + LGIA + + S
Sbjct: 278 SCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLV-PS 336
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS-----GQAPPVKEV 356
+ LA+ + ++ + S +I R LN S++ Y+ G+ +EV
Sbjct: 337 VTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPREV 396
Query: 357 ND 358
+D
Sbjct: 397 SD 398
>gi|407411065|gb|EKF33281.1| hypothetical protein MOQ_002854, partial [Trypanosoma cruzi
marinkellei]
Length = 636
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 99/227 (43%), Gaps = 8/227 (3%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P G+P+S + Y L S + + Q++L LG ++ W+
Sbjct: 167 FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------SSPQLWM 220
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
+KD + L+ L+N ++ PK W + L+N + EML P+ P H +
Sbjct: 221 MKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMT 280
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
R +A+LI +R+ FA N AE+ K + GMV I GI + + +S
Sbjct: 281 SCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-MYTSVVTS 339
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
+ LA+ +++ + +S I R LN S+V +Y+ G
Sbjct: 340 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRG 386
>gi|449280556|gb|EMC87826.1| UPF0420 protein [Columba livia]
Length = 111
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 48/67 (71%), Gaps = 1/67 (1%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
L + +LP+GFP+SV+SDYL Y LW ++Q + S ++G +A +++L A+G+G A T A
Sbjct: 18 LVQRVLLPQGFPESVSSDYLGYQLWDALQALCSSLAGAVAARSVLQAVGVGDTNASVTGA 77
Query: 178 AINWVLK 184
A+ W+L+
Sbjct: 78 ALGWLLR 84
>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
[Strongylocentrotus purpuratus]
Length = 311
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/249 (21%), Positives = 101/249 (40%), Gaps = 31/249 (12%)
Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
+ +AA +E+++ FP +FV I + +S + +TR+ A R N A+V AK
Sbjct: 1 MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60
Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
+Q + + +G+ + ++ + F + T +H+Y N ++ + + TLN
Sbjct: 61 DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120
Query: 336 RASLVFSEYLLSGQ---APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
R ++ +YL S Q + P VN EP+ + LS K
Sbjct: 121 RFHILVQDYLHSSQFSMSGP-DAVNSREPV----------------IWRLSRPLK----- 158
Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFC--VVLKESALPQDMLKS 450
L LG V + D E ++L+ K C + + + +D +K+
Sbjct: 159 ----LNLGVPFRTVAQSMRDLEMCLQSNGQEKFLLSFDVKKGCIDIAIHHDSTSEDFIKA 214
Query: 451 LFQASYLYW 459
QA + +
Sbjct: 215 CLQAEIIMY 223
>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 595
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 105/242 (43%), Gaps = 13/242 (5%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
F +P G+PDS + Y L S + + Q++L G + ++ W+
Sbjct: 164 FGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSIL------NGFLLASSPQLWM 217
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
LKD L+ L+N ++ PK W +F+ L N + EM+ P P+H +
Sbjct: 218 LKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILT 277
Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
R +A+L+ TR+ FA N AE+ K + GMV + LGIA + + S
Sbjct: 278 SCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLV-PS 336
Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS-----GQAPPVKEV 356
+ LA+ + ++ + S +I R LN S++ Y+ G+ +EV
Sbjct: 337 VTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPREV 396
Query: 357 ND 358
+D
Sbjct: 397 SD 398
>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
Length = 480
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 68/133 (51%), Gaps = 3/133 (2%)
Query: 99 SNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA 158
+ SLSSL S + LF LP G P + YL+Y + VQG+ S + GV++
Sbjct: 37 AGGSLSSL-SPRSVPSALSGLFGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVS 95
Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLL 216
T A L A G+G A AA+ W ++DG+G L + S + FD K +RL AD+
Sbjct: 96 TGAALAAAGVGDSDATAMGAAVTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVA 155
Query: 217 ENAAFGLEMLTPA 229
+A L+M P+
Sbjct: 156 NDAGMILDMARPS 168
>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
Length = 391
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 152/372 (40%), Gaps = 57/372 (15%)
Query: 167 GLGKG---AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFG 222
G+G G A PT A + V+ G ++ I+ ++ G + K +R AD+ + A
Sbjct: 25 GVGVGDSTATPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMV 84
Query: 223 LEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
L+ L+PAFP V + + + + + S+++ A FA + N EV AK +Q
Sbjct: 85 LDCLSPAFPKPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQET 144
Query: 282 VSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
V +G++ G + ++I S P A + ++ IH+ N + +++ + LN RA+++
Sbjct: 145 VISLLGMLAGSVVISYITS--PLATWGTLILLLTIHLATNYAAVRAVIMHCLNRQRANIL 202
Query: 341 FSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLG 400
FS G ++V+ E +F + +L S + ++ R +G
Sbjct: 203 FSNMFQHGLVLSPRDVSQRERVF-----------ERGGVLRWSDD------KVLGRCSIG 245
Query: 401 SKLSDVVNNKEDAHA--------------LFSLYEDEGYIL-----TEHGGKFCVVLKES 441
L ++N H L ++ +E YIL E G +VLK +
Sbjct: 246 VPLQRLLNRLGTPHKQTGSLTLKSMEISDLLDVFANEAYILLPASTVEEG---LIVLKRT 302
Query: 442 ALPQDMLK----SLFQASYLYWLERNAGIVATSTSADCAPG-----GRLEISLDYVQREF 492
P D +K +L A LE AG+ A P L SL VQ F
Sbjct: 303 CEPIDQIKAWAHALLLAKRREGLE-AAGLGRKRDDASHGPAMEGLISELRESLKEVQAMF 361
Query: 493 NHVKSDSASVGW 504
+ GW
Sbjct: 362 ARYGDEMRDKGW 373
>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
Length = 406
Score = 62.0 bits (149), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 21/197 (10%)
Query: 166 IGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
+G+G A PT+A + +L+D G ++ I ++ G + K +RL AD+ + A L
Sbjct: 6 VGVGNASASPTSALLLQILQDISGRIATICFAHRVGTALEPECKTYRLAADVFNDIAMIL 65
Query: 224 EMLTPAFPH--HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
+ L+P P V + + AG R+ + S+++ A FA N AE+ AK +Q
Sbjct: 66 DCLSPGVPAGPARVIVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQET 125
Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMY--------CNLKSYQSIELRTLN 333
V G+++G + +HI SFS TW+ + N + +++ + +LN
Sbjct: 126 VISLFGMLVGSVVISHI--------TSFST-TWLVLLFLLALHLSMNYAAVRAVRMTSLN 176
Query: 334 PYRASLVFSEYLLSGQA 350
RA++VFS L S ++
Sbjct: 177 RQRANIVFSALLASDKS 193
>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 90/207 (43%), Gaps = 25/207 (12%)
Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
+TRS A R N A+V +K AQ I +G+ L H+ + F +T
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHVQNHRVL-YGLFLFLT 61
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
+H+Y N+K+ Q++ LRT N R + EY SG ++VN K++
Sbjct: 62 LLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVN------------KLER 109
Query: 374 ANKSQLLVLSSEAKD--AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHG 431
Q + L++ K +A E+ H + L +V+ A F E +++ E
Sbjct: 110 VTIGQTVSLTARVKMGCSARELAHYYRNCYDLENVI-------ACFD--SREKFLIAETR 160
Query: 432 GKFCVVLKESALPQDMLKSLFQ-ASYL 457
V L + P D++KS F ASYL
Sbjct: 161 NYIGVYLHFTVKPLDIIKSYFYVASYL 187
>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
Length = 280
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 4/94 (4%)
Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAINWV 182
M P+ F + Y Y WR VQ V TQ+LL A+G+ G+ + AAAINWV
Sbjct: 1 MAPQRF---CVASYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWV 57
Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
+KDG G L +++ + +GR D K +RL L+
Sbjct: 58 IKDGAGRLGRLLFARWGRELDCELKQFRLMGYLV 91
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 290 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
LG L G + L F ++ ++ + K S+EL +N R + +YL GQ
Sbjct: 65 LGRLLFARWGRELDCELKQFRLMG--YLVASRKEVDSVELPYMNRARLAYAARQYLTGGQ 122
Query: 350 APPVKEVNDEEPLFP 364
P V E N EPL P
Sbjct: 123 VPGVAEANHHEPLLP 137
>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 494
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 135/341 (39%), Gaps = 66/341 (19%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
+ LP G+P SVT DY+ Y ++ S+ +A+ +L+A LG P
Sbjct: 126 ILNVFLPAGYPHSVTDDYIEYQIYESMGRIAT----------ILFAHRLGTSLEPEC--- 172
Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIG 238
K +RL AD+L ++A L+ L+P FP V +
Sbjct: 173 ---------------------------KMYRLAADILNDSAMVLDCLSPIFPKPVRVGLL 205
Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
+ + R+ + S+++ A FA N E+ A +G + ++I
Sbjct: 206 SLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNA---------------VGSVVVSYI 250
Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
S + +A V+ +H+ N + +++++ TLN RA++VFS E +
Sbjct: 251 TSPLETWIA-LIVLLIVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETSK 309
Query: 359 EEPLFPAFHFFKIKSAN-----KSQLLVLSSEAKDAAVEIEHR--LQLGSKLSDVVNNKE 411
EE +F + K+ + Q+ + E R L+ S ++
Sbjct: 310 EERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDAST 369
Query: 412 DAHALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKS 450
+ AL L+ +E YIL + +VLK A Q LK+
Sbjct: 370 NLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKA 410
>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
Length = 234
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 87/206 (42%), Gaps = 23/206 (11%)
Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
+TRS A R N A+V +K AQ I +G+ L H+ + F VT
Sbjct: 3 ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHVQNQRVL-YGLFLFVT 61
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
+H+Y N+K+ +++ LRT N R + EY SG ++VN K++
Sbjct: 62 LLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVN------------KLER 109
Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYED-EGYILTEHGG 432
Q + L++ K +L+ N D + + ++ E +++ E
Sbjct: 110 VTIGQTVSLTARVKMGCS--------ARELAQYYRNCYDLENVIACFDSREKFLIAETRN 161
Query: 433 KFCVVLKESALPQDMLKSLFQ-ASYL 457
V L + P D++KS F ASYL
Sbjct: 162 YVGVYLHFTVKPLDIIKSYFYVASYL 187
>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
Length = 194
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 46/77 (59%), Gaps = 1/77 (1%)
Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
V LP +P+SV S Y ++ + +A + VL+TQALL A+G+G + A P AAA
Sbjct: 114 IVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAAA 173
Query: 179 INWVLKDGIGYLSKIML 195
+NW+LKD G L ++
Sbjct: 174 LNWLLKDFAGQLGGVLF 190
>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
Length = 290
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 88/206 (42%), Gaps = 21/206 (10%)
Query: 225 MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK 284
M+TP FP+ F+ + + G+S + L +T + GFA N +V AK +Q
Sbjct: 1 MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60
Query: 285 AIGIMLGIALANHIG-SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
+G++ G+++++ +G SM AS ++ + +Y KS + + L TLN R+ L +
Sbjct: 61 LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120
Query: 344 YLLSGQA-PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK 402
Y G P E P H D +I + L S
Sbjct: 121 YCSQGIIYQPEAVAKLERYYLPYLH------------------VADCGTDIRLGMSLHSV 162
Query: 403 LSDVVNNKEDAHALFSLYEDEGYILT 428
LS + E+A + L+ D Y+L+
Sbjct: 163 LS-AYRSLEEADDVIQLFGDCNYMLS 187
>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
HHB-10118-sp]
Length = 464
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 80/373 (21%), Positives = 148/373 (39%), Gaps = 56/373 (15%)
Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINW 181
LP G+P SV+ DY Y + ++QG S + G++A +A+L G+G A T A +
Sbjct: 1 MFLPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLT 60
Query: 182 VLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH---HFVFI 237
+L+D ++ I+ + G K +R AD+ + L+ L+P H FV++
Sbjct: 61 ILQDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWL 120
Query: 238 G---------------------AAAGAGRSAAALIQASTRSCFYAGF------AARRNFA 270
+G RS A+ A +++ F AA + +
Sbjct: 121 AFPPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDIS 180
Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
++ AK ++ V +G++ G + I + +V+ H+Y N + +S+ +
Sbjct: 181 DLSAKDGSKETVLALVGMLCGSIIMPLIQTQTQ-TCGLLAVIIVFHIYINYYAVRSVVFK 239
Query: 331 TLNPYRASLVFSEYLLS---GQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
+ N R +++S + S + +EV E LF N S L ++
Sbjct: 240 SYNRQRTCILWSAFRRSEGEHRCLVPREVARHEHLF----------HNPSTL--FDTDLS 287
Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDM 447
A + +GS L+ ++ DA +G + H ES L +
Sbjct: 288 TGARRVIGHCDMGSPLASILPCSPDAR--------KGLLARRHQCPRWAGAAESTLVGSL 339
Query: 448 LKSLFQASYLYWL 460
L++ Y+ W
Sbjct: 340 LQAFVDEKYVLWF 352
>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
Length = 484
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 83/169 (49%), Gaps = 14/169 (8%)
Query: 117 RELFVQFMLPEGFPDSVTSDYL-NYSLWRSV-QGVASQIS------GVLATQALLYAIGL 168
++ V+ LP G+P SVT DY Y R + +G++ + +L + A +G+
Sbjct: 69 QDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILESSAARLPLGV 128
Query: 169 GKG---AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLE 224
G G A PTAA + +L+D G ++ I+ ++ G + K +RL AD+ + A L+
Sbjct: 129 GVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILD 188
Query: 225 MLTPAFP--HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
L+P P V + + AG R+ + S+++ A F+ N AE
Sbjct: 189 CLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAE 237
>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
Length = 405
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
P V+ +Y++Y W+ + V S VLATQA+ AIG+G+ ++ ++AA NW+LKDG+
Sbjct: 124 LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGL 183
Query: 188 GYL 190
G L
Sbjct: 184 GRL 186
>gi|313238188|emb|CBY13282.1| unnamed protein product [Oikopleura dioica]
Length = 93
Score = 56.6 bits (135), Expect = 3e-05, Method: Composition-based stats.
Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
L + LP+ +P+SV+ DY Y +W S Q + S ++G LA ++ IG+G A +A
Sbjct: 21 LICKVFLPKDYPNSVSEDYTTYQVWDSAQALCSSVTGSLAINSVFKGIGVGDSSASSLSA 80
Query: 178 AINWVLKDGI 187
NW+++DGI
Sbjct: 81 TSNWIIRDGI 90
>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 472
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 68/311 (21%), Positives = 122/311 (39%), Gaps = 65/311 (20%)
Query: 187 IGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAG 244
+G ++ I+ ++ G + K +RL AD+L ++A + L+PA P V + A +
Sbjct: 1 MGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVL 60
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEV-------------------------------I 273
R+ + S+++ + FA N E+ +
Sbjct: 61 RALCGVAAGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKL 120
Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTL 332
K +Q V +G++ G + +HI S PFA + ++ +H+ N + +S+ + TL
Sbjct: 121 QKDSSQETVISLLGMLCGSVVVSHI--STPFATWTTLLLLLLVHLSTNYAAVRSVNMTTL 178
Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
N RA++VFS G E + E +F + K A AA+
Sbjct: 179 NRQRANIVFSTLFEKGSVLTPTEASKCERIFERDGILRWK-------------ASPAALG 225
Query: 393 IEHRLQLGSKLSDVVNNKE-----------DAHALFSLYEDEGYIL--TEHGGKFCVVLK 439
Q+G +++ D + L L+E E Y+L K +VLK
Sbjct: 226 Y---CQIGGSFQELLRGSTHGANSIRDVGIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLK 282
Query: 440 ESALPQDMLKS 450
+ P LK+
Sbjct: 283 NNVTPISQLKA 293
>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
Length = 237
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 61/133 (45%), Gaps = 13/133 (9%)
Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
+G A GA R+A L QA R N A+V AK +Q + + + G +
Sbjct: 7 IVGVAGGATRAALTLHQAR-----------RNNMADVSAKDGSQETLVNLMALFTGFLIT 55
Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
+ ++ F V T++H+Y N ++ ++ + T+N R ++ Y + G+ KE
Sbjct: 56 PLVVDNLFLTWYFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKE 115
Query: 356 VNDEEPLF--PAF 366
VN +P+ P F
Sbjct: 116 VNKVDPVLRRPGF 128
>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
Length = 266
Score = 55.1 bits (131), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 101/243 (41%), Gaps = 30/243 (12%)
Query: 221 FGLEM-LTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
G+E+ + P +PH I + ++ + +TRS A R N A+V +K +Q
Sbjct: 1 MGIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQ 60
Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
+ +G+ L + I S F VV +H+Y NLK+ +++ LR+ N R +
Sbjct: 61 ETCVNLVASFVGLYLLSLIKSQAVL-YTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLI 119
Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
E+ S + ++VN E + V + ++L
Sbjct: 120 ALEEFFRSSRMLSPQQVNAMERV-----------------------TVGQTVSVSLNIKL 156
Query: 400 GSKLSDVVNNKEDAHALFSLYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA- 454
G + ++++ + + + ++ E +I+ + V L PQD+LK+ F A
Sbjct: 157 GLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAV 216
Query: 455 SYL 457
SYL
Sbjct: 217 SYL 219
>gi|384494916|gb|EIE85407.1| hypothetical protein RO3G_10117 [Rhizopus delemar RA 99-880]
Length = 247
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 57/113 (50%), Gaps = 1/113 (0%)
Query: 251 IQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFS 310
I ++ F F+ N +++AK +AQ + +G++ G+ L + I S F +F+
Sbjct: 9 IWLASHMTFTKHFSPNGNIGDIVAKDDAQMSTAHLLGMLSGVGLIS-ISHSPLFLFGAFA 67
Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
V++ I+++ +K + E LN + +L+ EY+ G +++ D E F
Sbjct: 68 VLSPINIWSTVKMLHAAEFEILNQAKLTLLAREYIDQGHVATYEKLRDREIGF 120
>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 400
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 46/228 (20%), Positives = 95/228 (41%), Gaps = 14/228 (6%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINW 181
+ ++P G+P SV Y + ++ + A+ + V++ A+L G +AA
Sbjct: 47 ELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL---RFNSGH---SAAFFV 100
Query: 182 VLKDGI-GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML---TPAFPHHFVFI 237
V ++ + G + I+ +G K WR+ DL+ +E+ TP F V
Sbjct: 101 VFREVMTGMMHLIITERWGTSIVFFAKQWRMRIDLISELLRIIEIAFISTPIFYSMSVTC 160
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
G +++ +I+ + +A N AE+ K + + + + +++G + +
Sbjct: 161 SVIEGVLQASRQVIKTRILN----NYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHV 216
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
+G + F H CN I +TLN R +++ S ++
Sbjct: 217 LGDFTESGIIVFICALISHCVCNFLLSNVIVFKTLNYERLTILMSYFM 264
>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
Length = 400
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 46/228 (20%), Positives = 96/228 (42%), Gaps = 14/228 (6%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINW 181
+ ++P G+P SV Y + ++ + A+ + V++ A+L I G +AA
Sbjct: 47 ELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL-RINPGH-----SAAFFV 100
Query: 182 VLKDGI-GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML---TPAFPHHFVFI 237
V ++ + G + I+ +G K WR+ +L+ +E+ TP F V
Sbjct: 101 VFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTPIFYSISVTC 160
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
G +++ +I+ + +A N AE+ K + + + + +++G + +
Sbjct: 161 SVIEGVLQASRQVIKTRILN----NYAKGNNVAELTEKVQNLETLFRVVSLLIGWVILHV 216
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
+G + F H CN I +TLN R +++ S ++
Sbjct: 217 LGDFTESGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFM 264
>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
Length = 400
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 46/228 (20%), Positives = 96/228 (42%), Gaps = 14/228 (6%)
Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINW 181
+ ++P G+P SV Y + ++ + A+ + V++ A+L I G +AA
Sbjct: 47 ELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL-RINPGH-----SAAFFV 100
Query: 182 VLKDGI-GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML---TPAFPHHFVFI 237
V ++ + G + I+ +G K WR+ +L+ +E+ TP F V
Sbjct: 101 VFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTPIFYSISVTC 160
Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
G +++ +I+ + +A N AE+ K + + + + +++G + +
Sbjct: 161 SVIEGVLQASRQVIKTRILN----NYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHV 216
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
+G + F H CN I +TLN R +++ S ++
Sbjct: 217 LGDFTESGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFM 264
>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
Length = 191
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 58/119 (48%)
Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
R+ + ++TR+ A N +V AK +Q + ++ + L + + F
Sbjct: 3 RTVVGVAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVF 62
Query: 305 ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
F + T+IH+Y N ++ + ++ RTLN +V +Y+ + + V E+N++EP+
Sbjct: 63 VWLLFCLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPIL 121
>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 45.4 bits (106), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 97/253 (38%), Gaps = 29/253 (11%)
Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
+TR+ A R N AE+ AK Q + IG I L N+ SS + S++
Sbjct: 42 ATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFSSSRAEWILIISLMV 101
Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
+H+Y N + +++ T N R +LV Y G +VN E + F
Sbjct: 102 -LHLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVNVNESVLLGF------G 154
Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN--KEDAHALFSLYEDEGYILTEHG 431
N ++ + LG + +N+ D L S+Y+D+ Y+L +G
Sbjct: 155 LNVKRMCGFD-------------IVLGESIKKSLNDYAAADIKLLQSVYKDKKYMLLVNG 201
Query: 432 --GKFCVVLKESALPQDMLKSLFQASYL-----YWLERNAGIVATSTSADCAPGGRLEIS 484
K V L++ D++ F A L + + I A P RL
Sbjct: 202 TKKKIYVCLEKGETAADVIAGYFHAVCLGIATSIYNSLDLDIYARRQLHHPTPITRLYTY 261
Query: 485 LDYVQREFNHVKS 497
+ ++ NH ++
Sbjct: 262 MKSYEKSHNHFRN 274
>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
Length = 269
Score = 45.1 bits (105), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 63/142 (44%), Gaps = 19/142 (13%)
Query: 139 NYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN 197
NYS+ ++Q +SG L+T+A+L +G+G + P AA + W++KDG +
Sbjct: 6 NYSI-HAIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLMWLIKDGSTRM------- 57
Query: 198 FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRS 257
GR G +L AD + G I AG +S + +TR+
Sbjct: 58 VGRILFAWCNGTKLDADTAVGGSNG----------TMTSILCVAGVAKSIVGVAGGATRA 107
Query: 258 CFYAGFAARRNFAEVIAKGEAQ 279
A + N A+V AK EA+
Sbjct: 108 ALTQHQARQGNLADVSAKVEAK 129
>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
Length = 635
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 41/183 (22%), Positives = 76/183 (41%), Gaps = 20/183 (10%)
Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-------GAIPTAAAINW 181
+P VTS+Y Y+ WR ++ + +S ++ L I L + G I A I++
Sbjct: 116 YPSGVTSNYYKYANWRLLERFSHSMSQSISNNPLAAKILLNQSTYSTNDGVINIAYKIDF 175
Query: 182 --------VLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
+ K+ + + K M + N G FD+NP+ + + L+ + L+ F
Sbjct: 176 IATNLMTQIFKEIVSRIVKTMWVGNIGIGFDMNPRAFCFWGSLICTITNTADFLSTLFNF 235
Query: 233 HFVFIGA-AAGAGRSAAALIQASTRSCFYAGFAARRNFA---EVIAKGEAQGMVSKAIGI 288
++ I R L +++ Y F ++ N + E+ AK EA + +G
Sbjct: 236 KYISITRHITTLFRQIGMLTTSASIGPIYGSFKSQNNVSNIGEITAKMEAVTPLCDLLGS 295
Query: 289 MLG 291
G
Sbjct: 296 ATG 298
>gi|440799745|gb|ELR20789.1| hypothetical protein ACA1_055550 [Acanthamoeba castellanii str.
Neff]
Length = 275
Score = 43.1 bits (100), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 51/101 (50%), Gaps = 3/101 (2%)
Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
A+V AK +Q +G++LG+ +A + + F+V T H++ N ++ + +
Sbjct: 1 MADVAAKDGSQETAVTLVGLILGLIIAPLVDENYFLIWLLFAVFTTAHLFANYRAVRGLV 60
Query: 329 LRTLNPYRASLVFSEYLLSGQAP---PVKEVNDEEPLFPAF 366
+ TLN R ++V + YL S PV+ E PL+ AF
Sbjct: 61 MDTLNRQRLAIVVTHYLGSASREVLSPVQASAREWPLWFAF 101
>gi|12654037|gb|AAH00822.1| C16orf58 protein [Homo sapiens]
Length = 231
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 33/63 (52%)
Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
F+L F +T +H+Y N ++ +++ + TLN R LV YL G+ N EPL+
Sbjct: 22 FSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLW 81
Query: 364 PAF 366
F
Sbjct: 82 TGF 84
>gi|422292653|gb|EKU19955.1| hypothetical protein NGA_2040500, partial [Nannochloropsis gaditana
CCMP526]
Length = 181
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 71/163 (43%), Gaps = 6/163 (3%)
Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
N AEV+AKG AQ G+ G+ A S +++++T IH+ N + + +
Sbjct: 20 NLAEVLAKGGAQHTAVSLFGLGFGMWFARVANQSPRRVWTAYTLLTVIHLTANYAAMRVL 79
Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFF---KIKSA--NKSQLLVL 382
++N R +++ + + + EV E + + K+ +A K +
Sbjct: 80 AFTSINRRRLNVLLTAFRENASVLSPAEVARRETIIVDVGRWRPPKLWTALFQKGKRRAT 139
Query: 383 SSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGY 425
+ AKDA I + ++LG ++ D+ E L L+ E Y
Sbjct: 140 AELAKDAHAPIWN-IRLGVRVKDLTRFAEGLTELQRLFSQEQY 181
>gi|154278545|ref|XP_001540086.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150413671|gb|EDN09054.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 425
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 92/213 (43%), Gaps = 16/213 (7%)
Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQ 325
++F EV G + ++S +G++ G + +HI S P A + ++ +H+ N + +
Sbjct: 142 QSFCEVGKFGRVKCLIS-LLGMLCGSVVVSHI--STPSATWTTLLLLLLVHLSTNYAAVR 198
Query: 326 SIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSE 385
S+ + TLN RA++VFS G E + E +F + K++ + L
Sbjct: 199 SVNMTTLNRQRANIVFSTLFEKGSVLTPTEASKCERIFERDGILRWKASPAA--LGYCQI 256
Query: 386 AKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESAL 443
+ + + + DV D + L L+E E Y+L K +VLK +
Sbjct: 257 GGSFQELLRGSIHGANSIRDV---GIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVT 313
Query: 444 PQDMLK----SLFQASYLYWLER-NAGIVATST 471
P LK +L A L L R A +V ST
Sbjct: 314 PISQLKAWSHALIVAKRLTDLRREEADMVNVST 346
>gi|194746211|ref|XP_001955574.1| GF16181 [Drosophila ananassae]
gi|190628611|gb|EDV44135.1| GF16181 [Drosophila ananassae]
Length = 287
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)
Query: 386 AKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT---------EHGGKFCV 436
K VE H++++ ++ ++ +K+ A+ +E+E LT H +F V
Sbjct: 101 VKHPLVEEVHQVEIDDRVVEL--SKQYLPAMACGFENEKLKLTIGDGFDYMKNHKNEFDV 158
Query: 437 VLKESALPQDMLKSLFQASYLYWLERNA----GIVATSTSADCAPGGRLEISLDYVQREF 492
++ +S+ P SLFQ SY Y L ++A GIV C+ GG + LDY+++
Sbjct: 159 IITDSSDPIGPAVSLFQESY-YELMKHALKEDGIV-------CSQGGSFWLDLDYIKKTM 210
Query: 493 NHVKSDSASVGWVTDGLIARP 513
+ K A V + + + P
Sbjct: 211 SGCKEHFAKVAYAVTSVPSYP 231
>gi|395331205|gb|EJF63586.1| hypothetical protein DICSQDRAFT_101426 [Dichomitus squalens
LYAD-421 SS1]
Length = 285
Score = 38.9 bits (89), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 87/192 (45%), Gaps = 23/192 (11%)
Query: 175 TAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTP--AF- 230
T A + VL+D L+ I+ + G K +RL AD+L +AA + L+P AF
Sbjct: 16 TNAILLTVLQDVFSRLTTIVGGYYLGTSLTPEAKTYRLLADVLNDAAIISDTLSPYLAFV 75
Query: 231 ----PHHFVFIGAA----------AGAGRSAAALIQASTRSCFYAGFAAR----RNFAEV 272
+ FV GA+ +GA R+ + +++ FA + ++
Sbjct: 76 TLSTQYPFVRPGASSSLRVVALCLSGAFRALCGAVAGGSKAALTVHFATDGERPGDVGDL 135
Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
AK ++ V +G++ G + +++ S+ F++V + H+ N + + I +R
Sbjct: 136 SAKDGSKETVLALLGMLCGSVVVHYVHSAQATYAVLFALV-FFHLAFNYMAVRVISMRYF 194
Query: 333 NPYRASLVFSEY 344
N RAS+ + +Y
Sbjct: 195 NRQRASIAWQKY 206
>gi|374921923|gb|AFA26139.1| hypothetical protein, partial [Lolium perenne]
Length = 109
Score = 38.5 bits (88), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 37/66 (56%)
Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
I P +ASF+V++ ++ + + +S+ L TLN R ++ ++ +G P +KE N
Sbjct: 6 ISKRNPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFVKTGHVPSLKEGN 65
Query: 358 DEEPLF 363
EE +F
Sbjct: 66 SEETVF 71
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.133 0.402
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,406,292,817
Number of Sequences: 23463169
Number of extensions: 345282346
Number of successful extensions: 1606304
Number of sequences better than 100.0: 543
Number of HSP's better than 100.0 without gapping: 444
Number of HSP's successfully gapped in prelim test: 99
Number of HSP's that attempted gapping in prelim test: 1602993
Number of HSP's gapped (non-prelim): 2078
length of query: 529
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 382
effective length of database: 8,910,109,524
effective search space: 3403661838168
effective search space used: 3403661838168
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)