BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 045134
         (529 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225462223|ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/500 (71%), Positives = 412/500 (82%), Gaps = 15/500 (3%)

Query: 27  DSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIP 86
           +++    ++LLFV S+LY F H Q+ TA+++          KE + VWEV+G K  K+IP
Sbjct: 199 ETAHMLRAVLLFVFSVLYSFFHFQLDTALSK---------EKEEEGVWEVRGGKWHKIIP 249

Query: 87  DFTKDAFVVAS----ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSL 142
           D +KD F+V +    A  A  SS L    LW +C+ELF++ MLPEGFP SVTSDYL+Y+L
Sbjct: 250 DSSKDEFLVVTPGIGAVGAPKSSTLP--NLWLQCKELFLRLMLPEGFPHSVTSDYLDYTL 307

Query: 143 WRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
           WR VQGVASQISGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHF
Sbjct: 308 WRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHF 367

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
           DV+PKGWRLFADLLENAA+GLE+LTPAFPH F+ IGA AGAGRSAAALIQASTRSCFYAG
Sbjct: 368 DVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAG 427

Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
           FAA+RNFAEVIAKGEAQGMVSK+IGIMLGIALAN IGSS P + ASF+VVT +HM+CNLK
Sbjct: 428 FAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLK 487

Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVL 382
           SYQSI+LRTLNPYRASLVFSEYLLSGQ P +KEVN+EEPLFP       K   K+Q  VL
Sbjct: 488 SYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVL 547

Query: 383 SSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESA 442
           S+EAKDAA EIE RLQLGSKLS+VV++KED  ALF LY +E YILTEH G+F V+LKES 
Sbjct: 548 STEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESC 607

Query: 443 LPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASV 502
            PQDMLKS+F  +YLYWLERNAGI++   S DC PGGRL+ISL+YVQREFNH+K+DS  V
Sbjct: 608 SPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFV 667

Query: 503 GWVTDGLIARPLPNRIRPGY 522
           GW TDGLIARPLPNRIRPG+
Sbjct: 668 GWATDGLIARPLPNRIRPGH 687


>gi|296082804|emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  721 bits (1860), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/494 (72%), Positives = 410/494 (82%), Gaps = 15/494 (3%)

Query: 34  SLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAF 93
           ++LLFV S+LY F H Q+ TA+++          KE + VWEV+G K  K+IPD +KD F
Sbjct: 4   AVLLFVFSVLYSFFHFQLDTALSK---------EKEEEGVWEVRGGKWHKIIPDSSKDEF 54

Query: 94  VVAS----ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGV 149
           +V +    A  A  SS L    LW +C+ELF++ MLPEGFP SVTSDYL+Y+LWR VQGV
Sbjct: 55  LVVTPGIGAVGAPKSSTLP--NLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGV 112

Query: 150 ASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGW 209
           ASQISGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDV+PKGW
Sbjct: 113 ASQISGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGW 172

Query: 210 RLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNF 269
           RLFADLLENAA+GLE+LTPAFPH F+ IGA AGAGRSAAALIQASTRSCFYAGFAA+RNF
Sbjct: 173 RLFADLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNF 232

Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
           AEVIAKGEAQGMVSK+IGIMLGIALAN IGSS P + ASF+VVT +HM+CNLKSYQSI+L
Sbjct: 233 AEVIAKGEAQGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQL 292

Query: 330 RTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA 389
           RTLNPYRASLVFSEYLLSGQ P +KEVN+EEPLFP       K   K+Q  VLS+EAKDA
Sbjct: 293 RTLNPYRASLVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDA 352

Query: 390 AVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLK 449
           A EIE RLQLGSKLS+VV++KED  ALF LY +E YILTEH G+F V+LKES  PQDMLK
Sbjct: 353 AAEIERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLK 412

Query: 450 SLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGL 509
           S+F  +YLYWLERNAGI++   S DC PGGRL+ISL+YVQREFNH+K+DS  VGW TDGL
Sbjct: 413 SVFHVNYLYWLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGL 472

Query: 510 IARPLPNRIRPGYV 523
           IARPLPNRIRPG++
Sbjct: 473 IARPLPNRIRPGHI 486


>gi|255557849|ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
 gi|223541000|gb|EEF42558.1| conserved hypothetical protein [Ricinus communis]
          Length = 541

 Score =  712 bits (1838), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/493 (75%), Positives = 414/493 (83%), Gaps = 19/493 (3%)

Query: 42  LLYCFCHL---QVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASA 98
           LL CF  L     ++A ART    +++   E D+VW VKGSKR +LIPDF KD F+V   
Sbjct: 46  LLCCFVALWLQSASSAFARTTLKEKEEEGAE-DSVWVVKGSKRIRLIPDFIKDEFLV--- 101

Query: 99  SNASLSSLLSV----------NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQG 148
            N SL S                LW +CR LFV+ MLPEG+P SVTSDYL+YSLWR VQG
Sbjct: 102 -NPSLPSSYDDIISSSWLHFGRTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQG 160

Query: 149 VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKG 208
           VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKI+LS +GRHFDVNPKG
Sbjct: 161 VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKG 220

Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
           WRLFADLLENAAFGLE+LTPAFPH FVFIGAAAGAGRSAAALIQA+TRSCFYAGFAA+RN
Sbjct: 221 WRLFADLLENAAFGLEILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRN 280

Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
           FAEVIAKGEAQGMVSK IGIMLGI LAN IGSS+P ALASFSVVTWIHM+CNLKSYQSI+
Sbjct: 281 FAEVIAKGEAQGMVSKFIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQ 340

Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
           LRTLNPYRASLVFSEYLLSGQAPP+K+VNDEEPLFPA  F   KSA+K  L+VLS EA+D
Sbjct: 341 LRTLNPYRASLVFSEYLLSGQAPPIKDVNDEEPLFPAV-FPHFKSADKPSLVVLSLEARD 399

Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDML 448
           AA EIE RLQLGSKLSDVVN+KED  ALF+LY+DEGYILTE+ G+FCVVLKES   QDML
Sbjct: 400 AATEIERRLQLGSKLSDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDML 459

Query: 449 KSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDG 508
           K+LFQ +YLYWLERNAG+ A  TSADC  GGRL++SL+Y+QREF+HV++DS SVGWV DG
Sbjct: 460 KALFQVNYLYWLERNAGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADG 519

Query: 509 LIARPLPNRIRPG 521
           LIARPLPNRI PG
Sbjct: 520 LIARPLPNRIYPG 532


>gi|42565615|ref|NP_190175.2| uncharacterized protein [Arabidopsis thaliana]
 gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis thaliana]
 gi|30794095|gb|AAP40490.1| unknown protein [Arabidopsis thaliana]
 gi|110739240|dbj|BAF01534.1| hypothetical protein [Arabidopsis thaliana]
 gi|332644566|gb|AEE78087.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 608

 Score =  693 bits (1789), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/489 (72%), Positives = 405/489 (82%), Gaps = 4/489 (0%)

Query: 36  LLFVPSLLYCFCH--LQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAF 93
           L F+   L CF H  L  A+AIA+   +S+ +G+   + VWEV+GSKR +L+PDF KD F
Sbjct: 110 LCFLLLGLSCFFHFRLSAASAIAKD-QNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEF 168

Query: 94  VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
           V +  S   LSS L+   L  +CR L  QF+LPEGFP+SVTSDYL+YSLWR VQG+ASQI
Sbjct: 169 V-SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQI 227

Query: 154 SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFA 213
           SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFA
Sbjct: 228 SGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFA 287

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
           DLLENAAFG+EMLTP FP  FV IGAAAGAGRSAAALIQA+TRSCF AGFA++RNFAEVI
Sbjct: 288 DLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVI 347

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           AKGEAQGMVSK++GI+LGI +AN IG+S   ALA+F VVT IHMY NLKSYQ I+LRTLN
Sbjct: 348 AKGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLN 407

Query: 334 PYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEI 393
           PYRASLVFSEYL+SGQAP +KEVNDEEPLFP   F  +KS  K Q  VLSSEAK AA +I
Sbjct: 408 PYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADI 467

Query: 394 EHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQ 453
           E RLQLGSKLSDV++NKE+A ALF LY +EGYILTEH G+FCV+LKES+ PQDML+SLFQ
Sbjct: 468 EERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQ 527

Query: 454 ASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARP 513
            +YLYWLE+NAGI   ST +DC PGGRL ISLDYV+REF H K DS SVGWVT+GLIARP
Sbjct: 528 VNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARP 587

Query: 514 LPNRIRPGY 522
           LP RIR G+
Sbjct: 588 LPTRIRLGH 596


>gi|356495287|ref|XP_003516510.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 649

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/499 (72%), Positives = 402/499 (80%), Gaps = 7/499 (1%)

Query: 35  LLLFVPSLLYCFCHL-QVATAIART-ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDA 92
           L L   S L  FCHL     A A+T + S+  D +   + V+EVKG K TKL+PD T D 
Sbjct: 90  LSLLCSSALCFFCHLLHAKLAKAKTLSPSTTADTSLFSEPVYEVKGGKWTKLVPDLTNDV 149

Query: 93  FVVASASNAS-LSSLLSVNKL----WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQ 147
           FV A     S LSSL   ++L    W +C ++F + MLPEGFP+SVTSDYL YSLWR+VQ
Sbjct: 150 FVSAQQGFLSELSSLKVPSQLATFVWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQ 209

Query: 148 GVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPK 207
           GVA Q+SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDV+PK
Sbjct: 210 GVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPK 269

Query: 208 GWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR 267
           GWRLFADLLENAAFGLEM TPAFP  FV IGA AGA RSAA+LIQASTRSCF+AGFAA+R
Sbjct: 270 GWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQR 329

Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
           NFAEVIAKGE QGM S+ IGI LGI L N IGSS P  LASF+V+TWIHMYCNLKSYQSI
Sbjct: 330 NFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSI 389

Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
           +LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPA        ANK+Q +VLSSEAK
Sbjct: 390 QLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAK 449

Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDM 447
           DAA EIEHRLQLGSKLS++VN+KED  ALF LY++EGYIL+E+ GKFCVVLKE+   QDM
Sbjct: 450 DAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDM 509

Query: 448 LKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
           LK+LFQ +YLYWLE+NAGI    T  D  PGGRL ISLDYV+REFNHVK+D   VGWVTD
Sbjct: 510 LKALFQVNYLYWLEKNAGIGGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTD 569

Query: 508 GLIARPLPNRIRPGYVEPS 526
           GLIARPLPNRIR G   PS
Sbjct: 570 GLIARPLPNRIRIGDTPPS 588


>gi|297815746|ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321594|gb|EFH52015.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 613

 Score =  690 bits (1781), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/515 (69%), Positives = 410/515 (79%), Gaps = 3/515 (0%)

Query: 16  WWWHGGNGGDDDSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEY-DAVW 74
           WW++GG    DDSS +       V  L  CF H +++ A A    S  D       + VW
Sbjct: 97  WWFNGGGDNSDDSSFNLRYFCFLVLGL-SCFFHFRLSAASAIAKASDSDSSGDTDKETVW 155

Query: 75  EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
           EV+GSKR +L+PDF KD FV +  S   LSS L+   L  +CR L  QF+LPEGFP+SVT
Sbjct: 156 EVRGSKRKRLVPDFVKDEFV-SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVT 214

Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIM 194
           SDYL+YSLWR VQG+ASQ+SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIM
Sbjct: 215 SDYLDYSLWRGVQGIASQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIM 274

Query: 195 LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQAS 254
           LS +GRHFDV+PKGWRLFADLLENAAFG+EMLTP FP  FV IGAAAGAGRSAAALIQA+
Sbjct: 275 LSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAA 334

Query: 255 TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTW 314
           TRSCF AGFA++RNFAEVIAKGEAQGMVSK++GI+LGI +AN IG+S   ALA+F VVT 
Sbjct: 335 TRSCFNAGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTT 394

Query: 315 IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSA 374
           IHMY NLKSYQ I+LRTLNPYRASLVFSEYL+SGQAP +KEVNDEEPLFP   F  +KS 
Sbjct: 395 IHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSP 454

Query: 375 NKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKF 434
            K Q  VLSSEAK AA +IE RLQLGSKLSDV++NKE+A ALF LY +EGYILTEH G+F
Sbjct: 455 EKLQDFVLSSEAKAAAEDIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRF 514

Query: 435 CVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNH 494
           CV+LKES+ PQDML+SLFQ +YLYWLE+NAGI   ST  DC PGGRL ISLDYV+REF H
Sbjct: 515 CVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYTDCKPGGRLHISLDYVRREFEH 574

Query: 495 VKSDSASVGWVTDGLIARPLPNRIRPGYVEPSVAS 529
            K DS SVGWVT+GLIARPLP RIR G+    ++S
Sbjct: 575 AKEDSQSVGWVTEGLIARPLPTRIRLGHDREPLSS 609


>gi|356540900|ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/462 (74%), Positives = 378/462 (81%), Gaps = 8/462 (1%)

Query: 71  DAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNK------LWDECRELFVQFM 124
           + V+EVKG K TKL+PD T D FV  SA    LS L S+        +W +C ++F + M
Sbjct: 128 EPVYEVKGGKWTKLVPDPTDDVFV--SAQQGFLSELSSLKPSQLATFVWLKCSDIFTRLM 185

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLK 184
           LPEGFP+SVTSDYL YSLWR+VQGVA Q+SGVLATQ+LLYA+GLGKGAIPTAAAINWVLK
Sbjct: 186 LPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLK 245

Query: 185 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
           DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEM TPA P  FV IGA AGA 
Sbjct: 246 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACPQFFVLIGAVAGAS 305

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
           RSAA+LIQASTRSCF+AGFAA+RNFAEVIAKGE QGM S+ IGI+LGI L N IGSS P 
Sbjct: 306 RSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLGIGLGNCIGSSTPL 365

Query: 305 ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP 364
            LASF+V+TWIHMYCNLKSYQSI+LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP
Sbjct: 366 VLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP 425

Query: 365 AFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEG 424
           A        A+K+Q   LSSEAKDAA EIEHRLQLGSKLS++VN+KED  ALF LY++EG
Sbjct: 426 AVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKNEG 485

Query: 425 YILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEIS 484
           YIL+EH GK+ VVLKE     DMLK+LFQ +YLYWLE+NAGI    T  D  PGGRL IS
Sbjct: 486 YILSEHMGKYSVVLKEKCSQLDMLKALFQVNYLYWLEKNAGIEGRGTLNDSKPGGRLHIS 545

Query: 485 LDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEPS 526
           LDYV+REFNHVK+D   VGWVTDGLIARPLPNRI  G   PS
Sbjct: 546 LDYVEREFNHVKNDGELVGWVTDGLIARPLPNRICIGDTAPS 587


>gi|357484331|ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
 gi|355513788|gb|AES95411.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  679 bits (1751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/503 (71%), Positives = 406/503 (80%), Gaps = 20/503 (3%)

Query: 33  YSLL--LFVPSLLYCFCHLQVATAIARTAT-SSEDDGNKEYDAVWEVKGSKRTKLIPDFT 89
           Y+LL  L   S+ +C C L    A+A+T + SSEDD   +   ++EVKG    KL PD  
Sbjct: 87  YTLLFTLLFSSVTFCLCQL----AMAKTRSLSSEDDILTQ--PIYEVKGGNLIKLFPDNL 140

Query: 90  KDAFVVASASNASL-SSLLSVNK------LWDECRELFVQFMLPEGFPDSVTSDYLNYSL 142
           KD F+   ASN  L S L S+N       L+++CRE  V+ MLPEGFP+SVTSDYL YSL
Sbjct: 141 KDIFI---ASNPGLFSELSSLNSSQVPTFLYNKCREFVVRLMLPEGFPNSVTSDYLEYSL 197

Query: 143 WRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
           WR VQGVA Q+SGVLATQALLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LS+FGRHF
Sbjct: 198 WRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHF 257

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
           DVNPKGWRLFADLLENAAFGLEM TPAFPH FV IGA AGA RSAA+LIQASTRSCF+AG
Sbjct: 258 DVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAG 317

Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
           FAA+RNFAEVIAKGE QGMVS+ IGI +GI L N IGSS P  LASF VVTW+HMYCNLK
Sbjct: 318 FAAQRNFAEVIAKGEVQGMVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLK 377

Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANK-SQLLV 381
           SYQSI+LRTLNP+RASLVFSEYLLSGQAPPVKEVN EEPLFPA        ANK +Q +V
Sbjct: 378 SYQSIQLRTLNPHRASLVFSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIV 437

Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKES 441
           LSSEAKDAAVEIE RLQLGSKLS+++NNKE+  ALFSLY++EGYIL+EH GKFCVVLKE+
Sbjct: 438 LSSEAKDAAVEIESRLQLGSKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKET 497

Query: 442 ALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSAS 501
               DMLK+LFQ +YLYWLE+NAGI    T  DC PGGRL+ISL+Y +REFNHV++D  S
Sbjct: 498 CSQLDMLKALFQVNYLYWLEKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGES 557

Query: 502 VGWVTDGLIARPLPNRIRPGYVE 524
           VGW+TDGLIARPLPNR RPG  E
Sbjct: 558 VGWITDGLIARPLPNRCRPGNTE 580


>gi|7339490|emb|CAB82813.1| putative protein [Arabidopsis thaliana]
          Length = 631

 Score =  674 bits (1739), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/512 (68%), Positives = 403/512 (78%), Gaps = 27/512 (5%)

Query: 36  LLFVPSLLYCFCH--LQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAF 93
           L F+   L CF H  L  A+AIA+   +S+ +G+   + VWEV+GSKR +L+PDF KD F
Sbjct: 110 LCFLLLGLSCFFHFRLSAASAIAKD-QNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEF 168

Query: 94  VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
           V +  S   LSS L+   L  +CR L  QF+LPEGFP+SVTSDYL+YSLWR VQG+ASQI
Sbjct: 169 V-SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQI 227

Query: 154 SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFA 213
           SGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFA
Sbjct: 228 SGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFA 287

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
           DLLENAAFG+EMLTP FP  FV IGAAAGAGRSAAALIQA+TRSCF AGFA++RNFAEV 
Sbjct: 288 DLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVY 347

Query: 274 -----------------------AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFS 310
                                  + GEAQGMVSK++GI+LGI +AN IG+S   ALA+F 
Sbjct: 348 NNFYMALVLITYQQLFVFLNYSGSLGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFG 407

Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK 370
           VVT IHMY NLKSYQ I+LRTLNPYRASLVFSEYL+SGQAP +KEVNDEEPLFP   F  
Sbjct: 408 VVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSN 467

Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH 430
           +KS  K Q  VLSSEAK AA +IE RLQLGSKLSDV++NKE+A ALF LY +EGYILTEH
Sbjct: 468 MKSPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEH 527

Query: 431 GGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQR 490
            G+FCV+LKES+ PQDML+SLFQ +YLYWLE+NAGI   ST +DC PGGRL ISLDYV+R
Sbjct: 528 KGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRR 587

Query: 491 EFNHVKSDSASVGWVTDGLIARPLPNRIRPGY 522
           EF H K DS SVGWVT+GLIARPLP RIR G+
Sbjct: 588 EFEHAKEDSESVGWVTEGLIARPLPTRIRLGH 619


>gi|449455818|ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 611

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/510 (65%), Positives = 392/510 (76%), Gaps = 11/510 (2%)

Query: 14  GSWWWHGGNGGDDDSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAV 73
           G  WW         S  +F  L  F  S+L CFC  Q+A A+AR   ++E        ++
Sbjct: 112 GFGWWQYDGDSPPWSDNAF--LAFFFSSVLGCFCLFQLAVALARNNMNTE--------SI 161

Query: 74  WEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
           WEVKG KR +LI D  +D F VA+   +S  S   VN +W  C ++F + MLPEGFPDSV
Sbjct: 162 WEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVN-VWLRCSDIFTRLMLPEGFPDSV 220

Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKI 193
           TSDYL YSLWR VQG+ASQ+SGVLATQALLYA+GLGKGAIPTAAA+NWVLKDG GYLSKI
Sbjct: 221 TSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGFGYLSKI 280

Query: 194 MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQA 253
            LS +GRHFDV+PKGWRLFADLLENAA+G+EMLTPAFP HFV IGAAAGAGRSAAALIQA
Sbjct: 281 FLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRSAAALIQA 340

Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
           +TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IG+MLGI LAN I SS   AL  FS+VT
Sbjct: 341 ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLALGCFSIVT 400

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
            IHM+CNLKSY+SI+LRTLNPYRASLVFSEYLLSG+ P +K+VN+EEPLFPA      K+
Sbjct: 401 LIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAVPLLNRKA 460

Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGK 433
            + S+  +LS+EAK++A  IE RLQLGSKLSDV   +ED   L SL+  E YIL+EH GK
Sbjct: 461 PDWSRDFLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEHRGK 520

Query: 434 FCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFN 493
           +CV+LKESA P DMLK++F  +YL+WLERNAGI A S S DC PGGRL++SL+YV+REF 
Sbjct: 521 YCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVEREFK 580

Query: 494 HVKSDSASVGWVTDGLIARPLPNRIRPGYV 523
           HVK D    GW TDGLIARPL  RI   +V
Sbjct: 581 HVKYDGELAGWSTDGLIARPLTTRICECHV 610


>gi|38344695|emb|CAE02373.2| OSJNBb0096E05.17 [Oryza sativa Japonica Group]
          Length = 593

 Score =  613 bits (1582), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 321/473 (67%), Positives = 372/473 (78%), Gaps = 21/473 (4%)

Query: 66  GNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS----------------ASNASLSSLLSV 109
           G +E  +VWEV+G  RT L+PD T  ++VVA                 A  A L++L   
Sbjct: 121 GGREGASVWEVRGGARTLLVPDPTGTSYVVAGDGRRKQAEGEDAEKAGAGRAELAALRRQ 180

Query: 110 NKL-WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
            +L W  C  + VQ +LP+G+PDSV+SDYL YSLWR VQG+ASQISGVL+TQALLYA+GL
Sbjct: 181 LELSWRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGL 240

Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
           GKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A+GLE+LTP
Sbjct: 241 GKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTP 300

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
            FPH FV IGAAAGAGRSAAALIQA+TRSCFYAGFA +RNFAEVIAKGEAQGMVSK +GI
Sbjct: 301 VFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFLGI 360

Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
           MLGIALAN IGSS+  AL SF+ VT +HMYCNLKSYQSI+LRTLNPYRASLVFSEYLLSG
Sbjct: 361 MLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 420

Query: 349 QAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN 408
           Q P VKEVNDEEPLF        +  +K    +LS +AKDAA  I  RLQLGSKLS+++ 
Sbjct: 421 QVPSVKEVNDEEPLFLNLSIGTSRKESK----ILSPQAKDAAEIICRRLQLGSKLSEIIE 476

Query: 409 NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVA 468
           NKEDA ALF LY++E Y+LTE+ GKFCV+LKE + P+DMLKSLF  +YLYWLE+  GI  
Sbjct: 477 NKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLKSLFHVNYLYWLEKYLGIKP 536

Query: 469 TSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 521
           +  ++ C PGGRLE SLDY QREF+HVK D ++ GWV DGLIARPLP RIR G
Sbjct: 537 SDVASACRPGGRLEASLDYTQREFSHVKLDGSNGGWVMDGLIARPLPVRIRIG 589


>gi|414879009|tpg|DAA56140.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
          Length = 593

 Score =  609 bits (1570), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 316/477 (66%), Positives = 375/477 (78%), Gaps = 25/477 (5%)

Query: 60  TSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASN----------ASLSSLLS- 108
           TS +D G+     VWEV+G KRT+L+PD T  ++++A               S+S  L+ 
Sbjct: 123 TSGDDAGD-----VWEVRGGKRTRLMPDPTWTSYLIAGDDGTKREDGDEKVGSISKDLAA 177

Query: 109 ----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
               + + W    E+ V  +LP+G+P SV+SDYLNYSLWR VQGVASQISGVL+TQALLY
Sbjct: 178 LRRQLGRSWRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLY 237

Query: 165 AIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLE 224
           A+GLGKGAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD LEN A+GLE
Sbjct: 238 AVGLGKGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLE 297

Query: 225 MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK 284
           +LTP FPH FV IGAAAGAGRSAA+LIQ++TRSCFYAGFA +RNFAEVIAKGEAQGMVSK
Sbjct: 298 ILTPVFPHLFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSK 357

Query: 285 AIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY 344
            +GI LGIALANHIGSS+P AL SF+ +T +HMYCNLKSYQSI+LRTLNPYRASLVFSEY
Sbjct: 358 FVGIGLGIALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEY 417

Query: 345 LLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS 404
           LLSGQ P VKEVNDEEPL     FF +      Q  +LS+EAKDAA  I  RLQ+GSKLS
Sbjct: 418 LLSGQVPSVKEVNDEEPL-----FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLS 472

Query: 405 DVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
           +++ +KEDA ALF LY++E Y+L ++ GKFC+VLKE + P+DMLKSLF  +YLYWLER  
Sbjct: 473 EIIESKEDACALFDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLKSLFHVNYLYWLERYM 532

Query: 465 GIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 521
           G   ++ +++C PGGRLE+SLDY QREF+HVK D +  GWV DGLIARPLP RIR G
Sbjct: 533 GFKPSNVASECRPGGRLEVSLDYAQREFSHVKHDGSDGGWVMDGLIARPLPVRIRIG 589


>gi|242055389|ref|XP_002456840.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
 gi|241928815|gb|EES01960.1| hypothetical protein SORBIDRAFT_03g043876 [Sorghum bicolor]
          Length = 569

 Score =  608 bits (1569), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 316/489 (64%), Positives = 381/489 (77%), Gaps = 23/489 (4%)

Query: 52  ATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--------- 102
           A+A+A    +S DD       VWEV+G  RT+L+PD T  ++++A    +          
Sbjct: 89  ASALAEALGNSGDDAG----GVWEVRGGTRTRLVPDSTWTSYLIAGDDGSKREDEDGKVG 144

Query: 103 -----LSSLL-SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
                L++L   + + W  C ++ VQ +LP+G+P SV+SDY+NYSLWR VQGVASQISGV
Sbjct: 145 DGPEGLAALRRQLGRSWRRCADVAVQLLLPDGYPHSVSSDYMNYSLWRGVQGVASQISGV 204

Query: 157 LATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
           L+TQALLYA+GLG+GAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD L
Sbjct: 205 LSTQALLYAVGLGRGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFL 264

Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKG 276
           EN A+GLE+LTP FPH FV IGAAAGA RSAA+LIQA+TRSCFYAGFA +RNFAEVIAKG
Sbjct: 265 ENMAYGLEILTPVFPHLFVPIGAAAGASRSAASLIQAATRSCFYAGFAVQRNFAEVIAKG 324

Query: 277 EAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
           EAQGMVSK +GI LGIALAN IGSS+P AL SF+ +T +HMYCNLKSYQSI+LRTLNPYR
Sbjct: 325 EAQGMVSKFVGIGLGIALANQIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYR 384

Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
           ASLVFSEYLLSGQ P VKEVNDEEPLF  F+     S  +S+  +LS+EAKDAA  I  R
Sbjct: 385 ASLVFSEYLLSGQVPSVKEVNDEEPLF--FNLSLGASCQQSK--ILSAEAKDAADRICRR 440

Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
           LQ+GSKLS+++ +KEDA ALF LY++E Y+L ++ GKFC+VLKE + P+DMLKSLF  +Y
Sbjct: 441 LQMGSKLSEIIESKEDACALFDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLKSLFHVNY 500

Query: 457 LYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPN 516
           LYWLER  G   ++ +++C PGGRLE+SLDY QREF+HVK D +  GWV DGLIARPLP 
Sbjct: 501 LYWLERYMGFKPSNVASECRPGGRLEVSLDYAQREFSHVKHDGSDGGWVMDGLIARPLPV 560

Query: 517 RIRPGYVEP 525
           RIR G   P
Sbjct: 561 RIRIGDGAP 569


>gi|357167308|ref|XP_003581100.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 604

 Score =  605 bits (1559), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 320/491 (65%), Positives = 376/491 (76%), Gaps = 23/491 (4%)

Query: 51  VATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS------------- 97
           + +A+A    +S  DG     +VWEV+GS RT L+PD    ++ VA              
Sbjct: 122 IPSALAEALGASGGDGA----SVWEVRGSSRTLLVPDPDGTSYRVAEDEKHKQEEGMEKV 177

Query: 98  -ASNASLSSLL-SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
            A    L++L   + + W  C ++ +Q +LP+G+P+SV+SDYL+YSLWR +QGVASQISG
Sbjct: 178 WAGRDDLAALRWQLEQSWRRCTDVALQLLLPDGYPNSVSSDYLHYSLWRGLQGVASQISG 237

Query: 156 VLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADL 215
           VL+TQALLYA+GLGKG IPTAAAINWVLKDG+GYLSKI+LS FGRHFDVNPK WRLFADL
Sbjct: 238 VLSTQALLYAVGLGKGVIPTAAAINWVLKDGLGYLSKILLSKFGRHFDVNPKRWRLFADL 297

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           LEN A+GLE+LTP FPH FV IGAAAGA RSAAALIQA+TRSCFYAGFA +RNFAEVIAK
Sbjct: 298 LENTAYGLEILTPVFPHLFVPIGAAAGAARSAAALIQAATRSCFYAGFAVQRNFAEVIAK 357

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
           GEAQGMVSK +GIMLGIALAN IGSS+P AL SF+ VT +HMYCNLKSYQSI+LRTLNPY
Sbjct: 358 GEAQGMVSKFLGIMLGIALANQIGSSVPLALISFAGVTVVHMYCNLKSYQSIQLRTLNPY 417

Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
           RASLVFSEYLLSGQ   VKEVNDEEPLF          A   Q  +LS+EAKDAA  I  
Sbjct: 418 RASLVFSEYLLSGQVSSVKEVNDEEPLF----LNSSMGAKLKQNKILSAEAKDAADRICR 473

Query: 396 RLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQAS 455
           RLQLGSKLS++++N+EDA ALF LY+DE Y+L E+ GKFCVVLKE + P+DMLKS+F  +
Sbjct: 474 RLQLGSKLSEIIDNEEDACALFDLYKDEQYLLLEYKGKFCVVLKEGSSPEDMLKSVFHVN 533

Query: 456 YLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLP 515
           YLYWLE+  G    + ++ C PGGRLE+SLDY QREF HVK D +  GWV DGLIARPLP
Sbjct: 534 YLYWLEKYMGPKPCNVASACRPGGRLEVSLDYAQREFRHVKQDGSDGGWVMDGLIARPLP 593

Query: 516 NRIRPGYVEPS 526
            RIR G V PS
Sbjct: 594 VRIRVGDVVPS 604


>gi|222628554|gb|EEE60686.1| hypothetical protein OsJ_14164 [Oryza sativa Japonica Group]
          Length = 609

 Score =  603 bits (1554), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 321/489 (65%), Positives = 372/489 (76%), Gaps = 37/489 (7%)

Query: 66  GNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS----------------ASNASLSSLLSV 109
           G +E  +VWEV+G  RT L+PD T  ++VVA                 A  A L++L   
Sbjct: 121 GGREGASVWEVRGGARTLLVPDPTGTSYVVAGDGRRKQAEGEDAEKAGAGRAELAALRRQ 180

Query: 110 NKL-WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
            +L W  C  + VQ +LP+G+PDSV+SDYL YSLWR VQG+ASQISGVL+TQALLYA+GL
Sbjct: 181 LELSWRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGL 240

Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
           GKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A+GLE+LTP
Sbjct: 241 GKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTP 300

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQ----------------ASTRSCFYAGFAARRNFAEV 272
            FPH FV IGAAAGAGRSAAALIQ                A+TRSCFYAGFA +RNFAEV
Sbjct: 301 VFPHLFVPIGAAAGAGRSAAALIQTIFTHFMLENANHMRKAATRSCFYAGFAVQRNFAEV 360

Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
           IAKGEAQGMVSK +GIMLGIALAN IGSS+  AL SF+ VT +HMYCNLKSYQSI+LRTL
Sbjct: 361 IAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTL 420

Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
           NPYRASLVFSEYLLSGQ P VKEVNDEEPLF        +  +K    +LS +AKDAA  
Sbjct: 421 NPYRASLVFSEYLLSGQVPSVKEVNDEEPLFLNLSIGTSRKESK----ILSPQAKDAAEI 476

Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
           I  RLQLGSKLS+++ NKEDA ALF LY++E Y+LTE+ GKFCV+LKE + P+DMLKSLF
Sbjct: 477 ICRRLQLGSKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLKSLF 536

Query: 453 QASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
             +YLYWLE+  GI  +  ++ C PGGRLE SLDY QREF+HVK D ++ GWV DGLIAR
Sbjct: 537 HVNYLYWLEKYLGIKPSDVASACRPGGRLEASLDYTQREFSHVKLDGSNGGWVMDGLIAR 596

Query: 513 PLPNRIRPG 521
           PLP RIR G
Sbjct: 597 PLPVRIRIG 605


>gi|226491780|ref|NP_001140263.1| uncharacterized protein LOC100272306 [Zea mays]
 gi|194698746|gb|ACF83457.1| unknown [Zea mays]
          Length = 451

 Score =  590 bits (1521), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 296/413 (71%), Positives = 344/413 (83%), Gaps = 5/413 (1%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
           + + W    E+ V  +LP+G+P SV+SDYLNYSLWR VQGVASQISGVL+TQALLYA+GL
Sbjct: 40  LGRSWRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLYAVGL 99

Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
           GKGAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD LEN A+GLE+LTP
Sbjct: 100 GKGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLEILTP 159

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
            FPH FV IGAAAGAGRSAA+LIQ++TRSCFYAGFA +RNFAEVIAKGEAQGMVSK +GI
Sbjct: 160 VFPHLFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMVSKFVGI 219

Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
            LGIALANHIGSS+P AL SF+ +T +HMYCNLKSYQSI+LRTLNPYRASLVFSEYLLSG
Sbjct: 220 GLGIALANHIGSSVPLALISFAGITAVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 279

Query: 349 QAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN 408
           Q P VKEVNDEEPL     FF +      Q  +LS+EAKDAA  I  RLQ+GSKLS+++ 
Sbjct: 280 QVPSVKEVNDEEPL-----FFNLSLGASRQSKILSAEAKDAADRICRRLQMGSKLSEIIE 334

Query: 409 NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVA 468
           +KEDA ALF LY++E Y+L ++ GKFC+VLKE + P+DMLKSLF  +YLYWLER  G   
Sbjct: 335 SKEDACALFDLYKNEQYLLMDYKGKFCIVLKEGSSPEDMLKSLFHVNYLYWLERYMGFKP 394

Query: 469 TSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 521
           ++ +++C PGGRLE+SLDY QREF+HVK D +  GWV DGLIARPLP RIR G
Sbjct: 395 SNVASECRPGGRLEVSLDYAQREFSHVKHDGSDGGWVMDGLIARPLPVRIRIG 447


>gi|218194530|gb|EEC76957.1| hypothetical protein OsI_15243 [Oryza sativa Indica Group]
          Length = 836

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 268/363 (73%), Positives = 303/363 (83%), Gaps = 4/363 (1%)

Query: 161 ALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAA 220
           ALLYA+GLGKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A
Sbjct: 476 ALLYAVGLGKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTA 535

Query: 221 FGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQG 280
           +GLE+LTP FPH FV IGAAAGAGRSAAALIQA+TRSCFYAGFA +RNFAEVIAKGEAQG
Sbjct: 536 YGLEILTPVFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAVQRNFAEVIAKGEAQG 595

Query: 281 MVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
           MVSK +GIMLGIALAN IGSS+  AL SF+ VT +HMYCNLKSYQSI+LRTLNPYRASLV
Sbjct: 596 MVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLRTLNPYRASLV 655

Query: 341 FSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLG 400
           FSEYLLSGQ P VKEVNDEEPLF        +  +K    +LS +AKDAA  I  RLQLG
Sbjct: 656 FSEYLLSGQVPSVKEVNDEEPLFLNLSVGTSRKESK----ILSPQAKDAAEIICRRLQLG 711

Query: 401 SKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
           SKLS+++ NKEDA ALF LY++E Y+LTE+ GKFCV+LKE + P+DMLKSLF  +YLYWL
Sbjct: 712 SKLSEIIENKEDACALFDLYKNEQYLLTEYKGKFCVILKEGSSPEDMLKSLFHVNYLYWL 771

Query: 461 ERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRP 520
           E+  GI  +  ++ C PGGRLE SLDY QREF+HVK D ++ GWV DGLIARPLP RIR 
Sbjct: 772 EKYLGIKPSDVASACRPGGRLEASLDYTQREFSHVKLDGSNGGWVMDGLIARPLPVRIRI 831

Query: 521 GYV 523
           G V
Sbjct: 832 GDV 834



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 146/235 (62%), Positives = 172/235 (73%), Gaps = 24/235 (10%)

Query: 66  GNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS----------------ASNASLSSLL-S 108
           G +E  +VWEV+G  RT L+PD T  ++VVA                 A  A L++L   
Sbjct: 246 GGREGASVWEVRGGARTLLVPDPTGTSYVVAGDGRRKQAEGEDAEKAGAGRAELAALRRQ 305

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
           + + W  C  + VQ +LP+G+PDSV+SDYL YSLWR VQG+ASQISGVL+TQALLYA+GL
Sbjct: 306 LERSWRRCATVAVQLLLPDGYPDSVSSDYLQYSLWRGVQGIASQISGVLSTQALLYAVGL 365

Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
           GKGAIPTAAA+NWVLKDG+GYLSKI+LS FGRHFDVNPKGWRLFADLLEN A+GLE+LTP
Sbjct: 366 GKGAIPTAAAVNWVLKDGLGYLSKILLSKFGRHFDVNPKGWRLFADLLENTAYGLEILTP 425

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR--RNFAEVIAKGEAQGM 281
            FPH FV IGAAAGAGRSAAALIQ       +A F      +  +VIAKGEAQGM
Sbjct: 426 VFPHLFVPIGAAAGAGRSAAALIQT-----IFAHFMLENANHMRKVIAKGEAQGM 475


>gi|168005499|ref|XP_001755448.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693576|gb|EDQ79928.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 503

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 251/479 (52%), Positives = 330/479 (68%), Gaps = 4/479 (0%)

Query: 49  LQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLS 108
           LQ  +AIA        D  K    +WEVKG +   L+    +D FVV +  N+   +L  
Sbjct: 24  LQQQSAIALLGPREVVD--KARTGIWEVKGGRWKYLVKHPERDEFVVQTVKNSEEEALAY 81

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
              +     EL  Q +LP+G+P SVT DY  Y+LWR  Q +ASQISGVL TQALLYA+GL
Sbjct: 82  EEDVITRFVELGKQIVLPDGYPQSVTDDYFEYTLWRMGQVIASQISGVLTTQALLYAVGL 141

Query: 169 GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
           GKGAIPTAAA+NWVL+DGIGYLSKI+LS +GRHFDV+PKGWRL +DL+EN ++GLE+LTP
Sbjct: 142 GKGAIPTAAAVNWVLRDGIGYLSKIVLSKYGRHFDVHPKGWRLVSDLIENMSYGLELLTP 201

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
           AFPH FV++ AAAGAGRSA+ LIQA+T+SCF AG AA RNFAE+IAKGEAQGMVSK+IGI
Sbjct: 202 AFPHLFVYLAAAAGAGRSASGLIQAATKSCFNAGMAANRNFAEIIAKGEAQGMVSKSIGI 261

Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
            LGIA++ ++GS+ P  + +F  VT +H++CNLKSYQ+++LRTLNPYRASLV +EY  SG
Sbjct: 262 ALGIAVSAYVGSTGPLLVGTFFGVTALHIFCNLKSYQAVQLRTLNPYRASLVLAEYFRSG 321

Query: 349 QAPPVKEVNDEEPLFPAFHFFKIK-SANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVV 407
               V+EVN EEP+FP F FF  +   +K    +LS E+K  A  IE+ L+ G+     V
Sbjct: 322 NVVSVREVNGEEPIFP-FSFFNFRLRGSKGDGTLLSEESKAVAHSIENHLEFGATFGAAV 380

Query: 408 NNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIV 467
             +E+A AL  +Y+ E Y+L++H  +  VVLKE A P+DML+++ QA YL +L+      
Sbjct: 381 KTREEADALLDIYKSERYLLSQHDDRLLVVLKEGATPRDMLRAMMQAIYLDYLQGTGFTS 440

Query: 468 ATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEPS 526
             +   D   GG L +S D+V  +F  ++ D A VGW+ +GLIARP PNR+      P+
Sbjct: 441 HRNLMHDSGNGGVLRVSHDFVVNQFEQIRQDIAGVGWICEGLIARPAPNRLLESRPSPT 499


>gi|302816607|ref|XP_002989982.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
 gi|300142293|gb|EFJ08995.1| hypothetical protein SELMODRAFT_130761 [Selaginella moellendorffii]
          Length = 440

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 228/424 (53%), Positives = 291/424 (68%), Gaps = 10/424 (2%)

Query: 96  ASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
           A A +   S  L V+++   C +     MLP G+P +V+ DY  Y+LWR  Q +ASQISG
Sbjct: 25  AGAVSTGRSRKLGVDEIVSWCLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISG 84

Query: 156 VLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADL 215
           VL TQALLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LS +GRHFDV+PKGWRL +DL
Sbjct: 85  VLTTQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDL 144

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           +E+ A GLE+LTP FPH FV + A AGAG+SAA LIQA+TRSCFYAGFA + NFAEVIAK
Sbjct: 145 IEDGACGLELLTPTFPHLFVPLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAK 204

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
           GEAQGMVSK++GI LGI +++ +GSS P  + SF  VT +H+ CNLKSYQ+++L TLNPY
Sbjct: 205 GEAQGMVSKSLGIALGIGISSKVGSSGPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPY 264

Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
           R  L+ +EYL + + P V++VN  EP+F    FF  K A  S  L +S E+K AA +IE 
Sbjct: 265 RTGLIMAEYLTTQRIPSVRDVNSVEPIFTVADFFSSKKAPVSTHLSISCESKCAAADIEK 324

Query: 396 RLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT-EHGGKFCVVLKESALPQDMLKSLFQA 454
            L+LG    + V ++  A AL  LY  E Y+LT +    F VVLK+SA P DML+++ QA
Sbjct: 325 NLKLGVSFGEAVASRGAAQALLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQA 384

Query: 455 SYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 514
           SYL  +         S S D      L++S   +Q+ FN +     + GW T GL+ARP 
Sbjct: 385 SYLRHIHNQ-----QSLSKD----ELLKLSYQSMQQSFNELIQGLQAAGWSTSGLVARPG 435

Query: 515 PNRI 518
            NR+
Sbjct: 436 SNRL 439


>gi|449484579|ref|XP_004156921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 443

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 228/308 (74%), Positives = 259/308 (84%), Gaps = 1/308 (0%)

Query: 67  NKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLP 126
           N   +++WEVKG KR +LI D  +D F VA+   +S  S   VN +W  C ++F + MLP
Sbjct: 125 NMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFVN-VWLRCSDIFTRLMLP 183

Query: 127 EGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDG 186
           EGFPDSVTSDYL YSLWR VQG+ASQ+SGVLATQALLYA+GLGKGAIPTAAA+NWVLKDG
Sbjct: 184 EGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLKDG 243

Query: 187 IGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
            GYLSKI LS +GRHFDV+PKGWRLFADLLENAA+G+EMLTPAFP HFV IGAAAGAGRS
Sbjct: 244 FGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVIGAAAGAGRS 303

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL 306
           AAALIQA+TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IG+MLGI LAN I SS   AL
Sbjct: 304 AAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANRIRSSTSLAL 363

Query: 307 ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
             FS+VT IHM+CNLKSY+SI+LRTLNPYRASLVFSEYLLSG+ P +K+VN+EEPLFPA 
Sbjct: 364 GCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVNNEEPLFPAV 423

Query: 367 HFFKIKSA 374
                K A
Sbjct: 424 PLLNRKLA 431


>gi|302771033|ref|XP_002968935.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
 gi|300163440|gb|EFJ30051.1| hypothetical protein SELMODRAFT_31929 [Selaginella moellendorffii]
          Length = 389

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 220/399 (55%), Positives = 278/399 (69%), Gaps = 14/399 (3%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT 175
           C +     MLP G+P +V+ DY  Y+LWR  Q +ASQISGVL TQALLYA+GLGKGAIPT
Sbjct: 4   CLDALKTTMLPIGYPKTVSDDYTEYTLWRMGQIIASQISGVLTTQALLYAVGLGKGAIPT 63

Query: 176 AAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           AAAINWVLKDGIGYLSKI+LS +GRHFDV+PKGWRL +DL+E+ A GLE+LTP FPH FV
Sbjct: 64  AAAINWVLKDGIGYLSKIVLSKYGRHFDVHPKGWRLLSDLIEDGACGLELLTPTFPHLFV 123

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            + A AGAG+SAA LIQA+TRSCFYAGFA + NFAEVIAKGEAQGMVSK++GI LGI ++
Sbjct: 124 PLTAVAGAGKSAAGLIQAATRSCFYAGFATQNNFAEVIAKGEAQGMVSKSLGIALGIGIS 183

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
           + +GSS P  + SF  VT +H+ CNLKSYQ+++L TLNPYR  L+ +EYL + + P V++
Sbjct: 184 SKVGSSGPLLVLSFGTVTAVHVLCNLKSYQAVQLHTLNPYRTGLIMAEYLTTQRIPSVRD 243

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
           VN  EP+F    FF  K A     L +S E+K AA +IE  L+LG    + V ++  A A
Sbjct: 244 VNSVEPIFTVADFFSSKKAP----LSISCESKCAAADIEKNLKLGVSFGEAVASRGAAQA 299

Query: 416 LFSLYEDEGYILT-EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSAD 474
           L  LY  E Y+LT +    F VVLK+SA P DML+++ QASYL  +         S S D
Sbjct: 300 LLDLYSGEEYLLTYDDSHCFKVVLKKSASPHDMLRAVVQASYLRHIHNQ-----QSLSKD 354

Query: 475 CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARP 513
                 L++S   +Q+ FN +     + GW T GL+ARP
Sbjct: 355 ----ELLKLSYQSMQQSFNELIQGLQAAGWSTSGLVARP 389


>gi|147807901|emb|CAN62157.1| hypothetical protein VITISV_025492 [Vitis vinifera]
          Length = 582

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/426 (53%), Positives = 280/426 (65%), Gaps = 45/426 (10%)

Query: 27  DSSGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIP 86
           +++    ++LLFV S+LY F H Q+  A+++          KE + VWEV+G K  K+IP
Sbjct: 130 ETAHKLRAVLLFVFSVLYSFFHFQLDAALSK---------EKEEEGVWEVRGGKWHKIIP 180

Query: 87  DFTKDAFVVAS----ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSL 142
           D +KD F+V +    A  A  SS L    LW +C+ELF++ MLPEGFP SVTSDYL+Y+L
Sbjct: 181 DSSKDEFLVVTPGIGAVGAPKSSTLP--NLWLQCKELFLRLMLPEGFPHSVTSDYLDYTL 238

Query: 143 WRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
           WR VQGVASQISGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHF
Sbjct: 239 WRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHF 298

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ--------AS 254
           DV+PKGWRLFADLLENAA+GLE+LTPAFPH F+ IGA AGAGRSAAALIQ         S
Sbjct: 299 DVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQLLFHTHNHLS 358

Query: 255 TRSCFYAGFAARRNFA------EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALAS 308
           + +     +    N A      ++   GE  G        + G  L     S++   L  
Sbjct: 359 SFALLLGSYFPLNNPAIGNSKRKMPTHGEGDGG-ENVTAFLGGPRLLXGAVSTLALLLQ- 416

Query: 309 FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHF 368
                        +++  +  +   P R  LVFSEYLLSGQ P +KEVN+EEPLFP    
Sbjct: 417 -------------RNFAEVIAKGEAP-RNGLVFSEYLLSGQVPSIKEVNEEEPLFPVVPL 462

Query: 369 FKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT 428
              K   K+Q  VLS+EAKDAA EIE RLQLGSKLS+VV++KED  ALF LY +E YILT
Sbjct: 463 LNAKPXYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILT 522

Query: 429 EHGGKF 434
           EH G+F
Sbjct: 523 EHKGRF 528


>gi|414879010|tpg|DAA56141.1| TPA: hypothetical protein ZEAMMB73_060175 [Zea mays]
          Length = 368

 Score =  290 bits (742), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 162/254 (63%), Positives = 193/254 (75%), Gaps = 27/254 (10%)

Query: 60  TSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASN----------ASLSSLLS- 108
           TS +D G+     VWEV+G KRT+L+PD T  ++++A               S+S  L+ 
Sbjct: 123 TSGDDAGD-----VWEVRGGKRTRLMPDPTWTSYLIAGDDGTKREDGDEKVGSISKDLAA 177

Query: 109 ----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
               + + W    E+ V  +LP+G+P SV+SDYLNYSLWR VQGVASQISGVL+TQALLY
Sbjct: 178 LRRQLGRSWRRYAEVAVHLLLPDGYPHSVSSDYLNYSLWRGVQGVASQISGVLSTQALLY 237

Query: 165 AIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLE 224
           A+GLGKGAIPTAAA+NWVLKDG+GYLSKIMLS FGRHFDVNPKGWRLFAD LEN A+GLE
Sbjct: 238 AVGLGKGAIPTAAAVNWVLKDGLGYLSKIMLSKFGRHFDVNPKGWRLFADFLENMAYGLE 297

Query: 225 MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK 284
           +LTP FPH FV IGAAAGAGRSAA+LIQ++TRSCFYAGFA +RNFAEVIAKGEAQGM + 
Sbjct: 298 ILTPVFPHLFVPIGAAAGAGRSAASLIQSATRSCFYAGFAVQRNFAEVIAKGEAQGMFTC 357

Query: 285 AIGIMLGIALANHI 298
                  IA ++HI
Sbjct: 358 -------IATSSHI 364


>gi|452824810|gb|EME31810.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 636

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 123/335 (36%), Positives = 192/335 (57%), Gaps = 25/335 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP-TAAAINWVL 183
           LP+G+P+SVT DYL+YS+WR++Q +A+ + GV AT+ALL+ +GLG+ + P T+AAI+WVL
Sbjct: 280 LPDGYPNSVTKDYLSYSIWRAIQNLAASVMGVFATEALLFGLGLGRKSTPATSAAISWVL 339

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KDG+GY+ K+   +  G  FDV+PK WRL AD +E+A   LE++TP FP HF+ + + A 
Sbjct: 340 KDGLGYIGKVFYGSIAGNQFDVDPKSWRLAADAVEDAGGVLEIITPLFPGHFLLLASVAN 399

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
           A +  AA+   +TR   Y   A   N  ++  KGE+QG+  K +G+ LGI  ++ +G + 
Sbjct: 400 AMKGVAAMTGTATRHAIYKSLALHENQGDIATKGESQGVTCKMVGLGLGIVASSKMGQNY 459

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
              L+++++  ++H++ N KS   ++  TLN  R SL   E+L +G  P   EV+  E +
Sbjct: 460 FALLSAYALGCFVHLFANWKSLSCVQFATLNRQRCSLALQEFLQTGNVPSPYEVSRREKV 519

Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYED 422
                   +    K  L                R+ +G+ L +   N  +     +LY D
Sbjct: 520 --------VVPPWKGYL---------------DRIVVGANLKECFQNGRELREAVALYRD 556

Query: 423 EGYILTEHGGKFCVVLKESALPQDMLKSLFQASYL 457
             YILT+   K  +VL ES  PQD+ ++  QA  L
Sbjct: 557 CRYILTQRDNKILIVLHESCRPQDIFQAFMQAHKL 591


>gi|301096265|ref|XP_002897230.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107315|gb|EEY65367.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 474

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 191/366 (52%), Gaps = 17/366 (4%)

Query: 91  DAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVA 150
           D  +      AS +  L+++ L  E +E+F    LP G+PDSV+ DYL +  W ++Q + 
Sbjct: 50  DQLLSTGQGRASNAGCLTLSGLSRELQEMF----LPAGYPDSVSEDYLGFQFWDTLQAMC 105

Query: 151 SQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKG 208
           S + GVLATQ++L ++G+G   A P AAA+ WVL+DG G +  +  + F G  FDVN K 
Sbjct: 106 SYLRGVLATQSVLQSVGVGDDKATPLAAALQWVLRDGSGMIGGLTFAYFVGPKFDVNVKR 165

Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
           WRLFAD++ +    L+M+ P FP     +   +   ++   +   +TRS     FA R N
Sbjct: 166 WRLFADVVNDVGLTLDMVAPYFPTLVTEVLCVSSVCKAMCGVAAGATRSSLMTHFAKRDN 225

Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
            A+  AK  +Q    K  G+M G+  AN + SS      +F V+T +H+Y N  +   + 
Sbjct: 226 MADCAAKEGSQETAVKLFGLMFGMYFANAVNSSPHAVWMAFLVLTLVHVYANYNAVSCLC 285

Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
           + T+N  R  ++   +L+SG  P  + V D +        + I+S N+ + +      KD
Sbjct: 286 IPTVNCSRGLILVQRFLMSGNKPGKENVADHQGR------YSIRSVNQEEPIF-----KD 334

Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDML 448
             + +   L +GS+L   V    D   L  +YEDE Y+LT  G +  V+ +  A P D L
Sbjct: 335 PVLPVTSHLMMGSELHQAVPTSTDLDRLLQIYEDETYLLTVVGDRVHVMFEAEAKPTDEL 394

Query: 449 KSLFQA 454
           ++ FQA
Sbjct: 395 RAFFQA 400


>gi|348671592|gb|EGZ11413.1| hypothetical protein PHYSODRAFT_338124 [Phytophthora sojae]
          Length = 478

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 115/355 (32%), Positives = 187/355 (52%), Gaps = 17/355 (4%)

Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
           S +  L+++ L  E +E+F    LP G+PDSV+ DYL++  W ++Q + S + GVLATQ+
Sbjct: 65  SAAGCLTLSGLSRELQEMF----LPAGYPDSVSEDYLSFQFWDTLQAMCSYLRGVLATQS 120

Query: 162 LLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENA 219
           +L ++G+G   A P AAA+ WVL+DG G +  +  +   G  FDVN K WRLFAD++ + 
Sbjct: 121 VLQSVGVGDDKATPLAAALQWVLRDGSGMIGGLTFAYLVGPKFDVNVKRWRLFADVINDV 180

Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
              L+M+ P FP     +   +   ++   +   +TRS     FA + N A+  AK  +Q
Sbjct: 181 GLTLDMVAPYFPTLVTEVLCVSSVCKTMCGVAAGATRSSLMTHFAKKDNMADCAAKEGSQ 240

Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
               K  G+M G+  AN + SS      +F ++T IH+Y N  +   + + T+N  R  +
Sbjct: 241 ETAVKLFGLMFGMYFANAVNSSPQAVWVAFLLLTVIHVYANYNAVSCLCIPTVNCSRGLI 300

Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
           +   +L SG  P  + V D          + I+S N+ + +      KD  + +   L +
Sbjct: 301 LVRRFLASGNKPGKENVADHN------GRYSIRSVNQEEPIF-----KDPVLPVTSHLVM 349

Query: 400 GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
           G++L   V +  D   L  +Y DE Y+LT  G +  V+ +  A P+D L++ FQA
Sbjct: 350 GAELHQAVPSSHDLDRLLHIYSDEAYLLTVVGDRVLVMFQAEAKPKDELRAFFQA 404


>gi|452823212|gb|EME30224.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 849

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/387 (32%), Positives = 206/387 (53%), Gaps = 33/387 (8%)

Query: 74  WEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
           ++V+GSKR  L  D +   F +AS     L    S  K W   R   ++F+    FP+ V
Sbjct: 372 YDVEGSKRRSLEWDESSARFKLASHDEEGLDWSTSW-KTWLTKR---IEFLRQSFFPEGV 427

Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKI 193
           + DY ++++WR  Q + S   GV  TQALL A+G+  G I  AAA +WVLKDG+G L K+
Sbjct: 428 SQDYYSFTMWRVFQRLVSSTVGVFGTQALLLALGIKAGRIGRAAATSWVLKDGLGRLGKM 487

Query: 194 M-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
           +  S  GR FD +PK WR  + LL     GLE++T  FP  F+     A + +  + L  
Sbjct: 488 VWASQMGRDFDADPKRWRFRSALLYALGNGLEIVTQIFPASFLLFATLANSMKQISMLTA 547

Query: 253 ASTRSCFYAGFAAR-RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
           ++TR+  Y  FA R  N A++ AKGEAQ +V+  +G+ LGI L+  IG+S P  L ++S 
Sbjct: 548 SATRNAMYKNFAGRSENIADITAKGEAQIVVADLLGMALGIQLSKLIGTSRPNVLTAYSF 607

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
           ++ + +Y   K  ++++ RTLN  R+S++   ++  G  P   EV+  E +F     + +
Sbjct: 608 LSVLDIYAFYKELRAVQFRTLNYERSSMIVDYFVRRGYVPRPDEVSQRENIFLGPR-YDV 666

Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH- 430
           +S                         + + LSD V+  ++ ++L  +++ E +++T   
Sbjct: 667 RS-------------------------MFASLSDAVSCPQELNSLIKIFKGEQFMVTRGW 701

Query: 431 GGKFCVVLKESALPQDMLKSLFQASYL 457
            G++ +VL+E A   D+L++L    +L
Sbjct: 702 NGQYRIVLREHARNGDILRALLTLGFL 728


>gi|449016609|dbj|BAM80011.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 496

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 195/404 (48%), Gaps = 29/404 (7%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTA 176
           + F +  LP+G+PDSV  DY  Y+ WR        + GV +TQ+LL A+G+G+  A P A
Sbjct: 118 DRFSRMFLPDGYPDSVGPDYAEYTKWRGRAFFFGGMVGVFSTQSLLLAVGVGRPSAAPIA 177

Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           AA+ WV++DG+G   +++ S  G  FD   K +RL A  + N +  LE LTP FPH F+ 
Sbjct: 178 AALQWVIRDGLGRAGRMLFSQVGTGFDAETKQYRLAAAFVLNVSCALEALTPLFPHLFLP 237

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + + A   + A+ +  ASTR   Y  F  R N  ++ AK E  G+    +G  +G+ L+ 
Sbjct: 238 LASLANMAKGASTVAAASTRGAIYRSFMRRENLGDITAKQETVGVAGDLLGTAVGVILSR 297

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL-SGQAPPVKE 355
               S   ++ +F   +  H+       +S+ELRTLN  RA ++   YL    +  P+++
Sbjct: 298 ASSHSHRLSVIAFIGASLGHLISAYYEVKSVELRTLNRQRAHMLIWNYLEDQDRVLPIRD 357

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
           VN  E L        + + N                     +++G++LSD   + +    
Sbjct: 358 VNRRERLVYRPWLDSLHAPN---------------------VEIGARLSDAAPDADALKF 396

Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
           L  LY DE Y++   GG+  +VL+  A  QD LK+ FQ+   + L   +G   +   A  
Sbjct: 397 LLRLYRDEKYMMNYAGGRVRIVLRNDATHQDQLKAYFQSRAFWRLYMTSGERFSPNRA-- 454

Query: 476 APGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIR 519
                +E S     + F     D  + GW  D ++ RPL  R++
Sbjct: 455 ----LIEDSYRATNQLFPSFVRDCRAEGWNVDCVLLRPLARRVQ 494


>gi|449528331|ref|XP_004171158.1| PREDICTED: uncharacterized protein LOC101231720 [Cucumis sativus]
          Length = 157

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 92/153 (60%), Positives = 114/153 (74%)

Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH 430
           +K+  + +L +LS+EAK++A  IE RLQLGSKLSDV   +ED   L SL+  E YIL+EH
Sbjct: 4   VKALKEPKLSLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELLSLFNKENYILSEH 63

Query: 431 GGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQR 490
            GK+CV+LKESA P DMLK++F  +YL+WLERNAGI A S S DC PGGRL++SL+YV+R
Sbjct: 64  RGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRPGGRLQMSLEYVER 123

Query: 491 EFNHVKSDSASVGWVTDGLIARPLPNRIRPGYV 523
           EF HVK D    GW TDGLIARPL  RI   +V
Sbjct: 124 EFKHVKYDGELAGWSTDGLIARPLTTRICECHV 156


>gi|449016577|dbj|BAM79979.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 719

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 132/415 (31%), Positives = 212/415 (51%), Gaps = 34/415 (8%)

Query: 108 SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG 167
           S   L +  R   +Q  LP G+P +VT DYL ++LWR +Q  A+ + GV +T+ALL  +G
Sbjct: 326 SERSLVERLRNWLIQVYLPHGYPHTVTQDYLAFTLWRLLQNTAASVMGVFSTEALLLGLG 385

Query: 168 LGKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEML 226
           LGK    TA A+ W+LKDG G+  KI  +   G+ FD++P+ WR+ +DLLE+ A  LE++
Sbjct: 386 LGKNVSGTAQAVQWILKDGFGHAGKIAYAAVAGKQFDMDPRSWRIMSDLLEDLAGVLEII 445

Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           TP F   F+ + + A   R+ AA+   +TR   +   A   N  ++  KGE+QG+ +K I
Sbjct: 446 TPIFSGQFLVLASIATTMRAIAAMTGTATRHAIFKSMALAENQGDLATKGESQGVTTKLI 505

Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           G+ LGI L+  IG      L ++ +   IH+  N K+ Q +E  TLN  R +L+   + +
Sbjct: 506 GLGLGITLSKRIGQDYTNLLIAYGIAAVIHLAANFKAMQCVEFSTLNRQRMALLLDAF-M 564

Query: 347 SGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
           S +A   ++ + +  L P       +   K + ++     +D   EI      G+ L  V
Sbjct: 565 SARAQTSRQASRQLTL-PT----PREICAKERFMI--PPWRDYRPEIV----FGASLQRV 613

Query: 407 VNNKEDAHALFSLYEDEGYIL-TEHGGKFCVV-LKESALPQDMLKSLFQASYLYWLERNA 464
            + +   H   +LY +E YI+  E   K  +V L++ A  +D LK+   A   Y L+R+A
Sbjct: 614 KHARHFLHCA-ALYRNERYIIYAEPSTKHVLVFLRDGATVKDQLKAFMNAQR-YLLDRDA 671

Query: 465 GIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIR 519
                            E + ++ QREF+     + + GW T+  +  P   RIR
Sbjct: 672 -----------------ERAYNFTQREFSTFLQATVASGWSTNVSLLNPTSYRIR 709


>gi|357487621|ref|XP_003614098.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
 gi|355515433|gb|AES97056.1| hypothetical protein MTR_5g044720 [Medicago truncatula]
          Length = 749

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 127/400 (31%), Positives = 207/400 (51%), Gaps = 34/400 (8%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
           F+Q  +PEGFP SVT DY+ + +W  +QG+++ I  +L+TQALL AIG+G K A    A 
Sbjct: 363 FLQAFVPEGFPSSVTPDYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGAT 422

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W L+D  G L  I+ + + G + D N K WRL ADL+ +    +++++P FP  FVFI
Sbjct: 423 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFI 482

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  RS   +   +TR+     FA + N A++ AK  +Q  V+  IG+ LG+ +A  
Sbjct: 483 VCLGSISRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVA-R 541

Query: 298 IGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
           I +  P A+  SF  +T  HMY N ++ Q + L +LNP R+S++   ++ S Q    K+V
Sbjct: 542 ITTGHPLAIWFSFLSLTLFHMYANYRAVQCLSLNSLNPERSSILLQRFMESRQVLSPKQV 601

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS--DVVNNKEDAH 414
           +  E + P             QL  +SS+  ++   ++ +++LG+++S  D +  KE   
Sbjct: 602 SPLEHVLPI------------QLTSMSSKKVNS---LDSKVRLGTRISSFDEMEIKEHLL 646

Query: 415 ALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSAD 474
           +L S Y    YIL E  G   V++ + +   D+LKS   A  L     N      S  +D
Sbjct: 647 SLASYYAKAKYILVEKKGIIDVIVHKDSSSADVLKSFIHALVL----ANNVYKGKSLHSD 702

Query: 475 CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 514
                    S  +++ ++        S+GW T+ L++ P+
Sbjct: 703 ---------SQMWMENQYEVFIQKLKSLGWKTERLLSSPI 733


>gi|449018781|dbj|BAM82183.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 631

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 113/340 (33%), Positives = 174/340 (51%), Gaps = 35/340 (10%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLK 184
           LPEGFP+SV   YL YSLWR  Q V S ++ VL+T ALL A+GL   + P AAA +WVLK
Sbjct: 243 LPEGFPESVGPSYLRYSLWRGAQNVVSAMTAVLSTHALLAAVGLS--STPVAAATSWVLK 300

Query: 185 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
           DGIG L K++ +  GR FD +PK +RL +DLL +A   LE+LTP FP +F+ + A     
Sbjct: 301 DGIGQLGKLLTARRGRDFDADPKRYRLSSDLLYDAGLSLEILTPLFPQYFLALAALGNFT 360

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
           +S A  +  + R+   + F  R N  ++ AK +AQ +V+   G+ LGI  A  +      
Sbjct: 361 KSVAITVGMACRNSVLSSFVLRGNLGDISAKNDAQNVVTNLTGMGLGIVTARALPPKPSV 420

Query: 305 ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--QAPPVKEVNDEEPL 362
            L +F+ +T ++   N +S ++++L  +N  R +++   ++ S       V+ VND E +
Sbjct: 421 RLGAFAALTALYSVLNYQSMKAVKLNVVNRQRGAIIADSFVRSHAEDVLDVETVNDLECI 480

Query: 363 FPAF--HFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLY 420
            P    HF  +                         L+LG  L  V              
Sbjct: 481 IPFVPEHFQWLP------------------------LKLGVSLERV-----GRLPPPQRR 511

Query: 421 EDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
               Y+++ +  +  V L E A P+D+L+ L Q++YL +L
Sbjct: 512 PRARYVMSMNERELRVALHERAAPRDLLQVLLQSAYLRYL 551


>gi|302764074|ref|XP_002965458.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
 gi|300166272|gb|EFJ32878.1| hypothetical protein SELMODRAFT_230646 [Selaginella moellendorffii]
          Length = 430

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 116/397 (29%), Positives = 195/397 (49%), Gaps = 41/397 (10%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVL 183
           ++PEGFPDSV   Y  Y  WR +Q       GV  T++LL+A+G+  GA  +A A+NWV+
Sbjct: 71  VVPEGFPDSVLPCYTPYMQWRCLQYFFGGAMGVFTTRSLLHALGVRNGASSSAIAVNWVV 130

Query: 184 KDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGA 243
           KDG G + K++ +  G+ FD + K  R   D+L   A  LE+ T A P  F+ +   A  
Sbjct: 131 KDGAGRIGKMIFARHGKKFDCHLKQIRFAGDVLMQLAAALELATSATPQFFLPLACIANI 190

Query: 244 GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMP 303
           G++ AA+   STR+  Y  FA R N  ++ AKGE  G ++  +G  +GI ++ +     P
Sbjct: 191 GKNVAAVASTSTRAPIYKAFARRENIGDITAKGECIGNIADLLGTGMGILMSKNFPVFAP 250

Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           F+  +F      ++Y + +  ++I+L TLN +R  +    +L +G+ P + E N+ E   
Sbjct: 251 FSALAFG-----YVYSSFREVKAIQLPTLNRHRFGIAVDTFLETGKVPGLVEANERE--- 302

Query: 364 PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDE 423
                 +I    +  L                +L+LG+++SD  +  +D   + SL++ E
Sbjct: 303 ------RIIVGPQWSL---------------EKLELGARVSDAFSAPQDYQYIDSLFKGE 341

Query: 424 GYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRL 481
            Y+++      +  VVLKE A   D+L++ F             I    +SA+    G  
Sbjct: 342 NYLVSYNPKRQRTYVVLKERANGDDVLRAAFHGQLF------VNIFDKCSSAE----GAY 391

Query: 482 EISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
           + S  Y+   + H K D+  +GW+    +  P   R+
Sbjct: 392 QESCKYISPLYEHFKLDAEKLGWIMSETLLSPGNARV 428


>gi|356497988|ref|XP_003517837.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 473

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 124/388 (31%), Positives = 197/388 (50%), Gaps = 28/388 (7%)

Query: 75  EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
           E  GS  TKL   FT  A   +S+S     S      +W    + FV    PEGFP SVT
Sbjct: 50  EWNGSSPTKLSKTFTIKA---SSSSVLITRSGARFTHVWRRLLQAFV----PEGFPSSVT 102

Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
           +DY+ + +W  +QG+++ I  +L+TQALL AIG+G K A    A   W L+D  G L  I
Sbjct: 103 ADYVPFQIWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRDLTGMLGGI 162

Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
           + + + G + D N K WRL ADL+ +    +++++P FP  FVF+       RS   +  
Sbjct: 163 LFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISRSFTGVAS 222

Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
            +TR+     FA + N A++ AK  +Q  V+  IG+ LG+ +A  +    P A+  SF  
Sbjct: 223 GATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVAR-LTIGHPLAIWFSFLS 281

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
           +T  HMY N ++ + + L +LNP R+S++   ++ +GQ    K+V+ +E + P       
Sbjct: 282 LTVFHMYANYRAVRCLALNSLNPERSSILLQHFMETGQVLSPKQVSSQEHVLPI------ 335

Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS--LYEDEGYILTE 429
                 Q +  SS+  +    +  ++ LG ++S   N +   H L +   Y    Y+L E
Sbjct: 336 ------QFISWSSKKANC---LHKKVCLGMRISSFDNMEIKEHLLCAAPYYTKAKYLLVE 386

Query: 430 HGGKFCVVLKESALPQDMLKSLFQASYL 457
             G   V++ + +   D+LKS F A  L
Sbjct: 387 RKGIIDVIVHKDSNAADVLKSFFHALVL 414


>gi|320170693|gb|EFW47592.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 710

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 90/237 (37%), Positives = 144/237 (60%), Gaps = 1/237 (0%)

Query: 111 KLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK 170
           +L ++ R   ++  LP+G+P SVT +Y+ YS W++VQ V   ++GVL+TQALLYA+GLG 
Sbjct: 148 RLREDTRHRLIEAFLPKGYPHSVTPNYMGYSRWQAVQSVTGTMTGVLSTQALLYAVGLGA 207

Query: 171 GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
           GAIP A A+NW++KDG+G L  ++ S F    FD +PK  R +++    A+  LE+LTP 
Sbjct: 208 GAIPLAGALNWIVKDGLGQLGGVVYSTFISSKFDSDPKRHRFWSNAALQASTLLEILTPL 267

Query: 230 FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
            P  F+F+ + +  G++ + L  +STR+  +     R N  +V  K  +Q + +  IG  
Sbjct: 268 APGMFLFLASVSNIGKNISWLAASSTRAQMHNSLTLRDNLGDVTGKAGSQAIATSLIGTG 327

Query: 290 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           LGIA+A  +G+     L +F  ++ ++M  N +S   + +RTLN  RA  +F  +LL
Sbjct: 328 LGIAIAPFVGTDPIAVLLAFLPLSIVNMVSNYRSNTIVHMRTLNVQRAERLFMHFLL 384


>gi|356522111|ref|XP_003529693.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 488

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 184/366 (50%), Gaps = 35/366 (9%)

Query: 100 NASLSSLLSVNKLWDECRELF---VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
           N+ L   L+ N   D   E+      +++PEGFPDSVT  Y+ Y  WR+++       GV
Sbjct: 75  NSFLPVSLNSNHPRDPLHEVLSFVTSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGV 134

Query: 157 LATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADL 215
             TQ LL ++G+ +  A P A AINW+LKDG G + K++ +  G+ FD + K  R   DL
Sbjct: 135 FTTQTLLSSVGVCRNRAAPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDL 194

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           L     G+E+ T A PH F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AK
Sbjct: 195 LMELGAGVELATAAVPHLFLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAK 254

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
           GE  G ++  +G  L I +A       P  + +FS+++  ++  + +  +S+ L TLN  
Sbjct: 255 GECVGNIADLLGTGLSILIAKR----NPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCG 310

Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
           R S+    +L++GQ P ++E N  E +F                   S   KD  V    
Sbjct: 311 RFSVAVEHFLMTGQVPTLQEGNMNENIF-------------------SFPWKDRPV---- 347

Query: 396 RLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQ 453
              LGS++ +   +     A+  L++ E YI+T +  K  V  VLK+ A   D+LK+ F 
Sbjct: 348 --VLGSRIKEAFQDPSAYVAIEPLFDRERYIVTYNPSKHKVYAVLKDQAKSDDILKAAFH 405

Query: 454 ASYLYW 459
           A  L++
Sbjct: 406 AHVLFF 411


>gi|452819618|gb|EME26673.1| glutamate N-acetyltransferase isoform 2 [Galdieria sulphuraria]
 gi|452819619|gb|EME26674.1| glutamate N-acetyltransferase isoform 1 [Galdieria sulphuraria]
          Length = 540

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 114/355 (32%), Positives = 175/355 (49%), Gaps = 29/355 (8%)

Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI 166
           L+   LW++       F LPEGFP+SV   YL YS+WR +Q + S IS VL+TQALL A 
Sbjct: 127 LNRQNLWEQITSSLSTFFLPEGFPESVGDSYLQYSIWRGIQNIISNISQVLSTQALLLAA 186

Query: 167 GLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEM 225
           G+G+  +   AAA  W  KDGIG + ++  +  G  +D +PK WRL +DLL +    LE+
Sbjct: 187 GVGRTHSETLAAATAWAWKDGIGQVGRLFTAVLGNQYDADPKRWRLVSDLLYDIGLSLEI 246

Query: 226 LTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKA 285
           L+PAF  +F+F+ A     +S A  I  S R      F  R N  E+ AK +AQ +V+  
Sbjct: 247 LSPAFSKYFLFMAALGNISKSIAFAIGISCRYSVLKTFVRRENLGEISAKNDAQNVVTGM 306

Query: 286 IGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           +G ++G+ ++  + ++    L  F   T I+ + N KS Q  EL T+N  R +++   YL
Sbjct: 307 LGTLIGVWMSKCLPAAPKVRLGVFLFFTAIYSFFNYKSMQVAELTTINRQRGNILAHSYL 366

Query: 346 LSGQAPPVKEVNDEE---PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK 402
           +    P V   N  E   PLFP                           +  H ++LG  
Sbjct: 367 MFQNVPSVSMSNSYERFVPLFPT------------------------VFQYPH-VRLGVG 401

Query: 403 LSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYL 457
           L+++  + E        +  E YIL+   G+  V L  +    D+L+ L   +++
Sbjct: 402 LTELSISIEKIANYTRQFSKEKYILSLSDGEIRVALHRNHESVDLLQLLLHIAFI 456


>gi|42562046|ref|NP_172832.3| uncharacterized protein [Arabidopsis thaliana]
 gi|28393739|gb|AAO42280.1| unknown protein [Arabidopsis thaliana]
 gi|28973405|gb|AAO64027.1| unknown protein [Arabidopsis thaliana]
 gi|332190945|gb|AEE29066.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 440

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 134/451 (29%), Positives = 219/451 (48%), Gaps = 47/451 (10%)

Query: 75  EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
           E  GS  TKL     K A + AS+S +   S    N +W    + FV    PEGFP SVT
Sbjct: 16  EWNGSSSTKLF----KTATITASSSLSIQRSANRFNHVWRRVLQAFV----PEGFPGSVT 67

Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
            DY+ + LW ++QG+++    +L+TQALL AIG+G K A    A   W L+D  G L  I
Sbjct: 68  PDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGI 127

Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
           + + + G + D N K WRL ADL+ +    +++L+P FP  F+ +       RS   +  
Sbjct: 128 LFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVAS 187

Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
            +TR+     FA + N A++ AK  +Q  ++  +G+ LG+ LA    S  P A+  SF  
Sbjct: 188 GATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLA-RFTSGNPMAIWLSFLS 246

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
           +T  HMY N ++ + + L +LN  R+S++ + ++ +GQ    ++V+  E + P +    +
Sbjct: 247 LTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQVSSMEGVLPLWA-TSL 305

Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF-----SLYEDEGYI 426
           +S N   L                R+QLG ++S +   + D   L      S Y++  Y+
Sbjct: 306 RSTNSKPL--------------HKRVQLGVRVSSL--PRLDMLQLLNGVGASSYKNAKYL 349

Query: 427 LTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLD 486
           L    G   V+L + + P D+LKS   A  L      A ++  STS              
Sbjct: 350 LAHIKGNVSVILHKDSKPADVLKSYIHAIVL------ANLMEKSTSFYSEGEA------- 396

Query: 487 YVQREFNHVKSDSASVGWVTDGLIARPLPNR 517
           ++ + ++ +     S GW T+ L++  +  R
Sbjct: 397 WIDKHYDELLHKLRSGGWKTERLLSPSITWR 427


>gi|160333925|ref|NP_001103923.1| uncharacterized protein LOC555936 [Danio rerio]
 gi|126631351|gb|AAI34047.1| Zgc:162613 protein [Danio rerio]
          Length = 432

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 108/338 (31%), Positives = 166/338 (49%), Gaps = 28/338 (8%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
           +F    LP+G+P+SV+ DYL Y LW +VQ  +S +SG LATQA L  +G+G + A   AA
Sbjct: 47  VFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAA 106

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            I W+L+DG G L +I+ +   G   D   K WRLFAD+L + A  +E+  P FP  F  
Sbjct: 107 TITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTI 166

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   AG  +S   +   +TR+      A R N A++ AK  +Q  +    G+++ +AL  
Sbjct: 167 IVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIP 226

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            +  +       F + T +H++ N ++ +S+ + TLN  R S+V  +YLL GQ     E 
Sbjct: 227 LVTDNPLLTFILFYLFTVLHLFANYRAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEA 286

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           N  EP+F +F                        V I    +LG +L D+++   D    
Sbjct: 287 NQREPVFLSFR---------------------RTVPI----KLGVRLGDLIHEPGDLQLA 321

Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
                 + Y++    G  CV L   A   D +K++ QA
Sbjct: 322 LK-NNSKPYLIGIKNGAVCVCLGSDASVDDEIKAVCQA 358


>gi|301622642|ref|XP_002940632.1| PREDICTED: UPF0420 protein C16orf58 [Xenopus (Silurana) tropicalis]
          Length = 473

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 113/345 (32%), Positives = 173/345 (50%), Gaps = 28/345 (8%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
           R+ FV  +LP G+PDSV+ DYL Y LW ++Q  +S ++G LATQ+LL   G+G      A
Sbjct: 66  RDFFVSLLLPHGYPDSVSKDYLAYQLWDTIQAFSSSVTGSLATQSLLRGSGVGDSTATVA 125

Query: 177 AAIN-WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AA   W+L+DG G + +I+ +   G   D + K WRLFADLL + A  +E++ P FP  F
Sbjct: 126 AATITWILRDGTGMVGRILFAWMKGSRLDCDAKRWRLFADLLNDIAIFMEIMAPVFPSCF 185

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
                AAG  +    +   +TR+      A R N A+V AK  +Q  +    G+++ + L
Sbjct: 186 TLTVCAAGVFKCIVGVAGGATRAAITVHQAKRDNMADVCAKDGSQETLVNLAGLLVSLFL 245

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
              +  SM      F ++T +H+Y N ++ +S+ + TLN  R S+V   YL  G+    +
Sbjct: 246 VPLVSDSMMATYLLFFLLTILHLYANYRAVRSVIMETLNQSRLSIVLHHYLKEGKILDPE 305

Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK-DAAVEIEHRLQLGSKLSDVVNNKEDA 413
             N  EPL P                 L S+A     V + +   L S + ++   K+D 
Sbjct: 306 AANLREPLLPG----------------LGSQAPISLGVPLSY---LASSVPELEQLKKDN 346

Query: 414 HALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLY 458
           H+L+ L    G+   E  G   VVL E A   DM+++   A  L+
Sbjct: 347 HSLYLL----GW--REDTGSLAVVLHERASSVDMIRACVHAEMLH 385


>gi|225423472|ref|XP_002274189.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
 gi|297738087|emb|CBI27288.3| unnamed protein product [Vitis vinifera]
          Length = 437

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 176/340 (51%), Gaps = 21/340 (6%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
            +Q  +PEGFP SVT DY  + LW S+QG+++ I  +L+TQALL AIG+G K A    A 
Sbjct: 51  ILQAFVPEGFPSSVTPDYAPFQLWDSLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGAT 110

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W L+D  G L  I+ + + G + D N K WRL ADL+ +    +++++P FP  FVFI
Sbjct: 111 FQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVFI 170

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN- 296
                  RS   +   +TR+     FA + N A++ AK  +Q  V+  +G+  G+ LA  
Sbjct: 171 VCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMVGMAFGMLLARI 230

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            +G S+     SF  +T  HMY N ++ + + L TLN  R+S++   ++ +GQ    +EV
Sbjct: 231 TMGHSVAIWF-SFLSLTMFHMYANYRAVRCLILTTLNSERSSILIEHFMETGQVLSPQEV 289

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS--DVVNNKEDAH 414
           +  E + P               L  SS +   A  +  R+ LG ++S  D +   +  H
Sbjct: 290 SKMEHVLP---------------LWASSWSLKNAKSLHTRIHLGVRVSSLDKLEVTDLLH 334

Query: 415 ALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
           +  S Y+   Y+L E  G   V++ + +   D+L+S   A
Sbjct: 335 SAESHYKKAKYLLLERKGIISVIMHKDSTASDVLQSFIHA 374


>gi|410903033|ref|XP_003964998.1| PREDICTED: UPF0420 protein C16orf58 homolog [Takifugu rubripes]
          Length = 442

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 107/345 (31%), Positives = 168/345 (48%), Gaps = 31/345 (8%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
           F    LP+G+P+SV+ DYL Y  W +VQ   S +SG LAT+A L  +G+G + A   AA 
Sbjct: 48  FKSVFLPQGYPESVSGDYLRYQFWDTVQAFFSSLSGTLATRASLRGVGVGNQEATVAAAT 107

Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           + W+LKDG G   +I+ +   G   D   K WRLFAD+L + A  +E+L P FP  F  +
Sbjct: 108 MTWLLKDGTGMCGRILFAWRKGTKLDSEAKKWRLFADILNDVAMFMEILAPYFPAFFTLV 167

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              +G  +S   +   +TR+      A R N A++ AK  +Q  +    G+++ + L   
Sbjct: 168 MCVSGIFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLVLIRF 227

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +  +    L  F +   +H++ N K+ +S+ + TLN  R S+V  +YL +GQ     E N
Sbjct: 228 VSDNAALTLGLFFLFAVLHLFANYKAVRSVVMETLNEARLSVVLQQYLENGQILSPLEAN 287

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
           + EP+F  F                        V I    +LG +L +V+ + E+     
Sbjct: 288 EREPVFMDFK---------------------KVVPI----RLGVRLQEVIQSSEELKFAL 322

Query: 418 SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLER 462
            L  ++ ++L    G  CV L   A  QD ++ + QA    WL R
Sbjct: 323 KL-SNKPFLLGVKNGGVCVCLGPEASVQDEIRVMCQA---LWLRR 363


>gi|356563749|ref|XP_003550122.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 497

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 129/426 (30%), Positives = 196/426 (46%), Gaps = 54/426 (12%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E       +++PEGFPDSV   Y+ Y  WR+++       GV  TQ LL ++G+ +  A 
Sbjct: 101 EVLAFVTSYVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRAA 160

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
           P A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T A PH 
Sbjct: 161 PGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPHL 220

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G ++  +G  L I 
Sbjct: 221 FLPLACAANVLKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLLGTGLSIL 280

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           +A       P  + +FS+++  ++  + +  +S+ L TLN  R S+    +L +GQ P +
Sbjct: 281 IAKR----NPSLVTTFSLLSCGYILSSYREVKSVVLHTLNCGRFSVAVESFLRTGQVPTL 336

Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
           +E N  E +F                   S   KD  V       LGS++ +   +    
Sbjct: 337 QEGNMNENIF-------------------SFPWKDRPV------VLGSRIKEAFQDPSAY 371

Query: 414 HALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYLY------WLERNAG 465
            A+  L++ E YI+T +  K  V  VLK+ A   D+LK+ F A  L       W E NA 
Sbjct: 372 FAIEPLFDRERYIVTYNPSKHKVYAVLKDQAKSDDILKAAFHAHVLLFNLIKSWNESNAS 431

Query: 466 IVA--------TSTSAD---CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 514
            +         T T AD   C  G     +   V   +   K+++   GW        P 
Sbjct: 432 SLKQREDLSNMTHTVADIEACMAG-----TCKTVADSYGFFKNNAKEQGWTMSESHLNPG 486

Query: 515 PNRIRP 520
             R+ P
Sbjct: 487 RARLHP 492


>gi|432867528|ref|XP_004071227.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oryzias latipes]
          Length = 423

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 161/337 (47%), Gaps = 28/337 (8%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
           F    LP+G+P+SV+ DYL Y  W ++Q  AS +S  L TQA L  +G+G + A   AA 
Sbjct: 45  FKSVFLPQGYPESVSEDYLQYQFWDTLQAFASTLSWTLVTQASLKGVGVGNQDATVAAAT 104

Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           + W+LKDG G   +I+ +   G   D   K WRLFAD+L + A  +E+L P FP  F  I
Sbjct: 105 VTWILKDGTGMFGRILFAWQKGSRLDSEAKKWRLFADVLNDIAMFIEILAPFFPSFFTLI 164

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              AG  ++   +   +TR+      A R N A++ AK  +Q  +    G+++ + L   
Sbjct: 165 VCTAGVFKAIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLVVSLILIPL 224

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I  +   +L  F + T +H++ N K+ +S+ + T N  R S+V  +YL   +     E N
Sbjct: 225 ITDNSSLSLGLFFLFTILHLFANYKAVRSVVMETFNQARLSIVLQQYLKDKRILSPFEAN 284

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
             EP+F  F                         +I   ++LG KL DVV + ED H L 
Sbjct: 285 RREPVFLEF-------------------------KITTPIKLGVKLQDVVQSPEDFH-LA 318

Query: 418 SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
                  Y+L    G  CV L   A   D +++  QA
Sbjct: 319 LRNNSMAYLLGVRNGCICVCLGSGASVHDEIRAACQA 355


>gi|449431896|ref|XP_004133736.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
 gi|449478104|ref|XP_004155224.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 507

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 117/406 (28%), Positives = 194/406 (47%), Gaps = 42/406 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           DE       +++PEGFPDSVT  Y+ Y  WR+++       GV  TQ LL ++G+ +  A
Sbjct: 114 DEMMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLNSVGVARNKA 173

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T A PH
Sbjct: 174 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAAPH 233

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +  AA   ++ +A+   STR+  Y  FA   N  +V AKGE  G ++     +LG 
Sbjct: 234 LFLPLACAANVAKNVSAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LLGT 289

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + +F +++  +++ + +  +S+ L TLN  R ++    +L +G+ P 
Sbjct: 290 GLSIMISKRNPSLVTTFGLLSCGYIFSSYQEVRSVVLHTLNRARFNVAVETFLKTGRVPS 349

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           +++ N  E +  +F + K                       E+ + LG +  D   +   
Sbjct: 350 LQKGNMNERIL-SFPWLK-----------------------ENPIVLGPRFKDAFQDAGS 385

Query: 413 AHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAG----- 465
             A+  L++ E Y++T  +  GK   +LK+ A   D++K+ F A  L    R++      
Sbjct: 386 FLAIEPLFDREKYLVTYNQTKGKVYALLKDQAKSDDIIKAAFHAHVLLHFIRSSSGAQNS 445

Query: 466 ----IVATSTSADCAPGGRLEI--SLDYVQREFNHVKSDSASVGWV 505
               + A S+S         EI  S   V   +   KS ++  GWV
Sbjct: 446 NQKQVDAFSSSVPTTTNLEAEIAASCKMVSNSYEIFKSKASEQGWV 491


>gi|157422883|gb|AAI53381.1| Zgc:162613 protein [Danio rerio]
          Length = 432

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 109/338 (32%), Positives = 166/338 (49%), Gaps = 28/338 (8%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
           +F    LP+G+P+SV+ DYL Y LW +VQ  +S +SG LATQA L  +G+G + A   AA
Sbjct: 47  VFKSVFLPQGYPESVSEDYLQYQLWDTVQAFSSSLSGTLATQASLRGVGVGNQEATVAAA 106

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            I W+L+DG G L +I+ +   G   D   K WRLFAD+L + A  +E+  P FP  F  
Sbjct: 107 TITWLLRDGTGMLGRILFAWLKGSKLDSEAKKWRLFADILNDVAMFIEIAAPHFPPFFTL 166

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   AG  +S   +   +TR+      A R N A++ AK  +Q  +    G+++ +AL  
Sbjct: 167 IVCIAGIFKSIVGVAGGATRAALTVHQARRNNMADISAKDGSQETLVNLAGLLVSLALIP 226

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            +  +       F + T +H++ N K+ +S+ + TLN  R S+V  +YLL GQ     E 
Sbjct: 227 LVTDNPLLTFILFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLHQYLLHGQVLSPAEA 286

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           N  EP+F +F                        V I    +LG +L D+++   D    
Sbjct: 287 NQREPVFLSFR---------------------RTVPI----KLGVRLGDIIHEPGDLQLA 321

Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
                 + Y++    G  CV L   A   D +K++ QA
Sbjct: 322 LK-NNSKPYLIGIKNGAVCVCLGSDASVDDEIKAVCQA 358


>gi|297844278|ref|XP_002890020.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335862|gb|EFH66279.1| hypothetical protein ARALYDRAFT_888749 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 440

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 139/451 (30%), Positives = 220/451 (48%), Gaps = 47/451 (10%)

Query: 75  EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
           E  GS  TKL     K A + ASAS +   S    N +W    + FV    PEGFP SVT
Sbjct: 16  EWNGSSSTKLF----KTATITASASLSIQRSANRFNHVWRRVLQAFV----PEGFPGSVT 67

Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
            DY+ +  W ++QG+++ I  +L+TQALL AIG+G K A    A   W L+D  G L  I
Sbjct: 68  PDYVGFQFWDTLQGLSTYIKMMLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGI 127

Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
           + + + G + D N K WRL ADL+ +    +++L+P FP  F+ +       RS   +  
Sbjct: 128 LFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVAS 187

Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
            +TR+     FA + N A++ AK  +Q  ++  +G+ LG+ LA    S  P A+  SF  
Sbjct: 188 GATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLAR-FTSGNPMAIWFSFLS 246

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
           +T  HMY N ++ + + L +LN  R+S++ + ++ +GQ    ++V+  E + P +    +
Sbjct: 247 LTVFHMYANYRAVRCLVLNSLNFERSSILLTHFMQTGQVLSPEQVSSMEGVLPVWA-TSL 305

Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF-----SLYEDEGYI 426
           +S N S+LL               R+ LG ++S +   + +   L      S Y++  Y+
Sbjct: 306 RSTN-SKLL-------------HKRVHLGVRVSSL--PRLEMMQLLNGVGASSYKNAKYL 349

Query: 427 LTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLD 486
           L    G   V+L + +   D+LKS   A  L      A ++  STS         E  +D
Sbjct: 350 LAHRKGNVSVILHKDSSAADVLKSYIHAIVL------ANLMEKSTSFYSEG----EAWID 399

Query: 487 YVQREFNHVKSDSASVGWVTDGLIARPLPNR 517
               EF H      S GW T+ L++  +  R
Sbjct: 400 KHYDEFLH---KLRSGGWKTERLLSPSITWR 427


>gi|340372549|ref|XP_003384806.1| PREDICTED: UPF0420 protein C16orf58-like [Amphimedon queenslandica]
          Length = 432

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 99/299 (33%), Positives = 157/299 (52%), Gaps = 8/299 (2%)

Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
            +L D  +E+F    LP+G+P+SV+ DYL Y LW ++Q   S I+G LAT A+L  +G+G
Sbjct: 35  TRLADTLKEVF----LPQGYPNSVSEDYLQYQLWDTLQAFCSYITGTLATHAMLKGVGVG 90

Query: 170 -KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT 227
              A P AA + W+LKDG+G   +I+ +   G   D N K WRL AD+L + A  +++++
Sbjct: 91  DSSASPVAATLTWILKDGLGMTGRILFAWLRGYDLDCNAKKWRLIADILNDIAICMQLVS 150

Query: 228 PAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIG 287
           P FP  F+ I   A   +S   +   +TR+      A R N A+V AK  +Q  +    G
Sbjct: 151 PFFPSCFLLIACLASITQSVVGVAGGATRAALVQHQARRDNMADVSAKDGSQETLVNLCG 210

Query: 288 IMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
           +++G+ +   I     F  + F   T +H+Y N K+   + + TLN  R  L+     L+
Sbjct: 211 LLIGLIITPLIAGQTVFVWSLFFSFTLLHLYSNYKAVSVVSMETLNCNRLHLLMRNLFLN 270

Query: 348 GQAPPVKEVNDEEP-LFPAFHFFKIK-SANKSQLLVLSSEAKDAAVEIEHRLQLGSKLS 404
           G       VN EEP LF    FF ++  ++ S +L+ SS+   + ++     +   KLS
Sbjct: 271 GTISEPNIVNREEPLLFRQERFFTVEYGSSLSSVLIHSSDYSRSILQFNKGQKFIIKLS 329


>gi|148905902|gb|ABR16113.1| unknown [Picea sitchensis]
          Length = 446

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 114/357 (31%), Positives = 190/357 (53%), Gaps = 25/357 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LPEGFP+SVTSDYL + +W S+QG+++ I  +L+TQALL AIG+G+  A    A   W L
Sbjct: 60  LPEGFPNSVTSDYLPFQIWDSLQGLSTYIRCMLSTQALLSAIGVGETTATVVGATFQWFL 119

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G L  I+ + + G + D N K WRL ADL+ +    +++++P +P  FV I     
Sbjct: 120 RDFTGMLGGILFTLYQGSNLDSNAKMWRLIADLMNDLGMLMDLVSPLYPASFVIILCLGS 179

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             RS   +   +TR+     FA + N A++ AK  +Q  ++  IG++LG+ LA H+ +  
Sbjct: 180 IARSFTGVASGATRAALTQHFALQHNAADISAKEGSQETMATMIGMLLGMLLA-HVTAGH 238

Query: 303 PFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
           P A+ +SF ++T+ HM+ N ++ + + L +LN  R S++   ++ +GQ    K+V+  E 
Sbjct: 239 PGAMWSSFLLLTFFHMFANYQAVRCLCLNSLNEGRTSVLLKCFITNGQVLNPKDVSSMEY 298

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSL-- 419
           L P          N  QL +L+      A       +LG+K+S +  ++ D   L  L  
Sbjct: 299 LLP----------NWMQLRMLTKGRPFNAT-----FKLGAKISSL--SQSDMSKLLKLSA 341

Query: 420 --YEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSAD 474
             Y    Y+L    G   V+L + ++  D+L+S      L +   ++  V  + +AD
Sbjct: 342 SCYGKGKYLLLCRNGTIEVILNKDSVSADILQSYIHGLVLSYHMGSSSSVNDANNAD 398


>gi|356502614|ref|XP_003520113.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Glycine max]
          Length = 486

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 119/388 (30%), Positives = 192/388 (49%), Gaps = 37/388 (9%)

Query: 75  EVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVT 134
           E  GS  TKL   FT       + S A  +       +W    + FV    PEGFP SVT
Sbjct: 72  EWNGSSPTKLSKTFT------ITRSGARFT------HVWRRLLQAFV----PEGFPSSVT 115

Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKI 193
           +DY+ + +W  + G+++ I  +L+TQALL AIG+  K A    A   W L+D  G L  I
Sbjct: 116 ADYVPFQIWDLLXGLSTYIRTMLSTQALLIAIGVEEKSATVIGATFQWFLRDLTGMLGGI 175

Query: 194 MLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
           + + + G + D   K WRL ADL+ +    +++++P FP  FVF+       RS   +  
Sbjct: 176 LFTFYQGSNLDSYAKMWRLVADLMNDLGMLMDLISPLFPSAFVFVVCLGSISRSFTGVAS 235

Query: 253 ASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSV 311
            +TR+     FA + N A++ AK  +Q  V+  IG+ LG+ +A  +    P A+  SF  
Sbjct: 236 GATRAALTQHFALQDNAADISAKEGSQETVATMIGMALGMLVAR-LTIGHPLAIWFSFLS 294

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKI 371
           +T  HMY N ++ Q + L +LNP R+S++F  ++ +GQ    ++V+ +E + P       
Sbjct: 295 LTVFHMYANYRAVQXLTLNSLNPERSSILFQHFMETGQVLSPEQVSSQEHILPI------ 348

Query: 372 KSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS--LYEDEGYILTE 429
                 Q +  SS+  +    +  ++ LG ++S   N +   H L +   Y    Y+L E
Sbjct: 349 ------QFISWSSKKANC---LHKKVNLGMRISSFDNMEIKEHLLSAAPYYTKAKYLLVE 399

Query: 430 HGGKFCVVLKESALPQDMLKSLFQASYL 457
             G   V++ + +   D+LK  F A  L
Sbjct: 400 RKGIIDVIVHKDSNAADVLKLFFHALVL 427


>gi|255569810|ref|XP_002525869.1| conserved hypothetical protein [Ricinus communis]
 gi|223534874|gb|EEF36563.1| conserved hypothetical protein [Ricinus communis]
          Length = 520

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 112/347 (32%), Positives = 177/347 (51%), Gaps = 31/347 (8%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ K  A
Sbjct: 125 EELMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTKTLLSSVGVSKSTA 184

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           IP A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T A PH
Sbjct: 185 IPGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGVELATAAVPH 244

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G V+     +LG 
Sbjct: 245 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVAD----LLGT 300

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + +F++++  +++ + +  +S+ L TLN  R S+    +L +G+ P 
Sbjct: 301 GLSIMISKRNPSLVTTFALLSCGYVFSSYQEVKSVVLHTLNRARFSVAVESFLKTGRVPS 360

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           ++E N  E +F +F + K                       E  + +GS+  D   +   
Sbjct: 361 LQEGNTRENIF-SFPWLK-----------------------ERPIVIGSRFKDAFQDPSA 396

Query: 413 AHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
             A+  L+E E YI+T +  K  V  +LK+ A   D+LK+ F A  L
Sbjct: 397 FLAIEPLFEKERYIVTYNPTKDKVYALLKDQAKADDILKAAFHAHVL 443


>gi|242076506|ref|XP_002448189.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
 gi|241939372|gb|EES12517.1| hypothetical protein SORBIDRAFT_06g022670 [Sorghum bicolor]
          Length = 439

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 111/410 (27%), Positives = 206/410 (50%), Gaps = 35/410 (8%)

Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
           S+  +     + D+  + F+    P G+P SV   YL Y+ +R++Q  +S +  VL+TQ+
Sbjct: 51  SVKVIQDARPIHDKVVDSFLNKFFPSGYPYSVNEGYLTYTRFRALQHFSSAMLHVLSTQS 110

Query: 162 LLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAF 221
           LL+A GL +     A A++W+LKDG+ +  K++ S+ G   D  PK WR+ AD+L +   
Sbjct: 111 LLFAAGL-RPTPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGT 169

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE+++P  P  F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   
Sbjct: 170 ALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAIST 229

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +   +GI  GI LA+ + S+    L +  +++ +H+Y  ++  ++  + TLNP R +++ 
Sbjct: 230 LFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVVQEMRATPVNTLNPQRTAMIV 289

Query: 342 SEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLG 400
           ++++ SG+ + P +    E+ LFP                       +  +E    +++G
Sbjct: 290 ADFIKSGKVSSPAELRYREDLLFP-----------------------NRVIEEAGSVKIG 326

Query: 401 SKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
             L  V+ +      L + + DE ++LT+   K  +VL++SA  +D L+    A++   +
Sbjct: 327 QPLRRVL-SPSLVEQLRATFPDEKFLLTQKSNKTYMVLEQSASGEDALRGWLVAAFASEM 385

Query: 461 ERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
           ER+ GIV+  T         L+ + +  +  F    S+  S GW TD  +
Sbjct: 386 ERS-GIVSRDTV--------LKEAYEKTKMVFPSFVSEVRSRGWYTDQFL 426


>gi|357126248|ref|XP_003564800.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 506

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 105/347 (30%), Positives = 175/347 (50%), Gaps = 32/347 (9%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  +
Sbjct: 111 EEIMAFVRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 170

Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  R  +DLL     G+E+ T AFP 
Sbjct: 171 TPGAIAINWILKDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELATAAFPQ 230

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 231 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 286

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+ +I    P  + SF++++  ++  + +  +S+ L TLN  R ++    ++ +G  P 
Sbjct: 287 GLSIYITKRNPSLVTSFALLSCGYLMSSYQEVRSVVLNTLNRARFTVAVDSFIKTGYVPS 346

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           +KE N +E LF   H ++                          + +GS+  +       
Sbjct: 347 LKEGNSQETLF--IHPWR-----------------------HEPVAIGSRFGEAFQEPVS 381

Query: 413 AHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
             A   L+EDE Y++T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 382 FVATRPLFEDERYMVTYNPTKDKVYALLKDQAKPDDVLKAAFHAHVL 428


>gi|168022981|ref|XP_001764017.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162684756|gb|EDQ71156.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 437

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/391 (30%), Positives = 191/391 (48%), Gaps = 29/391 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LPEG+P+SVT DYL +  W ++QG+++ I  +L+TQALL  IG+G+  A    A   W +
Sbjct: 54  LPEGYPNSVTPDYLPFQTWDTLQGLSTYIRSMLSTQALLGGIGVGETTATVVGATFLWFM 113

Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G L  I+ + + G + D + K WRL ADLL +    +++++P FP  FV I     
Sbjct: 114 RDFTGMLGSILFTLHQGSNLDSSAKQWRLTADLLNDVGMLMDLVSPLFPGAFVTILCIGS 173

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             RS   +   +TR+     FA R+N A+V AK  +Q   +  +G++LG+ LA     ++
Sbjct: 174 MARSVTGVASGATRAALTQHFALRKNAADVSAKEGSQETAATMVGMLLGMVLARLTAENV 233

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                SF ++T  HMY N ++ +S+ L +LN  R S+V S +    + P  +EV D+E L
Sbjct: 234 VALWVSFLLLTAFHMYANYRAVRSLCLTSLNAERTSIVLSAFKEGRKVPTPREVADQEHL 293

Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAV--EIEHRLQLGS-KLSDVVNNKEDAHALFSL 419
            P             +L      A DA +   I   +Q+ S KL D     +   +L   
Sbjct: 294 LPRI----------PRLPWQKQRAGDAWLPKSIRFGVQISSLKLDD---GWKSFLSLVER 340

Query: 420 YEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGG 479
           Y  + Y++   G    VVL + A PQD L     A  L  L+         + AD +   
Sbjct: 341 YSRDPYLMVPKGAIVHVVLHKLATPQDFLHGYVHALCLADLQLR-------SPADQSE-- 391

Query: 480 RLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
             E+++ +++  +          GW TD ++
Sbjct: 392 --ELAISWMKNNYTAFLDSLGKSGWATDRIL 420


>gi|348510102|ref|XP_003442585.1| PREDICTED: UPF0420 protein C16orf58 homolog [Oreochromis niloticus]
          Length = 433

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 165/339 (48%), Gaps = 30/339 (8%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
           +F    LP+G+P+SV+ DYL Y  W ++Q  +S +SG LATQA L   G+G + A   AA
Sbjct: 48  VFKSVFLPQGYPESVSDDYLQYQFWDTMQAFSSSLSGTLATQASLRGFGVGNQEATVAAA 107

Query: 178 AINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            + W+ KDG G L +I+ +   G   D   K WRLFAD+L + A  +E+L P FP  F F
Sbjct: 108 TVTWLFKDGTGMLGRILFAWRKGSKLDSEAKKWRLFADVLNDIAMFMEILAPYFPACFTF 167

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   AG  +S   +   +TR+      A R N A++ AK  +Q  +    G+++ + L  
Sbjct: 168 IVCTAGIFKSIVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLILIP 227

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            +  +    L  F + T +H++ N K+ +S+ + T N  R S+V  +YL   +     E 
Sbjct: 228 LVTDNPVLTLCLFFLFTVLHLFANYKAVRSVVMETFNEARLSIVLQQYLKDKRILSPPEA 287

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           N  EP+F  F F +                          ++LG +L ++  + ED    
Sbjct: 288 NQREPVF--FEFGQTVP-----------------------MKLGVRLQEIAQSPEDLQ-- 320

Query: 417 FSLYEDE-GYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
            +L E+   Y+L    G  CV L   A   D ++++ QA
Sbjct: 321 LALKENSMPYLLGIRNGCVCVCLGPEASASDEIRAMCQA 359


>gi|357474239|ref|XP_003607404.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
 gi|355508459|gb|AES89601.1| hypothetical protein MTR_4g077650 [Medicago truncatula]
          Length = 490

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/349 (31%), Positives = 170/349 (48%), Gaps = 34/349 (9%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-- 171
           DE       +++PEGFP +VT  Y+ Y  WR+++       GV  TQ LL ++G+ +   
Sbjct: 95  DEMMSFITSYVVPEGFPHTVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLSSVGVSRNRA 154

Query: 172 -AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
                A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T A 
Sbjct: 155 TPAAAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFTGDLLMELGAGVELATAAM 214

Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
           PH F+ +  AA   ++  A+   STR+  Y  FA   N  +V AKGE  G ++     +L
Sbjct: 215 PHLFLPLACAANVLKNVGAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LL 270

Query: 291 GIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
           G   +  I  S P  + +FS+++  ++  + K  +S+ L TLN  R S+    +L +GQ 
Sbjct: 271 GTGFSIMIAKSNPSLITTFSLLSCGYILSSYKEVKSVVLHTLNSARFSVAVDSFLKTGQV 330

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           P ++E N  E +F                   S   KD  V       LGS++ D   + 
Sbjct: 331 PTLREGNLNEDIF-------------------SFPWKDRPV------VLGSRIKDAFQSP 365

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L++ E YI+T +  K  V  VLK+ A   D+LK+ F A  L
Sbjct: 366 SAYVAIEPLFDKERYIVTYNPSKSKVHAVLKDQAKSDDILKAAFHAHVL 414


>gi|168024902|ref|XP_001764974.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683783|gb|EDQ70190.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 116/403 (28%), Positives = 200/403 (49%), Gaps = 37/403 (9%)

Query: 120 FVQ-FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
           FV+ +++PEG+PDSV   Y  Y  WR+VQ        V  T++L++A+G+ +G +  +A 
Sbjct: 35  FVKAYIVPEGYPDSVAPSYTPYMQWRAVQYFFGGAMSVFTTRSLMHALGVSRGTSASSAV 94

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           A+NWV+KDG G + K++ +  G+ FDV+ K  R    +L     G+E+ T A PH F+ +
Sbjct: 95  AVNWVIKDGAGRIGKMLFARHGKKFDVDLKQLRFKGAMLMQLGAGVELTTMAVPHLFLPL 154

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
             AA   ++ AA+  +STR+  Y  FA R N  +V AKGE    ++  +G  LGI     
Sbjct: 155 ACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLLGTGLGI----F 210

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I    P  +A+F V++  +M  +    +S+ L TLN  R  +V   +L +GQ P +++ N
Sbjct: 211 ISKRSPSLVATFCVLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGQVPSIEDAN 270

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
             E                  ++ L  + K         L+LGS++    +N  +     
Sbjct: 271 RRE-----------------SVITLPWQEKP--------LELGSRVGQAFSNPREFIRTQ 305

Query: 418 SLYEDEGYILTEHGGK--FCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
             ++ E Y++T    K    VV+KESA   D++++ FQA  +  + R      T+++ + 
Sbjct: 306 EQFQKENYLVTYRPNKKRAYVVIKESASSDDVVRATFQAHVMLHMLRGLEGTITNSNVED 365

Query: 476 APGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
           A     + + D     ++  K ++ S GW+    +  P  +R+
Sbjct: 366 AVAESTKRTYDL----YDSFKEEAKSKGWMMGETLLNPGDSRV 404


>gi|357133637|ref|XP_003568430.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Brachypodium
           distachyon]
          Length = 521

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 107/349 (30%), Positives = 172/349 (49%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFPDSVT  Y+ Y  WR+++        V  T+ALL ++G+ +  A
Sbjct: 121 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRA 180

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T AFPH
Sbjct: 181 TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPH 240

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 241 LFLPLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 296

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            ++  I    P  +ASF+ ++  ++  + +  +S+ L TLN  R ++    ++ +G  P 
Sbjct: 297 GMSILISKRNPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPS 356

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N EE +F  P  H                             + +GS+  +     
Sbjct: 357 LKEGNLEETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 389

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               +   L+EDE YI+T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 390 ASFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 438


>gi|18402585|ref|NP_565718.1| uncharacterized protein [Arabidopsis thaliana]
 gi|20197897|gb|AAD20664.2| expressed protein [Arabidopsis thaliana]
 gi|21553562|gb|AAM62655.1| unknown [Arabidopsis thaliana]
 gi|109134157|gb|ABG25076.1| At2g31190 [Arabidopsis thaliana]
 gi|330253411|gb|AEC08505.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 433

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 114/396 (28%), Positives = 193/396 (48%), Gaps = 39/396 (9%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E F+    P G+P SV   YL Y+ +R++Q  +S    VL+TQ+LL+A GL +     A 
Sbjct: 65  ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 123

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
            ++W+LKDG+ ++ K++ SN G   D  PK WR+ AD+L +   GLE+++P  PH F+ +
Sbjct: 124 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 183

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  +  A +   +TR   Y+ FA   N +++ AKGEA   +    GI  GI LA+ 
Sbjct: 184 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 243

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I SSM   L   S+++ +H+Y  ++  + + + TLNP R +L+ + +L +G+ P   ++ 
Sbjct: 244 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 303

Query: 358 DEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
            +E L FP                       +  ++    +++G  L   V   E    L
Sbjct: 304 FQEDLMFP-----------------------ERPIQDAGNVKVGRALHKAVKPSE-VQRL 339

Query: 417 FSLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
             ++ +E ++L+ HG  +  +VL+  A  +D L+    A+Y          V + T    
Sbjct: 340 KQVFVEEKFLLS-HGKSWTDMVLEHDATGEDALRGWLVAAY----------VKSMTKIYN 388

Query: 476 APGG-RLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
            P    L+ + D +   FN   S   + GW TD  +
Sbjct: 389 DPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRFL 424


>gi|334184613|ref|NP_001189649.1| uncharacterized protein [Arabidopsis thaliana]
 gi|330253412|gb|AEC08506.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 432

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 114/396 (28%), Positives = 193/396 (48%), Gaps = 39/396 (9%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E F+    P G+P SV   YL Y+ +R++Q  +S    VL+TQ+LL+A GL +     A 
Sbjct: 64  ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 122

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
            ++W+LKDG+ ++ K++ SN G   D  PK WR+ AD+L +   GLE+++P  PH F+ +
Sbjct: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 182

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  +  A +   +TR   Y+ FA   N +++ AKGEA   +    GI  GI LA+ 
Sbjct: 183 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 242

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I SSM   L   S+++ +H+Y  ++  + + + TLNP R +L+ + +L +G+ P   ++ 
Sbjct: 243 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANFLKTGKVPSPPDLR 302

Query: 358 DEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
            +E L FP                       +  ++    +++G  L   V   E    L
Sbjct: 303 FQEDLMFP-----------------------ERPIQDAGNVKVGRALHKAVKPSE-VQRL 338

Query: 417 FSLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
             ++ +E ++L+ HG  +  +VL+  A  +D L+    A+Y          V + T    
Sbjct: 339 KQVFVEEKFLLS-HGKSWTDMVLEHDATGEDALRGWLVAAY----------VKSMTKIYN 387

Query: 476 APGG-RLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
            P    L+ + D +   FN   S   + GW TD  +
Sbjct: 388 DPDDIILQDAYDKMNDVFNPFLSQVQAKGWYTDRFL 423


>gi|297604521|ref|NP_001055571.2| Os05g0419200 [Oryza sativa Japonica Group]
 gi|53982666|gb|AAV25645.1| unknown protein [Oryza sativa Japonica Group]
 gi|222631625|gb|EEE63757.1| hypothetical protein OsJ_18576 [Oryza sativa Japonica Group]
 gi|255676375|dbj|BAF17485.2| Os05g0419200 [Oryza sativa Japonica Group]
          Length = 415

 Score =  165 bits (417), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 107/349 (30%), Positives = 170/349 (48%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  A
Sbjct: 18  EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRA 77

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T AFP 
Sbjct: 78  TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 137

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 138 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 193

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + SF+ ++  ++  +    +S+ L TLN  R ++    ++ SG  P 
Sbjct: 194 GLSILISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPS 253

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N +E +F  P  H                             + +GS+  +     
Sbjct: 254 LKEGNSQETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 286

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE YI+T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 287 ASFVAIRPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 335


>gi|218196816|gb|EEC79243.1| hypothetical protein OsI_19997 [Oryza sativa Indica Group]
          Length = 401

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 107/349 (30%), Positives = 169/349 (48%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
            E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  A
Sbjct: 4   QEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVAQSRA 63

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T AFP 
Sbjct: 64  TSGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 123

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 124 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 179

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + SF+ ++  ++  +    +S+ L TLN  R ++    ++ SG  P 
Sbjct: 180 GLSILISKRNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNTARFTVAVDSFIKSGHVPS 239

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N +E +F  P  H                             + +GS+  +     
Sbjct: 240 LKEGNSQETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 272

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE YI+T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 273 ASFVAIRPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 321


>gi|302775588|ref|XP_002971211.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
 gi|300161193|gb|EFJ27809.1| hypothetical protein SELMODRAFT_94812 [Selaginella moellendorffii]
          Length = 399

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 98/299 (32%), Positives = 162/299 (54%), Gaps = 17/299 (5%)

Query: 111 KLWDECREL-------FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           +L D+ R L       F+    P G+P SV   YL YS +R++Q  +S I  V +TQ+L+
Sbjct: 27  RLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLV 86

Query: 164 YAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
           +A GL +     A  I+WVLKDG+ ++ K++ S  G   D  PK WR+FAD+L +A   L
Sbjct: 87  FAAGL-RPTPAQATVISWVLKDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTAL 145

Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
           EM++PA P +F+ +   A   +  A +   +TR   Y  FA   N +++ AKGEA   ++
Sbjct: 146 EMISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLA 205

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
              G+ +GI LA+   +S+   L    +++ IH+Y  ++  +++ L TLN  R +++ ++
Sbjct: 206 NVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVAD 265

Query: 344 YLLSGQAPPVKEVNDEE----PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQ 398
           YL  G+ P   E+   E    P+ P  H   ++ AN   L + S + +    E++HR Q
Sbjct: 266 YLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDI-LQIPSGKIR----ELKHRFQ 319


>gi|168019221|ref|XP_001762143.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686547|gb|EDQ72935.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 390

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/403 (29%), Positives = 204/403 (50%), Gaps = 37/403 (9%)

Query: 120 FVQ-FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGA-IPTAA 177
           FV+ +++PE +PDSV   Y  Y  WR+VQ +      V  T++L++A+G+ +GA   +A 
Sbjct: 19  FVKSYIVPENYPDSVAPSYTPYMQWRAVQYLFGGAMSVFTTRSLMHALGVSRGASASSAV 78

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           A+NWV+KDG G + K++ +  G+ FDV+ K  R    +L     G+E+ T A PH F+ +
Sbjct: 79  AVNWVIKDGAGRIGKMIFARHGKKFDVDLKQLRFKGAMLMQLGAGVELATMAVPHLFLPL 138

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
             AA   ++ AA+  +STR+  Y  FA R N  +V AKGE    ++  +G  LGI     
Sbjct: 139 ACAANVAKNVAAVTSSSTRAPIYKAFARRENIGDVTAKGECISNIADLMGTGLGI----F 194

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I    P  +A+F V++  +M  +    +S+ L TLN  R  +V   +L +G+ P V++ N
Sbjct: 195 ISKKSPSLVATFCVLSCGYMISSFNEIKSVRLATLNRARFGVVVQSFLETGKVPSVEDAN 254

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
             E                  ++ L  + K         L+LGS+++   +N ++     
Sbjct: 255 RRE-----------------SVITLPWQEKP--------LELGSRVAQAYSNPKEFLRAR 289

Query: 418 SLYEDEGYILTE--HGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADC 475
             ++ E Y++T      +  VV+KESA   D++++ FQA  +  + R  G+ +T  +++ 
Sbjct: 290 EQFQKEKYLVTYSLKKKRAYVVIKESASSDDVVRATFQAHVMLHILR--GLRSTLITSNV 347

Query: 476 APGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
             G  +E S       ++  K ++ S GW+    +  P  +R+
Sbjct: 348 --GAIVEESTKLTHDLYDSFKEEAKSQGWMMGETLLSPGDSRL 388


>gi|359486217|ref|XP_003633414.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
 gi|297739449|emb|CBI29631.3| unnamed protein product [Vitis vinifera]
          Length = 504

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 109/347 (31%), Positives = 177/347 (51%), Gaps = 31/347 (8%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           DE       +++PEGFPDSVT  Y+ Y  WR+++       GV  TQ LL ++G+ +  +
Sbjct: 109 DELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRS 168

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  RL  DLL     G+E+ T A PH
Sbjct: 169 APGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPH 228

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G ++     +LG 
Sbjct: 229 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIAD----LLGT 284

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + +F++++  ++  + +  +S+ L TLN  R S+    +L +G+ P 
Sbjct: 285 GLSIMISKRNPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNRARFSVAIDSFLKTGRVPS 344

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           ++E N +E +F +F + + K                        + LGS+  D   +   
Sbjct: 345 LQEGNMKENIF-SFPWLEDKP-----------------------IVLGSRFKDAFQDAGS 380

Query: 413 AHALFSLYEDEGYILTEH--GGKFCVVLKESALPQDMLKSLFQASYL 457
             ++  L+E E Y++T +   GK   +LK+ A   D+LK+ F A  L
Sbjct: 381 YLSIEPLFEKERYLVTYNPSKGKVYALLKDQAKSDDILKAAFHAHML 427


>gi|449502048|ref|XP_004161529.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 428

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/341 (30%), Positives = 179/341 (52%), Gaps = 32/341 (9%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA-- 177
           F+    P G+P SV+  YL Y+ +R++Q V S    VL+TQ+LL+A GL     PTAA  
Sbjct: 62  FLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLR----PTAAQA 117

Query: 178 -AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
             ++WVLKDG+ ++ K++ SN G   D  PK WR+ AD+L +   GLE+++P  PH F+ 
Sbjct: 118 TVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFLE 177

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +       +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  G+ LA+
Sbjct: 178 MAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLAS 237

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            I SS+   L +  +++ +H+YC ++  ++  + TLNP R +++ ++++ +G+ P   ++
Sbjct: 238 TICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPADI 297

Query: 357 NDEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
             +E L FP                 L  EA +  V++   L    K S +V  K+    
Sbjct: 298 RYQEDLVFPG---------------RLIEEAGN--VKVGRALHEVIKPSKLVEMKQ---- 336

Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
              ++  E ++L +      +VL+  A  +D L+    A+Y
Sbjct: 337 ---IFPGEKFVLNQSKKWVDMVLEHDASGEDALRGWLVAAY 374


>gi|449462449|ref|XP_004148953.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 428

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/341 (30%), Positives = 179/341 (52%), Gaps = 32/341 (9%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA-- 177
           F+    P G+P SV+  YL Y+ +R++Q V S    VL+TQ+LL+A GL     PTAA  
Sbjct: 62  FLNTFFPSGYPYSVSEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGLR----PTAAQA 117

Query: 178 -AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
             ++WVLKDG+ ++ K++ SN G   D  PK WR+ AD+L +   GLE+++P  PH F+ 
Sbjct: 118 TVVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRVIADVLYDLGAGLEVISPLCPHLFLE 177

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +       +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  G+ LA+
Sbjct: 178 MAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGIGAGLQLAS 237

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            I SS+   L +  +++ +H+YC ++  ++  + TLNP R +++ ++++ +G+ P   ++
Sbjct: 238 TICSSIQGKLVAAPLLSIVHVYCVVEQMRATPINTLNPQRTAMIVADFVKAGRIPSPADI 297

Query: 357 NDEEPL-FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
             +E L FP                 L  EA +  V++   L    K S +V  K+    
Sbjct: 298 RYQEDLVFPGR---------------LIEEAGN--VKVGRALHEVIKPSKLVEMKQ---- 336

Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
              ++  E ++L +      +VL+  A  +D L+    A+Y
Sbjct: 337 ---IFPGEKFVLNQSKKWVDMVLEHDASGEDALRGWLVAAY 374


>gi|255567274|ref|XP_002524618.1| conserved hypothetical protein [Ricinus communis]
 gi|223536171|gb|EEF37826.1| conserved hypothetical protein [Ricinus communis]
          Length = 436

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/408 (26%), Positives = 204/408 (50%), Gaps = 42/408 (10%)

Query: 111 KLWDECRELFVQFM-------LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           K+ D+ R +F + +        P G+P SV   YL Y+ +R++Q ++S    VL+TQ+LL
Sbjct: 54  KVIDDSRSMFHRVVESFQNKFFPSGYPYSVNEGYLRYTQFRALQHLSSAALSVLSTQSLL 113

Query: 164 YAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
           +A GL +     A A++WVLKDG+ ++ K++ SN G   D  PK WR+ AD+L +   GL
Sbjct: 114 FAAGL-RPTPAQATAVSWVLKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGL 172

Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
           E+L+P  PH F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   + 
Sbjct: 173 EVLSPLCPHLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLF 232

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
             +G+ +GI LA+ + SS+   L    +++ +H++  ++  ++  + TLNP R +++ ++
Sbjct: 233 NVVGLGVGIQLASTVCSSIQGKLVVGPLLSILHVFSVVEEMRAAPVNTLNPQRTAMIVAD 292

Query: 344 YLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK 402
           Y+ +G+ + P      E+ +FP                          +E    +++G  
Sbjct: 293 YVKAGKISSPADLRYREDLIFPG-----------------------RLIEDAGNVKVGRA 329

Query: 403 LSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLER 462
           L       +  H L  ++ +E ++L        +VL+++A  +D L+    A+Y   +E+
Sbjct: 330 LHKAFTPSK-VHELKDIFPEEKFLLNRGNTWTDMVLEQNASGEDALRGWLVAAYATSMEK 388

Query: 463 NAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
           ++ +        C P   L+ + + +   F+   S+  + GW TD  +
Sbjct: 389 SSHM--------CTP-SVLQDAYEKMNSTFDSFLSELQAKGWHTDRFL 427


>gi|226492898|ref|NP_001145431.1| uncharacterized protein LOC100278802 [Zea mays]
 gi|195656119|gb|ACG47527.1| hypothetical protein [Zea mays]
 gi|224035549|gb|ACN36850.1| unknown [Zea mays]
 gi|413945393|gb|AFW78042.1| hypothetical protein ZEAMMB73_132907 [Zea mays]
          Length = 511

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 106/349 (30%), Positives = 171/349 (48%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  A
Sbjct: 115 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRA 174

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T AFP 
Sbjct: 175 ASGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 234

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 235 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 290

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            ++  I  S P  + SF+ ++  ++  +    +S+ L TLN  R ++    ++ +G  P 
Sbjct: 291 GMSILISKSNPSLVTSFAFLSCGYLLSSYHEVRSVVLNTLNRARFTVAVDSFIRTGYVPS 350

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +K+ N +E +F  P  H                             + +GS+  D     
Sbjct: 351 LKDGNSQETIFNPPWRH---------------------------EPVAIGSRFGDAFQEP 383

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE YI+T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 384 ASFLAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 432


>gi|222619643|gb|EEE55775.1| hypothetical protein OsJ_04341 [Oryza sativa Japonica Group]
          Length = 508

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  +
Sbjct: 116 EEVMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 175

Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  R  +DLL     G+E+ T AFP 
Sbjct: 176 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQ 235

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 236 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 291

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + SF+ ++  +++ +    +S+ L TLN  R ++    ++ +G  P 
Sbjct: 292 GLSIFISKRNPSLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 351

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N +E +F  P  H                             + +GS+  +     
Sbjct: 352 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 384

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE Y++T +  K  V  +LK+ A   D++K+ F A  L
Sbjct: 385 ASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSDDIIKAAFHAHVL 433


>gi|56784316|dbj|BAD82242.1| unknown protein [Oryza sativa Japonica Group]
 gi|56785239|dbj|BAD82127.1| unknown protein [Oryza sativa Japonica Group]
          Length = 508

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  +
Sbjct: 116 EEVMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 175

Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  R  +DLL     G+E+ T AFP 
Sbjct: 176 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELATAAFPQ 235

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 236 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 291

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + SF+ ++  +++ +    +S+ L TLN  R ++    ++ +G  P 
Sbjct: 292 GLSIFISKRNPSLVTSFAFLSCGYLWSSYHEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 351

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N +E +F  P  H                             + +GS+  +     
Sbjct: 352 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 384

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE Y++T +  K  V  +LK+ A   D++K+ F A  L
Sbjct: 385 ASFVAIRPLFEDERYMVTYNPAKDKVYALLKDQAKSDDIIKAAFHAHVL 433


>gi|449452843|ref|XP_004144168.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Cucumis sativus]
          Length = 437

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 108/350 (30%), Positives = 176/350 (50%), Gaps = 31/350 (8%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
           R  F  F +PEGFP SVT DY+ + +W S+QG+++ I  +L+TQALL AIG+G + A   
Sbjct: 49  RRFFDAF-VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVI 107

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
            A   W L+D  G L  I+ + + G + D N K WRL AD + +    +++++P FP  F
Sbjct: 108 GATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLVSPLFPSAF 167

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           + I       RS   +   +TR+     FA + N A++ AK  +Q  V+  IG+ +G+ L
Sbjct: 168 MLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGMLL 227

Query: 295 ANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           A  I    P A+  SF  +T  HMY N K+   + L +LNP R S+    Y+ + +    
Sbjct: 228 A-RITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLSP 286

Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
           ++V+  E + PA+                SS +  +   I  R++LG+++S +  +  + 
Sbjct: 287 QQVSMMEHVLPAWA---------------SSWSLRSIKLIHTRVRLGARISSL--HHTEM 329

Query: 414 HALF---------SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
            A F         S Y+   Y+L E      V++++ A   D+ +S   A
Sbjct: 330 XAFFNQLMNFVKSSGYKGAKYLLVERKETISVIVQKDATANDIFQSFVHA 379


>gi|326527127|dbj|BAK04505.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 183/377 (48%), Gaps = 29/377 (7%)

Query: 88  FTKDAFVVASASNASLSSLLSVNKL---WDECRELFVQFMLPEGFPDSVTSDYLNYSLWR 144
            ++ A + ASAS+      L+ N+    W       +   +PEGFP SVT DY+ + +W 
Sbjct: 33  LSRTAVLTASASS------LTSNRFGSRWGRIGSRVLGAFVPEGFPGSVTPDYVPFQMWD 86

Query: 145 SVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHF 202
           ++QG+++ I  +L+TQALL AIG+G K A    A   W L+D  G L  I+ + + G + 
Sbjct: 87  TLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFLRDLTGMLGGILFAFYQGSNL 146

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
           D N K WRL AD + +    +++L+P FP   + I       RS   +   +TR+     
Sbjct: 147 DSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQH 206

Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
           FA   N A++ AK  +Q  ++   G+ LG+ LA+           SF  +T  HMY N K
Sbjct: 207 FALANNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHDLVVWVSFLSLTVFHMYANYK 266

Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVL 382
           + QS+ L TLN  R S++   ++  G+    ++V+ +E + P +                
Sbjct: 267 AVQSLSLSTLNYERTSILLQYFMEHGEVLTPQQVSKQEHILPFW---------------- 310

Query: 383 SSEAKDAAVEIEHRL-QLGSKLSDVVNNKEDAHA-LFSLYEDEGYILTEHGGKFCVVLKE 440
           SS  K   V++ H L  LG+K S + ++     A   S Y +  Y L +  G  C+ + +
Sbjct: 311 SSWRKLLRVKLPHELVNLGAKASMLTHSDMLLIAKTRSYYTNANYFLLDKDGSVCIFIHK 370

Query: 441 SALPQDMLKSLFQASYL 457
            A+  D+LKS      L
Sbjct: 371 QAVATDVLKSFIHGLVL 387


>gi|226502346|ref|NP_001144348.1| uncharacterized protein LOC100277253 [Zea mays]
 gi|195640526|gb|ACG39731.1| hypothetical protein [Zea mays]
          Length = 503

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  +
Sbjct: 105 EEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 164

Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  R  +DLL     G+E+ T AFP 
Sbjct: 165 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQ 224

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 225 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 280

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + SF+ ++  ++  + +  +S+ L TLN  R ++    ++ +G  P 
Sbjct: 281 GLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 340

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N +E +F  P  H                             + +GS+  +     
Sbjct: 341 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 373

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE Y++T +  K  V  +LK+ A   D++K+ F A  L
Sbjct: 374 ASFLAIRPLFEDERYMVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVL 422


>gi|414879331|tpg|DAA56462.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
 gi|414879332|tpg|DAA56463.1| TPA: hypothetical protein ZEAMMB73_966463 [Zea mays]
          Length = 503

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 103/349 (29%), Positives = 172/349 (49%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  +
Sbjct: 105 EEIMAFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLSSVGVSQSKV 164

Query: 174 -PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  R  +DLL     G+E+ T AFP 
Sbjct: 165 TPGAIAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSSDLLLEIGAGIELTTAAFPQ 224

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 225 FFLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGT 280

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  I    P  + SF+ ++  ++  + +  +S+ L TLN  R ++    ++ +G  P 
Sbjct: 281 GLSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPS 340

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N +E +F  P  H                             + +GS+  +     
Sbjct: 341 LKEGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 373

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE Y++T +  K  V  +LK+ A   D++K+ F A  L
Sbjct: 374 ASFLAIRPLFEDERYMVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVL 422


>gi|302756337|ref|XP_002961592.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
 gi|300170251|gb|EFJ36852.1| hypothetical protein SELMODRAFT_77235 [Selaginella moellendorffii]
          Length = 399

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 97/299 (32%), Positives = 162/299 (54%), Gaps = 17/299 (5%)

Query: 111 KLWDECREL-------FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           +L D+ R L       F+    P G+P SV   YL YS +R++Q  +S I  V +TQ+L+
Sbjct: 27  RLLDDNRPLKHKLVDGFLDQFFPVGYPQSVGEGYLTYSQFRALQHFSSAILSVFSTQSLV 86

Query: 164 YAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
           +A GL +     A  I+WVLKDG+ ++ K++ S  G   D  PK WR+FAD+L +A   L
Sbjct: 87  FAAGL-RPTPAQATVISWVLKDGMQHIGKLVGSRMGARMDSEPKRWRIFADVLYDAGTAL 145

Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
           E+++PA P +F+ +   A   +  A +   +TR   Y  FA   N +++ AKGEA   ++
Sbjct: 146 EIISPACPQYFLGVAGLANLAKGVAMVTARATRLPLYTAFAKEGNLSDLYAKGEAMSTLA 205

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
              G+ +GI LA+   +S+   L    +++ IH+Y  ++  +++ L TLN  R +++ ++
Sbjct: 206 NVAGLGVGIRLASSACTSLQGKLLVAPLLSVIHLYSVMEEMRAVPLDTLNAQRTAMLVAD 265

Query: 344 YLLSGQAPPVKEVNDEE----PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQ 398
           YL  G+ P   E+   E    P+ P  H   ++ AN   L + S + +    E++HR Q
Sbjct: 266 YLELGRVPSPAEIRYRERLILPVKPNVHAGSVRIANDI-LQIPSGKIR----ELKHRFQ 319


>gi|348675633|gb|EGZ15451.1| hypothetical protein PHYSODRAFT_546234 [Phytophthora sojae]
          Length = 408

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 101/313 (32%), Positives = 165/313 (52%), Gaps = 5/313 (1%)

Query: 29  SGSFYSLLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDF 88
           +GS +   L  PS+         ++  +    +S+D+G      ++ V+G    +   + 
Sbjct: 16  AGSPHRSALLRPSVCIASTSSSSSSTSSPAPIASQDNGFARGRTIYRVEGDGSQRFHVEK 75

Query: 89  TKDAFVV----ASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWR 144
           T+ A  +    A    ++ +SL +  +      ++     LP+    S+T DY  Y+ W 
Sbjct: 76  TRGALGLWTHKAEGLRSAPTSLAAARRTGSWLYDVLWDLFLPKDARASLTKDYFPYAKWS 135

Query: 145 SVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
            V  VAS  + VL+ Q+LLYAIGLG GAIPTAAA+NWVLKDG+G    ++ ++     +D
Sbjct: 136 FVGSVASCAASVLSMQSLLYAIGLGAGAIPTAAAVNWVLKDGLGQFGGVLFASIVNNRYD 195

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
            +PK WR+ + +  + +   E+LTP  P  F+ I + A   ++ A L  ++TR+ F+  F
Sbjct: 196 ADPKRWRVASSVALDVSVLGEILTPLAPGSFLAIASLANVAKNVAWLSASATRAGFHNSF 255

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A R N A+V AK  +Q + S   G  LGI ++   GSS     A+F+V++ +H+    KS
Sbjct: 256 AIRENLADVTAKAGSQAIASSIFGTGLGILISQFTGSSTLNVAAAFAVLSTVHLISTYKS 315

Query: 324 YQSIELRTLNPYR 336
              ++LRTLN  R
Sbjct: 316 VDCVQLRTLNCQR 328


>gi|242036507|ref|XP_002465648.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
 gi|241919502|gb|EER92646.1| hypothetical protein SORBIDRAFT_01g042950 [Sorghum bicolor]
          Length = 440

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 176/348 (50%), Gaps = 32/348 (9%)

Query: 113 WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KG 171
           W +     +   +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G + 
Sbjct: 52  WGQIGSRMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQS 111

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
           A    A   W L+D  G L  I+ + + G + D N K WRL AD + +    +++L+P F
Sbjct: 112 ATVIGATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLF 171

Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
           P   + I       RS   +   +TR+     FA  +N A++ AK  +Q  ++  IG+ L
Sbjct: 172 PSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMIGMGL 231

Query: 291 GIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
           G+ LA HI      ++  SF  +T  HMY N K+ QS+ L TLN  RAS++   +   G 
Sbjct: 232 GMLLA-HITRGHALSVWTSFLSLTVFHMYANYKAVQSLSLTTLNYERASILLQYFKECGD 290

Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVN 408
               ++V+ +E + P +                S+  K   +++ H R+ LG+K S + +
Sbjct: 291 VLAPQKVSQQEHILPFW----------------STWRKLNKIKLPHERVHLGAKASMLTH 334

Query: 409 N------KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKS 450
           +      K  +H     YE+  Y+L    G   + + + A P D+L+S
Sbjct: 335 SDMLVIAKTRSH-----YENTNYLLLNKQGSVYIFIHKLATPADVLRS 377


>gi|388508448|gb|AFK42290.1| unknown [Lotus japonicus]
          Length = 414

 Score =  161 bits (408), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 188/368 (51%), Gaps = 26/368 (7%)

Query: 99  SNASLSSLL--SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
           SN  LS +L      L+ +    F+    P G+P SV   YL Y+ +RSVQ V S    V
Sbjct: 25  SNGQLSVMLVDDSRPLYQKVAGSFMNKFFPSGYPYSVNEGYLRYTQFRSVQHVTSAALSV 84

Query: 157 LATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
           L+TQ+LL+A GL +     A A++W+LKDG+ ++ K++ SN+G   D  PK WRL AD+L
Sbjct: 85  LSTQSLLFAAGL-RPTPAQATAVSWILKDGMQHVGKLICSNWGARMDSEPKRWRLIADVL 143

Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKG 276
            +   GLE+L+P  PH F+ +       +  A +   +TR   Y+ FA   N +++ AKG
Sbjct: 144 YDLGTGLEVLSPLCPHLFLQMAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKG 203

Query: 277 EAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
           EA   +   IGI +GI LA+ I +S+   +    +++ +H+Y   +  ++  + TLNP R
Sbjct: 204 EAFSTLCNVIGIGVGIQLASTICASVQGKMVVGPLLSILHIYSVSEEMRATPINTLNPRR 263

Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
            +++ +++L +G      ++   E L      F+++         L+ +A +        
Sbjct: 264 TAMIVTDFLKAGVVSSPADLRYREDL-----LFRVR---------LTEDAGN-------- 301

Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
           +++G  L +V+        L  +   E ++L   G    +VL++ A  +D L+    A+Y
Sbjct: 302 VRVGRALHEVIKPSRLLE-LEQVLPGEKFLLNRGGKCVDMVLEQDASGEDALRGWLVAAY 360

Query: 457 LYWLERNA 464
              +E ++
Sbjct: 361 AAQIENSS 368


>gi|224082810|ref|XP_002306848.1| predicted protein [Populus trichocarpa]
 gi|222856297|gb|EEE93844.1| predicted protein [Populus trichocarpa]
          Length = 428

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 111/394 (28%), Positives = 197/394 (50%), Gaps = 35/394 (8%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E F+    P G+P SV   YL Y+ +R++Q  +S    VL+TQ+LL+A GL +     A 
Sbjct: 60  ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 118

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           A++W+LKDG+ +  K++ SN G   D  PK WR+ AD+L +   GLE+L+P  PH F+ +
Sbjct: 119 AVSWILKDGMQHAGKLICSNLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCPHLFLEV 178

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  +  A +   +TR   Y+ FA   N +++ AKGEA   +   +G+ +GI LA+ 
Sbjct: 179 AGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVLGLGVGIQLAST 238

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APPVKEV 356
           + SSM     +  +++ +H+ C ++  ++  + TLNP R ++V ++++ +G+ + P    
Sbjct: 239 VCSSMQGKFVAGPLLSIVHVCCVIEEMRATPVNTLNPQRTAMVVADFVKTGKISSPADLR 298

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
             E+ LFP        +    Q L              HR    SKL +          L
Sbjct: 299 YHEDLLFPGRLIENAGNVKVGQAL--------------HRAVRPSKLRE----------L 334

Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCA 476
             ++  E +IL+       +VL+++A  +D L++   A+Y   +++++     STS    
Sbjct: 335 KEIFPGEKFILSPGNKWTDLVLEQNASGEDALRAWLVAAYASSMKKSS---HESTSVT-- 389

Query: 477 PGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
               L+ + + +   F+   S+  + GW TD  +
Sbjct: 390 ----LQDAYEKMNSVFDPFLSELQAKGWHTDRFL 419


>gi|47217805|emb|CAG07219.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 412

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 88/250 (35%), Positives = 136/250 (54%), Gaps = 2/250 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
           F    LP+G+PDSV+SDYL Y  W ++Q  +S +SG LAT+A L  +G+G + A   AA 
Sbjct: 75  FQSVFLPQGYPDSVSSDYLQYQFWDTLQAFSSSLSGTLATRASLRGVGVGNQQATVAAAT 134

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W+LKDG G   +I+ +   G   D   K WRLFAD+L + A  +++L P FP  F  I
Sbjct: 135 TTWLLKDGTGMCGRILFAWLKGTKLDSEAKKWRLFADILNDIAMFMDILAPYFPAFFTLI 194

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              +G  +S   +   +TR+      A R N A++ AK  +Q  +    G+++ +AL + 
Sbjct: 195 MCVSGVFKSLVGVAGGATRAALTVHQARRDNMADISAKDGSQETLVNLAGLLVSLALVSF 254

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +  +    L+ F + T +H++ N K+ +S+ + TLN  R S+V  EYL  G+     E N
Sbjct: 255 VADNAALTLSLFFLFTVLHLFANYKAVRSVVMETLNEARLSIVLQEYLKDGRMLSPLEAN 314

Query: 358 DEEPLFPAFH 367
             EP+   F 
Sbjct: 315 QREPVLMEFK 324


>gi|242090561|ref|XP_002441113.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
 gi|241946398|gb|EES19543.1| hypothetical protein SORBIDRAFT_09g020620 [Sorghum bicolor]
          Length = 517

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/349 (30%), Positives = 172/349 (49%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFP+SVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  A
Sbjct: 121 EEIMSFIRSYVVPEGFPESVTPSYVPYMTWRAMKHFFGGAMGVFTTRTLLNSVGVSQSRA 180

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+LKDG G + K++ +  G+ FD + K  R   DLL     G+E+ T AFP 
Sbjct: 181 ASGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 240

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG 
Sbjct: 241 LFLPMACIANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGT 296

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            ++  I  S P  + SF+V++  ++  +    +S+ L TLN  R ++    ++ +G  P 
Sbjct: 297 GMSILISKSNPSLVTSFAVLSCGYLLSSYYEVRSVVLNTLNRARFTVAVDSFIRTGYVPS 356

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +K+ N +E +F  P  H                             + +GS+  +     
Sbjct: 357 LKDGNSQETIFNPPWRH---------------------------EPVAIGSRFGEAFQEP 389

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A+  L+EDE YI+T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 390 ASFIAIKPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 438


>gi|326490149|dbj|BAJ94148.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 519

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 171/349 (48%), Gaps = 36/349 (10%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFPDSVT  Y+ Y  WR+++        V  T+ALL ++G+ +  A
Sbjct: 119 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQSRA 178

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+L+DG G + K++ +  G+ FD + K  R   DL+     G+E+ T AFP 
Sbjct: 179 TSGAVAINWILRDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLMMELGAGIELATAAFPQ 238

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++  +G  + I
Sbjct: 239 LFLPLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIADLLGTGMSI 298

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            ++       P  +ASF+V++  ++  + +  +S+ L TLN  R ++    ++ +G  P 
Sbjct: 299 LMSKR----NPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFIKTGHVPS 354

Query: 353 VKEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           +KE N EE +F  P  H                             + +GS+  +     
Sbjct: 355 LKEGNLEETIFNPPWRH---------------------------QPVAIGSRFGEAFQEP 387

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
               A   L+EDE YI+T +  K  V  +LK+ A   D+LK+ F A  L
Sbjct: 388 ASFVATRPLFEDERYIVTYNPAKDKVYALLKDQAKQDDILKAAFHAHVL 436


>gi|8920582|gb|AAF81304.1|AC027656_21 Contains similarity to CG10338 gene product from Drosophila
           melanogaster gb|AE003660 [Arabidopsis thaliana]
          Length = 438

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 200/406 (49%), Gaps = 40/406 (9%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
           F+ F  P GFP SVT DY+ + LW ++QG+++    +L+TQALL AIG+G K A    A 
Sbjct: 52  FLGFSSP-GFPGSVTPDYVGFQLWDTLQGLSTYTKMMLSTQALLSAIGVGEKSATVIGAT 110

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W L+D  G L  I+ + + G + D N K WRL ADL+ +    +++L+P FP  F+ +
Sbjct: 111 FQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVADLMNDIGMLMDLLSPLFPSAFIVV 170

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  RS   +   +TR+     FA + N A++ AK  +Q  ++  +G+ LG+ LA  
Sbjct: 171 VCLGSLSRSFTGVASGATRAALTQHFALQDNAADISAKEGSQETMATMMGMSLGMLLA-R 229

Query: 298 IGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
             S  P A+  SF  +T  HMY N ++ + + L +LN  R+S++ + ++ +GQ    ++V
Sbjct: 230 FTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSLNFERSSILLTHFIQTGQVLSPEQV 289

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           +  E + P +    ++S N   L                R+QLG ++S +   + D   L
Sbjct: 290 SSMEGVLPLWA-TSLRSTNSKPL--------------HKRVQLGVRVSSL--PRLDMLQL 332

Query: 417 F-----SLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATST 471
                 S Y++  Y+L    G   V+L + + P D+LKS   A  L      A ++  ST
Sbjct: 333 LNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADVLKSYIHAIVL------ANLMEKST 386

Query: 472 SADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNR 517
           S              ++ + ++ +     S GW T+ L++  +  R
Sbjct: 387 SFYSEGEA-------WIDKHYDELLHKLRSGGWKTERLLSPSITWR 425


>gi|170037702|ref|XP_001846695.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880979|gb|EDS44362.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 417

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 166/346 (47%), Gaps = 26/346 (7%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           +  F    LP G+PDSV+SDYL Y  W +VQ   S ISG L T A+L  +G+G   A P 
Sbjct: 46  KRFFQNLFLPAGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPL 105

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHH 233
           +A I WVLKDG+G+  +I+ + + G   D++ K WR+ AD+L + A G+++   P +P  
Sbjct: 106 SATITWVLKDGMGHFGRILFAWWKGTELDIDSKKWRIRADILNDIAMGVDLFVLPLYPKV 165

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
             +I  A    ++   +   +TRS      A R N A+V +K  AQ      I   +G+ 
Sbjct: 166 STYILCATSTMKAIVGVAGGATRSALTQHHALRGNLADVASKDSAQETCVNLIASFVGLG 225

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           L  H+ +        F  VT +H+Y N+K+ +++ LRT N  R  +   EY  SG     
Sbjct: 226 LLTHVQNQRVL-YGLFLFVTLLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGTMLSP 284

Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
           ++VN            K++     Q + L++  K      E        L+    N  D 
Sbjct: 285 QQVN------------KLERVTIGQTVSLTARVKIGCSARE--------LAQYYRNCYDL 324

Query: 414 HALFSLYED-EGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
             + + ++  E ++L E      V L  +  P D++KS F  ASYL
Sbjct: 325 ENVIACFDSREKFLLAETRHYVGVYLHFTVKPLDIIKSYFYVASYL 370


>gi|357164706|ref|XP_003580140.1| PREDICTED: UPF0420 protein C16orf58 homolog [Brachypodium
           distachyon]
          Length = 437

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 110/412 (26%), Positives = 204/412 (49%), Gaps = 38/412 (9%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           D+  + F+    P G+P SV   YL Y+ +R++Q  +S +  VL+TQ+LL+A GL +   
Sbjct: 61  DKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL-RPTP 119

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
             A A++W+LKDG+ +  K++ S  G   D  PK WR+FAD+L +    L+ ++P  P  
Sbjct: 120 AQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRIFADVLYDFGTALDFISPLCPQL 179

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   +S  +GI  GI 
Sbjct: 180 FLEVAGLGNFAKGMAVVAARATRLPIYSAFAKEGNLSDLFAKGEAISTLSNVMGIGAGIG 239

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APP 352
           L++ + S+    L    +++ +H+Y  ++  ++I + +LNP R ++V ++++ SG+ + P
Sbjct: 240 LSSTVCSTTQGKLIIGPLLSAVHLYGVVQEMRAIPINSLNPQRTAMVVADFIKSGKVSSP 299

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
            +    E+ LFP                       +  +E    +++G  L  ++ + + 
Sbjct: 300 AELRYREDLLFP-----------------------NRLIEEAGSVKVGQPLCRIL-SPQR 335

Query: 413 AHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTS 472
              L + Y +E ++L+    K  +VL++SA  +D L+    A++   +E++   V +  S
Sbjct: 336 VEQLRATYPNEKFLLSLKSNKTYMVLEQSATGEDALRGWLVAAFASEMEKSG--VGSRDS 393

Query: 473 ADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVE 524
                   L  +   ++  F    S+  S GW TD  +     NR R  Y +
Sbjct: 394 V-------LNEAYQKMEDVFPMFVSEVRSRGWYTDQFLD---GNRSRIAYAK 435


>gi|449018940|dbj|BAM82342.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 738

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 91/235 (38%), Positives = 134/235 (57%), Gaps = 3/235 (1%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
            P+ VT+DY  ++ WR  Q + S   GV  TQALL A+G+  G  I  AA I+WVLKDG+
Sbjct: 379 IPEDVTADYYEFAKWRVFQRLMSSTVGVFGTQALLLALGIKSGNKISQAATISWVLKDGL 438

Query: 188 GYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
             + K++  S  GR  D +PK +R  + LL +   GLE+ T   P  F+ +   A   + 
Sbjct: 439 SRVGKMLWASQLGRDMDADPKRFRFGSALLYSLGNGLEIATRIVPQSFLLVATLANTAKL 498

Query: 247 AAALIQASTRSCFYAGFAAR-RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFA 305
            + L  ++TR+  Y  FA R  N A++ AKGEAQ  ++   G+ LGI L+  IG+S P  
Sbjct: 499 CSMLTASATRNAMYRSFADRNENIADITAKGEAQITIADLGGMALGIQLSKIIGTSRPKV 558

Query: 306 LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
           +A++ +++ I +Y   K  + IE RTLN  R+SLV   YL +G  P  K+V+  E
Sbjct: 559 VATYLILSTIDLYAIWKELRVIEFRTLNLQRSSLVIEHYLKTGCVPTPKQVSAAE 613


>gi|224129512|ref|XP_002328735.1| predicted protein [Populus trichocarpa]
 gi|222839033|gb|EEE77384.1| predicted protein [Populus trichocarpa]
          Length = 398

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 110/338 (32%), Positives = 172/338 (50%), Gaps = 31/338 (9%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINW 181
           +++PEGFPDSV   Y+ Y  WR+++       GV  T+ LL ++G+ K  AIP A AINW
Sbjct: 12  YVVPEGFPDSVIPSYVPYMTWRALKHFFGGAMGVFTTKTLLNSVGVSKSQAIPGAVAINW 71

Query: 182 VLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +LKDG G + K++ +  G+ FD + K  R   DLL      +E+ T A PH F+ +  AA
Sbjct: 72  ILKDGAGRVGKMLYARQGKKFDYDLKQLRFAGDLLMELGAAVELATAAVPHLFLPLACAA 131

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              ++ AA+   STR+  Y  FA   N  +V AKGE  G V+     +LG  L+  I   
Sbjct: 132 NVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNVAD----LLGTGLSIMISKR 187

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
            P  + +F++++  ++  + +  +S+ L TLN  R S+    +L +GQ P + E N +E 
Sbjct: 188 NPSLVTTFALLSCGYVLSSYQEVKSVVLHTLNGARFSVAVESFLKTGQVPSLHEGNVKEN 247

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
           +F  F + K +S                       + LG +  D   +     A+  L+E
Sbjct: 248 IF-NFPWLKDRS-----------------------ISLGPRFKDAFQDPRAYLAIEPLFE 283

Query: 422 DEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
            E YI+T +  K  V  +LK+ A P D++K+ F A  L
Sbjct: 284 RERYIVTYNPSKDKVYALLKDQAKPDDIVKAAFHAHVL 321


>gi|58392188|ref|XP_319167.2| AGAP010020-PA [Anopheles gambiae str. PEST]
 gi|55236239|gb|EAA13864.2| AGAP010020-PA [Anopheles gambiae str. PEST]
          Length = 417

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 105/346 (30%), Positives = 168/346 (48%), Gaps = 26/346 (7%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
           +  F    LP G+PDSV+SDYL+Y  W ++Q   S ISG L T A+L  +G+G  A  P 
Sbjct: 46  KRFFQHLFLPVGYPDSVSSDYLSYQKWDTLQAFCSTISGTLTTHAILKGVGVGSDAANPL 105

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHH 233
           +A I WVLKDG G+L +I+ + + G   D++ K WR+ AD+L + A  +++   P +P  
Sbjct: 106 SATITWVLKDGTGHLGRILFAWWKGSELDIDSKKWRIRADILNDVAMAIDLFVLPYYPKA 165

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
             +I  A    ++   +   +TRS      A R N A+V +K  AQ      I   +G+ 
Sbjct: 166 STYILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLF 225

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           L  ++  +       F+ VT IH+Y N+K+ +++ LRT N  R  +   EY  SG     
Sbjct: 226 LLTYL-QNQKVLYGLFAFVTLIHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284

Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
           ++VN            K++     Q + L++  K      E        L++   N  D 
Sbjct: 285 QQVN------------KLERVTVGQTVTLTARVKIGCSVRE--------LTEFYRNCYDL 324

Query: 414 HALFSLYED-EGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
             L + ++  + +++ E      V L  +  P D++KS F  ASYL
Sbjct: 325 ENLIACFDSRDKFLIAETRHYVGVYLHFTVKPLDIIKSYFYVASYL 370


>gi|115459438|ref|NP_001053319.1| Os04g0517300 [Oryza sativa Japonica Group]
 gi|113564890|dbj|BAF15233.1| Os04g0517300 [Oryza sativa Japonica Group]
 gi|116310213|emb|CAH67223.1| OSIGBa0145M07.5 [Oryza sativa Indica Group]
 gi|215697660|dbj|BAG91654.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 110/429 (25%), Positives = 209/429 (48%), Gaps = 38/429 (8%)

Query: 98  ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
           A   S+  +     + D+  + F+    P G+P SV   YL Y+ +R++Q  +S +  VL
Sbjct: 41  AGRVSVKVIQDSRPIHDKMIDSFLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVL 100

Query: 158 ATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLE 217
           +TQ+LL+A GL +     A A++W+LKDG+ +  K++ S  G   D  PK WR+ AD+L 
Sbjct: 101 STQSLLFAAGL-RPTPAQATAVSWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLY 159

Query: 218 NAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGE 277
           +    LE+++P  P  F+ +       +  A +   +TR   Y+ FA   N +++ AKGE
Sbjct: 160 DFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGE 219

Query: 278 AQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRA 337
           A   +   +GI  GI LA+ + S+    L +  +++ +H+Y  ++  ++  + TLNP R 
Sbjct: 220 AISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRT 279

Query: 338 SLVFSEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
           +++ ++++ SG+ + P +    E+ LFP                       +  +E    
Sbjct: 280 AMIVADFIKSGKVSSPAELRYREDLLFP-----------------------NRLIEEAGS 316

Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
           +++G  +  V+ + +    L + +  E ++L+       +VL++SA  +D L+    A++
Sbjct: 317 VKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWLVAAF 375

Query: 457 LYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPN 516
              +ER +G+ +  T         L ++ + ++  F    ++  S GW TD  +     N
Sbjct: 376 ASEMER-SGVGSGDTV--------LNVAYERMENVFPMFVAEVKSRGWYTDQFLD---GN 423

Query: 517 RIRPGYVEP 525
           R R  Y  P
Sbjct: 424 RSRIAYANP 432


>gi|355710161|gb|EHH31625.1| hypothetical protein EGK_12728 [Macaca mulatta]
          Length = 468

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +   AS  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPRAASPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +S+ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|302564091|ref|NP_001181525.1| UPF0420 protein C16orf58 [Macaca mulatta]
 gi|380815644|gb|AFE79696.1| hypothetical protein LOC64755 [Macaca mulatta]
          Length = 468

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +   AS  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPRAASPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +S+ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|414586417|tpg|DAA36988.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 439

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 107/400 (26%), Positives = 200/400 (50%), Gaps = 34/400 (8%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
           + D+  + F+    P G+P SV   YL Y+ +R +Q  +S +  VL+TQ+LL+A GL + 
Sbjct: 60  IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
               A A++W+LKDG+ +  K++ S+ G   D  PK WR+ AD+L +    LE+++P  P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCP 178

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
             F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-A 350
           I LA+ + S+    L +  +++ +H+Y  ++  ++  + TLNP R +++ ++++ SG+ +
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVS 298

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
            P +    E+ LFP                       +  +E    +++G  L  V + +
Sbjct: 299 SPAELRYKEDLLFP-----------------------NRVIEEAGSVKIGQPLRKVFSPR 335

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
                L + +  E ++L++   +  +VL+ SA  +D L+    A++   +ER+      +
Sbjct: 336 L-VEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALRGWLVAAFASEMERSG-----A 389

Query: 471 TSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
            S D A    L  + + ++R F    S+  S GW TD  +
Sbjct: 390 GSRDDAV---LSEAYEKMERVFPTFVSEVRSRGWYTDQFL 426


>gi|402908238|ref|XP_003916859.1| PREDICTED: UPF0420 protein C16orf58 homolog [Papio anubis]
          Length = 468

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +   AS  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPRAASPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +S+ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|332262940|ref|XP_003280516.1| PREDICTED: UPF0420 protein C16orf58 homolog [Nomascus leucogenys]
          Length = 468

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 105/325 (32%), Positives = 159/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +  AS+  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPGASSPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|225438149|ref|XP_002273202.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
          Length = 420

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 112/395 (28%), Positives = 194/395 (49%), Gaps = 37/395 (9%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E F+    P G+P SV   YL Y+ +R++Q  +S    VL+TQ+LL+A GL +     A 
Sbjct: 52  ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 110

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           A++WVLKDG+ ++ K++ SN G   D  PK WR+ AD L +   GLE+L+P  PH F+ +
Sbjct: 111 AVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEM 170

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  +  A +   +TR   Y+ FA   N +++ AKGEA   +   +G+  GI LA+ 
Sbjct: 171 AGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLAST 230

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I SSM   + +  +++ IH+Y  ++  ++  + TLNP R +++ ++++ +G+     ++ 
Sbjct: 231 ICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLR 290

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
            +E L                            +E    +++G  L  VV   +    L 
Sbjct: 291 YQEDLIFTGRL----------------------IEDAGNVKVGRDLHKVVKPSKLCK-LK 327

Query: 418 SLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCA 476
            ++ +E ++L  HG K+  +VL+  A  +D L+    A+Y+  LE+           D  
Sbjct: 328 GVFPEEKFLLC-HGKKWTDMVLEHDATGEDALRGWLVAAYVANLEKT----------DHE 376

Query: 477 PGGR-LEISLDYVQREFNHVKSDSASVGWVTDGLI 510
           P    L+ + D +   F+   S   + GW TD  +
Sbjct: 377 PSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFL 411


>gi|297744143|emb|CBI37113.3| unnamed protein product [Vitis vinifera]
          Length = 424

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 112/395 (28%), Positives = 194/395 (49%), Gaps = 37/395 (9%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E F+    P G+P SV   YL Y+ +R++Q  +S    VL+TQ+LL+A GL +     A 
Sbjct: 56  ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 114

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           A++WVLKDG+ ++ K++ SN G   D  PK WR+ AD L +   GLE+L+P  PH F+ +
Sbjct: 115 AVSWVLKDGMQHVGKLICSNLGARMDSEPKLWRILADALYDLGTGLEVLSPLCPHLFLEM 174

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  +  A +   +TR   Y+ FA   N +++ AKGEA   +   +G+  GI LA+ 
Sbjct: 175 AGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVVGMGAGIQLAST 234

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I SSM   + +  +++ IH+Y  ++  ++  + TLNP R +++ ++++ +G+     ++ 
Sbjct: 235 ICSSMQGKMIAGPLLSVIHVYSVIEEMRAAPVNTLNPQRTAMIVADFIKTGKISSPADLR 294

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALF 417
            +E L                            +E    +++G  L  VV   +    L 
Sbjct: 295 YQEDLIFTGRL----------------------IEDAGNVKVGRDLHKVVKPSKLCK-LK 331

Query: 418 SLYEDEGYILTEHGGKFC-VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCA 476
            ++ +E ++L  HG K+  +VL+  A  +D L+    A+Y+  LE+           D  
Sbjct: 332 GVFPEEKFLLC-HGKKWTDMVLEHDATGEDALRGWLVAAYVANLEKT----------DHE 380

Query: 477 PGGR-LEISLDYVQREFNHVKSDSASVGWVTDGLI 510
           P    L+ + D +   F+   S   + GW TD  +
Sbjct: 381 PSMTVLQEAYDKMNSVFSTFVSKLQAKGWHTDCFL 415


>gi|78486556|ref|NP_001030594.1| UPF0420 protein C16orf58 homolog [Rattus norvegicus]
 gi|123781627|sp|Q499P8.1|CP058_RAT RecName: Full=UPF0420 protein C16orf58 homolog
 gi|71682994|gb|AAH99813.1| Similar to cDNA sequence BC017158 [Rattus norvegicus]
          Length = 466

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 87/245 (35%), Positives = 134/245 (54%), Gaps = 2/245 (0%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
           +  +L  F ++T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EP
Sbjct: 255 LSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314

Query: 362 LFPAF 366
           L+  F
Sbjct: 315 LWTGF 319


>gi|405974674|gb|EKC39300.1| UPF0420 protein C16orf58-like protein [Crassostrea gigas]
          Length = 444

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 104/355 (29%), Positives = 172/355 (48%), Gaps = 29/355 (8%)

Query: 113 WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KG 171
           W    + F    LP+G+P+SV+SDYL Y +W ++Q  AS I+G LA QALL  +G+G + 
Sbjct: 33  WTSLSQFFTVVFLPQGYPESVSSDYLQYQIWDTIQAFASSITGTLAAQALLKGVGVGDES 92

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
           A   AA + W+LKDG G + +I+ +   G + D + K WRLFAD+L + A  +E+L P F
Sbjct: 93  ATVMAATLTWILKDGTGMIGRILFAWMQGTNLDCDAKRWRLFADILNDMAIFMEILAPYF 152

Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
             +F  I   AG  +S   +   +TR+      A R N A+V AK  +Q  +     ++ 
Sbjct: 153 KAYFTPIVCTAGVCKSIVGVAGGATRAALTQHQARRNNMADVSAKDGSQETLVNLAALVC 212

Query: 291 GIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
            + L   +        + F + T +H++ N  +  S+ + TLN  R  ++   YL +G+ 
Sbjct: 213 SLTLVPLVTGKDVLIWSLFLLFTLLHLFANYSAVTSVVMETLNQARLHILVHHYLQTGEV 272

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
             ++EVN +EP+      +K +                     +  + LG+ +  V +  
Sbjct: 273 LSLQEVNYKEPV-----LWKTRR--------------------KLNVHLGTSIGKVFSRA 307

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYLYWLERN 463
                  +LY+D  Y+L  +    CV  +L   +   D L S FQA  + ++  N
Sbjct: 308 SILEESIALYKDSNYLLDINLQTGCVSIILSPLSTTMDHLTSCFQAEVIDYVYDN 362


>gi|327286048|ref|XP_003227743.1| PREDICTED: UPF0420 protein C16orf58-like [Anolis carolinensis]
          Length = 420

 Score =  158 bits (399), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 95/273 (34%), Positives = 151/273 (55%), Gaps = 8/273 (2%)

Query: 97  SASNASLSSLLSVNKLWDE----CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQ 152
           S  +  LS +L  ++L DE     R+ F    LP+G+P+SV+ DYL Y +W +VQ  AS 
Sbjct: 24  SLPDGRLSCVL--DRLEDEGYRSIRDFFTAVFLPQGYPESVSPDYLAYQIWDTVQAFASS 81

Query: 153 ISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWR 210
           I+G LATQA+L  +G+G + +  TAA + W+LKDG G L +I+ + + G   D + K WR
Sbjct: 82  ITGTLATQAVLKGVGVGDETSTVTAATVTWILKDGTGMLGRILFAWSKGSKLDCDAKQWR 141

Query: 211 LFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
           LFAD+L + A  +E+L PAFP  F  I   +G  +    +   +TR+      A R N A
Sbjct: 142 LFADVLNDVAILMEILAPAFPACFTLIVCISGFFKCIVGVAGGATRAALTMHQARRDNMA 201

Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
           +V AK  +Q  +    G++  + L   +  ++    + + + T +H+Y N ++ +++ + 
Sbjct: 202 DVSAKDGSQETLVNLAGLLFSLLLIPLVADNLRLTYSLYGLFTILHLYANYRAVRAVCME 261

Query: 331 TLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           TLN  R  LV   +L SG+ P     N +EPL 
Sbjct: 262 TLNRARLCLVLQSFLKSGEVPGPDVTNPQEPLL 294


>gi|357120354|ref|XP_003561892.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
          Length = 452

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 168/344 (48%), Gaps = 20/344 (5%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G K A    A   W L
Sbjct: 72  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLSAIGVGEKSATVIGATFQWFL 131

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G L  I+ + + G + D N K WRL AD + +    +++L+P FP   + I     
Sbjct: 132 RDLTGMLGGILFAFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 191

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             RS   +   +TR+     FA  +N A++ AK  +Q  ++   G+ LG+ LA+      
Sbjct: 192 LSRSFTGVASGATRAALTQHFALAKNAADISAKEGSQETLATMSGMGLGMLLAHVTRGHD 251

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                SF  +T  HMY N K+ QS+ L TLN  R S++   +L  G+    ++V+ +E +
Sbjct: 252 LVVWVSFLSLTIFHMYANYKAVQSLSLVTLNYERTSILLQYFLEDGEVRTPQQVSKQEHI 311

Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRL-QLGSKLSDVVNNKE-DAHALFSLY 420
            P +                SS  +    ++ H L  LG+K S + ++         S Y
Sbjct: 312 LPFW----------------SSWRRLLRTKLPHELVHLGAKASMLTHSDMLLISKTRSYY 355

Query: 421 EDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
            D  Y L +  G   + + + A   D+LKS      L  L R +
Sbjct: 356 TDANYFLLDKEGSVHIFIHKQAAATDVLKSFVHGLVLARLTRKS 399


>gi|356505015|ref|XP_003521288.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 415

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 85/229 (37%), Positives = 134/229 (58%), Gaps = 1/229 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           F+    P G+P SV   YL Y+ +R+VQ V S    VL+TQ+LL+A GL +     A A+
Sbjct: 48  FMNKFFPSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL-RPTPAQATAV 106

Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           +W+LKDG+ ++ K++ SN+G   D  PK WRL AD L +   GLE+L+P  PH F+ +  
Sbjct: 107 SWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAG 166

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
                +  A +   +TR   Y+ FA   N +++ AKGEA   +   IGI +GI LA+ I 
Sbjct: 167 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTIC 226

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
           +SM   L +  +++ IH+Y   +  ++  + TLNP R ++V +++L +G
Sbjct: 227 ASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAG 275


>gi|281201954|gb|EFA76161.1| DUF647 family protein [Polysphondylium pallidum PN500]
          Length = 430

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 152/304 (50%), Gaps = 25/304 (8%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           EL     LP G+PDSVTSDY++Y +W + Q ++S +SGVLAT+A+L   G+G  A    +
Sbjct: 45  ELLRSMFLPAGYPDSVTSDYMSYQVWDTAQALSSSLSGVLATRAMLRGYGVGNAAASVGS 104

Query: 178 AI-NWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           A+  W+L+DG G + +I  +   GR  D N K WR  AD+L +    LEML+P     F+
Sbjct: 105 AVLQWILRDGAGMIGRIYFAWKTGRQLDCNAKTWRFMADILNDIGMTLEMLSPLSNVLFM 164

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            +       +S   +    T++     FA   N  +V AK  +Q      +G+ LG AL+
Sbjct: 165 PLSVCGIVSKSICGVAGGCTKASLTMHFAKNDNMGDVSAKDGSQETAVGLVGMFLGAALS 224

Query: 296 NHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
             I         + +V  W+        H+YCN ++   + L ++N YRA+L++  YL  
Sbjct: 225 ASIPDESE-QTTTATVAIWMLFFILIAFHLYCNYRAVTGVVLTSINRYRATLIYKHYLQY 283

Query: 348 GQAPPVKEV---------NDEEPLFPAFHFFKIKSANKSQLLVLSS----EAKDAAVEIE 394
           G+ P   +V         +D++P++       I + NK Q     S    E+K  ++ I 
Sbjct: 284 GEIPTPIQVSKRENILFRDDKQPIYLGASLLNIINLNKQQQQQQQSKEIRESKTNSI-IR 342

Query: 395 HRLQ 398
           H+L+
Sbjct: 343 HKLE 346


>gi|355756739|gb|EHH60347.1| hypothetical protein EGM_11687 [Macaca fascicularis]
          Length = 468

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 157/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNAS--L 103
            C  Q  +  AR   ++ D   +     W   G  R   +     DA  V +   AS  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGGDAGEVGAPRAASPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +S+ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|38568058|emb|CAE05447.3| OSJNBa0073E02.7 [Oryza sativa Japonica Group]
 gi|125549033|gb|EAY94855.1| hypothetical protein OsI_16651 [Oryza sativa Indica Group]
 gi|125590999|gb|EAZ31349.1| hypothetical protein OsJ_15471 [Oryza sativa Japonica Group]
          Length = 380

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 107/407 (26%), Positives = 201/407 (49%), Gaps = 38/407 (9%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           F+    P G+P SV   YL Y+ +R++Q  +S +  VL+TQ+LL+A GL +     A A+
Sbjct: 5   FLNKFFPSGYPYSVNEGYLTYTKFRALQHFSSAMLHVLSTQSLLFAAGL-RPTPAQATAV 63

Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           +W+LKDG+ +  K++ S  G   D  PK WR+ AD+L +    LE+++P  P  F+ +  
Sbjct: 64  SWILKDGMQHAGKLICSGMGARMDSEPKSWRILADVLYDFGTALEVISPLCPQLFLEVAG 123

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
                +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  GI LA+ + 
Sbjct: 124 FGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVC 183

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APPVKEVND 358
           S+    L +  +++ +H+Y  ++  ++  + TLNP R +++ ++++ SG+ + P +    
Sbjct: 184 STTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYR 243

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
           E+ LFP                       +  +E    +++G  +  V+ + +    L +
Sbjct: 244 EDLLFP-----------------------NRLIEEAGSVKIGQPVRRVL-SPQRIEQLKA 279

Query: 419 LYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPG 478
            +  E ++L+       +VL++SA  +D L+    A++   +ER+ G+ +  T       
Sbjct: 280 TFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWLVAAFASEMERS-GVGSGDTV------ 332

Query: 479 GRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEP 525
             L ++ + ++  F    ++  S GW TD  +     NR R  Y  P
Sbjct: 333 --LNVAYERMENVFPMFVAEVKSRGWYTDQFLD---GNRSRIAYANP 374


>gi|312374874|gb|EFR22346.1| hypothetical protein AND_15402 [Anopheles darlingi]
          Length = 417

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 168/346 (48%), Gaps = 26/346 (7%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT- 175
           +  F    LP G+PDSV+SDYL Y  W +VQ   S ISG L T A+L  +G+G  ++   
Sbjct: 46  KRFFQHLFLPVGYPDSVSSDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDSVTAL 105

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHH 233
           +A + WVLKDG+G+L +I+ + + G   D++ K WR+ AD+L + A  +++   P +P  
Sbjct: 106 SATVTWVLKDGMGHLGRILFAWWKGTELDIDSKKWRIRADVLNDIAMAIDLFVLPYYPKA 165

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
             +I       ++   +   +TRS      A R N A+V +K  AQ      I   +G+ 
Sbjct: 166 STYILCGTTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLL 225

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           L  ++  +       F+ VT++H+Y N+K+ +++ LRT N  R  +   EY  SG     
Sbjct: 226 LLTYL-QNQRILYGLFAFVTFLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGMMLSP 284

Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
           ++VN            K++     Q + L++  K      E        L++   N  D 
Sbjct: 285 RQVN------------KLERVTVGQTVSLTARVKIGCSVRE--------LTEFYRNCYDL 324

Query: 414 HALFSLYED-EGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
             L + ++  E +++ E      V L  S  P D++KS F  ASYL
Sbjct: 325 ENLIACFDSREKFLIAETRHYVGVYLHFSVKPLDIIKSYFYVASYL 370


>gi|297283910|ref|XP_002802508.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Macaca
           mulatta]
          Length = 429

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/244 (36%), Positives = 133/244 (54%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  IG+G   A  +AA   W++
Sbjct: 39  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 98

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + + 
Sbjct: 99  KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 158

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   +    
Sbjct: 159 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 218

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
            F+L  F  +T +H+Y N ++ +S+ + TLN  R  LV   YL  G+       N  EPL
Sbjct: 219 GFSLGCFFFLTALHIYANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 278

Query: 363 FPAF 366
           +  F
Sbjct: 279 WTGF 282


>gi|356545865|ref|XP_003541354.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 431

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 188/364 (51%), Gaps = 32/364 (8%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
           L+    + F+    P G+P SV   YL Y+ +R++Q  AS    VL+TQ+LL+A GL + 
Sbjct: 57  LYHRIVDSFLNKFFPSGYPYSVNEGYLRYTQFRALQHTASAALSVLSTQSLLFAAGL-RP 115

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
               A A++WVLKDG+ +L  I+ S  G   D  PK WR+ AD+L +   GLE+L+P  P
Sbjct: 116 TPAQATAVSWVLKDGMQHLGNIICSKLGARMDSEPKRWRILADVLYDLGTGLEVLSPLCP 175

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
             F+ +       +  + +   +TR   Y+ FA   N +++ A+GEA   +   +GI +G
Sbjct: 176 QFFLEMAGIGNLAKGMSIVASRATRLPIYSSFAKEGNLSDLFARGEAISTLFNVVGIGIG 235

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-A 350
           I LA+ + +SM   L    +++ IH+Y   +  ++  + TLNP R +++ +++L +G  +
Sbjct: 236 IQLASTVCASMQGKLVVGPLLSIIHIYSVSEEMRATPVNTLNPQRTAMIVADFLKAGSVS 295

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAA-VEIEHRLQLGSKLSDVVNN 409
            P      E+ LFP             +L+      +DA  V +   L    K S ++ +
Sbjct: 296 SPADLRYREDLLFP------------RRLI------EDAGNVRVGRDLHKVIKPSRLLES 337

Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYLYWLERNAGIV 467
           K+       ++  E +IL  +G   C+  VL++ A+ +D L+    ASY   + +++  +
Sbjct: 338 KQ-------VFPGEKFIL--NGDNRCIDMVLEQDAIGKDALRGWLVASYAVQIGKSSHEL 388

Query: 468 ATST 471
           +TST
Sbjct: 389 STST 392


>gi|426381987|ref|XP_004057610.1| PREDICTED: UPF0420 protein C16orf58 homolog [Gorilla gorilla
           gorilla]
          Length = 468

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +  A +  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPGAPSPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLILPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|255639094|gb|ACU19847.1| unknown [Glycine max]
          Length = 302

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 85/229 (37%), Positives = 134/229 (58%), Gaps = 1/229 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           F+    P G+P SV   YL Y+ +R+VQ V S    VL+TQ+LL+A GL +     A A+
Sbjct: 48  FMNKFFPSGYPYSVNEGYLRYTQFRAVQHVTSAALSVLSTQSLLFAAGL-RPTPAQATAV 106

Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           +W+LKDG+ ++ K++ SN+G   D  PK WRL AD L +   GLE+L+P  PH F+ +  
Sbjct: 107 SWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADALYDIGTGLEVLSPLCPHLFLEMAG 166

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
                +  A +   +TR   Y+ FA   N +++ AKGEA   +   IGI +GI LA+ I 
Sbjct: 167 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTIC 226

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
           +SM   L +  +++ IH+Y   +  ++  + TLNP R ++V +++L +G
Sbjct: 227 ASMQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAG 275


>gi|383859063|ref|XP_003705017.1| PREDICTED: UPF0420 protein C16orf58-like [Megachile rotundata]
          Length = 464

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 180/373 (48%), Gaps = 25/373 (6%)

Query: 90  KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
           K+ F+V S    S++ LLS        +     +  +  LP+G+PDSV  DY  Y +W +
Sbjct: 15  KERFIVKSKDEQSITELLSDTTRTKPFYSGFVSILKEVFLPQGYPDSVHPDYTAYQIWDT 74

Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
           VQ  AS I G L T +++  +G+G+  A P AAAI W+LKDG G + +I+ + + G   D
Sbjct: 75  VQAFASTIMGTLTTHSIMQGVGVGEPAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLD 134

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
              K WRLFAD+L + A GLE+L P +  + + I   + A +S   +   +TR+      
Sbjct: 135 GQCKKWRLFADILNDVAMGLELLLPYYSSYSLAILCVSTAMKSIVGVAGGATRAAVTQHQ 194

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A + N A+V AK  +Q      I  ++GI + + I  +  + +  +  +  +H+Y N  +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASLVGILILS-IFHNGRYIMELYFFLIIVHLYANYSA 253

Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +S+ L +LN  R +L+   Y+ +   P  +EVN +E                S LL+  
Sbjct: 254 VKSLCLNSLNEDRLALIVKSYITNEVIPEPEEVNKKE----------------SVLLLTK 297

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
                    I+  + L + + + + +  D      L+ D  Y+++         +  K+ 
Sbjct: 298 PTMSIYGFNIKMGVSLAALIKNNIISTSDTELSLKLFLDRKYLISIDVQNKTIFICFKKD 357

Query: 442 ALPQDMLKSLFQA 454
           A P D+L++ F A
Sbjct: 358 AQPCDVLEAYFHA 370


>gi|114662190|ref|XP_001159530.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan troglodytes]
 gi|397472000|ref|XP_003807550.1| PREDICTED: UPF0420 protein C16orf58 homolog [Pan paniscus]
 gi|410212432|gb|JAA03435.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410264128|gb|JAA20030.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410292910|gb|JAA25055.1| chromosome 16 open reading frame 58 [Pan troglodytes]
 gi|410340381|gb|JAA39137.1| chromosome 16 open reading frame 58 [Pan troglodytes]
          Length = 468

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +  A +  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGAPGAPSPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|257900512|ref|NP_073581.2| UPF0420 protein C16orf58 [Homo sapiens]
 gi|143955305|sp|Q96GQ5.2|CP058_HUMAN RecName: Full=UPF0420 protein C16orf58
 gi|62897385|dbj|BAD96633.1| hypothetical protein FLJ13868 variant [Homo sapiens]
 gi|119572504|gb|EAW52119.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
 gi|119572505|gb|EAW52120.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
 gi|119572506|gb|EAW52121.1| chromosome 16 open reading frame 58, isoform CRA_a [Homo sapiens]
          Length = 468

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +  A +  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|10436021|dbj|BAB14730.1| unnamed protein product [Homo sapiens]
          Length = 468

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 104/325 (32%), Positives = 158/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +  A +  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P +P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|81879541|sp|Q91W34.1|CP058_MOUSE RecName: Full=UPF0420 protein C16orf58 homolog
 gi|16877856|gb|AAH17158.1| CDNA sequence BC017158 [Mus musculus]
          Length = 466

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/245 (35%), Positives = 134/245 (54%), Gaps = 2/245 (0%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +KD  G L +I+L+ + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 VKDSTGMLGRIILAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
              +L  F ++T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314

Query: 362 LFPAF 366
           L+  F
Sbjct: 315 LWTGF 319


>gi|356572258|ref|XP_003554286.1| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max]
          Length = 419

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 135/229 (58%), Gaps = 1/229 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           F+    P G+P SV   YL Y+ +R++Q V S    VL+TQ+LL+A GL +     A A+
Sbjct: 52  FMNKFFPSGYPYSVNEGYLRYTQFRALQHVTSAALSVLSTQSLLFAAGL-RPTPAQATAV 110

Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           +W+LKDG+ ++ K++ SN+G   D  PK WRL AD+L +   GLE+L+P  PH F+ +  
Sbjct: 111 SWILKDGMQHVGKLICSNWGARMDSEPKRWRLLADVLYDIGIGLEVLSPLCPHLFLEMAG 170

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
                +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI +GI LA+ I 
Sbjct: 171 LGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVVGIGVGIQLASTIC 230

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
           +S+   L +  +++ IH+Y   +  ++  + TLNP R ++V +++L +G
Sbjct: 231 ASIQGKLVAGPLLSIIHLYSVSEEMRATPINTLNPRRTAMVVADFLKAG 279


>gi|395842960|ref|XP_003794274.1| PREDICTED: UPF0420 protein C16orf58 homolog [Otolemur garnettii]
          Length = 468

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/334 (32%), Positives = 167/334 (50%), Gaps = 24/334 (7%)

Query: 41  SLLYCFCHLQVATAIARTATSSEDDGNKEYDAV----WEVKGSKRTKLIPDFTKDAFVVA 96
           SL+   C  Q  +  AR   ++ D G+ ++DA     W   G+   K  P+   +  V A
Sbjct: 6   SLVAPLCSEQFGSGAARRCHAAAD-GSLQWDARGWRWWGFSGAFTVK--PEGRDEGGVGA 62

Query: 97  -SASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
             A++  LS L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG
Sbjct: 63  PGAASPPLSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSG 108

Query: 156 VLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFA 213
            LATQA+L  +G+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFA
Sbjct: 109 SLATQAVLLGLGVGNAKASVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFA 168

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
           D+L + A  LE++ P +P  F    + +   +   ++   +TR+      A R N A+V 
Sbjct: 169 DILNDVAMFLEIMAPIYPICFTMTVSTSNLAKCIVSVAGGATRAALTMHQARRNNMADVS 228

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           AK  +Q  +    G+++ + +   +     F+L  F  +T +H+Y N ++ ++I + TLN
Sbjct: 229 AKDSSQETLVNLAGLLVSLLILPLVSDCPSFSLGCFFFLTALHIYANYRAVRAIVMETLN 288

Query: 334 PYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFH 367
             R  LV   +L  G+       N  EPL+  F 
Sbjct: 289 QGRLRLVLKHFLQRGEVLDPTSANKMEPLWTGFR 322


>gi|449489204|ref|XP_004158245.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 435

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/346 (31%), Positives = 175/346 (50%), Gaps = 25/346 (7%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
           R  F  F +PEGFP SVT DY+ + +W S+QG+++ I  +L+TQALL AIG+G + A   
Sbjct: 49  RRFFDAF-VPEGFPSSVTPDYVPFQIWDSLQGLSTYIRTMLSTQALLSAIGVGEQSATVI 107

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLE-MLTPAFPHH 233
            A   W L+D  G L  I+ + + G + D N K WRL AD + +    ++  ++P FP  
Sbjct: 108 GATFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMYVDGPVSPLFPSA 167

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ I       RS   +   +TR+     FA + N A++ AK  +Q  V+  IG+ +G+ 
Sbjct: 168 FMLIVCLGSLSRSFTGVASGATRAALTQHFALQNNAADISAKEGSQETVATMIGMAIGML 227

Query: 294 LANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
           LA  I    P A+  SF  +T  HMY N K+   + L +LNP R S+    Y+ + +   
Sbjct: 228 LA-RITMGCPLAIWLSFLSLTMFHMYANYKAVCCLSLTSLNPQRCSIALQHYIQTAEVLS 286

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN--K 410
            ++V+  E + PA+                SS +  +   I  R++LG+++S + +   K
Sbjct: 287 PQQVSMMEHVLPAWA---------------SSWSLRSIKLIHTRVRLGARISSLHHTEIK 331

Query: 411 EDAHALFSLYED--EGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
           E  H   +   D  E Y+L E      V++++ A   D+ +S   A
Sbjct: 332 ELLHLAGANNNDKQEKYLLVERKETISVIVQKDATANDIFQSFVHA 377


>gi|149067630|gb|EDM17182.1| similar to cDNA sequence BC017158, isoform CRA_b [Rattus
           norvegicus]
          Length = 466

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 86/245 (35%), Positives = 133/245 (54%), Gaps = 2/245 (0%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTMTVSTS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
           +  +   F ++T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EP
Sbjct: 255 LSLSFGCFILLTALHIYANYRAVRALVLETLNESRLQLVLKHFLQRGEVLEPASANQMEP 314

Query: 362 LFPAF 366
           L+  F
Sbjct: 315 LWTGF 319


>gi|226494744|ref|NP_001146435.1| uncharacterized protein LOC100280018 [Zea mays]
 gi|219887187|gb|ACL53968.1| unknown [Zea mays]
          Length = 439

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 106/400 (26%), Positives = 199/400 (49%), Gaps = 34/400 (8%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
           + D+  + F+    P G+P SV   YL Y+ +R +Q  +S +  VL+TQ+LL+A GL + 
Sbjct: 60  IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
               A A++W+LKDG+ +  K++ S+ G   D  PK W + AD+L +    LE+++P  P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWTILADVLYDLGTALEVVSPLCP 178

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
             F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-A 350
           I LA+ + S+    L +  +++ +H+Y  ++  ++  + TLNP R +++ ++++ SG+ +
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVS 298

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
            P +    E+ LFP                       +  +E    +++G  L  V + +
Sbjct: 299 SPAELRYKEDLLFP-----------------------NRVIEEAGSVKIGQPLRKVFSPR 335

Query: 411 EDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
                L + +  E ++L++   +  +VL+ SA  +D L+    A++   +ER+      +
Sbjct: 336 L-VEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALRGWLVAAFASEMERSG-----A 389

Query: 471 TSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
            S D A    L  + + ++R F    S+  S GW TD  +
Sbjct: 390 GSRDDAV---LSEAYEKMERVFPTFVSEVRSRGWYTDQFL 426


>gi|168027023|ref|XP_001766030.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682673|gb|EDQ69089.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 397

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 81/244 (33%), Positives = 139/244 (56%), Gaps = 1/244 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           F+    P G+P+SV   YL+YS +R+VQ  AS +  VL+TQ+LL+A GL +     A  +
Sbjct: 26  FLSRFFPLGYPNSVAEGYLDYSKYRAVQHFASAVMHVLSTQSLLFAAGL-RPTPAQATIV 84

Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           +WVLKDG+ ++ K++ S  G   D  PK WR+FAD + +   GLE+++P  P HF+ +  
Sbjct: 85  SWVLKDGMQHVGKLLCSRMGARMDSEPKRWRIFADFMYDVGAGLEVISPLCPQHFLPVAG 144

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
            A   +  A +   +TR   Y+ F    N +++ AKGEA   +S  +G+ +GI LA++  
Sbjct: 145 MANLAKGMALVSARATRLPVYSSFVKEGNLSDLYAKGEAISTLSNVLGLGVGIHLASNFA 204

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
           S++   L    +++ IH+YC  +  ++  + TLN  R +++ ++++ +G      E+   
Sbjct: 205 STIQGKLIWAPILSAIHLYCVQQEMRAAPINTLNSQRTAMLVADFVKTGSVSSPAELRYR 264

Query: 360 EPLF 363
           E L 
Sbjct: 265 ERLI 268


>gi|197101105|ref|NP_001126133.1| UPF0420 protein C16orf58 homolog [Pongo abelii]
 gi|75041556|sp|Q5R8F6.1|CP058_PONAB RecName: Full=UPF0420 protein C16orf58 homolog
 gi|55730464|emb|CAH91954.1| hypothetical protein [Pongo abelii]
          Length = 468

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 88/244 (36%), Positives = 134/244 (54%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  IG+G   A  +AA   W++
Sbjct: 78  LPQGFPDSVSPDYLPYQLWDSVQAFASGLSGSLATQAVLLGIGVGNAKATVSAATATWLV 137

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + + 
Sbjct: 138 KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 197

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +   +G+++ + +   +    
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLVSGCP 257

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
            F+L  F  +T +H+Y N ++ +++ + TLN  R  LV   YL  G+       N  EPL
Sbjct: 258 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPL 317

Query: 363 FPAF 366
           +  F
Sbjct: 318 WTGF 321


>gi|340711243|ref|XP_003394188.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus terrestris]
          Length = 464

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 178/373 (47%), Gaps = 25/373 (6%)

Query: 90  KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
           KD F V S    S++ LLS        +     +  +  LP+G+PDSV  DY +Y +W +
Sbjct: 15  KDRFFVKSKDEQSITELLSDTTRTKSFYSGFISIVKEVFLPQGYPDSVHPDYTSYQIWDT 74

Query: 146 VQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
           VQ  AS I G L T +++  +G+G+  A P AAAI W+LKDG G + +I+ + + G   D
Sbjct: 75  VQAFASTIMGTLTTHSIMQGVGVGEASATPLAAAITWILKDGTGMIGRIVFAWWNGTDLD 134

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
              K WRLFAD+L + A GLE+  P F  + + I   + A +S   +   +TR+      
Sbjct: 135 GQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQ 194

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A + N A+V AK  +Q      I   +GI + + I  +  + +  +  +  IH+Y N  +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGRYIMELYLFLVMIHLYANYMA 253

Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +++ L +LN  R +L+   Y+ +   P  +EVN +E                S LL+  
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
                    I+  + L + + + +    D      L+ D  Y+++         + LK+ 
Sbjct: 298 PTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIFICLKKD 357

Query: 442 ALPQDMLKSLFQA 454
           A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370


>gi|296233362|ref|XP_002761979.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1 [Callithrix
           jacchus]
          Length = 468

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 110/337 (32%), Positives = 166/337 (49%), Gaps = 31/337 (9%)

Query: 39  VPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFT-----KDAF 93
           +P+LL   C  Q  +  AR   ++ D   +     WEV+G +   L   FT     +DA 
Sbjct: 7   LPALL---CSEQFGSGEARGCRAAADGSLQ-----WEVRGWRWWGLSRAFTVKPEGRDAG 58

Query: 94  VVASASNAS--LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVAS 151
            V +   AS  LS L +V               LP+GFPDSV+ DYL Y LW SVQ  AS
Sbjct: 59  EVGAPGAASPPLSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFAS 104

Query: 152 QISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGW 209
            +SG LAT A+L  IG+G   A  +AA   W++KD  G L +I+ + + G   D N K W
Sbjct: 105 SLSGSLATHAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQW 164

Query: 210 RLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNF 269
           RLFAD+L + A  LE++ P +P  F    + +   +   ++   +TR+      A R N 
Sbjct: 165 RLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNM 224

Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
           A+V AK  +Q  +    G+++ + +   +     F+   F  +T +H+Y N ++ +++ +
Sbjct: 225 ADVSAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTALHIYANYRAVRALVM 284

Query: 330 RTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
            TLN  R  LV   YL  G+       N  EPL+  F
Sbjct: 285 ETLNEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGF 321


>gi|417401458|gb|JAA47615.1| Hypothetical protein [Desmodus rotundus]
          Length = 468

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/249 (34%), Positives = 134/249 (53%), Gaps = 2/249 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP+GFPDSV+ DYL+Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA 
Sbjct: 73  LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W++KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F   
Sbjct: 133 ATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCFTVT 192

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   
Sbjct: 193 ICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +     F+L  F ++T +H+Y N ++ Q++ + TLN  R  LV   +L  G+       N
Sbjct: 253 VSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSAN 312

Query: 358 DEEPLFPAF 366
             EPL+  F
Sbjct: 313 QMEPLWTGF 321


>gi|452823170|gb|EME30183.1| glutamate N-acetyltransferase [Galdieria sulphuraria]
          Length = 519

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 174/370 (47%), Gaps = 27/370 (7%)

Query: 91  DAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVA 150
           D F+  S  +  ++  +    +W +  +L     LPE    +V+ DY  ++ +  +Q V 
Sbjct: 137 DYFLNPSNGDLLIAPSIDSRTVWRKILDLLNSSFLPENCKKTVSKDYFWFTTFNFLQNVV 196

Query: 151 SQISGVLATQALLYAIGL-GKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKG 208
           +   GV++TQ LL A+GL     I  +A++NWVLKDGIG L  I+  +F G  FD +PK 
Sbjct: 197 NCCIGVISTQQLLRAVGLSASQGIGKSASLNWVLKDGIGRLGAIVFGSFIGNRFDADPKR 256

Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
           WRL+ D+L     G E+++P  P +F+ + + A   ++ + +++  T +     FA R N
Sbjct: 257 WRLWGDILYAFGIGTEIISPLLPRYFLLVASLANMVKATSYMLRLPTTAAIRRSFAKREN 316

Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
           F ++ AK  +Q ++S   G  LGI  +  +G S      ++ V   +    N    + + 
Sbjct: 317 FGDISAKANSQEVLSNLFGTFLGIGFSIILGHSWWQLSVAYFVYFIMFFLLNYLGVKGLH 376

Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVN-DEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
           LRTLN  R+ L+   Y  S Q   V+EVN  E  + P+   F                  
Sbjct: 377 LRTLNLQRSLLLGWSYWKSSQVSTVEEVNYRENVILPSIMSFG----------------- 419

Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDM 447
                    ++ G  L D+     +  AL   Y    Y+LT   G+  +V    A  +D 
Sbjct: 420 -------RYIRFGVSLRDIHLTGSELAALLDKYRQAKYLLTVRHGQVYIVFHMEARVKDE 472

Query: 448 LKSLFQASYL 457
           L+++ QA++L
Sbjct: 473 LQAILQATFL 482


>gi|74201602|dbj|BAE28428.1| unnamed protein product [Mus musculus]
          Length = 466

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 87/245 (35%), Positives = 133/245 (54%), Gaps = 2/245 (0%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
              +L  F ++T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314

Query: 362 LFPAF 366
           L+  F
Sbjct: 315 LWTGF 319


>gi|322787029|gb|EFZ13253.1| hypothetical protein SINV_10096 [Solenopsis invicta]
          Length = 468

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 111/375 (29%), Positives = 183/375 (48%), Gaps = 33/375 (8%)

Query: 89  TKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQG 148
           +KD F  AS +  SL+  L++         +F Q  LP+G+PDSV  DY  Y +W +VQ 
Sbjct: 20  SKDGFS-ASETKLSLTESLAIR---SRITSVFKQIFLPQGYPDSVHPDYTAYQIWDTVQA 75

Query: 149 VASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNP 206
            AS I+G L T +++  IG+G+  A P AAAI W+LK G G +  IM + + G   D   
Sbjct: 76  FASTITGTLTTHSIMKGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTKLDGQC 135

Query: 207 KGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
           K WRLFAD+L++ A G+E+L P F  +   I   +   +S   +   +TR+      A +
Sbjct: 136 KKWRLFADILDDIAKGIELLVPFFSSYSTVILCISTTMKSIVGVAGGATRTALINHQAIQ 195

Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALAN--HIGSSMPFALASFSVVTWIHMYCNLKSY 324
            N A+V AK  +Q      I    GI + +  H G    + L  +  +  +H+Y N  + 
Sbjct: 196 NNVADVSAKDGSQETCVNLIASFFGIFILSFFHDGR---YLLELYLFLVAVHLYANYSAV 252

Query: 325 QSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSS 384
           +++ L TLN  R +++   Y+++ Q P   +VN EE +F                 +L++
Sbjct: 253 KALCLDTLNEDRLAIIVKNYMINEQIPKPGKVNKEESIF-----------------LLTN 295

Query: 385 EAKDA-AVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT---EHGGKFCVVLKE 440
             ++     I+  +   S L   + +  +   L   +E+  YI+T   E    F ++LK+
Sbjct: 296 PGRNVHGFNIKIGVSFTSILKRNIISLTEMEFLLKFFENRKYIITIDIEKKNIF-IILKK 354

Query: 441 SALPQDMLKSLFQAS 455
                ++L+  F AS
Sbjct: 355 DIQSIEILEGYFYAS 369


>gi|225579041|ref|NP_663565.2| UPF0420 protein C16orf58 homolog [Mus musculus]
 gi|74204001|dbj|BAE29003.1| unnamed protein product [Mus musculus]
 gi|148685697|gb|EDL17644.1| cDNA sequence BC017158, isoform CRA_b [Mus musculus]
          Length = 466

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/245 (35%), Positives = 133/245 (54%), Gaps = 2/245 (0%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
              +L  F ++T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314

Query: 362 LFPAF 366
           L+  F
Sbjct: 315 LWTGF 319


>gi|14424560|gb|AAH09308.1| C16orf58 protein [Homo sapiens]
          Length = 468

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/325 (32%), Positives = 157/325 (48%), Gaps = 18/325 (5%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVAS--ASNASL 103
            C  Q  +  AR   ++ D   +     W   G  R   +    +DA  V +  A +  L
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQWEVGGWRWWGLSRAFTVKPEGRDAGEVGASGAPSPPL 70

Query: 104 SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L
Sbjct: 71  SGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVL 116

Query: 164 YAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAF 221
             IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + A 
Sbjct: 117 LGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAM 176

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            LE++ P  P  F    + +   +   ++   +TR+      A R N A+V AK  +Q  
Sbjct: 177 FLEIMAPVCPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQET 236

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +    G+++ + +   +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV 
Sbjct: 237 LVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVL 296

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAF 366
             YL  G+       N  EPL+  F
Sbjct: 297 KHYLQRGEVLDPTAANRMEPLWTGF 321


>gi|308807048|ref|XP_003080835.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116059296|emb|CAL55003.1| putative glutamate/ornithine acetyltransferase (ISS) [Ostreococcus
           tauri]
          Length = 1023

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 126/429 (29%), Positives = 199/429 (46%), Gaps = 51/429 (11%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
            C E     +LPE  P SVTSDY  Y+ +R  Q +AS    V ATQ +L A+GLG    +
Sbjct: 574 RCVERASHALLPE--PTSVTSDYWEYARYRFAQRIASSCISVFATQQMLTAVGLGASRTL 631

Query: 174 PTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           P AAA+NWVLKDG+G L K+ + +NFGR FD + K +R  + ++ +A+  +E++TP FP 
Sbjct: 632 PAAAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQ 691

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
           HF+ +   A  G+S         R+     F    N AEV AK  AQ +++  IG+ L +
Sbjct: 692 HFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAV 751

Query: 293 ALAN----------HIGSSMP-FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
             A            I  ++P FA    +V+  I +Y  LK+ Q   LRT+N  RA ++ 
Sbjct: 752 GAAKGMSKVATVRPEIRRALPLFAFGPLAVLDLICIYKELKAVQ---LRTINKERAEIIA 808

Query: 342 SEYLLSGQAPPVKEV-NDEEPLFPA-----------FHFFKIKSANKSQLLVLSSEAKDA 389
             Y+   + P    V N E  L PA               ++    KS +  LS + + A
Sbjct: 809 EAYVKEREIPTRARVANAERLLIPARLDKSNLPLKVTSIGEVCPTPKSLIRALSGKGRRA 868

Query: 390 AV-----------EIEHRLQLGSKL---SDVVNNKEDAHALFSLYEDEGYILTEHGGKFC 435
           +              +  L  G ++      VN K+   A F      G +  +  G+  
Sbjct: 869 STPPYILAYEPGASKKRVLVDGPRIGAPKTPVNGKKKMSAKFQFPWQRGRVGLK--GRAL 926

Query: 436 VVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHV 495
           + L E A  +D+L+++ Q +++  L     +     +AD A    L+ S    +R+ +  
Sbjct: 927 LALSEDATSKDVLRAVLQVAHMRALPYRPDL-----TADEAYVWALQESESLAKRDIDDF 981

Query: 496 KSDSASVGW 504
            ++ +  GW
Sbjct: 982 TANLSKKGW 990


>gi|357612137|gb|EHJ67828.1| hypothetical protein KGM_02553 [Danaus plexippus]
          Length = 525

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 179/375 (47%), Gaps = 29/375 (7%)

Query: 90  KDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGV 149
           K+ + V +A    L  +L VN+   +   LF +  LP+G+P+SV+ DY+ Y +W + Q  
Sbjct: 19  KERYYVKAADQ--LPIVLVVNEKSRDVAGLFAKIFLPQGYPNSVSKDYIFYQIWDTAQAF 76

Query: 150 ASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPK 207
            S I+G+LATQ +   +G+G   A P AA + WV KDG G++ KI+ +   G + D   K
Sbjct: 77  CSTITGILATQEVFRGVGVGDTNASPLAATVTWVFKDGCGHIGKILFAYTHGTYLDAYSK 136

Query: 208 GWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR 267
            WRL+AD L +AA  +E+  P F ++  F    +   ++   +   +TR       A R 
Sbjct: 137 KWRLYADTLNDAAMCIEIALPLFKNYITFALCVSTCMKAIVGVAGGATRVAMTQHHALRG 196

Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
           N A+V AK  AQ      I     + L + IG+S+      F ++  +H+  N  + +++
Sbjct: 197 NLADVSAKDSAQETAVNLIASFAALFLISLIGNSVTI----FIILLIMHIVFNYMAVRAV 252

Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
            LRTLN  R   V   YL         E+N  EP+     F+++   N   L +      
Sbjct: 253 CLRTLNEPRFLQVIDTYLRKEVIANPCEINRNEPII----FYQL-GPNLLDLKICG---- 303

Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDA---HALFSLYEDEGYILTEHGGK--FCVVLKESA 442
                    + +G  +S ++N + +A   + +  +Y D+ YI+    G     V  KE A
Sbjct: 304 -------FHIIIGDSISKILNPRTNAVYINKVKDIYNDKKYIIHPDTGNRVMYVFPKEDA 356

Query: 443 LPQDMLKSLFQASYL 457
              DML + FQ+  L
Sbjct: 357 SVDDMLCAYFQSVLL 371


>gi|417401377|gb|JAA47577.1| Hypothetical protein [Desmodus rotundus]
          Length = 464

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/249 (34%), Positives = 134/249 (53%), Gaps = 2/249 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP+GFPDSV+ DYL+Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA 
Sbjct: 73  LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W++KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F   
Sbjct: 133 ATWIVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMCFTVT 192

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   
Sbjct: 193 ICISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +     F+L  F ++T +H+Y N ++ Q++ + TLN  R  LV   +L  G+       N
Sbjct: 253 VSDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSAN 312

Query: 358 DEEPLFPAF 366
             EPL+  F
Sbjct: 313 QMEPLWTGF 321


>gi|403276814|ref|XP_003930078.1| PREDICTED: UPF0420 protein C16orf58 homolog [Saimiri boliviensis
           boliviensis]
          Length = 468

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 108/334 (32%), Positives = 162/334 (48%), Gaps = 28/334 (8%)

Query: 42  LLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFT-----KDAFVVA 96
           L    C  Q  +  AR    + D   +     WEV+G +   L   FT     +DA  V 
Sbjct: 7   LQTALCSEQFGSGEARGCRVAADGSLQ-----WEVRGWRWWGLSRAFTVKPEGRDAGEVG 61

Query: 97  SASNAS--LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
           +   AS  +S L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +S
Sbjct: 62  APGAASPPVSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLS 107

Query: 155 GVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLF 212
           G LATQA+L  IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLF
Sbjct: 108 GSLATQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFAWWKGSKLDCNAKQWRLF 167

Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
           AD+L + A  LE++ P +P  F    + +   +   ++   +TR+      A R N A+V
Sbjct: 168 ADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAALTVHQARRNNMADV 227

Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
            AK  +Q  +    G+++ + +   +     F+   F  +T +H+Y N ++ +++ + TL
Sbjct: 228 SAKDSSQETLVNLAGLLVSLLMLPLVSGFPGFSFGCFFFLTALHIYANYRAVRALVMETL 287

Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
           N  R  LV   YL  G+       N  EPL+  F
Sbjct: 288 NEGRLRLVLKHYLQRGEVLDPTTANRMEPLWTGF 321


>gi|297826561|ref|XP_002881163.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327002|gb|EFH57422.1| hypothetical protein ARALYDRAFT_482047 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 130/228 (57%), Gaps = 1/228 (0%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E F+    P G+P SV   YL Y+ +R++Q  +S    VL+TQ+LL+A GL +     A 
Sbjct: 64  ESFLNKFFPSGYPYSVNEGYLRYTQFRALQHFSSAALSVLSTQSLLFAAGL-RPTPAQAT 122

Query: 178 AINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
            ++W+LKDG+ ++ K++ SN G   D  PK WR+ AD+L +   GLE+++P  PH F+ +
Sbjct: 123 VVSWILKDGMQHVGKLICSNLGARMDSEPKRWRILADVLYDLGTGLELVSPLCPHLFLEM 182

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                  +  A +   +TR   Y+ FA   N +++ AKGEA   +    GI  GI LA+ 
Sbjct: 183 AGLGNFAKGMATVAARATRLPIYSSFAKEGNLSDIFAKGEAISTLFNVAGIGAGIQLAST 242

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           I SSM   L   S+++ +H+Y  ++  + + + TLNP R +L+ + YL
Sbjct: 243 ICSSMEGKLVVGSILSVVHVYSVVEQMRGVPINTLNPQRTALIVANYL 290


>gi|354505561|ref|XP_003514836.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cricetulus griseus]
 gi|344256981|gb|EGW13085.1| UPF0420 protein C16orf58-like [Cricetulus griseus]
          Length = 466

 Score =  154 bits (390), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 87/245 (35%), Positives = 134/245 (54%), Gaps = 2/245 (0%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV++DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSTDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGDAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           LKD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 LKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADVLNDIAMFLEIMAPMYPIFFTMTVSIS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
              +L  F ++T +H+Y N ++ +++ L TLN  R  L+   +L  G+       N  EP
Sbjct: 255 PSLSLGCFILLTALHIYANYRAVRALVLETLNESRLQLLLKHFLQRGEVLAPASANQMEP 314

Query: 362 LFPAF 366
           L+  F
Sbjct: 315 LWTGF 319


>gi|348585092|ref|XP_003478306.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cavia porcellus]
          Length = 468

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 99/331 (29%), Positives = 160/331 (48%), Gaps = 30/331 (9%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAIN-WV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +S  LAT A++  +G+G      AAA   W+
Sbjct: 77  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSSALATHAVMLGLGVGNAEASLAAATTTWL 136

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           ++D  G L +I+ + + G   D N K WRLFAD+L +AA  LE+  P  P  F    + +
Sbjct: 137 VRDSTGMLGRIVFAWWKGSKLDCNVKQWRLFADILNDAAMFLEIAAPTLPFCFTVTVSTS 196

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +   ++   +TR+      A R N A+V AK  +Q  +    G++  + +   +   
Sbjct: 197 NLAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLFSLVMLPLVSGC 256

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
             F+L  F ++T +H+Y N ++ +++ + TLN  R  LV   +L  G+       N  EP
Sbjct: 257 PSFSLGCFFLLTGLHIYANYRAVRALVMETLNESRLQLVLKHFLQRGEILDTTSANQMEP 316

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
           L+  F                              L LG  L D++++  +   L   ++
Sbjct: 317 LWTGFW-------------------------PSLSLSLGVPLHDLISSVSELQQLVEGHQ 351

Query: 422 DEGYILT--EHGGKFCVVLKESALPQDMLKS 450
            E Y+L+  +   K  VVL   A P+ +L++
Sbjct: 352 -EPYLLSWNQSRNKVQVVLSHVAGPEAVLRA 381


>gi|345801540|ref|XP_547054.3| PREDICTED: UPF0420 protein C16orf58 homolog [Canis lupus
           familiaris]
          Length = 468

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/251 (35%), Positives = 136/251 (54%), Gaps = 3/251 (1%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTA 176
            L V F LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +A
Sbjct: 72  RLLVVF-LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAEASVSA 130

Query: 177 AAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           A   W++KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F 
Sbjct: 131 ATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFT 190

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
                +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + + 
Sbjct: 191 MTVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLML 250

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
             + +   F+L+ F  +T +H+Y N ++ +++ L TLN  R  LV   +L  G+      
Sbjct: 251 PLVSACPSFSLSCFFFLTALHIYANYRAVRALVLETLNEDRLRLVLKHFLQRGEVLDPTS 310

Query: 356 VNDEEPLFPAF 366
            N  EPL+  F
Sbjct: 311 ANQMEPLWTGF 321


>gi|194388018|dbj|BAG65393.1| unnamed protein product [Homo sapiens]
          Length = 422

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/244 (35%), Positives = 132/244 (54%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  IG+G   A  +AA   W++
Sbjct: 32  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 91

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD    L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + + 
Sbjct: 92  KDSTDMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 151

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   +    
Sbjct: 152 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 211

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
            F+L  F  +T +H+Y N ++ +++ + TLN  R  LV   YL  G+       N  EPL
Sbjct: 212 GFSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPL 271

Query: 363 FPAF 366
           +  F
Sbjct: 272 WTGF 275


>gi|431902902|gb|ELK09111.1| hypothetical protein PAL_GLEAN10000828 [Pteropus alecto]
          Length = 468

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/249 (35%), Positives = 135/249 (54%), Gaps = 2/249 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA 
Sbjct: 73  LLTVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNANASVSAAT 132

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W++KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F  I
Sbjct: 133 ATWLVKDSAGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPICFTMI 192

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   
Sbjct: 193 VCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +     F+L  F ++T +H+Y N ++ +++ + TLN  R  LV   +L SG+       N
Sbjct: 253 VSECPSFSLGCFFLLTALHIYANYRAVRALVIETLNEGRLWLVLKHFLQSGEVLDPTSAN 312

Query: 358 DEEPLFPAF 366
             EPL+  F
Sbjct: 313 KMEPLWTGF 321


>gi|255542032|ref|XP_002512080.1| conserved hypothetical protein [Ricinus communis]
 gi|223549260|gb|EEF50749.1| conserved hypothetical protein [Ricinus communis]
          Length = 380

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/335 (30%), Positives = 176/335 (52%), Gaps = 26/335 (7%)

Query: 127 EGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKD 185
           +GFP SVT DY+ + +W  +QG+++ I  +L+TQALL AIG+G K A    A   W L+D
Sbjct: 2   QGFPSSVTPDYVPFQVWDLLQGLSTYIRTMLSTQALLSAIGVGEKSATVIGATFQWFLRD 61

Query: 186 GIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
             G L  I+ + + G   D N K WRL ADL+ +    +++++P FP  FV +       
Sbjct: 62  LTGMLGGILFTFYQGSDLDSNAKMWRLVADLMNDLGMLMDLVSPLFPSAFVVVVCLGSLS 121

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
           RS   +   +TR+     FA + N A++ AK  +Q  V+   G+ LG+ LA  I    P 
Sbjct: 122 RSFTGVASGATRAALTQHFALQSNAADISAKEGSQETVATMTGMALGMLLAR-ITIGHPL 180

Query: 305 AL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           A+  SF  +T  HMY N ++ + + L +LN  R+S++   ++ +GQ    ++V+  E + 
Sbjct: 181 AIWFSFLSLTMFHMYANYRAVRCLVLTSLNMQRSSILLQHFMETGQVLSPEQVSGMEHIL 240

Query: 364 PAF-HFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNNKEDAHALFSL-- 419
           P + +F++ K                  V++ H R+ LG ++S  ++++E    L S   
Sbjct: 241 PRWTNFWRTKH-----------------VKLLHSRMCLGVRVSS-LDHQELMELLHSAGS 282

Query: 420 YEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 454
           ++   Y+L E  G   +++ +++   D+L+S   A
Sbjct: 283 HKKAKYLLMERKGTINILIHKNSTTSDVLQSFIHA 317


>gi|157113953|ref|XP_001652155.1| hypothetical protein AaeL_AAEL006646 [Aedes aegypti]
 gi|108877519|gb|EAT41744.1| AAEL006646-PA [Aedes aegypti]
          Length = 417

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/343 (30%), Positives = 163/343 (47%), Gaps = 24/343 (6%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
            F    LP G+PDSV+ DYL Y  W +VQ   S ISG L T A+L  +G+G   A P +A
Sbjct: 48  FFQNLFLPAGYPDSVSGDYLAYQKWDTVQAFCSTISGTLTTHAILKGVGVGSDVANPLSA 107

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLT-PAFPHHFV 235
            I WVLKDG+G+  +I+ + + G   D++ K WR+ AD+L + A  +++   P +P    
Sbjct: 108 TITWVLKDGMGHFGRIIFAWWKGTELDIDSKKWRIRADVLNDLAMSIDLFVLPYYPKAST 167

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
           FI  A    ++   +   +TRS      A R N A+V +K  AQ      I   +G+ L 
Sbjct: 168 FILCATTTMKAIVGVAGGATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLFLL 227

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            H+          F  VT +H+Y N+K+ +++ LRT N  R  +   EY  +G     ++
Sbjct: 228 THVQDQRVL-YGLFLFVTLMHIYANIKAVKAVCLRTFNEARYLIALEEYFKTGTMLSPEQ 286

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
           VN            K++     Q + L++  K      E      ++   V  + E+  A
Sbjct: 287 VN------------KLERVTIGQTVTLTARVKIGCSAREL-----AQYYRVCYDLENLMA 329

Query: 416 LFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQ-ASYL 457
            F     + +++ E      V L  +A P D++KS F  ASYL
Sbjct: 330 CFD--SRDKFLIAETRNYVGVYLHFTAKPLDIIKSYFYVASYL 370


>gi|334332835|ref|XP_001370394.2| PREDICTED: UPF0420 protein C16orf58 homolog [Monodelphis domestica]
          Length = 442

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 90/253 (35%), Positives = 133/253 (52%), Gaps = 2/253 (0%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP-T 175
           R       LP GFPDSV+ DYL Y LW +VQ  AS ++G LATQA+L  +G+G  A   +
Sbjct: 45  RAALTAVFLPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVS 104

Query: 176 AAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AA I W+LKDG G L +I  +   G   D   K WRLFAD+L +AA  LE++ P FP  F
Sbjct: 105 AATITWLLKDGTGMLGRIGFAWLMGSRLDCEAKKWRLFADVLNDAAMFLEIVAPGFPSCF 164

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
             I   +   +    +   +TR+      A R N A+V AK  +Q  +    G+++ + L
Sbjct: 165 TLIVCVSSLAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLL 224

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
              + +    +LA+F + T +H+Y N ++ +++ + T N  R  LV   +L  G+     
Sbjct: 225 LPLVSNRPSLSLAAFFLFTILHLYANFRAVRAVIMETFNEARLYLVLKHFLQWGKVLTPP 284

Query: 355 EVNDEEPLFPAFH 367
             N  EPL+   H
Sbjct: 285 SANRMEPLWTGCH 297


>gi|302835848|ref|XP_002949485.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
           nagariensis]
 gi|300265312|gb|EFJ49504.1| hypothetical protein VOLCADRAFT_80645 [Volvox carteri f.
           nagariensis]
          Length = 495

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 172/353 (48%), Gaps = 32/353 (9%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGA 172
           D  R L   ++LP GFPDSV   Y  Y  WR VQ        V  TQ+LL A+G+ G+  
Sbjct: 103 DHFRALIHAYLLPNGFPDSVGPQYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRFQ 162

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              AAAINWV+KDG G L +++ + +GR  D   K +RL  DLL  A   LE+ T   P 
Sbjct: 163 GEAAAAINWVIKDGAGRLGRLLFARWGRELDCELKQFRLAGDLLMEAGAALELATVYAPP 222

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +   A   ++ AA+  +STR+  Y  FA + N A++ AKGE+   ++  +G + GI
Sbjct: 223 AFLPLACTANLSKNLAAVAASSTRAPIYRTFALQNNLADITAKGESVANLADILGTVAGI 282

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
           AL+      MP    +F V++  ++  + K   S+EL  +N  R +     YL  G  P 
Sbjct: 283 ALSRMKLPRMP----TFCVLSAGYLLSSRKEVDSVELPYMNRARLAYATQRYLSDGFIPG 338

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           V E N  EPL P          N+++L++       A+VE           S      E 
Sbjct: 339 VAEANHNEPLLP------WGRHNQNRLVL------GASVE-----------SACAGPSEL 375

Query: 413 AHALFSLYEDE-GYILT--EHGGKFCVVLKESALPQDMLK-SLFQASYLYWLE 461
           AHA+    E E  Y++T      K  V+L+E A   D L+ S F   +L+ L+
Sbjct: 376 AHAVQRFPEPEYRYLVTYRPDTKKAYVLLREGATSMDCLQASFFGHVFLHLLD 428


>gi|413956564|gb|AFW89213.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 485

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 123/428 (28%), Positives = 203/428 (47%), Gaps = 48/428 (11%)

Query: 59  ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
           A+  E +    +  V E  GS  + L    ++ A + ASAS+ +     S    W+    
Sbjct: 3   ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRFGS---RWERIGS 55

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
             +   +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G + A    A
Sbjct: 56  RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
              W L+D  G L  I+ + + G + D N K WRL AD + +    +++L+P FP   + 
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I       RS   +   +TR+     FA   N A++ AK  +Q  ++  +G+ LG+ LA 
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQ 235

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
                     ASF  +T  HMY N K+ QS+ L TLN  RAS++   +   G+    ++V
Sbjct: 236 ITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKV 295

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNN------ 409
           + +E + P++                S+  K   +++ H R+ LG+K S + ++      
Sbjct: 296 SQQEHILPSW----------------SNWRKLNRIKLPHERVHLGAKASMLTHSDMLVIA 339

Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 469
           K   H     YE+  Y L +  G   V + + A P D+L+S              G+V  
Sbjct: 340 KTRYH-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVH-----------GLVLA 383

Query: 470 STSADCAP 477
           S++ +  P
Sbjct: 384 SSTQNSKP 391


>gi|212274801|ref|NP_001130544.1| uncharacterized protein LOC100191643 [Zea mays]
 gi|194689438|gb|ACF78803.1| unknown [Zea mays]
 gi|223974155|gb|ACN31265.1| unknown [Zea mays]
 gi|413956565|gb|AFW89214.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 442

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 122/428 (28%), Positives = 202/428 (47%), Gaps = 48/428 (11%)

Query: 59  ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
           A+  E +    +  V E  GS  + L    ++ A + ASAS+ +          W+    
Sbjct: 3   ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRF---GSRWERIGS 55

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
             +   +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G + A    A
Sbjct: 56  RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
              W L+D  G L  I+ + + G + D N K WRL AD + +    +++L+P FP   + 
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I       RS   +   +TR+     FA   N A++ AK  +Q  ++  +G+ LG+ LA 
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQ 235

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
                     ASF  +T  HMY N K+ QS+ L TLN  RAS++   +   G+    ++V
Sbjct: 236 ITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKV 295

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNN------ 409
           + +E + P++                S+  K   +++ H R+ LG+K S + ++      
Sbjct: 296 SQQEHILPSW----------------SNWRKLNRIKLPHERVHLGAKASMLTHSDMLVIA 339

Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 469
           K   H     YE+  Y L +  G   V + + A P D+L+S              G+V  
Sbjct: 340 KTRYH-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVH-----------GLVLA 383

Query: 470 STSADCAP 477
           S++ +  P
Sbjct: 384 SSTQNSKP 391


>gi|291410995|ref|XP_002721793.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 468

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/244 (34%), Positives = 133/244 (54%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFP+SV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W++
Sbjct: 78  LPQGFPESVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGLGVGNAKATVSAATATWLV 137

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P FP  F    + + 
Sbjct: 138 KDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDIAMFLEIMAPVFPVCFTMTVSTSN 197

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   +    
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCP 257

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
             +L  F ++T +H+Y N ++ +++ + TLN  R  LV   +L  G+       N  EPL
Sbjct: 258 SVSLGCFFLLTALHIYANYRAVRALVMETLNESRLRLVLKHFLQQGEVLDPTPANQMEPL 317

Query: 363 FPAF 366
           +  F
Sbjct: 318 WTGF 321


>gi|332030173|gb|EGI69967.1| UPF0420 protein C16orf58-like protein [Acromyrmex echinatior]
          Length = 471

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/345 (29%), Positives = 172/345 (49%), Gaps = 29/345 (8%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAA 177
           +F Q  LP+G+P+SV  DY  Y +W +VQ  AS I+G L T +++  IG+G+  A P AA
Sbjct: 52  VFKQIFLPQGYPNSVHPDYTAYQIWDTVQAFASTITGTLTTHSIMKGIGVGESNATPLAA 111

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           AI W+LK G G +  IM + + G   D   K WRLFAD+L++ A G+E+L P F  + V 
Sbjct: 112 AITWILKSGTGMIGSIMFAWWNGTELDGQCKKWRLFADILDDIAKGIELLVPYFSSYSVI 171

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   +   +S   +   +TR+      A + N A+V AK  +Q      I    GI + +
Sbjct: 172 ILCISTTMKSIVGVAGGATRTALIHHQAIQNNVADVSAKDGSQETCVNLIASFFGIFILS 231

Query: 297 --HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
             H G    + L  +  +  +H+Y N  + +++ L TLN  R +++  +Y+++ Q P   
Sbjct: 232 LFHDGQ---YVLELYLFLVAVHLYANYSAVKALCLDTLNEDRLAIIVKKYMMNEQIPEPW 288

Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA-AVEIEHRLQLGSKLSDVVNNKEDA 413
           +VN EE +F                 +L +  +D    +I+  +   + L   + +  + 
Sbjct: 289 KVNKEESVF-----------------LLKNPTRDIHGFDIKIGVSFANILKRNIISFTEM 331

Query: 414 HALFSLYEDEGYIL---TEHGGKFCVVLKESALPQDMLKSLFQAS 455
             L   +E+  YI+    E    F V+LK+     ++L++ F AS
Sbjct: 332 EFLLKFFENRKYIILIDIEKRNIF-VILKKDIQSVEILEAYFYAS 375


>gi|74215416|dbj|BAE41911.1| unnamed protein product [Mus musculus]
          Length = 337

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 88/251 (35%), Positives = 135/251 (53%), Gaps = 5/251 (1%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + +
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTMTVSTS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              +    +   +TR+      A R N A+V AK  +Q  V    G+++ + +   +   
Sbjct: 195 NLAKCIVGVAGGATRAALTMHQARRNNMADVSAKDSSQETVVNLAGLLVSLLMLPLVSDC 254

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
              +L  F ++T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EP
Sbjct: 255 PSLSLGCFVLLTALHIYANYRAVRALVLETLNESRLQLVLEHFLQRGEVLEPASANQMEP 314

Query: 362 LF---PAFHFF 369
           L+   P+  F 
Sbjct: 315 LWIGDPSLWFL 325


>gi|195643336|gb|ACG41136.1| hypothetical protein [Zea mays]
          Length = 442

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 121/428 (28%), Positives = 201/428 (46%), Gaps = 48/428 (11%)

Query: 59  ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
           A+  E +    +  V E  GS  + L    ++ A + ASAS+ +          W+    
Sbjct: 3   ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRF---GSRWERIGS 55

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
             +   +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G + A    A
Sbjct: 56  RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
              W L+D  G L  I+ + + G + D N K WRL AD + +    +++L+P FP   + 
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I       RS   +   +TR+     FA   N  ++ AK  +Q  ++  +G+ LG+ LA 
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNATDISAKEGSQETLATMLGMGLGMLLAQ 235

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
                     ASF  +T  HMY N K+ QS+ L TLN  RAS++   +   G+    ++V
Sbjct: 236 ITRGHALSVWASFLSLTMFHMYANYKAVQSLSLTTLNYERASILLQYFKECGEVLVPRKV 295

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLGSKLSDVVNN------ 409
           + +E + P++                S+  K   +++ H R+ LG+K S + ++      
Sbjct: 296 SQQEHILPSW----------------SNWRKLNRIKLPHERVHLGAKASMLTHSDMLVIA 339

Query: 410 KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 469
           K   H     YE+  Y L +  G   V + + A P D+L+S              G+V  
Sbjct: 340 KTRCH-----YENANYFLLDKQGIVYVFIHKEATPADVLRSFVH-----------GLVLA 383

Query: 470 STSADCAP 477
           S++ +  P
Sbjct: 384 SSTQNSKP 391


>gi|410984724|ref|XP_003998676.1| PREDICTED: UPF0420 protein C16orf58 homolog [Felis catus]
          Length = 468

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 86/249 (34%), Positives = 132/249 (53%), Gaps = 2/249 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA 
Sbjct: 73  LIAVFLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
             W++KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F   
Sbjct: 133 ATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTIT 192

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   
Sbjct: 193 ICTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPL 252

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +     F+L  F  +T +H+Y N ++ +++ + TLN  R  LV   +L  G+       N
Sbjct: 253 VSDCPGFSLGCFFFLTALHIYANYRAVRALVIETLNEGRLRLVLKHFLQRGEVLGPTSAN 312

Query: 358 DEEPLFPAF 366
             EPL+  F
Sbjct: 313 QMEPLWTGF 321


>gi|149725803|ref|XP_001495510.1| PREDICTED: UPF0420 protein C16orf58 homolog [Equus caballus]
          Length = 468

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 104/327 (31%), Positives = 161/327 (49%), Gaps = 22/327 (6%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAV----WEVKGSKRTKLIPDFTKDAFVVASASNA 101
            C  Q  +  AR   ++ D G+ +++A     W   G+   K            A+AS A
Sbjct: 11  LCSEQYGSGAARGCCAAVD-GSLQWEAWGWRWWGFSGAFTAKPEGPDGGGEGAPATAS-A 68

Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
            LS LL++               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A
Sbjct: 69  PLSRLLAL--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHA 114

Query: 162 LLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENA 219
           +L  +G+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+L + 
Sbjct: 115 VLLGLGVGDAKASVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKHWRLFADILNDV 174

Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
           A  LE++ P +P  F  I   +   +   ++   +TR+      A R N A+V AK  +Q
Sbjct: 175 AMFLEIMAPVYPICFTMIVCISNLAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQ 234

Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
             +    G+++ + +   +     F+L  F + T +H+Y N ++ +++ + TLN  R  L
Sbjct: 235 ETLVNLAGLLVSLLMLPLVSGCPSFSLGCFFLFTALHIYANYRAVRALVIETLNEGRLWL 294

Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAF 366
           V   +L  G+       N  EPL+  F
Sbjct: 295 VLKHFLQRGEVLGPTSANRMEPLWTGF 321


>gi|440913323|gb|ELR62787.1| hypothetical protein M91_19503, partial [Bos grunniens mutus]
          Length = 437

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 129/244 (52%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA   W++
Sbjct: 47  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 106

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P  P  F      + 
Sbjct: 107 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTITVCISN 166

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +    +   +TR+      A R N A+V AK  +Q  +    G+++ + +   +  S 
Sbjct: 167 LAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPMVSDSP 226

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
             +L  F  +T +H+Y N ++ +++ + TLN  R  LV   +L  G+       N  EPL
Sbjct: 227 SLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQRGEVLGPTSANQMEPL 286

Query: 363 FPAF 366
           +  F
Sbjct: 287 WTGF 290


>gi|301780856|ref|XP_002925848.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ailuropoda
           melanoleuca]
 gi|281353232|gb|EFB28816.1| hypothetical protein PANDA_015415 [Ailuropoda melanoleuca]
          Length = 468

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 132/244 (54%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  +G+G   A  +AA   W++
Sbjct: 78  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGLGVGNAKASVSAATATWLV 137

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F      + 
Sbjct: 138 KDSTGMLGRILFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPIYPIFFTMTVCTSN 197

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   + +  
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSACP 257

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
            F+L  F  +T +H+Y N ++ +++ L TLN  R  LV   +L  G+       N  EPL
Sbjct: 258 SFSLGCFFFLTALHIYANYRAVRALVLETLNEGRLRLVLKHFLQRGEVLGPTSANQMEPL 317

Query: 363 FPAF 366
           +  F
Sbjct: 318 WTGF 321


>gi|345316269|ref|XP_001510147.2| PREDICTED: UPF0420 protein C16orf58 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 353

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 132/244 (54%), Gaps = 8/244 (3%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI----PTAAAIN 180
           LP GFPDSV+ DYL+Y LW +VQ  AS ++G LATQA+L  +G+G G +    PT A + 
Sbjct: 4   LPRGFPDSVSPDYLSYQLWDTVQAFASSLTGSLATQAVLRGVGVGDGTMTVDSPTGALL- 62

Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
             +K G G L +I  +   G   D + K WRLFAD+L +AA  LE++ PAFP +F     
Sbjct: 63  --IKHGTGMLGRIGFAWKKGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPAYFTLTVC 120

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
            +   +    +   +TR+      A R N A+V AK  +Q  +    G+++ + L   + 
Sbjct: 121 VSSLAKCIVGVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLMLLPLVS 180

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
                 L  F + T +H+Y N ++ +++ + TLN  R  LV   +L  GQ   +   N++
Sbjct: 181 DRPSLCLPLFLLFTSLHLYANFRAVRAVVMETLNESRLRLVLEHFLREGQVLSLASANEK 240

Query: 360 EPLF 363
           EPL+
Sbjct: 241 EPLW 244


>gi|207080128|ref|NP_001128778.1| DKFZP468D197 protein [Pongo abelii]
 gi|55727564|emb|CAH90537.1| hypothetical protein [Pongo abelii]
          Length = 466

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 89/244 (36%), Positives = 134/244 (54%), Gaps = 4/244 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  IG+G   A  +AA   W++
Sbjct: 78  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLLGIGVGNAKATVSAATATWLV 137

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F    + + 
Sbjct: 138 KDSTGMLGRIVFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSN 197

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +   +G+++ + +   + S  
Sbjct: 198 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDSSQETLVNLVGLLVSLLMLPLV-SGC 256

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
           P  L  F  +T +H+Y N ++ +++ + TLN  R  LV   YL  G+       N  EPL
Sbjct: 257 P-GLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLNPTAANRMEPL 315

Query: 363 FPAF 366
           +  F
Sbjct: 316 WTGF 319


>gi|156379270|ref|XP_001631381.1| predicted protein [Nematostella vectensis]
 gi|156218420|gb|EDO39318.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 78/241 (32%), Positives = 128/241 (53%), Gaps = 2/241 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP+G+P+SV+ DYL Y  W +VQ   S I+G LATQA+L   G+G + A   AA + W+L
Sbjct: 6   LPQGYPESVSKDYLTYQFWDTVQAFCSSITGTLATQAMLKGYGVGNESATALAATMTWIL 65

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           + G G +  I+ +   G + D N K WRLFAD++ + A  +E+++P F  +F  +   + 
Sbjct: 66  RSGTGMVGSILFAWMQGSNLDCNAKKWRLFADIMNDLAILVELISPYFKAYFTLLACLSS 125

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +S   +   STR+      A R N A+V AK  +Q  +   + ++ G+ +   +    
Sbjct: 126 VAKSIVGVAGGSTRAALTQHQARRDNMADVAAKDGSQETLVNLMALLAGLVITPLVTGHT 185

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                 F + T++H+Y N  +  S+ + TLN  R  +V +E+L +G      EV   EP+
Sbjct: 186 LLTWVLFIIFTFLHLYANYCAVSSVVMETLNTTRLQIVTNEFLRTGHVLSPDEVGKMEPV 245

Query: 363 F 363
            
Sbjct: 246 I 246


>gi|18423070|ref|NP_568713.1| uncharacterized protein [Arabidopsis thaliana]
 gi|16604643|gb|AAL24114.1| unknown protein [Arabidopsis thaliana]
 gi|23296703|gb|AAN13151.1| unknown protein [Arabidopsis thaliana]
 gi|332008476|gb|AED95859.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 497

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 117/433 (27%), Positives = 195/433 (45%), Gaps = 44/433 (10%)

Query: 105 SLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
           SL S    +DE       +++PEGFP SV   Y+ Y  WR+++       GV  TQ LL 
Sbjct: 93  SLESPQTPFDEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLN 152

Query: 165 AIGLGKGAIPTAAA-INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
           ++G  + +  +AA  INW+LKDG G + K++ +  G+ FD + K  R   DLL     G+
Sbjct: 153 SVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGV 212

Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
           E+ T A PH F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G ++
Sbjct: 213 ELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 272

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
             +G    I     I    P  + +F +++  ++  + +  +S+ L TLN  R ++    
Sbjct: 273 DLMGTGFSIL----ISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVES 328

Query: 344 YLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKL 403
           +L +G+ P ++E N +E +F  F +                         +  + LG++ 
Sbjct: 329 FLKTGRVPSLQEGNIQEKIF-TFPWVD-----------------------DRPVMLGARF 364

Query: 404 SDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWLE 461
            D   +     A+   ++ E Y++T     GK   +LK  A   D+LK+ F A  L    
Sbjct: 365 KDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVLLHFM 424

Query: 462 RNAGIVATSTSADCAPG---------GRLEISLDYVQREFNHVKSDSASVGW-VTDGLIA 511
             +      +     P           R+  S + V   +   KS +A  GW +++ L+ 
Sbjct: 425 NQSKDGNPRSVEQLDPAFAPTEYELESRIAESCEMVSTSYGVFKSRAAEQGWRMSESLLN 484

Query: 512 RPLPNRIRPGYVE 524
              P R R  +V+
Sbjct: 485 ---PGRARLCHVK 494


>gi|395514351|ref|XP_003761381.1| PREDICTED: UPF0420 protein C16orf58 homolog [Sarcophilus harrisii]
          Length = 455

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 88/241 (36%), Positives = 132/241 (54%), Gaps = 2/241 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP-TAAAINWVL 183
           LP GFPDSV+ DYL Y LW +VQ  AS ++G LATQA+L  +G+G  A   +AA I W+L
Sbjct: 66  LPHGFPDSVSPDYLPYQLWDTVQAFASSLTGSLATQAVLQGVGVGDAAASVSAATITWLL 125

Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KDG G L +I  +   G   D + K WRLFAD+L +AA  LE++ PAFP  F  I   + 
Sbjct: 126 KDGTGMLGRIGFAWLMGSKLDCDAKKWRLFADVLNDAAMFLEIVAPAFPSCFTLIVCVSS 185

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +    +   +TR+      A R N A+V AK  +Q  +    G+++ + L   +    
Sbjct: 186 LAKCIVGVAGGATRAALTMHQARRDNMADVSAKDGSQETLVNLAGLLVSLLLLPLVSERP 245

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
             +LA+F + T +H+Y N ++ +++ + T N  R  LV   ++  G+       N  EPL
Sbjct: 246 RLSLAAFFLFTALHLYANYRAVRAVIMETFNETRLYLVLKHFIQWGEVLTPPSANRMEPL 305

Query: 363 F 363
           +
Sbjct: 306 W 306


>gi|344294431|ref|XP_003418921.1| PREDICTED: UPF0420 protein C16orf58 homolog [Loxodonta africana]
          Length = 468

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 161/331 (48%), Gaps = 22/331 (6%)

Query: 42  LLYCFCHLQVATAIARTATSSEDDGNKEYDA----VWEVKGSKRTKLIPDFTKDAFVVAS 97
           L    C  +  +  AR   ++ D G+ +++A     W + G+   KL            +
Sbjct: 7   LAVTLCSERFGSGTARGCRAAAD-GSLQWEAGAWRWWGLSGAFSVKLAGRDGGRGGTPGT 65

Query: 98  ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
           AS+  LS LL+V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG L
Sbjct: 66  ASS-PLSGLLAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGAL 110

Query: 158 ATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADL 215
           ATQA+L  IG+G   A  +AA   W++KD  G L +I+ + + G   D N K WRLFAD+
Sbjct: 111 ATQAVLLGIGVGDAKASVSAATATWLVKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADI 170

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           L + A  LE++ P  P  F      +   +   ++   +TR+      A R N A+V AK
Sbjct: 171 LNDVAMFLEIMAPVLPICFTITVCTSNLAKCIVSVAGGATRAALTMHQARRNNMADVSAK 230

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
             +Q  +    G+++ + +   +      +L  F  +T +H+Y N ++ +++ + TLN  
Sbjct: 231 DSSQETLVNLAGLLVSLLMLPLVSDCPSVSLGCFFFLTALHIYANYRAVRALVMETLNEG 290

Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
           R  L+   +L  G+       N  EPL+  F
Sbjct: 291 RLWLILKHFLQKGEVLSPTSANQMEPLWTGF 321


>gi|307205774|gb|EFN84004.1| UPF0420 protein C16orf58 [Harpegnathos saltator]
          Length = 466

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 178/366 (48%), Gaps = 39/366 (10%)

Query: 107 LSVNKLWDECRE------LFVQFM----LPEGFPDSVTSDYLNYSLWRSVQGVASQISGV 156
            SVN++  E  E       FV F+    LP+G+PDSV +DYL Y +W +VQ  AS I G 
Sbjct: 24  FSVNEVKSESTESQDIYFRFVSFVKEVFLPQGYPDSVRADYLAYQIWDTVQAFASTIMGT 83

Query: 157 LATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFAD 214
           L T +++  IG+G+  A P AAAI W+LK G G +  IM + + G   D   K WRLFAD
Sbjct: 84  LTTHSVMQGIGVGESNATPLAAAITWILKSGTGMIGSIMFAWWNGTQLDGQCKKWRLFAD 143

Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
           +L + A GLE++ P    + V I   +   +S   +   +TR+      A R N A+V A
Sbjct: 144 ILNDIAMGLELMVPYLSPYSVPILCTSTIMKSVVGVAGGATRAALTQHQAIRNNLADVSA 203

Query: 275 KGEAQGMVSKAIGIMLGIALAN--HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
           K  +Q      I   +GI + +  H G    + +  +  +  +H+Y N  + +++ L TL
Sbjct: 204 KDSSQETCVNLIASFVGIFVLSLFHDGR---YLIELYLFLVAVHLYANYSAVKALCLDTL 260

Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD-AAV 391
           N  R +L+   Y+++ +    +++N EE +F                 +L + +K+    
Sbjct: 261 NEDRLALIIKNYMMNERISEPQKINKEESVF-----------------LLGNPSKEICGF 303

Query: 392 EIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT---EHGGKFCVVLKESALPQDML 448
            I+      S L   V +  +   L   +ED  Y++T   E    F ++LK+   P ++L
Sbjct: 304 NIKIGASFASLLKRNVISSMEMEFLLKFFEDRRYLITIDIETRTIF-IILKKDIKPIEVL 362

Query: 449 KSLFQA 454
           +  F A
Sbjct: 363 EGYFYA 368


>gi|387019749|gb|AFJ51992.1| UPF0420 protein C16orf58-like [Crotalus adamanteus]
          Length = 437

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 88/271 (32%), Positives = 142/271 (52%), Gaps = 13/271 (4%)

Query: 100 NASLSSLLSVNKLW--DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
           N  L+S    +K W  +    +F    LP+G+P+SV++DYL Y  W ++Q  AS I+G L
Sbjct: 27  NGKLNSKTDASKDWHLNSFHRIFKSIFLPQGYPESVSTDYLAYQFWDTIQAFASSITGTL 86

Query: 158 ATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
           ATQA+L  +G+G + +   AAA+ W+LKDG G L +I  +          KG  LFAD+L
Sbjct: 87  ATQAVLKGVGVGDETSTVAAAAVTWILKDGTGMLGRIAFAW--------SKG--LFADVL 136

Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKG 276
            + A  +E++ PAFP  F  I   +G  +    +   +TR+      A R N A+V AK 
Sbjct: 137 NDVAIFMEIMAPAFPACFTLIVCTSGFFKCIVGVAGGATRAALTMHQARRDNMADVSAKD 196

Query: 277 EAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
            +Q  +    G++  + L   +  ++P   A +++ T +H+Y N ++ +++ + TLN  R
Sbjct: 197 GSQETLVNLAGLLFSLFLIPLVVDNLPLTYAFYALFTILHLYANYQAVRAVCMETLNRAR 256

Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFH 367
             LV   YL   +      +N +EPL   F 
Sbjct: 257 LHLVLQHYLKREEVLGPAVINPQEPLLLGFR 287


>gi|328778772|ref|XP_394441.3| PREDICTED: UPF0420 protein C16orf58 homolog [Apis mellifera]
          Length = 464

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 179/373 (47%), Gaps = 25/373 (6%)

Query: 90  KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
           KD F V S    S++ LLS        +     +  +  LP+G+PDSV  DY +Y +W +
Sbjct: 15  KDRFFVKSKDEQSITELLSDTTRTKSFYSGLISIIKEVFLPQGYPDSVHPDYTSYQIWDT 74

Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
           VQ  AS I G L T +++  +G+G+  A P AAAI W+LKDG G + +IM + + G   D
Sbjct: 75  VQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWILKDGTGMVGRIMFAWWNGTDLD 134

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
              K WRLFAD+L + A GLE+L P F  + + I   + A +S   +   +TR+      
Sbjct: 135 GQCKKWRLFADILNDLAMGLELLLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQ 194

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A + N A+V AK  +Q      I   +GI + + I  +  + +  +  +  IH+Y N  +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSA 253

Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +++ L +LN  R +L+   Y+ +   P  +EVN +E                S LL+  
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
           S        I+  + L + +   +    D      L+ D  Y+++         + LK+ 
Sbjct: 298 STMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDRKYLISIDVQNKTIFICLKKD 357

Query: 442 ALPQDMLKSLFQA 454
           A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370


>gi|297795749|ref|XP_002865759.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
 gi|297311594|gb|EFH42018.1| EMB1879 [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  149 bits (376), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 169/356 (47%), Gaps = 31/356 (8%)

Query: 105 SLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLY 164
           SL S    +DE       +++PEGFP SV   Y+ Y  WR+++       GV  TQ LL 
Sbjct: 96  SLESPQTPFDEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTLLN 155

Query: 165 AIGLGKGAIPTAAA-INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGL 223
           ++G  + +  +AA  INW+LKDG G + K++ +  G+ FD + K  R   DLL     G+
Sbjct: 156 SVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGAGV 215

Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
           E+ T A PH F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G ++
Sbjct: 216 ELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIA 275

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
             +G    I     I    P  + +F +++  ++  + +  +S+ L TLN  R ++    
Sbjct: 276 DLMGTGFSIL----ISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAVES 331

Query: 344 YLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKL 403
           +L +G+ P ++E N +E +F  F +                         +  + LG++ 
Sbjct: 332 FLKTGRVPSLQEGNIQEKIF-TFPWVD-----------------------DRPVMLGARF 367

Query: 404 SDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYL 457
            D   +     A+   ++ E Y++T     GK   +LK  A   D+LK+ F A  L
Sbjct: 368 KDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVL 423


>gi|8978258|dbj|BAA98149.1| unnamed protein product [Arabidopsis thaliana]
          Length = 407

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 114/435 (26%), Positives = 195/435 (44%), Gaps = 44/435 (10%)

Query: 103 LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQAL 162
           + S+     L  E       +++PEGFP SV   Y+ Y  WR+++       GV  TQ L
Sbjct: 1   MCSIFVFGDLLQEVGSFLRSYVVPEGFPGSVNESYVPYMTWRALKHFFGGAMGVFTTQTL 60

Query: 163 LYAIGLGKGAIPTAAA-INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAF 221
           L ++G  + +  +AA  INW+LKDG G + K++ +  G+ FD + K  R   DLL     
Sbjct: 61  LNSVGASRNSSASAAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLLMELGA 120

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
           G+E+ T A PH F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G 
Sbjct: 121 GVELATAAVPHLFLPLACAANVVKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGN 180

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           ++     ++G   +  I    P  + +F +++  ++  + +  +S+ L TLN  R ++  
Sbjct: 181 IAD----LMGTGFSILISKRNPSLVTTFGLLSCGYLMSSYQEVRSVVLHTLNRARFTVAV 236

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGS 401
             +L +G+ P ++E N +E +F  F +                         +  + LG+
Sbjct: 237 ESFLKTGRVPSLQEGNIQEKIF-TFPWVD-----------------------DRPVMLGA 272

Query: 402 KLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYW 459
           +  D   +     A+   ++ E Y++T     GK   +LK  A   D+LK+ F A  L  
Sbjct: 273 RFKDAFQDPSTYMAVKPFFDKERYMVTYSPTKGKVYALLKHQANSDDILKAAFHAHVLLH 332

Query: 460 LERNAGIVATSTSADCAPG---------GRLEISLDYVQREFNHVKSDSASVGW-VTDGL 509
               +      +     P           R+  S + V   +   KS +A  GW +++ L
Sbjct: 333 FMNQSKDGNPRSVEQLDPAFAPTEYELESRIAESCEMVSTSYGVFKSRAAEQGWRMSESL 392

Query: 510 IARPLPNRIRPGYVE 524
           +    P R R  +V+
Sbjct: 393 LN---PGRARLCHVK 404


>gi|345492163|ref|XP_001602603.2| PREDICTED: UPF0420 protein C16orf58 homolog [Nasonia vitripennis]
          Length = 466

 Score =  149 bits (375), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 165/341 (48%), Gaps = 23/341 (6%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
            F +  LP+GFPDSV  DY+ Y +W ++Q  AS I+G L T +++  +G+G+  A P AA
Sbjct: 49  FFKEVFLPQGFPDSVHKDYVPYQIWDTLQAFASTINGTLTTHSIMRGVGVGESTATPLAA 108

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           AI W+LKDG G + +I+ + + G + D   K WRLFAD+L +AA  LE+  P       +
Sbjct: 109 AITWILKDGTGMIGRIIFAWWQGNNLDSQCKKWRLFADILNDAAMALEVTVPYISTFSSY 168

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
                   ++   +   +TR+      A + N A+V AK  +QG +    G ++GI +  
Sbjct: 169 TLCLTTGMKAIVGIAGGATRTAIMQHHAIKDNMADVSAKEHSQGTLVNLAGSIVGILILL 228

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            I       +    V+  +H+  N  +  S+EL TLN  R  L+  +Y+L+     V+++
Sbjct: 229 LINEKFFIYVCLLLVI--VHIVSNYFAVTSLELNTLNEDRLCLIIEDYILNHTMSNVEDI 286

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA-AVEIEHRLQLGSKLSDVVNNKEDAHA 415
           N  E                S +++L   AK     +IE  +   S L   + N +D   
Sbjct: 287 NSRE----------------SVIILLEKPAKKIFGFDIEIGVSFESVLKMNLINTKDVRF 330

Query: 416 LFSLYEDEGY--ILTEHGGKFCVVLKESALPQDMLKSLFQA 454
           L +L++   Y  ++     K  ++L + A    +LK+ F  
Sbjct: 331 LLALFQRRKYLPVMDLKAKKIHIILSKDADDDIILKAYFHC 371


>gi|412993492|emb|CCO14003.1| predicted protein [Bathycoccus prasinos]
          Length = 584

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 92/250 (36%), Positives = 135/250 (54%), Gaps = 15/250 (6%)

Query: 129 FPD--SVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKD 185
           FP   SVTSDY +Y+ +R  Q +AS    VLATQ +L AIGLG   +IP AAA+NWVLKD
Sbjct: 173 FPQKTSVTSDYWDYAKYRFTQRIASSCMTVLATQQMLAAIGLGASRSIPAAAALNWVLKD 232

Query: 186 GIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
           G+G L K+ + +NFGR FD + K  R  + L+   A  +E +TP FP HF+ I   A  G
Sbjct: 233 GLGRLGKLSVATNFGRSFDSDVKRLRFSSSLVYTGAVFIETITPFFPKHFLAIATVANVG 292

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH------- 297
           +S         R      FA   N  E+ AK  AQ +++  IG+ LG+++          
Sbjct: 293 KSIGITTGNVVRPPIQKTFALEENLGEIAAKTSAQQVLADNIGLALGVSVMKFQTKFVSL 352

Query: 298 -IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ-APPVKE 355
            +  ++P AL  F ++  I +    K  +S++LRT+N  RA ++   ++   + A  ++ 
Sbjct: 353 ALQRAVPLAL--FPILASIDLVSIHKQLKSVQLRTVNKERAEIIAEAFVREKKLATQLQV 410

Query: 356 VNDEEPLFPA 365
              E  LFPA
Sbjct: 411 AEKERLLFPA 420


>gi|195115010|ref|XP_002002060.1| GI17175 [Drosophila mojavensis]
 gi|193912635|gb|EDW11502.1| GI17175 [Drosophila mojavensis]
          Length = 417

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 168/337 (49%), Gaps = 24/337 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
           LP+G+PDSV+ DY  Y +W +VQ   S I G L T A+L  IG+G   +   +A + W+L
Sbjct: 54  LPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVTWIL 113

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIGAAA 241
           K+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I   +
Sbjct: 114 KEGSGHMGRIIFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCGS 173

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              ++   +   +TR+      A R N A+V +K  +Q      I   +G+ L   I S 
Sbjct: 174 TLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFMGLYLLTMIKSQ 233

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
                  + VVT +H+Y NLK+ +++ LRT N  R  +   E+  S Q    ++VN    
Sbjct: 234 AMLYTVFYIVVT-LHLYANLKAVRAVCLRTFNESRYLITLEEFFRSSQMLTPQQVN---- 288

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
                   K++     Q++ +S + +   + ++H +    K SDV+   ED  + F  +E
Sbjct: 289 --------KLERVTLGQMVSVSLKIR-LGLSVKHLID-EYKSSDVI---EDIVSSFDPHE 335

Query: 422 DEGYILTEHGGKFCVVLKESALPQDMLKS-LFQASYL 457
              +I+ E      V L     PQD+LK+  F  SYL
Sbjct: 336 H--FIIAESKKFLGVYLHFDTRPQDVLKAYFFGVSYL 370


>gi|443714847|gb|ELU07084.1| hypothetical protein CAPTEDRAFT_141059, partial [Capitella teleta]
          Length = 266

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 85/255 (33%), Positives = 135/255 (52%), Gaps = 4/255 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP+GFP SV+ DY+ Y +W +VQ  AS I+  LATQA+L  +G+G + A   AA + W++
Sbjct: 7   LPKGFPASVSDDYVQYQIWDTVQAFASSITNTLATQAVLKGVGVGDEQATVLAATLTWLM 66

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KDG G L +I+ +   G + D + K WRLFAD+L + +  +++  P F  +F  I   AG
Sbjct: 67  KDGTGMLGRILFTWIQGSYLDCDCKRWRLFADILNDVSILMDICAPFFRVYFTLIVCVAG 126

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             RS   +   +TR+      A R N A+V AK  +Q  +     ++  +AL   +    
Sbjct: 127 VCRSIVGVAGGATRAAVTQHQACRSNMADVSAKDGSQETLVNLAALLCNLALVPMVSDRQ 186

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL--SGQAPPVKEVNDEE 360
                 F+  T +H+Y N K+ +S+++ T N  R  L   E++   +G  P VK VN  E
Sbjct: 187 WLVWILFTCFTTLHIYANFKAVRSLDMPTFNQVRLHLAAQEFISDNTGNFPGVKTVNQRE 246

Query: 361 PLFPAFHFFKIKSAN 375
           P+  +     + S +
Sbjct: 247 PVLCSESLLSVPSTS 261


>gi|198431163|ref|XP_002119233.1| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Ciona
           intestinalis]
          Length = 414

 Score =  148 bits (373), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 168/356 (47%), Gaps = 41/356 (11%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           ++F    LP+G+P+SV+ DYL+Y +W +VQ   S I+G LAT A+L   G+G      A+
Sbjct: 47  QIFKDIFLPKGYPESVSEDYLSYQIWDTVQAFCSSITGTLATHAVLKGSGVGDETANVAS 106

Query: 178 A-INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HF 234
           A I W+LKDG G L +I  + F G   D N K WRLFAD++ + A  ++++ P  P  +F
Sbjct: 107 ATITWLLKDGTGMLGRIAFAYFKGSSLDCNAKQWRLFADVMNDCAIFIQLVAPVLPKAYF 166

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
             +   AG   S   +    TR+      A   N A+V AK  +Q  +     ++ G+ +
Sbjct: 167 TLVMCLAGLAFSLVGVAGGCTRAALTMHQAKCNNMADVSAKDGSQETLVNLAALLTGLVI 226

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
              + +++P     F + T++H++ N K+  S+ +   N  R S V   +    +    K
Sbjct: 227 MPIVATNIPLMWLLFLIFTFLHVFANYKAVASVVMELFNRNRLSKVIKVFCDMNKVLTPK 286

Query: 355 EVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           E N++E +F  P++ +                            L LG KL D+     +
Sbjct: 287 EANEKEAIFFPPSYSW----------------------------LNLGCKLEDLGLTSSE 318

Query: 413 AHALFSLYEDEG--YIL--TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
            H      +D G  Y++   +  G+  V L  S    D+L++ ++ SYL  + R +
Sbjct: 319 FH---QKVDDGGVRYLIKRNQKTGQIMVSLNHSCTASDILEAFYE-SYLLHINRTS 370


>gi|328872765|gb|EGG21132.1| DUF647 family protein [Dictyostelium fasciculatum]
          Length = 534

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 84/264 (31%), Positives = 138/264 (52%), Gaps = 10/264 (3%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAI 173
           +  ++F +  LP+G+P SVT DY  Y  W  +Q      + VL+T ALL ++G+G   ++
Sbjct: 110 KLTKMFQKLFLPKGYPQSVTHDYSQYMKWMFLQNTLGSTTYVLSTHALLTSVGVGLSTSL 169

Query: 174 PTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           P AAAI+WVLKDG+G  + ++  S +    D N K ++   D+L N    LEMLTP  P 
Sbjct: 170 PFAAAISWVLKDGLGASALVLFASKYSTSLDYNLKSFKFRGDILHNVGVMLEMLTPFVPG 229

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
           +F+ + + +   +  A LI  +TR+     F+ + N  ++ AK ++Q M S   G+ LG 
Sbjct: 230 YFLPLASVSNLSKGLAGLIYGATRASLNRSFSLQENLGDITAKYQSQSMASYLTGMGLGT 289

Query: 293 ALANHIGSSMPFA-LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
            +   +      A L +  +++ +HM C++KS +SI L+TLN  R+ ++   +L      
Sbjct: 290 TIGLLMSPMTTTANLCAVFIISLLHMSCSVKSLKSICLKTLNSQRSKIIIDHWLREKIIL 349

Query: 352 PVKEVNDEEPLF-------PAFHF 368
             + VN  E          P  HF
Sbjct: 350 SPEIVNKNENFMFSSSTDTPKIHF 373


>gi|260815171|ref|XP_002602347.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
 gi|229287656|gb|EEN58359.1| hypothetical protein BRAFLDRAFT_234382 [Branchiostoma floridae]
          Length = 244

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 82/240 (34%), Positives = 132/240 (55%), Gaps = 2/240 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP+G+P+SV+ DYL+Y +W +VQ   S I+G LATQA+L  +G+G + A    A I W+L
Sbjct: 5   LPQGYPESVSDDYLSYQIWDTVQAFCSSITGTLATQAVLKGVGVGDETATAAGATITWIL 64

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KDG G L +I+ +   G   D + K WRLFAD++ + +  LE++ PAFP  F  I   AG
Sbjct: 65  KDGTGMLGRILFAWLQGTSLDCDAKRWRLFADIVNDFSIFLELVAPAFPLFFTGIVCVAG 124

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
            G++   +   +TR+      A R N A+V AK  +Q  +   + ++ G+ +   +  + 
Sbjct: 125 VGKAIVGVSGGATRAALTMHQARRDNMADVSAKDGSQETLVNLMALLAGLVVTPMVADNP 184

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                 F V T +H++ N  +   + + TLN  R  ++ S YL +G      + N +EP+
Sbjct: 185 LLTWFLFFVFTCLHLFANYNAVTCVVMETLNQSRLHILVSHYLRTGVVLNPAQANVQEPV 244


>gi|380030107|ref|XP_003698699.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58 homolog
           [Apis florea]
          Length = 464

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 178/373 (47%), Gaps = 25/373 (6%)

Query: 90  KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
           KD F V S    S++ LLS        +     +  +  LP+GFPDSV  DY +Y +W +
Sbjct: 15  KDRFFVKSKDEQSITELLSDTTRTKSFYSGLISIIKEVFLPQGFPDSVHPDYTSYQVWDT 74

Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
           VQ  AS I G L T +++  +G+G+  A P AAAI W+LKDG G + +IM + + G   D
Sbjct: 75  VQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWILKDGTGMVGRIMFAWWNGTDLD 134

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
              K WRLFAD+L + A GLE+L P F  + + I   + A +S   +   +TR+      
Sbjct: 135 GQCKKWRLFADILNDLAMGLELLLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTNIK 194

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A + N A+V AK  +Q      I   +GI + + I  +  + +  +  +  IH+Y N  +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGQYIMELYLFLVIIHLYANYSA 253

Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +++ L +LN  R +L+   Y+ +   P  +EVN +E                S LL+  
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
                    I+  + L + +   +    D      L+ D  Y+++         + LK+ 
Sbjct: 298 PTMNICGFNIKIGVSLAALIKKDLILTSDTELSLKLFLDRKYLISIDVQNKTIFICLKKD 357

Query: 442 ALPQDMLKSLFQA 454
           A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370


>gi|145349487|ref|XP_001419164.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579395|gb|ABO97457.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 269

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 89/261 (34%), Positives = 138/261 (52%), Gaps = 16/261 (6%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
            C E     +LPE  P SVT+DY  Y+ +R  Q +AS    V ATQ +L A+GLG    +
Sbjct: 11  RCVERAQHALLPE--PSSVTNDYWEYAKFRFAQRIASSCISVFATQQMLTAVGLGASRTL 68

Query: 174 PTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           P AAA+NWVLKDG+G L K+ + +NFGR FD + K +R  + ++ +A+  +E++TP FP 
Sbjct: 69  PAAAAVNWVLKDGLGRLGKLSVAANFGRTFDSDVKRFRFTSSVVYDASSFIEIITPYFPQ 128

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
           HF+ +   A  G+S         R+     F    N AEV AK  AQ +++  IG+ L +
Sbjct: 129 HFLPLATVANIGKSVGITTANVVRAPIQRTFILEENLAEVAAKTSAQQVLADNIGLALAV 188

Query: 293 ALAN----------HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
             A            I  ++P  + +F  +  + + C  K  ++++LRT+N  RA ++  
Sbjct: 189 GAAKAVSKVATVRPEIRRALP--VFAFGPLAALDLICIYKELKAVQLRTINKERAEIIAE 246

Query: 343 EYLLSGQAPPVKEVNDEEPLF 363
            ++   + P    V D E LF
Sbjct: 247 AFVKEREIPSRAHVADAERLF 267


>gi|66818028|ref|XP_642707.1| DUF647 family protein [Dictyostelium discoideum AX4]
 gi|74861464|sp|Q86K80.1|U420_DICDI RecName: Full=UPF0420 protein
 gi|60470803|gb|EAL68775.1| DUF647 family protein [Dictyostelium discoideum AX4]
          Length = 527

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/260 (29%), Positives = 138/260 (53%), Gaps = 6/260 (2%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA 178
           +  +  LP G+PDSVT+DY  Y  W S+Q + S I+G LAT+A+L   G+G  +   A+A
Sbjct: 58  VLCELFLPNGYPDSVTTDYFGYQFWDSIQALCSTITGTLATRAILKGYGVGDSSATVASA 117

Query: 179 -INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH-FV 235
              W+++DG+G + +I+ +   G   D N K WR  AD+L N     EM++P F    F+
Sbjct: 118 TTQWLIRDGMGMIGRIVFAWRKGTDLDCNSKKWRYTADILNNIGMAFEMISPLFSSQLFL 177

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            +       +S   +    T++     FA R N A+V AK  +Q      +G++L + ++
Sbjct: 178 PLSCIGLIAKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAVNLVGMLLSVIVS 237

Query: 296 NHI--GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS-GQAPP 352
           + I   +S+      F   T +H++CN ++  +++L+++N YRA L++  ++ + G  P 
Sbjct: 238 SFINDNTSLIVTWLVFLFFTSLHLFCNYRAVSAVQLKSINRYRAYLIYDYFIHNQGSIPS 297

Query: 353 VKEVNDEEPLFPAFHFFKIK 372
             E++  E +  +     I+
Sbjct: 298 PSEISKLENILFSIKELDIR 317


>gi|357133639|ref|XP_003568431.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2 [Brachypodium
           distachyon]
          Length = 508

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/348 (28%), Positives = 164/348 (47%), Gaps = 47/348 (13%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++        V  T+ALL ++G+ +   
Sbjct: 121 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMSVFTTRALLNSVGVSQS-- 178

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
                     +DG G + K++ +  G+ FD + K  R   DLL     G+E+ T AFPH 
Sbjct: 179 ----------RDGAGRVGKMLFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPHL 228

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG  
Sbjct: 229 FLPLACMANVVKNVAAVTSTSTRTPIYKAYAKGENIGDVTAKGESVGNIAD----LLGTG 284

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           ++  I    P  +ASF+ ++  ++  + +  +S+ L TLN  R ++    ++ +G  P +
Sbjct: 285 MSILISKRNPSLVASFAFLSCGYLLSSYREVRSVVLNTLNTARFTVAVDSFIKTGHVPSL 344

Query: 354 KEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
           KE N EE +F  P  H                             + +GS+  +      
Sbjct: 345 KEGNLEETIFNPPWRH---------------------------QPVAIGSRFGEAFQEPA 377

Query: 412 DAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
              +   L+EDE YI+T +  K  V  +LK+ A P D+LK+ F A  L
Sbjct: 378 SFVSTSPLFEDERYIVTYNPTKDKVYALLKDQAKPDDILKAAFHAHVL 425


>gi|449017769|dbj|BAM81171.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 827

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 177/362 (48%), Gaps = 28/362 (7%)

Query: 120 FVQFMLPEGF------PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGA- 172
            V  +LPEGF      P  V+ DYL Y++   +Q +    + V+ATQ++L AIG+ K A 
Sbjct: 334 LVGSLLPEGFGDKTRFPPVVSDDYLPYTVLHFMQNLVHAAASVIATQSMLIAIGVAKQAT 393

Query: 173 IPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
           I  +AA+ WVLKDG+G ++ I+  S  G  +D +PK WR+  D+L     G E+ T   P
Sbjct: 394 IGVSAAVQWVLKDGLGRIAAIIFGSLLGNRYDADPKRWRMRGDMLYTLGVGTEIATRLVP 453

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
            +F+ +G  A A +S + +++  T +     F+ R NF ++ AK  +Q ++S  +G  +G
Sbjct: 454 QYFLLVGTMANAIKSVSYMMRLPTAAAIRRHFSLRENFGDLSAKANSQEVLSMLLGTFVG 513

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--Q 349
           I L+  I +S+    A + +   + +  N  S + + +RTLN  R  +V   Y  SG  +
Sbjct: 514 IGLSYMIDNSLKALFAFYIIYVQMFILFNYWSLRVLRMRTLNLQRTLIVMRRYWNSGGVE 573

Query: 350 APPVKEVNDEEP-LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE-----HRLQLGSKL 403
              V   N  E  L P+    ++  A + +      EA     E E      RL  G  L
Sbjct: 574 TCSVAMANQAENFLLPS----RLGPARRVRFGCRLGEAFRTVGEFEEAIARQRLWRGQYL 629

Query: 404 --SDVVNNKEDAHALFSLYE----DEGYIL--TEHGGKFCVVLKESALPQDMLKSLFQAS 455
             +DV     +A A F+       DE Y+L      G+  VVL   A   D L++L  A+
Sbjct: 630 QQTDVSKRNGNADADFTSVPDDVLDERYLLGVDAKRGRVFVVLHRDAKVADELQALMHAT 689

Query: 456 YL 457
           YL
Sbjct: 690 YL 691


>gi|328869935|gb|EGG18310.1| DUF647 family protein [Dictyostelium fasciculatum]
          Length = 407

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/257 (32%), Positives = 139/257 (54%), Gaps = 11/257 (4%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTA 176
           E+     LP+G+P+SVTSDY  Y  W ++Q + S ++GVLAT+A+L   G+G   A   +
Sbjct: 43  EVLRDMFLPKGYPNSVTSDYTGYQTWDTLQALCSSLTGVLATRAILKGYGVGNLNASVAS 102

Query: 177 AAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHF 234
           A+I W+ +D  G L +I+ +  +G   D N K WR  ADLL +    LEM++P F P +F
Sbjct: 103 ASIQWITRDAAGMLGRIIFAWKWGTDLDCNSKTWRFAADLLNDFGMTLEMISPLFGPSYF 162

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           + I       +S   +    T++     FA R N A+V AK  +Q      +G++LG+ +
Sbjct: 163 LPISCLGLISKSICGVAGGCTKASLTQHFAKRDNLADVSAKDGSQETAIALVGMLLGMVM 222

Query: 295 ANHI---GSSMPFALAS---FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL-- 346
           A  I     S+     +   F + T IH+ CN ++   ++L ++N YRA+L++  +    
Sbjct: 223 AASIPDDDDSIQVYFKTWLFFIIFTVIHLLCNYRAVSLVQLTSINRYRANLIYQHFNENN 282

Query: 347 SGQAPPVKEVNDEEPLF 363
           + Q P  K+++ +E + 
Sbjct: 283 NNQIPTPKQISLKENIL 299


>gi|312082381|ref|XP_003143421.1| hypothetical protein LOAG_07840 [Loa loa]
 gi|307761414|gb|EFO20648.1| hypothetical protein LOAG_07840 [Loa loa]
          Length = 393

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/277 (32%), Positives = 147/277 (53%), Gaps = 8/277 (2%)

Query: 89  TKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQG 148
           T DA+   S  NA  + L SV   W   + LF    +P G+P SV+ DY+NY +W ++Q 
Sbjct: 17  TVDAYTTNSKYNA-FTRLESV-AYW---KSLFRDIFMPRGYPQSVSPDYMNYQIWDTIQA 71

Query: 149 VASQISGVLATQALLYAIGLGKGAIPT-AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNP 206
            AS +S  LAT+A+L  +G+G     T AAA+ W+L+DGIG L++I+ + F   + D + 
Sbjct: 72  FASSMSSALATEAILRGVGVGNETASTMAAAVAWLLRDGIGMLTRILFAWFYSPYLDADC 131

Query: 207 KGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
           K WRL AD   + AF L+++TP FP+ F+ I   +   R    +   +TR+      A  
Sbjct: 132 KQWRLIADCFNDLAFCLDLITPIFPNLFMPIVCLSSMVRGVVGVAGGATRTAVVNHQAIL 191

Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQS 326
            N  +V AK  +Q  +     +   + L   +  ++   L  F + T+IH+Y N ++ +S
Sbjct: 192 DNVGDVAAKDGSQETLINIFALFCSLLLLPIVNGNICVWLL-FCIFTFIHLYGNYRAVKS 250

Query: 327 IELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           ++ RTLN     ++   Y  + +   V +VN++E + 
Sbjct: 251 LQFRTLNQSLLQIIVKNYAETKKIDTVSKVNNKESML 287


>gi|42567566|ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana]
 gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana]
 gi|332002971|gb|AED90354.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 509

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 175/371 (47%), Gaps = 45/371 (12%)

Query: 110 NKLW--DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG 167
           N LW  D  R+    F+ P GFP SV+ DYL+Y LW+    +   I  VL T +LL A+G
Sbjct: 107 NILWLPDVVRD----FVFPSGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLLKAVG 162

Query: 168 LGK--------GAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLEN 218
           +G          A  +AAAI WV KDGIG L ++++   FG  FD +PK WR++AD + +
Sbjct: 163 VGSFSGTSAAATAAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGS 222

Query: 219 AAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEA 278
           A    ++ T  +P  F+ + +     ++ A  ++  +       FA   N  EV AK E 
Sbjct: 223 AGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEV 282

Query: 279 QGMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
             + ++ IG+  GI + +  G   S PF L +++ +  +H++   +S   ++  T+N  R
Sbjct: 283 WEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKR 342

Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
           A ++   +++    P   + N  E +     F K +                  V +E  
Sbjct: 343 ARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRII--------------FGVSLE-- 386

Query: 397 LQLGSKLSDVVNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLF 452
                +LS +  +     AL  +Y  E YILT        +F V  K +A  +D+L+ L+
Sbjct: 387 -----ELSGLEKSVSKVKALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLW 441

Query: 453 QASYLYWLERN 463
           QA   YWLE N
Sbjct: 442 QA---YWLEEN 449


>gi|195035521|ref|XP_001989226.1| GH11606 [Drosophila grimshawi]
 gi|193905226|gb|EDW04093.1| GH11606 [Drosophila grimshawi]
          Length = 417

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 104/340 (30%), Positives = 165/340 (48%), Gaps = 30/340 (8%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
           LP+G+PDSV+ DY+ Y +W ++Q   S I G L T A+L  IG+G   I   +A + W+L
Sbjct: 54  LPKGYPDSVSDDYVVYQIWDTLQAFCSTICGTLCTHAILKGIGVGSENINAYSATVTWIL 113

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIGAAA 241
           K+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I   +
Sbjct: 114 KEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQILCGS 173

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS- 300
              ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L   I S 
Sbjct: 174 TMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTMIKSQ 233

Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
           +M + +  F +V  +H+Y NLK+ +SI LRT N  R  +   E+  S      ++VN  E
Sbjct: 234 AMLYTI--FYIVVTLHLYANLKAVRSICLRTFNESRYLIALEEFFRSSHMLTPQQVNKME 291

Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK-LSDVVNNKEDAHALFSL 419
            +                     +  +  +V +  RL L  K L D   +      + S 
Sbjct: 292 RV---------------------TVGQTVSVSLNIRLGLSVKHLIDEYKSSSVIENIVSS 330

Query: 420 YE-DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           ++  E +I+ E      V L     PQD+LK+ F A SYL
Sbjct: 331 FDPHERFIIAESKKHLGVYLHFDTRPQDVLKAYFFAVSYL 370


>gi|330802843|ref|XP_003289422.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
 gi|325080505|gb|EGC34058.1| hypothetical protein DICPUDRAFT_153796 [Dictyostelium purpureum]
          Length = 552

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 82/273 (30%), Positives = 138/273 (50%), Gaps = 6/273 (2%)

Query: 106 LLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYA 165
           LL   +     + +     LP G+PDSVTSDY  Y  W S+Q + S I+G LAT+A+L  
Sbjct: 42  LLETKRTKSSLQNILYDLFLPSGYPDSVTSDYFRYQFWDSIQALCSSITGTLATRAILKG 101

Query: 166 IGLGKGAIPTAAA-INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGL 223
            G+G  +   A+A   WV++   G + +I+ +   G   D N K WR  AD+L +     
Sbjct: 102 YGVGNSSATVASATTQWVIQSATGMIGRIIFAWANGTDLDCNSKKWRFMADILNDIGMTF 161

Query: 224 EMLTPAFPHHFVFIGAAAGA-GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMV 282
           EM++P F        +  G   +S   +    T++     FA   N A+V AK  +Q   
Sbjct: 162 EMISPLFNEALFLPLSCIGLFAKSICGVAGGCTKASLTQHFAKNDNLADVSAKDGSQETA 221

Query: 283 SKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
              +G++L + +++ +   +S+      F + T  H+YCN K+  +++L ++N YRA L+
Sbjct: 222 VSLLGMILSVFVSSVLNDDTSILVTWFIFFIFTGFHLYCNYKAVSAVQLNSINKYRAHLI 281

Query: 341 FSEYLLS-GQAPPVKEVNDEEPLFPAFHFFKIK 372
           +  Y+ + G  P  KE+++ E +      FKI+
Sbjct: 282 YEYYIRNQGSIPSPKEISNFENILFNSKDFKIR 314


>gi|426254633|ref|XP_004020981.1| PREDICTED: UPF0420 protein C16orf58 homolog [Ovis aries]
          Length = 462

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 129/244 (52%), Gaps = 2/244 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA   W++
Sbjct: 72  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P  P  F      + 
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFTVTVCISN 191

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +    +   +TR+      A R N A+V AK  +Q  +    G+++ + +   +  S 
Sbjct: 192 LAKCLVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDSP 251

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
             +L  F  +T +H+Y N ++ +++ + TLN  R  LV   +L  G+       N  EPL
Sbjct: 252 SLSLGCFFFLTALHIYANYRAVRALVIETLNEQRLWLVLRHFLQWGEVLGPTSANRMEPL 311

Query: 363 FPAF 366
           +  F
Sbjct: 312 WTGF 315


>gi|297810247|ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 510

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 172/370 (46%), Gaps = 41/370 (11%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
            N LW    ++   F+ P GFP SV+ DYL+Y LW+    V   I  VL T +LL A+G+
Sbjct: 107 TNTLW--LPDVVKDFVFPTGFPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGV 164

Query: 169 GK--------GAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENA 219
           G          A  +AAAI WV KDGIG L ++++   FG  FD +PK WR++AD + +A
Sbjct: 165 GSFSGTSAAATAAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSA 224

Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
               ++ T  +P  F+ + +     ++ A  ++  +       FA   N  EV AK E  
Sbjct: 225 GSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVW 284

Query: 280 GMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRA 337
            + ++ IG+  GI + +  G   S PF   +++ V  +H++   +S   ++  T+N  RA
Sbjct: 285 EVAAQLIGLGFGILIIDTPGLVKSFPFVSLTWTSVRLVHLWLRYQSLAVLQFNTVNLKRA 344

Query: 338 SLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRL 397
            ++   +++    P   + N  E +     F K +                  V +E   
Sbjct: 345 RIIVESHVVHSVVPGYVDCNKRENILLGQRFMKPRII--------------FGVSLEEVS 390

Query: 398 QLGSKLSDVVNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLFQ 453
            L   +S V        AL  +Y  E YILT        +F V  K +A  +D+L+ L+Q
Sbjct: 391 GLEKSVSKV-------KALLKMYTKEKYILTLNKLNSDTEFSVSFKVNATSRDVLRCLWQ 443

Query: 454 ASYLYWLERN 463
           A   YWLE N
Sbjct: 444 A---YWLEEN 450


>gi|195387984|ref|XP_002052672.1| GJ17681 [Drosophila virilis]
 gi|194149129|gb|EDW64827.1| GJ17681 [Drosophila virilis]
          Length = 417

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 103/337 (30%), Positives = 165/337 (48%), Gaps = 24/337 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
           LP+G+PDSV+ DY  Y +W +VQ   S I G L T A+L  IG+G   +   +A + W+L
Sbjct: 54  LPKGYPDSVSDDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENVNAYSATVTWIL 113

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIGAAA 241
           K+G G+L +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I   +
Sbjct: 114 KEGSGHLGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQILCGS 173

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              ++   +   +TR+      A R N A+V +K  +Q      I   +G+ L   I S 
Sbjct: 174 TMLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLIASFVGLYLLTMIKSQ 233

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
                  + VVT +H+Y NLK+ +++ LRT N  R  +   E+  S      ++VN    
Sbjct: 234 AMLYTVFYIVVT-LHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSHMLTPQQVN---- 288

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYE 421
                   K++     Q++ +S   +   + ++H +    K S+V+ N      + S   
Sbjct: 289 --------KLERVTVGQMVSVSLNIR-LGLSVKHLID-EYKSSNVIEN-----IVSSFDP 333

Query: 422 DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
            E +I+ E      V L     PQD+LK+ F A SYL
Sbjct: 334 HERFIIAESKKYLGVYLHFDTRPQDVLKAYFFAVSYL 370


>gi|189241392|ref|XP_968556.2| PREDICTED: similar to UPF0420 protein C16orf58 homolog [Tribolium
           castaneum]
          Length = 472

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 95/343 (27%), Positives = 161/343 (46%), Gaps = 22/343 (6%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
            F + +LP G+PDSV+ DY  Y LW +VQ   S I+G+  TQA+L  +G+G   A   +A
Sbjct: 51  FFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQAILKGVGVGSAEASALSA 110

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            I W++KDG G + +I+ + + G+  D N K WRLFADLL + A  LE+  P   H    
Sbjct: 111 TITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLNDGAMILELSIPHVTHLSTE 170

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   +   ++   +   +TR+      A + N AE+ AK  +Q  +   I    GI L N
Sbjct: 171 ILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGSQETLVNLIASFTGIFLIN 230

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
           +   +    + +F ++ ++H+Y N ++  ++  + LN  R +L+   YL        + +
Sbjct: 231 YFTDATTQWIVTFLLI-FLHLYSNYQAVTALIFKNLNNLRMTLILKSYLAVNAVLGPECI 289

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           N  E +F  F                +S  K    +I     LG  L     N ++  +L
Sbjct: 290 NKTESVFLGFG---------------TSVEKFCGFKIVLGQALGKTLRRY--NSDELKSL 332

Query: 417 FSLYEDEGYILTE--HGGKFCVVLKESALPQDMLKSLFQASYL 457
             +Y+D+ Y++          V  ++   PQ+++   + A  L
Sbjct: 333 TEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375


>gi|350411769|ref|XP_003489447.1| PREDICTED: UPF0420 protein C16orf58 homolog [Bombus impatiens]
          Length = 464

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 177/373 (47%), Gaps = 25/373 (6%)

Query: 90  KDAFVVASASNASLSSLLS----VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRS 145
           KD F V S    S++ LLS        +     +  +  LP+G+PDSV  DY  Y +W +
Sbjct: 15  KDRFFVKSKDEQSITELLSDTTRTKSFYSGFISIVKEVFLPQGYPDSVHPDYTPYQIWDT 74

Query: 146 VQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFD 203
           VQ  AS I G L T +++  +G+G+  A P AAAI W+LKDG G + +I+ + + G   D
Sbjct: 75  VQAFASTIMGTLTTHSIMQGVGVGEAAATPLAAAITWILKDGTGMIGRIVFAWWNGTDLD 134

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF 263
              K WRLFAD+L + A GLE+  P F  + + I   + A +S   +   +TR+      
Sbjct: 135 GQCKKWRLFADILNDLAMGLELFLPYFSSYSLGILCISTAMKSIVGVAGGATRAALTQHQ 194

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A + N A+V AK  +Q      I   +GI + + I  +  + +  +  +  IH+Y N  +
Sbjct: 195 ALQNNLADVSAKDGSQETCVNLIASFVGILILS-IFHNGRYIMELYLFLVMIHLYANYMA 253

Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +++ L +LN  R +L+   Y+ +   P  +EVN +E                S LL+  
Sbjct: 254 VKALCLNSLNEDRLALIVKSYISNEVIPKPEEVNKKE----------------SVLLLTK 297

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT--EHGGKFCVVLKES 441
                    I+  + L + + + +    D      L+ D  Y+++         + LK+ 
Sbjct: 298 PTMSICGFNIKIGVSLAALIKEDIILTSDTELALKLFLDRKYLISIDVQNKTIFICLKKD 357

Query: 442 ALPQDMLKSLFQA 454
           A P D+L++ F A
Sbjct: 358 AQPYDVLEAYFHA 370


>gi|270013094|gb|EFA09542.1| hypothetical protein TcasGA2_TC011650 [Tribolium castaneum]
          Length = 450

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 95/343 (27%), Positives = 161/343 (46%), Gaps = 22/343 (6%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
            F + +LP G+PDSV+ DY  Y LW +VQ   S I+G+  TQA+L  +G+G   A   +A
Sbjct: 51  FFREVLLPHGYPDSVSEDYFEYQLWDTVQAFCSTITGIFTTQAILKGVGVGSAEASALSA 110

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            I W++KDG G + +I+ + + G+  D N K WRLFADLL + A  LE+  P   H    
Sbjct: 111 TITWIMKDGTGMVGRIIFAWWKGKSLDSNCKKWRLFADLLNDGAMILELSIPHVTHLSTE 170

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   +   ++   +   +TR+      A + N AE+ AK  +Q  +   I    GI L N
Sbjct: 171 ILCLSTLMKAMVGVAGGATRASITNHQAIKGNMAEIAAKDGSQETLVNLIASFTGIFLIN 230

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
           +   +    + +F ++ ++H+Y N ++  ++  + LN  R +L+   YL        + +
Sbjct: 231 YFTDATTQWIVTFLLI-FLHLYSNYQAVTALIFKNLNNLRMTLILKSYLAVNAVLGPECI 289

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           N  E +F  F                +S  K    +I     LG  L     N ++  +L
Sbjct: 290 NKTESVFLGFG---------------TSVEKFCGFKIVLGQALGKTLRRY--NSDELKSL 332

Query: 417 FSLYEDEGYILTE--HGGKFCVVLKESALPQDMLKSLFQASYL 457
             +Y+D+ Y++          V  ++   PQ+++   + A  L
Sbjct: 333 TEIYKDKKYLIVPDLKTRTIYVTFEKGETPQEVIAGFYHAVLL 375


>gi|307191039|gb|EFN74793.1| UPF0420 protein C16orf58-like protein [Camponotus floridanus]
          Length = 825

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 98/338 (28%), Positives = 160/338 (47%), Gaps = 21/338 (6%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAIN 180
           +  LP+G+PDSV SDY  Y +W ++Q  AS I   LAT +++  IG+G+  A P AAAI 
Sbjct: 406 EIFLPQGYPDSVHSDYTAYQIWDTIQASASTIMNTLATHSIMIGIGVGESNATPLAAAIT 465

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           W+LK+G G +  IM + + G   D   K WRLFAD+L + A G+E+L   F  + V I  
Sbjct: 466 WILKNGAGMVGSIMFAWWNGTKLDGQCKKWRLFADILNDMAAGMELLVSYFSSYSVVILC 525

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
            +   +S   +   +TR       A R N A+V AK  +Q M    I   +GI + + + 
Sbjct: 526 ISTIMKSMVGVAGGATRVALTQHQAIRNNLADVAAKDGSQEMCVNLIASFIGIFILS-LF 584

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
               + +  +  +  +H+Y N  + + + L  LN  R +L+   Y+++ + P  K +N E
Sbjct: 585 HDRQYLIELYLFLVAVHLYANYSAVKVLCLDILNEDRLALIIKYYMINERIPEPKRLNKE 644

Query: 360 EPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSL 419
           E                S  L+ +         I+  +   S L   V +  +   L   
Sbjct: 645 E----------------SVSLLDNPTKSICGFNIKIGVSFASVLRKNVISSMEMEFLLKF 688

Query: 420 YEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQAS 455
           +ED  Y++          V+LK+     ++L++ F AS
Sbjct: 689 FEDRKYLIAIDVKNRNIFVILKKDIQATEILEAYFYAS 726


>gi|66816549|ref|XP_642284.1| DUF647 family protein [Dictyostelium discoideum AX4]
 gi|60470350|gb|EAL68330.1| DUF647 family protein [Dictyostelium discoideum AX4]
          Length = 567

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 91/290 (31%), Positives = 144/290 (49%), Gaps = 50/290 (17%)

Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYA- 165
           L  NKL+D  ++LF    LP+G+P +VTSDY+ Y  W  +Q V    + VL+T ALL + 
Sbjct: 114 LDDNKLFDSIKKLF----LPKGYPSTVTSDYIGYQKWMFIQNVIGSTTYVLSTHALLTSV 169

Query: 166 IGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
           IG+    A+P AAAI+WVLKDG+G  + ++ +N +   FD + K ++   DLL N    L
Sbjct: 170 IGISSTSALPFAAAISWVLKDGLGATALVLFANKYSTLFDYDIKKYKFRGDLLHNFGVLL 229

Query: 224 EMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM-- 281
           EM TP F   F+ + + +   +  A LI  STR+     F+ + N  ++ AK ++Q M  
Sbjct: 230 EMFTPFFQQFFLPLASISNLSKGLAGLIYGSTRASLNRSFSLKENIGDITAKYQSQSMAA 289

Query: 282 ------VSKAIGIML---------------GIALANHIGSSMPFALASFSVVTWIHMYCN 320
                 +  +IG++L               G++                    +IH++C 
Sbjct: 290 YLSGMAIGSSIGMILSVFPLTSVSNLSMVFGLS--------------------FIHIFCG 329

Query: 321 LKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK 370
           + + +SI+L+TLN  R S++   +L S      K VN EE       + K
Sbjct: 330 INAIKSIDLKTLNQQRTSILIDNWLKSNLILTSKIVNQEEKFITNDKYLK 379


>gi|300176775|emb|CBK25344.2| unnamed protein product [Blastocystis hominis]
          Length = 409

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 92/292 (31%), Positives = 145/292 (49%), Gaps = 12/292 (4%)

Query: 88  FTKDAFVVASASNASLSS-LLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
           F K++  +   S  SL +  L++       +       LP  +P SV   YLNY  W + 
Sbjct: 68  FDKESNRIKKESTTSLRNHTLTLPVFIKTTKLKLTDIFLPRNYPSSVKGKYLNYCSWAAF 127

Query: 147 QGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVN 205
               S   GVL+TQ++L+AIG+G GA+P AAA+NWVLKDG+G L  +  +       D +
Sbjct: 128 GMGLSAAGGVLSTQSMLFAIGMGAGAVPMAAALNWVLKDGLGQLGGMAFTALVNSRLDAD 187

Query: 206 PKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA 265
            +GWR+ +  L   +  LE++TP FPH F+ +   A  G++ + L  ++TR+    GF  
Sbjct: 188 SRGWRILSAWLLEISTWLEVMTPLFPHSFLLLATLANVGKNISWLAGSATRAGIRYGFVN 247

Query: 266 RRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG-SSMPFALASFSVVTWIHMYCNLKSY 324
             N  ++ AK  +Q +     G  LGI ++N IG   M + L S   ++ + ++   +S 
Sbjct: 248 AHNMGDITAKEGSQTVAITVFGTFLGIVISNLIGHGHMEYVLMSSMCISAVSLFSIYQSL 307

Query: 325 QSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANK 376
           + + L TLN        S  LLS         ND+  L   +H F ++ A K
Sbjct: 308 RCVSLPTLNYQVGRSSVSRCLLSSS-------NDQ--LKAYYHGFLLRKAIK 350


>gi|125986275|ref|XP_001356901.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
 gi|195148693|ref|XP_002015302.1| GL19630 [Drosophila persimilis]
 gi|54645227|gb|EAL33967.1| GA10253 [Drosophila pseudoobscura pseudoobscura]
 gi|194107255|gb|EDW29298.1| GL19630 [Drosophila persimilis]
          Length = 417

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 103/342 (30%), Positives = 160/342 (46%), Gaps = 28/342 (8%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAIN 180
           +  LP+G+PDSV+ DY  Y +W +VQ   S I G L T A+L  IG+G   I   +A + 
Sbjct: 51  KIFLPKGYPDSVSEDYAAYQIWDTVQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W++K+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WIMKEGCGHVGRILFAWWKGCQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L   I
Sbjct: 171 CGSTMLKAIVGVAGGATRSALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLTLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +S+ LRT N  R  +   EY  S +    ++VN 
Sbjct: 231 -KSQAVLYTVFYVVVALHLYANLKAVRSVCLRTFNESRYLIALEEYFRSSRMLTPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK-LSDVVNNKEDAHALF 417
            E +                     +  +  +V +  RL L  K L D          + 
Sbjct: 290 MERV---------------------TLGQTVSVSLNVRLGLSVKNLIDEYKTSSSIENIV 328

Query: 418 SLYE-DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           S ++  E +I+ E      V L      QD+LK+ F A SYL
Sbjct: 329 SSFDPHEHFIIAESKKYLGVYLHFETRQQDVLKAYFFAVSYL 370


>gi|195437813|ref|XP_002066834.1| GK24688 [Drosophila willistoni]
 gi|194162919|gb|EDW77820.1| GK24688 [Drosophila willistoni]
          Length = 417

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 101/342 (29%), Positives = 166/342 (48%), Gaps = 28/342 (8%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAIN 180
           +  LP+G+PDSV+ DY  Y ++ ++Q   S I G L T A+L +IG+G   I   +A + 
Sbjct: 51  KIFLPKGYPDSVSDDYAAYQIYDTIQAFCSTICGTLCTHAILKSIGVGSENINAYSATVT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRILFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++  ++   +TRS      A R N A+V +K  +Q      +   +G+ +   I
Sbjct: 171 CGSTMLKAVVSVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLLASCVGLYVLTLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
            S        + VVT +H+Y NLK+ +++ LRT N  R  +   E+  S +    ++VN 
Sbjct: 231 KSPAVLYTVFYLVVT-LHLYANLKAVRAVCLRTFNESRYLIALEEFFRSSRMLTPQQVNG 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK-LSDVVNNKEDAHALF 417
            E +                     +  +  +V +  RL L  K L D   +  D   + 
Sbjct: 290 MERV---------------------TLGQTVSVSLNIRLGLSVKNLVDEYKSSSDIENIV 328

Query: 418 SLYE-DEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           S ++  E +I+ E      V L     PQD+LK+ F A SYL
Sbjct: 329 SSFDTHERFIIAESKKYLGVYLHFDTRPQDVLKAYFFAVSYL 370


>gi|168013684|ref|XP_001759421.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689351|gb|EDQ75723.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 712

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/285 (32%), Positives = 149/285 (52%), Gaps = 20/285 (7%)

Query: 98  ASNASLSSLL--SVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISG 155
           A N S+S  +  SV KL    + +FV        PD V   YL Y  W+ V    S +  
Sbjct: 257 ARNESVSGQVAGSVLKLRQTLQRIFV--------PDQVWPHYLVYLKWKLVHRFLSSVLH 308

Query: 156 VLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFA 213
              TQA+L+A+G+G K  +P AAA+NWV+KDG+G L K++ + NFGR FD + K  R + 
Sbjct: 309 FQCTQAMLWAVGVGAKRRLPAAAALNWVMKDGVGRLGKLLFTANFGRTFDSDLKRVRFWT 368

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
            +L +++ GLE+LTP FP HF+ +   A   +S A     +T S  +  FA   N A++ 
Sbjct: 369 SVLFSSSVGLEILTPLFPQHFLLLATLANIAKSIAYAAYLATSSAIHRSFALGDNLADIS 428

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMP-----FALASFSVVTWIHMYCNLKSYQSIE 328
           AKG AQ +V+  +G+   + L+  I  + P       LA + V+    ++   +  Q++ 
Sbjct: 429 AKGMAQTVVADNLGLAAAVCLSQFI-HNFPKIEKFLPLAMYPVLASAELFAIYQQLQAVH 487

Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE--PLFPAFHFFKI 371
           L+TLN  R  ++ + ++   + P  +EV+  E   L   F  F +
Sbjct: 488 LQTLNKERLEIIINIWVKDRRIPSFEEVSKIENVALLDLFRVFGV 532


>gi|147768659|emb|CAN60615.1| hypothetical protein VITISV_003254 [Vitis vinifera]
          Length = 470

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 160/347 (46%), Gaps = 65/347 (18%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           DE       +++PEGFPDSVT  Y+ Y  WR+++       GV  TQ LL ++G+ +  +
Sbjct: 109 DELMSFIRSYVVPEGFPDSVTPSYVPYMSWRALKHFFGGAMGVFTTQTLLNSVGVSRNRS 168

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            P A AINW+LKDG G + K++ +  G+ FD + K  RL  DLL     G+E+ T A PH
Sbjct: 169 APGAVAINWILKDGAGRVGKMLFARQGKKFDYDLKQLRLAGDLLMELGAGVELATAAVPH 228

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            F+ +  AA   ++ AA+   STR+  Y  FA   N  +V AKGE  G ++  +      
Sbjct: 229 LFLPLACAANVAKNVAAVTSTSTRTPIYKAFAKGENIGDVTAKGECVGNIADLV------ 282

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
                                           +S+ L TLN  R S+    +L +G+ P 
Sbjct: 283 --------------------------------KSVVLHTLNRARYSVAIDCFLKTGRVPS 310

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
           ++E N +E +F +F + + K                        + LGS+  D   +   
Sbjct: 311 LQEGNMKENIF-SFPWLEDKP-----------------------IVLGSRFKDAFQDAGS 346

Query: 413 AHALFSLYEDEGYILTEH--GGKFCVVLKESALPQDMLKSLFQASYL 457
             ++  L+E E Y++T +   GK   +LK+ A   D+LK+ F A  L
Sbjct: 347 YLSIEPLFEKERYLVTYNPSKGKVYALLKDQAKSDDILKAAFHAHML 393


>gi|195996945|ref|XP_002108341.1| hypothetical protein TRIADDRAFT_19061 [Trichoplax adhaerens]
 gi|190589117|gb|EDV29139.1| hypothetical protein TRIADDRAFT_19061, partial [Trichoplax
           adhaerens]
          Length = 272

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 85/273 (31%), Positives = 139/273 (50%), Gaps = 15/273 (5%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP+G+P+SV+ DYL Y LW ++Q   S I+G LATQA+L  +G+G + A   AAA+ W+L
Sbjct: 4   LPQGYPESVSGDYLTYQLWDTLQAFCSSITGTLATQAILKGVGVGDEKATVAAAAMTWML 63

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPK------GWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           KDG G + +I+ +   G   D + K       +RLFAD L + A  +EM+ P FPH F  
Sbjct: 64  KDGTGMIGRILFAWLRGTSLDCDAKKMLIFISYRLFADFLNDGALFIEMIAPNFPHLFTL 123

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +   A   +S   +   +TR+      A R N A+V AK  +Q  +     +++G+ L  
Sbjct: 124 LLCVASVAKSIVGVAGGATRAALTMHQARRNNMADVSAKDGSQETLVNLSALIVGLFLLP 183

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            +  +       F + T +H+Y N K+  S+ + TLN  R  ++   +   G      EV
Sbjct: 184 AVTGNARIIWTLFFMFTMLHLYANYKAVSSVVMETLNRERLHILAYNFFRYGSILSPAEV 243

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDA 389
              E +      +K++       ++L +E K++
Sbjct: 244 CQRESVIIMKDQYKLR-------MILGAEFKES 269


>gi|226505832|ref|NP_001140509.1| uncharacterized protein LOC100272571 [Zea mays]
 gi|194699768|gb|ACF83968.1| unknown [Zea mays]
          Length = 491

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 96/348 (27%), Positives = 163/348 (46%), Gaps = 47/348 (13%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +   
Sbjct: 106 EEIMSFIRSYVVPEGFPDSVTPSYVPYMNWRALKHFFGGAMGVFTTRTLLSSVGVSQS-- 163

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
                     KDG G + K++ +  G+ FD + K  R  +DLL     G+E+ T AFP  
Sbjct: 164 ----------KDGAGRVGKMLFARQGKKFDNDLKQLRFSSDLLLEIGAGIELTTAAFPQF 213

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ +   A   ++ AA+   STR+  Y  +A   N  +V AKGE+ G ++     +LG  
Sbjct: 214 FLPMACVANVVKNVAAVTSTSTRTPIYKAYARGENIGDVTAKGESVGNIAD----LLGTG 269

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           L+  I    P  + SF+ ++  ++  + +  +S+ L TLN  R ++    ++ +G  P +
Sbjct: 270 LSIFISKRNPSLVTSFAFLSCGYLLSSYQEVRSVVLNTLNRARFTVAVDSFIKTGHIPSL 329

Query: 354 KEVNDEEPLF--PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
           KE N +E +F  P  H                             + +GS+  +      
Sbjct: 330 KEGNSQETIFNPPWRH---------------------------ETVAIGSRFGEAFQEPA 362

Query: 412 DAHALFSLYEDEGYILTEHGGKFCV--VLKESALPQDMLKSLFQASYL 457
              A+  L+EDE Y++T +  K  V  +LK+ A   D++K+ F A  L
Sbjct: 363 SFLAIRPLFEDERYMVTYNPTKDKVYALLKDQAKSDDIIKAAFHAHVL 410


>gi|398388772|ref|XP_003847847.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
 gi|339467721|gb|EGP82823.1| hypothetical protein MYCGRDRAFT_63982 [Zymoseptoria tritici IPO323]
          Length = 457

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/353 (29%), Positives = 182/353 (51%), Gaps = 37/353 (10%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP GFP SVT DY  Y ++ S+Q  +S I+ +L+++A+L +IG+G   A PTAA +  VL
Sbjct: 58  LPAGFPHSVTEDYTEYQIYDSLQAFSSSIAAMLSSRAVLSSIGVGDTKASPTAALLLSVL 117

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
           ++ +G L+ I+ ++ FG   +   K +RL AD+L ++AF L++L+P FP    + I + +
Sbjct: 118 QESVGRLATILFAHRFGTSLEPECKMYRLMADVLNDSAFILDVLSPIFPKPIRIVILSFS 177

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              RS   +   S+++   A FA   N  E+ AK  +Q  V   +G++ G  + + I + 
Sbjct: 178 SILRSLCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLMGMLAGSLVISWITTP 237

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
           +    A   +++ IH+  N ++ +++++RTLN  RA+LVF         P +KE++  E 
Sbjct: 238 LATWTALIGLLS-IHLETNRRAVRAVKMRTLNRQRATLVFHHLQRQQTVPSIKEISSVE- 295

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAA------------VEIEHRLQLGSKLSDVVNN 409
                H F+          VL +  +D              +E     Q  +K S +   
Sbjct: 296 -----HIFEWDG-------VLRTSTRDIMGYCDIGTPFLRLLEAVSESQTSTKASHI--Q 341

Query: 410 KEDAHALFSLYEDEGYIL------TEHGGKFCVVLKESALPQDMLKSLFQASY 456
           ++    + SLY    YIL      T+   +F ++LK+ A P+D+L + +QA +
Sbjct: 342 QQTLSQILSLYNSSRYILWHDRRSTKDVPRFNIILKKGAEPKDLLIAWWQALF 394


>gi|170582654|ref|XP_001896227.1| cDNA sequence BC017158 [Brugia malayi]
 gi|158596613|gb|EDP34929.1| cDNA sequence BC017158, putative [Brugia malayi]
          Length = 389

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 80/247 (32%), Positives = 138/247 (55%), Gaps = 2/247 (0%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
           L     +P G+P SV+ DY+NY +W ++Q  AS +S  L+T+A+L  +G+G K A   AA
Sbjct: 37  LLRDIFMPRGYPQSVSPDYMNYQMWDTIQAFASSMSSALSTEAILRGVGVGNKAASTMAA 96

Query: 178 AINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           A+ W+LKDGIG L++I+ +  +  + D + K WRL AD   + AF L+++TP FP  F+ 
Sbjct: 97  AVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAFCLDLITPIFPSLFMP 156

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I   +   R+   +  ++TR+      A   N  +V AK  +Q  +     ++  + L  
Sbjct: 157 IICLSSMIRAVVGVAGSATRTTVTNHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLP 216

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            +  ++      FS+ T+IH+Y N ++ +S++ RTLN     +V  +Y+ + +   V E+
Sbjct: 217 VVSRNIVSVWLLFSLFTFIHLYGNYRAVKSLQFRTLNQSLLRIVVKDYIETRKISTVSEM 276

Query: 357 NDEEPLF 363
           N +EP+ 
Sbjct: 277 NSKEPIL 283


>gi|242056489|ref|XP_002457390.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
 gi|241929365|gb|EES02510.1| hypothetical protein SORBIDRAFT_03g006560 [Sorghum bicolor]
          Length = 624

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/395 (29%), Positives = 190/395 (48%), Gaps = 47/395 (11%)

Query: 82  TKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYS 141
           +KL     K    V +  + +  SL+    +WD        F+LP GFP SV+ DYL+Y 
Sbjct: 206 SKLQVHLEKHVSPVNAVEDENADSLIP-QAIWD--------FVLPAGFPGSVSVDYLDYM 256

Query: 142 LWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAAINWVLKDGIGYLSKIML-S 196
           L +    V   I  VL T +LL A+G+    G  A  +AAAI WV KDGIG   ++++  
Sbjct: 257 LLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAASAAAIRWVSKDGIGAFGRLLIGG 316

Query: 197 NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTR 256
            FG  FD +PK WR++AD++ +A    E++TP +P +F+ + +     ++ A   +  + 
Sbjct: 317 RFGALFDDDPKKWRMYADIIGSAGSIFELMTPLYPGYFLPLASLGNLAKAVARGFKDPSF 376

Query: 257 SCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG---SSMPFALASFSVVT 313
                 FA   N  EV AK E   + ++ +G+  G+ + +  G   S +  AL   S V 
Sbjct: 377 RVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSAGVLIMDTAGVKSSYLTLALTWLS-VR 435

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
            +H++   +S   ++ RT+N  R  ++   ++     P     N+EE +     F + + 
Sbjct: 436 LLHLWLRYQSLSVLKFRTINLKRGRILVRSHVAQHTVPGYVACNEEENILTWERFLRPQI 495

Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHG 431
           +                V +E R+  G + SD+V+       L  LY++E YIL   + G
Sbjct: 496 S--------------FGVPME-RMLGGDESSDMVDR------LLKLYKNEKYILFFEQFG 534

Query: 432 GK---FCVVLKESALPQDMLKSLFQASYLYWLERN 463
            +   F V  KESA    +L+SL+QA   +WL +N
Sbjct: 535 SRDPTFLVTFKESATSMSVLRSLWQA---HWLHKN 566


>gi|255081626|ref|XP_002508035.1| predicted protein [Micromonas sp. RCC299]
 gi|226523311|gb|ACO69293.1| predicted protein [Micromonas sp. RCC299]
          Length = 665

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 86/247 (34%), Positives = 133/247 (53%), Gaps = 14/247 (5%)

Query: 129 FP--DSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGA-IPTAAAINWVLKD 185
           FP  +SVT DY  Y+ +R +Q +AS    V ATQ +L AIG+G    +P AAAINWVLKD
Sbjct: 199 FPARESVTEDYWEYAKFRFLQRMASSCITVFATQQMLAAIGMGASRRLPAAAAINWVLKD 258

Query: 186 GIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAG 244
           G+G L K+ + +NFGR FD + K +R  + ++ +A+  +EM+TP FP  F+ +   A  G
Sbjct: 259 GLGRLGKLGVAANFGREFDSDVKRFRFTSSVVYDASSLVEMITPFFPKKFLALATLANIG 318

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS--- 301
           +S         R+     F    N AEV AK  AQ +V+  +G+ + +      G     
Sbjct: 319 KSVGITTANVVRAPIQRSFVLEENLAEVAAKTSAQQVVADNLGLAVAVLATGLTGKVVND 378

Query: 302 -----MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
                +PF   +F  +  + +YC  +  ++++L T+N  RA ++   ++  G  P  K V
Sbjct: 379 RARLIIPF--VAFIPLATMDLYCIYRELKAVQLTTINKERAEIICDHWINVGTVPGTKYV 436

Query: 357 NDEEPLF 363
           +D E LF
Sbjct: 437 SDAERLF 443


>gi|357127303|ref|XP_003565322.1| PREDICTED: UPF0420 protein C16orf58-like [Brachypodium distachyon]
          Length = 486

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 104/353 (29%), Positives = 171/353 (48%), Gaps = 36/353 (10%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG----KGAIPTAAA 178
           F+LP GFP SV+ DYL Y LW+    V   I   L T +LL A+G+G      A  +AAA
Sbjct: 98  FVLPAGFPGSVSDDYLQYMLWQFPTNVTGWICHTLVTSSLLKAVGVGSFTGTSAAASAAA 157

Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG   ++++   FG  FD +PK WR++AD + +     E+ TP +P +F+ +
Sbjct: 158 IRWVSKDGIGAFGRLLIGGRFGTLFDNDPKKWRMYADFIGSVGSIFELTTPLYPGYFLPL 217

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
            +     ++     +  +       FA   N  E+ AK E  G+ ++ +G+ +G+ + + 
Sbjct: 218 ASLGNLAKAVGRGFRDPSNRVIQNHFAKSGNLGEIAAKEEVWGVGAQLLGLSIGVLILDT 277

Query: 298 IG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            G  SS P    ++  V  +H++   +S   ++ RT+N  RA ++   ++     P    
Sbjct: 278 SGVQSSYPTLTLTWLGVRLLHLWFRYQSLIVLKFRTVNLKRARILVRSHVAHHTVPGYVA 337

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
            N+EE +     F + + +                V +E R+  G + SD+V+       
Sbjct: 338 CNEEENILTWERFLQPQIS--------------FGVPME-RMLGGGEPSDMVSR------ 376

Query: 416 LFSLYEDEGYIL-----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERN 463
           L  LY++E YIL           F V  KE+A    +L+SL+QA   +WL +N
Sbjct: 377 LLKLYKNEMYILFVDQSESKEPAFVVTFKEAATSMSVLRSLWQA---HWLHKN 426


>gi|307105811|gb|EFN54059.1| hypothetical protein CHLNCDRAFT_8919, partial [Chlorella
           variabilis]
          Length = 325

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 162/346 (46%), Gaps = 31/346 (8%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTA 176
           +L   ++LP+ FP SV   Y +Y   RSVQ        V  T++LL ++G+  K +   A
Sbjct: 5   DLLADYLLPQDFPHSVAPQYSDYMRARSVQYFFGGAMSVFTTRSLLASLGVANKHSSEAA 64

Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           AAINWV+KDG G L + + + +GR  D   K +RL  D+L      LE+ T   P  F+ 
Sbjct: 65  AAINWVVKDGAGRLGRFLFARWGRELDCELKQFRLMGDVLMETGAALELSTVLMPRMFLP 124

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +   A   ++ AA+  +STR+  Y  FA + N A+V AKGE+   ++  +G   GIALA 
Sbjct: 125 LACTANLAKNLAAVTASSTRAPIYRTFAKQNNLADVTAKGESVANLADVVGTAFGIALAK 184

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
              +++P  L +F+ ++  ++  + +   S+ L  LN  R S     +  +G+ P   E 
Sbjct: 185 ---ANLPV-LPTFAALSVGYLIASRREVDSVVLPYLNRARLSYTTRAFYSTGRVPETLEG 240

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHAL 416
           N  EPL P                   S+  +       R+ LG+ + +     +  H  
Sbjct: 241 NYREPLMP------------------WSDPHNG------RVVLGATVEEACAGPQQLHDA 276

Query: 417 FSLYEDEGYILTEHGG--KFCVVLKESALPQDMLKSLFQASYLYWL 460
            + +    Y LT      K   +LK+ A P+ + ++   A  L W+
Sbjct: 277 LAAFSGRQYALTYRPDTRKCYALLKQGASPRSVQQAAMDAHALLWM 322


>gi|351711359|gb|EHB14278.1| hypothetical protein GW7_05117 [Heterocephalus glaber]
          Length = 469

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 78/241 (32%), Positives = 129/241 (53%), Gaps = 2/241 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI-NWVL 183
           LP+GFP+SV+ DYL Y  W SVQ  AS +   LAT +++  +G+G      AAA   W++
Sbjct: 79  LPQGFPESVSPDYLPYQQWDSVQAFASSLCSALATHSVMLGLGVGNAEASLAAATATWLV 138

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G L +I+ + + G   D N K WRLFAD+L +AA  LE++ P     F  I + + 
Sbjct: 139 RDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDAAMLLELVAPVLSVSFTMIVSISN 198

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   + +  
Sbjct: 199 LAKCIVSVAGGATRAALTVHQARRNNMADVSAKDGSQETLVNLAGLLVSLIMLPLMSNCP 258

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
             +L  F ++T +H+Y N ++ +++ + TLN  R  LV   +L SG+       N  EPL
Sbjct: 259 RLSLGCFFLLTALHIYANYRAVRTLVMETLNESRLRLVLKHFLQSGEVLDTTSANQMEPL 318

Query: 363 F 363
           +
Sbjct: 319 W 319


>gi|299115992|emb|CBN75993.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 479

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/224 (34%), Positives = 129/224 (57%), Gaps = 1/224 (0%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
            +  +LP GFP SV   Y+ Y+ W+     AS  +GV++TQALLYA+GLG GA+P AA +
Sbjct: 1   MLGLVLPSGFPASVHPPYMAYAGWQFTGMAASAAAGVMSTQALLYAMGLGAGALPLAATL 60

Query: 180 NWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
           NWV+KDG+G L  +  S+     FD NPK WR+ A +  +A+  LE+L+P  P +F+ I 
Sbjct: 61  NWVIKDGLGQLGGVAFSSLVSTRFDANPKLWRVVAAVSLDASMVLELLSPLAPAYFLPIA 120

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
           + A  G++ + L  +++R+  +   + + N A+V AK  AQ +++   G   G+ L+  +
Sbjct: 121 SVANIGKNVSFLAASASRAAIHNVLSLKGNLADVTAKSGAQTILACMAGTGAGVTLSASL 180

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
           GS     + +   ++ +H+  N  S + + + T +  R  LV +
Sbjct: 181 GSEFHAIVPACLCLSVVHLGANHMSMRRLGIPTFDHQRLELVVN 224


>gi|414586418|tpg|DAA36989.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 297

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 132/234 (56%), Gaps = 1/234 (0%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
           + D+  + F+    P G+P SV   YL Y+ +R +Q  +S +  VL+TQ+LL+A GL + 
Sbjct: 60  IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
               A A++W+LKDG+ +  K++ S+ G   D  PK WR+ AD+L +    LE+++P  P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCP 178

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
             F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           I LA+ + S+    L +  +++ +H+Y  ++  ++  + TLNP R +++ ++++
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFI 292


>gi|242019564|ref|XP_002430230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515330|gb|EEB17492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 527

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 126/248 (50%), Gaps = 3/248 (1%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           + L     LP+G+P SV+SDYL Y +W   Q   S I G L  +A++  IG+G   A P 
Sbjct: 37  KSLIKDIFLPKGYPYSVSSDYLEYQIWDFCQAFCSSICGTLTVRAVMKGIGVGSNVATPM 96

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           +AA+ W++KDG G +  IM +   G   D + K WRLFAD+L + A  +E+  P    + 
Sbjct: 97  SAALTWIIKDGTGMIGSIMFAWLKGSKLDSDCKKWRLFADILNDIAMFIELSIPIIMDYS 156

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           +++  AA   ++      +STR+      A + N A+V AK  +Q + +  +G + GI L
Sbjct: 157 MYMLCAATIFKALVGTAGSSTRAAIITHQAVKGNTADVSAKDGSQELFTNLLGFIFGIFL 216

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
             ++  S    +      T  H+  N K+ + +++   N  R  L    YL +   P ++
Sbjct: 217 LKNVNESNQVWIV-MCTATIFHLIFNYKAVRCLKINVFNGERLRLTLLSYLPTSCVPTIE 275

Query: 355 EVNDEEPL 362
           +VN  EP+
Sbjct: 276 QVNRREPV 283


>gi|452005321|gb|EMD97777.1| hypothetical protein COCHEDRAFT_1165025 [Cochliobolus
           heterostrophus C5]
          Length = 421

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 171/343 (49%), Gaps = 13/343 (3%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP GFP SVT DY+ Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PTAA 
Sbjct: 1   MIDVFLPSGFPQSVTEDYVEYQIFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTAAL 60

Query: 179 INWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
           +  VL++ +G ++ I  ++ FG   +   K +RL AD+L ++   L+ L+P FP    V 
Sbjct: 61  LLSVLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSGMILDCLSPGFPKPLRVA 120

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + + A   RS   +   S+++   A FA   N  EV AK  +Q  V   +G++ G  + +
Sbjct: 121 VLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFVVS 180

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
           HI S +     +  ++  IH+  N  + +++ +++LN  RA+++FS  L  G     KEV
Sbjct: 181 HITSDIA-TWCTLILLLVIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLSPKEV 239

Query: 357 NDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE----- 411
           +  E +F      +           + +   +    I  R +L   L+   +  +     
Sbjct: 240 SARERIFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGADGESEV 299

Query: 412 DAHALFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKS 450
           D   L S++E E Y+L   GG    +  +VLK    P D L++
Sbjct: 300 DVFDLLSIFEAEDYVLWASGGPAAYEAVIVLKAGCGPIDQLRA 342


>gi|326427989|gb|EGD73559.1| hypothetical protein PTSG_05267 [Salpingoeca sp. ATCC 50818]
          Length = 404

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 81/251 (32%), Positives = 132/251 (52%), Gaps = 6/251 (2%)

Query: 103 LSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQAL 162
           LSSL S        R       LP  +  SV+  +L+Y  W+ V       SGVL+ Q+L
Sbjct: 104 LSSLTS-----QPLRTRLEHVFLPRDYRASVSPKFLDYCKWQFVHMTTMTASGVLSMQSL 158

Query: 163 LYAIGLGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAF 221
           LYA+G+G G++P AAAINW+LKDG+G L  ++  S FG  FD +PK  R  A L    + 
Sbjct: 159 LYAVGVGAGSVPLAAAINWILKDGLGQLGGMLYGSIFGTRFDDDPKRQRFNAVLSLQVSG 218

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
            ++++TP FPHHF+ + + +  G++ + L  ++TR+     F    N  +V AK  +Q +
Sbjct: 219 IVDIITPLFPHHFLLLASVSNFGKNVSYLASSATRAQMNLSFTRTSNLGDVTAKMTSQSI 278

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
            +   G  LGI ++   G+     +A++  ++ + +  N  S + + L+T N  RA ++ 
Sbjct: 279 AASVFGTGLGILVSKVTGTEAALLMAAYVPLSLVSIGGNYYSSKFVSLKTFNVQRAEILA 338

Query: 342 SEYLLSGQAPP 352
             Y  +    P
Sbjct: 339 RHYFTTKVCTP 349


>gi|293336742|ref|NP_001169416.1| hypothetical protein [Zea mays]
 gi|224029221|gb|ACN33686.1| unknown [Zea mays]
 gi|414876408|tpg|DAA53539.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
          Length = 481

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 116/405 (28%), Positives = 192/405 (47%), Gaps = 43/405 (10%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPT 175
            + F+LP GFP SV++DYL+Y L +    V   I  VL T +LL A+G+    G  A  +
Sbjct: 91  ILDFVLPAGFPGSVSADYLDYMLLQFPTNVTGWICHVLVTSSLLKAVGVGSFTGTSAAAS 150

Query: 176 AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AAAI WV KDGIG   ++++   FG  FD +PK WR++AD + +A    E+ T  +P +F
Sbjct: 151 AAAIRWVSKDGIGAFGRLLIGGRFGALFDDDPKKWRMYADFIGSAGSIFELSTTLYPGYF 210

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           + + +     ++ A   +  +       FA   N  EV AK E   + ++ +G+ +G+ +
Sbjct: 211 LPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVLI 270

Query: 295 ANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            +  G  SS    + ++  V  +H++   +S   ++ RT+N  R  ++   ++     P 
Sbjct: 271 MDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLKFRTINLKRGRILVKSHVAQHTVPG 330

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKED 412
               N+EE +     + + + +                V +E R+  G K SD+VN    
Sbjct: 331 YVACNEEENILTWERYLRPQVS--------------FGVPME-RMLGGDKSSDMVNR--- 372

Query: 413 AHALFSLYEDEGYIL--TEHGGK---FCVVLKESALPQDMLKSLFQASYLYWLERNAGIV 467
              L  LY++E YIL   + G +   F V  KESA    +L+SL+QA   +WL +N    
Sbjct: 373 ---LLKLYKNEKYILFFEQFGSREPTFLVTFKESATSMSVLRSLWQA---HWLHKN---- 422

Query: 468 ATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
                 +  P   LE S+  ++  F        S GW  D +I +
Sbjct: 423 -QLRQDEVFPW--LEESIVALENGFTDFLEQLGSAGWDQDLIILK 464


>gi|321455034|gb|EFX66180.1| hypothetical protein DAPPUDRAFT_302922 [Daphnia pulex]
          Length = 315

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 92/321 (28%), Positives = 151/321 (47%), Gaps = 28/321 (8%)

Query: 62  SEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFV 121
           +E DG K Y        S++   + D T++  V        + S L  N  +     L  
Sbjct: 9   TEKDGEKGY--------SRQYVKVTDTTENGVVC-------IDSDLDSNGKFRGITHLLS 53

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
              LP+G+P SV+ DYL Y +W ++Q  AS +SG L+T+A+L  +G+G   + P AA + 
Sbjct: 54  SVFLPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLM 113

Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTP-------AFPH 232
           W++KDG G + +I+ +   G   D + K WR FADLL + A  LE+  P           
Sbjct: 114 WLIKDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSNG 173

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
               I   AG  +S   +   +TR+      A + N A+V AK  +Q  +     ++  +
Sbjct: 174 TMTSILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITSL 233

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L   +  +       F + T +H++ NLK+ +++ + TLN  R  ++  ++  +     
Sbjct: 234 WLLPILDGATGLTWLMFILFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVAS 293

Query: 353 VKEVNDEEP----LFPAFHFF 369
           VKEVN  EP    L P  +F 
Sbjct: 294 VKEVNRLEPVIIGLIPTGNFI 314


>gi|417401281|gb|JAA47531.1| Hypothetical protein [Desmodus rotundus]
          Length = 457

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 83/248 (33%), Positives = 130/248 (52%), Gaps = 11/248 (4%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP+GFPDSV+ DYL+Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA 
Sbjct: 73  LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132

Query: 179 INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
             W++KD  G L +I+ + +        KG  LFAD+L + A  LE++ P +P  F    
Sbjct: 133 ATWIVKDSTGMLGRIIFAWW--------KG--LFADILNDVAMFLEIMAPIYPMCFTVTI 182

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   +
Sbjct: 183 CISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLV 242

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
                F+L  F ++T +H+Y N ++ Q++ + TLN  R  LV   +L  G+       N 
Sbjct: 243 SDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSANQ 302

Query: 359 EEPLFPAF 366
            EPL+  F
Sbjct: 303 MEPLWTGF 310


>gi|302766892|ref|XP_002966866.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
 gi|300164857|gb|EFJ31465.1| hypothetical protein SELMODRAFT_64357 [Selaginella moellendorffii]
          Length = 265

 Score =  138 bits (348), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 77/242 (31%), Positives = 133/242 (54%), Gaps = 2/242 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA-INWVL 183
           LPEG+P SV+SDYL +  W ++QG+++ I  +L+TQALL  IG+G        A   W L
Sbjct: 24  LPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWFL 83

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G +  I+ + + G + D   K WRL AD + +    +++++P FP  F+F      
Sbjct: 84  RDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIGMLMDLVSPLFPRAFMFFLCLGS 143

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
           A R+   +   +TR+     FA ++N A+V AK  +Q   +  +G+++G+ LA    +++
Sbjct: 144 AARAVTGVAGGATRAALTQHFARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANNI 203

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                SF  +T  H+Y N K+  ++ L ++N  R ++V   +L  G+AP  +E + +E  
Sbjct: 204 VLMWTSFMSLTAFHVYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQESA 263

Query: 363 FP 364
            P
Sbjct: 264 LP 265


>gi|451846811|gb|EMD60120.1| hypothetical protein COCSADRAFT_151456 [Cochliobolus sativus
           ND90Pr]
          Length = 482

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 111/402 (27%), Positives = 194/402 (48%), Gaps = 13/402 (3%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTA 176
           +  +   LP GFP SVT DY+ Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PTA
Sbjct: 59  QYIIDVFLPSGFPQSVTEDYIEYQVFDSLQAFSSSIAGMLASRAVLQGVGVGDSNATPTA 118

Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF- 234
           A +  VL++ +G ++ I  ++ FG   +   K +RL AD+L ++A  L+ L+P FP    
Sbjct: 119 ALLLSVLQESVGRMATIAFADRFGTALEPECKMYRLTADILNDSAMILDCLSPGFPKPLR 178

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           V + + A   RS   +   S+++   A FA   N  EV AK  +Q  V   +G++ G  +
Sbjct: 179 VAVLSFASCLRSLCGVCAGSSKASLSAHFATAGNLGEVNAKDSSQETVISLLGMLAGSFV 238

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
            +HI S +     +  ++  IH+  N  + +++ +++LN  RA+++FS  L  G     K
Sbjct: 239 VSHITSDLA-TWCTLILLLAIHLATNYAAVRAVSMQSLNRQRANILFSHILQHGIVLNPK 297

Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH 414
           EV+  E +F      +           + +   +    I  R +L   L+   +  +D  
Sbjct: 298 EVSARERIFERGGVLRWADDEVLGKCRIGTSLSELLGRIGRRDKLSGSLAFCSSGGDDGS 357

Query: 415 A-----LFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKSLFQASYLYWLERNAG 465
                 L S++E E Y+L   GG    +  +VLK    P D L++   A  +    +  G
Sbjct: 358 EVDVFDLLSIFEAEDYVLWATGGPAAYEAVIVLKAGCGPIDQLRAWAHALLVAEKRQRGG 417

Query: 466 IVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
               + +      G L+ +L  V+R F+    +  + GW  D
Sbjct: 418 DSGGTRAKHGLVVGDLKDTLYEVRRMFDKYGDEMRAKGWNLD 459


>gi|339250912|ref|XP_003374441.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316969248|gb|EFV53376.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 340

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/247 (29%), Positives = 128/247 (51%), Gaps = 4/247 (1%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAIN 180
           +  LP+G+PDSVT DY +Y  W S+Q   S ++   A  A+  A+G+G + A   AA + 
Sbjct: 53  KIFLPQGYPDSVTEDYFDYQCWDSLQAFCSSVTNAFALHAVFKAVGVGNESASLLAATVV 112

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           W+L++G+G +S+IM + F   H D + K WR  AD+L + +  L+++ P     FV++  
Sbjct: 113 WLLRNGVGMMSQIMFAWFCAGHLDSDCKSWRFVADVLNDCSIFLDLIAPLSESTFVYVAC 172

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
            A   RS   +   +TR+      + R N A+V AK  +Q  +   I +   +AL   + 
Sbjct: 173 LANVCRSIVGIAGGATRAAITQHQSRRHNMADVAAKDSSQETMVNLIALGFNLALLPLVA 232

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP--VKEVN 357
             +      F + T++H+Y N ++ ++++  TLN  R  ++   ++         V+ VN
Sbjct: 233 HHVLLVWTLFLMFTFMHIYANYRAIRALKFETLNEKRFRMLVQSFIKDNTNAKMGVEIVN 292

Query: 358 DEEPLFP 364
            EE L P
Sbjct: 293 YEESLVP 299


>gi|417401214|gb|JAA47499.1| Hypothetical protein [Desmodus rotundus]
          Length = 453

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/248 (33%), Positives = 130/248 (52%), Gaps = 11/248 (4%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP+GFPDSV+ DYL+Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA 
Sbjct: 73  LLAMFLPQGFPDSVSPDYLSYQLWDSVQAFASSLSGSLATHAVLLGIGVGNAKASVSAAT 132

Query: 179 INWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
             W++KD  G L +I+ + +        KG  LFAD+L + A  LE++ P +P  F    
Sbjct: 133 ATWIVKDSTGMLGRIIFAWW--------KG--LFADILNDVAMFLEIMAPIYPMCFTVTI 182

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   +   ++   +TR+      A R N A+V AK  +Q  +    G+++ + +   +
Sbjct: 183 CISNLAKCVVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLV 242

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
                F+L  F ++T +H+Y N ++ Q++ + TLN  R  LV   +L  G+       N 
Sbjct: 243 SDCPSFSLGCFFLLTALHVYANYRAVQALVIETLNEGRLWLVLKHFLQRGEVLDPTSANQ 302

Query: 359 EEPLFPAF 366
            EPL+  F
Sbjct: 303 MEPLWTGF 310


>gi|330805147|ref|XP_003290548.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
 gi|325079335|gb|EGC32940.1| hypothetical protein DICPUDRAFT_155062 [Dictyostelium purpureum]
          Length = 493

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/241 (31%), Positives = 133/241 (55%), Gaps = 3/241 (1%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI-GLG-KGAIPTAAAIN 180
           + LP+G+P+SV+ DY++Y  W   Q V   ++ VL+T ALL  + G+    A+P AAAI+
Sbjct: 99  YFLPKGYPNSVSPDYIHYYKWLFCQNVVGSVTYVLSTHALLTTVVGMSVASALPFAAAIS 158

Query: 181 WVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           WVLKDG+G  + +  +N F    D + K ++   D+L N    LEM TP    +F+ + +
Sbjct: 159 WVLKDGLGAFALVFFANKFSTLLDFDLKKYKFRGDILHNLGVLLEMCTPFIQGYFLPLAS 218

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIG 299
            +   +  A LI  STR+     F+ + N  ++ AK ++Q M +  +G+ +G      + 
Sbjct: 219 ISNLSKGLAGLIYGSTRASLNKSFSIKDNIGDITAKYQSQSMAAYLMGMGIGSTFGLLLS 278

Query: 300 SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
            S    ++    ++++HM+   K+ +SI L++LN  R S++   +LL+ +    K VN+ 
Sbjct: 279 GSTFSGISIVFSLSFVHMFLGYKAVKSINLKSLNKQRLSIIIDNWLLNSEILNSKIVNNN 338

Query: 360 E 360
           E
Sbjct: 339 E 339


>gi|256077542|ref|XP_002575062.1| hypothetical protein [Schistosoma mansoni]
          Length = 410

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/261 (29%), Positives = 133/261 (50%), Gaps = 6/261 (2%)

Query: 87  DFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
           D     F +  + +  +SSL  ++  W   ++ F    LP G+P SV+ DYL Y +W ++
Sbjct: 12  DEVSRVFTLEKSQSRQISSLFDLHLSWVTLKKFF----LPVGYPSSVSDDYLEYQIWDTI 67

Query: 147 QGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDV 204
           Q  AS I+G LA+QA+L  +G+G   A   +A++ W+ KDG G + +I+ + + G   D 
Sbjct: 68  QAFASSITGALASQAVLIGVGVGDSSATILSASLTWMFKDGSGMIGRIIFAGYHGIKLDC 127

Query: 205 NPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFA 264
           + K WR  AD+L + A  LE+++P F + F  +   A   +S   +  ++TR+      A
Sbjct: 128 DCKFWRFVADILNDCALFLEIISPLFNYLFTPLLCLANVLKSLVGVAGSATRAAIVQHQA 187

Query: 265 ARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSY 324
              N A+V AK  +Q  +S  +  +L   L   +  +       F   T IH+Y N ++ 
Sbjct: 188 INNNLADVSAKDGSQETLSNLMAWLLNFILLYMVTGNQFLIWFCFICCTSIHLYSNYRAV 247

Query: 325 QSIELRTLNPYRASLVFSEYL 345
           + ++LRT N  R  L   ++ 
Sbjct: 248 KCLKLRTFNRTRFHLAIQQWF 268


>gi|358335880|dbj|GAA28290.2| UPF0420 protein C16orf58 [Clonorchis sinensis]
          Length = 428

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 88/307 (28%), Positives = 153/307 (49%), Gaps = 20/307 (6%)

Query: 126 PEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLK 184
           P G+P+SV+ DYL Y +W +VQ +AS ++G LA Q++L  +G+G   A    A+++W+LK
Sbjct: 21  PSGYPNSVSPDYLEYQIWDTVQALASSVTGALAAQSVLIGVGVGDVKATILGASLSWMLK 80

Query: 185 DGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGA 243
           DG G + KI+   + G + D + K WRL AD++ + A  LE+ +  FP  F FI   A  
Sbjct: 81  DGSGMIGKIVFVGWQGSNLDCDCKRWRLLADIINDCALFLELASSFFPSVFTFIICLANL 140

Query: 244 GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMP 303
            +S  ++   +TR+      A   N A+V AK  +Q  +S  + +   +++   +  +  
Sbjct: 141 CKSVVSVAGGATRAAVTLHQALDNNVADVAAKDGSQETLSNLLALFFNLSVVYFVTGNWS 200

Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--QAPPVKEVNDEEP 361
                F ++T+IH+Y N ++ Q + L   N  R  +   ++        P V +      
Sbjct: 201 IIWTGFVILTFIHVYANYRAVQCLRLSNFNRNRFHIAVQQWFRQRLFHDPSVSDPT---- 256

Query: 362 LFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE-DAHALFSLY 420
              A  F  I+  N  + ++L   A DA      R+ LG  L  + N ++ +   L +++
Sbjct: 257 ---ATPFPSIEWVNSREPILL--PASDA------RIHLGCSLQTLPNKEQYNLRDLINVF 305

Query: 421 EDEGYIL 427
             E Y+L
Sbjct: 306 SQEKYLL 312


>gi|427789495|gb|JAA60199.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 464

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 87/296 (29%), Positives = 139/296 (46%), Gaps = 14/296 (4%)

Query: 111 KLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG- 169
           + W   R  F +  LP G+PDSV+ DY  Y +W S+Q  AS I+G LATQ++L  +G+G 
Sbjct: 36  RKWSSLRAFFSEIFLPVGYPDSVSRDYTEYQIWDSIQAFASSITGTLATQSVLSGVGVGD 95

Query: 170 KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
           +GA   AA   W+L+DG G   +I+ + F G + D + K WRLFAD+L + A  LE++  
Sbjct: 96  QGASVLAATTTWILRDGTGMTGRILFAWFQGSNLDYDSKKWRLFADILNDVAIFLELMCQ 155

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
               +   +   +   +S   +   +TR+      A   N A+V AK  +Q  +      
Sbjct: 156 YMKGYVTAVLCVSSVAKSIVGVAGGATRAALTQHQARNNNMADVSAKDGSQETLVNLAAF 215

Query: 289 MLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL-LS 347
           +  + L   +  +       F   T +H++ N ++   + + T N  R ++V   +L  +
Sbjct: 216 LFSLLLLRIVAGNPWLLYGIFISFTLLHIFANYRAVSCVVMETFNRSRYAIVVRRFLETA 275

Query: 348 GQAPPVKEVNDEEPL-------FPAFHF----FKIKSANKSQLLVLSSEAKDAAVE 392
           G   PV  VN  E +       F + +F    F I   NK     +   A D  VE
Sbjct: 276 GDIAPVTWVNARESVWIQCGKPFVSINFGVPLFSIAPTNKELQCRVEDAAGDRKVE 331


>gi|449461779|ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
 gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 495

 Score =  135 bits (341), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 114/416 (27%), Positives = 187/416 (44%), Gaps = 54/416 (12%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA 178
           L   F+LP GFP+SV+ DYL Y + +    V   I   L T +LL A+G+G  +  T AA
Sbjct: 113 LIKDFILPTGFPESVSDDYLQYMIRQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTTTAA 172

Query: 179 ----INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
               I WV KDGIG + ++ +   FG  FD +PK WR++AD + +A    ++ TP +P +
Sbjct: 173 SAVAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLATPLYPSY 232

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ + +     ++ A  ++  +       FA   N  E+ AK E   +V++ +G+ +GI 
Sbjct: 233 FLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEIAAKEEVWEVVAQLLGLAIGIL 292

Query: 294 LANHIGSSMPFALASFSVV--TW-----IHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           + +      P  + S+SV+  TW     +H++   +S   +   T+N  RA ++   ++L
Sbjct: 293 ILD-----TPGLVNSYSVLSATWLSMRLLHLWLRYQSLAVLHFNTINLKRARILARAHVL 347

Query: 347 SGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
             + P   E N EE +     F              +  +    V +E    +GS+ S  
Sbjct: 348 HNKVPGTVECNSEESILVWERF--------------TRPSIIFGVPLEE--MMGSERSST 391

Query: 407 VNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLFQASYLYWLER 462
           V        L  LY +E YIL     +   K  V  K  A    +L+S++Q    YWL +
Sbjct: 392 V------MKLLKLYANEKYILMLDSQDKDLKVFVSFKVGASSMTVLRSIWQT---YWLNK 442

Query: 463 NAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
           +                +L  SL  ++ +FN         GW T  L +  +PN I
Sbjct: 443 HLDTTENVIH-------QLAQSLSEMEDKFNEFVQLLEGAGWDTHQL-SLKVPNNI 490


>gi|357506015|ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
 gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula]
          Length = 492

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 104/356 (29%), Positives = 169/356 (47%), Gaps = 43/356 (12%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA---- 178
           F+LP GFP SV+ DYL Y L +    V   I   + T +LL A+G+G  +  TAAA    
Sbjct: 110 FILPAGFPGSVSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAVGVGSFSGTTAAASAAA 169

Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG + ++ +   FG  FD +PK WR++AD + +A    ++ TP +P +F+ +
Sbjct: 170 IRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTPLYPGYFLPL 229

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
            +     ++ A  ++  +     + FA   N  E+ AK E   +V++ +G+ LGI + + 
Sbjct: 230 ASLGNLTKAIARGLKDPSSRVIQSHFAISANLGEIAAKEEVWEVVAQLVGLGLGILILDT 289

Query: 298 IGSSMPFALASFS--VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            G    + + S +  +V  +H++   +S   ++  T+N  RA  +   ++L    P   +
Sbjct: 290 PGLVKSYTVLSSTWLIVRLLHLWLRYESLSVLQFNTINLKRARTLVKSHVLHSTVPGCMD 349

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH- 414
            N EE                  +LV S   K   +        GS L D ++  E +H 
Sbjct: 350 CNKEE-----------------NILVWSQFMKPIII-------FGSPL-DKMDGMERSHF 384

Query: 415 ---ALFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKSLFQASYLYWLERN 463
              AL  LY +E YIL  +      KF V  K  A    +L+S++Q    +WL  N
Sbjct: 385 MVEALLKLYANEKYILVVNQQLDDLKFYVSFKVGATSVSVLRSIWQT---FWLSEN 437


>gi|194880075|ref|XP_001974360.1| GG21126 [Drosophila erecta]
 gi|190657547|gb|EDV54760.1| GG21126 [Drosophila erecta]
          Length = 417

 Score =  135 bits (340), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 164/340 (48%), Gaps = 24/340 (7%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
           +  LP+G+PDSV+ DY  Y +W + Q   S I G L T A+L  IG+G   I   +A   
Sbjct: 51  KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +++ LR+ N  R  +   E+  S +    ++VN 
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
            E +         ++ + S  + L    K+   E +      S + D+V++  D H    
Sbjct: 290 MERVTVG------QTVSVSLNIKLGLSVKNLIDEYKS----SSVIEDIVSSF-DPH---- 334

Query: 419 LYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
               E +I+ E      V L     PQD+LK+ F A SYL
Sbjct: 335 ----EHFIIAETKKCLGVYLHFETRPQDVLKAYFFAVSYL 370


>gi|195484210|ref|XP_002090597.1| GE13200 [Drosophila yakuba]
 gi|194176698|gb|EDW90309.1| GE13200 [Drosophila yakuba]
          Length = 417

 Score =  135 bits (340), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 96/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
           +  LP+G+PDSV+ DY  Y +W + Q   S I G L T A+L  IG+G   I   +A   
Sbjct: 51  KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +++ LR+ N  R  +   E+  S +    ++VN 
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
            E +                            V +   ++LG  + ++++  + +  + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326

Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           +       E +I+ E      V L     PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAETKKCLGVYLHFETRPQDVLKAYFFAVSYL 370


>gi|219119716|ref|XP_002180612.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408085|gb|EEC48020.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 408

 Score =  134 bits (338), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 107/383 (27%), Positives = 167/383 (43%), Gaps = 34/383 (8%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL---GKGAIPTA 176
            +Q  LP  FP SV   Y  +  +  +  V+     VL+TQ LL AIG+      A   A
Sbjct: 26  LIQHFLPAQFPASVAPGYAQFCGFCFLASVSGSAGMVLSTQTLLLAIGVVGNAHSASVMA 85

Query: 177 AAINWVLKDGIGYLSKIML-SNFG--RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
            A+NWVLKDGIG L  ++  S  G  R FD NPK WR+   L  ++A  L++L+P  P  
Sbjct: 86  GALNWVLKDGIGQLGGVIFASRMGETRRFDANPKKWRMMGALCLDSASLLDILSPFAPSA 145

Query: 234 FVF-IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
            V  +   A A ++   L   ++R+  +       N  +V AK  +Q + +  +G  +GI
Sbjct: 146 VVLPLACFANALKNIGYLTVGASRAALHQALTRAGNLGDVTAKAGSQSIAAGLLGTGVGI 205

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            L+  +G      +  F  ++ IH  CN  + Q ++L   N +R  LV   YL  G+   
Sbjct: 206 GLSAAMGHDAGNFVLGFCGLSLIHQLCNFAALQHVDLHHFNRHRLYLVLDHYLRHGKVLS 265

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVV-NNKE 411
            +E+  +E  FP                 L  E   A+      L +GS L+ +      
Sbjct: 266 PREIATKESFFP-----------------LLREDDTAS-----WLSIGSALTAISPGGPN 303

Query: 412 DAHALFSLYEDEGYIL-TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
           +   L     DE YI+   + GK  VV  + A  +D++K +F A +   L +  G+    
Sbjct: 304 ELTQLLKACPDESYIINCTNDGKIHVVFLQDAQGEDLVKGMFHACF---LRKKTGLSIDG 360

Query: 471 TSADCAPGGRLEISLDYVQREFN 493
             A  A     ++   +   E N
Sbjct: 361 VEAVVATHEETKVRFPHFLEELN 383


>gi|194758785|ref|XP_001961639.1| GF15067 [Drosophila ananassae]
 gi|190615336|gb|EDV30860.1| GF15067 [Drosophila ananassae]
          Length = 417

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/342 (30%), Positives = 164/342 (47%), Gaps = 28/342 (8%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAIN 180
           +  LP+G+PDSV+ DY  Y +W + Q   S I G L T A+L  IG+G   I   +A + 
Sbjct: 51  KIFLPKGYPDSVSDDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAYSATVT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I  + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRIFFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TR+      A R N A+V +K  +Q      +   +G+ L + I
Sbjct: 171 CGSTLLKAIVGVAGGATRAALTQHHAVRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +++ LRT N  R  +   E+  S +    ++VN 
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYSNLKAVRAVCLRTFNESRYLIALEEFFRSSRMLSPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
            E +         ++ + S  + L    K+   E +      S + D + +  D H    
Sbjct: 290 MERVTIG------QTVSVSLNIKLGLSVKNLIDEYK-----SSSVIDNIVSSFDPH---- 334

Query: 419 LYEDEGYILTEHGGKFC--VVLKESALPQDMLKSLFQA-SYL 457
               E +I+ E  GK C  V L     P D+LK+ F A SYL
Sbjct: 335 ----EHFIIAE--GKKCLGVYLHFDTRPLDVLKAYFFAVSYL 370


>gi|195579948|ref|XP_002079818.1| GD24150 [Drosophila simulans]
 gi|194191827|gb|EDX05403.1| GD24150 [Drosophila simulans]
          Length = 417

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
           +  LP+G+PDSV+ DY  Y +W + Q   S I G L T A+L  IG+G   I   +A   
Sbjct: 51  KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +++ LR+ N  R  +   E+  S +    ++VN 
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
            E +                            V +   ++LG  + ++++  + +  + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326

Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           +       E +I+ +      V L     PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAVSYL 370


>gi|24585018|ref|NP_609897.2| CG10338 [Drosophila melanogaster]
 gi|22946770|gb|AAF53692.2| CG10338 [Drosophila melanogaster]
 gi|384475974|gb|AFH89820.1| FI20169p1 [Drosophila melanogaster]
          Length = 417

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
           +  LP+G+PDSV+ DY  Y +W + Q   S I G L T A+L  IG+G   I   +A   
Sbjct: 51  KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHFSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +++ LR+ N  R  +   E+  S +    ++VN 
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
            E +                            V +   ++LG  + ++++  + +  + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326

Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           +       E +I+ +      V L     PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAVSYL 370


>gi|307106356|gb|EFN54602.1| hypothetical protein CHLNCDRAFT_135084 [Chlorella variabilis]
          Length = 486

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 125/242 (51%), Gaps = 14/242 (5%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           F    LPEGFP SV+SDYL Y LW S+QG++S + G+L++QA+L  +G            
Sbjct: 65  FAACFLPEGFPHSVSSDYLPYQLWDSIQGLSSYLRGMLSSQAMLAGVG------------ 112

Query: 180 NWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
            + LKD  G L  I+  S  G   D   K WRL AD++ +    LE+ +P  P  F+ + 
Sbjct: 113 -FFLKDFTGMLGGILFASTQGSGLDCYAKQWRLLADVMNDIGLALELASPLLPGAFLLLA 171

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
                 R+   +  ++TR      FA +RN A++ AK  +Q      +G++LG+A     
Sbjct: 172 CLGSLARAVTGVAGSATRMALTQHFALQRNAADIAAKEGSQETAVTLVGMVLGMAFIRAA 231

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
               P   ++F  +T +H++ N+++ + + + +LN  R  L+   YL +G+A    ++  
Sbjct: 232 HGLEPLIWSAFWALTALHVWANVRAMRCLRIASLNQARLGLLLRHYLRTGEALTPGQMAA 291

Query: 359 EE 360
           EE
Sbjct: 292 EE 293


>gi|195344882|ref|XP_002039005.1| GM17287 [Drosophila sechellia]
 gi|194134135|gb|EDW55651.1| GM17287 [Drosophila sechellia]
          Length = 417

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 32/344 (9%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT-AAAIN 180
           +  LP+G+PDSV+ DY  Y +W + Q   S I G L T A+L  IG+G   I   +A   
Sbjct: 51  KVFLPKGYPDSVSEDYAAYQIWDTAQAFCSTICGTLCTHAILKGIGVGSENINAFSATAT 110

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEM-LTPAFPHHFVFIG 238
           W+LK+G G++ +I+ + + G   DV+ K WRL AD L + A G+E+ + P +PH    I 
Sbjct: 111 WILKEGSGHVGRIVFAWWQGSQLDVDSKKWRLRADFLNDLAMGIEIYVLPKYPHLSTQIL 170

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             +   ++   +   +TRS      A R N A+V +K  +Q      +   +G+ L + I
Sbjct: 171 CCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQETCVNLVASFVGLYLLSLI 230

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
             S       F VV  +H+Y NLK+ +++ LR+ N  R  +   E+  S +    ++VN 
Sbjct: 231 -KSQAVLYTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLIALEEFFRSSRMLSPQQVNA 289

Query: 359 EEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFS 418
            E +                            V +   ++LG  + ++++  + +  + +
Sbjct: 290 MERV-----------------------TVGQTVSVSLNIKLGLSVKNLIDEYKSSSVIEN 326

Query: 419 LYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA-SYL 457
           +       E +I+ +      V L     PQD+LK+ F A SYL
Sbjct: 327 IVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAVSYL 370


>gi|222624448|gb|EEE58580.1| hypothetical protein OsJ_09902 [Oryza sativa Japonica Group]
          Length = 479

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 166/361 (45%), Gaps = 48/361 (13%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G K A    A   W L
Sbjct: 71  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130

Query: 184 KDGIGYLSKIMLSNF-----------------------------GRHFDVNPKGWRLFAD 214
           +D  G L  I+ + +                             G + D N K WRL AD
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190

Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
            + +    +++L+P FP   + I       RS   +   +TR+     FA   N A++ A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250

Query: 275 KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 334
           K  +Q  ++  +G+ LG+ LA+           SF  +T  HMY N K+ QS+ L TLN 
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310

Query: 335 YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 394
            R+S++   ++ +G+    ++V+ +E + P +                SS  K   +++ 
Sbjct: 311 ERSSILLQYFMDNGEVLTPQQVSKQEHILPYW----------------SSWRKFLRIKLP 354

Query: 395 H-RLQLGSKLSDVVN-NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
           H  + LG+K S + + +          Y +E Y L +  G   + + + A   D+L S  
Sbjct: 355 HEHVHLGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFI 414

Query: 453 Q 453
            
Sbjct: 415 H 415


>gi|452823886|gb|EME30893.1| hypothetical protein Gasu_19060 [Galdieria sulphuraria]
          Length = 346

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 77/251 (30%), Positives = 133/251 (52%), Gaps = 8/251 (3%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           + +  +  LP G+P+SVT DY N+  W S+QG+ S +  V+ T++ L  +G+G   A PT
Sbjct: 61  KNVLRRLFLPSGYPESVTDDYANFEKWDSLQGLCSYLRQVMVTRSTLSGLGVGNNFAHPT 120

Query: 176 AAAINWVLKDGIGYLSKIMLSNFGR--HFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
           AAA+  + ++G G +  ++ + +G+   FD + K WRLFAD++ N A  +E   P FP  
Sbjct: 121 AAALMLLWREGFGVVGGLLFT-WGKSSRFDKDSKSWRLFADVINNVALTMEWTAPLFPTP 179

Query: 234 FVF-IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             F + ++A    S   +   +T + F A FA R N  ++ +K    G     +G++L  
Sbjct: 180 VFFALHSSANIFHSLCGIAGGATCTAFQAHFAKRNNIGDICSKHGNIGRAVSLVGLLLST 239

Query: 293 AL---ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
                ++ + +S+PF+   F  +T  H+Y N+ + +S+ L  LN  R  +V   Y+   Q
Sbjct: 240 LCFWSSSLLENSLPFSFGVFCFLTVFHIYFNILALRSLHLAVLNKERGYIVAEHYICHKQ 299

Query: 350 APPVKEVNDEE 360
              V  V+ +E
Sbjct: 300 LLDVSSVSKKE 310


>gi|218192328|gb|EEC74755.1| hypothetical protein OsI_10514 [Oryza sativa Indica Group]
          Length = 479

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 101/365 (27%), Positives = 167/365 (45%), Gaps = 48/365 (13%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G K A    A   W L
Sbjct: 71  VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 130

Query: 184 KDGIGYLSKIMLSNF-----------------------------GRHFDVNPKGWRLFAD 214
           +D  G L  I+ + +                             G + D N K WRL AD
Sbjct: 131 RDLTGMLGGILFTFYQVSNSLANSNRQCHRIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 190

Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
            + +    +++L+P FP   + I       RS   +   +TR+     FA   N A++ A
Sbjct: 191 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 250

Query: 275 KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 334
           K  +Q  ++  +G+ LG+ LA+           SF  +T  HMY N K+ QS+ L TLN 
Sbjct: 251 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 310

Query: 335 YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 394
            R+S++   ++ +G+    ++V+ +E + P +                SS  K   +++ 
Sbjct: 311 ERSSIMLQYFMDNGEVLTPQQVSKQEHILPFW----------------SSWRKFLRIKLP 354

Query: 395 H-RLQLGSKLSDVVN-NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
           H  + LG+K S + + +          Y +E Y L +  G   + + + A   D+L S  
Sbjct: 355 HEHVHLGAKASRLTHPDLLQIAKTRPYYRNENYFLLDKEGSVHIFIHKQAAATDILMSFI 414

Query: 453 QASYL 457
               L
Sbjct: 415 HGLVL 419


>gi|384252100|gb|EIE25577.1| DUF647-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 286

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 121/229 (52%), Gaps = 2/229 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAAAINWVL 183
           LP+G+PD+VTSDY+ + +W S+Q + S + G++ + A+L  IG+GK A  P  A   + +
Sbjct: 50  LPDGYPDAVTSDYVGFQIWDSIQALCSYVRGMICSNAILTGIGVGKQATSPLVAVFQFFV 109

Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G    ++ +   G   D   K WRLFAD+  N  + L++ +P FP  F+ +     
Sbjct: 110 RDLAGMCGGVLFAFTQGSQLDAYAKQWRLFADIFNNVGYALDLASPLFPDQFLLLACLGS 169

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             R+   +   +TR+     F+   N  ++ AK  +Q   +  IG++LG+A       S 
Sbjct: 170 LARAVTGVAGGATRAALTQHFSRADNAGDISAKEGSQETATTLIGMILGMAFLQAASDSP 229

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
               A F ++T++H+Y N+++ ++++L +LN  R  ++      S   P
Sbjct: 230 VAVWAVFLLLTFLHIYANVRAVRALQLTSLNCSRLDILLKHSSASKVTP 278


>gi|321459639|gb|EFX70690.1| hypothetical protein DAPPUDRAFT_60988 [Daphnia pulex]
          Length = 264

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 79/258 (30%), Positives = 130/258 (50%), Gaps = 13/258 (5%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+G+P SV+ DYL Y +W ++Q  AS +SG L+T+A+L  +G+G   + P AA + W++
Sbjct: 6   LPQGYPSSVSDDYLEYQIWDTIQAFASSLSGSLSTRAILEGVGVGSSTSTPLAATLMWLI 65

Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTP-------AFPHHFV 235
           KDG G + +I+ +   G   D + K WR FADLL + A  LE+  P              
Sbjct: 66  KDGTGMVGRILFAWCNGTKLDADSKKWRFFADLLNDCALSLELCAPYAVAAVGGSNGTMT 125

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            I   AG  +S   +   +TR+      A + N A+V AK  +Q  +     ++  + L 
Sbjct: 126 SILCVAGVAKSIVGVAGGATRAALTQHQARQGNLADVSAKDGSQETLVNLAALITSLWLL 185

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
             +  +       F + T +H++ NLK+ +++ + TLN  R  ++  ++  +     VKE
Sbjct: 186 PILDGATGLTWLMFFLFTLLHVFANLKAVKAVTMETLNRARYMIILKQFASTKYVASVKE 245

Query: 356 VNDEEP----LFPAFHFF 369
           VN  EP    L P  +F 
Sbjct: 246 VNRLEPVIMGLIPTGNFI 263


>gi|449018513|dbj|BAM81915.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 600

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 118/453 (26%), Positives = 200/453 (44%), Gaps = 55/453 (12%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----G 169
           +  + L     LP  +P SVT+ Y  Y+ W  +    +  + VL T ALL + G+     
Sbjct: 148 EPLQRLLASVFLPTNYPHSVTASYAPYAFWHFLHNTLTACNSVLGTTALLVSAGVVADPS 207

Query: 170 KGA------------IPTAAA----INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLF 212
           + A            +PTAA+     +W+LKDG+G+L ++   S +   FD + K  R+ 
Sbjct: 208 RAASTALTRDPALEHLPTAASASAVTSWILKDGVGHLVRLAFGSRYAHRFDGDLKRLRVL 267

Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
           AD+L +   GLE+++ +FP  F+ + + A A +  A ++  STRS  Y   A R N  +V
Sbjct: 268 ADVLWHLGTGLELVSRSFPDSFIGLASGAYALKGMAHVVFNSTRSTIYRTVATRSNIGDV 327

Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPF-ALASFSVVTWIHMYCNLKSYQSIELRT 331
            AKG+AQ + ++ +G+ +G++L      +  F   A F +V    MY  ++S + + L  
Sbjct: 328 TAKGDAQSIAAELLGMSIGLSLYRGFAENNLFWTWAIFGIVVAGQMYSRIQSMRVLVLPL 387

Query: 332 LNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIK-SANKSQLLVLSSEAKDAA 390
           LN  RA L+   Y+ S  A          P+    +   ++ +     LLVLS E     
Sbjct: 388 LNLQRAVLLVDAYVASRLAAAATASEAIRPIRKQEYVSNVRENGTMKTLLVLSPEEVAQR 447

Query: 391 VEIEHRLQLGSKLSDVV---------NNKEDAHALFSLYEDEGYIL--------TEHGGK 433
               H     S +  VV          +  D   L  ++E E +++         +   +
Sbjct: 448 ERFLHWQSHWSTVPQVVPDVRMHWFSESASDVGTLLRVFETEAFVVGRCRRQHPAQSKEQ 507

Query: 434 FCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFN 493
              V+K  A  ++ L++L  A +L    +       +T  D     RLE++ D+ +    
Sbjct: 508 VGFVMKTGAGAREALRALIAAIFLVRQPQ-------ATERDALLFSRLELTEDFFR---- 556

Query: 494 HVKSDSASVGWVTDGLIARPLPNRIRPGYVEPS 526
               ++AS GW  D +      NR+    VEPS
Sbjct: 557 ----EAASRGWQLDCVHGVGQRNRLLCNAVEPS 585


>gi|186478458|ref|NP_001117280.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332190946|gb|AEE29067.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 353

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 177/370 (47%), Gaps = 39/370 (10%)

Query: 156 VLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFA 213
           +L+TQALL AIG+G K A    A   W L+D  G L  I+ + + G + D N K WRL A
Sbjct: 2   MLSTQALLSAIGVGEKSATVIGATFQWFLRDFTGMLGGILFTFYQGSNLDSNAKMWRLVA 61

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
           DL+ +    +++L+P FP  F+ +       RS   +   +TR+     FA + N A++ 
Sbjct: 62  DLMNDIGMLMDLLSPLFPSAFIVVVCLGSLSRSFTGVASGATRAALTQHFALQDNAADIS 121

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTL 332
           AK  +Q  ++  +G+ LG+ LA    S  P A+  SF  +T  HMY N ++ + + L +L
Sbjct: 122 AKEGSQETMATMMGMSLGMLLAR-FTSGNPMAIWLSFLSLTVFHMYANYRAVRCLVLNSL 180

Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
           N  R+S++ + ++ +GQ    ++V+  E + P +    ++S N   L             
Sbjct: 181 NFERSSILLTHFIQTGQVLSPEQVSSMEGVLPLWA-TSLRSTNSKPL------------- 226

Query: 393 IEHRLQLGSKLSDVVNNKEDAHALF-----SLYEDEGYILTEHGGKFCVVLKESALPQDM 447
              R+QLG ++S +   + D   L      S Y++  Y+L    G   V+L + + P D+
Sbjct: 227 -HKRVQLGVRVSSL--PRLDMLQLLNGVGASSYKNAKYLLAHIKGNVSVILHKDSKPADV 283

Query: 448 LKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
           LKS   A  L      A ++  STS              ++ + ++ +     S GW T+
Sbjct: 284 LKSYIHAIVL------ANLMEKSTSFYSEGEA-------WIDKHYDELLHKLRSGGWKTE 330

Query: 508 GLIARPLPNR 517
            L++  +  R
Sbjct: 331 RLLSPSITWR 340


>gi|328703155|ref|XP_001949028.2| PREDICTED: UPF0420 protein C16orf58 homolog [Acyrthosiphon pisum]
          Length = 457

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/396 (25%), Positives = 178/396 (44%), Gaps = 43/396 (10%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
           +  LP GFP+SV+ DY  Y +W ++Q   S IS +LAT A+L  +G+G   A   AA+I 
Sbjct: 54  EVFLPRGFPNSVSKDYTEYQIWDTIQAFCSTISNILATNAILKGVGVGDAQATALAASIT 113

Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEML--TPAFPHHFVFI 237
           ++LKDG G + +I  +   G   D   K WRL AD L + A  +E+L   P      + I
Sbjct: 114 YILKDGAGMIGRITFAWCKGSELDTQCKKWRLRADALNDLAIFIELLLSIPWIRQFSLII 173

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
            + +   +S  ++   +TR+      A R N  +V AK  +Q      I  ++G+ +   
Sbjct: 174 LSFSSCAKSIVSVAGGATRAALTQHQAIRDNMGDVSAKDGSQETCINLIAFLIGLIMLPI 233

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           + + +      + VVT +H++ N K+ +S+ +   N  R  L    YL    +   +  +
Sbjct: 234 VENRILLIWMIYLVVTSLHLFANYKAVKSLNINVFNSARYDLTLKYYL----SNDTQNHD 289

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE----DA 413
            ++P +           NK +   L  E          ++QLG+ + +++        D 
Sbjct: 290 VQKPDY----------INKREACFLGDEKL-----CSFKIQLGTSIHELLYTNTLTTWDI 334

Query: 414 HALFSLYEDEGYILT--EHGGKFCVVLKESALPQDMLKSLFQASYLYWL----------- 460
                LY+D  YIL    H     VVL ++   +++LK+ F A+ L  L           
Sbjct: 335 IDHIELYKDYLYILIVDTHRDIIRVVLDKNINTENILKAYFHANILGHLICPKNKFSLIK 394

Query: 461 ---ERNAGIVATSTSADCAPGGRLEISLDYVQREFN 493
               R+    +T+T   C     +++S ++V + F+
Sbjct: 395 LNSLRSMKYTSTNTQFCCTHSEYVQLSCEFVNKNFD 430


>gi|356523306|ref|XP_003530281.1| PREDICTED: UPF0420 protein-like [Glycine max]
          Length = 480

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 167/360 (46%), Gaps = 41/360 (11%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           E+   F+LP GFP SV+ DYL+Y L +    V   I   L T +LL A+G+G     TAA
Sbjct: 92  EIIKDFVLPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFTGTTAA 151

Query: 178 A----INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           A    I WV KDGIG + ++ +   FG  FD +PK WR++AD + +A    ++ T  +P 
Sbjct: 152 ASAAAIRWVSKDGIGAVGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPA 211

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
           +F+ + +     ++ A  ++  +       FA   N  EV AK E   +V++ +G+ LGI
Sbjct: 212 YFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVVAQLVGLSLGI 271

Query: 293 ALANHIGSSMPFALASFSVVTW-----IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
            + +  G    + + S   +TW     +H++   +S   ++  T+N  RA ++   ++L 
Sbjct: 272 LILDTPGLVKSYGVIS---LTWLSMRLLHLWLRYESLSVLQFNTINIKRARILVKSHVLH 328

Query: 348 GQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVV 407
              P   + N EE +     F K K      +  L  E  D  VE  + +          
Sbjct: 329 STVPGCTDCNREENILAWSQFMKPKI-----IFGLPLEKMD-GVERSYFM---------- 372

Query: 408 NNKEDAHALFSLYEDEGYILTEHGG----KFCVVLKESALPQDMLKSLFQASYLYWLERN 463
                  AL  LY  E YIL  +      +F V  K  A    +L+S++Q+   +WL  N
Sbjct: 373 -----VEALIKLYASEKYILMVNQQTEDLRFYVSFKVGATNVSVLRSVWQS---FWLSEN 424


>gi|298709448|emb|CBJ31354.1| putative glutamate/ornithine acetyltransferase (ISS) [Ectocarpus
           siliculosus]
          Length = 789

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 83/278 (29%), Positives = 133/278 (47%), Gaps = 38/278 (13%)

Query: 112 LWDECRELFV---------------------QFMLPEGF-PDSVTSDYLNYSLWRSVQGV 149
           +WDE +E FV                     Q +L   F P+ VT DY +++ WR  Q  
Sbjct: 147 VWDESKERFVEADSLRDRSNSQRVRGAVGGLQLLLRHWFIPEDVTPDYYHFTFWRMFQRF 206

Query: 150 ASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIM-LSNFGRHFDVNPKG 208
            +    V  TQALL A+G+    +  AAA++WVLKD +G   +I+  S  GR FD + K 
Sbjct: 207 VAGTINVFGTQALLLALGIKAKRLGAAAAMSWVLKDALGKFGRILWASKMGRRFDSDAKR 266

Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRN 268
           WR  + LL  A  GLE++T  FP  F+ + A A + +  + L  ++TR+  Y  FA   N
Sbjct: 267 WRFRSSLLYAAGNGLEIVTYVFPASFLVLAAMANSFKQMSMLTSSATRNTIYRSFARGEN 326

Query: 269 FAEVIAK---------------GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
             ++ AK               GEAQ  V   +G++ GI L   +G+S    +A++ +++
Sbjct: 327 IGDITAKACPPAAAMPNAKKHEGEAQIAVVDVLGMLSGIGLCTLVGTSRQTIVAAYVILS 386

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
            + +       +++    LN  R  L+  +++ SG  P
Sbjct: 387 LVDISAIYNEIRAVCFSVLNHERTHLLVRDFVASGSDP 424


>gi|449017626|dbj|BAM81028.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 518

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 83/266 (31%), Positives = 134/266 (50%), Gaps = 7/266 (2%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
           + L    +LP  +P +V+SDY  +  W  ++    + + VL TQ++L A+G+G   A+P 
Sbjct: 92  QRLVQTLLLPADYPSAVSSDYDRFWRWNILRHTLLEAAEVLGTQSMLLALGVGGASALPL 151

Query: 176 AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AAA  W+LKDG+GY +K+ L +      D +PK +R + D +       EMLT AFP  F
Sbjct: 152 AAAWKWILKDGLGYFAKVALATQLAPRVDNDPKRFRFYGDFVMALGTLFEMLTLAFPPWF 211

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           + + +     R AA +        F   F+ R N  +V +K E+Q +V +  GI  GIA+
Sbjct: 212 LALASTGNLLRKAADVATGPAYRVFLYHFSIRNNSGDVSSKSESQVVVGRLSGIGAGIAV 271

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG--QAPP 352
           +          + ++ +++  H+Y   +S   +ELRTLN  R   V + Y  S   + P 
Sbjct: 272 SAVTSHDASALVMAYGLLSAGHLYATYQSVIRLELRTLNRTRLEYVLNVYTASSGKRVPS 331

Query: 353 VKEVNDEEP---LFPAFHFFKIKSAN 375
           + E N  E    L P  H ++I+  N
Sbjct: 332 IAEANAAEQKSFLRPWSHVYRIRFVN 357


>gi|327350378|gb|EGE79235.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 497

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 179/358 (50%), Gaps = 31/358 (8%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP G+P SV+ DYL Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT A ++ VL
Sbjct: 74  LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
            D +G ++ I+ ++  G   +   K +RL AD+L ++A   + L+PAFP H  + + A +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              R+   +   S+++   + FA   N  E+ AK  +Q  V   +G++ G  + + +  S
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRV--S 251

Query: 302 MPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
            PFA  A+   +  +H+  N  + +S+ + TLN  RA++VFS     G+    K+ +  E
Sbjct: 252 TPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCE 311

Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE---HRLQLGS--KLSDVVNNKEDAHA 415
            +F      + K          +S A   +  I      L  GS  +++ + +   D   
Sbjct: 312 RIFERDGVLRWK----------ASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRR 361

Query: 416 LFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
           L  L+ +E YIL    ++ GG   +VLK S  P   LK    +L  A  +  L   AG
Sbjct: 362 LLHLFREEEYILWFNSSDKGGT--IVLKNSVTPLSQLKAWSHALMVAKRIADLREGAG 417


>gi|239610850|gb|EEQ87837.1| DUF647 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 498

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 179/358 (50%), Gaps = 31/358 (8%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP G+P SV+ DYL Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT A ++ VL
Sbjct: 75  LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 134

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
            D +G ++ I+ ++  G   +   K +RL AD+L ++A   + L+PAFP H  + + A +
Sbjct: 135 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 194

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              R+   +   S+++   + FA   N  E+ AK  +Q  V   +G++ G  + + +  S
Sbjct: 195 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRV--S 252

Query: 302 MPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
            PFA  A+   +  +H+  N  + +S+ + TLN  RA++VFS     G+    K+ +  E
Sbjct: 253 TPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCE 312

Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE---HRLQLGS--KLSDVVNNKEDAHA 415
            +F      + K          +S A   +  I      L  GS  +++ + +   D   
Sbjct: 313 RIFERDGVLRWK----------ASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRR 362

Query: 416 LFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
           L  L+ +E YIL    ++ GG   +VLK S  P   LK    +L  A  +  L   AG
Sbjct: 363 LLHLFREEEYILWFNSSDKGGT--IVLKNSVTPLSQLKAWSHALMVAKRMADLREGAG 418


>gi|261206350|ref|XP_002627912.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239592971|gb|EEQ75552.1| DUF647 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 497

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 179/358 (50%), Gaps = 31/358 (8%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP G+P SV+ DYL Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT A ++ VL
Sbjct: 74  LPVGYPHSVSDDYLEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVALLHSVL 133

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
            D +G ++ I+ ++  G   +   K +RL AD+L ++A   + L+PAFP H  + + A +
Sbjct: 134 HDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSAMVFDCLSPAFPKHLRIVVLACS 193

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              R+   +   S+++   + FA   N  E+ AK  +Q  V   +G++ G  + + +  S
Sbjct: 194 SVLRALCGVAAGSSKASLSSHFARWGNLGELNAKDSSQETVISLLGMLCGSLVVSRV--S 251

Query: 302 MPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
            PFA  A+   +  +H+  N  + +S+ + TLN  RA++VFS     G+    K+ +  E
Sbjct: 252 TPFATWATLLFLLLVHLSTNYAAVRSVNMTTLNRQRANIVFSTLFEDGRVLTPKQASKCE 311

Query: 361 PLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE---HRLQLGS--KLSDVVNNKEDAHA 415
            +F      + K          +S A   +  I      L  GS  +++ + +   D   
Sbjct: 312 RIFERDGVLRWK----------ASAATLGSCRIGVSFQELLCGSRGRVNSIRDINIDIRR 361

Query: 416 LFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
           L  L+ +E YIL    ++ GG   +VLK S  P   LK    +L  A  +  L   AG
Sbjct: 362 LLHLFREEEYILWFNSSDKGGT--IVLKNSVTPLSQLKAWSHALMVAKRIADLREGAG 417


>gi|324514354|gb|ADY45840.1| Unknown [Ascaris suum]
          Length = 394

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/249 (30%), Positives = 126/249 (50%), Gaps = 2/249 (0%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
           +  FV   LP G+P SV++DY++Y +W +VQ  AS ++  LAT+A+L   G+G + A   
Sbjct: 41  KAFFVDVFLPRGYPHSVSADYMSYQIWDTVQAFASSMTSALATEAVLRGAGVGDENASAL 100

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AA + W+++DG+G LS+I  +       D + K WRL AD L + AF LE+L  A P  F
Sbjct: 101 AATLTWLIRDGLGMLSRIGFAWLKSAQLDADCKRWRLVADFLNDLAFSLELLAAALPSLF 160

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
             +   +   RS  A+   +TR+      A   N A+V AK  +Q  +     ++  + L
Sbjct: 161 TLLVCLSSLARSIVAVAGGATRTTVVQHQARCNNVADVSAKDGSQETLVNVTALLCSLIL 220

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
              I          + + T +H++ N  +  ++   TLN     ++ + ++ +G    VK
Sbjct: 221 LPLISGRTVLIWMFYILFTGVHLFANYAAVTALHFETLNQKLLRIITNHFVETGDVFSVK 280

Query: 355 EVNDEEPLF 363
             N+ EPL 
Sbjct: 281 VANELEPLL 289


>gi|225452488|ref|XP_002278851.1| PREDICTED: UPF0420 protein C16orf58 homolog [Vitis vinifera]
          Length = 514

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 111/419 (26%), Positives = 189/419 (45%), Gaps = 37/419 (8%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
           +N L+  C      F +P+     V+ +Y++Y  W+ +  V S    VLATQA+  AIG+
Sbjct: 102 LNNLFALCSSAVRDFFIPQ----KVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGI 157

Query: 169 G-KGAIPTAAAINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML 226
           G   ++P+AAA+NWVLKDG+G L + I  ++    FD N K  R    +L + + G+E+L
Sbjct: 158 GYSRSLPSAAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELL 217

Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           TPAFP++F+ +   A   +        +T S  +  FA   N  EV AK + Q +    +
Sbjct: 218 TPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNL 277

Query: 287 GIMLGIAL------ANHIGSSMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
           G+ML   L         + + +PF +   FS +    +Y  LK    + L+TL   R  +
Sbjct: 278 GLMLAAVLNILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLK---HVHLQTLTKDRLEI 334

Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
           + + ++  G  P   EV+ EE +  +F + K +     ++  L  + +   + +     L
Sbjct: 335 ILNMWIQLGYVPSPAEVSKEEGI--SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSL 392

Query: 400 GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY--- 456
            S+  D      +A    +  E +G +L          L+E A   D++  L QA Y   
Sbjct: 393 SSE--DFYFICMEACRRLTRPEQQGILLC---------LREGASTTDVISGLLQACYVRK 441

Query: 457 --LYWLERNAGIVATSTSADCAPGGRLEISLD---YVQREFNHVKSDSASVGWVTDGLI 510
             L    R   IV    ++D       ++  D     QR+ N +  +   +GW    ++
Sbjct: 442 KLLLNRTRWKNIVEDCDTSDTFVREWFKVVEDSKQSAQRDLNLLNEEMLRLGWTVKNIL 500


>gi|219129733|ref|XP_002185036.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403531|gb|EEC43483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 627

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 164/344 (47%), Gaps = 27/344 (7%)

Query: 25  DDDSSGSFYSLLLF---VPSLLYCFCHLQVATAIARTATSSEDD---GNKE---YDAVWE 75
           D DS+    S L+F     + L C    Q +     +AT SE D   G +    YD + E
Sbjct: 91  DRDSALVSASDLVFNGSTSAALTCSPEEQTSRTSQSSATFSEVDVLYGRRAVLVYDPLQE 150

Query: 76  --VKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
             VK S++ + + D TK   V   A  +SL+  ++   L      L + F+     P  V
Sbjct: 151 RYVKVSEKNR-VADSTKQESVALRARRSSLARFITTKIL----PRLSLAFL-----PSGV 200

Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT---AAAINWVLKDGIGYL 190
           T+DY  +  WR +Q   +    V  TQ+LL A+ +   A      +AA+NWVLKD +G +
Sbjct: 201 TNDYYRFVRWRILQRFVNANLNVFGTQSLLLALRIKSSASQLGALSAALNWVLKDALGKI 260

Query: 191 SKIM-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAA 249
            +++  S  GR FD + K WR  +  +  A  GLE++T  FP  F+     A   +  + 
Sbjct: 261 VRMLWASRMGRRFDSDAKRWRFRSSFVFAAGNGLEIITYVFPSLFLLWATLANCCKQISM 320

Query: 250 LIQASTRSCFYAGF--AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALA 307
           L  +STR+  Y  F   +R N  ++ AKGEAQ  +   +GI  G+ L+  +G+S+   LA
Sbjct: 321 LTSSSTRTSIYNSFRDGSRENIGDITAKGEAQIAIVDLLGIASGVTLSRTVGTSIRAVLA 380

Query: 308 SFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
            +  +  I + C     +++  R +N  R   V +++  + Q P
Sbjct: 381 VYVTLQAIEIVCVYHQLRAVTYRVMNFERMISVVADFCQARQGP 424


>gi|296087683|emb|CBI34939.3| unnamed protein product [Vitis vinifera]
          Length = 559

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 112/419 (26%), Positives = 189/419 (45%), Gaps = 37/419 (8%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
           +N L+  C      F +P+     V+ +Y++Y  W+ +  V S    VLATQA+  AIG+
Sbjct: 147 LNNLFALCSSAVRDFFIPQ----KVSGNYMDYVKWKFLHRVFSSALQVLATQAMFRAIGI 202

Query: 169 G-KGAIPTAAAINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML 226
           G   ++P+AAA+NWVLKDG+G L + I  ++    FD N K  R    +L + + G+E+L
Sbjct: 203 GYSRSLPSAAALNWVLKDGLGRLCRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELL 262

Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           TPAFP++F+ +   A   +        +T S  +  FA   N  EV AK + Q +    +
Sbjct: 263 TPAFPNYFLLLATIANIAKQVTVACYLATGSAVHRSFAIADNLGEVSAKAQIQMVCFDNL 322

Query: 287 GIMLGIAL------ANHIGSSMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
           G+ML   L         + + +PF +   FS +    +Y  LK    + L+TL   R  +
Sbjct: 323 GLMLAAVLNILCKNNQRLLAGLPFVVYPIFSAIDLFGIYQGLK---HVHLQTLTKDRLEI 379

Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
           + + ++  G  P   EV+ EE +  +F + K +     ++  L  + +   + +     L
Sbjct: 380 ILNMWIQLGYVPSPAEVSKEEGI--SFPWSKGRELWPIRIGCLDPKGRLPKLSMLMMQSL 437

Query: 400 GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY--- 456
            S+  D      +A    +  E +G +L          L+E A   D++  L QA Y   
Sbjct: 438 SSE--DFYFICMEACRRLTRPEQQGILLC---------LREGASTTDVISGLLQACYVRK 486

Query: 457 --LYWLERNAGIVATSTSADCAPG---GRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
             L    R   IV    ++D         +E S    QR+ N +  +   +GW    ++
Sbjct: 487 KLLLNRTRWKNIVEDCDTSDTFVREWFKVVEDSKQSAQRDLNLLNEEMLRLGWTVKNIL 545


>gi|255562731|ref|XP_002522371.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
           communis]
 gi|223538449|gb|EEF40055.1| mRNA, complete cds, clone: rafl24-09-C04, putative [Ricinus
           communis]
          Length = 523

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 102/341 (29%), Positives = 159/341 (46%), Gaps = 30/341 (8%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGI 187
            P  VT +Y++Y  W+ +  V S    VLATQA+  A+G+G   ++P+AAA+NWVLKDG+
Sbjct: 131 IPKQVTENYMDYVKWKLLHRVFSSALQVLATQAMFRAVGIGYSRSLPSAAALNWVLKDGL 190

Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G LS+ I  ++    FD N K  R    +L +   GLE+LTPAFP  F+ +   A   + 
Sbjct: 191 GRLSRCIYTASLASAFDTNLKRVRFSTSILFSLGIGLELLTPAFPKQFLLLATIANIAKQ 250

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA------NHIGS 300
            +     +T S  +  FA   N  EV AK + Q +    +G+ML   L         + +
Sbjct: 251 ISLACHLATGSAIHRSFAVADNLGEVSAKAQIQTVCFDNLGLMLAALLNMLFKNNQRLLA 310

Query: 301 SMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
            +PF +   FS +    +Y  LK    + L+TL   R  ++   ++  G+ P   EV+ +
Sbjct: 311 GLPFVVYPIFSAIDLFGIYQGLK---HVHLQTLTKDRLEMILDSWIALGRVPTPAEVSKK 367

Query: 360 EPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN---NKEDAHAL 416
           E     F F + K      + +    AKD            SKLS +     + ED + +
Sbjct: 368 E----GFEFLRTKGKELWPIRIGCLNAKDQI----------SKLSVIAMQSLSSEDYYFI 413

Query: 417 FSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYL 457
                  G   T+  G   + L+E A   D++  L QA Y+
Sbjct: 414 CVENSYRGLTRTKQQG-ILLSLREGAGTADVIMGLLQACYI 453


>gi|403368796|gb|EJY84239.1| hypothetical protein OXYTRI_18022 [Oxytricha trifallax]
          Length = 504

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 122/436 (27%), Positives = 207/436 (47%), Gaps = 22/436 (5%)

Query: 94  VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
           V   + N  +S  L  NK++    E F    LP G+P+SV   YL +S++ ++  ++   
Sbjct: 71  VTVISKNDRISWGLVKNKIYIGLFETF----LPVGYPNSVAEGYLKFSIFNNLSALSITA 126

Query: 154 SGVLATQALLYAIG-LGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRL 211
              L+ Q+L  AIG     A   AAA  WVLKDGIG L  I+  S +GR+FD + K WR 
Sbjct: 127 MSFLSAQSLFVAIGSTMTQANLAAAAYTWVLKDGIGQLGAILFASRYGRNFDEDIKKWRF 186

Query: 212 FADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
            A +  N +  +E+LT  FP+HF+ + + A  G++   L+ A++R+     FA R N  +
Sbjct: 187 MAMIALNISIYIEILTLRFPNHFLALASIANIGKNICFLLSAASRASINVQFAKRNNIGD 246

Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIG-SSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
           +  K  +Q   S   G+ +G+ L+  I  SS+      F ++T +++  + ++ + I+  
Sbjct: 247 ISGKSVSQFTASTLCGVGIGLMLSKLIDISSISQLYPVFMILTILNIATSYQASKVIDEV 306

Query: 331 TLNPYRASLVFSEYLLSGQAP--PVKEVNDEE----PLFPAFHFFK-IKSANKSQLLVLS 383
            LN  RA ++F+EY  S +     V+E N +E    P +   ++ K IK  +++   VL+
Sbjct: 307 YLNNQRAFIIFNEYFKSNKKNFLTVEEGNIQEIFYMPNYLNMNYCKFIKFGHRTIGKVLA 366

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKES-- 441
           + +K         LQL     + V + +    L + Y  +    T +G KF + L  S  
Sbjct: 367 T-SKPHYYTKSVILQLEKNDRNFVYHVK----LTNWYTRKLNFFTNNGRKFEIHLNMSKN 421

Query: 442 ALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSAS 501
           A   D++K+ + A  L     N   V     A+          +++ +  FN ++     
Sbjct: 422 ATNFDIMKAYYFARLLDEQLFNFQKVQ-RIDANIIEQSIKTAEIEFQETSFNEIELKLKQ 480

Query: 502 VGWVTDGLIARPLPNR 517
            GW  D +   P  NR
Sbjct: 481 HGWSLDYIYLDPRKNR 496


>gi|453080329|gb|EMF08380.1| DUF647-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 464

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 187/367 (50%), Gaps = 37/367 (10%)

Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI 166
           +S+  LW +  ++F    LP GFP SVT DYL Y ++ S+Q  +S I+ +L+++A+L  +
Sbjct: 45  ISIASLWHKTVDVF----LPVGFPQSVTDDYLEYQIYDSLQAFSSSIAAMLSSRAVLEGV 100

Query: 167 GLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLE 224
           G+G   A  T A +  +L+D +G ++ IM ++ FG   +   K +RL AD+L +  F L+
Sbjct: 101 GVGDATANATTALLLSILQDSVGRITTIMFAHRFGMSLEPECKMYRLLADILNDTGFVLD 160

Query: 225 MLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
            L+P FP    V I + +   RS   +   S ++   A FA + N  E+ AK  +Q  V 
Sbjct: 161 CLSPIFPKPTRVVILSFSSILRSLCGVAAGSAKASLSAHFAKQGNLGELNAKDSSQETVI 220

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
              G++ G  +   + + +    A   +++ IH+  N ++ ++++++TLN  RA+L++  
Sbjct: 221 SLAGMLFGSWVITWVKTPVATWTALILLLS-IHLETNRRAVRAVKMQTLNRQRATLLY-H 278

Query: 344 YLLSGQAPPVKEVNDEEPLFPAFHFFKIKS----------ANKSQLLVLSSEAKDAAVEI 393
           +L  G  P   EV D+E +F      + +           A  S+LL        AAV  
Sbjct: 279 HLRKGHLPTPIEVADQELIFERDGILRSREGRAIGHCSVGAPMSRLL--------AAVGQ 330

Query: 394 EHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL----TEHGGKFCV--VLKESALPQDM 447
           +H+    +  S  V++++  + +  LY D   I+    +   GK  V  VLK++A  + +
Sbjct: 331 QHK----TTKSIRVDDQQHLNRILELYSDAQCIVWCDPSSRPGKASVFIVLKKAADSKAL 386

Query: 448 LKSLFQA 454
             + +QA
Sbjct: 387 TLAWWQA 393


>gi|241595406|ref|XP_002404468.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215502351|gb|EEC11845.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 472

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 159/376 (42%), Gaps = 55/376 (14%)

Query: 111 KLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG- 169
           + W   R +F +  LP GFPD+V+ DY  Y +W S+Q  AS I+G LATQA+L  +G+G 
Sbjct: 36  RKWSSLRAVFSEIFLPSGFPDTVSRDYKEYQIWDSIQAFASSITGTLATQAVLTGVGVGD 95

Query: 170 KGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKG--------------------W 209
           + A   AA   W+LKDG G + +I+ +   R+  +                         
Sbjct: 96  QTASVLAATTTWILKDGTGMIGRILFALVSRYLGMCCATHNKYCAHSNDSTTLNHADNYC 155

Query: 210 RLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNF 269
            LFAD+L +AA  LE+++     +   +   +   +S   +   +TR+      A   N 
Sbjct: 156 SLFADILNDAAILLELMSQHLKGYITLVLCISSVAKSVVGVAGGATRAALTQHQARSNNM 215

Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
           A+V AK  +Q  +      +  + L   +  S       F++ T +H++ N ++   + +
Sbjct: 216 ADVSAKDGSQETLVNLAAFLCSLLLLRLLAGSQILVYGIFTIFTLLHIFANFRAVSCVVM 275

Query: 330 RTLNPYRASLVFSEYLL-SGQAPPVKEVNDEEP--LFPAFHFFKIKSANKSQLLVLSSEA 386
            T N  R SL+   +L  +G    V++VN +E   L    HF  +               
Sbjct: 276 ETFNRSRYSLLVKHFLEEAGDIASVEKVNKQESVWLRCGRHFVNV--------------- 320

Query: 387 KDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL-----TEHGGKFCVVLKES 441
                       LGS LS V    +D  +     +   Y+L     +       VVL E 
Sbjct: 321 -----------NLGSPLSSVAATHDDLCSRAPKEKQARYLLKVVRDSSRHYDINVVLHEQ 369

Query: 442 ALPQDMLKSLFQASYL 457
             P+D ++++F +  L
Sbjct: 370 CTPRDQMEAMFHSFLL 385


>gi|356566391|ref|XP_003551415.1| PREDICTED: uncharacterized protein LOC100783455 [Glycine max]
          Length = 525

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/251 (31%), Positives = 128/251 (50%), Gaps = 10/251 (3%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIP 174
           C  +   F +P G    VT +Y+ Y  W+ +  V S    VLATQA+  A+G+G   ++P
Sbjct: 119 CGSVVWDFFIPRG----VTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSCSLP 174

Query: 175 TAAAINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
           +AAA+NWVLKDG+G LS+ I  ++    FD N K  R    +L  A+ GLE+LTPAFP  
Sbjct: 175 SAAALNWVLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRC 234

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG-- 291
           F+ +   A   +  +     +TRS  +  FA   N  E+ AK + Q +    +G+ML   
Sbjct: 235 FLLLATIANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAAL 294

Query: 292 --IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
             + + NH      F    +     + ++   +  +++ L+TL   R  ++ S ++  G 
Sbjct: 295 VNLWIENHRRQQAGFHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGY 354

Query: 350 APPVKEVNDEE 360
            P   EV+++E
Sbjct: 355 VPSPAEVSEKE 365


>gi|350646547|emb|CCD58759.1| hypothetical protein Smp_142110 [Schistosoma mansoni]
          Length = 423

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 133/274 (48%), Gaps = 19/274 (6%)

Query: 87  DFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
           D     F +  + +  +SSL  ++  W   ++ F    LP G+P SV+ DYL Y +W ++
Sbjct: 12  DEVSRVFTLEKSQSRQISSLFDLHLSWVTLKKFF----LPVGYPSSVSDDYLEYQIWDTI 67

Query: 147 QG-------------VASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSK 192
           Q               AS I+G LA+QA+L  +G+G   A   +A++ W+ KDG G + +
Sbjct: 68  QAGIFFITKISLFSAFASSITGALASQAVLIGVGVGDSSATILSASLTWMFKDGSGMIGR 127

Query: 193 IMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALI 251
           I+ + + G   D + K WR  AD+L + A  LE+++P F + F  +   A   +S   + 
Sbjct: 128 IIFAGYHGIKLDCDCKFWRFVADILNDCALFLEIISPLFNYLFTPLLCLANVLKSLVGVA 187

Query: 252 QASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
            ++TR+      A   N A+V AK  +Q  +S  +  +L   L   +  +       F  
Sbjct: 188 GSATRAAIVQHQAINNNLADVSAKDGSQETLSNLMAWLLNFILLYMVTGNQFLIWFCFIC 247

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
            T IH+Y N ++ + ++LRT N  R  L   ++ 
Sbjct: 248 CTSIHLYSNYRAVKCLKLRTFNRTRFHLAIQQWF 281


>gi|167517110|ref|XP_001742896.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779520|gb|EDQ93134.1| predicted protein [Monosiga brevicollis MX1]
          Length = 272

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 73/193 (37%), Positives = 109/193 (56%), Gaps = 1/193 (0%)

Query: 154 SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLF 212
           SGVL+ QALLYA GLG G++P AAAINWVLKDG+G L  ++    FG  FD +PK  R +
Sbjct: 4   SGVLSMQALLYAAGLGAGSVPLAAAINWVLKDGLGQLGGVLYGTLFGPRFDRDPKRQRFW 63

Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
           + +    A   E+L P  PH F+ + +AA   ++ + L  ++TR+  +A F  R N  ++
Sbjct: 64  SLVALQGANLTELLIPLVPHLFLPLASAANIAKNISFLASSATRAQMHASFVRRANLGDI 123

Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
            AK  AQ + +  +G  LGI ++ H GS +   + +F  +    +Y N  S + + L TL
Sbjct: 124 TAKIGAQSIGASVVGTALGIFVSTHTGSDVTNIIPAFVPLALACLYFNHLSSKHVVLSTL 183

Query: 333 NPYRASLVFSEYL 345
           N  R  L    +L
Sbjct: 184 NEQRLELAMHPWL 196


>gi|225440682|ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera]
          Length = 503

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 165/359 (45%), Gaps = 45/359 (12%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--------GAIP 174
           F+LP GFP SV+ DYL Y L +    V + I   L T +LL A+G+G          A  
Sbjct: 117 FILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAASAAA 176

Query: 175 TAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH 233
           +AAAI WV KDGIG + ++ +   FG  FD +PK WR++ADL+ +A    ++ T  +P +
Sbjct: 177 SAAAIRWVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAY 236

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
           F+ + +     ++ A  ++  +       FA   N  EV AK E   + ++ +G+ LGI 
Sbjct: 237 FLQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIM 296

Query: 294 LANH--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
           +     + +S P    ++  +  +H++   +S   +  +++N  RA ++   ++L    P
Sbjct: 297 VLTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVP 356

Query: 352 PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
              + N +E +     F K                         R+  G  L +++  + 
Sbjct: 357 GYVDCNRKEKILSWQRFLK------------------------PRIIFGVSLEELIGGER 392

Query: 412 ---DAHALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
                  L  LY  E YIL   + G  F V +  K  A    +L+S++Q+   YWL+ N
Sbjct: 393 PVSKVRTLLKLYSQEKYILVVKQQGTDFEVFVSFKVGATSLSVLRSIWQS---YWLQEN 448


>gi|326428268|gb|EGD73838.1| hypothetical protein PTSG_05532 [Salpingoeca sp. ATCC 50818]
          Length = 376

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 72/244 (29%), Positives = 127/244 (52%), Gaps = 2/244 (0%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI-NW 181
            +LPEG+P SV+SDY +Y +W + Q  +S + G ++T A+L  +G+G      AAA    
Sbjct: 49  LVLPEGYPASVSSDYASYQIWDTAQAFSSSVMGTISTYAVLKGVGVGDAQATVAAATYTN 108

Query: 182 VLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAA 240
           +L+D  G L+ I  + +     D N K WRL AD++ + A  +E+++P  P  F+ +   
Sbjct: 109 LLRDATGMLAGIWFAWYSSTDMDRNAKQWRLVADVVNDMAMLVELVSPLIPAAFLPLMCM 168

Query: 241 AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS 300
           A   R+   +   +TR+      A   N A+V AK  +Q  +     +++ +A+      
Sbjct: 169 ASFLRAIVGVAGGATRAALVRHQARLNNMADVSAKDGSQETLVNLAALVVSLAITPLAAE 228

Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
           S   A A F  + ++H+YCN ++ +++E+   N +R     + ++ + + P V   N  E
Sbjct: 229 SRVMAWAMFLPLVFMHLYCNYRAVRALEMPAFNAHRLHHACAVFMRTNRVPSVCAANQAE 288

Query: 361 PLFP 364
           PLFP
Sbjct: 289 PLFP 292


>gi|356524511|ref|XP_003530872.1| PREDICTED: uncharacterized protein LOC100783222 [Glycine max]
          Length = 513

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 125/244 (51%), Gaps = 10/244 (4%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINW 181
           F +P G    VT +Y+ Y  W+ +  V S    VLATQA+  A+G+G   ++P+AAA+NW
Sbjct: 115 FFIPRG----VTGNYVEYVKWKLLHRVFSSALQVLATQAMFTAMGVGFSNSLPSAAALNW 170

Query: 182 VLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAA 240
           VLKDG+G LS+ I  ++    FD N K  R    +L  A+ GLE+LTPAFP  F+ +   
Sbjct: 171 VLKDGLGRLSRCIYTASLASAFDTNLKRVRFTTSVLFVASIGLELLTPAFPRCFLLLATI 230

Query: 241 AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG----IALAN 296
           A   +  +     +TRS  +  FA   N  E+ AK + Q +    +G+ML     + + N
Sbjct: 231 ANISKQISLACYLATRSAVHQSFAIGDNLGEISAKAQIQTVCFDILGLMLAALVNLWIEN 290

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
           H           +     + ++   +  +++ L+TL   R  ++ S ++  G  P   EV
Sbjct: 291 HRRQQAGLHYFIYPFFAAMDLFGIYQGLKTVHLQTLTKDRLEIILSTWIECGYVPSPAEV 350

Query: 357 NDEE 360
           +++E
Sbjct: 351 SEKE 354


>gi|298708275|emb|CBJ48338.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 284

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 113/210 (53%), Gaps = 2/210 (0%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP G+P SV  +Y+ Y  W +VQ + S + GVL   A+L+A G+G + A   AAA+ WVL
Sbjct: 72  LPTGYPSSVRKEYMEYQTWDTVQALCSYLRGVLCLTAMLHASGVGDENASAIAAAMVWVL 131

Query: 184 KDGIGYLSKIMLSNFGR-HFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +DG G +  ++ S     H D N K WRLFAD+  +    L+++ P F  +F  + + A 
Sbjct: 132 RDGFGMIGSLLFSYAASSHMDSNIKEWRLFADIANDVGLTLDLVAPVFRGNFAVVSSLAT 191

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             ++   +    T++   + FA ++N A++ AK  AQ       G++LG+ +A+++  + 
Sbjct: 192 VFKALCGVAAGCTKTSITSHFALKQNMADISAKENAQETAVNVCGMLLGVKVASYLSDAP 251

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTL 332
              +  F   + +H+Y N +   SI L  L
Sbjct: 252 TGRVLIFVFFSLLHVYANYRYMPSIHLLPL 281


>gi|414586414|tpg|DAA36985.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 288

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 116/207 (56%), Gaps = 1/207 (0%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
           + D+  + F+  + P G+P SV   YL Y+ +R +Q  +S +  VL+TQ+LL+A GL + 
Sbjct: 60  IHDKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RP 118

Query: 172 AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
               A A++W+LKDG+ +  K++ S+ G   D  PK WR+ AD+L +    LE+++P  P
Sbjct: 119 TPAQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCP 178

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
             F+ +       +  A +   +TR   Y+ FA   N +++ AKGEA   +   +GI  G
Sbjct: 179 QLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAG 238

Query: 292 IALANHIGSSMPFALASFSVVTWIHMY 318
           I LA+ + S+    L +  +++ +H++
Sbjct: 239 IGLASTVCSTTQGKLIAGPLLSAVHIW 265


>gi|440638734|gb|ELR08653.1| hypothetical protein GMDG_03339 [Geomyces destructans 20631-21]
          Length = 714

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 126/221 (57%), Gaps = 4/221 (1%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
           F++  LP G+P+SVT DY+ Y ++ S+Q  +S I+G++A++A+L  +G+G   A PT A 
Sbjct: 88  FMEIFLPAGYPNSVTEDYMEYQIYDSLQAFSSSIAGLIASRAVLEGVGVGDATASPTTAL 147

Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
           +  VL+D  G ++ I   +  G   +   K WRL AD+  +AA  L+ L+PAFP    V 
Sbjct: 148 LLSVLQDSTGRIATIFFAARLGLSLEPECKRWRLVADIFNDAAMILDCLSPAFPKPLRVS 207

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + +A+ A R+   +   S ++   A FA   N  E+ AK  +Q  +   +G+++G AL  
Sbjct: 208 LLSASAALRALCGVAAGSAKASLSAHFAKSGNLGELNAKDSSQETIISLLGMLVGTALIP 267

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRA 337
           ++ + +    A   +++ IH+  N  + +++ +RTLN  RA
Sbjct: 268 YLTTPLQTWTALLLLLS-IHLTTNYFAVRAVSMRTLNRQRA 307


>gi|357505287|ref|XP_003622932.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
 gi|355497947|gb|AES79150.1| hypothetical protein MTR_7g058500 [Medicago truncatula]
          Length = 590

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 175/396 (44%), Gaps = 31/396 (7%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGI 187
            P  V  +Y+++  W+ +  V S    VLATQA+  AIG+G   ++P+AAA+NWVLKDG+
Sbjct: 116 IPREVNENYVDFVKWKLLHRVFSSALQVLATQAMFTAIGVGYSSSLPSAAALNWVLKDGL 175

Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G LSK I  ++    FD N K  R    +L   + GLE+LTP FP  F+ +   A   + 
Sbjct: 176 GRLSKCIYTASLASAFDTNLKRVRFTTSVLFVGSIGLELLTPTFPRFFLLLATIANIAKQ 235

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG----IALANHIGSSM 302
            +     +TRS  +  FA   N  E+ AK + Q +    +G+ML     + + NH     
Sbjct: 236 ISLACYLATRSAVHQSFAKADNLGEISAKAQIQTVCFDILGLMLAALVNMWIGNHQRPQA 295

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                 + +  ++ ++   +  + + L+TL   R  ++   ++  G  P   +V+++E  
Sbjct: 296 GLHFFVYPLFAFMDLFGIYQGLKHVHLKTLTKDRLEIILKTWIEFGYVPSPADVSEKE-- 353

Query: 363 FPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYED 422
               +F  +K  +   + +     KD   E         K    + +++       ++E 
Sbjct: 354 --VINFMGVKGKSLWPVRIGCISPKDQIPE------WSMKTIQSITDEDYYFVCMEIFEG 405

Query: 423 EGYILTEHGGKFCVVL--KESALPQDMLKSLFQASY-----LYW-LERNAGIVATSTSAD 474
                 +  G+  ++L  +E A  + ++  L QA Y     L W + R     + S   D
Sbjct: 406 -----LKRTGQLRILLSIREGAESKHIIMGLLQACYIRRTLLKWNISRGENNASNSALED 460

Query: 475 CAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
            +    +E    Y +R+   +      +GWV   ++
Sbjct: 461 WSV--IVEDGKRYAERDVTDLIEQMVGMGWVVKNIL 494


>gi|298706870|emb|CBJ25834.1| Unknown protein [Ectocarpus siliculosus]
          Length = 673

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 81/245 (33%), Positives = 132/245 (53%), Gaps = 12/245 (4%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIP 174
            R   +   LP G+P SV  +Y+ Y  W  +Q ++S + GVLATQA+L  +G+G+ G+ P
Sbjct: 196 VRGGLMNTFLPAGYPGSVRPEYVRYRCWDILQDLSSHLRGVLATQAVLEGMGVGRAGSTP 255

Query: 175 TAAAINWVLKDGIGYLSKIMLSNFGR-HFDVNPKGWRLFADLLENAAFGLEMLT--PAFP 231
            AA + W+ +DG   +  ++ ++    +F VN K WRLFAD   +    LEML   PAF 
Sbjct: 256 LAATLQWMARDGASMIGGLLFTSVASANFGVNIKTWRLFADTSNDIGLALEMLAPLPAFR 315

Query: 232 H--HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
               F+ +   A   ++A  +   +T +     +A   N A++ AK  AQ  V+  +G+ 
Sbjct: 316 EGGRFLRLICIASVFKAACGVAGGATGAAITEHWAIDNNIADIGAKNGAQHTVASLLGLG 375

Query: 290 LGIALANHI----GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           L +  A  +     +S       ++V+T +H+  N  + +++ LR+LNP RASL+ +EYL
Sbjct: 376 LSVWFARAVSVGGSASGSRVWGWYAVLTLVHLLANYAAMRTLALRSLNPTRASLLVTEYL 435

Query: 346 LSGQA 350
             GQA
Sbjct: 436 --GQA 438


>gi|307111229|gb|EFN59464.1| hypothetical protein CHLNCDRAFT_138052 [Chlorella variabilis]
          Length = 657

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 151/339 (44%), Gaps = 50/339 (14%)

Query: 160 QALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLE 217
           ++LL A+G+G + ++P AA INWVLKDG+G L ++ + + FG  FD + K +R    ++ 
Sbjct: 78  ESLLQAVGVGARRSLPAAATINWVLKDGLGRLGRLTVATRFGESFDSDLKRFRYATSIIY 137

Query: 218 NAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGE 277
             +  LE LTP  P HF+ + + A  G+S       +T+  F+  F  R N A++ AK +
Sbjct: 138 AVSLSLEFLTPLAPQHFLVMASLANVGKSIGLTTFIATQPAFHRSFCLRENLADISAKTQ 197

Query: 278 AQGMVSKAIGIMLGIA---LANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           AQ MV   IG+   +    L  H   +     L  F ++    ++      +SI LRTLN
Sbjct: 198 AQQMVMDNIGLAAAVGLTYLCRHTEAARRALPLVMFPLLAAGDLWAIYSELRSIHLRTLN 257

Query: 334 PYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEI 393
             RA ++   +L  G+ P  ++V++EE      H        +   + L+  + D AV+ 
Sbjct: 258 KERAEIIAQHWLREGRVPSPRQVSEEERFVLPPHI-------EVGRMPLTISSLDQAVQS 310

Query: 394 EHRLQLGSKLSDVVNNKEDAHAL--------------------------------FSLYE 421
              LQL  +      ++ D + L                                     
Sbjct: 311 SQDLQLLEQ-----QDRGDHYFLTFTSPGGSSSSPSSSSSSSDGSGPAGRGRRGLLPPPL 365

Query: 422 DEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
              + L  H G   V L++ A   D+++++ QA+YL  L
Sbjct: 366 PLPWQLGRHAGSVRVCLRQDAAEPDIVQAVLQAAYLRQL 404


>gi|355736030|gb|AES11869.1| hypothetical protein [Mustela putorius furo]
          Length = 218

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/166 (39%), Positives = 94/166 (56%), Gaps = 2/166 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS ++G LAT A+L  IG+G   A  +AA   W++
Sbjct: 53  LPQGFPDSVSPDYLPYQLWDSVQAFASSLAGSLATHAVLLGIGVGNAEASVSAATATWLV 112

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F  I   + 
Sbjct: 113 KDSAGMLGRIIFAWWKGTKLDCNAKQWRLFADILNDVAMFLEIMAPIYPMFFTMIVCTSN 172

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
             +   ++   +TR+      A R N A+V AK  +Q  +    G+
Sbjct: 173 LAKCIVSVAGGATRAALTMHQARRNNMADVSAKDSSQETLVNLAGL 218


>gi|156059886|ref|XP_001595866.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980]
 gi|154701742|gb|EDO01481.1| hypothetical protein SS1G_03956 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 477

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 181/356 (50%), Gaps = 35/356 (9%)

Query: 125 LPEGFPDSVTSDY---------LNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
           LP G+PDSVT DY         L Y L   +   AS I+G+LA++A+L  +G+G   A  
Sbjct: 57  LPVGYPDSVTEDYTACVPCPIHLLYLLCEKL-AFASTIAGLLASRAVLQGLGVGDSTASA 115

Query: 175 TAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH- 232
           T A +  VL++  G ++ I+ ++  G   +   K +RL AD+  + A  L+ ++PAFP  
Sbjct: 116 TGAVLLNVLQESAGRIATILFAHRLGSALEPECKKYRLMADIFNDGAMILDCISPAFPKV 175

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             VF+ +A+   +S   +   S+++   A FA + N AE+ AK  +Q  +   +G+++G 
Sbjct: 176 PRVFLLSASSVCKSLCGVAAGSSKASLSAHFAKQGNLAELNAKDASQETLISLLGMLVGT 235

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA-- 350
            + + I S     +A  S+++ IH+  N  + +S+ +RTLN  RA+LV S+++ +  A  
Sbjct: 236 FVVSRISSQTATWIALISLLS-IHLGTNYLAVRSVTMRTLNRQRANLVISDFVSNINAEK 294

Query: 351 -----PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSD 405
                P  K+++ +E +F      +  +    Q +VL        V +   L   S    
Sbjct: 295 TRFNLPTPKDISRKERIFE-----RDGAIRHIQGVVLG--YCKVGVSLREILSSISSSPT 347

Query: 406 VVNN-KEDAH----ALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKSLFQA 454
           V  + KE+      +LF +Y+++GY++  +     F V+LKE+  P   L + F A
Sbjct: 348 VSGSYKEETSSIITSLFKIYKNQGYVMWYSRRQNTFLVILKENTAPSVQLDAWFHA 403


>gi|401421841|ref|XP_003875409.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491646|emb|CBZ26919.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 675

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/272 (29%), Positives = 142/272 (52%), Gaps = 10/272 (3%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
           +  LP+G+P SVT D L ++ W  VQ +A  ++  L+T+A+L  +G+G+  A  T++ ++
Sbjct: 70  RIFLPDGYPSSVTEDLLPFAFWAHVQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLS 129

Query: 181 WVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH---FVF 236
           W+++DG   +  ++ ++   +  +   K W+L  D + ++A  LE+ +P  P     F  
Sbjct: 130 WMMQDGARMIGSVVFASVIPQGLECRAKTWQLVTDFIHDSAHLLELCSPWLPGGQAIFRV 189

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +  AA   ++ A++    TR+ F   FA R N A++ AK   +G V   IG+ LGI +A 
Sbjct: 190 VLVAASVIKALASVCGNGTRAPFTQHFALRNNGADIAAKAATRGHVGGLIGLALGIVIAY 249

Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            +  +S    LA F++ + +H++ N +  + ++LR LN  R      +Y L      +  
Sbjct: 250 IVPATSRRLNLAVFALCSAVHIFANYRGVRGVQLRHLNAPRLEWCLEQYYLFKAKVLIAS 309

Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
             D  PL   P       +SAN ++ LL+L S
Sbjct: 310 KKDSGPLSVMPILD-VSPRSANAAERLLILPS 340


>gi|452842814|gb|EME44750.1| hypothetical protein DOTSEDRAFT_130464 [Dothistroma septosporum
           NZE10]
          Length = 447

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 187/376 (49%), Gaps = 22/376 (5%)

Query: 90  KDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGV 149
           + A  V S   ++  S + V  LW +  ++F    +P GFP+SVT DYL Y ++ S+Q  
Sbjct: 17  EKATYVQSPEGSTERSRVDVISLWQQAFDVF----MPVGFPNSVTHDYLEYQIYDSLQAF 72

Query: 150 ASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPK 207
           +S I+ +LA++A+L ++G+G   A PTAA +  + +   G L+ I+ ++  G       K
Sbjct: 73  SSSIASLLASRAVLSSVGVGDADASPTAAILLSIFQTSAGRLATILFAHRLGTSLGPECK 132

Query: 208 GWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
            +RL AD+L +  F L+M++ AFP    V I + +   RS   +   S ++   A FA  
Sbjct: 133 MYRLLADILNDTGFLLDMISVAFPKPISVIILSFSSILRSLCGVAAGSAKASLSAHFARW 192

Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQS 326
            N AE+ AK  +Q  V   +G+  G  + ++I      A  +  V+  IH+  N ++ ++
Sbjct: 193 GNLAELNAKDSSQETVISLLGMWAGSVVISYITEPTA-AWTTLIVLLAIHLETNRRAVRA 251

Query: 327 IELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEA 386
           + ++TLN  RA+L+F  +L  G  P   E+  +E +F      +    N  +  +L   A
Sbjct: 252 VTMQTLNRQRATLMF-HHLQRGHVPTPVEIASQERIFERDGVLR---GNDDR--ILGHCA 305

Query: 387 KDAAVEIEHRLQLGSKLSDVV--NNKEDAHALFSLYEDEGYILTEH------GGKFCVVL 438
             A+ E   +   G++ S     +      ++ ++YE   YIL +            +VL
Sbjct: 306 VGASTERLLKAVSGAQTSTKAFPSQGPAMQSILTVYEKSPYILWQDRRSLPGRSSLFIVL 365

Query: 439 KESALPQDMLKSLFQA 454
           K+ A P +ML + +QA
Sbjct: 366 KKGATPPEMLTAWWQA 381


>gi|259484769|tpe|CBF81274.1| TPA: DUF647 domain protein (AFU_orthologue; AFUA_6G06850)
           [Aspergillus nidulans FGSC A4]
          Length = 564

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/235 (31%), Positives = 132/235 (56%), Gaps = 5/235 (2%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           R++ ++  LP G+P SV+ DY  Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT
Sbjct: 66  RDVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPT 125

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           +A +  +L+D  G ++ I+ ++  G   +   K +R  AD+  + A  L+  +P  P  F
Sbjct: 126 SALLLHILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185

Query: 235 --VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             V + +AAG  R+   +   S+++   A FA   N AEV AK  +Q  V   IG+++G 
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAKDSSQETVISLIGMLVGS 245

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
            + +HI +S      +   +  +H+  N  + +S+++ TLN  RA++VFS  L S
Sbjct: 246 FVVSHI-TSFTATWLTLVFLLSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 299


>gi|108706828|gb|ABF94623.1| expressed protein [Oryza sativa Japonica Group]
          Length = 497

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/255 (31%), Positives = 126/255 (49%), Gaps = 30/255 (11%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G K A    A   W L
Sbjct: 148 VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEKSATVIGATFQWFL 207

Query: 184 KDGIGYLSKIM-----------------------------LSNFGRHFDVNPKGWRLFAD 214
           +D  G L  I+                             ++  G + D N K WRL AD
Sbjct: 208 RDLTGMLGGILFTFYQVSNSLANSNRQCHLIAIDSLFTDEITCLGSNLDSNAKMWRLVAD 267

Query: 215 LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
            + +    +++L+P FP   + I       RS   +   +TR+     FA   N A++ A
Sbjct: 268 FMNDLGMLMDLLSPLFPSSLIVIMCLGSLSRSFTGVASGATRAALTQHFALANNAADISA 327

Query: 275 KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 334
           K  +Q  ++  +G+ LG+ LA+           SF  +T  HMY N K+ QS+ L TLN 
Sbjct: 328 KEGSQETLATMLGMGLGMLLAHVTRGHALGVWVSFLSLTIFHMYANYKAVQSLSLTTLNY 387

Query: 335 YRASLVFSEYLLSGQ 349
            R+S++   ++ +G+
Sbjct: 388 ERSSILLQYFMDNGE 402


>gi|302849057|ref|XP_002956059.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
           nagariensis]
 gi|300258564|gb|EFJ42799.1| hypothetical protein VOLCADRAFT_119273 [Volvox carteri f.
           nagariensis]
          Length = 208

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 106/201 (52%), Gaps = 11/201 (5%)

Query: 98  ASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVL 157
           A  AS S L S+    D  R+LF    LP GFPDSVT DYL Y LW     V   +S  L
Sbjct: 6   APGASPSPLSSIP---DSIRDLF----LPPGFPDSVTPDYLPYQLWSLPTHVTGHLSHSL 58

Query: 158 ATQALLYAIGLGKGAIPT---AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFA 213
           AT +LL A+G+      T   +A+I W++KDG+G L + ++ S F   FD +P+ WRL A
Sbjct: 59  ATSSLLTAVGVSTSPAATVALSASIKWIIKDGVGALGRFIVGSRFSAEFDEDPRRWRLVA 118

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
           +LL  A  GLE+ T  +P  F+ + +A   G++    +           FA  +N   V 
Sbjct: 119 ELLSTAGLGLEVATSLYPQSFMLLASAGKFGQALGKGMGKPVFRVIQTHFARSQNVGAVA 178

Query: 274 AKGEAQGMVSKAIGIMLGIAL 294
           AK E   + ++ +G+M  +A+
Sbjct: 179 AKEEVWEVAAQMVGLMASVAV 199


>gi|356566668|ref|XP_003551552.1| PREDICTED: uncharacterized protein LOC100787355 [Glycine max]
          Length = 1041

 Score =  119 bits (297), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 93/317 (29%), Positives = 142/317 (44%), Gaps = 46/317 (14%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG----KGAIPTAAA 178
           F+LP GFP SV+ DYL+Y L +    V   I   L T +LL A+G+G      A  +A+A
Sbjct: 654 FILPAGFPGSVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKAVGIGSFSGTSATASASA 713

Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG + ++ L   FG  FD +PK WR++AD + +A     + T  +P +F+ +
Sbjct: 714 IRWVSKDGIGAVGRLCLGGRFGSLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPDYFLPL 773

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
            +     ++ A  ++  +       FA   N  EV AK E   +V++ IG+ LGI + + 
Sbjct: 774 ASLGNLTKAVARGLKDPSFCVIQNHFAISGNLGEVAAKEEIWEVVAQLIGLALGILILD- 832

Query: 298 IGSSMPFALASFSVVT-------WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
                P  + S+ V++       ++H++   KS   ++  T+N  R        +L    
Sbjct: 833 ----TPSLVKSYGVLSLPWLGMQFLHLWLRYKSLSVLQFNTINLKR--------VLHSTV 880

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
           P     N EE +     F K K                    I   L L  K+  V  + 
Sbjct: 881 PGCTYCNREENILTRSQFMKPK--------------------INFGLPL-EKIDGVERSH 919

Query: 411 EDAHALFSLYEDEGYIL 427
               AL  LY  E YIL
Sbjct: 920 FMVEALLKLYASEKYIL 936


>gi|317143185|ref|XP_001819302.2| hypothetical protein AOR_1_114154 [Aspergillus oryzae RIB40]
          Length = 510

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 78/260 (30%), Positives = 138/260 (53%), Gaps = 5/260 (1%)

Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
           V+  LP G+P SV+ DY+ Y ++ S+Q  +S I+G+L+++A+L  +G+G   A PTAA +
Sbjct: 72  VEVFLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALL 131

Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VF 236
             +L+D  G +S I+ ++  G   +   K +RL AD+  +AA  L+ L+P  P  F  V 
Sbjct: 132 LHILQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVT 191

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + + AG  R+   +   S+++   A F+   N AEV AK  +Q  +   IG+++G  + +
Sbjct: 192 VLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGSFVVS 251

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
            + +S      S  ++  +H+  N  + +S+++ +LN  RA++VFS  L S     +   
Sbjct: 252 RV-TSYTATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNSDTDLDIANF 310

Query: 357 NDEEPLFPAFHFFKIKSANK 376
           N      P     K   + K
Sbjct: 311 NPTHETHPTPKHSKATQSQK 330


>gi|403362828|gb|EJY81150.1| DUF647 family protein [Oxytricha trifallax]
          Length = 507

 Score =  118 bits (296), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 78/260 (30%), Positives = 133/260 (51%), Gaps = 5/260 (1%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG- 167
           VN+L         +  LP G+P +V+S YL ++++ ++        G L+ Q+L  AIG 
Sbjct: 73  VNQLNKNIYNGLFETFLPAGYPHTVSSSYLKFTIYSNISAFMYTAMGFLSAQSLFVAIGS 132

Query: 168 LGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEML 226
               A   AAA  WVLKDG+G L  I+  S +GR+FDV+ K WR  A    N +  +E+L
Sbjct: 133 TMTQANLAAAAYTWVLKDGLGQLGGILFASRYGRNFDVDIKKWRFMAIFALNISMYIEIL 192

Query: 227 TPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           T +FP +F+ + + A  G++   L+ +++R+      A   N  ++  K  +Q   S  I
Sbjct: 193 TLSFPQYFLALASLANVGKNICFLLASASRASINLQLAKNNNIGDISGKSVSQFTASTLI 252

Query: 287 GIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           G   G+ ++  I  +  + L   F V+T +++  +  S + I+   LN  RA L+F+E+ 
Sbjct: 253 GGACGLGISKIIDITSIYQLYPVFFVLTVMNIAASYYSAKVIDELYLNNQRAFLLFNEFF 312

Query: 346 LSGQA--PPVKEVNDEEPLF 363
            S +      + +ND E  +
Sbjct: 313 KSNEKLIHTAEYINDIEAFY 332


>gi|449460628|ref|XP_004148047.1| PREDICTED: UPF0420 protein C16orf58-like [Cucumis sativus]
          Length = 523

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/240 (29%), Positives = 125/240 (52%), Gaps = 10/240 (4%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
            P +V+  Y+ Y  W+ +  V S    V+ATQA+  AIG+G   ++ +AAA+NWVLKDG+
Sbjct: 125 IPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGL 184

Query: 188 GYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G LS+ + + +    FD N K  R    +L + + G+E+LTPAFP +F+ + + A   + 
Sbjct: 185 GRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQ 244

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL------ANHIGS 300
            +     ST S  +  FA   N  EV AK + Q +    +G++L   L         + +
Sbjct: 245 ISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFLSKNDRRLQA 304

Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
           ++PF +  + +   + ++   +  + + L+TL   R  ++ S ++  G  P   EV++ E
Sbjct: 305 ALPFVV--YPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSERE 362


>gi|449509553|ref|XP_004163622.1| PREDICTED: LOW QUALITY PROTEIN: UPF0420 protein C16orf58-like
           [Cucumis sativus]
          Length = 523

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 71/240 (29%), Positives = 125/240 (52%), Gaps = 10/240 (4%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
            P +V+  Y+ Y  W+ +  V S    V+ATQA+  AIG+G   ++ +AAA+NWVLKDG+
Sbjct: 125 IPKNVSEHYVIYVKWKLLHRVFSSALQVIATQAMFRAIGVGHSRSLASAAALNWVLKDGL 184

Query: 188 GYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G LS+ + + +    FD N K  R    +L + + G+E+LTPAFP +F+ + + A   + 
Sbjct: 185 GRLSRCLFTASIASAFDTNLKRVRFSTAVLFSLSIGVELLTPAFPQYFLLLASIANIVKQ 244

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL------ANHIGS 300
            +     ST S  +  FA   N  EV AK + Q +    +G++L   L         + +
Sbjct: 245 ISLGCYLSTASAVHRSFAVADNLGEVSAKAQIQSVCFDNLGLVLAAFLNFLSKNDRRLQA 304

Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
           ++PF +  + +   + ++   +  + + L+TL   R  ++ S ++  G  P   EV++ E
Sbjct: 305 ALPFVV--YPIFAAMDLFGTYQGLKHVHLQTLTKDRLEIILSNWIEQGYVPTPAEVSERE 362


>gi|225683629|gb|EEH21913.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 509

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 106/398 (26%), Positives = 187/398 (46%), Gaps = 37/398 (9%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLW---------RSVQGVASQISGVLATQALLYAIGLG 169
           L + F LP G+P SV+ DY++              S+Q  +S I+G+LA++A+L  +G+G
Sbjct: 68  LLIVF-LPAGYPHSVSDDYMDLLFLALENDRKYKDSLQAFSSSIAGLLASRAVLQGVGVG 126

Query: 170 KG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
              A PT A ++ VL++ IG ++ I+ ++  G   +   K +R  AD+L ++A  ++ L+
Sbjct: 127 DASASPTVALLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLS 186

Query: 228 PAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           PAFP H  V +   +   R+   +   S+++   + FA   N  E+ AK  +Q  V   +
Sbjct: 187 PAFPKHLRVVLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNAKDSSQETVISLM 246

Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           G++ G  + +HI + +   LA    +  IH+  N  + +++ + TLN  RA++VFS    
Sbjct: 247 GMLCGSLVVSHISTPLTTGLA-LIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFE 305

Query: 347 SGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
            G+A   K+ +  E +F      + K+++ +        +          L  GS  S  
Sbjct: 306 EGRALTPKQTSKCERIFERDGILRWKASSTTLGFCRIGNSF-------QELLCGSNGSYR 358

Query: 407 VNNKEDA----HALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLK----SLFQASY 456
            N+  DA      L  L+E E YIL       +  +VLK    P   LK    +L  A +
Sbjct: 359 ANSIRDAPIDIPRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLVAKH 418

Query: 457 LYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNH 494
           L     +A         + +P  + E+ L+   R+ N+
Sbjct: 419 LTKTTEDA------EKNNNSPIKKGEVKLERQDRKKNY 450


>gi|340967040|gb|EGS22547.1| diphosphomevalonate decarboxylase-like protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 425

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 78/250 (31%), Positives = 131/250 (52%), Gaps = 20/250 (8%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP GFP +VT DYL+Y L+ S+Q  AS I+ +LA +A+L  +G+G   + PT A I  + 
Sbjct: 45  LPAGFPHTVTEDYLSYQLYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH-FVFIGAAA 241
            D I  ++ I+ ++  GR  +   K +R  AD+  ++A  L++LTPA P+   + +  +A
Sbjct: 105 GDTISRIATILFAHRMGRAIEPECKFYRFLADIFNDSAQFLDLLTPALPYFPKLSVIVSA 164

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
           G  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G++ G         S
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLAG---------S 215

Query: 302 MPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           +   L     V W        +H+  N ++ +S+++R+LN  RA++VF E+L  G     
Sbjct: 216 LVVRLVEDKQVVWFLMILLAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGTVLTP 275

Query: 354 KEVNDEEPLF 363
            +V   E + 
Sbjct: 276 DQVAKRESIL 285


>gi|295659580|ref|XP_002790348.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226281800|gb|EEH37366.1| DUF647 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 485

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/385 (25%), Positives = 183/385 (47%), Gaps = 35/385 (9%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
           L + F LP G+P SV+ DY+ Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT A
Sbjct: 68  LLIVF-LPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126

Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-V 235
            ++ VL++ +G ++ I+ ++  G   +   K +RL AD+L ++A  +E L+PAFP H  V
Sbjct: 127 LLHSVLQESMGRIATILFAHRLGTSLEPECKLYRLAADVLNDSAMVVECLSPAFPKHLRV 186

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            +   +   R+   +   S+++   + FA   N  E+ A G                 + 
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGS---------------LVV 231

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
           +HI + +   L +   +  IH+  N  + +++ + TLN  RA++VFS     G+A   K+
Sbjct: 232 SHISTPLTTGL-TLIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQ 290

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
            +  E +F      + K+++ +           A  E+     +  + + + +   D   
Sbjct: 291 TSKCERIFERDGILRWKASSTTLGFC---RIGIAFQELLRSSNVSCRANSIRDIPIDIPR 347

Query: 416 LFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAGIVAT 469
           L  L+E E YIL       K  +VLK +  P   LK    +L  A +L     +A     
Sbjct: 348 LLRLFEKEEYILWFNPVSKKGTIVLKNNVKPVSQLKAWSHALLVAKHLMKTTEDA----- 402

Query: 470 STSADCAPGGRLEISLDYVQREFNH 494
               + +P  + E+ L+   R+ N+
Sbjct: 403 -EKNNNSPITKGEVKLERQNRKKNY 426


>gi|320164887|gb|EFW41786.1| GTP-binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 436

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/231 (32%), Positives = 120/231 (51%), Gaps = 3/231 (1%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTA 176
           +L     LP+G+PDSV++DY +Y  W ++Q   S I+G LAT A+L  +G+G + A  TA
Sbjct: 46  QLLKAVFLPDGYPDSVSADYASYQRWDTLQAFCSSINGSLATLAVLKGVGVGDETATATA 105

Query: 177 AAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           AA++W+L+DG+G L +I  +   G   D N K WRL AD+  + A  +E+L+P     F+
Sbjct: 106 AAVSWMLRDGVGMLGRIFFAWQKGPALDANAKRWRLVADIFNDLAMFVELLSPLVGEWFL 165

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
                    RS   +   +TR+      A R N  +V AK  +Q  +     +++ + L 
Sbjct: 166 AFACLGSVLRSIVGVAGGATRAAITQHQARRNNHGDVSAKDGSQETLVNLAALLVSLWLL 225

Query: 296 NHIGSSMP-FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
             +GS  P      F   T +H++ N +  ++    T N  R  +V S+Y+
Sbjct: 226 PALGSDHPTLVWLLFCSFTSLHLFANYRGVRATVFDTFNRERLFIVASDYM 276


>gi|290999547|ref|XP_002682341.1| predicted protein [Naegleria gruberi]
 gi|284095968|gb|EFC49597.1| predicted protein [Naegleria gruberi]
          Length = 261

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/255 (30%), Positives = 129/255 (50%), Gaps = 15/255 (5%)

Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG------LGKGAIP 174
           +   LP+ +P SV+S+Y+NY  W+++Q  A   SGVL  Q+LL+A G      LG     
Sbjct: 2   ISVFLPKNYPLSVSSNYINYCKWQALQYTAGSFSGVLTMQSLLHAAGITSQLALGGNEYF 61

Query: 175 TA---AAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
            A    A+ WV+KDG+G L  I+  S    +FD +PK WR+  +    A+  LE+ TP  
Sbjct: 62  LAFLGGALAWVIKDGLGQLGGILFASKVNTNFDADPKFWRMGGEYALVASALLEVTTPIT 121

Query: 231 -PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
            P  F+   + A  G++ +    ++TR+     FA   N A+V AK  +Q +    IG  
Sbjct: 122 GPSWFIAQASLANIGKNVSCFAASATRAAMNQSFAKSDNLADVTAKATSQALACSLIGTT 181

Query: 290 LGIALANHIGSSMPFALAS----FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           LGI +++ +     F  A     F V++ I ++   K+  +++L+ LN  R  +  S+YL
Sbjct: 182 LGIFVSSALSLGTQFVFAKVFPIFCVLSVIQLFSLYKAVSNVKLKVLNKQRFLIACSQYL 241

Query: 346 LSGQAPPVKEVNDEE 360
                    +V++ E
Sbjct: 242 KDRAVLDPSKVSERE 256


>gi|281211285|gb|EFA85450.1| DUF647 family protein [Polysphondylium pallidum PN500]
          Length = 427

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/174 (34%), Positives = 97/174 (55%), Gaps = 2/174 (1%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 171
           L ++  E F    LP G+P+SV+ DY +Y  W  VQ     ++ +L+T ALL ++G+G  
Sbjct: 98  LSEKLYETFKTLFLPSGYPESVSQDYASYQRWIFVQNTLGSVTYMLSTHALLTSVGVGLS 157

Query: 172 A-IPTAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
           A +P +AAI+WVLKDG+G  + ++  S +    D + K ++   D L N    LE  TP 
Sbjct: 158 ASLPFSAAISWVLKDGLGASALVLFASKYSTSLDFDLKRFKFRGDFLHNFGVFLETCTPF 217

Query: 230 FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
            P +F+ + + +   +  A LI  STR+  +  FA + N  ++ AK ++Q M S
Sbjct: 218 LPGYFLLMASISNLAKGIAGLIYGSTRASLHKSFALKENIGDITAKYQSQAMSS 271


>gi|328773288|gb|EGF83325.1| hypothetical protein BATDEDRAFT_85860 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 434

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 161/352 (45%), Gaps = 36/352 (10%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           + F    LP G+P SV + Y    +W   + VA     VL  QA+L +IG    A  TAA
Sbjct: 34  QWFRNMFLPVGYPSSVHACYKKVHMWLFAENVAGSAISVLTAQAMLTSIGASTAAHETAA 93

Query: 178 ---AINWVLKDGIGYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PH 232
              A++WVLKDGIG L K +M+  F   FD +PK W+L+ +    +   +++ T    PH
Sbjct: 94  LAIAVDWVLKDGIGELGKMLMIQRFAHQFDTHPKRWKLYGEACSISGALMQLATCITNPH 153

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR--NFAEVIAKGEAQGMVSKAIGIML 290
           HF+F  +     RS    I A+T + F    A     N  +++AK ++Q  ++  +G++ 
Sbjct: 154 HFLFFASIGVGLRSMHYSIWAATHTTFTRNMATHNGVNVGDIVAKADSQLSLAHLLGMVS 213

Query: 291 GIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
           GI +     S  P AL A FSV++ I + C     +      L+ YR  L+  E++    
Sbjct: 214 GIGMLA--VSFQPAALFAWFSVLSAIQIVCTGLLLREARFEVLDQYRLVLIPREFMSEKN 271

Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 409
           A  +  ++        F  +  +  +K Q++                L +G  + D   +
Sbjct: 272 ALRIPTLDQ----VVRFENWLNEGLSKGQVVT--------------DLSIGVTVDDAFKH 313

Query: 410 KEDAHALFSLYEDEGYIL-------TEHGGKFCVVLKESALPQDMLKSLFQA 454
              +HAL  ++++  Y+L       T    K  VVL +S  P  ++++ F A
Sbjct: 314 SRLSHAL-QIFKNGKYLLGFKLADSTHSQIKVYVVLHQSIQPVHVIQAAFHA 364


>gi|255082448|ref|XP_002504210.1| predicted protein [Micromonas sp. RCC299]
 gi|226519478|gb|ACO65468.1| predicted protein [Micromonas sp. RCC299]
          Length = 226

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/226 (30%), Positives = 113/226 (50%), Gaps = 4/226 (1%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI-NW 181
             LPE +P+SV+ DY  +  W + QG++S + G L T+ALL  +G+G      A+A   +
Sbjct: 1   MFLPEDYPNSVSDDYAAFQAWDTAQGLSSYVRGSLTTKALLEGVGVGATGATAASAAAQF 60

Query: 182 VLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH--FVFIG 238
           + +D  G +  ++ +   GRH D   K WRLFADL  N    +E+  P       F+ + 
Sbjct: 61  IARDMCGMVGGLVFTLAKGRHLDAEAKQWRLFADLANNVGMAMELAAPLANDRWWFLALA 120

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
            A    RS       +TR+     FA  RN A++ AK  +Q      +G+ LG+A+    
Sbjct: 121 CAGSVARSLCGCAAGATRAALTQHFARARNAADIAAKEGSQETAVTLVGMSLGVAVTTST 180

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY 344
                +  A F  +T  H+YCN+++ +S+ + +LN  R   + + Y
Sbjct: 181 DGKTAWQWAVFVALTVAHVYCNVRAVRSLVIDSLNRERVVRLLARY 226


>gi|238488016|ref|XP_002375246.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
 gi|220700125|gb|EED56464.1| DUF647 domain protein [Aspergillus flavus NRRL3357]
          Length = 513

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 82/268 (30%), Positives = 138/268 (51%), Gaps = 18/268 (6%)

Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
           V+  LP G+P SV+ DY+ Y ++ S+Q  +S I+G+L+++A+L  +G+G   A PTAA +
Sbjct: 72  VEVFLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALL 131

Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VF 236
             +L+D  G +S I+ ++  G   +   K +RL AD+  +AA  L+ L+P  P  F  V 
Sbjct: 132 LHILQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVT 191

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + + AG  R+   +   S+++   A F+   N AEV AK  +Q  +   IG+     L  
Sbjct: 192 VLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGM-----LVR 246

Query: 297 HIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
            +GS +   + S++  TWI        H+  N  + +S+++ +LN  RA++VFS  L S 
Sbjct: 247 FVGSFVVSRVTSYT-ATWISLVMLLTMHLSLNYAAVRSVQMTSLNRQRANIVFSTLLNSD 305

Query: 349 QAPPVKEVNDEEPLFPAFHFFKIKSANK 376
               +   N      P     K   + K
Sbjct: 306 TDLDIANFNPTHETHPTPKHSKATQSQK 333


>gi|392564112|gb|EIW57290.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 516

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 172/383 (44%), Gaps = 32/383 (8%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
            N L       F Q +LP  +P SV   YL + + +  +   + I+ VL  QALL ++G+
Sbjct: 100 TNSLTGRISSWFGQMVLPTNYPHSVHRSYLPFHILQFFETTVATITSVLCNQALLTSVGV 159

Query: 169 -GKGAIPTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLE-- 224
             +G++  A A+ W++KDG G ++K+  +  F  +FD +PK +  F +L+     GL+  
Sbjct: 160 SAEGSVFGAVAVQWIIKDGAGEVAKLFFIRRFSPYFDSHPKTFTFFGELMGCLGSGLQIA 219

Query: 225 -MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVS 283
            +L    P +F+   A     +     I  +T   F   F+ + N  +V AK E+Q  ++
Sbjct: 220 TLLIAPSPGNFLLCAAGGNIFKIVGNAIWFTTHIKFMRYFSEQGNTGDVAAKDESQSSIA 279

Query: 284 KAIGIMLGIALANHIGSSMPFALASFSVVTWIH--MYCNLKSYQSIELRTLNPYRASLVF 341
           + +G   GI+L   +  + P+  A F+    +H  M   +    + EL TL   R S + 
Sbjct: 280 QLVGYASGISLLA-VSHTAPYLYAIFAFAVPVHLAMTAYMMRVATFELLTLP--RVSCLA 336

Query: 342 SEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH-RLQLG 400
            EY+  GQ   + E++          F++ KS     L     +  D+  E+E  R Q+ 
Sbjct: 337 QEYVTRGQVSSLGELDAAHRTGLFGEFYRDKSDRWLTLAPRVGDVLDSGTELERVRWQIC 396

Query: 401 SKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
           +                 ++ED+ Y+L   G    V     A   DML +++ A+    L
Sbjct: 397 A----------------GVFEDDRYLLFPRGRNISVFFHPDAANDDMLCAIYHAA----L 436

Query: 461 ERNAGIVATSTSADCAPG-GRLE 482
            R+A I   ++S D  P  G LE
Sbjct: 437 VRDALINREASSTDVPPAEGLLE 459


>gi|297602390|ref|NP_001052390.2| Os04g0290800 [Oryza sativa Japonica Group]
 gi|255675293|dbj|BAF14304.2| Os04g0290800, partial [Oryza sativa Japonica Group]
          Length = 107

 Score =  116 bits (290), Expect = 4e-23,   Method: Composition-based stats.
 Identities = 57/72 (79%), Positives = 63/72 (87%)

Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
           +VIAKGEAQGMVSK +GIMLGIALAN IGSS+  AL SF+ VT +HMYCNLKSYQSI+LR
Sbjct: 29  QVIAKGEAQGMVSKFLGIMLGIALANRIGSSVSLALISFAGVTAVHMYCNLKSYQSIQLR 88

Query: 331 TLNPYRASLVFS 342
           TLNPYRAS V S
Sbjct: 89  TLNPYRASNVLS 100


>gi|320593995|gb|EFX06398.1| duf647 domain containing protein [Grosmannia clavigera kw1407]
          Length = 458

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 77/250 (30%), Positives = 136/250 (54%), Gaps = 5/250 (2%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
           + +F  F LP G+P SVT DYL Y  + S+Q  +S I+ +LA +A+L  +G+G  +  PT
Sbjct: 41  KAIFYAF-LPAGYPQSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPT 99

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP-HH 233
           AA +  +++D    ++ I+ ++  G+  +   K +R  ADL  ++A  L++LTP  P + 
Sbjct: 100 AALVLKIIQDTFSRMATILFAHRMGQAIEPECKTYRFMADLFNDSALMLDLLTPVLPLYP 159

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
            V I  A+   RS   +  +++++   A FA   N AE+ AK  +Q  V   +G++ G A
Sbjct: 160 KVCIMVASSILRSLCGIAASASKASLSAHFARTGNLAELNAKEASQETVISLLGMLAGTA 219

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           +  H+       L    ++  +H+  N +  + +++ TLN  RA++VF EYL +G+    
Sbjct: 220 VV-HVVQDKTAVLYWMVMLLMVHLVMNYRGVRCVKMLTLNRQRATIVFREYLETGKILSP 278

Query: 354 KEVNDEEPLF 363
            EV + E + 
Sbjct: 279 GEVAERESVL 288


>gi|327294507|ref|XP_003231949.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
 gi|326465894|gb|EGD91347.1| hypothetical protein TERG_07567 [Trichophyton rubrum CBS 118892]
          Length = 483

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 138/247 (55%), Gaps = 4/247 (1%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAA 178
            +   LP G+P SVT DY+ Y ++ S+Q  +S I+G+L+++A+L  +G+G   A PTAA 
Sbjct: 58  ILNVFLPAGYPHSVTDDYIEYQIYDSLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAAL 117

Query: 179 INWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
           +  VL++ +G ++ I+ ++  G   +   K +RL AD+L ++A  L+ L+P FP    V 
Sbjct: 118 LLNVLQESMGRIATILFAHRLGTSLEPECKLYRLAADILNDSAMVLDCLSPIFPKPVRVG 177

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + + +   R+   +   S+++   A FA   N  E+ AK  +Q  V    G+++G  + +
Sbjct: 178 LLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISLAGMLVGSVVVS 237

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
           +I S +   +A   ++  +H+  N  + +++++ TLN  RA++VFS      +     E 
Sbjct: 238 YITSPLETWIA-LIILLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLFEDDRVLTPAEA 296

Query: 357 NDEEPLF 363
           ++EE +F
Sbjct: 297 SEEERIF 303


>gi|302755442|ref|XP_002961145.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
 gi|300172084|gb|EFJ38684.1| hypothetical protein SELMODRAFT_402795 [Selaginella moellendorffii]
          Length = 343

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 79/242 (32%), Positives = 124/242 (51%), Gaps = 30/242 (12%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA-INWVL 183
           LPEG+P SV+SDYL +  W ++QG+++ I  +L+TQALL  IG+G        A   W L
Sbjct: 55  LPEGYPSSVSSDYLAFQTWDTLQGLSTYIRSMLSTQALLSGIGVGAATATAIGATFQWFL 114

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G +  I+ + + G + D   K WRL AD + + A                 G A G
Sbjct: 115 RDLTGMVGGIVFTLYQGSNLDNRAKQWRLAADFMNDIA-----------------GVAGG 157

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
           A R  AAL Q          FA ++N A+V AK  +Q   +  +G+++G+ LA    +++
Sbjct: 158 ATR--AALTQH---------FARKQNAADVSAKEGSQETAATLVGMIVGMFLARLTANNI 206

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPL 362
                SF  +T  HMY N K+  ++ L ++N  R ++V   +L  G+AP  +E + +E  
Sbjct: 207 FLMWTSFMSLTAFHMYANYKAVCALCLTSINAERMAIVLQSFLKDGKAPSPEEASAQESA 266

Query: 363 FP 364
            P
Sbjct: 267 LP 268


>gi|116205700|ref|XP_001228659.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
 gi|88182740|gb|EAQ90208.1| hypothetical protein CHGG_02143 [Chaetomium globosum CBS 148.51]
          Length = 448

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 131/243 (53%), Gaps = 6/243 (2%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP G+P SVT DYL Y  + S+Q  AS I+ +LA +A+L  +G+G   + PT A I  + 
Sbjct: 45  LPAGYPHSVTDDYLPYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
            D I  ++ I+ ++  G+  +   K +R  AD+  ++A  L++LTPA P +F  +G    
Sbjct: 105 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALP-YFPKLGVIVS 163

Query: 243 AG--RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS 300
           AG  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G+++G +L   +  
Sbjct: 164 AGVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG-SLVVKMVE 222

Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
                     V+  +H+  N  + +++++R+LN  RA+LVF E+L  G     ++V+  E
Sbjct: 223 DKQVVWMLMVVLAGVHLAMNYHAVRAVKMRSLNRQRATLVFREWLDRGTVLTPEQVSQRE 282

Query: 361 PLF 363
            + 
Sbjct: 283 SIL 285


>gi|393217163|gb|EJD02652.1| DUF647-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 455

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 114/410 (27%), Positives = 184/410 (44%), Gaps = 35/410 (8%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAIN 180
           +  LP GFPDSVT DY+ Y ++ S+Q  +S I+G+L+++A+L   G+G + A  T A + 
Sbjct: 48  EIFLPSGFPDSVTPDYIGYQIFDSLQAFSSSIAGLLSSRAVLEGFGVGNENASSTHAVLL 107

Query: 181 WVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG- 238
            VL+D  G L+ I+ +  FG       K +RL AD+  ++AF L+ L+PA P        
Sbjct: 108 TVLQDATGRLATILFAWKFGPALAPEAKMYRLAADIFNDSAFVLDCLSPALPTQLRIAAL 167

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
             AGA R+   +     ++     FA   N  E+ AK  +Q  V    G++ G  +  HI
Sbjct: 168 CLAGALRAICGVCGGGAKAALSVHFARTGNVGELNAKDASQETVIGLFGMLCGSFVMAHI 227

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL-LSGQAPPVKEVN 357
            S     +A   ++T  H+  N  + +++ L TL+  R ++ +  Y   S       EV 
Sbjct: 228 TSRRTTWIALIFLLT-AHLVTNYLAVRAVTLTTLSRQRTNIAYGVYCEQSNNVLTPSEVA 286

Query: 358 DEEPLFPAFHFFKIKSANKSQL---LVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH 414
             E        F++  A +  L   L+ +     +  E+  R + GS  S         +
Sbjct: 287 RRE------RIFEVPGAIRDSLQGFLLGTCTICTSPEELLRRSRQGSMAS-------WKN 333

Query: 415 ALFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAG--IVA 468
           AL + +E E Y+L     +   K  +  K  A P+D LK+   A  L    R AG  +  
Sbjct: 334 ALVA-FEHEAYVLLGIRADFPCKILICFKHGAKPEDYLKAWVNAVELC---RIAGSRLQV 389

Query: 469 TSTSADCAPG--GRLEISLDYVQREFNHVKSDSASVGW--VTDGLIARPL 514
              S +  PG    L+ SL+    +F      +   GW   TD +   P+
Sbjct: 390 GQESNNVTPGSFSALQKSLECTNTKFLDFLDKANKAGWDLKTDAICTMPV 439


>gi|157869108|ref|XP_001683106.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68223989|emb|CAJ04889.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 678

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 77/272 (28%), Positives = 141/272 (51%), Gaps = 10/272 (3%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
           + +LP+G+P SVT D L ++LW   Q +A  ++  L+T+A+L  +G+G+  A  T+A ++
Sbjct: 70  RILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSATLS 129

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP---HHFVF 236
           W+++DG   +  ++ +    +  +   K W+L AD   + A  LE+  P  P     F  
Sbjct: 130 WMMQDGTRMIGSVVFAPLIPQGLECRAKTWQLVADFTTDIANLLELCAPWLPGGQTTFRV 189

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +  AA   ++  ++ +  TR+ F   FA R N A++ AK   +  V   IG++LG+A+A 
Sbjct: 190 VLVAASVVKALGSVCRNCTRASFTRHFALRNNAADIAAKASTRSSVGSFIGLVLGVAVAY 249

Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            +  +S    L  F++ + +H++ + +S + ++L+ LN  R      +Y L      +  
Sbjct: 250 IVPATSRSLNLVVFALGSVLHIFASYRSVRGVQLKHLNAPRLEWCLVQYSLFEAKVLIAS 309

Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
             D  PL   P       +SAN ++ LL+L S
Sbjct: 310 KKDSGPLSVMPILD-VSPRSANAAERLLILPS 340


>gi|400593658|gb|EJP61584.1| DUF647 domain protein [Beauveria bassiana ARSEF 2860]
          Length = 467

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 169/361 (46%), Gaps = 31/361 (8%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E +++     LP GFP SV+SDYL Y  + S+Q   S I+ +LA +ALL  +G+G   + 
Sbjct: 66  EWKKILSDSFLPVGFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSS 125

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-P 231
            T A +  +L+D    ++ I+ ++ FG   + + K +R  ADL  + AF LE+ +P   P
Sbjct: 126 ATFALLLTILRDATSRVATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLSP 185

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
              V + +   A R+   +   ++++     FA   N AE+ AK  +Q      IG+++G
Sbjct: 186 LAKVVVLSVGEALRALCGVSAGASKAALSMHFAKHDNLAELNAKEASQETAVGLIGLLVG 245

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
             +   +       ++   ++ + H++ N    +S+++ TLN  RA+++F EYL +G+  
Sbjct: 246 TIVVKVVEDHRS-VVSLMILLVFGHLWMNYLGVRSVQMATLNRQRATILFQEYLRTGKVL 304

Query: 352 PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 411
              EV   E       +  + S+      V+S+       +IE     G  +S  V    
Sbjct: 305 SPAEVAKRE-------YIIVWSS------VVSNRHGQEVAKIEMAKSFGHAMSSSVGR-- 349

Query: 412 DAHALFSLYEDEGYILT------EHGGKFCVVLKESALPQDMLKSLFQASYLYW-LERNA 464
                  + + + Y +T            C++L +  LP+  + + F+A  + W ++R  
Sbjct: 350 -----VKILDGDKYSITATPAVPHKKTTLCILLWDDVLPKQAIMAWFEAVEMAWVMDRQT 404

Query: 465 G 465
           G
Sbjct: 405 G 405


>gi|226292974|gb|EEH48394.1| DUF647 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 495

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 181/389 (46%), Gaps = 43/389 (11%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
           L + F LP G+P SV+ DY+ Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT A
Sbjct: 68  LLIVF-LPAGYPHSVSDDYMEYQIYDSLQAFSSSIAGLLASRAVLQGVGVGDASASPTVA 126

Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-V 235
            ++ VL++ IG ++ I+ ++  G   +   K +R  AD+L ++A  ++ L+PAFP H  V
Sbjct: 127 LLHSVLQESIGRIATILFAHRLGTSLEPECKLYRFAADVLNDSAMVVDCLSPAFPKHLRV 186

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            +   +   R+   +   S+++   + FA   N  E+ A G                 + 
Sbjct: 187 VLLTFSSILRALCGVAAGSSKASLSSHFAKWGNLGELNACGS---------------LVV 231

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
           +HI + +   LA    +  IH+  N  + +++ + TLN  RA++VFS     G+A   K+
Sbjct: 232 SHISTPLTTGLA-LIFLLLIHLSTNYAAVRAVNMTTLNRQRANIVFSTLFEEGRALTPKQ 290

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA-- 413
            +  E +F      + K+++ +        +          L  GS  S   N+  DA  
Sbjct: 291 TSKCERIFERDGILRWKASSTTLGFCRIGNSF-------QELLCGSNGSYRANSIRDAPI 343

Query: 414 --HALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLK----SLFQASYLYWLERNAG 465
               L  L+E E YIL       +  +VLK    P   LK    +L  A +L     +A 
Sbjct: 344 DIPRLLRLFEKEEYILWFNPVSRRGTIVLKNDVRPVSQLKAWSHALLVAKHLTKTTEDA- 402

Query: 466 IVATSTSADCAPGGRLEISLDYVQREFNH 494
                   + +P  + E+ L+   R+ N+
Sbjct: 403 -----EKNNNSPIKKGEVKLERQDRKKNY 426


>gi|413956563|gb|AFW89212.1| hypothetical protein ZEAMMB73_413911 [Zea mays]
          Length = 263

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 130/262 (49%), Gaps = 9/262 (3%)

Query: 59  ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
           A+  E +    +  V E  GS  + L    ++ A + ASAS+ +          W+    
Sbjct: 3   ASHPEGEATAGWLTVEEWSGSSASAL----SRTAVLTASASSLTAHRF---GSRWERIGS 55

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
             +   +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G + A    A
Sbjct: 56  RMLGAFVPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGA 115

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
              W L+D  G L  I+ + + G + D N K WRL AD + +    +++L+P FP   + 
Sbjct: 116 TFQWFLRDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIV 175

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           I       RS   +   +TR+     FA   N A++ AK  +Q  ++  +G+ LG+ LA 
Sbjct: 176 IMCLGSLSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQ 235

Query: 297 HIGSSMPFALASFSVVTWIHMY 318
                     ASF  +T  HMY
Sbjct: 236 ITRGHALSVWASFLSLTMFHMY 257


>gi|302881823|ref|XP_003039822.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
           77-13-4]
 gi|256720689|gb|EEU34109.1| hypothetical protein NECHADRAFT_49969 [Nectria haematococca mpVI
           77-13-4]
          Length = 444

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 127/251 (50%), Gaps = 4/251 (1%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
            R++ +   LP G+P SV+ DYL Y  + S+Q   S I+ +LA +ALL  +G+G   +  
Sbjct: 50  MRKILMDAFLPVGYPHSVSDDYLAYQFFDSLQAFFSTITALLANRALLQGLGVGDANSSA 109

Query: 175 TAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PH 232
           T A +  VLKD +  ++ I+ +  FG   + + K +R  ADL  + AF +E+ +P F P+
Sbjct: 110 TFAMLLTVLKDAMSRVATIIFAQQFGLRIEPDAKRYRFLADLFNDTAFFMELYSPYFGPY 169

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             +    +  A R+   +   ++++     FA   N AE+ AK  +Q      +G+++G 
Sbjct: 170 GKILALTSGEALRALCGVAGGASKAALSVHFAKHDNLAELNAKEASQETAVGLVGLLVGT 229

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            +  H+       +   +V+   H++ N    +S+ +  LN  RA++++ EYL SG    
Sbjct: 230 FVVKHVEDHTS-VVYLMTVLVLAHLWMNYLGVRSVCMNVLNRQRATILYKEYLKSGNVLT 288

Query: 353 VKEVNDEEPLF 363
             EV   E + 
Sbjct: 289 PNEVARRESIL 299


>gi|429848918|gb|ELA24353.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 440

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 75/250 (30%), Positives = 129/250 (51%), Gaps = 9/250 (3%)

Query: 99  SNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA 158
           S  ++S LL++N      R++F     P G+PDSV+ DYL Y L+ S+Q   S I+ +LA
Sbjct: 41  SPTTISRLLTMN-----IRQVFSDAFFPIGYPDSVSKDYLGYQLYDSLQAFFSTITSLLA 95

Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLL 216
            +A+L  +G+G   +  T A +  +LKDGI  L+ I  +  FG   +   K +R  AD+ 
Sbjct: 96  NRAILQGLGVGDADSSATYALLLTILKDGISRLATIGFAYRFGLVIEPECKKYRFLADIF 155

Query: 217 ENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
            ++AF L++ +P F     V     A A R+   +   ++++     FA R N +E+ AK
Sbjct: 156 NDSAFFLDLFSPLFGSWTKVAALVVAEALRAMCGVAAGASKAALSKHFALRGNLSELNAK 215

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
             +Q      IG+++G  +  H+  S     A   ++ ++H+  N    + ++L   N  
Sbjct: 216 ESSQETAVGLIGLIVGSFVVRHV-ESREAVFALMIILVFVHLGMNYLGVRCVQLNNFNQQ 274

Query: 336 RASLVFSEYL 345
           RA++ F EY+
Sbjct: 275 RATIFFEEYM 284


>gi|449301217|gb|EMC97228.1| hypothetical protein BAUCODRAFT_147370 [Baudoinia compniacensis
           UAMH 10762]
          Length = 454

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 75/254 (29%), Positives = 137/254 (53%), Gaps = 9/254 (3%)

Query: 113 WDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG- 171
           W    ++F    LP GFP SVT DY+ Y ++ S+Q  +S I G+L+++A+L ++G+G   
Sbjct: 45  WQRLLDIF----LPAGFPHSVTVDYVEYQIYDSLQAFSSNIVGLLSSRAVLLSVGVGDAD 100

Query: 172 AIPTAAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
           A PT A +  + ++ +G +  I  ++ +G   +   K +RL AD+  + AF L+ L+PA 
Sbjct: 101 ASPTYALLLHIAQESLGRIGTIAFVALYGTALEPECKMYRLLADVFNDFAFVLDCLSPAI 160

Query: 231 PHHFVFIGAAAGAGRSAAALIQA-STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
           P +   +  +  +   A   + A S ++   A FA   N  E+ AK  +Q  V   +G+ 
Sbjct: 161 PKYLRVVTLSFSSLLKALCGVAAGSAKASLSAHFAQWGNLGELNAKDSSQETVISLLGMW 220

Query: 290 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
           +G  + + + +       +  ++  IH+  N ++ +++ +RTLN  RA+LVF  +L  G 
Sbjct: 221 VGSLVLSWVRTPQA-TWTTLILLLAIHLETNRRAVRAVVMRTLNRQRATLVF-HHLQRGH 278

Query: 350 APPVKEVNDEEPLF 363
            P  +EV+ +E +F
Sbjct: 279 VPTPEEVSKQERIF 292


>gi|342881518|gb|EGU82408.1| hypothetical protein FOXB_07090 [Fusarium oxysporum Fo5176]
          Length = 448

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 73/251 (29%), Positives = 130/251 (51%), Gaps = 8/251 (3%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
            RE F    +P G+P SV+SDYLNY  + S+Q   S I+ +LA +ALL  +G+G   +  
Sbjct: 51  LREAF----MPVGYPHSVSSDYLNYQFFDSLQAFFSTITSLLANRALLQGLGVGDANSSA 106

Query: 175 TAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PH 232
           T A +  VLKD +  ++ I+ ++ FG   + + K +R  ADL  + AF LE+ +P F P+
Sbjct: 107 TFAMLLTVLKDAMSRIATIVFAHQFGLRIEPDAKRFRFLADLFNDTAFFLELYSPYFGPY 166

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             +    +    R+   +   ++++     FA   N +E+ AK  +Q      IG+++G 
Sbjct: 167 GKILALTSGETLRALCGVAAGASKAALSVHFAKHDNLSELNAKEASQETAVGLIGLLVGT 226

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
            +  ++       L   +V+ + H++ N    +++ +  LN  RA+++F EYL +G    
Sbjct: 227 IVVKYVEDHHS-VLFLMTVLVFAHLWMNYLGVRAVCMDNLNQQRATILFEEYLKTGNVLS 285

Query: 353 VKEVNDEEPLF 363
            +EV   E + 
Sbjct: 286 PEEVAHRENIL 296


>gi|380480425|emb|CCF42443.1| hypothetical protein CH063_12442 [Colletotrichum higginsianum]
          Length = 484

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 131/257 (50%), Gaps = 8/257 (3%)

Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
            K W    E+F    LP+G+P SVT DY  Y ++ SVQ  A  I+G+++++A+   +G+G
Sbjct: 44  KKPWLRLLEIF----LPDGYPHSVTEDYAAYQIYDSVQAFAGSIAGMISSRAVWEGLGVG 99

Query: 170 KG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
              A PT A +  V+++  G  + I  ++ FG   +   K WRL AD+L + A  L+ L+
Sbjct: 100 DSLASPTGAMLVQVIRESTGRFATIAFAHLFGTSIEAECKAWRLAADVLCDVAMVLDCLS 159

Query: 228 PAFPHHFVFIGAA-AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           P FPH   F+    +    SA+ +   +++S     FA   N  E+ AK  +Q      I
Sbjct: 160 PFFPHGVRFLVLCFSSLLYSASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLI 219

Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           G++ G  + + + +      A+   +  +H++ N K  ++++ R+LN  RA++ FS    
Sbjct: 220 GMISGTFVVSLL-TDQRATWATLIFLLALHLFLNWKGVRAVKSRSLNRQRANIAFSALFS 278

Query: 347 SGQAPPVKEVNDEEPLF 363
            GQ      V + E +F
Sbjct: 279 KGQVLTPHHVAERERIF 295


>gi|367028787|ref|XP_003663677.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
           42464]
 gi|347010947|gb|AEO58432.1| hypothetical protein MYCTH_103025 [Myceliophthora thermophila ATCC
           42464]
          Length = 448

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 131/242 (54%), Gaps = 4/242 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP G+P +VT DYL Y  + S+Q  AS I+ +LA +A+L  +G+G   + PT A I  + 
Sbjct: 45  LPAGYPHTVTGDYLAYQTFDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 104

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAA 241
            D I  ++ I+ ++  G+  +   K +R  AD+  ++A  L++LTPA P+   + +  +A
Sbjct: 105 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDSAQFLDLLTPALPYLPKLGVIVSA 164

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
           G  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G+++G +L   +   
Sbjct: 165 GVLRSLCGVAANASKASLSAHFALTGNLAELNAKEASQETVVSLLGMLVG-SLVVRMVED 223

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
                    ++  +H+  N ++ +S+++R+LN  RA++VF E+L  G     ++V   E 
Sbjct: 224 KQVVWMLMVILAGVHLTMNYRAVRSVKMRSLNRQRATIVFREWLDHGVVLTPEQVAQRES 283

Query: 362 LF 363
           + 
Sbjct: 284 IL 285


>gi|297825247|ref|XP_002880506.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326345|gb|EFH56765.1| hypothetical protein ARALYDRAFT_481217 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 519

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 75/241 (31%), Positives = 125/241 (51%), Gaps = 12/241 (4%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
            P  V+ +Y++Y  W+ +  V S    VLATQ +  AIG+G+  ++ ++AA NW+LKDG+
Sbjct: 123 LPKQVSDNYISYVKWKFLHRVFSSALQVLATQVMFRAIGIGQSRSLASSAAFNWILKDGL 182

Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G LS+ I  ++    FD N K  R    +L + + G+E++TP FP +F+ + + A   + 
Sbjct: 183 GRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQ 242

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA------NHIGS 300
            +     +T S  +  FA   N  EV AK + Q +    +G++L + L         + +
Sbjct: 243 ISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLFQNNQRLQA 302

Query: 301 SMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
           S+PF L   FS    + +Y  LK    I L+TL   R  ++   ++   Q P   EV++E
Sbjct: 303 SLPFVLYPIFSTFDLLGIYQGLK---HINLQTLTKDRLEIILERWIEFRQVPSPAEVSEE 359

Query: 360 E 360
           E
Sbjct: 360 E 360


>gi|79561141|ref|NP_179928.2| uncharacterized protein [Arabidopsis thaliana]
 gi|51970902|dbj|BAD44143.1| unnamed protein product [Arabidopsis thaliana]
 gi|330252365|gb|AEC07459.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 520

 Score =  112 bits (279), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 122/241 (50%), Gaps = 12/241 (4%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
            P  V+ +Y++Y  W+ +  V S    VLATQA+  AIG+G+  ++ ++AA NW+LKDG+
Sbjct: 124 LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGL 183

Query: 188 GYLSK-IMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G LS+ I  ++    FD N K  R    +L + + G+E++TP FP +F+ + + A   + 
Sbjct: 184 GRLSRCIYTASLASAFDTNLKRVRFSTSVLFSLSIGVELMTPVFPQYFLLLASIANIAKQ 243

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMV------SKAIGIMLGIALANHIGS 300
            +     +T S  +  FA   N  EV AK + Q +         A+ + +       + +
Sbjct: 244 ISLSCYLATGSAVHRSFAVADNLGEVSAKAQIQTVCFDNLGLLLAVLLNMLFQHNQRLQA 303

Query: 301 SMPFALAS-FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDE 359
            +PF L   FS    + +Y  LK    I L+TL   R  ++   ++   Q P   EV++E
Sbjct: 304 CLPFVLYPIFSTFDLLGIYQGLK---HINLQTLTKDRLEIILERWIEFRQVPSPAEVSEE 360

Query: 360 E 360
           E
Sbjct: 361 E 361


>gi|449690512|ref|XP_002160836.2| PREDICTED: UPF0420 protein-like, partial [Hydra magnipapillata]
          Length = 172

 Score =  111 bits (278), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 67/174 (38%), Positives = 97/174 (55%), Gaps = 11/174 (6%)

Query: 74  WEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSV 133
           +E  GS   K++  + +D F   + +       LSV   +   + LF    LP+GFP+SV
Sbjct: 8   YESYGSNEEKIV--YYEDGFCKKTKN-------LSVIGFYTTLKLLFQNVFLPQGFPESV 58

Query: 134 TSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSK 192
           + DYL Y LW ++Q   S I+G+LATQA+L A G+G   A  TAA I W+LKDG G + +
Sbjct: 59  SEDYLRYQLWDTLQAFCSSITGMLATQAMLKAYGVGDNTATVTAATITWMLKDGAGMIGR 118

Query: 193 IMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGR 245
           I+ +   G   D + K WRL AD+L N A  L+++ P F  +FV I   +   R
Sbjct: 119 ILFAWKKGTQLDCDAKRWRLVADVLNNCAIMLDLVAPLFKQYFVLIACLSSVAR 172


>gi|317037751|ref|XP_001399065.2| hypothetical protein ANI_1_1452164 [Aspergillus niger CBS 513.88]
          Length = 525

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 130/242 (53%), Gaps = 21/242 (8%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIP 174
            ++  V+  LP G+P SVT DY  Y L+ S+Q  +S I+G+L+++A+L  +G+G   A P
Sbjct: 68  VQDFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASP 127

Query: 175 TAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP-- 231
           TAA +  +L+D  G ++ I+ ++  G   +   K +RL AD+  + A  L+ L+P  P  
Sbjct: 128 TAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILDCLSPMIPAG 187

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
              V + + AG  R+   +   S+++   A F+   N AEV AK  +Q  +   IG+++G
Sbjct: 188 APRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVG 247

Query: 292 IALANHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSE 343
             + + +         +FS  TWI        H+  N  + +S+++ +LN  RA++VFS 
Sbjct: 248 SFVVSRV--------TNFS-TTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFST 298

Query: 344 YL 345
            L
Sbjct: 299 LL 300


>gi|219127360|ref|XP_002183905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404628|gb|EEC44574.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 891

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 129/273 (47%), Gaps = 30/273 (10%)

Query: 99  SNASLSSLLSVNKLWDECRELFVQF----MLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
           S+ +  S LS    W    +  ++F     +P G+P SV   YL+Y ++ S+QG++S + 
Sbjct: 409 SSHTFYSNLSSISFWQHLSQTPIRFATELFMPVGYPYSVREGYLSYQIYDSIQGLSSYLR 468

Query: 155 GVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLF 212
            VL +  +L A G+G   A    AA+ W LKDG+G +  ++ S     H D   K +RLF
Sbjct: 469 AVLCSAQVLQAAGVGDANATAIMAALTWALKDGLGMIGGLIFSYGVASHLDAYVKEFRLF 528

Query: 213 ADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
           AD++ +    L+ML P  P    ++I +AA   +    +   +T+S     FA + N A+
Sbjct: 529 ADVINDVGMLLDMLAPYVPRADLIWISSAATLCKVMCGIAAGATKSSITQHFALQGNMAD 588

Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIG-----------------SSMP------FALAS 308
           + AK   Q  +   IG++ G+ LA ++                  S +P           
Sbjct: 589 LNAKEATQETLVNLIGMVCGVTLARYLSQLEAGNEEGDTDRIGYRSWIPCISHDSIQWMV 648

Query: 309 FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           F+++T +H++ N K    + L TLN  R  +V 
Sbjct: 649 FTILTLLHIWSNWKGVSLLRLETLNRERTKVVL 681


>gi|171678219|ref|XP_001904059.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937179|emb|CAP61836.1| unnamed protein product [Podospora anserina S mat+]
          Length = 448

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/242 (30%), Positives = 131/242 (54%), Gaps = 4/242 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP G+P +VT+DYL Y  + S+Q  AS I+ +LA++A+L  +G+G   A PT A I  + 
Sbjct: 47  LPAGYPHTVTTDYLPYQTYDSLQAFASSITSLLASRAVLEGLGVGSSEASPTGALILKIT 106

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAA 241
            D I  ++ I+ ++  G+  +   K +R  AD+  +AA  L++LTPA P+   + +  +A
Sbjct: 107 GDTISRIATILFAHRMGQAIEPECKFYRFLADIFNDAAQFLDLLTPALPYLPKLGVIVSA 166

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
           G  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G+++G +L   +   
Sbjct: 167 GVLRSLCGVAANASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLVG-SLVVRLVED 225

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
                    ++   H+  N ++ +++ + +LN  RA++VF E+L SG      +V+  E 
Sbjct: 226 KHKVWGLMIILAGCHLAMNYRAVRAVRMTSLNRQRATIVFREWLESGTVLNPAQVSQRES 285

Query: 362 LF 363
           + 
Sbjct: 286 IL 287


>gi|67539168|ref|XP_663358.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
 gi|40743657|gb|EAA62847.1| hypothetical protein AN5754.2 [Aspergillus nidulans FGSC A4]
          Length = 559

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/235 (31%), Positives = 130/235 (55%), Gaps = 10/235 (4%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           R++ ++  LP G+P SV+ DY  Y ++ S+Q  +S I+G+LA++A+L  +G+G   A PT
Sbjct: 66  RDVLIEVFLPAGYPHSVSDDYTAYQIFDSLQAFSSSIAGLLASRAVLQGVGVGNANASPT 125

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           +A +  +L+D  G ++ I+ ++  G   +   K +R  AD+  + A  L+  +P  P  F
Sbjct: 126 SALLLHILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCFSPMVPAGF 185

Query: 235 --VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             V + +AAG  R+   +   S+++   A FA   N AEV A+      V   IG+++G 
Sbjct: 186 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLAEVNAE-----TVISLIGMLVGS 240

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
            + +HI +S      +   +  +H+  N  + +S+++ TLN  RA++VFS  L S
Sbjct: 241 FVVSHI-TSFTATWLTLVFLLSMHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 294


>gi|367049005|ref|XP_003654882.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
 gi|347002145|gb|AEO68546.1| hypothetical protein THITE_131994 [Thielavia terrestris NRRL 8126]
          Length = 451

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 127/242 (52%), Gaps = 4/242 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           LP G+P +VT DYL Y  + S+Q  AS I+ +LA +A+L  +G+G   + PT A I  + 
Sbjct: 46  LPAGYPHTVTDDYLAYQTYDSLQAFASSITSLLANRAVLEGLGVGDSSSSPTGALILKIT 105

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH-FVFIGAAA 241
            D I  ++ I+ ++  G+  +   K +R  ADL  ++A  L++LTPA P+   + +  +A
Sbjct: 106 GDTISRIATILFAHRMGQAIEPECKFYRFLADLFNDSAQFLDLLTPALPYFPKLAVIVSA 165

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
           G  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G++ G  +   +   
Sbjct: 166 GVLRSLCGVAAGASKASLSAHFAVTGNLAELNAKEASQETVVSLLGMLAGSLVVRLVEDK 225

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
               L   ++   +H+  N ++ + + +R+LN  RA++VF E+L  G      +V   E 
Sbjct: 226 QLVWLLMVALAA-VHLIMNYRAVRCVRMRSLNRQRATIVFREWLDHGAVLTPAQVARRES 284

Query: 362 LF 363
           + 
Sbjct: 285 IL 286


>gi|115434452|ref|NP_001041984.1| Os01g0141600 [Oryza sativa Japonica Group]
 gi|54290177|dbj|BAD61065.1| unknown protein [Oryza sativa Japonica Group]
 gi|113531515|dbj|BAF03898.1| Os01g0141600 [Oryza sativa Japonica Group]
          Length = 323

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/218 (32%), Positives = 114/218 (52%), Gaps = 7/218 (3%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAA 178
           F+LP GFP SV+ DYL Y LW+    V   I   L T  LL A+G+    G  A  +AAA
Sbjct: 106 FVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAASAAA 165

Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG   ++++   FG  FD +PK WR++AD + +A    E++TP +P +F+ +
Sbjct: 166 IRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYFLPL 225

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
            +     ++     +  +       FA   N  EV AK E   + ++ +G+ +G+ + + 
Sbjct: 226 ASLGNLAKAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLGLSIGVFIMDT 285

Query: 298 --IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
             I SS P    ++  V  +H++   +S   ++ RT+ 
Sbjct: 286 SGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVR 323


>gi|7327810|emb|CAB82267.1| putative protein [Arabidopsis thaliana]
          Length = 447

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 134/292 (45%), Gaps = 31/292 (10%)

Query: 179 INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG L ++++   FG  FD +PK WR++AD + +A    ++ T  +P  F+ +
Sbjct: 120 IRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLL 179

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
            +     ++ A  ++  +       FA   N  EV AK E   + ++ IG+  GI + + 
Sbjct: 180 ASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILIIDT 239

Query: 298 IG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            G   S PF L +++ +  +H++   +S   ++  T+N  RA ++   +++    P   +
Sbjct: 240 PGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQFNTVNLKRARIIVESHVVHSVVPGYVD 299

Query: 356 VNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA 415
            N  E +     F K +                  V +E       +LS +  +     A
Sbjct: 300 CNKRENILLWQRFMKPRII--------------FGVSLE-------ELSGLEKSVSKVKA 338

Query: 416 LFSLYEDEGYILT----EHGGKFCVVLKESALPQDMLKSLFQASYLYWLERN 463
           L  +Y  E YILT        +F V  K +A  +D+L+ L+QA   YWLE N
Sbjct: 339 LLKMYTKEKYILTLNKLNKDTEFSVSFKVNATSRDVLRCLWQA---YWLEEN 387


>gi|380471002|emb|CCF47486.1| hypothetical protein CH063_04176 [Colletotrichum higginsianum]
          Length = 478

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 128/249 (51%), Gaps = 6/249 (2%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK- 170
           L+    ++     LP G+PDSVT DY+ Y  + S+Q   S I+G+L+ +A+L  +G+G  
Sbjct: 56  LFTNMNQILFDAFLPIGYPDSVTPDYIGYQAYDSLQAFFSTITGLLSNRAILQGLGVGDP 115

Query: 171 GAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
            +  T A +  +LKDGI  ++ I  +  FG   +   K +R  AD+  ++AF L++ +P 
Sbjct: 116 NSSATYALLLTILKDGISRVATIAFAYRFGLVIEPECKRYRFLADIFNDSAFFLDLFSPY 175

Query: 230 F-PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
           F P   V     A A R+   +   ++++     FA R N +E+ AK  +Q      +G+
Sbjct: 176 FDPWTKVAAIVLAEALRAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAIGLVGL 235

Query: 289 MLGIALANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
           ++G  +   + G    F L  F V  ++H+  N    + ++L TLN  RA+++F  Y  +
Sbjct: 236 LVGSIVVRFVEGREAVFVLMVFLV--FVHLAMNYFGVRCVQLDTLNQQRATILFDHYAQT 293

Query: 348 GQAPPVKEV 356
            +    K+V
Sbjct: 294 KRVLTPKQV 302


>gi|149067629|gb|EDM17181.1| similar to cDNA sequence BC017158, isoform CRA_a [Rattus
           norvegicus]
          Length = 246

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 55/115 (47%), Positives = 74/115 (64%), Gaps = 2/115 (1%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLQYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F  
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDTAMFLEIMAPMYPIFFTM 189


>gi|432112420|gb|ELK35212.1| hypothetical protein MDA_GLEAN10003470, partial [Myotis davidii]
          Length = 638

 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 70/225 (31%), Positives = 114/225 (50%), Gaps = 2/225 (0%)

Query: 144 RSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAIN-WVLKDGIGYLSKIMLSNF-GRH 201
           + V   AS +SG LAT A+L  IG+G  +   +AA   W+LKD  G L +I+ + + G  
Sbjct: 267 KPVWAFASSLSGSLATHAVLLGIGVGNASASVSAATATWLLKDSTGMLGRIIFAWWKGSK 326

Query: 202 FDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYA 261
            D N K WRLFAD+L + A  LE++ P  P  F      +   +   ++   +TR+    
Sbjct: 327 LDCNAKQWRLFADILNDTAMFLEIMAPLLPICFTMTICISNLAKCIVSVAGGATRAALTM 386

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
             A R N A+V AK  +Q  +    G+++ + +   +     F+L  F ++T +H+Y N 
Sbjct: 387 HQARRNNMADVSAKDGSQETLVNLAGLLVSLLMLPLVSDCPSFSLGCFFLLTALHVYANY 446

Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
           ++ +++ + TLN  R  LV   +L  G+       N  EPL+  F
Sbjct: 447 RAVRALVIETLNEGRLWLVLKHFLQRGEVLDPTSANQMEPLWTGF 491


>gi|159467675|ref|XP_001692017.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278744|gb|EDP04507.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 216

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 105/192 (54%), Gaps = 14/192 (7%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT- 175
           RELF    LP G+P +VT DYL+Y LW     +   +S  L T +LL A+G+  G   T 
Sbjct: 2   RELF----LPPGYPQTVTPDYLSYQLWSVPTHITGHLSHALVTSSLLSAVGVSAGPAATV 57

Query: 176 --AAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
             +A+I W++KDG+G L ++++ S F   FD +P+ WR+ A+LL     GLE+ T  +PH
Sbjct: 58  ALSASIKWIVKDGVGALGRLIVGSRFSAEFDEDPRRWRMVAELLSTCGMGLEVATVLYPH 117

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCF---YAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
           +F+ +   A +G+ A AL +   +  F      FA  +N   V AK E   + ++  G+ 
Sbjct: 118 YFLLL---ACSGKFAQALGKGMGKPVFRVIQTHFARAQNVGAVAAKEEVWEVTAQMAGLA 174

Query: 290 LGIALANHIGSS 301
             +A+   + +S
Sbjct: 175 ASVAVLRALEAS 186


>gi|358373469|dbj|GAA90067.1| DUF647 domain protein [Aspergillus kawachii IFO 4308]
          Length = 527

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 83/263 (31%), Positives = 141/263 (53%), Gaps = 26/263 (9%)

Query: 95  VASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
           VA AS+ +L++  S + L     +  V+  LP G+P SVT DY  Y L+ S+Q  +S I+
Sbjct: 52  VAHASSTTLAAWSSKSVL-----DFLVEVFLPAGYPHSVTDDYAPYQLFDSLQAFSSSIA 106

Query: 155 GVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLF 212
           G+L+++A+L  +G+G   A PTAA +  +L+D  G ++ I+ ++  G   +   K +RL 
Sbjct: 107 GLLSSRAVLQGVGVGNADASPTAALLLHILQDSSGRIATILFAHRVGTALEPECKMYRLA 166

Query: 213 ADLLENAAFGLEMLTPAFP--HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
           AD+  + A  L+ L+P  P     V + + AG  R+   +   S+++   A F+   N A
Sbjct: 167 ADVFNDLAMILDCLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLA 226

Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWI--------HMYCNLK 322
           EV AK  +Q  +   IG+++G  + + +         SFS  TWI        H+  N  
Sbjct: 227 EVNAKDSSQETIISLIGMLVGSFVVSRV--------TSFS-TTWICLLMLLALHLSLNYA 277

Query: 323 SYQSIELRTLNPYRASLVFSEYL 345
           + +S+++ +LN  RA++VFS  L
Sbjct: 278 AVRSVQMTSLNRQRANIVFSTLL 300


>gi|428170130|gb|EKX39058.1| hypothetical protein GUITHDRAFT_114933 [Guillardia theta CCMP2712]
          Length = 405

 Score =  108 bits (271), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 85/310 (27%), Positives = 140/310 (45%), Gaps = 62/310 (20%)

Query: 121 VQFMLPEGF-PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
           V F + + F P  V+ DY ++++WR +Q + S    V  TQ+LL A+G+    +  AAA 
Sbjct: 90  VAFKVKQSFLPRGVSEDYYSFTIWRILQRIVSSTVSVFGTQSLLLALGVKTNKVGVAAAS 149

Query: 180 NWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
            WV  +  G   K++ +  +GR FD + K WR                            
Sbjct: 150 GWVFANAFGKFGKMIFAARWGRDFDSDAKRWR---------------------------- 181

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAAR-RNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
                         ++TR  FY  FA R  N  ++ AKG+AQ  V+  +G+++GI LA  
Sbjct: 182 --------------SATRGTFYKQFAGRNENLGDITAKGDAQVAVADLVGLVVGIELARL 227

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
            G++      S+ V++ +     LKS  +I  R LN  RA+++ S ++  G      +++
Sbjct: 228 AGTNALCVWGSYVVLSALDFIFILKSLDTIVFRFLNLERATMLASAFVSEGIIKSPADIS 287

Query: 358 DEEPLFPAFHFFKIKSANK----------SQLLVLSSEAKD-----AAVEIEH-RLQLGS 401
             EP+       K++   K          SQ +VL   AKD     + + +EH RL L S
Sbjct: 288 RSEPVLSRPR-HKLRETFKGYWASTFRSISQCVVLKQSAKDLDILQSLLALEHFRLHLSS 346

Query: 402 KLSDVVNNKE 411
           + SD ++ +E
Sbjct: 347 RQSDSLDFEE 356


>gi|408393145|gb|EKJ72412.1| hypothetical protein FPSE_07436 [Fusarium pseudograminearum CS3096]
          Length = 427

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 120/242 (49%), Gaps = 4/242 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           +P G+P SV+SDYLNY  + S+Q   S I+ +LA +ALL  +G+G   +  T A +  VL
Sbjct: 70  MPVGYPHSVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVL 129

Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
           KD I  ++ I  +  FG   + + K +R  ADL  + AF +E+ +P   P+  +      
Sbjct: 130 KDAISRVATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTG 189

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
            A R+   +   ++++     FA   N AE+ AK  +Q      IG+ +G  +  ++   
Sbjct: 190 EALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTIVVKYVEDH 249

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
                    V+   H++ N    + + +  LN  RA+++F EYL +GQ    +EV   E 
Sbjct: 250 NTVVFLMI-VLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGQIMTPEEVAKRES 308

Query: 362 LF 363
           + 
Sbjct: 309 IL 310


>gi|297740212|emb|CBI30394.3| unnamed protein product [Vitis vinifera]
          Length = 462

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 153/356 (42%), Gaps = 42/356 (11%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYA-----IGLGKGAIPTAA 177
           F+LP GFP SV+ DYL Y L +    V + I   L T +LL A                 
Sbjct: 79  FILPAGFPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKASCWSWFFFSNHCCCFCC 138

Query: 178 AINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
                 +DGIG + ++ +   FG  FD +PK WR++ADL+ +A    ++ T  +P +F+ 
Sbjct: 139 CFCCCHQDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAYFLQ 198

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + +     ++ A  ++  +       FA   N  EV AK E   + ++ +G+ LGI +  
Sbjct: 199 LASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMVLT 258

Query: 297 H--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
              + +S P    ++  +  +H++   +S   +  +++N  RA ++   ++L    P   
Sbjct: 259 SPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPGYV 318

Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE--- 411
           + N +E +     F K                         R+  G  L +++  +    
Sbjct: 319 DCNRKEKILSWQRFLK------------------------PRIIFGVSLEELIGGERPVS 354

Query: 412 DAHALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
               L  LY  E YIL   + G  F V +  K  A    +L+S++Q+   YWL+ N
Sbjct: 355 KVRTLLKLYSQEKYILVVKQQGTDFEVFVSFKVGATSLSVLRSIWQS---YWLQEN 407


>gi|303280291|ref|XP_003059438.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459274|gb|EEH56570.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 568

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 79/253 (31%), Positives = 125/253 (49%), Gaps = 16/253 (6%)

Query: 113 WDECRELFVQ-FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK- 170
           WD  +   ++   LP G+PD+V+SDY  +  W     +   +  VL +Q+LL A+G+G  
Sbjct: 138 WDPRKSPRLRAVFLPAGYPDTVSSDYATFIRWHLASLMFRNVLEVLTSQSLLVALGMGST 197

Query: 171 -GAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
            GA+P AAA  WVLKDG+G  + ++   F G+ +D +PK W    + LE+ A  LE++TP
Sbjct: 198 PGALPLAAATKWVLKDGVGSFATLLAGVFGGQKYDEDPKRWWAVTNSLEDVARVLELVTP 257

Query: 229 AFPHHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGM-----V 282
           A+P  F+ + A+A   R+ A   + S     F       +NF +V AK EA G+     V
Sbjct: 258 AYPALFLPLAASATFVRAGALTGRGSLINGTFMQHLGRNQNFGDVRAKMEALGIAIFRGV 317

Query: 283 SKAIGIMLGIALANHIGSSMP------FALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
           S A G       A    ++M           ++  V   H++C  K+   +    LN +R
Sbjct: 318 SGAFGADAAAMAAEGGETAMSELTRYGGVFGAYGAVLSSHLFCCWKAADVLRFDALNRHR 377

Query: 337 ASLVFSEYLLSGQ 349
            +    EY+  G+
Sbjct: 378 LTRAAEEYVRGGR 390


>gi|148685696|gb|EDL17643.1| cDNA sequence BC017158, isoform CRA_a [Mus musculus]
          Length = 226

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 55/115 (47%), Positives = 74/115 (64%), Gaps = 2/115 (1%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
           +LP+GFPDSV+ DYL Y LW SVQ  AS +SG LATQA+L  +G+G   A  +AA   W+
Sbjct: 75  LLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATQAVLQGLGVGNAKASVSAATSTWL 134

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
           +KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P +P  F  
Sbjct: 135 VKDSTGMLGRIIFAWWKGSKLDCNAKQWRLFADILNDVAMFLEIMAPMYPIFFTM 189


>gi|223974077|gb|ACN31226.1| unknown [Zea mays]
          Length = 207

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 69/196 (35%), Positives = 106/196 (54%), Gaps = 2/196 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVL 183
           +PEGFP SVT DY+ + +W ++QG+++ I  +L+TQALL AIG+G + A    A   W L
Sbjct: 6   VPEGFPGSVTPDYVPFQMWDTLQGLSTYIRAMLSTQALLGAIGVGEQSATVIGATFQWFL 65

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           +D  G L  I+ + + G + D N K WRL AD + +    +++L+P FP   + I     
Sbjct: 66  RDLTGMLGGILFTFYQGSNLDSNAKMWRLVADFMNDLGMLMDLLSPLFPSSLIVIMCLGS 125

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             RS   +   +TR+     FA   N A++ AK  +Q  ++  +G+ LG+ LA       
Sbjct: 126 LSRSFTGVASGATRAALTQHFALANNAADISAKEGSQETLATMLGMGLGMLLAQITRGHA 185

Query: 303 PFALASFSVVTWIHMY 318
               ASF  +T  HMY
Sbjct: 186 LSVWASFLSLTMFHMY 201


>gi|346978906|gb|EGY22358.1| hypothetical protein VDAG_03796 [Verticillium dahliae VdLs.17]
          Length = 455

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 141/286 (49%), Gaps = 28/286 (9%)

Query: 100 NASLSSLLSVNKLWDECRELFV----------------QFM----LPEGFPDSVTSDYLN 139
           +A ++S  ++  + +  R+LF                 QF+    LP  +PDSV+ DY+ 
Sbjct: 37  DAQVTSYRTIGTILEPLRQLFSPANAVHRVESLTRGTPQFLRDAFLPVDYPDSVSPDYIK 96

Query: 140 YSLWRSVQGVASQISGVLATQALLYAIGLG--KGAIPTAAAINWVLKDGIGYLSKIMLS- 196
           Y ++ S Q   S I+G+LA +A+L  +G+G   G++  A  +   LKDG+  L+ I  + 
Sbjct: 97  YQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLLT-CLKDGVSRLATITFAY 155

Query: 197 NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQAST 255
            FG   + N K +R  ADL  + AF LE+  P F     + I  +A + R+   +   ++
Sbjct: 156 RFGSVIEANAKKYRFLADLFNDTAFFLELANPFFAGWAKILILVSAESLRALCGIAAGAS 215

Query: 256 RSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS-MPFALASFSVVTW 314
           ++     FA R N AE+  K  +Q      IG++ G A+ +++ +    F L  F V  +
Sbjct: 216 KAALSTHFARRNNLAELNTKEASQETAIGLIGLLAGTAIIHYVQNRGAVFFLTIFLV--F 273

Query: 315 IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
           IH+  N     +++L T+N  RA++V   YL + +     E + +E
Sbjct: 274 IHLGMNYLGVCALQLDTVNRQRATIVMQHYLRTQEVLTPAEASKKE 319


>gi|358397603|gb|EHK46971.1| hypothetical protein TRIATDRAFT_291240 [Trichoderma atroviride IMI
           206040]
          Length = 468

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 136/273 (49%), Gaps = 10/273 (3%)

Query: 91  DAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVA 150
           D  ++   SN+  S      + + E R+  +   LP GFP SV+ DYL Y  + S+Q   
Sbjct: 34  DGHILPKTSNSLFS------RSFREWRKAIMDSFLPVGFPHSVSKDYLAYQTFDSLQAFF 87

Query: 151 SQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKG 208
           S IS +LA +ALL  +G+G   +  T A +  ++KD +  ++ I+ ++ FG   + + K 
Sbjct: 88  STISSLLANRALLQGLGVGDASSSATFALLLTIMKDAMSRIATIVFAHRFGLRIEPDAKR 147

Query: 209 WRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARR 267
           +R  AD+  ++AF LE+ +P        I  + G A R+   +   ++++     FA   
Sbjct: 148 YRFLADVFNDSAFFLELYSPYLGSWGKVIALSTGQALRALCGVAAGASKAALSVHFAKHD 207

Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
           N AE+ AK  +Q      IG+++G  +   I       +    +V   H+  N    +S+
Sbjct: 208 NLAELNAKEASQETAVGLIGLLVGTLVVKMIEDHSSVVILMILLVI-AHLGTNYLGVRSV 266

Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
            + TLN  RA++VF E+L SG+    ++V + E
Sbjct: 267 CMATLNRQRATIVFEEFLRSGKVLTPEQVAERE 299


>gi|125568971|gb|EAZ10486.1| hypothetical protein OsJ_00318 [Oryza sativa Japonica Group]
          Length = 459

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/228 (32%), Positives = 117/228 (51%), Gaps = 17/228 (7%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAA 178
           F+LP GFP SV+ DYL Y LW+    V   I   L T  LL A+G+    G  A  +AAA
Sbjct: 123 FVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAASAAA 182

Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG   ++++   FG  FD +PK WR++AD + +A    E++TP +P +F+ +
Sbjct: 183 IRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYFLPL 242

Query: 238 GAAAGAGRS---AAALIQASTR-------SCFYAGFAARRNFAEVIAKGEAQGMVSKAIG 287
            +     +    +  L QA  R             FA   N  EV AK E   + ++ +G
Sbjct: 243 ASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKEEVWEVGAQLLG 302

Query: 288 IMLGIALANH--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           + +G+ + +   I SS P    ++  V  +H++   +S   ++ RT++
Sbjct: 303 LSIGVFIMDTSGIKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVD 350


>gi|414586420|tpg|DAA36991.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 306

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 151/320 (47%), Gaps = 33/320 (10%)

Query: 192 KIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALI 251
           K++ S+ G   D  PK WR+ AD+L +    LE+++P  P  F+ +       +  A + 
Sbjct: 6   KLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVA 65

Query: 252 QASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
             +TR   Y+ FA   N +++ AKGEA   +   +GI  GI LA+ + S+    L +  +
Sbjct: 66  ARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPL 125

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN-DEEPLFPAFHFFK 370
           ++ +H+Y  ++  ++  + TLNP R +++ ++++ SG+     E+   E+ LFP      
Sbjct: 126 LSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFIKSGKVSSPAELRYKEDLLFP------ 179

Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEH 430
                            +  +E    +++G  L  V + +     L + +  E ++L++ 
Sbjct: 180 -----------------NRVIEEAGSVKIGQPLRKVFSPRL-VEQLRATFPGEKFLLSQK 221

Query: 431 GGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQR 490
             +  +VL+ SA  +D L+    A++   +ER+      + S D A    L  + + ++R
Sbjct: 222 SNRTYMVLERSASGEDALRGWLVAAFASEMERSG-----AGSRDDAV---LSEAYEKMER 273

Query: 491 EFNHVKSDSASVGWVTDGLI 510
            F    S+  S GW TD  +
Sbjct: 274 VFPTFVSEVRSRGWYTDQFL 293


>gi|387198702|gb|AFJ68865.1| hypothetical protein NGATSA_2057300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 277

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 79/136 (58%), Gaps = 2/136 (1%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
           + L     LP GFP SV  +YL Y  W  VQ +++ + G+LATQA+L  +G+G+ GA P 
Sbjct: 138 KTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPL 197

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AA + W+ +DG   +S ++ ++F   +F VN K WRLFAD + +    L+ML P FP +F
Sbjct: 198 AATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNF 257

Query: 235 VFIGAAAGAGRSAAAL 250
           +     A   +S   +
Sbjct: 258 LLCICLASVCKSLCGI 273


>gi|422292654|gb|EKU19956.1| hypothetical protein NGA_2057300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 274

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 79/136 (58%), Gaps = 2/136 (1%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
           + L     LP GFP SV  +YL Y  W  VQ +++ + G+LATQA+L  +G+G+ GA P 
Sbjct: 135 KTLLHTVFLPVGFPSSVAPEYLRYQQWNVVQDLSTYLRGILATQAILEGVGVGREGATPL 194

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AA + W+ +DG   +S ++ ++F   +F VN K WRLFAD + +    L+ML P FP +F
Sbjct: 195 AATLQWITRDGASMMSGLVFTSFLSTNFGVNAKSWRLFADFMVDIGITLDMLAPLFPKNF 254

Query: 235 VFIGAAAGAGRSAAAL 250
           +     A   +S   +
Sbjct: 255 LLCICLASVCKSLCGI 270


>gi|255574377|ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis]
 gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis]
          Length = 485

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 159/357 (44%), Gaps = 43/357 (12%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAA---- 178
           F+LP GFP SV+ DY  Y L +    V   I   L T +LL A+G+G     TAAA    
Sbjct: 101 FILPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGSTAAAAASA 160

Query: 179 --INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
             I WV KDGIG L ++ +   FG  FD +PK WR++AD + +A    +++T  +P +F+
Sbjct: 161 AAIRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLITQVYPAYFL 220

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            + +     ++ A  ++  +       FA   N  EV AK E   + ++ +G+ LGI + 
Sbjct: 221 PLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVGAQLLGLALGILIL 280

Query: 296 NHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           +  G   S P   +++  +  +H++    S   ++  T+N  RA L+   ++L       
Sbjct: 281 DTPGLVKSFPALASTWLSMRLLHLWLRYLSLSVLQFDTINLKRARLLVKSHILHSVVLGC 340

Query: 354 KEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA 413
            + N EE +     F K                         ++  G  L ++V  +  A
Sbjct: 341 VDCNREENILLWETFTK------------------------PQITFGVPLQEMVGGERSA 376

Query: 414 ---HALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
                L  LY  E Y+L   +  G F V +  K  A     L+S++Q    YWL  N
Sbjct: 377 AQVKTLLKLYTKENYLLVVNQQKGDFEVFVSFKVGATSMSALRSVWQT---YWLHEN 430


>gi|412993086|emb|CCO16619.1| predicted protein [Bathycoccus prasinos]
          Length = 454

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 83/270 (30%), Positives = 131/270 (48%), Gaps = 24/270 (8%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG----K 170
           + R  F +  LP+GFP++V+ DYL +  W+ +      +  VL+ Q+LL A+G+      
Sbjct: 31  QFRAQFKRIFLPDGFPNTVSRDYLPWLKWQMLSLFFRDVLEVLSAQSLLVAVGMDVNQAN 90

Query: 171 GAIPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
            A P A A  WVLKDG G L+ + L     + FD NPK +   A+ LE+ +  +E+LTP 
Sbjct: 91  AAAPVAGAAKWVLKDGTGGLATLALGALNTKKFDENPKFFWSTANSLEDVSRAMELLTPL 150

Query: 230 FPHHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
           FP HF+ I   A   RSAA   ++S     F   F+   N  ++  K E QG   +   +
Sbjct: 151 FPSHFLLIAGTATFVRSAALTGRSSLINGTFMTHFSRNENAGDIRTKLEVQGRWLQIAAL 210

Query: 289 MLGIAL-------------ANHIGSSMP---FALASFSVVTWIHMYCNLKSYQSIELRTL 332
            +GIAL              N +G  M      +A++  + + H +C   + +S+   TL
Sbjct: 211 PVGIALFRAVTNGLELDFATNSLGQEMKGVTVGVATYGSMLFAHSFCCYMAAKSLNFETL 270

Query: 333 NPYRASLVFSEYLLSG--QAPPVKEVNDEE 360
           +  RA +    ++ S       V+EV+D E
Sbjct: 271 SRKRALIQAKTFVESNGKNVLSVEEVSDVE 300


>gi|302847291|ref|XP_002955180.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
           nagariensis]
 gi|300259472|gb|EFJ43699.1| hypothetical protein VOLCADRAFT_121395 [Volvox carteri f.
           nagariensis]
          Length = 1055

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 147/323 (45%), Gaps = 22/323 (6%)

Query: 57  RTATSSEDDGNKEYDAVWE---VKGSKRTKLIPDFTKDAFVVASASNASLS-SLLSVNKL 112
           R A+    +G+    A  E   V   K  +L+P +  D   + + S A+L  S ++  +L
Sbjct: 217 REASERHSNGHPSAPASLERLPVAVKKGDQLLP-YVWDGRRITAMSEAALQDSDVTTGRL 275

Query: 113 WDECRELFVQFMLPEGF---PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
            +    L VQ  L E F   P  VT DY  +  WR  Q   S      +  ALL A GLG
Sbjct: 276 RNWL--LGVQRQLKEAFLPDPRDVTPDYWEWLRWRLTQRFFSSTMQNFSFSALLMATGLG 333

Query: 170 KGAIPTAAA-INWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
                 A+A INW+LKDG+  + ++ +S +FG+ FD + K  R    L+  A    E  T
Sbjct: 334 AKKAFAASAAINWLLKDGVSRIVRMSVSTSFGQTFDADLKRMRFITSLIFTACMAGEFAT 393

Query: 228 PAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK--A 285
           P +P HFV + + +  GR+       +T+  F A  A   N A++ +K +AQ MV    A
Sbjct: 394 PFYPQHFVALASISSVGRAVGLSAFVATQPAFQAALATGGNLADLTSKNQAQHMVMDMLA 453

Query: 286 IGIMLGIALANHIGSSMP-----FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
           +G+  G+     +   MP          + +     + C     ++++LRTLN  RA ++
Sbjct: 454 LGVSAGLTW---LCRGMPRGGLLLPAVMYPLCAAGDLTCIWHELKAVQLRTLNRERAEML 510

Query: 341 FSEYLLSGQAPPVKEVNDEEPLF 363
              ++  G  P   E++  E L 
Sbjct: 511 IERWMRRGAVPDAAEISAAENLI 533


>gi|358380832|gb|EHK18509.1| hypothetical protein TRIVIDRAFT_14438, partial [Trichoderma virens
           Gv29-8]
          Length = 451

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/251 (29%), Positives = 130/251 (51%), Gaps = 4/251 (1%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E R++     LP GFP SV+ DYL Y ++ S+Q   S IS +LA +ALL  +G+G   + 
Sbjct: 65  EWRKVLFDSFLPVGFPHSVSKDYLAYQMFDSLQAFFSTISSLLANRALLQGLGVGDASSS 124

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            T A +  +LKD +  ++ I+ ++ FG   + + K +R  AD+  ++AF LE+ +P    
Sbjct: 125 ATFALLLTILKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADIFNDSAFFLELYSPYLGS 184

Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
               I  + G A R+   +   ++++     FA + N AE+ AK  +Q      IG+++G
Sbjct: 185 WGKVIALSTGEALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVG 244

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
             +   I       +    +V + H+  N    +S+ + TLN  RA+++FSEYL +    
Sbjct: 245 TLVVKTIEDHSSVVILMILLV-FAHLGMNYLGVRSVCMTTLNRQRATILFSEYLKNRVVL 303

Query: 352 PVKEVNDEEPL 362
             ++V + E +
Sbjct: 304 TPEQVAERESI 314


>gi|224090990|ref|XP_002309136.1| predicted protein [Populus trichocarpa]
 gi|222855112|gb|EEE92659.1| predicted protein [Populus trichocarpa]
          Length = 500

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/412 (25%), Positives = 185/412 (44%), Gaps = 30/412 (7%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIG----LGKGAI 173
           ++   F+LP GFP SV+ DYL Y + +    +   I   L T +LL A+G     G  A 
Sbjct: 100 DILKDFILPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTDAA 159

Query: 174 PTAAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            +AAAI WV KDGIG L ++ +   FG  FD +PK WR++AD + +A    ++ T  +P 
Sbjct: 160 ASAAAIRWVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPA 219

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
           +F+ + +     ++ A  ++  +       FA   N  EV AK E   + ++ +G+ LGI
Sbjct: 220 YFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEEVWEVGAQLLGLALGI 279

Query: 293 ALAN--HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
            + +   +  S P   +++  +  +H++   +S   +   T+N  RA ++   ++L  + 
Sbjct: 280 LILDTPDLVKSFPLLTSTWMSMRLLHLWLRYQSLSVLRFDTINLKRARILIKSHILQSRV 339

Query: 351 PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNK 410
               + N +E +     F + +      L  +    +  A +I+H + +  K+       
Sbjct: 340 LGCVDCNQDENILSWETFTRPRIIFGVPLEEMIGGERSVA-KIKHGVVIKIKM------- 391

Query: 411 EDAHALFSLYEDEGYILTEHGGK----FCVVLKESALPQDMLKSLFQASYLYWLERNAGI 466
                L  LY  E + L  +  K      V  K  A    +L+S++Q    YWL  N   
Sbjct: 392 -----LLKLYAKEKHFLVVNQQKTDFEVLVSFKVGATSMSVLRSVWQT---YWLHENWKS 443

Query: 467 VATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRI 518
               +S   +   +L  SL  ++ +F        +VGW T   I   +P +I
Sbjct: 444 FGNRSSD--SDYDQLAQSLLEMEVKFEDFMQQLDAVGWDTSQ-INLKVPRKI 492


>gi|46137815|ref|XP_390599.1| hypothetical protein FG10423.1 [Gibberella zeae PH-1]
          Length = 425

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 120/242 (49%), Gaps = 4/242 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           +P G+P SV+SDYLNY  + S+Q   S I+ +LA +ALL  +G+G   +  T A +  VL
Sbjct: 70  MPVGYPHSVSSDYLNYQFFDSMQAFFSTITSLLANRALLQGLGVGDANSSATFAMLLTVL 129

Query: 184 KDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
           KD I  ++ I  +  FG   + + K +R  ADL  + AF +E+ +P   P+  +      
Sbjct: 130 KDAISRVATIAFAQQFGLRIEPDAKRFRFLADLFNDTAFFMELYSPYLGPYGKILALTTG 189

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
            A R+   +   ++++     FA   N AE+ AK  +Q      IG+ +G  +  ++   
Sbjct: 190 EALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAIGLIGLAVGTLVVKYVEDH 249

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
                    V+   H++ N    + + +  LN  RA+++F EYL +G+    +EV   E 
Sbjct: 250 NTVVFLMI-VLVLAHLWMNYLGVRCVCMDNLNRQRATILFDEYLKTGKIMTPEEVATRES 308

Query: 362 LF 363
           + 
Sbjct: 309 IL 310


>gi|310789554|gb|EFQ25087.1| hypothetical protein GLRG_00231 [Glomerella graminicola M1.001]
          Length = 484

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 171/361 (47%), Gaps = 18/361 (4%)

Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 169
            K+W +  E+F    LP+G+P SV+ DY  Y ++ S+Q  A  I+G+++++A+   +G+G
Sbjct: 59  KKIWLQLLEIF----LPDGYPHSVSDDYAAYQIYDSIQAFAGSIAGMISSRAVWEGLGVG 114

Query: 170 KG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLT 227
              A PT A +  V+++  G  + I  ++ FG   +   K +RL +D+L + A  L+ L+
Sbjct: 115 DSLASPTGAMLIQVIRESTGRFATITFAHLFGTSIEAECKAYRLASDILCDVAMVLDCLS 174

Query: 228 PAFPHHFVFIGAA-AGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAI 286
           P FP    F+    +    SA+ +   +++S     FA   N  E+ AK  +Q      +
Sbjct: 175 PFFPRSIRFLVLCFSSLLYSASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLM 234

Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           G++ G  + + +       +A  S++  +H++ N K  ++++ R+LN  RA++ FS    
Sbjct: 235 GMITGTFMVSLLTGPKATWVALISLLA-LHLFLNWKGVRAVKSRSLNRQRANIAFSALFS 293

Query: 347 SGQAPPVKEVNDEEPLFP-----AFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL-- 399
             Q      V++ E +F       F +   K     +  V   E   +    + R++   
Sbjct: 294 KDQVLTPVIVSEREFIFEERWGSVFRWNSGKVLGHCEFGVSLQEILQSLASGQERMKTFH 353

Query: 400 --GSKLSDVVN-NKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASY 456
               +L++++N   E+ + L+    +  Y       K  VVLK+   P+  LK+ F    
Sbjct: 354 LSTLELANLLNIYDEEEYVLWCEASETVYCDAASRVKVMVVLKDGVTPKSQLKAWFHGLL 413

Query: 457 L 457
           L
Sbjct: 414 L 414


>gi|336273098|ref|XP_003351304.1| hypothetical protein SMAC_03608 [Sordaria macrospora k-hell]
 gi|380092824|emb|CCC09577.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 498

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 78/250 (31%), Positives = 129/250 (51%), Gaps = 6/250 (2%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTA 176
           +  +   LP G+P SVT DYL Y  + S+Q   S I+ +LA +A+L  +G+G   + PT 
Sbjct: 39  QALLHAFLPAGYPHSVTPDYLPYQTYDSLQAFFSSITSLLANRAVLEGLGVGDSSSSPTG 98

Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           A I  +  D I  ++ I+ ++  G   +   K +R  ADLL ++A  L++LTPA P +F 
Sbjct: 99  ALILKIAGDTISRIATILFAHRMGTAIEPECKFYRFLADLLNDSAQLLDLLTPALP-YFP 157

Query: 236 FIGAAAGAG--RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
            +G    AG  RS   +   ++++   A FA R N AE+ AK  +Q  V   +GI++G +
Sbjct: 158 KLGVIVSAGVLRSLCGVAANASKASLSAHFALRGNLAELNAKEASQETVVSLLGILVG-S 216

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           L   +        A    +  +H+  N  + +++  RTLN  RA++V   +L  G+    
Sbjct: 217 LVVKVVQDRGVVWALMVGLLGVHLLMNYWAVRAVRCRTLNRQRATVVVRGWLEWGRVLGP 276

Query: 354 KEVNDEEPLF 363
           +EV   E + 
Sbjct: 277 EEVARMESIL 286


>gi|323455525|gb|EGB11393.1| hypothetical protein AURANDRAFT_11851, partial [Aureococcus
           anophagefferens]
          Length = 248

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 78/251 (31%), Positives = 127/251 (50%), Gaps = 18/251 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP GFP +VT  Y  Y+  +      S   GVL+T+ LL A+G+G G A P AAA NW +
Sbjct: 2   LPAGFPTTVTPGYGTYAALQFAAYACSSACGVLSTRVLLAAVGVGDGAAAPLAAAANWAI 61

Query: 184 KDGIGYLSKIML-SNFGRHFDVNPKGWRL-------FADLLENAAFGLEMLTPAFPHHFV 235
           KDG+G L  +   + +    D  P+ WRL        A L+E AA       PAFP +FV
Sbjct: 62  KDGLGMLGGVAFAATWSNSLDARPRQWRLRSSAALDLASLVELAAL------PAFPQYFV 115

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAAR---RNFAEVIAKGEAQGMVSKAIGIMLGI 292
            I   A   ++ + L  +++R+  +   +AR    N A++ AK  +Q +V+   G+ LG+
Sbjct: 116 PIAGLANVAKNVSYLAASASRAAIHQSLSARSDASNLADLTAKTGSQTIVASLCGLALGV 175

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
           A +N  G+       +++ ++  H+ C   S + +   TL+  R + +   +  +G  P 
Sbjct: 176 AASNACGAEASDVWPAWAALSACHLGCTYASLKYVNTTTLDDARLAALVDAFRETGACPT 235

Query: 353 VKEVNDEEPLF 363
             +V++ E LF
Sbjct: 236 PADVSERESLF 246


>gi|389639940|ref|XP_003717603.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
 gi|351643422|gb|EHA51284.1| hypothetical protein MGG_09749 [Magnaporthe oryzae 70-15]
          Length = 479

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 71/250 (28%), Positives = 130/250 (52%), Gaps = 5/250 (2%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
           + +F  F LP GFP +VT DYL+Y ++ S+Q  +S I+ +LA +A+L  +G+G  +  PT
Sbjct: 43  KAIFYAF-LPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPT 101

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAA-FGLEMLTPAFPHH 233
           AA +  +++D I  L+ IM ++  G   +   K +R  AD+  +AA     ++       
Sbjct: 102 AALVLKIIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLP 161

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
            V +   +G  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G++ G A
Sbjct: 162 KVSVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTA 221

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPV 353
           +  H+            V+  +H+  N ++ +++ + TLN  RA++VF E+L +G+    
Sbjct: 222 VV-HMVEDKTAVWCWMVVLICMHLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTP 280

Query: 354 KEVNDEEPLF 363
            + +  E + 
Sbjct: 281 AQCSARESIL 290


>gi|145347876|ref|XP_001418386.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578615|gb|ABO96679.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 362

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 69/239 (28%), Positives = 122/239 (51%), Gaps = 21/239 (8%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GA 172
           E R +F    LP GFP + ++DY+ +  W+ +  +   +  +++ Q+LL A+G+G   GA
Sbjct: 3   ELRRIF----LPSGFPATTSADYVEWLRWQLLSLLFRDVLEIMSAQSLLVALGMGNTPGA 58

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
           +P  AA  WVLKDG+G  + ++  +FG + +D +PK W   ++ LE+ A  +E++TPA P
Sbjct: 59  LPLTAAAKWVLKDGVGSFATLIAGSFGGQRYDEDPKRWWGLSNTLEDVARAIELVTPAAP 118

Query: 232 HHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
             F+ + A+A   R+AA   + S     F   F    N  ++ AK E QG     I +  
Sbjct: 119 GLFLPLAASATFVRAAALTGRGSLLNGTFMQHFGRNNNLGDIRAKLEVQGRWLALIALPA 178

Query: 291 GIALANHIG-------------SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
           GI +   +               +   A  ++  V  +H++   ++ +S++  TLN +R
Sbjct: 179 GIQVFQLVSAAAADLAADGDEFGAFVAAFGAYGGVIGVHVFACWQAAKSLKFDTLNRFR 237


>gi|440465939|gb|ELQ35234.1| hypothetical protein OOU_Y34scaffold00720g28 [Magnaporthe oryzae
           Y34]
 gi|440485195|gb|ELQ65173.1| hypothetical protein OOW_P131scaffold00517g6 [Magnaporthe oryzae
           P131]
          Length = 431

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 71/248 (28%), Positives = 129/248 (52%), Gaps = 5/248 (2%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PTAA 177
           +F  F LP GFP +VT DYL+Y ++ S+Q  +S I+ +LA +A+L  +G+G  +  PTAA
Sbjct: 36  IFYAF-LPAGFPHTVTEDYLHYQIYDSLQAFSSSIASLLANRAVLEGLGVGDASQSPTAA 94

Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAA-FGLEMLTPAFPHHFV 235
            +  +++D I  L+ IM ++  G   +   K +R  AD+  +AA     ++        V
Sbjct: 95  LVLKIIQDTISRLATIMFAHRMGTAIEPECKSYRFLADIFNDAAQILDLLVPVLPLLPKV 154

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            +   +G  RS   +   ++++   A FA   N AE+ AK  +Q  V   +G++ G A+ 
Sbjct: 155 SVLVTSGCLRSLCGVAAGASKASLSAHFAKTGNLAELNAKEASQETVVSLVGMLTGTAVV 214

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            H+            V+  +H+  N ++ +++ + TLN  RA++VF E+L +G+     +
Sbjct: 215 -HMVEDKTAVWCWMVVLICMHLMMNYRAVRNVRMLTLNRQRATIVFREFLETGKVLTPAQ 273

Query: 356 VNDEEPLF 363
            +  E + 
Sbjct: 274 CSARESIL 281


>gi|398015008|ref|XP_003860694.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322498916|emb|CBZ33989.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 675

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 78/272 (28%), Positives = 139/272 (51%), Gaps = 10/272 (3%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
           + +LP+G+P SVT D L ++LW   Q +A  ++  L+T+A+L  +G+G+  A  T++ ++
Sbjct: 70  RILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLS 129

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP---HHFVF 236
           W+++DG   +  I+ ++   +  +   K W+L AD   + A  LE+  P  P     F  
Sbjct: 130 WMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELCAPWLPGCQTIFRV 189

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +  AA A ++   +  + TR+ F   FA R N A++ AK   +  V   IG+ LG A+A 
Sbjct: 190 VLVAASAIKALGGVCDSGTRASFTQHFALRNNAADIAAKAATRSSVGSFIGLALGTAVAY 249

Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            +  +S    LA+F++ +   ++ + +  + ++LR LN  R      +Y L      +  
Sbjct: 250 SVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQYSLFKAKVLIAS 309

Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
             D  PL   P       +SAN ++ LL+L S
Sbjct: 310 RKDSGPLSVMPILD-VSPRSANAAERLLILPS 340


>gi|339898153|ref|XP_001465441.2| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399425|emb|CAM67862.2| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 675

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 78/272 (28%), Positives = 139/272 (51%), Gaps = 10/272 (3%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAIN 180
           + +LP+G+P SVT D L ++LW   Q +A  ++  L+T+A+L  +G+G+  A  T++ ++
Sbjct: 70  RILLPDGYPSSVTEDLLPFALWDLAQVLAKNVTATLSTRAVLLGVGVGESKADLTSSTLS 129

Query: 181 WVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP---HHFVF 236
           W+++DG   +  I+ ++   +  +   K W+L AD   + A  LE+  P  P     F  
Sbjct: 130 WMMQDGTRMIGSIVFASLIPQGLECRAKTWQLVADFTTDIANLLELCAPWLPGCQTIFRV 189

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +  AA A ++   +  + TR+ F   FA R N A++ AK   +  V   IG+ LG A+A 
Sbjct: 190 VLVAASAIKALGGVCDSGTRASFAQHFALRNNAADIAAKAATRSSVGSFIGLALGTAVAY 249

Query: 297 HI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
            +  +S    LA+F++ +   ++ + +  + ++LR LN  R      +Y L      +  
Sbjct: 250 SVPATSRSLNLAAFALCSAFRIFAHYRGVRGVQLRHLNAPRLEWCLEQYSLFKAKVLIAS 309

Query: 356 VNDEEPL--FPAFHFFKIKSANKSQ-LLVLSS 384
             D  PL   P       +SAN ++ LL+L S
Sbjct: 310 RKDSGPLSVMPILD-VSPRSANAAERLLILPS 340


>gi|397563898|gb|EJK43986.1| hypothetical protein THAOC_37516 [Thalassiosira oceanica]
          Length = 542

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 134/291 (46%), Gaps = 46/291 (15%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
           FV   LP  +P+SV S Y  ++ +     +A   + VL+TQALL A+G+G + A P AAA
Sbjct: 111 FVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAAA 170

Query: 179 INWVLKDGIGYLSKIML-------------------------SNFGR---------HFDV 204
           +NW+LKD  G L  ++                          +N GR           D 
Sbjct: 171 LNWLLKDFAGQLGGVLFVSQLGIDHEFWKRKINRLFRENNQGNNTGRPKKGNYQRGTADT 230

Query: 205 NPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAAGAGRSAAALIQASTRSCF---- 259
           NPK WR+ A L  + +  LE+ TP   P +F+   + A  G++   L  +++R+      
Sbjct: 231 NPKRWRMVAALALDLSTLLEICTPLLGPQYFLPCASVANVGKNIGWLAASASRAAINQSL 290

Query: 260 ----YAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFA--LASFSVVT 313
                   ++  N  +V AK  +Q +V+  +G  LGI  +  + +       ++ F V++
Sbjct: 291 SAGGSYSSSSGSNLGDVTAKSGSQAIVASLVGTALGIFFSKTLYADHGTVGVMSGFIVLS 350

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFP 364
            +H     K+  ++ LR+L+ +R  +V SE+  +   P   +V  +E  FP
Sbjct: 351 AVHQVATYKALMAVPLRSLDRHRLHIVLSEFSRTSSVPSPSQVATKEKFFP 401


>gi|154292218|ref|XP_001546685.1| hypothetical protein BC1G_14192 [Botryotinia fuckeliana B05.10]
          Length = 449

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 165/337 (48%), Gaps = 53/337 (15%)

Query: 150 ASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPK 207
           AS I+G+LA++A+L  +G+G   A  T A +  +L++  G +S I+ ++  G   +   K
Sbjct: 60  ASTIAGLLASRAVLQGLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECK 119

Query: 208 GWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
            +RL AD+  +AA  L+ ++PAFP    VF+ +A+   +S   +   S+++   A FA  
Sbjct: 120 KYRLMADIFNDAAMILDCISPAFPKIPRVFLLSASSVCKSLCGVAAGSSKASLSAHFAKM 179

Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQS 326
            N AE+ AK  +Q  +   +G+++G  + + I S +   +A  ++++ IH+  N  + +S
Sbjct: 180 GNLAELNAKDASQETLISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRS 238

Query: 327 IELRTLNPYRASLVFSEYLLSGQ-------APPVKEVNDEEPLFPAFHFFKIKSANKSQL 379
           + +RTLN  RA+LV S  L S +        P  ++++ +E +F                
Sbjct: 239 VTMRTLNRQRANLVISSLLSSPEHEKNKISLPSPRDISLQERIFE--------------- 283

Query: 380 LVLSSEAKDAAVEIEH-----RLQLGSKLSDVVNN-------------KEDAHALFSLYE 421
                  +D A+   H       ++G  L D++++             +    +L S+Y+
Sbjct: 284 -------RDGAIRNTHGTILAYCKVGVSLQDLLSSIATPTTAGSYAEPESILTSLLSIYK 336

Query: 422 DEGYIL--TEHGGKFCVVLKESALPQDMLKSLFQASY 456
            +GY++  +     F V+LKE   P+  L + F A +
Sbjct: 337 TQGYLMWYSRSRNTFLVILKEETAPKVQLDAWFHAIF 373


>gi|322707810|gb|EFY99388.1| DUF647 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 462

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 67/238 (28%), Positives = 123/238 (51%), Gaps = 4/238 (1%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E +       LP GFP SV++DYL Y  + S+Q   S I+ +LA +ALL  +G+G   + 
Sbjct: 70  EWKRFLSDAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSS 129

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            T A +  +L D    ++ I+ ++ FG   + + K +R  AD+  ++AF LE+ +P F  
Sbjct: 130 ATFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDSAFFLELYSPYFGD 189

Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
           +   +  + G A R+   +   ++++     FA   N AE+ AK  +Q      IG+++G
Sbjct: 190 YGKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLVVG 249

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
             +   +           ++V + H++ N    +S+ + TLN  RA+++F EY+ +G+
Sbjct: 250 TLVVKLVQDHQSVVYLMVALV-FAHLWMNYLGVRSVCMTTLNRQRATILFREYMTTGR 306


>gi|340516831|gb|EGR47078.1| predicted protein [Trichoderma reesei QM6a]
          Length = 448

 Score =  105 bits (261), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 128/252 (50%), Gaps = 4/252 (1%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E R+      LP GFP SV+ DYL+Y  + S+Q   S IS +LA +ALL  +G+G   + 
Sbjct: 65  EWRKALFDSFLPVGFPHSVSKDYLSYQFFDSLQAFFSTISSLLANRALLQGLGVGDASSS 124

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            T A +  +LKD +  ++ I+ ++ FG   + + K +R  ADL  + AF LE+ +P    
Sbjct: 125 ATFALLLTILKDAMSRIATIVFAHRFGLRIEPDAKRYRFLADLFNDTAFFLELYSPYLGS 184

Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
               +  + G A R+   +   ++++     FA + N AE+ AK  +Q      IG+++G
Sbjct: 185 WGKVVALSTGEALRALCGVAAGASKAALSVHFARQDNLAELNAKEASQETAVGLIGLLVG 244

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
             +   + +     L    +V   H+  N    +S+ + T N  RA+++F EYL SG+  
Sbjct: 245 TLVVKTVENHSAVVLLMIVLVL-AHLGMNYLGVRSVCMNTFNRQRATILFEEYLKSGKVL 303

Query: 352 PVKEVNDEEPLF 363
             ++V + E + 
Sbjct: 304 SPEQVANRESII 315


>gi|296473307|tpg|DAA15422.1| TPA: hypothetical protein LOC767862 [Bos taurus]
          Length = 232

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 2/113 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA   W++
Sbjct: 72  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P  P  F 
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFT 184


>gi|116004169|ref|NP_001070441.1| UPF0420 protein C16orf58 homolog [Bos taurus]
 gi|61554841|gb|AAX46623.1| hypothetical protein FLJ13868 [Bos taurus]
          Length = 232

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 2/113 (1%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA   W++
Sbjct: 72  LPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWLV 131

Query: 184 KDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV 235
           KD  G L +I+ + + G   D N K WRLFAD+L + A  LE++ P  P  F 
Sbjct: 132 KDSTGMLGRIVFAWWMGSKMDCNAKQWRLFADILNDIAMFLEIMAPILPFCFT 184


>gi|302768148|ref|XP_002967494.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
 gi|300165485|gb|EFJ32093.1| hypothetical protein SELMODRAFT_86604 [Selaginella moellendorffii]
          Length = 412

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 175/389 (44%), Gaps = 24/389 (6%)

Query: 132 SVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK---GAIPTAAAINWVLKDGIG 188
           SV+ DYL Y +W+    +   I   L T +LL A+GL      A   +AAI WV KDG+G
Sbjct: 21  SVSQDYLQYMVWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 80

Query: 189 YLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSA 247
            + ++ +   FG  FD +PK WR++A+++ +     E+ TP  P +F+ + +     ++ 
Sbjct: 81  AVGRLFIGGRFGGVFDQDPKQWRMYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 140

Query: 248 AALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH--IGSSMPFA 305
           A  ++  +       FA   N  +V AK E   +  + IG+ LG+AL +   + +S    
Sbjct: 141 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVALLSTPVVATSYSQL 200

Query: 306 LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPA 365
             S+ ++   H++   ++  ++EL T+N  R +++  E++       V E N +E L   
Sbjct: 201 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHIQGRPLSGVSECNKKENLILP 260

Query: 366 FHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL--GSKLSDVVNNKEDAHALFSLYEDE 423
           +     +      L  + +  K   +EIE   +L    K   +V +K +   L  +    
Sbjct: 261 YTMLMPQIRLGCSLREVITLGK-TPLEIEEAFELYNSEKYVLIVFSKSNMGILLKVLSQ- 318

Query: 424 GYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEI 483
            ++ +      C V ++      +L+S++QA  L+           S S D      L+ 
Sbjct: 319 -FLSSCSCNIKCRVCQDGCENTTLLRSIYQAHVLF----------HSKSFDTFT---LKQ 364

Query: 484 SLDYVQREFNHVKSDSASVGWVTDGLIAR 512
           SL  +Q+ F     +  + GW    L+ +
Sbjct: 365 SLTSMQKSFASFLDELVAAGWNPSKLVVK 393


>gi|395332415|gb|EJF64794.1| hypothetical protein DICSQDRAFT_80493 [Dichomitus squalens LYAD-421
           SS1]
          Length = 506

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 182/411 (44%), Gaps = 40/411 (9%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAIN 180
           Q  LP  +P SV S Y ++   ++++   + I+ VL  QALL ++G+  +G++  A A+ 
Sbjct: 118 QMFLPTNYPQSVHSSYASFHTLQALETTMATITSVLCNQALLTSVGMSAEGSVFGAVAVQ 177

Query: 181 WVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF---PHHFVF 236
           W++KDG G ++K+  + +F  +FD +PK + LF ++L     GL++ T      P +F+ 
Sbjct: 178 WIIKDGAGEVAKLFFIRHFSTYFDSHPKTFTLFGEVLGCLGSGLQIATVLIAPSPLNFLL 237

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
             A     +     I ++T   F   F+ + N  +V AK E+Q  V++  G   GI+L  
Sbjct: 238 CAAGGNIFKLVGNAIWSTTHIKFIRYFSMQGNDGDVAAKDESQASVAQLAGYAAGISLLT 297

Query: 297 HIGSSMPFALASFSVVTWIH--MYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVK 354
               +  +  A F     +H  M   +    + EL TL   R S +   Y+  G     K
Sbjct: 298 -FSHAPAYLYAIFFAAVPLHLTMTAYMMRVATFELLTLP--RISHLAQTYVNDGVVGSQK 354

Query: 355 EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH 414
           +++ E        F+K    N  + L L+    DA           +  SDV  ++    
Sbjct: 355 DLDREHATGLFGEFYK---NNGDRWLTLAPRVGDAL----------NTSSDV--DRSTWQ 399

Query: 415 ALFSLYEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATS 470
               +++D+ Y+L    +  G    V+    A   DML+S+  A+ +  L         S
Sbjct: 400 VCSQVFQDDRYLLLPCDSPRGPLISVLFHPDATTDDMLQSILHAASVRSL--------LS 451

Query: 471 TSADC-APGGRLEISLDYV--QREFNHVKSDSASVGWVTDGLIARPLPNRI 518
             A C  P  R  +S   V  Q++F+  K      GW TD L    L +R+
Sbjct: 452 RDASCPTPSLRTVLSETRVQAQQDFSAFKQALHERGWRTDELCFADLGHRV 502


>gi|402074762|gb|EJT70271.1| hypothetical protein GGTG_12444 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 438

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 78/252 (30%), Positives = 132/252 (52%), Gaps = 9/252 (3%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI-PT 175
           + +F  F LP GFP SVT DYL Y  + S+Q  +S I+ +LA +A+L  +G+G  +  PT
Sbjct: 42  KAIFYAF-LPAGFPHSVTDDYLAYQTYDSLQAFSSSIASLLANRAVLEGLGVGNASQSPT 100

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA---FP 231
           AA +  +L+D    L+ I+ ++  G   +   K +R  ADL  +A+  L++L P     P
Sbjct: 101 AALVLQILQDTFSRLATILFAHRMGTAIEPECKTYRFMADLFNDASQLLDLLVPVLPFLP 160

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
              V + AA    RS   +   ++++   A FA   N AE+ AK  +Q  V   +G+++G
Sbjct: 161 KVSVMVSAA--ILRSLCGVAAGASKASLSAHFAKMGNLAELNAKEASQETVVSLVGMLVG 218

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
            A+  H+            V+  IH+  N ++ + +++ TLN  RA++VF E+L +G+  
Sbjct: 219 TAVV-HMVEDKRAVWCWMIVLIAIHLMMNYRAVRCVKMLTLNRQRATIVFREFLETGKVL 277

Query: 352 PVKEVNDEEPLF 363
              +V   E + 
Sbjct: 278 TPGQVAARESIL 289


>gi|302753580|ref|XP_002960214.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
 gi|300171153|gb|EFJ37753.1| hypothetical protein SELMODRAFT_75120 [Selaginella moellendorffii]
          Length = 381

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/399 (24%), Positives = 175/399 (43%), Gaps = 56/399 (14%)

Query: 132 SVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK---GAIPTAAAINWVLKDGIG 188
           SV+ DYL Y LW+    +   I   L T +LL A+GL      A   +AAI WV KDG+G
Sbjct: 2   SVSQDYLQYMLWQLPTNITGWICRALITSSLLQAVGLSTEPGSAAAASAAIKWVTKDGLG 61

Query: 189 YLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSA 247
            + ++ +   FG  FD +PK WRL+A+++ +     E+ TP  P +F+ + +     ++ 
Sbjct: 62  AVGRLFIGGRFGGVFDEDPKQWRLYAEVIGSFGGLFELATPLAPDYFLLLASLGHLTKAV 121

Query: 248 AALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH--IGSSMPFA 305
           A  ++  +       FA   N  +V AK E   +  + IG+ LG+ L +   + +S    
Sbjct: 122 ARGLKDPSFRVIQNHFAISENVGDVAAKEEVWEVAGQLIGLSLGVVLLSTPVVATSYSQL 181

Query: 306 LASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPA 365
             S+ ++   H++   ++  ++EL T+N  R +++  E++   Q  P+  V +       
Sbjct: 182 ALSWMIIRSFHLWLRYQTLAALELPTMNFKRLNVLIMEHI---QGRPLSGVGE------- 231

Query: 366 FHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVN---NKEDAHALFSLYED 422
                    NK + L+L          +  +++LG  L +V+       +    F LY  
Sbjct: 232 --------CNKKENLILPYTM------LMPQIRLGCSLREVITLGKTPLEIEEAFELYNS 277

Query: 423 EGYIL---------TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSA 473
           E Y+L              KF V  ++      +L+S++QA  L+           S S 
Sbjct: 278 EKYVLIVLSQFLSSCSCNIKFRVC-QDGCENTTLLRSIYQAHVLF----------HSKSF 326

Query: 474 DCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
           D      L+ SL  +Q+ F     +  + GW    L+ +
Sbjct: 327 DTF---TLKQSLTSMQKSFASFLDELVAAGWNPSKLVVK 362


>gi|308804794|ref|XP_003079709.1| unnamed protein product [Ostreococcus tauri]
 gi|116058166|emb|CAL53355.1| unnamed protein product [Ostreococcus tauri]
          Length = 499

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 136/270 (50%), Gaps = 26/270 (9%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GA 172
           E R +F    LP GFP +V++DY ++  W     +   +  +++ Q+LL A+G+G   GA
Sbjct: 115 ELRAVF----LPSGFPQTVSADYADWLRWHLTSLLFRDVIEIMSAQSLLVALGVGSTPGA 170

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 231
           +P  AA  WVLKDG+G  + ++   FG R +D +PK W   ++ LE+ A  +E++TPA P
Sbjct: 171 LPLTAAAKWVLKDGVGSFATLLAGAFGGRWYDEDPKRWWAVSNALEDVARAIELVTPAAP 230

Query: 232 HHFVFIGAAAGAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQG----MVSKAI 286
             F+ + A+A   R+AA   + S     F   F    N  ++ AK E QG    +V+  +
Sbjct: 231 GLFLPLAASATFVRAAALTGRGSLMNGTFMQHFGRNNNLGDIRAKLEVQGRWLALVALPV 290

Query: 287 GIMLGIALANHIGSSM----------PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYR 336
           GI +   L N   S++            A   + +V   H++   ++ +S++  +LN +R
Sbjct: 291 GIQV-FQLVNAQASALHAEGDELGAAAAAFGGYGLVIGAHIFSCWQAARSLKFDSLNGFR 349

Query: 337 ASLVFSEYLLSG---QAPPVKEVNDEEPLF 363
              +   Y+ S    + P   E+ + E ++
Sbjct: 350 LLRLAEAYVASDGKIELPNCVEIGEIEGVY 379


>gi|402582701|gb|EJW76646.1| hypothetical protein WUBG_12446, partial [Wuchereria bancrofti]
          Length = 184

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/143 (37%), Positives = 89/143 (62%), Gaps = 2/143 (1%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPT 175
           + L     +P G+P SV+ DY+NY +W ++Q  AS +S  L+T+A+L  +G+G K A   
Sbjct: 36  KLLLRDIFMPRGYPQSVSPDYMNYQIWDTIQAFASSMSSALSTEAILRGVGVGNKAASTM 95

Query: 176 AAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AAA+ W+LKDGIG L++I+ +  +  + D + K WRL AD   + A+ L+++TP FP+ F
Sbjct: 96  AAAVAWLLKDGIGMLARILFAWLYSPYLDADCKQWRLIADCFNDLAYCLDLITPIFPNLF 155

Query: 235 VFIGAAAGAGRSAAALIQASTRS 257
           + I   +   R+   +  ++TR+
Sbjct: 156 MPIICLSSMIRAVVGVAGSATRT 178


>gi|322700393|gb|EFY92148.1| DUF647 domain-containing protein [Metarhizium acridum CQMa 102]
          Length = 466

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 119/237 (50%), Gaps = 4/237 (1%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E +       LP GFP SV++DYL Y  + S+Q   S I+ +LA +ALL  +G+G   + 
Sbjct: 70  EWKRFLSDAFLPIGFPHSVSNDYLAYQTYDSLQAFFSTITSLLANRALLQGLGVGDADSS 129

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            T A +  +L D    ++ I+ ++ FG   + + K +R  AD+  + AF LE+ +P F  
Sbjct: 130 ATFAMLITILTDATSRIATIVFAHRFGLRIEPDAKRFRFLADIFNDTAFFLELYSPYFGD 189

Query: 233 HFVFIGAAAG-AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
           +   +  + G A R+   +   ++++     FA   N AE+ AK  +Q      IG+++G
Sbjct: 190 YGKILALSVGQALRALCGVAAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG 249

Query: 292 IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
             +   +           ++V   H++ N    +S+ + T N  RA+++F EY+ +G
Sbjct: 250 TLVVKLVQDHQSVVYLMVTLVL-AHLWMNYLGVRSVCMTTFNRQRATILFREYMTTG 305


>gi|310800465|gb|EFQ35358.1| hypothetical protein GLRG_10502 [Glomerella graminicola M1.001]
          Length = 416

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 77/252 (30%), Positives = 131/252 (51%), Gaps = 7/252 (2%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
           R++     LP G+PDSVT DYL Y  + S+Q   S I+G+L+ +A+L  +G+G   +  T
Sbjct: 2   RQIIYDAFLPIGYPDSVTPDYLGYQTFDSLQAFFSTITGLLSNRAILQGLGVGDPNSSAT 61

Query: 176 AAAINWVLKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHH 233
            A +  +LKDGI  ++ I+ +  FG   +   K +R  AD+  ++AF L + +P F P  
Sbjct: 62  YALLLTILKDGISRVATIVFAYRFGLVIEPECKRYRFLADIFNDSAFFLNLFSPYFDPWT 121

Query: 234 FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIA 293
            V     A A  +   +   ++++     FA R N +E+ AK  +Q      +G+++G  
Sbjct: 122 KVGALVLAEALGAMCGVAAGASKAALSKHFARRNNLSELNAKEASQETAVGLVGLLVGSV 181

Query: 294 LANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA-P 351
           +   + G    FAL +F V  ++H+  N    + ++L TLN  RA+++F  Y  + Q   
Sbjct: 182 VVRFVEGREAVFALMTFLV--FVHLGMNYLGVRCVQLATLNQQRATILFEHYARTKQVLT 239

Query: 352 PVKEVNDEEPLF 363
           P +  + E  LF
Sbjct: 240 PAQVASRENILF 251


>gi|384249834|gb|EIE23315.1| hypothetical protein COCSUDRAFT_15701 [Coccomyxa subellipsoidea
           C-169]
          Length = 265

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 75/238 (31%), Positives = 118/238 (49%), Gaps = 12/238 (5%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG---AIPTAAAI 179
           F LP G+P++VT DYL + LW     +   +S  LAT +LL A+G+G     A   AAAI
Sbjct: 8   FFLPRGYPETVTPDYLRFQLWAIPAHITGWMSTSLATSSLLKAVGVGNSAVGATAAAAAI 67

Query: 180 NWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
            W+ KDGIG   ++++    G  FD +PK WR+ A+        LE+ T   P +F+ + 
Sbjct: 68  KWITKDGIGAAGRVLVGGRLGNVFDEDPKRWRMVAEAFLTVGLALEIATAFSPANFIVL- 126

Query: 239 AAAGAGRSAAALIQASTRSCF---YAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
             AGAG  + A+ +  T  CF      FAA  N  +V AK E   + ++ +G+   + L 
Sbjct: 127 --AGAGNLSRAVGRGMTNPCFRIIQTHFAAASNVGDVAAKEEVWEVSAQLLGLAGSVGLL 184

Query: 296 NHI-GSSMPF-ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 351
             I  +  P   L  ++ V  +H     KS   ++  ++N  RA L+ + ++     P
Sbjct: 185 KAIEATGKPENVLGVWAAVQVLHSLLRYKSLAVLQFPSVNQKRACLLAAAHVAGNPLP 242


>gi|255074355|ref|XP_002500852.1| predicted protein [Micromonas sp. RCC299]
 gi|226516115|gb|ACO62110.1| predicted protein [Micromonas sp. RCC299]
          Length = 561

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 73/231 (31%), Positives = 114/231 (49%), Gaps = 17/231 (7%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GAIPTAAAINWV 182
           LP G+PD+V+ DY  +  W     +   +  V+  Q+LL A+G+G   GA+P  AA  WV
Sbjct: 127 LPAGYPDTVSPDYAPFIRWHLGSLMFRNVLEVITAQSLLVALGMGSTPGALPLTAATKWV 186

Query: 183 LKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAA 241
           LKDGIG L+ +   +  G+  D +PK W + ++  E+ A  +E++TPA P  F+ + A A
Sbjct: 187 LKDGIGSLATLAAGSLGGQKCDEDPKRWWMVSNAFEDVARVIELVTPAAPALFLPLAATA 246

Query: 242 GAGRSAAALIQAS-TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS 300
              R+AA   + S         FA   N A+V A+ E QG     + + +GI +   +  
Sbjct: 247 TFVRTAALTGRGSLVNGSLMQHFARNENTADVRARLEVQGRWLALVALPVGIGIFRTVSQ 306

Query: 301 S-----------MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
           S           +  A+A +  V   H+ C   + + +   TLN  RA LV
Sbjct: 307 SFASDPEEPARDVAIAVALYGGVVCAHLVCIWNAAKVLRFETLN--RARLV 355


>gi|346318934|gb|EGX88536.1| DUF647 domain-containing protein [Cordyceps militaris CM01]
          Length = 467

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/253 (30%), Positives = 126/253 (49%), Gaps = 12/253 (4%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AI 173
           E +++     LP GFP SV+SDYL Y  + S+Q   S I+ +LA +ALL  +G+G   + 
Sbjct: 66  EWKKILSDSFLPVGFPHSVSSDYLAYQTFDSLQAFFSTITSLLANRALLQGLGVGDANSS 125

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-P 231
            T A +  +L+D    ++ I  ++ FG   + + K +R  ADL  ++AF LE+ +P   P
Sbjct: 126 ATFALLLTILRDATSRVATIAFAHRFGLRIEPDAKRYRFLADLFNDSAFFLELYSPYLSP 185

Query: 232 HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 291
              V   +   A R+   +   ++++     FA   N AE+ AK  +Q      IG+++G
Sbjct: 186 WGKVAALSIGEALRALCGVSAGASKAALSVHFAKHDNLAELNAKEASQETAVGLIGLLVG 245

Query: 292 IALA----NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
             +     +H        L  F      H++ N     S+E+ TLN  RA+++F EYL +
Sbjct: 246 TVVVKVVEDHRSVVFLMILLVFG-----HLWMNYLGVCSVEMTTLNRQRATILFQEYLQT 300

Query: 348 GQAPPVKEVNDEE 360
           G+     EV+  E
Sbjct: 301 GKVLSPAEVSKRE 313


>gi|429856893|gb|ELA31783.1| duf647 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 500

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 151/308 (49%), Gaps = 24/308 (7%)

Query: 69  EYDAVWEVKGSKRTKLIP----------DFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
           EYD      G+ +TK I           D    A  V + S+  L ++    K W +  +
Sbjct: 26  EYDRA----GNLKTKFIESAPRAGPARVDILSCAMHVINGSSVQLPTIR--KKPWRKVLD 79

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
           +F    LP+G+P SVT DY  Y ++ S Q  A  I+G+++++A+   +G+G   A P  A
Sbjct: 80  IF----LPDGYPHSVTDDYAAYQIYDSFQAFAGTIAGMISSRAVWEGMGVGDSMASPIGA 135

Query: 178 AINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            +  V+++ +G L  I  ++  G   +   K +R+ +D+L + AF L+ L+P FP    F
Sbjct: 136 MMIQVIRESMGRLGTIAFAHLMGTSIEAECKAYRMASDVLCSIAFILDCLSPLFPRSIRF 195

Query: 237 IGAAAGAGRSAAALIQA-STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
           +     +   AA+ +   +++S     FA   N  E+ AK  +Q      +G++ G  + 
Sbjct: 196 LILCCSSILFAASGVAGNASKSSLSGHFAKWNNLGELNAKDASQETAISLMGMLTGTLVV 255

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
           + + +        F ++  I+++ N K  ++++ R+LN  RA++ FS  L   Q     +
Sbjct: 256 SCLTTPTTTWTTLFILIA-IYLFLNWKGVRAVKSRSLNRQRANIAFSALLSKDQVLTPGK 314

Query: 356 VNDEEPLF 363
           ++++E +F
Sbjct: 315 ISEKELIF 322


>gi|224013726|ref|XP_002296527.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968879|gb|EED87223.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 719

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 106/463 (22%), Positives = 180/463 (38%), Gaps = 124/463 (26%)

Query: 94  VVASASNASLSSLLSVNKLWDECREL---------FVQFMLPEGFPDSVTSDYLNYSLWR 144
           V + +S  + S L ++      CR            +   LP  +P SV   Y  Y+ + 
Sbjct: 204 VPSKSSGGTQSPLSTIRSFISSCRTKDMINIQNLPLIHHFLPANYPQSVCPSYATYASYC 263

Query: 145 SVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS------- 196
            +  +A   + VL+TQALL A+G+G + A P AAA+NWV+KDG+G L  ++ +       
Sbjct: 264 FLGSIAGSSAMVLSTQALLVAVGVGTQSAAPMAAALNWVMKDGVGQLGGVVFASQLGKGG 323

Query: 197 ----------------------------NFGR-HFDVNPKGWRLFADLLENAAFGLEMLT 227
                                       NF R   D NPK WR+ A L  + +  LE+ T
Sbjct: 324 VDVDYWKVKLNKLMGSSSSKSTETSARGNFQRGTADSNPKRWRMVAALALDLSTLLEICT 383

Query: 228 PAF-PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF----------------------- 263
           P   P  F+   + A  G++   L  +++R+  +                          
Sbjct: 384 PWMGPQWFLPCASIANIGKNVGFLAASASRAAVHQSLCTGGSFLPPVPDNTTEKDESYDN 443

Query: 264 --------AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFA--LASFSVVT 313
                   ++  N  +V AK  +Q +V+  +G  LGI L+    S    A  LA F +++
Sbjct: 444 PKKGKNVTSSSNNLGDVTAKSGSQAIVASLLGTALGIWLSRTFCSDYGTAGILAGFVILS 503

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYL------LSGQAPPVKEVNDEEPLFPAFH 367
            +H  C  ++ +++ LR+++ +R  ++ + Y+      +        +V +EE   P   
Sbjct: 504 AVHQVCTYRALKAVPLRSVDRHRLHIILNAYITADGRSVQSMVLTPSQVAEEESFLPMM- 562

Query: 368 FFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA-HALFSLYED---- 422
                             A D +V   H L +G  L D+  +  D   AL    +D    
Sbjct: 563 ------------------APDNSV---HWLSVGDSLIDICPSGLDELEALLIRRDDISQY 601

Query: 423 EGYILTEH-----------GGKFCVVLKESALPQDMLKSLFQA 454
           E YI+  H            G   +   + A   D+L+ +F A
Sbjct: 602 EKYIMKVHLADTNATTAAVDGIVQLTFLDGATDNDLLQGMFHA 644


>gi|224094527|ref|XP_002310175.1| predicted protein [Populus trichocarpa]
 gi|222853078|gb|EEE90625.1| predicted protein [Populus trichocarpa]
          Length = 333

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 70/233 (30%), Positives = 114/233 (48%), Gaps = 15/233 (6%)

Query: 138 LNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSK-IML 195
           + Y  W+ +  V S    VLATQA+  AIG+G   ++P+AAA+NWVLKDG+G LS+ I  
Sbjct: 1   MGYVKWKFLHRVFSSALQVLATQAMFRAIGIGYSRSLPSAAALNWVLKDGLGRLSRCIYT 60

Query: 196 SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQAST 255
           ++    FD N K  R    +L + + G+E+LTP FP +F+ +   A   +  +     +T
Sbjct: 61  ASLASAFDTNLKRVRFTTSVLFSLSIGIELLTPTFPQYFLLLATLANIAKQISLACYLAT 120

Query: 256 RSCFYAGFAARRNFAEVIAKGEAQGMVSKAIG----IMLGIALANHIGSSMPFALASFS- 310
            S  +  FA   N  EV AK + Q +    +G     +L +   N+   S+ F   SF  
Sbjct: 121 GSAVHRSFAIADNIGEVSAKAQIQTVSFDNLGLMLAALLNMLFKNNQRFSLSFPTMSFCF 180

Query: 311 --------VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
                   + + I ++   +  Q + L+TL   R  ++ + ++  G  P   E
Sbjct: 181 CLPFFVYPIFSAIDLFGIYQGLQHVHLQTLTKDRLEIILNSWIDFGHVPSPAE 233


>gi|303278700|ref|XP_003058643.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459803|gb|EEH57098.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 679

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/303 (27%), Positives = 124/303 (40%), Gaps = 71/303 (23%)

Query: 131 DSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGY 189
           ++VT DY  Y+ +R  Q VAS    V ATQ +L A+GLG    +P AAA+NWVLKDG+G 
Sbjct: 179 NAVTPDYWAYAKFRFAQRVASSCITVFATQQMLQAVGLGATRRLPAAAAVNWVLKDGLGR 238

Query: 190 LSKI-MLSNFGRHF-------DVNPKGWRLFADLLEN----------------AAFGLEM 225
           L K+ + +NFGR F        + P+ WR     L                   AF L +
Sbjct: 239 LGKLSVATNFGREFGAFYTLVPIRPR-WRGERRSLRTFPGGSLAHHTVSIPALGAFQLHL 297

Query: 226 L----TPAF-----------------------------------PHHFVFIGAAAGAGRS 246
                TP F                                   P  F+ +   A  G+S
Sbjct: 298 TPFDSTPTFACMKRPSDSDVKRFRFTSSVVYDCASLIEMITPFYPSKFLLLATIANVGKS 357

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS------ 300
                    R+     FA   N AE+ AK  AQ +++  IG+ L +      G       
Sbjct: 358 VGITTANVVRAPIQMSFALEENLAEIAAKTSAQQVLADNIGLALAVVATGWTGKIANPRL 417

Query: 301 SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEE 360
            +   L +F+ +  + +YC  +  +S+ LRT+N  R  ++ + ++ +G  P    V   E
Sbjct: 418 RLAVPLLAFAPLASVDLYCIYRELKSVRLRTINKERGEIIAAGFVETGTIPSAVSVAAAE 477

Query: 361 PLF 363
            LF
Sbjct: 478 RLF 480


>gi|378729487|gb|EHY55946.1| hypothetical protein HMPREF1120_04055 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 485

 Score =  100 bits (248), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/268 (27%), Positives = 141/268 (52%), Gaps = 15/268 (5%)

Query: 102 SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 161
           S SS  +   + ++C   F    LP G+P +VT DY  Y ++ S+Q  AS I+G+LA++A
Sbjct: 40  SYSSPRTGTSILNKCMSPF----LPAGYPSTVTPDYTPYQIYDSLQAFASTIAGLLASRA 95

Query: 162 LLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENA 219
           +   +G+G + A      + ++ ++ +G ++ I+ ++ F +  +   K +R FAD++ + 
Sbjct: 96  VFVGMGVGSEEASVLTTMLLYIAQETVGRVATILFAHQFSQRIEAEVKFYRFFADIVNDV 155

Query: 220 AFGLEMLTPAFP-HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEA 278
           AF L+ ++PA P    V     + A R+   +   S+++   + FA   N  E+ AK  +
Sbjct: 156 AFVLDCMSPAMPILGRVVTLCLSNACRAVCGVAGGSSKAILSSHFAKAGNIGELNAKDGS 215

Query: 279 QGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT---WIHMYCNLKSYQSIELRTLNPY 335
           Q  V   +G+ +G  + + +  +    LA++S +     +H++ N K+ +S+ L + N  
Sbjct: 216 QETVISLLGMWVGGVVVSKVHGT----LATWSWLVPLLALHLWANWKAVRSVRLNSFNTN 271

Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           RA LVF    L G+   +++V  EE + 
Sbjct: 272 RA-LVFFRAALEGRTLDMEQVAQEETVL 298


>gi|396461797|ref|XP_003835510.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
 gi|312212061|emb|CBX92145.1| hypothetical protein LEMA_P048510.1 [Leptosphaeria maculans JN3]
          Length = 543

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 146/308 (47%), Gaps = 39/308 (12%)

Query: 163 LYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAA 220
           L  +G+G   A PTAA +  VL+D +G  + I+ ++  G  F+   K +R  AD+L +  
Sbjct: 176 LAGVGVGDANASPTAALLLSVLQDSMGRTATILFADRLGTAFEPECKMYRFTADILNDTG 235

Query: 221 FGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
             LE L+PAFP    V + + +   RS   +   S ++   A FA + N  EV AK  +Q
Sbjct: 236 MVLECLSPAFPKPIRVCVLSCSSVLRSLCGVCAGSAKASLSAHFARKGNLGEVNAKDSSQ 295

Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTW---IHMYCNLKSYQSIELRTLNPYR 336
             V   +G++ G  + + I S M    A++S +     IH+  N  + +++ +R LN  R
Sbjct: 296 ETVISLLGMLAGSVVVSWISSPM----ATWSTLIGLLAIHLATNYAAVRAVSMRCLNRQR 351

Query: 337 ASLVFSEYLLSGQAPPVKEVNDEEPLFP-----------AFHFFKIKSANKSQLLVLS-S 384
           A+++FS  L  GQ     EV+  E +F            A  + +I  + ++ L  +   
Sbjct: 352 ANILFSRLLEHGQLLSPLEVSRRERVFEHDGVLRWSDDMAIGYCRIGVSLQTLLSHMGLR 411

Query: 385 EAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESA 442
            A+  ++ +      G+++S+          L  ++ DE YIL      G F +VLKE  
Sbjct: 412 NARTGSLHLR-----GAEISE----------LLHVFVDEAYILWPASSAGHFLIVLKEGC 456

Query: 443 LPQDMLKS 450
            P D LK+
Sbjct: 457 TPMDQLKA 464



 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 29/39 (74%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL 163
           LP G+P SVT DY+ Y ++ S+Q  +S I+ +LA++A+L
Sbjct: 58  LPAGYPHSVTEDYIQYQIYDSLQAFSSSIASMLASRAVL 96


>gi|315056527|ref|XP_003177638.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311339484|gb|EFQ98686.1| DUF647 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 495

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 171/355 (48%), Gaps = 25/355 (7%)

Query: 120 FVQFMLPEGFPDSVTSDYL-NYSLWRSVQG--VASQISGVLAT--------QALLYAIGL 168
            +   LP G+P SVT DY+ + SL+R  +   + S ISG+ ++          +  ++G+
Sbjct: 58  ILNVFLPVGYPHSVTDDYIESSSLFRDYEKKELTSLISGLTSSILKLNCWSPVIQSSVGV 117

Query: 169 GKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEML 226
           G   A PTAA +  VL++ +G ++ I+ ++  G   +   K +RL ADLL ++A  L+ +
Sbjct: 118 GDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAADLLNDSAIVLDCM 177

Query: 227 TPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKA 285
           +PAFP    V + + +   R+   +   S+++   A FA   N  E+ AK  +Q  V   
Sbjct: 178 SPAFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAKDSSQETVISL 237

Query: 286 IGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
            G+++G  + ++I S +   +A   ++  +H+  N  + +++++ TLN  RA++VFS   
Sbjct: 238 FGMLVGSVVVSYISSPLATWIA-LILLLIVHLGTNHAAVRAVKMTTLNRQRANIVFSYLF 296

Query: 346 LSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS------SEAKDAAVEIEHRLQL 399
              +     + ++EE +F      + K+     L +                ++    Q 
Sbjct: 297 EDDRVLTPAQASEEERIFETDGVLRWKAKATGTLGICQIGVSLEQLLLLLPEQVNTNAQG 356

Query: 400 GSKLSD--VVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKS 450
            S+  D  +V       AL  L+++E YIL  +    +  +VLK  A PQ  LK+
Sbjct: 357 SSQKPDTTMVGASTGLSALLELFKEEEYILYFSPAARRGAIVLKAGATPQAQLKA 411


>gi|307111734|gb|EFN59968.1| hypothetical protein CHLNCDRAFT_133075 [Chlorella variabilis]
          Length = 434

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 158/364 (43%), Gaps = 75/364 (20%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIP---TAAAI 179
           F LP G+P SVT DYL Y L      +   +S  LAT +++ A+G+  G      T+AA+
Sbjct: 38  FFLPTGWPHSVTPDYLRYQLASVPAHITGWMSHSLATSSMITALGVDAGPATIAATSAAV 97

Query: 180 NWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHH----- 233
            W+ KDG+G   ++++       FD +PK WR+ A+ +      +E+ T  +P       
Sbjct: 98  KWLTKDGLGAAGRLIVGGRLSSVFDEDPKRWRMVAEGVTTVGLAMEIATQIYPGGLRSKF 157

Query: 234 ------FVFIGAAAGAGRSAAALIQASTRSCF---YAGFAARRNFAEVIAKGEAQGMVSK 284
                 + +  A AG G    ++ +     CF      F+A  N  +V AK E   + ++
Sbjct: 158 RVVSVVWSYFIALAGTGTLCKSMAKGMGTPCFRIIQTHFSATNNVGDVAAKEEVWEVAAQ 217

Query: 285 AIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR----TLNPYRASLV 340
            +G+                   + SVV          + Q++E      T+ P RA+L+
Sbjct: 218 LLGL-------------------AASVV----------ALQALEAAGSPDTVVPKRAALM 248

Query: 341 FSEYLLSGQAPPVKEVNDEEPLF-------PAFHFFKIKSANKSQLLVLSSEAKDAAVEI 393
            S ++ +G  P V++VN  E +        P+ H+         Q L             
Sbjct: 249 ASSHVRTGLVPSVEQVNLSEQVLLPPTMCQPSVHY----GCTLDQALGSGGGGFGGGAAS 304

Query: 394 EHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQ 453
             R Q   +L+++V           LY+ E Y+LT H G   V++ E A   D+L++++Q
Sbjct: 305 GGRGQ---QLAELVR----------LYDGEQYMLTWHRGAAWVLVLEGAQRADLLRAMWQ 351

Query: 454 ASYL 457
           A++L
Sbjct: 352 AAWL 355


>gi|392585079|gb|EIW74420.1| DUF647-domain-containing protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 515

 Score = 99.0 bits (245), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 168/384 (43%), Gaps = 56/384 (14%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           D  R + V   LP G+P SVT DYL Y  + ++Q   S ++G++A++  L  +G+G   A
Sbjct: 53  DRIRRVLVDLFLPAGYPASVTPDYLPYHAYNALQAFFSSLAGLIASRGALQGVGVGDASA 112

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTP--- 228
             T+A +  VL+D  G ++ I  +  FG       K +RL ADL+ +A+  L+ L P   
Sbjct: 113 TATSALLLTVLQDVFGRITTISAAYVFGTSLYPETKTFRLVADLVNDASIVLDTLCPRLS 172

Query: 229 -AFPHHFV-------------FIGAAAGAGRSA-----AALIQASTRSCFYAGFAARRNF 269
            +F H F+               G  AG  ++A     A  + AST +   A  A   + 
Sbjct: 173 LSFLHTFLLAISLCASAALRALCGLVAGGSKAALTNHFACTVPASTAAGGEAVVAG--DV 230

Query: 270 AEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIEL 329
            E+ AK  ++  V    G++LG  L   + S     L    ++   H+  N  + +S+ L
Sbjct: 231 GELNAKDASRETVVGLFGMLLGTLLVPRLTSVRATWLTLVFLIAG-HLLANYLAVRSVSL 289

Query: 330 RTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFH----------FFKIKSANKSQL 379
           RTLN  RA ++++ +     A  ++E  D + + P+                + + + ++
Sbjct: 290 RTLNRQRAGILWTAW-----AARMREREDADDVSPSISSLAKAGGQLALTPAEVSRRERI 344

Query: 380 LVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSL-------YEDEGYIL----- 427
           L L    +DA        +LG+ +S VV      +   +L       +  E Y+L     
Sbjct: 345 LALPGTLRDARGHSIGYCELGASVSRVVARGGAENVRRALQSSSDAKFAKERYMLFALPR 404

Query: 428 --TEHGGKFCVVLKESALPQDMLK 449
              E   +  V  KE A P+D LK
Sbjct: 405 RRVESKVELTVCFKEGATPRDQLK 428


>gi|444725757|gb|ELW66311.1| hypothetical protein TREES_T100000484 [Tupaia chinensis]
          Length = 406

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 66/224 (29%), Positives = 110/224 (49%), Gaps = 11/224 (4%)

Query: 144 RSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHF 202
           R     AS +SG LATQA+L  IG+G   A  +AA   W++KD  G L +I+ + +    
Sbjct: 46  RPFAAFASSLSGSLATQAVLLGIGVGNARASVSAATATWLVKDSTGMLGRIVFAWW---- 101

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAG 262
               KG  LFAD+L + A  LE++ P +P  F      +   +   ++   +TR+     
Sbjct: 102 ----KG--LFADILNDVAMFLEIMAPIYPIFFTMTICTSNLAKCVVSVAGGATRAALTMH 155

Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLK 322
            A R N A+V AK  +Q  +    G+++ + +   +      +L  F ++T +H+Y N +
Sbjct: 156 QARRNNMADVAAKDGSQETLVNLAGLLVSLLMLPLVSDFPSVSLGCFFLLTALHIYANYR 215

Query: 323 SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
           + +++ + TLN  R  LV   +L +G        N  EPL+  F
Sbjct: 216 AVRALVMETLNEGRLRLVLKHFLRTGNVLDPTSANQMEPLWTGF 259


>gi|449542984|gb|EMD33961.1| hypothetical protein CERSUDRAFT_117486 [Ceriporiopsis subvermispora
           B]
          Length = 505

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 108/409 (26%), Positives = 175/409 (42%), Gaps = 59/409 (14%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAA 178
           F Q  LP  +P SV   Y+ + + + V+G    +  VL  QALL ++G+  +G+I  A A
Sbjct: 114 FRQMFLPTNYPQSVHRSYMPFHVLQFVEGTLGTLVSVLCNQALLTSVGVSAEGSIFGAVA 173

Query: 179 INWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEM---LTPAFPHHF 234
           + W++KDG G ++K+  +  F  +FD +PK + L  ++L     GL+M   L    P  F
Sbjct: 174 VQWIIKDGAGEVAKLFFIRKFSPYFDSHPKTFTLSGEVLVALGSGLQMATLLVNPTPLSF 233

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           +   A   A +     I  +T   F   F+ + N  +V AK E+Q  V++  G   GI L
Sbjct: 234 LICAAGGNAFKLIGYAIWFTTHIKFIRYFSQQGNTGDVAAKDESQASVAQLAGYAAGIGL 293

Query: 295 ------ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
                  +++ +    A+ +  V T + M C      + EL T+   R SL+   Y   G
Sbjct: 294 LTLSHDPSYLYTIFALAVPTHLVATTLMMRCA-----TFELLTIP--RLSLLAETYSRKG 346

Query: 349 QAPPVK--EVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDV 406
              P+   E      +F  F+        K Q  ++S               L  ++++V
Sbjct: 347 DVMPLSRLEAKRATGMFGEFY-------GKGQGKLVS---------------LAPRVAEV 384

Query: 407 VNNKEDAHALFSL----YEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLY 458
           V ++E   A ++L      DE Y+L            V L+  A   DML+++  A+ L 
Sbjct: 385 VKDEEGERARWALCTEVCRDERYLLYPPAAPQANPASVFLRPDASSDDMLRAVLHAARLR 444

Query: 459 WLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
            L   A  + T   A       L+ S ++ Q  F   K      GW TD
Sbjct: 445 SL--LADDIVTLREA-------LKDSHEWTQEHFMTFKGALDDKGWRTD 484


>gi|302837684|ref|XP_002950401.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
           nagariensis]
 gi|300264406|gb|EFJ48602.1| hypothetical protein VOLCADRAFT_90737 [Volvox carteri f.
           nagariensis]
          Length = 500

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 117/243 (48%), Gaps = 19/243 (7%)

Query: 139 NYSLWRSVQGVASQISGVLATQALLYAIGLGKGA-IPTAAAINWVLKDGIGYLSKIMLSN 197
            +  W ++Q ++S + GVL +QA+L  +G+G+ A  P AA   + ++D  G L  I+ S 
Sbjct: 3   GFQTWDTIQALSSYVRGVLTSQAILRGVGVGQEASTPLAAVFTFFVRDLAGMLGGILFSY 62

Query: 198 F-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-HFVFIGAAAGAGRSAAALIQAST 255
             G  FD   K WRLFAD+  +    +E+ +P FP   F+ I       RS   +   +T
Sbjct: 63  LEGSSFDACAKQWRLFADITNDLGMTVELASPLFPRVMFLPIACLGSVARSLTGVAGGAT 122

Query: 256 RSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM-----PFALASFS 310
           R+     FA R N A+V AK ++Q   +  +G++ G+A+   + +       P   A F 
Sbjct: 123 RAALTQHFARRGNAADVSAKEQSQETATTILGMVTGMAVTRLLAAGEGPAGGPTGAAGF- 181

Query: 311 VVTWIHMYCNLK---------SYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEP 361
           V+ W+ ++  L          + + + L +LN  R  L+   YL  G+    + V+  E 
Sbjct: 182 VIAWL-VFLGLTAVHVVANVAAMRVLLLTSLNTPRLELLVDRYLQDGRVLSPRAVSQLED 240

Query: 362 LFP 364
           L P
Sbjct: 241 LTP 243


>gi|296411034|ref|XP_002835240.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295628015|emb|CAZ79361.1| unnamed protein product [Tuber melanosporum]
          Length = 413

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 98/398 (24%), Positives = 178/398 (44%), Gaps = 36/398 (9%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
           S+Q  +S I+G+L+++A+L   G+G   A  T A +  + +D +  L+ I+ ++ FG   
Sbjct: 6   SLQAFSSSIAGLLSSRAVLEGFGVGDANASATGALLLTIAQDCVSRLATILFAHRFGPAL 65

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
           +   K +RL AD+  ++A  L+ L+PAFP    + +   +G+ R+A  +    +++    
Sbjct: 66  EAETKRYRLAADVFNDSAMILDCLSPAFPRSVRILMLCCSGSLRAACGVAGGGSKASLSV 125

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
            FA   N  E+ AK  +Q  V   +G++ G  + +HI S+    + S +++  +H+  N 
Sbjct: 126 HFARSGNVGELNAKDSSQETVIGLMGMLAGSFVVSHISSTWATWI-SLTLLQAVHLGTNF 184

Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
           K+  ++ LRTLN  RA++V S +  +G+    ++    E +F            K  +L 
Sbjct: 185 KAVTAVALRTLNRQRANIVCSAFFDTGEILTPRQAARHERIF-----------EKDGVLR 233

Query: 382 LSSEAKDAAVEIEHRLQ-LGSKLSDVVNNKEDAHA-------LFSLYEDEGYIL--TEHG 431
                     EI  +L  L   L      K++          L S++++E Y+L      
Sbjct: 234 WGGSGVLGNAEIGVKLTALIRSLRQSPERKKEGEGEGVSPVELASVFQNEKYLLWYDVQR 293

Query: 432 GKFCVVLKESALPQDMLKSLFQASYL------YWLERNAGIVATSTSADCAPGGRLEISL 485
               + LKE   P++ LKS   A  L      +W +  A    ++  +  A    +  SL
Sbjct: 294 RLVTICLKEKVSPEEQLKSWVHALVLAREIRHHWPKEGAD--NSNRPSQQALLDTVRRSL 351

Query: 486 DYVQREFNHVKSDSASVGWVTDGLIARPLPN---RIRP 520
           D +   F  +       GW  D  +   +P     IRP
Sbjct: 352 DDIGMSFTPMLGRLKGAGWDLDAAVLETVPGYRVSIRP 389


>gi|170118219|ref|XP_001890291.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164634725|gb|EDQ99048.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 471

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 169/368 (45%), Gaps = 47/368 (12%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTA 176
           +L  +  LP G+PDSV+ DYL Y +  ++Q   + + G+L+++A+L   G+G   A  T 
Sbjct: 47  DLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLVGLLSSRAILEGFGVGDPSATATN 106

Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-----F 230
           A +  VL+D    L+ I+ ++  G     + K +RL AD+L +AA  L+ ++P      F
Sbjct: 107 ALLLTVLQDVFSRLTTIVAAHILGSSLAPDVKKYRLLADMLNDAAVILDTISPRLDTLFF 166

Query: 231 PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA----RRNFAEVIAKGEAQGMVSKAI 286
           P   V     +   RS  A+    +++     FA       +  ++ AK  ++  V   +
Sbjct: 167 PGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETVLALL 226

Query: 287 GIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           G++LG  +  H+  + P+ +  +  V+  +H+  N    + + LRTLN  R  L +S Y+
Sbjct: 227 GMLLGTLIVPHL--TTPWTIYTTLLVLVGLHLTINYIGVRGLVLRTLNGQRTWLAWSAYM 284

Query: 346 LS--GQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKL 403
            +  G+AP   EV   E +F     F+     +S +L+              R  +GS  
Sbjct: 285 RTQPGRAPTPDEVASLERIFERPGAFR---DPRSGVLM-------------GRCTIGSSF 328

Query: 404 SDVVNNKEDAHALFSLYEDEGYIL--------------TEHGGKFCVVLKESALPQDMLK 449
           S++++       L  L+++E Y+L               E   +  + LKE    +D LK
Sbjct: 329 SEILSGPIPPQ-LLELFKEERYLLWFDRRCLLRPESPSVEGFLRLHICLKEGYNTEDQLK 387

Query: 450 SLFQASYL 457
           +   A+ L
Sbjct: 388 AWIHAAEL 395


>gi|164423593|ref|XP_001728067.1| hypothetical protein NCU11380 [Neurospora crassa OR74A]
 gi|157070159|gb|EDO64976.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 357

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 121/242 (50%), Gaps = 26/242 (10%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP G+P SVT DYL Y  + S+Q   S IS +LA +A+L  +G+G   + PTAA I  + 
Sbjct: 44  LPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTAALILKIT 103

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF----PHHFVFIG 238
            D I   + I+ ++  G   +   K +R  AD+L +AA  L++L P        + + +G
Sbjct: 104 SDTISRFATILFAHRLGTAIEPECKFFRFLADVLNDAAQLLDLLMPVMMMGETRYGLKLG 163

Query: 239 AAAGAG--RSAAALIQASTRSCFYAGFAARR-NFAEVIAKGEAQGMVSKAIGIMLGIALA 295
              G+G  RS   +   ++++   A F  +R +  E+ AK  +Q  V   +G+++G    
Sbjct: 164 VVVGSGVLRSWCGVAAGASKASLSAHFVTQRSSLGELNAKEASQETVVSLLGMLVG---- 219

Query: 296 NHIGSSMPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
                S+   +    +V W        +H+  N +  + + LRTLN  RA++V   +L +
Sbjct: 220 -----SLVVKVVQDRLVVWVLMVGLLGVHLLMNYRGVRCVRLRTLNRQRATMVVRSWLET 274

Query: 348 GQ 349
           G+
Sbjct: 275 GR 276


>gi|218187487|gb|EEC69914.1| hypothetical protein OsI_00335 [Oryza sativa Indica Group]
          Length = 386

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/220 (32%), Positives = 107/220 (48%), Gaps = 21/220 (9%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL----GKGAIPTAAA 178
           F+LP GFP SV+ DYL Y LW+    V   I   L T  LL A+G+    G  A  +AAA
Sbjct: 70  FVLPAGFPGSVSDDYLEYMLWQLPTNVTGWICHTLVTSTLLKAVGVGSFTGTSAAASAAA 129

Query: 179 INWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           I WV KDGIG   ++++   FG  FD +PK WR++AD + +A    E++TP +P +F+ +
Sbjct: 130 IRWVSKDGIGAFGRLLIGGRFGTLFDDDPKKWRMYADFIGSAGSIFELITPLYPGYFLPL 189

Query: 238 GAAAGAGRS---AAALIQASTRSCFYAGFAARRN-FAEVIAKGEAQGMVSKAIGIMLGIA 293
            +     +    +  L QA  R      F   +N FAE    GE     +          
Sbjct: 190 ASLGNLAKKIIDSGLLKQAVGRGFRDPSFRVIQNHFAESGNLGEVAAKDTSG-------- 241

Query: 294 LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
               I SS P    ++  V  +H++   +S   ++ RT++
Sbjct: 242 ----IKSSYPTLALTWLGVRLLHLWFRYQSLSVLKFRTVD 277


>gi|397615116|gb|EJK63231.1| hypothetical protein THAOC_16128 [Thalassiosira oceanica]
          Length = 621

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 118/272 (43%), Gaps = 45/272 (16%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI-GLGKGAI 173
           +CR+   +  +PEG    VT  Y  +  WR +Q   +    V+ TQ+LL  + G+ +  +
Sbjct: 146 KCRQSLHKNFVPEG----VTPSYYRFMRWRILQRFMNANVHVIGTQSLLMGLRGMQRSTV 201

Query: 174 PTAAA-----------INWVLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLLENAAF 221
            +AAA            NWVLKD +G L ++   S  GR FD + K WR  A  +     
Sbjct: 202 VSAAAKGGAALGAVAATNWVLKDTLGKLVRMAWASKMGRKFDSDAKRWRYRACFIYALGN 261

Query: 222 GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGF------------------ 263
           GLE+ T   P  F+ +   A + +  + L  ++TR+  Y  F                  
Sbjct: 262 GLEVSTYLHPRFFLVLAMLANSCKQVSMLTSSATRNALYNSFRRMDDDVDTVGTGSNSTL 321

Query: 264 -----AARR-----NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
                  RR     N  ++ AKGEAQ  V   +GI  GI L+  IG S+   L  +SV+ 
Sbjct: 322 KSSINGGRRRGVVENIGDITAKGEAQIAVVDLLGIASGICLSRAIGVSVQNVLTVWSVLQ 381

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
              +YC     +SI  +TLN  R   V    L
Sbjct: 382 LGEIYCMYHEIRSIVYQTLNFERMYTVLGRLL 413


>gi|428168470|gb|EKX37414.1| hypothetical protein GUITHDRAFT_89666 [Guillardia theta CCMP2712]
          Length = 331

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 92/165 (55%), Gaps = 3/165 (1%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPT- 175
           R ++  F LP G+P S+  +Y  Y  +  +Q ++S + G+L+TQ +L  +G+GK  + + 
Sbjct: 124 RIMYATF-LPSGYPASLPDEYTKYQAFNLLQDLSSNLRGILSTQKILEGMGVGKAGVTSL 182

Query: 176 AAAINWVLKDGIGYLSKIMLSNFGRH-FDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           AA + W+ +DG   L  ++ + F    F ++ K WRLFADL+ + A  L+ML PA+P  F
Sbjct: 183 AATMQWMARDGSSMLGGLLFTAFASSSFGIDIKRWRLFADLINDVALLLDMLAPAYPDLF 242

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
           + I   +   ++   +   +  +     +AAR N A+V AK  AQ
Sbjct: 243 LPIICLSSVFKAMCGVAAGACNTAIVQHWAARGNIADVGAKTGAQ 287


>gi|413942078|gb|AFW74727.1| hypothetical protein ZEAMMB73_469739, partial [Zea mays]
          Length = 199

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 50/132 (37%), Positives = 79/132 (59%), Gaps = 6/132 (4%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           D+  + F+    P G+P SV   YL Y+ +R +Q  +S +  VL+TQ+LL+A+GL     
Sbjct: 69  DKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAVGLR--PT 126

Query: 174 PT-AAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           PT A A++W+LKDG+ +  K++ S+ G   D  PK WR+ AD+L +    LE+++P  P 
Sbjct: 127 PTQATAVSWILKDGMQHTRKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQ 186

Query: 233 HFVFIGAAAGAG 244
            F+ +   AG G
Sbjct: 187 LFLEV---AGLG 195


>gi|403418386|emb|CCM05086.1| predicted protein [Fibroporia radiculosa]
          Length = 497

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 98/394 (24%), Positives = 178/394 (45%), Gaps = 65/394 (16%)

Query: 107 LSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI 166
            S+ +   + R L     LP G+P SV+ DYL Y ++ ++Q   S ++ ++A++A+L   
Sbjct: 34  FSIKQKRTDWRRLLSNVFLPAGYPASVSPDYLQYQIYNALQAFCSSLASLIASRAVLEGH 93

Query: 167 GLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLE 224
           G+G   A  T A    VL+D    L+ I+   + G       K +RL AD+  +AA  L+
Sbjct: 94  GVGNADASATHAIFLTVLQDIFSRLTTIVSGYYLGTSLFPEAKTYRLLADIFNDAAIVLD 153

Query: 225 MLTP-------AFPHHFVFIGAA----------AGAGRSAAALIQASTRSCFYAGFAA-- 265
            L+P       +  + FV  G+           +GA R+   ++   +++     FA+  
Sbjct: 154 TLSPHLDGVVLSLRYPFVLSGSGSSLRVSALCLSGASRALCGMVAGGSKAALTVHFASPG 213

Query: 266 --RRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
               +  ++ AK  ++  V   +G++ G A+ +++ S+    L  F V+T++H++ N  +
Sbjct: 214 QGTGDVGDLNAKDGSKETVLALLGMLSGTAVMHYVHSARATYLVLF-VLTFLHLFGNYLA 272

Query: 324 YQSIELRTLNPYRASLVFSEYL------LSGQAPPV---KEVNDEEPLFPAFHFFKIKSA 374
            + + LR+LN  RA++V+  Y       + G+ P V   + V   E +F          A
Sbjct: 273 VRVVVLRSLNRQRANIVWMAYRAHIKGPMGGKRPLVLTPEAVAARERIF----------A 322

Query: 375 NKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDA-------------HALFSLYE 421
           N S L   SS A           ++GS  S + + ++ A             H +  ++ 
Sbjct: 323 NPSALHDTSSPAH----PFLGICKVGSSFSSLASRQQSAPRGALAAPSPPQIHTILEIFS 378

Query: 422 DEGYIL--TEHGGKFC---VVLKESALPQDMLKS 450
           +E Y+L      G  C   V LK+   P+D LK+
Sbjct: 379 EEPYVLWFGPTSGPSCRLHVFLKDGHKPRDHLKA 412


>gi|409048331|gb|EKM57809.1| hypothetical protein PHACADRAFT_251666 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 532

 Score = 95.5 bits (236), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 103/440 (23%), Positives = 178/440 (40%), Gaps = 54/440 (12%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAIN 180
           Q  LP  +P SV   YL + L + ++ +   + GVL  QALL ++G+  +G+I  A A+ 
Sbjct: 112 QMFLPTNYPQSVHPSYLKFHLLQFMECIVGTVVGVLCNQALLVSVGVSAEGSILGAVAVQ 171

Query: 181 WVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLE---MLTPAFPHHFVF 236
           WV+KDG G ++K+ ++  F  +FD +PK +      +     GL+   ++ P    +F+ 
Sbjct: 172 WVIKDGAGEVAKLWVIRRFSPYFDSHPKTFTFTGATMGLLGSGLQIAALVVPPTTLNFLL 231

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
             A     +     +  +T   F   FA + N  +V AK E+QG V    G   G+ L  
Sbjct: 232 CAAGGNIFKCVGGAVWLTTHIKFVRFFARQGNMGDVAAKSESQGSVGILSGYAAGVGLLT 291

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-------Q 349
            +  S P+    F++   IH+       ++     L   R S +  E+           +
Sbjct: 292 -VSHSAPYLYTIFALAIPIHLLITQWMLRAATFELLTLPRLSFLAREFASRASRPEAERE 350

Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 409
            P + E+  +  +     +FK K              +D  V+I   L      +D +  
Sbjct: 351 LPTLGELERQRAIGVFGEYFKTK--------------EDQYVQIAPDLDAVVSPTD-LGA 395

Query: 410 KEDAHALFSLYEDEGYIL-----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNA 464
           +    A    + D+ Y+L      +   K  V+L  SA   D ++S+  A++   L+R+ 
Sbjct: 396 RTRWDACVEAFNDQPYLLYPYLMQQSECKIAVLLHPSATVDDRIQSVLHAAF---LQRHL 452

Query: 465 GIVATSTSADCAP------GGRLEISLDYVQR----EFNHVKSDSASVGWVTDGLIARPL 514
             V     A+  P         L+ +LD  +R         K+      W TD +    L
Sbjct: 453 HNVMQIDPAESTPQELLDNDKVLKAALDDAKRLVALYLREFKTQLVEKEWRTDEIAVPEL 512

Query: 515 PNRI--------RPGYVEPS 526
             R+        +P  VE S
Sbjct: 513 GRRVLWGTAAIQKPKRVEKS 532


>gi|70991459|ref|XP_750578.1| DUF647 domain protein [Aspergillus fumigatus Af293]
 gi|66848211|gb|EAL88540.1| DUF647 domain protein [Aspergillus fumigatus Af293]
          Length = 542

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/236 (29%), Positives = 117/236 (49%), Gaps = 26/236 (11%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
           L V   LP G+P+SVT DYL                      A+L  +G+G   A PT+A
Sbjct: 69  LLVDVFLPSGYPNSVTDDYL---------------------PAVLQGVGVGNANASPTSA 107

Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            +  +L+D  G ++ I+ ++  G   +   K +RL AD+  + A  L+ L+P  P     
Sbjct: 108 LLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIPAGVGR 167

Query: 237 IG--AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           +G  +AAG  R+   +   S+++   A FA   N AEV AK  +Q  +   +G+++G  +
Sbjct: 168 VGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVGSFI 227

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
            +H+ +S      S   +  +H+  N  + +S+++ +LN  RA++VFS  L S  A
Sbjct: 228 VSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPA 282


>gi|159124134|gb|EDP49252.1| DUF647 domain protein [Aspergillus fumigatus A1163]
          Length = 542

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/236 (29%), Positives = 117/236 (49%), Gaps = 26/236 (11%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAA 177
           L V   LP G+P+SVT DYL                      A+L  +G+G   A PT+A
Sbjct: 69  LLVDVFLPSGYPNSVTDDYL---------------------PAVLQGVGVGNANASPTSA 107

Query: 178 AINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVF 236
            +  +L+D  G ++ I+ ++  G   +   K +RL AD+  + A  L+ L+P  P     
Sbjct: 108 LLLHILQDSSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMVLDCLSPMIPAGVGR 167

Query: 237 IG--AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           +G  +AAG  R+   +   S+++   A FA   N AEV AK  +Q  +   +G+++G  +
Sbjct: 168 VGVLSAAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLLGMLVGSFI 227

Query: 295 ANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
            +H+ +S      S   +  +H+  N  + +S+++ +LN  RA++VFS  L S  A
Sbjct: 228 VSHV-TSFAATWTSLVFLLTVHLAMNYAAVRSVQMTSLNRQRANIVFSTLLESDPA 282


>gi|413949258|gb|AFW81907.1| hypothetical protein ZEAMMB73_025152 [Zea mays]
          Length = 253

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 49/132 (37%), Positives = 72/132 (54%), Gaps = 1/132 (0%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-A 172
           +E       +++PEGFPDSVT  Y+ Y  WR+++       GV  T+ LL ++G+ +  A
Sbjct: 116 EEIMSFIRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTRTLLNSVGVSQSRA 175

Query: 173 IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
              A AINW+LKDG G + K+  +  G+ FD + K  R   DLL     G+E+ T AFP 
Sbjct: 176 ASGAVAINWILKDGAGRVGKMFFARQGKKFDYDLKQLRFSGDLLMELGAGIELATAAFPQ 235

Query: 233 HFVFIGAAAGAG 244
            F+ +   A  G
Sbjct: 236 FFLPMACIANVG 247


>gi|224002080|ref|XP_002290712.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974134|gb|EED92464.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 656

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 122/282 (43%), Gaps = 45/282 (15%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI-GLGK-G 171
           D   + + QF+     P+ VT  Y  +  WR +Q   +    V+ TQ+LL  + G+ + G
Sbjct: 139 DTLVQKWTQFLHSAFVPEGVTPSYYRFMRWRILQRFINANVHVIGTQSLLMGLRGMTRSG 198

Query: 172 AIPTAAA--------------INWVLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLL 216
           ++  AA+               NWVLKD +G + +++  S  GR FD + K WR  + L+
Sbjct: 199 SVGEAASFAAKEGAALGAAAATNWVLKDTLGKIVRMVWASKMGRKFDPDAKRWRFRSALI 258

Query: 217 ENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFA------------ 264
                GLE+ T   P +F+ +   A + +  + L  ++TR+  Y  F             
Sbjct: 259 YALGNGLEVSTYLHPQYFLLLAMLANSCKQMSMLTSSATRNALYNSFKRVGTTSSTVSTS 318

Query: 265 ---------ARRNFA-------EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALAS 308
                     RR          ++ AKGEAQ  V   +GI  GI L+  +G S+    A 
Sbjct: 319 NNASGSSAIVRRGGNGGIENIGDITAKGEAQIAVVDLLGIASGICLSRAVGVSVQNVFAV 378

Query: 309 FSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
           + V+    ++C     +SI  RT N  R  +V  + +L   A
Sbjct: 379 WVVLQIGEIFCMYHEMRSIVYRTFNFERMYMVLGKLMLDDAA 420


>gi|407259493|gb|AFT91377.1| hypothetical protein [Emericella rugulosa]
          Length = 549

 Score = 94.4 bits (233), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/235 (29%), Positives = 119/235 (50%), Gaps = 21/235 (8%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           R++ +   LP G+P SV+ DY                +  LA  A+   +G+G   A PT
Sbjct: 66  RDVLIDVFLPAGYPHSVSDDY----------------TPALAQPAIHAGVGVGNANASPT 109

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           +A +  +L+D  G ++ I+ ++  G   +   K +R  AD+  + A  L+ L+P  P  F
Sbjct: 110 SALLLHILQDTSGRIATILFAHRVGTALEPECKMYRFAADIFNDLAMLLDCLSPMIPAGF 169

Query: 235 --VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             V + +AAG  R+   +   S+++   A FA   N A+V AK  +Q  V   IG+++G 
Sbjct: 170 NRVTVLSAAGVLRALCGVAGGSSKASLSAHFARWGNLADVNAKDSSQETVISLIGMLVGS 229

Query: 293 ALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
            + +HI S     L    +V+ +H+  N  + +S+++ TLN  RA++VFS  L S
Sbjct: 230 FVVSHITSFTATWLTLVFLVS-MHLSLNYAAVRSVQMTTLNRQRANIVFSTLLSS 283


>gi|397627073|gb|EJK68340.1| hypothetical protein THAOC_10484 [Thalassiosira oceanica]
          Length = 478

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 115/223 (51%), Gaps = 13/223 (5%)

Query: 135 SDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKI 193
           + YL Y    ++Q + + +  VLATQ +L  +G+G+ GA   AA ++++++DG G ++ +
Sbjct: 91  TGYLEYVCMDNLQDLTTALRSVLATQRVLEGVGVGREGATALAATLDFLIRDGCGMIASL 150

Query: 194 ML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQ 252
           +L S F   F  + K W+LFAD   +A   LE++ P+ P  + F+    GA    A    
Sbjct: 151 VLTSGFSSTFRRDIKRWKLFADAAVDAGITLEVVAPSLPSRW-FLPVLCGANVCKALCGV 209

Query: 253 ASTRSC--FYAGFAAR-----RNFAEVIAKGEAQGMVSKAIGIMLGIALANHI---GSSM 302
           A+   C      +AAR        +EV AK  AQ  +  A G++L    A  +   G++ 
Sbjct: 210 AAGAVCGPVQLHWAARLGLGDEGLSEVAAKAGAQRTIVGATGLVLSGLFARWMGGRGANR 269

Query: 303 PFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           P  +  ++V+T +H+  N +S + + L  LN  R  L+  ++L
Sbjct: 270 PVWVGMYAVLTALHLLANYRSMRIVRLNWLNRERMDLILEDFL 312


>gi|392871321|gb|EAS33216.2| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
          Length = 407

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 160/321 (49%), Gaps = 32/321 (9%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
           S+Q  +S I+G+L+++A+L  +G+G   A PT+A +  +L+D +G ++ I+ ++  G   
Sbjct: 21  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 80

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
           +   K +RL AD+  +++  ++ L+P FP    V + + +   R+   +   S+++   A
Sbjct: 81  EPECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 140

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
            FA   N AE+ AK  +Q  V   +G+++G  + ++I S +   +A  S++  +H+  N 
Sbjct: 141 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLI-VHITTNY 199

Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
            + +++ + +LN  RA++VFS      +    K+ + +E +F      + K+++ +    
Sbjct: 200 LAVRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFC 259

Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEH 430
                           Q+G    DV+++   +           A+  ++E E YIL  + 
Sbjct: 260 ----------------QIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYILWFDP 303

Query: 431 GGKFCVV-LKESALPQDMLKS 450
             K  V+ LK +A P   LK+
Sbjct: 304 ARKRGVIGLKTNASPTSQLKA 324


>gi|320038116|gb|EFW20052.1| hypothetical protein CPSG_03227 [Coccidioides posadasii str.
           Silveira]
          Length = 478

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 160/321 (49%), Gaps = 32/321 (9%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
           S+Q  +S I+G+L+++A+L  +G+G   A PT+A +  +L+D +G ++ I+ ++  G   
Sbjct: 92  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
           +   K +RL AD+  +++  ++ L+P FP    V + + +   R+   +   S+++   A
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
            FA   N AE+ AK  +Q  V   +G+++G  + ++I S +   +A  S++  +H+  N 
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLI-VHITTNY 270

Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
            + +++ + +LN  RA++VFS      +    K+ + +E +F      + K+++ +    
Sbjct: 271 LAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFC 330

Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEH 430
                           Q+G    DV+++   +           A+  ++E E YIL  + 
Sbjct: 331 ----------------QIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDP 374

Query: 431 GGKFCVV-LKESALPQDMLKS 450
             K  V+ LK +A P   LK+
Sbjct: 375 ARKRGVIGLKTNASPTSQLKA 395


>gi|303316636|ref|XP_003068320.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240108001|gb|EER26175.1| hypothetical protein CPC735_003440 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 478

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 160/321 (49%), Gaps = 32/321 (9%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
           S+Q  +S I+G+L+++A+L  +G+G   A PT+A +  +L+D +G ++ I+ ++  G   
Sbjct: 92  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSL 151

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
           +   K +RL AD+  +++  ++ L+P FP    V + + +   R+   +   S+++   A
Sbjct: 152 EPECKMYRLAADIFNDSSMIMDCLSPTFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSA 211

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
            FA   N AE+ AK  +Q  V   +G+++G  + ++I S +   +A  S++  +H+  N 
Sbjct: 212 HFAKWGNLAELNAKDSSQETVISLMGMLVGSLVVSYITSPLATWVALISLLI-VHITTNY 270

Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLV 381
            + +++ + +LN  RA++VFS      +    K+ + +E +F      + K+++ +    
Sbjct: 271 LAVRAVSMTSLNRQRANIVFSTLFDENRVLTPKQASKQERVFERDGVLRWKASSDTLGFC 330

Query: 382 LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEH 430
                           Q+G    DV+++   +           A+  ++E E YIL  + 
Sbjct: 331 ----------------QIGVPFKDVIDHISSSSPAERTEATVVAILRVFEQEEYILWFDP 374

Query: 431 GGKFCVV-LKESALPQDMLKS 450
             K  V+ LK +A P   LK+
Sbjct: 375 ARKRGVIGLKTNASPTSQLKA 395


>gi|346467075|gb|AEO33382.1| hypothetical protein [Amblyomma maculatum]
          Length = 388

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 103/213 (48%), Gaps = 3/213 (1%)

Query: 154 SGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRL 211
           +G LA Q++L  +G+G +GA   AA   W+L+DG G   +I+ + F G + D + K WRL
Sbjct: 1   TGTLAIQSVLTGVGVGDQGASVLAATTTWILRDGAGMTGRILFAWFQGSNLDYDSKKWRL 60

Query: 212 FADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
           FAD+L +AA  LE++   F  +   I   +   +S   +   +TR+      A   N A+
Sbjct: 61  FADILNDAAIFLELMCQYFKGYVTPILCISSVAKSVVGVAGGATRAALTQHQARSNNMAD 120

Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRT 331
           V AK  +Q  +      +  + L   +  +       F + T +H++ N ++   + + T
Sbjct: 121 VSAKDGSQETLVNLAAFLFSLLLLQLVAGNTWLVSTIFILFTLLHIFANYRAVSCVVMET 180

Query: 332 LNPYRASLVFSEYLLSGQA-PPVKEVNDEEPLF 363
            N  R S+V   +L  G    PV  VN  E ++
Sbjct: 181 FNRSRYSVVVRHFLQKGGGIAPVGWVNARESVW 213


>gi|159474944|ref|XP_001695583.1| hypothetical protein CHLREDRAFT_191958 [Chlamydomonas reinhardtii]
 gi|158275594|gb|EDP01370.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 487

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 1/135 (0%)

Query: 146 VQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIML-SNFGRHFDV 204
           +  +A   + VLA+  +LYA+GLG GAIPTA A+NWVLKDG+G L  ++       +FDV
Sbjct: 143 LNNIAVTANSVLASTFMLYAVGLGAGAIPTAGALNWVLKDGMGQLGTLVFGKTIAHNFDV 202

Query: 205 NPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFA 264
           + K W   + +L  AA  LEMLT   P HF+ +G+ A   +  A +   STRS F+  FA
Sbjct: 203 HSKTWFFLSAVLLQAAAALEMLTVLVPGHFLLMGSLANMLKGLAWMAAGSTRSVFHLSFA 262

Query: 265 ARRNFAEVIAKGEAQ 279
              N A+V AKG +Q
Sbjct: 263 RDNNIADVTAKGTSQ 277


>gi|119467992|ref|XP_001257802.1| hypothetical protein NFIA_052500 [Neosartorya fischeri NRRL 181]
 gi|119405954|gb|EAW15905.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 515

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 114/210 (54%), Gaps = 5/210 (2%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
           S+Q   S I+G+L+++A+L  +G+G   A PT+A +  +L+D  G ++ I+ ++  G   
Sbjct: 86  SLQAFCSSIAGLLSSRAVLQGVGVGNANASPTSALLLHILQDSSGRIATILFAHRVGTAL 145

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG--AAAGAGRSAAALIQASTRSCFY 260
           +   K +RL AD+  + A  L+ L+P  P     +G  +AAG  R+   +   S+++   
Sbjct: 146 EPECKMYRLAADVFNDVAMVLDCLSPMIPAGVGRVGVLSAAGVLRALCGVAGGSSKASLS 205

Query: 261 AGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCN 320
           A FA   N AEV AK  +Q  +   IG+++G  + +H+ +      AS   +  +H+  N
Sbjct: 206 AHFARGGNLAEVNAKDSSQETIISLIGMLVGSFIVSHV-TCFAATWASLLFLLTVHLGMN 264

Query: 321 LKSYQSIELRTLNPYRASLVFSEYLLSGQA 350
             + +S+++ +LN  RA++V S  L S  A
Sbjct: 265 YAAVRSVQMTSLNRQRANIVLSTLLESDLA 294


>gi|452984686|gb|EME84443.1| hypothetical protein MYCFIDRAFT_187449 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 435

 Score = 92.0 bits (227), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 90/357 (25%), Positives = 171/357 (47%), Gaps = 60/357 (16%)

Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
           V   LP GFP SV+ DYL Y ++ S+Q  +S I+ +L+++A+L ++G+G   A PTAA I
Sbjct: 44  VDVFLPVGFPHSVSEDYLEYQIYDSLQAFSSSIASLLSSRAVLSSVGVGDSTASPTAAII 103

Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIG 238
             +L+D +G L+ I+ ++ FG   +   K +RL AD +      L  +  +        G
Sbjct: 104 LSILQDSVGRLATILFAHSFGTSLEPECKMYRLLADPIRVVVLSLSSVLRS------LCG 157

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
            AAG           S+++   A FA + N  E+ AK  +Q  V   +G+M G  + + I
Sbjct: 158 VAAG-----------SSKASLSAHFAKQGNLGELNAKDSSQETVISLVGMMAGSLVISWI 206

Query: 299 GSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
             + P A   +  ++  IH+  N ++ +++++RTL   RA+L++ + +   + P    + 
Sbjct: 207 --TTPIATWTALILLLSIHLETNRRAVRAVKMRTLTRQRAALIYYQ-VQQNKTPTPDFIA 263

Query: 358 DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHA-- 415
            +E +F            +  +L      ++A  E+     +G  L+ ++N+  + H+  
Sbjct: 264 SQERIF-----------ERDGVL------RNAKAEVLGFCSVGVPLARLLNSIGEQHSQT 306

Query: 416 ------------LFSLYEDEGYIL------TEHGGKFCVVLKESALPQDMLKSLFQA 454
                       +  LY+   +IL              +V+K+SA+ +D++ + +QA
Sbjct: 307 KSSTMASEKFTRILHLYQTSAFILWSDRHSRSRRNSLFIVVKKSAVSRDLILAWWQA 363


>gi|347842141|emb|CCD56713.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 376

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 153/322 (47%), Gaps = 53/322 (16%)

Query: 165 AIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFG 222
           ++G+G   A  T A +  +L++  G +S I+ ++  G   +   K +RL AD+  +AA  
Sbjct: 2   SLGVGDSTASATHAVLLSILQESAGRISTILFAHRLGSALEPECKKYRLMADIFNDAAMI 61

Query: 223 LEMLTPAFPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
           L+ ++PAFP    VF+ +A+   +S   +   S+++   A FA   N AE+ AK  +Q  
Sbjct: 62  LDCISPAFPKIPRVFLLSASSVCKSLCGVAAGSSKASLSAHFAKMGNLAELNAKDASQET 121

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 341
           +   +G+++G  + + I S +   +A  ++++ IH+  N  + +S+ +RTLN  RA+LV 
Sbjct: 122 LISLLGMLVGTFVVSKISSQVATWIALLALLS-IHLGTNYMAVRSVTMRTLNRQRANLVI 180

Query: 342 SEYLLSGQ-------APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 394
           S  L S +        P  ++++ +E +F                       +D A+   
Sbjct: 181 SSLLSSPEHEKNKISLPSPRDISLQERIFE----------------------RDGAIRNT 218

Query: 395 H-----RLQLGSKLSDVVNN-------------KEDAHALFSLYEDEGYIL--TEHGGKF 434
           H       ++G  L D++++             +    +L S+Y+ +GY++  +     F
Sbjct: 219 HGTILAYCKVGVSLQDLLSSIATPTTAGSYAEPESILTSLLSIYKTQGYLMWYSRSRNTF 278

Query: 435 CVVLKESALPQDMLKSLFQASY 456
            V+LKE   P+  L + F A +
Sbjct: 279 LVILKEETAPKFQLDAWFHAIF 300


>gi|170102118|ref|XP_001882275.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642647|gb|EDR06902.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 285

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 132/263 (50%), Gaps = 22/263 (8%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAA 177
           +L  +  LP G+PDSV+ DYL Y +  ++Q   + ++G+L+++A+L   G+G    P+A 
Sbjct: 6   DLISKVFLPAGYPDSVSPDYLRYQILNALQAFCNSLAGLLSSRAILEGFGVGD---PSAT 62

Query: 178 AIN----WVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA--- 229
           A N     VL+D    L+ I+ ++  G     + K +RL AD+L +AA  L+ ++P    
Sbjct: 63  ATNALLLTVLQDVFSRLTTIVAAHILGSSLAPDAKKYRLLADMLNDAAVILDTISPRLDT 122

Query: 230 --FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA----RRNFAEVIAKGEAQGMVS 283
             FP   V     +   RS  A+    +++     FA       +  ++ AK  ++  V 
Sbjct: 123 LFFPGLRVAALCLSALFRSLCAISAGGSKASISLHFATPLKGTGDVGDLNAKDASKETVL 182

Query: 284 KAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
             +G++LG  +  H+  + P+ + A+  V+  +H+  N    + + LRTLN  R  + +S
Sbjct: 183 ALLGMLLGTLIVPHL--TTPWTIYATLLVLVGLHLTINYIGVRGLVLRTLNGQRTWIAWS 240

Query: 343 EYLLSG--QAPPVKEVNDEEPLF 363
            Y+ +   +AP   EV   E +F
Sbjct: 241 AYMRTQPRRAPTPVEVASLERIF 263


>gi|426194386|gb|EKV44317.1| hypothetical protein AGABI2DRAFT_120451 [Agaricus bisporus var.
           bisporus H97]
          Length = 462

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/416 (23%), Positives = 175/416 (42%), Gaps = 49/416 (11%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAI 173
           + +EL  +  LP G+P++VT DYL Y +  + Q   + ++ +LA++A L  IG+G   A 
Sbjct: 46  DTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPSAT 105

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            T A +  VL+D  G ++ I+ + + G       K +R  AD+L + A  +++  P F  
Sbjct: 106 STGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF-S 164

Query: 233 HFVFIGA------AAGAGRSAAALIQASTRSCFYAGFAARR----NFAEVIAKGEAQGMV 282
            F+F GA       + A R+   ++    ++     FA  R    +  ++ AK  ++  V
Sbjct: 165 AFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKETV 224

Query: 283 SKAIGIMLGIALANHIGS---SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
              +G++LG  L  HI S   + P  L   SV    H+  N  + + + +R+ N +RA++
Sbjct: 225 LALLGMLLGSVLIPHITSPWVTYPVLLLLISV----HLVINYIAVRGLAIRSPNRHRATI 280

Query: 340 VFSEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQ 398
            +  Y    + AP  + V   E +F      +    + +  LVL             +  
Sbjct: 281 AWLWYRSKDECAPSPQAVASHELIFERPENIR----DPTTSLVLG------------KCH 324

Query: 399 LGSKLSDVVNNKEDAHALFSLYEDEGYIL-------TEHGGKFCVVLKESALPQDMLKSL 451
           +GS   DV+     +     ++  + YI+       T H     +  KE     D LK  
Sbjct: 325 VGSAPLDVMRGPALSKEYLDIFHADKYIVCFNPSHETGHLPIIHICFKEGFTRADELK-- 382

Query: 452 FQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTD 507
              S+++ +E    I              +  +   V+ +++H        GW  D
Sbjct: 383 ---SWIHAVEICRSIALNQVPVQKGALEHIRSTFQVVKEKYDHFVDHMKKAGWNFD 435


>gi|393227224|gb|EJD34913.1| DUF647-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 377

 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 144/325 (44%), Gaps = 18/325 (5%)

Query: 142 LWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLS-NFG 199
           ++ S+Q   S+I+G+LA +A+L  +G+G  GA  T A +  +++D  G ++ I+ +   G
Sbjct: 1   MFDSLQAFFSEIAGLLAARAVLTGLGVGDAGATATGAVLLKIIQDTCGRVATILSAWKLG 60

Query: 200 RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSC 258
              +   K WRL ADL  +A   LE L+P  PH   + +   A   RS   +     ++ 
Sbjct: 61  PMLEPECKQWRLAADLFNDAGIILECLSPTLPHTMKISLLCVARVVRSLCGVSAGGAKAA 120

Query: 259 FYAGFAA-RRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHM 317
               F     + A++ AK  +Q  V   IG+++G  +   I +S      +   +  +H+
Sbjct: 121 LSQHFVGPGGSIADINAKESSQETVVALIGMLVGSMIVPRI-TSFEGTWTALLFLIGVHL 179

Query: 318 YCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKS 377
           Y N +  +++ + TLN  RA++V  E +  G+    ++V   E +F       +      
Sbjct: 180 YANWRGVRAVAMLTLNRQRATIVLGELIQRGRVLSPEQVRQRESVFCRSSRLVLDGVTLG 239

Query: 378 QLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL---TEHGGKF 434
           Q  +         V ++H L   +++     +      L   +  E YIL       GK 
Sbjct: 240 QCHI--------GVSLQHLLAQIARVHGPDRSGSALAGLMDRFASERYILHCCCRRKGKV 291

Query: 435 CVV--LKESALPQDMLKSLFQASYL 457
            ++  LK     +DML++   A  L
Sbjct: 292 SIIICLKRGVESRDMLRAWTHALVL 316


>gi|336467636|gb|EGO55800.1| hypothetical protein NEUTE1DRAFT_103211 [Neurospora tetrasperma
           FGSC 2508]
 gi|350287710|gb|EGZ68946.1| hypothetical protein NEUTE2DRAFT_72230 [Neurospora tetrasperma FGSC
           2509]
          Length = 361

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 120/247 (48%), Gaps = 30/247 (12%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVL 183
           LP G+P SVT DYL Y  + S+Q   S IS +LA +A+L  +G+G   + PT A I  + 
Sbjct: 44  LPAGYPHSVTPDYLPYQTYDSLQAFFSSISSLLANRAVLEGLGVGDASSSPTGALILKIA 103

Query: 184 KDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAA 241
            D I  ++ I+ ++  G   +   K +R  AD L +AA  L++L PA    + + +G   
Sbjct: 104 SDTISRVATILFAHRLGTAIEPECKFFRFLADCLNDAAQLLDLLVPAMMERYGLKLGVIV 163

Query: 242 GAG--RSAAALIQASTRSCFYAGF--------AARRNFAEVIAKGEAQGMVSKAIGIMLG 291
           GAG  RS   +  +++++   A F            N AE+ AK  +Q  V   +G+++G
Sbjct: 164 GAGVLRSWCGVAASASKASLSAHFIVVASQQKGGGNNLAELNAKEASQETVVSLLGMLVG 223

Query: 292 IALANHIGSSMPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSE 343
                    S+   +    +V W        +H+  N    + +  RTLN  RA++V  E
Sbjct: 224 ---------SLVVQVVQDRLVVWVLMVGLLGVHLLMNYWGVRCVRCRTLNRQRATMVVRE 274

Query: 344 YLLSGQA 350
           +L +G+ 
Sbjct: 275 WLETGRV 281


>gi|67972176|dbj|BAE02430.1| unnamed protein product [Macaca fascicularis]
          Length = 326

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 82/168 (48%)

Query: 199 GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSC 258
           G   D N K WRLFAD+L + A  LE++ P +P  F    + +   +   ++   +TR+ 
Sbjct: 12  GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71

Query: 259 FYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMY 318
                A R N A+V AK  +Q  +    G+++ + +   +     F+L  F  +T +H+Y
Sbjct: 72  LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131

Query: 319 CNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAF 366
            N ++ +S+ + TLN  R  LV   YL  G+       N  EPL+  F
Sbjct: 132 ANYRAVRSLVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLWTGF 179


>gi|391863517|gb|EIT72825.1| hypothetical protein Ao3042_00909 [Aspergillus oryzae 3.042]
          Length = 236

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 91/156 (58%), Gaps = 4/156 (2%)

Query: 121 VQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAI 179
           V+  LP G+P SV+ DY+ Y ++ S+Q  +S I+G+L+++A+L  +G+G   A PTAA +
Sbjct: 72  VEVFLPAGYPHSVSDDYVPYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALL 131

Query: 180 NWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VF 236
             +L+D  G +S I+ ++  G   +   K +RL AD+  +AA  L+ L+P  P  F  V 
Sbjct: 132 LHILQDTSGRISTILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVT 191

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
           + + AG  R+   +   S+++   A F+   N AEV
Sbjct: 192 VLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEV 227


>gi|409076043|gb|EKM76417.1| hypothetical protein AGABI1DRAFT_131245 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 466

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 129/265 (48%), Gaps = 21/265 (7%)

Query: 115 ECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAI 173
           + +EL  +  LP G+P++VT DYL Y +  + Q   + ++ +LA++A L  IG+G   A 
Sbjct: 46  DTKELLTKIFLPAGYPNTVTPDYLLYQILNAAQAFCNSLASLLASRAALEGIGVGDPSAT 105

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
            T A +  VL+D  G ++ I+ + + G       K +R  AD+L + A  +++  P F  
Sbjct: 106 STGAMLISVLQDIFGRIATIIGAYYLGTRLVPEAKKYRFLADVLNDIAVVVDVFNPVF-S 164

Query: 233 HFVFIGA------AAGAGRSAAALIQASTRSCFYAGFAARR----NFAEVIAKGEAQGMV 282
            F+F GA       + A R+   ++    ++     FA  R    +  ++ AK  ++  V
Sbjct: 165 AFMFPGARVIGLCTSAALRAVCGVLAGGAKAAITLHFATPRGGVGDVGDLNAKDSSKETV 224

Query: 283 SKAIGIMLGIALANHIGS---SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
              +G++LG  +  HI S   + P  L   SV    H+  N  + + + +R+ N +RA++
Sbjct: 225 LALLGMLLGSVMIPHITSPWVTYPVLLLLISV----HLVINYIAVRGLAIRSPNRHRATI 280

Query: 340 VFSEYLLSGQ-APPVKEVNDEEPLF 363
            +  Y    + AP  + V   E +F
Sbjct: 281 AWLWYRSKDECAPSPQAVASRELIF 305


>gi|403412621|emb|CCL99321.1| predicted protein [Fibroporia radiculosa]
          Length = 4586

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 92/414 (22%), Positives = 174/414 (42%), Gaps = 54/414 (13%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAIN 180
           Q  LP  +P SV   YL +   +  +     +  VL  QALL ++G+  +G+I  A A+ 
Sbjct: 115 QMFLPTNYPQSVHPSYLRFHALQFFETTFGTVVSVLCNQALLTSVGVSAEGSIFGAVAVQ 174

Query: 181 WVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRLFADLLENAAFGLEM----LTPAFPHHFV 235
           W++KDG G ++K+  +  F  +FD +PK + LF + +      L+M    +TP  P +F+
Sbjct: 175 WIIKDGAGEVAKLFFIRRFSPYFDSHPKSFTLFGEGIVAIGSCLQMATLLITPT-PGNFL 233

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
              A     +     +  +T   +   F+ + N  +V AK E+Q  +++  G   GI L 
Sbjct: 234 LCAAGGNIFKLVGYAVWFTTHIKWVRYFSLQGNVGDVAAKDESQTSIAQLCGYAAGIGLL 293

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY------LLSGQ 349
             +  S  +  + F  +T +H+   +   +  +   L   R SL+  EY        +G+
Sbjct: 294 T-VSHSPAYLYSLFFALTPLHLTATMFMMRDAKFEALTLPRLSLLAREYAREDGEAATGK 352

Query: 350 APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 409
              ++E+ +         FFK K+                    +  ++L  ++++V+  
Sbjct: 353 VNTLREIEESRQTGAFGEFFKRKN--------------------DRFVELAPRVTEVIGG 392

Query: 410 KEDAHALF-----SLYEDEGYIL----TEHGGKFCVVLKESALPQDMLKSLFQASYLYWL 460
             D   +      S ++ E Y+L    +       V     A   D L+S+  A+     
Sbjct: 393 DVDREPVLWEMCTSEFQTEKYLLYPSASPSEKPITVFYHPDATSDDTLRSILHAARF--- 449

Query: 461 ERNAGIVATSTSAD---CAPGGRLEISL----DYVQREFNHVKSDSASVGWVTD 507
            R+  + +  +S D    +  G L+++L     + ++ F   K +    GW TD
Sbjct: 450 -RHDLLESRLSSLDSPWSSRTGELQVALASSHAWTRQHFGAFKQELDDKGWRTD 502


>gi|219129806|ref|XP_002185071.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403566|gb|EEC43518.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 500

 Score = 88.6 bits (218), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 127/259 (49%), Gaps = 7/259 (2%)

Query: 96  ASASNASLSSLLSVNK-LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 154
           +S+  + L+S L+V + L    +       LP G+P      YL YS+W  VQ +++Q+ 
Sbjct: 56  SSSGRSELNSRLNVGRVLLRNVQSGLRSTFLPSGYPARTPPGYLRYSVWSWVQDLSTQLR 115

Query: 155 GVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLF 212
            VLATQ +L  +G+G+ GA   +A +N++++DG G  + +  ++     F  + K WR+F
Sbjct: 116 SVLATQRVLEGVGVGREGATALSALLNFLVRDGCGMAASLAFTSAAASRFQSDVKRWRIF 175

Query: 213 ADLLENAAFGLEMLTPAFPHH-FVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
           AD++ +    LE+   A P   F+ + +   A ++   +   +        +A   + ++
Sbjct: 176 ADVMVDIGITLEVAASACPPTLFLPMISLGNACKAMCGVAAGACGGAINLHWAKGSDISD 235

Query: 272 VIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL--ASFSVVTWIHMYCNLKSYQSIEL 329
           + AK  AQ  V+ ++G++     A  + S++P       ++ +T +H+  NL+  + I  
Sbjct: 236 IQAKFGAQHTVTGSLGLVFAAFFAKSV-STVPGGRLWILYAALTALHVMANLRCMRIIHF 294

Query: 330 RTLNPYRASLVFSEYLLSG 348
              N  R  LV      SG
Sbjct: 295 DVFNTTRLRLVLRALFESG 313


>gi|294462406|gb|ADE76751.1| unknown [Picea sitchensis]
          Length = 78

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 55/77 (71%)

Query: 447 MLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVT 506
           ML+++FQ  +LYWLER+ G+ +     DC  GG+L +S D++ REF++V+  +A+VGW  
Sbjct: 1   MLRAMFQVCHLYWLERHLGVESKKIINDCMAGGKLALSHDFMVREFDNVQKKAATVGWYP 60

Query: 507 DGLIARPLPNRIRPGYV 523
           +GL+ARPLP R+   Y+
Sbjct: 61  EGLVARPLPFRVHLNYI 77


>gi|414586419|tpg|DAA36990.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 311

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 70/299 (23%), Positives = 140/299 (46%), Gaps = 33/299 (11%)

Query: 213 ADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV 272
           AD+L +    LE+++P  P  F+ +       +  A +   +TR   Y+ FA   N +++
Sbjct: 32  ADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVAARATRLPIYSSFAKEGNLSDL 91

Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
            AKGEA   +   +GI  GI LA+ + S+    L +  +++ +H+Y  ++  ++  + TL
Sbjct: 92  FAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSAVHIYGVIQEMRATPVNTL 151

Query: 333 NPYRASLVFSEYLLSGQ-APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAV 391
           NP R +++ ++++ SG+ + P +    E+ LFP                       +  +
Sbjct: 152 NPQRTAMIVADFIKSGKVSSPAELRYKEDLLFP-----------------------NRVI 188

Query: 392 EIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSL 451
           E    +++G  L  V + +     L + +  E ++L++   +  +VL+ SA  +D L+  
Sbjct: 189 EEAGSVKIGQPLRKVFSPRL-VEQLRATFPGEKFLLSQKSNRTYMVLERSASGEDALRGW 247

Query: 452 FQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLI 510
             A++   +ER+      + S D A    L  + + ++R F    S+  S GW TD  +
Sbjct: 248 LVAAFASEMERSG-----AGSRDDAV---LSEAYEKMERVFPTFVSEVRSRGWYTDQFL 298


>gi|414586421|tpg|DAA36992.1| TPA: hypothetical protein ZEAMMB73_408601 [Zea mays]
          Length = 164

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 84/154 (54%)

Query: 192 KIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALI 251
           K++ S+ G   D  PK WR+ AD+L +    LE+++P  P  F+ +       +  A + 
Sbjct: 6   KLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMAVVA 65

Query: 252 QASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSV 311
             +TR   Y+ FA   N +++ AKGEA   +   +GI  GI LA+ + S+    L +  +
Sbjct: 66  ARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPL 125

Query: 312 VTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           ++ +H+Y  ++  ++  + TLNP R +++ ++++
Sbjct: 126 LSAVHIYGVIQEMRATPVNTLNPQRTAMIVADFI 159


>gi|167526040|ref|XP_001747354.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774189|gb|EDQ87821.1| predicted protein [Monosiga brevicollis MX1]
          Length = 283

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 87/186 (46%), Gaps = 1/186 (0%)

Query: 179 INWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFI 237
           +  +L+D  G +  I  +   G   D N K WRL AD+L + A  LE+++P  P +F+F+
Sbjct: 1   MTQILRDATGMIGSISFAWIQGSDLDNNAKQWRLVADVLNDTAMLLELISPLLPAYFLFL 60

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
              A   R+   +   +TR+      A   N  +V AK  +Q  +   + +++ + L   
Sbjct: 61  VCVASLARAIVGVAGGATRAALIQHQARNNNMGDVAAKDGSQETLVNLVALLVNLILVPL 120

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           +  +     + F + T  H++ N K+ +S+ +  LN  RA L+   Y    Q    K  N
Sbjct: 121 VSQNAQVVWSLFGLSTCCHIFFNYKAVRSLCMTRLNGRRAHLIVEHYTRHQQILSPKIAN 180

Query: 358 DEEPLF 363
             EPLF
Sbjct: 181 QREPLF 186


>gi|119187997|ref|XP_001244605.1| hypothetical protein CIMG_04046 [Coccidioides immitis RS]
          Length = 472

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 85/379 (22%), Positives = 169/379 (44%), Gaps = 70/379 (18%)

Query: 110 NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLW-----------------------RSV 146
            + W       +   LP G+P SV+ DYL Y ++                        S+
Sbjct: 43  RRTWSSYFTAILDAFLPVGYPHSVSGDYLEYQVYVRPIFLKLHWSFTDVANNRLFVQDSL 102

Query: 147 QGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDV 204
           Q  +S I+G+L+++A+L  +G+G   A PT+A +  +L+D +G ++ I+ ++  G   + 
Sbjct: 103 QAFSSSIAGLLSSRAVLQGVGVGDATASPTSALLLSILQDSMGRIATILFAHQLGTSLEP 162

Query: 205 NPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGF 263
             K +RL AD+  +++  ++ L+P FP    V + + +   R+   +   S+++   A F
Sbjct: 163 ECKMYRLAADIFNDSSMIMDCLSPMFPKSIRVGLLSLSSVLRALCGVAAGSSKASLSAHF 222

Query: 264 AARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKS 323
           A   N AE+ A               +G  + ++I S +   +A  S++  +H+  N  +
Sbjct: 223 AKWGNLAELNA---------------VGSLVVSYITSPLATWVALISLLI-VHITTNYLA 266

Query: 324 YQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +++ + +LN  RA++VFS      +    K+ + +E +F      + K+++ +      
Sbjct: 267 VRAVSMTSLNRQRANIVFSTLFDENRVLTAKQASKQERVFERDGVLRWKASSDTLGFC-- 324

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVNNKEDAH----------ALFSLYEDEGYIL-TEHGG 432
                         Q+G    DV+++   +           A+  ++E E YIL  +   
Sbjct: 325 --------------QIGVPFKDVIDHVSSSSPAERTEATIVAILRVFEQEEYILWFDPAR 370

Query: 433 KFCVV-LKESALPQDMLKS 450
           K  V+ LK +A P   LK+
Sbjct: 371 KRGVIGLKTNASPTSQLKA 389


>gi|258576451|ref|XP_002542407.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902673|gb|EEP77074.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 442

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 158/332 (47%), Gaps = 40/332 (12%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFD 203
           S+Q  +S I+G+L+++A+L  +G+G   A PTAA +  VL++ +G ++ I+ ++      
Sbjct: 40  SLQAFSSSIAGLLSSRAVLQGVGVGDATASPTAALLLNVLQESMGRIATILFAH-----R 94

Query: 204 VNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAG 262
           +  K +RL AD+  ++A  ++ L+P FP    V + + +   R+   +   S+++   A 
Sbjct: 95  LECKMYRLAADIFNDSAIVMDCLSPMFPKPMRVGVLSLSSVLRAMCGVAAGSSKASLSAH 154

Query: 263 FAARRNFAEVIAKGEAQGMVSKAIGIML-----------GIALANHIGSSMPFALASFSV 311
           FA   N AE+ AK  +Q  V   +G+++           G+ L   +G S+  +  + ++
Sbjct: 155 FARWGNLAELNAKDSSQETVISLMGMLVRPILGFLESISGLILTQQVG-SLVVSRVTSTL 213

Query: 312 VTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
            TW        IH+  N  + +++ + TLN  RA++VFS      +     +   +E +F
Sbjct: 214 ATWAALILLLTIHITTNYFAVRAVNMTTLNRQRANIVFSTIFDENRTLTPSQAAHQERIF 273

Query: 364 PAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL-GSKLSDVVNNKEDAHALFSLYED 422
                 + K++  S        +    V  ++ +QL  S  S   + +    AL  L+E 
Sbjct: 274 ERDGVLRWKASPASL------GSCRIGVSFQYMVQLIASSESCTSDGQSIVAALLHLFER 327

Query: 423 EGYIL----TEHGGKFCVVLKESALPQDMLKS 450
           E YIL    T   G   +VLK +A     LK+
Sbjct: 328 EEYILWFNPTRKRG--AIVLKTNATAASQLKA 357


>gi|356510851|ref|XP_003524147.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
          Length = 158

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/120 (38%), Positives = 70/120 (58%)

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           LKDG+ ++ K++ SN+G   D  PK WRL AD L +   GLE+L+P  PH F+ +     
Sbjct: 1   LKDGMQHVGKLICSNWGGTMDSEPKRWRLLADALYDIGTGLEVLSPRCPHLFLEMAGLGN 60

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +  + +   +TR   Y+ FA   N ++++AKGEA   +   IGI +GI LA+ I +SM
Sbjct: 61  FAKGMSVVAARATRLPIYSSFAKEGNLSDLLAKGEAFSTLFNVIGIGVGIQLASTICASM 120


>gi|390599626|gb|EIN09022.1| DUF647-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 461

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 66/257 (25%), Positives = 125/257 (48%), Gaps = 12/257 (4%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           + L     LP G+P SV+ DYL Y  + + Q   S ++ ++A++A+L   G+G   A  T
Sbjct: 30  KHLLTDVFLPAGYPSSVSPDYLRYQTYNAFQAFCSSLANLIASRAVLEGFGVGNASASAT 89

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-- 232
            A +  VL+D  G L+ +    + G       K +RL AD+  +A+  L++  P +    
Sbjct: 90  HAMLLTVLQDVFGRLTTVAGGYYLGTALSPEAKTYRLLADVFNDASIVLDVFQPFYASVP 149

Query: 233 ---HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 289
                V    ++G  R+   +    +++     FA+  +  ++ AK  ++  V   +G++
Sbjct: 150 VAGARVLALCSSGCLRALCGVCAGGSKAALAVHFASAGDLGDLNAKDGSRETVIGLVGLL 209

Query: 290 LGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL-- 346
            G  L ++IG+  P A  A+   +   H+  N  + +++ +RTL   RAS+ +  ++   
Sbjct: 210 SGTILLSYIGT--PGATYAALCALLGAHLALNYLAVRAVAMRTLTRQRASIAWRVWMERG 267

Query: 347 SGQAPPVKEVNDEEPLF 363
           +G AP  +EV + E L 
Sbjct: 268 AGAAPTPEEVAEREGLL 284


>gi|440794597|gb|ELR15757.1| hypothetical protein ACA1_379390, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 449

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 104/226 (46%), Gaps = 33/226 (14%)

Query: 88  FTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQ 147
           F + A +VA   N      L V+ +    + LF    LP G+P SV  DY+++  W   +
Sbjct: 48  FEESAAIVAREKNQHERKPLRVDLM---LKSLF----LPTGYPASVRQDYIHFQKWEVAK 100

Query: 148 GVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNP 206
           G+ S                       +A  +  ++  G+G L  ++++  FG+ FD   
Sbjct: 101 GIVS-----------------------SAGFVGHLV--GLGCLGMMLVAGAFGKFFDTET 135

Query: 207 KGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAAR 266
           K  R  AD +  A   LE+ TP FP +F+ + + A + +  A L   +T++    GFA R
Sbjct: 136 KRIRWAADCIHVAGVALELATPLFPAYFLPLASLANSAKGIAGLTTGATKAAINQGFALR 195

Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVV 312
            N  ++ AKG +QG+V+  +G+ +GI      G  +    A + V+
Sbjct: 196 DNLGDITAKGHSQGIVAYLMGMGIGIGTTYVTGGVLGALFAVYGVL 241


>gi|194382298|dbj|BAG58904.1| unnamed protein product [Homo sapiens]
          Length = 194

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 81/165 (49%)

Query: 199 GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSC 258
           G   D N K WRLFAD+L + A  LE++ P +P  F    + +   +   ++   +TR+ 
Sbjct: 12  GSKLDCNAKQWRLFADILNDVAMFLEIMAPVYPICFTMTVSTSNLAKCIVSVAGGATRAA 71

Query: 259 FYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMY 318
                A R N A+V AK  +Q  +    G+++ + +   +     F+L  F  +T +H+Y
Sbjct: 72  LTVHQARRNNMADVSAKDSSQETLVNLAGLLVSLLMLPLVSGCPGFSLGCFFFLTALHIY 131

Query: 319 CNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
            N ++ +++ + TLN  R  LV   YL  G+       N  EPL+
Sbjct: 132 ANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLW 176


>gi|296826902|ref|XP_002851048.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238838602|gb|EEQ28264.1| DUF647 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 479

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 103/434 (23%), Positives = 184/434 (42%), Gaps = 73/434 (16%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLW---------------------RSVQGVASQISGVLA 158
            +   LP G+P SVT DYL Y ++                      S+Q  +S I+G+L+
Sbjct: 57  ILNVFLPSGYPHSVTDDYLEYQIYVSQQPFPSSDRKEKKLTVSYQDSLQAFSSSIAGLLS 116

Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLE 217
           ++A+L  +G+G   A PTAA +  VL++ +G ++ I                 LFA  L 
Sbjct: 117 SRAVLQGVGVGDATASPTAALLLNVLQESMGRIATI-----------------LFAHRLG 159

Query: 218 NAAFGLEMLTPAFPHHFVFIG--AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
            +      L+P FP   V +G  + +   R+   +   S+++   A FA   N  E+ AK
Sbjct: 160 TSLEPEYCLSPVFPKP-VRVGFLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAK 218

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
             +Q  V   +G+++G  +  ++ S +   +A   ++  IH+  N  + +++ + TLN  
Sbjct: 219 DSSQETVISLVGMLVGSVVVAYVSSPLATWVALLFLLA-IHLATNHAAVRAVNMTTLNRQ 277

Query: 336 RASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEH 395
           RA++VFS      +     +V+ EE +F      + K+       VL +     ++E   
Sbjct: 278 RANIVFSSLFDDDRVLTPAQVSKEERIFERDGVLRWKAEATG---VLGTCRVGVSLEQLL 334

Query: 396 RLQLGSKLSDVVNNKEDAHA----------LFSLYEDEGYIL----TEHGGKFCVVLKES 441
            L   S+ +D       + A          L  L+++E Y+L     +H     ++LK  
Sbjct: 335 LLLPHSQTNDASKTSRRSEAGADFGVSLPVLVDLFKEEEYLLYFSPRDHAA--VIILKAG 392

Query: 442 ALPQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQRE--------FN 493
              Q  LK+    S+   + R     +TS   +   GG+ E+S+  + R+        F+
Sbjct: 393 VTAQAQLKAW---SHALLVSRRLAGRSTSACINEKQGGQNELSVLSILRDTLREHSKAFD 449

Query: 494 HVKSDSASVGWVTD 507
                    GW TD
Sbjct: 450 DRVERLKQAGWETD 463


>gi|149392593|gb|ABR26099.1| unknown [Oryza sativa Indica Group]
          Length = 282

 Score = 85.5 bits (210), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 71/314 (22%), Positives = 144/314 (45%), Gaps = 37/314 (11%)

Query: 214 DLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVI 273
           D+L +    LE+++P  P  F+ +       +  A +   +TR   Y+ FA   N +++ 
Sbjct: 1   DVLYDFGTALEVISPLCPQLFLEVAGFGNFAKGMAVVAARATRLPIYSSFAKEGNLSDLF 60

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           AKGEA   +   +GI  GI LA+ + S+    L +  +++ +H+Y  ++  ++  + TLN
Sbjct: 61  AKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIAGPLLSVVHIYGVVQEMRATPVNTLN 120

Query: 334 PYRASLVFSEYLLSGQAPPVKEVN-DEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
           P R +++ ++++ SG+     E+   E+ LFP                       +  +E
Sbjct: 121 PQRTAMIVADFIKSGKVSSPAELRYREDLLFP-----------------------NRLIE 157

Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLF 452
               +++G  +  V+ + +    L + +  E ++L+       +VL++SA  +D L+   
Sbjct: 158 EAGSVKIGQPVRRVL-SPQRIEQLKATFSKEKFLLSRKDNSAYMVLEQSATGEDALRGWL 216

Query: 453 QASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIAR 512
            A++   +ER+ G+ +  T         L ++ + ++  F    ++  S GW TD  +  
Sbjct: 217 VAAFASEMERS-GVGSGDTV--------LNVAYERMENVFPMFVAEVKSRGWYTDQFLD- 266

Query: 513 PLPNRIRPGYVEPS 526
              NR R  Y  PS
Sbjct: 267 --GNRSRIAYAIPS 278


>gi|406860777|gb|EKD13834.1| DUF647 domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 501

 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 123/245 (50%), Gaps = 33/245 (13%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSV------------QGVASQISGVLATQALLYA-IGLGKG 171
           LP G+P+SVT DYL Y ++               +G+++ +  + +  A+L + IG+G  
Sbjct: 58  LPAGYPNSVTEDYLEYQIYVRAANKPWTQSELISEGLSTSVLKLHSWDAVLESRIGVGDS 117

Query: 172 -AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 229
            A PTAA +  VL+  +G ++ I+ ++  G   +   K +RL AD+  +AA  L+ L+PA
Sbjct: 118 HASPTAALLLSVLQQSMGRIATIVFAHRLGTALEPECKMYRLAADIFNDAAMILDCLSPA 177

Query: 230 FPH-HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGI 288
            P    V + + +   RS   +   S+++   A FA + N  E+ AK  +Q  V   +G+
Sbjct: 178 LPKASRVALLSLSSVLRSMCGVTAGSSKASLSAHFATQGNLGELNAKDSSQETVISLLGM 237

Query: 289 MLGIALANHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLV 340
           + G  + ++I S           +TWI        H+  N  + +++ ++TLN  RA+LV
Sbjct: 238 LAGSVVVSYITS---------KTITWIALILLLATHLGTNYLAVRAVCMQTLNRQRANLV 288

Query: 341 FSEYL 345
            S  L
Sbjct: 289 LSNIL 293


>gi|449543750|gb|EMD34725.1| hypothetical protein CERSUDRAFT_116920 [Ceriporiopsis subvermispora
           B]
          Length = 507

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 90/381 (23%), Positives = 161/381 (42%), Gaps = 42/381 (11%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           R+LF    LP G+P++V+ DYL+Y ++  +Q   S I+ +LA++A+L   G+G+  A  T
Sbjct: 50  RQLFSGIFLPAGYPNTVSPDYLHYQIFNGLQAGCSSIASLLASRAVLEGHGVGEASASAT 109

Query: 176 AAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-- 232
            A +  +++D +  L  I    + G       K +RL AD+  +AA  L++++P F H  
Sbjct: 110 NAMLVNIIQDVVSRLITIGSGYYLGTSLFPEAKKYRLLADVYNDAAVVLDVISPLFTHWS 169

Query: 233 ---HFVFIG------------AAAGAGRSAAALIQASTRSCFYAGFAAR----RNFAEVI 273
               + FI               +G+ R+   ++   +++     FA       +  ++ 
Sbjct: 170 ISLVYPFIRREQGFYLRVLALCLSGSFRALCGMVAGGSKAALTMHFATAGPVPGDIGDLN 229

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           AK  ++  V   IG++LG  +  H+ S      A  S++   H+  N  + + + +RT N
Sbjct: 230 AKDGSKETVLALIGMLLGTFVIKHVDSGFA-TYAVLSMLILTHLAINFVAVRGVVMRTFN 288

Query: 334 PYRASLVFSEYLLSGQ----------APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLS 383
            +RA+  ++ Y    Q           P +  V +     P      ++   +     L 
Sbjct: 289 RHRAAAAWASYRRYEQGIAQYEGAILTPELMAVKERIFADPTLLMHPLRPEKRLGHCHLG 348

Query: 384 SEAKDAAVEIEHRLQLGSKLSDVVN-NKEDAHALFSLYEDEGYIL-TEHGGK-----FCV 436
           +      +E  HR    S+ S V N      + +   +E+E YIL   H  +       V
Sbjct: 349 ASLSTIFLE-THRPCPPSRTSTVPNLTSAQLNVVLQTFENERYILWLRHDSRSESLQLVV 407

Query: 437 VLKESALPQDMLKSLFQASYL 457
             K    P D LK    A  L
Sbjct: 408 CFKTGHQPIDRLKGWIHAQEL 428


>gi|350630832|gb|EHA19204.1| hypothetical protein ASPNIDRAFT_186948 [Aspergillus niger ATCC
           1015]
          Length = 504

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 67/241 (27%), Positives = 114/241 (47%), Gaps = 42/241 (17%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPT 175
           ++  V+  LP G+P SVT DY                       A+L  +G+G   A PT
Sbjct: 69  QDFLVEVFLPAGYPHSVTDDY---------------------APAVLQGVGVGNADASPT 107

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH-- 232
           AA +  +L+D  G ++ I+ ++  G   +   K +RL AD+  + A  L+ L+P  P   
Sbjct: 108 AALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMTLDCLSPMIPAGA 167

Query: 233 HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGI 292
             V + + AG  R+   +   S+++   A F+   N AEV AK  +Q  +   IG+++G 
Sbjct: 168 PRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAEVNAKDSSQETIISLIGMLVGS 227

Query: 293 ALANHIGSSMPFALASFSVVTWI--------HMYCNLKSYQSIELRTLNPYRASLVFSEY 344
            + + +         +FS  TWI        H+  N  + +S+++ +LN  RA++VFS  
Sbjct: 228 FVVSRV--------TNFS-TTWICLLMLQALHLSLNYAAVRSVQMTSLNRQRANIVFSTL 278

Query: 345 L 345
           L
Sbjct: 279 L 279


>gi|189196610|ref|XP_001934643.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187980522|gb|EDU47148.1| hypothetical protein PTRG_04310 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 333

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/223 (28%), Positives = 118/223 (52%), Gaps = 6/223 (2%)

Query: 145 SVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHF 202
           S+Q  +S I+G+LA++A+L  +G+G   A PT A +  V++  +G ++ I+ ++  G   
Sbjct: 42  SLQAFSSSIAGMLASRAILEGVGVGDSTATPTTALLLSVIQTSVGQIATILFAHRLGTAL 101

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYA 261
           +   K +RL AD+  + A  L+ L+PAFP    V + + +   R+   +   S+++   A
Sbjct: 102 EPECKMYRLAADVFNDTAMVLDCLSPAFPKSIRVAVLSFSSCLRALCGVCAGSSKASLSA 161

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCN 320
            FA + N  EV AK  +Q  V   +G++ G  + + I S  P A   +  ++  IH+  N
Sbjct: 162 HFAHKGNLGEVNAKDSSQETVISLLGMLAGSLVVSQITS--PLATWGTLILLLTIHLATN 219

Query: 321 LKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
             + +++ +  LN  RA+++FS     G     ++V+  E +F
Sbjct: 220 YAAVRAVNMHCLNRQRANILFSNMFQHGLVLSPRDVSQRERVF 262


>gi|356498113|ref|XP_003517898.1| PREDICTED: UPF0420 protein-like, partial [Glycine max]
          Length = 171

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 46/120 (38%), Positives = 69/120 (57%)

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAG 242
           LK+G+ ++ K++ SN+G   D  PK WRL AD L +   GLE+L+P  PH F+ +     
Sbjct: 1   LKNGMQHVGKLIGSNWGGTMDFEPKRWRLLADALYDIGTGLEVLSPWCPHLFLEMAGLGN 60

Query: 243 AGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSM 302
             +  A +   +TR   Y+ FA   N +++ AKGEA   +   IGI +GI LA+ I +SM
Sbjct: 61  FAKGMAVVAARATRLPIYSSFAKEGNLSDLFAKGEAFSTLFNVIGIGVGIQLASTICASM 120


>gi|452981822|gb|EME81581.1| hypothetical protein MYCFIDRAFT_100197, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 272

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 118/222 (53%), Gaps = 5/222 (2%)

Query: 145 SVQGVASQISGVLATQALLYAIGLG-KGAIPTAAAINWVLKDGIGYLSKIMLS-NFGRHF 202
           S+Q   S I+G+L+++A+L ++G+G   A PT+A +  +L+  +  ++K+  +   G+  
Sbjct: 34  SIQAFVSSIAGLLSSRAVLLSVGVGDSSASPTSALLLSMLQQSLSRIAKVAFAYRLGKAI 93

Query: 203 DVNPKGWRLFADLLENAAFGLEMLTPAFPHHFV-FIGAAAGAGRSAAALIQASTRSCFYA 261
           +   K +R  ADLL +++F L  L+P  P  F   I A +    +   ++ +ST++   A
Sbjct: 94  EPECKMYRFCADLLHDSSFVLNCLSPMLPKAFRPPILALSSICFAVCDVVASSTKASLSA 153

Query: 262 GFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNL 321
            FA   N  E+ AK  +Q  +   +G++ G  L N I        A   +++ +H+  N 
Sbjct: 154 HFAKWENLGELNAKDSSQETIISLLGMLTGSLLVNWISGHWATWTALILMLS-VHLEANR 212

Query: 322 KSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           ++ ++I ++TLN  RA++V+   L  G  P   EV+  E +F
Sbjct: 213 RAVRAIVMQTLNRQRATVVY-HRLCVGHIPRPAEVSRVERIF 253


>gi|240280131|gb|EER43635.1| DUF647 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 521

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 104/433 (24%), Positives = 179/433 (41%), Gaps = 78/433 (18%)

Query: 59  ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNASLSSLLSVNKLWDECRE 118
           +T+SE    +  D+       +R  ++    KD       S++ L+SLL+V         
Sbjct: 26  STTSEPTQFRTPDSSGVRHERRRVDIVQPSRKD-------SSSYLASLLNV--------- 69

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA----TQALLYAIGLGKG-AI 173
                 LP G+P SV+ DYL+        G AS   GVL     T  +   +G+G   A 
Sbjct: 70  -----FLPVGYPHSVSDDYLD------CLGFAS---GVLKFNCRTAGVPGWVGVGDASAS 115

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           PT A ++ VL+D +G ++ I+ ++  G   +   K +RL AD+L ++A   + L+PA P 
Sbjct: 116 PTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPK 175

Query: 233 HF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV------------------- 272
              V + A +   R+   +   S+++   + FA   N  E+                   
Sbjct: 176 RLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLVIAISGAAQR 235

Query: 273 ------------IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYC 319
                       + K  +Q  V   +GI+ G  + +HI  S PFA   +  ++  +H+  
Sbjct: 236 GFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHI--STPFATWTTLLLLLLVHLST 293

Query: 320 NLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQL 379
           N  + +S+ + TLN  R ++VFS     G      E +  E +F      + K++  S  
Sbjct: 294 NYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILRWKAS--SAT 351

Query: 380 LVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVV 437
           L            +   +   + + DV     D + L  L+E E Y+L       K  +V
Sbjct: 352 LGYCQIGGSFQELLRGSIHGANSIRDV---GIDIYKLLRLFEKEEYVLWFNPIHKKGTIV 408

Query: 438 LKESALPQDMLKS 450
           LK +  P   L++
Sbjct: 409 LKNNVTPISQLQA 421


>gi|115389162|ref|XP_001212086.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114194482|gb|EAU36182.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 411

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/160 (35%), Positives = 88/160 (55%), Gaps = 13/160 (8%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
           + L ++  LP G+P SVT DY  Y ++ S+Q  +S I+G+L+++A+L  +G+G   A PT
Sbjct: 54  KTLLIEVFLPAGYPHSVTDDYTAYQIFDSLQAFSSSIAGLLSSRAVLQGVGVGNPNASPT 113

Query: 176 AAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           +A +  +L+D  G L+ I+ ++  G       K +RL ADL  + A  L+ L+PA P   
Sbjct: 114 SALLLHILQDTSGRLATILFAHRVGTALQPECKSYRLAADLFNDLAMILDCLSPAAPA-- 171

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 274
              GA   A  SAA +++A        G A   N+A V A
Sbjct: 172 ---GAPRVAVLSAAGVLRA------LCGVAGALNYAAVRA 202


>gi|194383144|dbj|BAG59128.1| unnamed protein product [Homo sapiens]
          Length = 156

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 79/159 (49%), Gaps = 27/159 (16%)

Query: 46  FCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFT-----KDAFVVAS--A 98
            C  Q  +  AR   ++ D   +     WEV G +   L   FT     +DA  V +  A
Sbjct: 11  LCSEQFGSGEARGCRAAADGSLQ-----WEVGGWRWWGLSRAFTVKPEGRDAGEVGASGA 65

Query: 99  SNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA 158
            +  LS L +V               LP+GFPDSV+ DYL Y LW SVQ  AS +SG LA
Sbjct: 66  PSPPLSGLQAV--------------FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLA 111

Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLS 196
           TQA+L  IG+G   A  +AA   W++KD  G L +I+ +
Sbjct: 112 TQAVLLGIGVGNAKATVSAATATWLVKDSTGMLGRIVFA 150


>gi|392558453|gb|EIW51641.1| DUF647-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 450

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 105/465 (22%), Positives = 187/465 (40%), Gaps = 57/465 (12%)

Query: 94  VVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQI 153
           +V   S+  L S  +  + W   R L     LPEG+P +V+   + Y +W +VQ   S +
Sbjct: 1   MVLHQSSPYLISQTNAGRHW---RRLLNDVFLPEGYPFTVSPQCVRYQIWNAVQAFCSSL 57

Query: 154 SGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRL 211
           +G+ A++A+L   G+G   A  T A    +L+D    L+ I+   + G       K +RL
Sbjct: 58  AGLFASRAVLQGHGVGNASASATDAIFLTILQDVFSRLTTIVSGYYLGTSLYPEAKTYRL 117

Query: 212 FADLLENAAFGLEMLTPAFPH-----HFVFIGAA------------AGAGRSAAALIQAS 254
            AD+L +AA   + L+P   H      + F+  A            +GA R+    +   
Sbjct: 118 LADILNDAAMVTDTLSPHLAHFSFSLSYPFVVPASNSSLRVVALCMSGAFRALCGAVAGG 177

Query: 255 TRSCFYAGFAAR----RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFS 310
           +++     FA       +  ++ AK  ++  V   +G++ G  +  ++ ++    +  F 
Sbjct: 178 SKAALTVHFATAGERPGDVGDLSAKDGSKETVLALLGMLSGSVVLRYVHTAQATYVVLFG 237

Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS----GQAPPVKEVNDEEPLFPAF 366
           ++ + H+  N  + + I +R  N  RA + + E+  S       P    V  +E +F   
Sbjct: 238 LI-FCHLAANFIAVRVIAMRAFNRQRAGIAWCEFRRSFDVVTVVPDYLHVARQERIF--- 293

Query: 367 HFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYI 426
                     S  + +S  A+         +   S       + E    L  ++  E YI
Sbjct: 294 --------TDSARIPISQHARGTYAHCALGVSFLSLSGREPFSDEQIATLLEVFAHEKYI 345

Query: 427 L--TEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATSTSA----DCAPGGR 480
           L    +  +  VVLK+   P D LK+   A  +      A I    + A    +C    R
Sbjct: 346 LWPASNPPQIAVVLKDGHGPHDHLKAWAHAHEI------ANICDRESHAHGFNECLVAIR 399

Query: 481 LEISLDYVQREFNHVKSDSASVGWVT-DGLIARPLPNRIRPGYVE 524
              +L+ V R F      +  VGW T +G++    P  +  G  E
Sbjct: 400 Y--ALENVSRMFPLFLRAAQEVGWKTEEGVVVGGSPITVTIGEAE 442


>gi|121709968|ref|XP_001272600.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
 gi|119400750|gb|EAW11174.1| DUF647 domain protein [Aspergillus clavatus NRRL 1]
          Length = 512

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 69/241 (28%), Positives = 119/241 (49%), Gaps = 22/241 (9%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLAT-QALLYAIGLGKG---AI 173
           +L V   LP G+P SVT DYL    +             LAT Q     +G+G G   A 
Sbjct: 67  DLLVDVFLPSGYPHSVTDDYLPRVYFH------------LATPQPTNNPLGVGVGNADAT 114

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           PT+A +  +L+D  G ++ I+ ++  G   +   K +RL AD+  + A  L+ L+P  P 
Sbjct: 115 PTSALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADVFNDLAMILDCLSPMIPA 174

Query: 233 HF--VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
               + + + AG  R+   +   S+++   A FA   N AEV AK  +Q  +   +G+++
Sbjct: 175 GVSRITVLSTAGVLRALCGVAGGSSKASLSAHFARGGNLAEVNAKDSSQETIISLVGMLV 234

Query: 291 GIALANHI-GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
           G  + +H+ G +  +    F +   + M  N  + +++++ +LN  RA++VFS  L S  
Sbjct: 235 GSLVVSHVTGFAATWLSLLFLLAAHLAM--NYAAVRAVQMTSLNRQRANIVFSTLLDSDP 292

Query: 350 A 350
           A
Sbjct: 293 A 293


>gi|440799746|gb|ELR20790.1| protein C16orf58 family protein [Acanthamoeba castellanii str.
           Neff]
          Length = 101

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 35/71 (49%), Positives = 50/71 (70%), Gaps = 1/71 (1%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWV 182
           +LPEG+P+SV+ DY  Y  W +VQ + S I+G+L T+ALL  +G+G  GA P AAA+ W 
Sbjct: 21  LLPEGYPESVSGDYATYQTWDTVQALCSSITGLLCTRALLQGVGVGDVGATPGAAAVQWA 80

Query: 183 LKDGIGYLSKI 193
           L+DG G + +I
Sbjct: 81  LRDGAGMVGRI 91


>gi|56752623|gb|AAW24525.1| SJCHGC08554 protein [Schistosoma japonicum]
          Length = 127

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 63/113 (55%), Gaps = 5/113 (4%)

Query: 87  DFTKDAFVVASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSV 146
           D     F +    N  LS+L  ++  W   ++ F    LP G+P SV+ DYL Y +W ++
Sbjct: 12  DEISRVFTLEKNQNRRLSTLFGLHLSWVTLKKCF----LPVGYPSSVSDDYLEYQIWDTI 67

Query: 147 QGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGIGYLSKIMLSNF 198
           Q  AS I+G LA+QA+L  +G+G   A    A++ W+ KDG G + +I+ + +
Sbjct: 68  QAFASSITGALASQAVLIGVGVGNSSATILGASLTWMFKDGSGMIGRILFAGY 120


>gi|414586415|tpg|DAA36986.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 163

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 61/101 (60%), Gaps = 1/101 (0%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAI 173
           D+  + F+  + P G+P SV   YL Y+ +R +Q  +S +  VL+TQ+LL+A GL +   
Sbjct: 62  DKLVDSFLNKIFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQSLLFAAGL-RPTP 120

Query: 174 PTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFAD 214
             A A++W+LKDG+ +  K++ S+ G   D  PK WR+  +
Sbjct: 121 AQATAVSWILKDGMQHAGKLICSSMGARMDSEPKSWRILGE 161


>gi|345563243|gb|EGX46246.1| hypothetical protein AOL_s00110g70 [Arthrobotrys oligospora ATCC
           24927]
          Length = 445

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/256 (23%), Positives = 116/256 (45%), Gaps = 34/256 (13%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAA 178
            +   LP G+P SVT DYL+  L                      A+G     A  TAA 
Sbjct: 57  IINIFLPAGYPKSVTDDYLDGDL----------------------AVGFEDSNASTTAAV 94

Query: 179 INWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VF 236
           +  +++D IG ++ I+ ++  G   +   K +R  AD+  ++A  ++ L+P FP    V 
Sbjct: 95  LLKIVQDSIGRVATILFAHRLGSSLEPECKRFRFAADIFNDSALIVDCLSPVFPKPLRVV 154

Query: 237 IGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           + + AG  R+   +    +++     F+   N AE+ AK  +Q  V   +G+++      
Sbjct: 155 LLSTAGTLRALCGVAGGGSKAQLSVHFSKWGNVAELNAKDSSQETVISLMGMLMRADQTL 214

Query: 297 HIGS-SMPFALASFSVVTW--------IHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS 347
            + + S+  +  +    TW        +H++ N ++ +++ + TLN  RA+ V S +  +
Sbjct: 215 RVKAGSLVISSVTSKTATWASMLFLIAVHLWTNYRAVKAVTMVTLNRQRANSVMSTFAET 274

Query: 348 GQAPPVKEVNDEEPLF 363
           G     K+V  +E +F
Sbjct: 275 GVMLTPKDVARQERIF 290


>gi|169604674|ref|XP_001795758.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
 gi|111066623|gb|EAT87743.1| hypothetical protein SNOG_05352 [Phaeosphaeria nodorum SN15]
          Length = 473

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 97/381 (25%), Positives = 161/381 (42%), Gaps = 69/381 (18%)

Query: 117 RELFVQ----FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL--------- 163
           R+ +VQ      LP G+P S            S+Q  +S I+G+LA++A+L         
Sbjct: 36  RKSYVQQVFDVFLPAGYPQS-----------DSLQAFSSSIAGMLASRAVLEGHLDAEDT 84

Query: 164 ----------YAIGLGK-----------------GAIPTAAAINWVLKDGIGYLSKIMLS 196
                     YA  +G                   A PTAA +  VL++ +G ++ I+ +
Sbjct: 85  LRLQNLVPSVYAASIGDQRHMSSHQSRCVGVGDSSASPTAALLLSVLQESMGRIATILFA 144

Query: 197 N-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAGRSAAALIQAS 254
           +  G   +   K +RL AD+  + A  L+ L+PAFP  F V + + +   RS   +   S
Sbjct: 145 HRLGTALEPECKMYRLAADVFNDTAMILDCLSPAFPKPFRVLVLSFSSVLRSLCGVCAGS 204

Query: 255 TRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN-HIGSSMPFALASFSVVT 313
            ++   A FA + N  EV A      M       + G+   N HI +     +A  S   
Sbjct: 205 AKASLSAHFARKGNLGEVNA--HITHMNPNVDFDIEGLEPGNGHIATWNAILIALLS--- 259

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK-IK 372
            IH+  N  + +++ +R LN  RA++VF   +  G      EV+  E +F      + + 
Sbjct: 260 -IHLATNYAAVKAVSMRCLNRQRANIVFGNLMHHGSVLTPTEVSQRERIFERDGVLRWVD 318

Query: 373 SANKSQLLV-LSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TE 429
                   + +      A++   +R     +LSD+  N  D   L  ++ D+ Y+L  T+
Sbjct: 319 DCVIGHCTIGVPLGVLLASMSQRYRRTGSLELSDI--NLPD---LMDIFADQKYVLWFTD 373

Query: 430 HGGKFCVVLKESALPQDMLKS 450
              +  VVLKE   P D LK+
Sbjct: 374 SEYEAIVVLKEGCTPIDQLKA 394


>gi|302678497|ref|XP_003028931.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
 gi|300102620|gb|EFI94028.1| hypothetical protein SCHCODRAFT_40050 [Schizophyllum commune H4-8]
          Length = 278

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 119/257 (46%), Gaps = 12/257 (4%)

Query: 118 ELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTA 176
           ++  +  LP GFP+SV+ DYL Y ++ ++Q   S ++G+L+++ALL   G+G   A  T 
Sbjct: 2   DVLAKVFLPAGFPNSVSPDYLRYQIYNAIQAFCSSLAGLLSSRALLEGFGVGDANATATN 61

Query: 177 AAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTP---AFPH 232
           A +  V +D    L+ I+ +   G       K +R  AD+  + A   +  TP   A  H
Sbjct: 62  AVLLTVAQDVFSRLTTIIAAYLLGPSLFPEAKTFRFLADVFNDTAIIFDASTPILRASGH 121

Query: 233 HFVFIGA--AAGAGRSAAALIQASTRSCFYAGFA----ARRNFAEVIAKGEAQGMVSKAI 286
             + + A   AGA R+    I    ++     FA       +  +V AK  ++  +   +
Sbjct: 122 ANLRVAALCLAGASRAICGTICGGAKAALTLHFATPLRGTGDIGDVNAKDASKETLLALM 181

Query: 287 GIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
           G++ G  + +H+ SS     A    +   H+  N    + + LRTLN  RASL ++ +  
Sbjct: 182 GMLAGSFVVHHL-SSTQATHALLLALLAAHLTVNYLGVRGVCLRTLNRQRASLAWTSFRE 240

Query: 347 SGQAPPVKEVNDEEPLF 363
            G AP    V   E + 
Sbjct: 241 YGSAPSPTIVAGMEGIL 257


>gi|242208267|ref|XP_002469985.1| predicted protein [Postia placenta Mad-698-R]
 gi|242226469|ref|XP_002477622.1| predicted protein [Postia placenta Mad-698-R]
 gi|220722555|gb|EED77179.1| predicted protein [Postia placenta Mad-698-R]
 gi|220731047|gb|EED84896.1| predicted protein [Postia placenta Mad-698-R]
          Length = 226

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 115/224 (51%), Gaps = 11/224 (4%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAI--GLGK-GAIPTAAAINW 181
           LP GFP +V+     Y ++ ++Q   S ++G++A++A+L     G+GK GA  T A    
Sbjct: 6   LPAGFPATVSPG--KYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLT 63

Query: 182 VLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAA 240
           VL+D    L+ I+   + G       K +RL AD+  +AA  L+ L+P      V     
Sbjct: 64  VLQDIFSRLTTIVAGYYLGTSLFPEAKAYRLMADIFNDAAIILDTLSPHLGSFRVASLCL 123

Query: 241 AGAGRSAAALIQASTRSCFYAGFAARR----NFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
           +GA R+   ++   +++     FA       +  ++ AK  ++  V   +G++ G A+ +
Sbjct: 124 SGAFRALCGVVAGGSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMH 183

Query: 297 HIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
           ++ S+    +  F+++  +H++CN  + + +  R+LN  RAS+V
Sbjct: 184 YVHSTRATYVVLFALIA-LHLFCNYLAVRVVVFRSLNRQRASIV 226


>gi|62320358|dbj|BAD94740.1| putative protein [Arabidopsis thaliana]
          Length = 295

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 62/259 (23%), Positives = 111/259 (42%), Gaps = 30/259 (11%)

Query: 211 LFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
           ++AD + +A    ++ T  +P  F+ + +     ++ A  ++  +       FA   N  
Sbjct: 1   MYADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLG 60

Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIE 328
           EV AK E   + ++ IG+  GI + +  G   S PF L +++ +  +H++   +S   ++
Sbjct: 61  EVAAKEEVWEVAAQLIGLGFGILIIDTPGLVKSFPFVLLTWTSIRLVHLWLRYQSLAVLQ 120

Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
             T+N  RA ++   +++    P   + N  E +     F K +                
Sbjct: 121 FNTVNLKRARIIVESHVVHSVVPGYVDCNKRENILLWQRFMKPRII-------------- 166

Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT----EHGGKFCVVLKESALP 444
             V +E    L   +S V        AL  +Y  E YILT        +F V  K +A  
Sbjct: 167 FGVSLEELSGLEKSVSKV-------KALLKMYTKEKYILTLNKLNKDTEFSVSFKVNATS 219

Query: 445 QDMLKSLFQASYLYWLERN 463
           +D+L+ L+QA   YWLE N
Sbjct: 220 RDVLRCLWQA---YWLEEN 235


>gi|302417830|ref|XP_003006746.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261354348|gb|EEY16776.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 228

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 63/109 (57%), Gaps = 4/109 (3%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG--KGAIPTAAAINWV 182
           LP  +PDSV+ DY+ Y ++ S Q   S I+G+LA +A+L  +G+G   G++  A  +   
Sbjct: 82  LPVDYPDSVSPDYIKYQMYDSFQAFFSTITGMLANRAILEGLGVGDASGSVNHALLLT-C 140

Query: 183 LKDGIGYLSKIMLS-NFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF 230
           LKDG+  L+ I  +  FG   + N K +R  ADL  + AF LE+  P F
Sbjct: 141 LKDGVSRLAMITFAYRFGSVIEANAKKYRFLADLFNDTAFFLELANPFF 189


>gi|388520739|gb|AFK48431.1| unknown [Medicago truncatula]
          Length = 236

 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 3/106 (2%)

Query: 114 DECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-- 171
           DE       +++PEGFPDSVT  Y+ Y  WR+++       GV  TQ LL ++G+ +   
Sbjct: 106 DEFISFVRSYVVPEGFPDSVTPSYVPYMTWRALKHFFGGAMGVFTTQTLLTSVGVSRNRA 165

Query: 172 -AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
                A AINW+L DG G + K++ +  G+ FD + K  R   DLL
Sbjct: 166 TPAAAAVAINWILNDGAGRVGKMLFARQGKKFDYDLKQLRFAGDLL 211


>gi|303288586|ref|XP_003063581.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454649|gb|EEH51954.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 290

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 64/251 (25%), Positives = 108/251 (43%), Gaps = 55/251 (21%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--GAIPTAAAIN 180
             LP G+PD+V+SDY  +  W         +  VL +Q+LL A+G+G   GA+P AAA  
Sbjct: 1   MFLPAGYPDTVSSDYARWLRWHLFSNFFRNVLEVLTSQSLLVALGMGSTPGALPLAAATK 60

Query: 181 WVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGA 239
           WVLKDG+G  + ++    G + +D +PK                            F+ A
Sbjct: 61  WVLKDGVGSFATLLSGVVGAQRYDEDPK------------------------RCATFVRA 96

Query: 240 AAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM------VSKAIGIMLGIA 293
            A  GR+  +L+  +    F   F   +N  ++ AK E QG       +   I I  G++
Sbjct: 97  GALTGRT--SLVNGT----FMQHFGRNQNAGDIRAKLEVQGKWLALCALPTGIFIFRGVS 150

Query: 294 ---------------LANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRAS 338
                          L +H+  S+P  ++++  V   H + + K+   + +  LN +R  
Sbjct: 151 GAFEPLPPETLETMTLLDHLTRSLP-VISAYGAVVGGHSFLSWKAASVLRVDVLNRFRLL 209

Query: 339 LVFSEYLLSGQ 349
               E++  G+
Sbjct: 210 KAAREFVDGGE 220


>gi|151555803|gb|AAI49327.1| FLJ13868 protein [Bos taurus]
          Length = 158

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 49/74 (66%), Gaps = 1/74 (1%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWV 182
            LP+GFPDSV+ DYL Y LW SVQ  AS +SG LAT A+L  IG+G   A  +AA   W+
Sbjct: 81  FLPQGFPDSVSPDYLPYQLWDSVQAFASSLSGSLATHAVLLGIGVGDAKASVSAATATWL 140

Query: 183 LKDGIGYLSKIMLS 196
           +KD  G L +I+ +
Sbjct: 141 VKDSTGMLGRIVFA 154


>gi|342186523|emb|CCC96010.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 596

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 110/248 (44%), Gaps = 25/248 (10%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P G+PDS    +  Y L        S  +  +A Q++L       G +  ++   W+
Sbjct: 140 FGMPTGYPDSCAEGFRRYFLLSLCSTFVSSFASSIAYQSIL------NGFLLVSSPQLWM 193

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
           LKD    L+   L+N    ++  PK W + + +L N +  LEM+ P+  PH+ +      
Sbjct: 194 LKDLAPALAAAYLANRVVSYENRPKFWFVVSVVLHNTSIVLEMIIPSVVPHNLLTAAVLT 253

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGS- 300
              R +A+L+   TR+     FA   N AE+  K  + G+V   +   LGIA   ++ S 
Sbjct: 254 SCVRQSASLMFLVTRASALQHFAVSGNLAELTKKFNSFGIVIYTVSTALGIAYTYYVASL 313

Query: 301 -----SMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP--- 352
                ++ F   +  V++++ M CN      I  R LN    S++   Y+  G   P   
Sbjct: 314 TAQLVTVLFCCGANLVLSYMSM-CN------IAFRVLNGTTISVILRAYVKEGCLGPGHV 366

Query: 353 --VKEVND 358
              +EV+D
Sbjct: 367 LSPREVSD 374


>gi|413942077|gb|AFW74726.1| hypothetical protein ZEAMMB73_469739 [Zea mays]
          Length = 425

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 60/266 (22%), Positives = 99/266 (37%), Gaps = 76/266 (28%)

Query: 112 LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALL-------- 163
           + D+  + F+    P G+P SV   YL Y+ +R +Q  +S +  VL+TQ +         
Sbjct: 67  IHDKLVDSFLNKFFPSGYPYSVNEGYLTYTKFRVLQHFSSAMLHVLSTQLVFQNRLLSVL 126

Query: 164 ---------YA----------------------------IGLGKGAIPTAAAINWVLKDG 186
                    YA                              LGK  +        +LKDG
Sbjct: 127 SSLTSAHFNYARQLTAQVGCGSVLAVCCRFATYPDASNCCKLGKHMMTEMLKSKQILKDG 186

Query: 187 IGYLSKIMLSNFGRHFDVNPKGWRLF-------------------------------ADL 215
           + +  K++ S+ G   D  PK WR+                                AD+
Sbjct: 187 MQHTRKLICSSMGARMDSEPKSWRILGLTAYVLYGCHCLGSHFTIFPIICLYCSILTADV 246

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           L +    LE+++P  P  F+ +       +    +   +TR   Y+ FA   N +++ AK
Sbjct: 247 LYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMVVVAARATRLPIYSSFAKEGNLSDLFAK 306

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSS 301
           GEA   +   +GI  GI LA+ + S+
Sbjct: 307 GEAISTLFNVMGIGAGIGLASTVCST 332


>gi|414876406|tpg|DAA53537.1| TPA: hypothetical protein ZEAMMB73_477539 [Zea mays]
          Length = 295

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 73/309 (23%), Positives = 134/309 (43%), Gaps = 38/309 (12%)

Query: 211 LFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFA 270
           ++AD + +A    E+ T  +P +F+ + +     ++ A   +  +       FA   N  
Sbjct: 1   MYADFIGSAGSIFELSTTLYPGYFLPLASLGNLAKAIARGFKDPSFRVIQNHFAESGNLG 60

Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIG--SSMPFALASFSVVTWIHMYCNLKSYQSIE 328
           EV AK E   + ++ +G+ +G+ + +  G  SS    + ++  V  +H++   +S   ++
Sbjct: 61  EVAAKEEVWEVGAQLLGLSIGVLIMDKTGVKSSYLTLVLTWLSVRLLHLWLRYQSISVLK 120

Query: 329 LRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKD 388
            RT+N  R  ++   ++     P     N+EE +     + + + +              
Sbjct: 121 FRTINLKRGRILVKSHVAQHTVPGYVACNEEENILTWERYLRPQVS-------------- 166

Query: 389 AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGK---FCVVLKESAL 443
             V +E R+  G K SD+VN       L  LY++E YIL   + G +   F V  KESA 
Sbjct: 167 FGVPME-RMLGGDKSSDMVNR------LLKLYKNEKYILFFEQFGSREPTFLVTFKESAT 219

Query: 444 PQDMLKSLFQASYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVG 503
              +L+SL+QA   +WL +N          +  P   LE S+  ++  F        S G
Sbjct: 220 SMSVLRSLWQA---HWLHKNQ-----LRQDEVFPW--LEESIVALENGFTDFLEQLGSAG 269

Query: 504 WVTDGLIAR 512
           W  D +I +
Sbjct: 270 WDQDLIILK 278


>gi|303281718|ref|XP_003060151.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458806|gb|EEH56103.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 814

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 60/115 (52%), Gaps = 2/115 (1%)

Query: 116 CRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIP 174
           C     +  LPEG+P SV+SDYL + L  + QGV S   G+L + ALL   G+G   A  
Sbjct: 229 CLRALRRAFLPEGYPRSVSSDYLAFQLCDAAQGVCSYARGILCSTALLRGAGVGSSTATA 288

Query: 175 TAAAINWVLKDGIGYLSKIML-SNFGRHFDVNPKGWRLFADLLENAAFGLEMLTP 228
            +A   +V +D    L  +   +  GR  D + K WRLFAD + +    +E+ +P
Sbjct: 289 ASATAQFVARDLTSMLGGVAFAATRGRAMDADAKRWRLFADAMNDLGMAIELASP 343


>gi|414586416|tpg|DAA36987.1| TPA: hypothetical protein ZEAMMB73_386837 [Zea mays]
          Length = 155

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 68/130 (52%)

Query: 189 YLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAA 248
           +  K++ S+ G   D  PK WR+ AD+L +    LE+++P  P  F+ +       +  A
Sbjct: 3   HAGKLICSSMGARMDSEPKSWRILADVLYDLGTALEVVSPLCPQLFLEVAGLGNFAKGMA 62

Query: 249 ALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALAS 308
            +   +TR   Y+ FA   N +++ AKGEA   +   +GI  GI LA+ + S+    L +
Sbjct: 63  VVAARATRLPIYSSFAKEGNLSDLFAKGEAISTLFNVMGIGAGIGLASTVCSTTQGKLIA 122

Query: 309 FSVVTWIHMY 318
             +++ +H++
Sbjct: 123 GPLLSAVHIW 132


>gi|301123209|ref|XP_002909331.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100093|gb|EEY58145.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 122

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 64/125 (51%), Gaps = 19/125 (15%)

Query: 160 QALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENA 219
           Q+LLY  GL  GAIPTAAA+N VL+DG+G                    WR+ + +  + 
Sbjct: 2   QSLLYDAGLVAGAIPTAAAVNRVLRDGLGQ-------------------WRITSSVALDV 42

Query: 220 AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
           A   E+LTP  P  F+ I +     ++ A L  ++TR+ F+  F  R N A+V AK   Q
Sbjct: 43  AVLGEILTPLAPGSFLVIASLTNVAKNVAWLSASATRAGFHNSFVIRENLADVTAKTGPQ 102

Query: 280 GMVSK 284
           G+  +
Sbjct: 103 GIAQQ 107


>gi|325088852|gb|EGC42162.1| DUF647 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 507

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/322 (23%), Positives = 135/322 (41%), Gaps = 44/322 (13%)

Query: 166 IGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
           +G+G   A PT A ++ VL+D +G ++ I+ ++  G   +   K +RL AD+L ++A   
Sbjct: 31  VGVGDASASPTVALLHSVLQDSMGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIF 90

Query: 224 EMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEV---------- 272
           + L+PA P    V + A +   R+   +   S+++   + FA   N  E+          
Sbjct: 91  DCLSPALPKRLRVVVLAFSSVLRALCGVAAGSSKASLSSHFAKCGNLGELNAVRFPNVLV 150

Query: 273 ---------------------IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFS 310
                                + K  +Q  V   +GI+ G  + +HI  S PFA   +  
Sbjct: 151 IAISGAAQRGFPFLFSDADQKLQKDSSQETVISLLGILCGSVVVSHI--STPFATWTTLL 208

Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFK 370
           ++  +H+  N  + +S+ + TLN  R ++VFS     G      E +  E +F      +
Sbjct: 209 LLLLVHLSTNYAAVRSVNMTTLNRQRTNIVFSTLFEKGSVLTPTEASKCERIFERDGILR 268

Query: 371 IKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--T 428
            K++  S  L            +   +   + + DV     D + L  L+E E Y+L   
Sbjct: 269 WKAS--SATLGYCQIGGSFQELLRGSIHGANSIRDV---GIDIYKLLRLFEKEEYVLWFN 323

Query: 429 EHGGKFCVVLKESALPQDMLKS 450
               K  +VLK +  P   L++
Sbjct: 324 PIHKKGTIVLKNNVTPISQLQA 345


>gi|255945167|ref|XP_002563351.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588086|emb|CAP86157.1| Pc20g08280 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 504

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 81/294 (27%), Positives = 130/294 (44%), Gaps = 39/294 (13%)

Query: 125 LPEGFPDSVTSDYLNYSLW-----RSVQGVA-----SQISGVLATQALLYAIGLGKG-AI 173
           LP G+P SV+ DYL   L      RS+   A     +Q +  L T      +G+G   A 
Sbjct: 70  LPSGYPQSVSDDYLPPPLLPSGSSRSISHPAQLHRPTQRAKQL-TPTQHTGVGVGNANAS 128

Query: 174 PTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
           PT+A +  +L+D  G ++ I  ++  G   +   K +RL AD+  + A  L+ L+P  P 
Sbjct: 129 PTSALLLHILQDISGRIATICFAHRIGTALEPECKTYRLAADVFNDIAMILDCLSPGVPA 188

Query: 233 --HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIML 290
               V + + AG  R+   +   S+++   A FA   N AE+ AK  +Q  V    G+++
Sbjct: 189 GPARVAVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQETVISLFGMLV 248

Query: 291 GIALANHIGSSMPFALASFSVVTWIHMY--------CNLKSYQSIELRTLNPYRASLVFS 342
           G  + +HI         SFS  TW+ +          N  + +++++ +LN  RA++VFS
Sbjct: 249 GSVVISHI--------TSFST-TWLILLILLALHLSMNYAAVRAVQMTSLNRQRANIVFS 299

Query: 343 EYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHR 396
               S +       N    L P        S  KS+  VL+         I HR
Sbjct: 300 TLFDSDR-------NLSLDLDPELDTTPSLSKTKSEWTVLTPAQVSKHERIFHR 346


>gi|224010239|ref|XP_002294077.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970094|gb|EED88432.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 354

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 53/170 (31%), Positives = 85/170 (50%), Gaps = 20/170 (11%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPTAAAINWVLKDGI 187
           +P +V   YL Y  + S+QG++S + GV++T A+L A G+G   A   +AA+ W ++DG+
Sbjct: 19  YPHTVAPGYLEYQFYDSLQGLSSYLRGVVSTTAVLSAAGVGDAAATAMSAAMTWAIRDGL 78

Query: 188 GYLSKIMLSNFG-RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRS 246
           G +  ++ S     HFD + K +RL AD+L +    L+M   A P    +  A       
Sbjct: 79  GMVGGLLFSYVASSHFDAHVKEFRLVADVLNDVGLTLDM---ALPVVLSWTRA------- 128

Query: 247 AAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALAN 296
                   T+      F+   N A+V AK   Q  +   IG+ LG+ L+N
Sbjct: 129 --------TKGNITDHFSISGNRADVSAKESTQETLVSLIGMALGVWLSN 170


>gi|384494915|gb|EIE85406.1| hypothetical protein RO3G_10116 [Rhizopus delemar RA 99-880]
          Length = 188

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 54/97 (55%), Gaps = 2/97 (2%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-GAIPT 175
           ++   +  LP G+P+SV S Y  +  W  ++       GVL +QA+L ++GLG   A   
Sbjct: 51  KQNLTEMFLPVGYPESVHSCYKKFHSWLFLETYVGSAVGVLCSQAMLASLGLGTVEATGG 110

Query: 176 AAAINWVLKDGIGYLSKI-MLSNFGRHFDVNPKGWRL 211
           A AI WVLKDGIG + K+  +  +   FD +PK W+ 
Sbjct: 111 AVAIQWVLKDGIGEIGKLFFIKKYASTFDSHPKTWKF 147


>gi|146096657|ref|XP_001467882.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020910|ref|XP_003863618.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072248|emb|CAM70952.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501851|emb|CBZ36933.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 468

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 54/228 (23%), Positives = 97/228 (42%), Gaps = 8/228 (3%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P+GFPD+  + +  +          S  +  +  Q+LL    LG      ++   W+
Sbjct: 33  FGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 86

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
           LKD +  L    ++N    ++  PK W + +  + N     +ML P+  P+H +      
Sbjct: 87  LKDLVPALFAAYMANKVVSYESRPKFWFVVSVFINNLTVISDMLIPSVLPNHLLAAAIIT 146

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              + ++AL+   TR+     +A   N AE+  K  + GMVS  +   LGI    +I  S
Sbjct: 147 STVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGILYCTYIA-S 205

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
               L + +     +M+ +  S   I  R LN     L+   ++L  +
Sbjct: 206 FTVQLITVAACCVTNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253


>gi|157874033|ref|XP_001685512.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128584|emb|CAJ08716.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 468

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 97/225 (43%), Gaps = 8/225 (3%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P+GFPD+  + +  +          S  +  +  Q+LL    LG      ++   W+
Sbjct: 33  FGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 86

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
           LKD +  L    ++N    ++  PK W + +  + N     +ML P+  P+H +      
Sbjct: 87  LKDLVPALFAAYMANKVVSYESRPKFWFVVSVFVNNLTVISDMLIPSVLPNHLLAAAIIT 146

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              + ++AL+   TR+     +A   N AE+  K  + GMVS  +   LGI    +I +S
Sbjct: 147 STVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGILYCTYI-AS 205

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLL 346
               L + +     +M+ +  S   I  R LN     L+   ++L
Sbjct: 206 FTVQLITVAACCVTNMFLSSMSMMPITFRLLNVDTMKLLLGAFIL 250


>gi|340059715|emb|CCC54109.1| conserved hypothetical protein, fragment, partial [Trypanosoma
           vivax Y486]
          Length = 618

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 102/233 (43%), Gaps = 8/233 (3%)

Query: 125 LPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWVLK 184
           +P G+P S    +  Y L        S  +  +  Q++L       G    ++   W+LK
Sbjct: 182 MPSGYPHSCGVGFRRYFLLSLCSSSVSSFASSIGFQSIL------NGFFLASSPQLWMLK 235

Query: 185 DGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAAGA 243
           D +  L+   L+N    ++  PK W + +  + N +   EM+ P+  P H +F       
Sbjct: 236 DLVPALAAAYLANRIVSYENRPKFWFVVSVGMHNLSVIAEMIVPSLVPQHLLFAAVTTSC 295

Query: 244 GRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMP 303
            R +A+L+   TR+     FA   N AE+  K  + G+V   +   LGIA  + + +S+ 
Sbjct: 296 IRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGIVIYTVATALGIAFTS-VVTSLT 354

Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEV 356
             L +  +   +++  + +S   I  R LN    S++  EY+ +G     + V
Sbjct: 355 AQLITVLMCCAVNLIISHQSMCKIAFRMLNGTTISVILHEYIQNGSTCSCRRV 407


>gi|242208074|ref|XP_002469889.1| predicted protein [Postia placenta Mad-698-R]
 gi|220731120|gb|EED84968.1| predicted protein [Postia placenta Mad-698-R]
          Length = 214

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 112/215 (52%), Gaps = 9/215 (4%)

Query: 138 LNYSLWRSVQGVASQISGVLATQALL--YAIGLGK-GAIPTAAAINWVLKDGIGYLSKIM 194
           + Y ++ ++Q   S ++G++A++A+L  +  G+GK GA  T A    VL+D    L+ I+
Sbjct: 1   VRYQIYNALQAFCSSLAGLIASRAVLQGFRHGVGKAGASATHAIFLTVLQDIFSRLTTIV 60

Query: 195 LSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQA 253
              + G       K +RL AD+  +AA  L+ L+P      V     +GA R+   ++  
Sbjct: 61  AGYYLGTSLFPEAKAYRLTADIFNDAAIILDTLSPHLGSFRVASLCLSGAFRALCGVVAG 120

Query: 254 STRSCFYAGFAARR----NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASF 309
            +++     FA       +  ++ AK  ++  V   +G++ G A+ +++ S+    +  F
Sbjct: 121 GSKAALTVHFATTAKGAGDVGDLSAKDGSKETVLALLGMLSGTAVMHYVHSTRATYVVLF 180

Query: 310 SVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEY 344
           +++  +H++CN  + + +  R+LN  RAS+V++ Y
Sbjct: 181 ALIA-LHLFCNYLAVRVVVFRSLNRQRASIVWAAY 214


>gi|299740308|ref|XP_001838896.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
 gi|298404184|gb|EAU82827.2| aldo-keto reductase [Coprinopsis cinerea okayama7#130]
          Length = 768

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/236 (23%), Positives = 110/236 (46%), Gaps = 16/236 (6%)

Query: 109 VNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL 168
           + KL D  + +F    LP G+P+SVT DYL Y +  ++Q   S ++ +++++A L   G+
Sbjct: 320 LQKLRDFGKAVF----LPAGYPNSVTPDYLTYQVLNALQAFCSSLASLISSRATLQGFGV 375

Query: 169 GKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEML 226
           G   A    A +  +L D    L+ I+ ++  G       K +R  AD+L + A  L+ L
Sbjct: 376 GSASASANDALLLTILLDIYSRLTTIVSAHLLGSSLATEAKKYRFVADILNDVAVVLDTL 435

Query: 227 TPAF-----PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAA----RRNFAEVIAKGE 277
           +P F     P   V     + + R+   +    +++     FA     + +  ++ AK  
Sbjct: 436 SPVFKAQGYPQLRVVALCLSASCRALCGIAAGGSKAAISVHFATPVKGKGDLGDLNAKDS 495

Query: 278 AQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLN 333
           ++  V    G++LG  +   + ++       F +++ +H+  N  +   + LRT+N
Sbjct: 496 SKETVLALFGMLLGSLIVPRLQTAWSTYTTLFILLS-LHIAINYVAVTGLVLRTMN 550


>gi|401427109|ref|XP_003878038.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494285|emb|CBZ29584.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 468

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 8/228 (3%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P+GFP++  + +  +          S  +  +  Q+LL    LG      ++   W+
Sbjct: 33  FGMPKGFPETTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 86

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
           LKD +  L    ++N    ++  PK W + +  + N     +ML P+  P+H +      
Sbjct: 87  LKDLVPALFAAYMANKVVSYENRPKFWFVVSVFMSNLMVISDMLIPSVLPNHLLAAAIIT 146

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              + ++AL+   TR+     +A   N AE+  K  + GMVS  +   LGI    +I  S
Sbjct: 147 STVKQSSALMYFVTRASALQHYAINNNLAELTKKFNSFGMVSYTVATALGILYCTYIA-S 205

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
               L + +    ++M+ +  S   I  R LN     L+   ++L  +
Sbjct: 206 FTVQLITVAACCVMNMFLSSMSMMPITFRLLNFDTMKLLLRAFILEKR 253


>gi|429329557|gb|AFZ81316.1| hypothetical protein BEWA_007250 [Babesia equi]
          Length = 743

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/242 (23%), Positives = 97/242 (40%), Gaps = 31/242 (12%)

Query: 130 PDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI---------- 179
           P   T  Y  Y+ WR  +  +  +   +A+        L +  +     I          
Sbjct: 176 PSVATKQYYKYAHWRMFERFSFSVLQTMASNIKSAPTNLQQNKVENTNIILMLKNKISES 235

Query: 180 -------NWVLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF- 230
                    + KD    +   +  S  G  FD NPK +RL   +L + A  ++ +   F 
Sbjct: 236 KMKSILATMIFKDAFSRIFHFLWFSEIGVGFDNNPKAFRLLGSILCSTATLIDFVCNVFT 295

Query: 231 --PHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARR---NFAEVIAKGEAQGMVSKA 285
             P   + +GA   A R    L  ++++  FY  F  +    N  E+ AK EAQ  +   
Sbjct: 296 FGPK--MIMGACTNAVRQIGLLTMSASQGPFYNSFHVKNTATNIGEITAKLEAQNPICDF 353

Query: 286 IGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSY---QSIELRTLNPYRASLVFS 342
            GI LGI L N +    PF + + +V+    +  N+ +Y   +S+  + LNP R  +V  
Sbjct: 354 SGIALGIYLTNLLNEQ-PFVIQATTVI-LTSLISNVATYMCVKSVSFKNLNPQRCFVVLE 411

Query: 343 EY 344
           ++
Sbjct: 412 DF 413


>gi|147803314|emb|CAN66551.1| hypothetical protein VITISV_039658 [Vitis vinifera]
          Length = 470

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 80/358 (22%), Positives = 142/358 (39%), Gaps = 75/358 (20%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK--------GAIP 174
           F+LP GFP  +   Y+                  L  Q  + A+G+G          A  
Sbjct: 116 FILPAGFPVFLFYAYME-----------------LKNQKFV-AVGVGSFSATTAAASAAA 157

Query: 175 TAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF 234
           +AAAI WV KDGIG + ++ +   G       KG++   DL           T  +P +F
Sbjct: 158 SAAAIRWVSKDGIGAVGRLFI---GLSVIKRAKGYKSIFDL----------STQLYPAYF 204

Query: 235 VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIAL 294
           + + +     ++ A  ++  +       FA   N  EV AK E   + ++ +G+ LGI +
Sbjct: 205 LQLASLGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLLGLALGIMV 264

Query: 295 ANH--IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPP 352
                + +S P    ++  +  +H++   +S   +  +++N  RA ++   ++L    P 
Sbjct: 265 LTSPLVATSYPVLAFTWMSMRLLHLWLRYQSLSVLRFKSINLKRARILVKSHVLHSIVPG 324

Query: 353 VKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE- 411
             + N +E +     F K                         R+  G  L +++  +  
Sbjct: 325 YVDCNRKEKILSWQRFLK------------------------PRIIFGVSLEELIGGERP 360

Query: 412 --DAHALFSLYEDEGYILT--EHGGKFCVVL--KESALPQDMLKSLFQASYLYWLERN 463
                 L  LY  E YIL   + G  F V +  K  A    +L+S++Q+   YWL+ N
Sbjct: 361 VSKVRTLLKLYSQEKYILVVKQQGTDFEVFVSFKVGATSLSVLRSIWQS---YWLQEN 415


>gi|71414333|ref|XP_809272.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70873629|gb|EAN87421.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 630

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 64/249 (25%), Positives = 109/249 (43%), Gaps = 14/249 (5%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
           + LF+ F +P G+P+S    +  Y L        S  +  +  Q++L    LG      +
Sbjct: 156 QRLFL-FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------S 208

Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFV 235
           +   W++KD +  L+   L+N    ++  PK W   +  L+N +   EML P+  P H +
Sbjct: 209 SPQLWMMKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLL 268

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
                    R +A+LI   +R+     FA   N AE+  K  + GMV   I    GI + 
Sbjct: 269 LGAIMTSCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-MY 327

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-----QA 350
             + +S+   LA+      +++  + +S   I  R LN    S+V  +Y+  G       
Sbjct: 328 TSVVTSLVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGIHRRHI 387

Query: 351 PPVKEVNDE 359
           P  +EV+D 
Sbjct: 388 PTPQEVSDR 396


>gi|71661222|ref|XP_817635.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70882839|gb|EAN95784.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 630

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/249 (25%), Positives = 109/249 (43%), Gaps = 14/249 (5%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
           + LF+ F +P G+P+S    +  Y L        S  +  +  Q++L    LG      +
Sbjct: 156 QRLFL-FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------S 208

Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFV 235
           +   W++KD +  L+   L+N    ++  PK W   +  L+N +   EML P+  P H +
Sbjct: 209 SPQLWMMKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLL 268

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
                    R +A+LI   +R+     FA   N AE+  K  + GMV   I    GI + 
Sbjct: 269 LGAIMTSCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-MY 327

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-----QA 350
             + +S+   LA+      +++  + +S   I  R LN    S+V  +Y+  G       
Sbjct: 328 TSVVTSLVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRGGIHRRHI 387

Query: 351 PPVKEVNDE 359
           P  +EV+D 
Sbjct: 388 PTPQEVSDR 396


>gi|83767160|dbj|BAE57299.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 165

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 47/144 (32%), Positives = 81/144 (56%), Gaps = 4/144 (2%)

Query: 133 VTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLS 191
           +TS   + +L  S+Q  +S I+G+L+++A+L  +G+G   A PTAA +  +L+D  G +S
Sbjct: 13  ITSVTNSLTLKDSLQAFSSSIAGLLSSRAVLQGVGVGNADASPTAALLLHILQDTSGRIS 72

Query: 192 KIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF--VFIGAAAGAGRSAA 248
            I+ ++  G   +   K +RL AD+  +AA  L+ L+P  P  F  V + + AG  R+  
Sbjct: 73  TILFAHRVGTALEPECKMYRLAADVFNDAAMILDCLSPMIPAGFGRVTVLSTAGVLRALC 132

Query: 249 ALIQASTRSCFYAGFAARRNFAEV 272
            +   S+++   A F+   N AEV
Sbjct: 133 GVAGGSSKASLSAHFSRWGNLAEV 156


>gi|390341857|ref|XP_003725545.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform 1
           [Strongylocentrotus purpuratus]
          Length = 311

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 55/249 (22%), Positives = 102/249 (40%), Gaps = 31/249 (12%)

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           + +AA  +E+++  FP +FV I   +   +S   +   +TR+      A R N A+V AK
Sbjct: 1   MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
             +Q  +     + +G+ +      ++    + F + T +H+Y N ++   + + TLN  
Sbjct: 61  DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120

Query: 336 RASLVFSEYLLSGQ---APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
           R  ++  +YL S Q   + P   VN  EP+                +  LS   K     
Sbjct: 121 RFHILVQDYLHSSQFSMSGP-DAVNSREPV----------------IWRLSRPLK----- 158

Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFC--VVLKESALPQDMLKS 450
               L LG     V  +  D          E ++L+    K C  +V+   +  +D +K+
Sbjct: 159 ----LNLGVPFRTVAKSMRDIEMCLQSNGQEKFLLSLDVKKGCIDIVIHHDSTSEDFIKA 214

Query: 451 LFQASYLYW 459
             QA  + +
Sbjct: 215 CLQAEIIMY 223


>gi|407850424|gb|EKG04824.1| hypothetical protein TCSYLVIO_004113 [Trypanosoma cruzi]
          Length = 631

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 109/249 (43%), Gaps = 14/249 (5%)

Query: 117 RELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTA 176
           + LF+ F +P G+P+S    +  Y L        S  +  +  Q++L    LG      +
Sbjct: 156 QRLFL-FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------S 208

Query: 177 AAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFV 235
           +   W++KD +  L+   L+N    ++  PK W   +  L+N +   EML P+  P H +
Sbjct: 209 SPQLWMMKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLL 268

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
                    R +A+LI   +R+     FA   N AE+  K  + GM+   I    GI + 
Sbjct: 269 LGAIMTSCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMMIYTIFTAFGI-MY 327

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG-----QA 350
             + +S+   LA+      +++  + +S   I  R LN    S+V  +Y+  G       
Sbjct: 328 TSVVTSLVAQLATVLFCCALNLVLSYQSMSKIAFRILNATTLSVVLRDYVGRGGIHRRHI 387

Query: 351 PPVKEVNDE 359
           P  +EV+D 
Sbjct: 388 PTPQEVSDR 396


>gi|154343241|ref|XP_001567566.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134064898|emb|CAM43006.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 439

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/230 (23%), Positives = 97/230 (42%), Gaps = 12/230 (5%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P+GFPD+  + +  +          S  +  +  Q+LL    LG      ++   W+
Sbjct: 4   FGMPKGFPDTTAAGFQRFFYLSLTSSFVSNFASSIGYQSLLSGFFLG------SSPQLWM 57

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAF-PHHFVFIGAAA 241
           LKD    L    ++N    ++  PK W + + L+ N     +ML P+  P+H +      
Sbjct: 58  LKDLAPALLAAYMANQVVSYENRPKFWFVVSVLMNNIMVMSDMLIPSVVPNHLLAAAIVT 117

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              + ++AL+    R+     +A   N AE+  K  + GMVS  I   LGI    +I S 
Sbjct: 118 STVKQSSALMYFVARASTLQHYAINNNLAELTKKFNSFGMVSYTIATALGILYCTYIAS- 176

Query: 302 MPFALASFSVVTWI--HMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
             F +  F+V      +M  + +    I  R LN     L+   +++  +
Sbjct: 177 --FTMQLFTVAVCCAANMLLSSRMMMPISFRLLNFDTMKLLLRAFIMEKR 224


>gi|326469821|gb|EGD93830.1| hypothetical protein TESG_01362 [Trichophyton tonsurans CBS 112818]
          Length = 413

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 75/297 (25%), Positives = 136/297 (45%), Gaps = 19/297 (6%)

Query: 166 IGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
           +G+G   A PTAA +  VL++ +G ++ I+ ++  G   +   K +RL AD+L ++A  L
Sbjct: 40  VGVGDATASPTAALLLNVLQESMGRIATILFAHRLGTSLEPECKMYRLAADILNDSAMVL 99

Query: 224 EMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMV 282
           + L+P FP    V + + +   R+   +   S+++   A FA   N  E+ A G      
Sbjct: 100 DCLSPIFPKPVRVGLLSLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNAVG-LNIPP 158

Query: 283 SKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFS 342
           S+  G++L     ++I S +   +A   V+  +H+  N  + +++++ TLN  RA++VFS
Sbjct: 159 SQLTGVVL-----SYITSPLETWIA-LIVLLIVHLGTNHAAVRAVKMTTLNRQRANIVFS 212

Query: 343 EYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSAN-----KSQLLVLSSEAKDAAVEIEHR- 396
                       E + EE +F      + K+ +       Q+     +      E   R 
Sbjct: 213 HLFEDNLILTPAETSKEERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRG 272

Query: 397 -LQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKS 450
            L+     S  ++   +  AL  L+ +E YIL       +  +VLK  A  Q  LK+
Sbjct: 273 ALETSRTTSTTMDASTNLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKA 329


>gi|74026006|ref|XP_829569.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834955|gb|EAN80457.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 595

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 105/242 (43%), Gaps = 13/242 (5%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P G+PDS    +  Y L        S  +  +  Q++L       G +  ++   W+
Sbjct: 164 FGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSIL------NGFLLASSPQLWM 217

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
           LKD    L+   L+N    ++  PK W +F+  L N +   EM+ P   P+H +      
Sbjct: 218 LKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILT 277

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              R +A+L+   TR+     FA   N AE+  K  + GMV   +   LGIA  + +  S
Sbjct: 278 SCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLV-PS 336

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS-----GQAPPVKEV 356
           +   LA+  +    ++  +  S  +I  R LN    S++   Y+       G+    +EV
Sbjct: 337 VTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPREV 396

Query: 357 ND 358
           +D
Sbjct: 397 SD 398


>gi|407411065|gb|EKF33281.1| hypothetical protein MOQ_002854, partial [Trypanosoma cruzi
           marinkellei]
          Length = 636

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 58/227 (25%), Positives = 99/227 (43%), Gaps = 8/227 (3%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P G+P+S    +  Y L        S  +  +  Q++L    LG      ++   W+
Sbjct: 167 FGMPRGYPESCEVGFRRYFLLSLCSSSVSSFASSIGFQSILNGFFLG------SSPQLWM 220

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
           +KD +  L+   L+N    ++  PK W   +  L+N +   EML P+  P H +      
Sbjct: 221 MKDLLPALAAAYLANRIVSYENRPKFWFFVSVGLQNLSVVAEMLVPSLLPQHLLLGAIMT 280

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              R +A+LI   +R+     FA   N AE+  K  + GMV   I    GI +   + +S
Sbjct: 281 SCARQSASLIFLVSRAAALQHFAVSNNLAELTKKLNSFGMVIYTIFTAFGI-MYTSVVTS 339

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSG 348
           +   LA+      +++  + +S   I  R LN    S+V  +Y+  G
Sbjct: 340 LVAQLATVLFCCALNLVLSYQSMSKIAFRILNVTTLSVVLRDYVGRG 386


>gi|449280556|gb|EMC87826.1| UPF0420 protein [Columba livia]
          Length = 111

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 48/67 (71%), Gaps = 1/67 (1%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
           L  + +LP+GFP+SV+SDYL Y LW ++Q + S ++G +A +++L A+G+G   A  T A
Sbjct: 18  LVQRVLLPQGFPESVSSDYLGYQLWDALQALCSSLAGAVAARSVLQAVGVGDTNASVTGA 77

Query: 178 AINWVLK 184
           A+ W+L+
Sbjct: 78  ALGWLLR 84


>gi|390341859|ref|XP_795645.3| PREDICTED: UPF0420 protein C16orf58 homolog isoform 2
           [Strongylocentrotus purpuratus]
          Length = 311

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 54/249 (21%), Positives = 101/249 (40%), Gaps = 31/249 (12%)

Query: 216 LENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAK 275
           + +AA  +E+++  FP +FV I   +   +S   +   +TR+      A R N A+V AK
Sbjct: 1   MNDAAMCIELISVFFPQYFVIIACVSSLFKSMVGVAGGATRAALTMHQARRNNMADVSAK 60

Query: 276 GEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPY 335
             +Q  +     + +G+ +      ++    + F + T +H+Y N ++   + + TLN  
Sbjct: 61  DGSQETLVNLAALFVGLIITPIASKNILLTWSLFFLCTCLHLYANYRAVTCVVMETLNQN 120

Query: 336 RASLVFSEYLLSGQ---APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
           R  ++  +YL S Q   + P   VN  EP+                +  LS   K     
Sbjct: 121 RFHILVQDYLHSSQFSMSGP-DAVNSREPV----------------IWRLSRPLK----- 158

Query: 393 IEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFC--VVLKESALPQDMLKS 450
               L LG     V  +  D          E ++L+    K C  + +   +  +D +K+
Sbjct: 159 ----LNLGVPFRTVAQSMRDLEMCLQSNGQEKFLLSFDVKKGCIDIAIHHDSTSEDFIKA 214

Query: 451 LFQASYLYW 459
             QA  + +
Sbjct: 215 CLQAEIIMY 223


>gi|261335583|emb|CBH18577.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 595

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 105/242 (43%), Gaps = 13/242 (5%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINWV 182
           F +P G+PDS    +  Y L        S  +  +  Q++L       G +  ++   W+
Sbjct: 164 FGMPAGYPDSCGEGFRRYFLLSLCSSFVSSFASSIGYQSIL------NGFLLASSPQLWM 217

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA-FPHHFVFIGAAA 241
           LKD    L+   L+N    ++  PK W +F+  L N +   EM+ P   P+H +      
Sbjct: 218 LKDLAPALAAAYLANRVISYENRPKFWFVFSVALHNLSVIAEMIIPTTVPNHLLAAAILT 277

Query: 242 GAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSS 301
              R +A+L+   TR+     FA   N AE+  K  + GMV   +   LGIA  + +  S
Sbjct: 278 SCVRQSASLMFLVTRASALQHFAISNNLAELTKKFNSFGMVIYTVSTALGIAYTSLV-PS 336

Query: 302 MPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLS-----GQAPPVKEV 356
           +   LA+  +    ++  +  S  +I  R LN    S++   Y+       G+    +EV
Sbjct: 337 VTAQLATVLMCCGANLAISHLSMCNIAFRILNETTLSVILRFYMREEGHQRGRVLSPREV 396

Query: 357 ND 358
           +D
Sbjct: 397 SD 398


>gi|397621626|gb|EJK66381.1| hypothetical protein THAOC_12704 [Thalassiosira oceanica]
          Length = 480

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 68/133 (51%), Gaps = 3/133 (2%)

Query: 99  SNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLA 158
           +  SLSSL S   +      LF    LP G P +    YL+Y +   VQG+ S + GV++
Sbjct: 37  AGGSLSSL-SPRSVPSALSGLFGDLFLPAGHPSTTAGGYLHYQVCDGVQGLCSYLRGVVS 95

Query: 159 TQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSNFG-RHFDVNPKGWRLFADLL 216
           T A L A G+G   A    AA+ W ++DG+G L  +  S +    FD   K +RL AD+ 
Sbjct: 96  TGAALAAAGVGDSDATAMGAAVTWAVRDGLGMLGGLAFSYYASSKFDSRVKEYRLLADVA 155

Query: 217 ENAAFGLEMLTPA 229
            +A   L+M  P+
Sbjct: 156 NDAGMILDMARPS 168


>gi|330932017|ref|XP_003303615.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
 gi|311320239|gb|EFQ88270.1| hypothetical protein PTT_15911 [Pyrenophora teres f. teres 0-1]
          Length = 391

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 152/372 (40%), Gaps = 57/372 (15%)

Query: 167 GLGKG---AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFG 222
           G+G G   A PT A +  V+    G ++ I+ ++  G   +   K +R  AD+  + A  
Sbjct: 25  GVGVGDSTATPTTALLLSVIHSSTGRIATILFAHRLGTALEPECKMYRFAADIFNDTAMV 84

Query: 223 LEMLTPAFPHHF-VFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
           L+ L+PAFP    V + + +    +   +   S+++   A FA + N  EV AK  +Q  
Sbjct: 85  LDCLSPAFPKPIRVAVLSFSSCLSALCGVCAGSSKASLSAHFARKGNLGEVNAKDSSQET 144

Query: 282 VSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLV 340
           V   +G++ G  + ++I S  P A   +  ++  IH+  N  + +++ +  LN  RA+++
Sbjct: 145 VISLLGMLAGSVVISYITS--PLATWGTLILLLTIHLATNYAAVRAVIMHCLNRQRANIL 202

Query: 341 FSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLG 400
           FS     G     ++V+  E +F            +  +L  S +      ++  R  +G
Sbjct: 203 FSNMFQHGLVLSPRDVSQRERVF-----------ERGGVLRWSDD------KVLGRCSIG 245

Query: 401 SKLSDVVNNKEDAHA--------------LFSLYEDEGYIL-----TEHGGKFCVVLKES 441
             L  ++N     H               L  ++ +E YIL      E G    +VLK +
Sbjct: 246 VPLQRLLNRLGTPHKQTGSLTLKSMEISDLLDVFANEAYILLPASTVEEG---LIVLKRT 302

Query: 442 ALPQDMLK----SLFQASYLYWLERNAGIVATSTSADCAPG-----GRLEISLDYVQREF 492
             P D +K    +L  A     LE  AG+      A   P        L  SL  VQ  F
Sbjct: 303 CEPIDQIKAWAHALLLAKRREGLE-AAGLGRKRDDASHGPAMEGLISELRESLKEVQAMF 361

Query: 493 NHVKSDSASVGW 504
                +    GW
Sbjct: 362 ARYGDEMRDKGW 373


>gi|425769885|gb|EKV08364.1| hypothetical protein PDIP_68690 [Penicillium digitatum Pd1]
 gi|425771531|gb|EKV09972.1| hypothetical protein PDIG_59240 [Penicillium digitatum PHI26]
          Length = 406

 Score = 62.0 bits (149), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 97/197 (49%), Gaps = 21/197 (10%)

Query: 166 IGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGL 223
           +G+G   A PT+A +  +L+D  G ++ I  ++  G   +   K +RL AD+  + A  L
Sbjct: 6   VGVGNASASPTSALLLQILQDISGRIATICFAHRVGTALEPECKTYRLAADVFNDIAMIL 65

Query: 224 EMLTPAFPH--HFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 281
           + L+P  P     V + + AG  R+   +   S+++   A FA   N AE+ AK  +Q  
Sbjct: 66  DCLSPGVPAGPARVIVLSTAGVLRALCGVAGGSSKASLSAHFAKWGNLAELNAKDSSQET 125

Query: 282 VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMY--------CNLKSYQSIELRTLN 333
           V    G+++G  + +HI         SFS  TW+ +          N  + +++ + +LN
Sbjct: 126 VISLFGMLVGSVVISHI--------TSFST-TWLVLLFLLALHLSMNYAAVRAVRMTSLN 176

Query: 334 PYRASLVFSEYLLSGQA 350
             RA++VFS  L S ++
Sbjct: 177 RQRANIVFSALLASDKS 193


>gi|215259609|gb|ACJ64296.1| conserved hypothetical protein [Culex tarsalis]
          Length = 234

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 90/207 (43%), Gaps = 25/207 (12%)

Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
           +TRS      A R N A+V +K  AQ      I   +G+ L  H+ +        F  +T
Sbjct: 3   ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHVQNHRVL-YGLFLFLT 61

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
            +H+Y N+K+ Q++ LRT N  R  +   EY  SG     ++VN            K++ 
Sbjct: 62  LLHIYPNIKAVQAVCLRTFNEARYLIALEEYFKSGSMLSPQQVN------------KLER 109

Query: 374 ANKSQLLVLSSEAKD--AAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHG 431
               Q + L++  K   +A E+ H  +    L +V+       A F     E +++ E  
Sbjct: 110 VTIGQTVSLTARVKMGCSARELAHYYRNCYDLENVI-------ACFD--SREKFLIAETR 160

Query: 432 GKFCVVLKESALPQDMLKSLFQ-ASYL 457
               V L  +  P D++KS F  ASYL
Sbjct: 161 NYIGVYLHFTVKPLDIIKSYFYVASYL 187


>gi|159490370|ref|XP_001703152.1| hypothetical protein CHLREDRAFT_195109 [Chlamydomonas reinhardtii]
 gi|158270782|gb|EDO96617.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 280

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 4/94 (4%)

Query: 124 MLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGL-GKGAIPTAAAINWV 182
           M P+ F     + Y  Y  WR VQ        V  TQ+LL A+G+ G+ +   AAAINWV
Sbjct: 1   MAPQRF---CVASYAPYMAWRGVQYFFGGAISVFTTQSLLGALGVAGRYSGEAAAAINWV 57

Query: 183 LKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLL 216
           +KDG G L +++ + +GR  D   K +RL   L+
Sbjct: 58  IKDGAGRLGRLLFARWGRELDCELKQFRLMGYLV 91



 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 2/75 (2%)

Query: 290 LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 349
           LG  L    G  +   L  F ++   ++  + K   S+EL  +N  R +    +YL  GQ
Sbjct: 65  LGRLLFARWGRELDCELKQFRLMG--YLVASRKEVDSVELPYMNRARLAYAARQYLTGGQ 122

Query: 350 APPVKEVNDEEPLFP 364
            P V E N  EPL P
Sbjct: 123 VPGVAEANHHEPLLP 137


>gi|326479045|gb|EGE03055.1| DUF647 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 494

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 135/341 (39%), Gaps = 66/341 (19%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAI 179
            +   LP G+P SVT DY+ Y ++ S+  +A+          +L+A  LG    P     
Sbjct: 126 ILNVFLPAGYPHSVTDDYIEYQIYESMGRIAT----------ILFAHRLGTSLEPEC--- 172

Query: 180 NWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIG 238
                                      K +RL AD+L ++A  L+ L+P FP    V + 
Sbjct: 173 ---------------------------KMYRLAADILNDSAMVLDCLSPIFPKPVRVGLL 205

Query: 239 AAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHI 298
           + +   R+   +   S+++   A FA   N  E+ A               +G  + ++I
Sbjct: 206 SLSSVLRALCGVAAGSSKASLSAHFARWGNLGELNA---------------VGSVVVSYI 250

Query: 299 GSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVND 358
            S +   +A   V+  +H+  N  + +++++ TLN  RA++VFS            E + 
Sbjct: 251 TSPLETWIA-LIVLLIVHLGTNHAAVRAVKMTTLNRQRANIVFSHLFEDNLILTPAETSK 309

Query: 359 EEPLFPAFHFFKIKSAN-----KSQLLVLSSEAKDAAVEIEHR--LQLGSKLSDVVNNKE 411
           EE +F      + K+ +       Q+     +      E   R  L+     S  ++   
Sbjct: 310 EERIFERDGVLRWKAESSGILGTCQIGTSLEQLLLLLPEQADRGALETSRTTSTTMDAST 369

Query: 412 DAHALFSLYEDEGYIL--TEHGGKFCVVLKESALPQDMLKS 450
           +  AL  L+ +E YIL       +  +VLK  A  Q  LK+
Sbjct: 370 NLTALLELFREEEYILYFGPLSRRGAIVLKAGATAQAQLKA 410


>gi|215259607|gb|ACJ64295.1| conserved hypothetical protein [Culex tarsalis]
          Length = 234

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 87/206 (42%), Gaps = 23/206 (11%)

Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
           +TRS      A R N A+V +K  AQ      I   +G+ L  H+ +        F  VT
Sbjct: 3   ATRSALTQHHAIRGNLADVASKDSAQETCVNLIASFVGLGLLTHVQNQRVL-YGLFLFVT 61

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
            +H+Y N+K+ +++ LRT N  R  +   EY  SG     ++VN            K++ 
Sbjct: 62  LLHIYANIKAVKAVCLRTFNEARYLIALEEYFKSGSMLSPQQVN------------KLER 109

Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYED-EGYILTEHGG 432
               Q + L++  K              +L+    N  D   + + ++  E +++ E   
Sbjct: 110 VTIGQTVSLTARVKMGCS--------ARELAQYYRNCYDLENVIACFDSREKFLIAETRN 161

Query: 433 KFCVVLKESALPQDMLKSLFQ-ASYL 457
              V L  +  P D++KS F  ASYL
Sbjct: 162 YVGVYLHFTVKPLDIIKSYFYVASYL 187


>gi|397575094|gb|EJK49531.1| hypothetical protein THAOC_31586, partial [Thalassiosira oceanica]
          Length = 194

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 46/77 (59%), Gaps = 1/77 (1%)

Query: 120 FVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAAA 178
            V   LP  +P+SV S Y  ++ +     +A   + VL+TQALL A+G+G + A P AAA
Sbjct: 114 IVHHFLPAKYPESVCSSYSTFASYCMCANIAGSAAMVLSTQALLVAVGVGSQSAAPMAAA 173

Query: 179 INWVLKDGIGYLSKIML 195
           +NW+LKD  G L  ++ 
Sbjct: 174 LNWLLKDFAGQLGGVLF 190


>gi|300175334|emb|CBK20645.2| unnamed protein product [Blastocystis hominis]
          Length = 290

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 88/206 (42%), Gaps = 21/206 (10%)

Query: 225 MLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSK 284
           M+TP FP+ F+ +   +  G+S + L   +T +    GFA   N  +V AK  +Q     
Sbjct: 1   MITPLFPNCFLALATISNIGKSISFLANGATGAGIRYGFANAMNIGDVTAKEGSQTSAVY 60

Query: 285 AIGIMLGIALANHIG-SSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSE 343
            +G++ G+++++ +G  SM    AS   ++ + +Y   KS + + L TLN  R+ L   +
Sbjct: 61  LLGMLFGVSVSSLLGDQSMSTIFASVICMSCLSLYAMNKSLRCVTLPTLNTQRSELACVD 120

Query: 344 YLLSGQA-PPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSK 402
           Y   G    P      E    P  H                    D   +I   + L S 
Sbjct: 121 YCSQGIIYQPEAVAKLERYYLPYLH------------------VADCGTDIRLGMSLHSV 162

Query: 403 LSDVVNNKEDAHALFSLYEDEGYILT 428
           LS    + E+A  +  L+ D  Y+L+
Sbjct: 163 LS-AYRSLEEADDVIQLFGDCNYMLS 187


>gi|409046959|gb|EKM56438.1| hypothetical protein PHACADRAFT_253558 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 464

 Score = 58.9 bits (141), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 80/373 (21%), Positives = 148/373 (39%), Gaps = 56/373 (15%)

Query: 123 FMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINW 181
             LP G+P SV+ DY  Y +  ++QG  S + G++A +A+L   G+G   A  T A +  
Sbjct: 1   MFLPAGYPASVSGDYTRYQVLNALQGFFSSLGGLVAARAVLEGHGVGSADASATNALLLT 60

Query: 182 VLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH---HFVFI 237
           +L+D    ++ I+   + G       K +R  AD+  +    L+ L+P   H    FV++
Sbjct: 61  ILQDIFSRVTAILSGYYLGTALFPEAKTYRFLADIWNDIPLVLDTLSPLLSHVSLDFVWL 120

Query: 238 G---------------------AAAGAGRSAAALIQASTRSCFYAGF------AARRNFA 270
                                   +G  RS  A+  A +++     F      AA  + +
Sbjct: 121 AFPPFLLPTLVHRPGPWRGIALCFSGIFRSMCAVAAAGSKAALTLHFSQPDHGAASGDIS 180

Query: 271 EVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELR 330
           ++ AK  ++  V   +G++ G  +   I +         +V+   H+Y N  + +S+  +
Sbjct: 181 DLSAKDGSKETVLALVGMLCGSIIMPLIQTQTQ-TCGLLAVIIVFHIYINYYAVRSVVFK 239

Query: 331 TLNPYRASLVFSEYLLS---GQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAK 387
           + N  R  +++S +  S    +    +EV   E LF           N S L    ++  
Sbjct: 240 SYNRQRTCILWSAFRRSEGEHRCLVPREVARHEHLF----------HNPSTL--FDTDLS 287

Query: 388 DAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDM 447
             A  +     +GS L+ ++    DA         +G +   H         ES L   +
Sbjct: 288 TGARRVIGHCDMGSPLASILPCSPDAR--------KGLLARRHQCPRWAGAAESTLVGSL 339

Query: 448 LKSLFQASYLYWL 460
           L++     Y+ W 
Sbjct: 340 LQAFVDEKYVLWF 352


>gi|134084659|emb|CAK43337.1| unnamed protein product [Aspergillus niger]
          Length = 484

 Score = 58.5 bits (140), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 83/169 (49%), Gaps = 14/169 (8%)

Query: 117 RELFVQFMLPEGFPDSVTSDYL-NYSLWRSV-QGVASQIS------GVLATQALLYAIGL 168
           ++  V+  LP G+P SVT DY   Y   R + +G++  +        +L + A    +G+
Sbjct: 69  QDFLVEVFLPAGYPHSVTDDYAPPYQTRRLIHEGLSPSLQQLDCWPPILESSAARLPLGV 128

Query: 169 GKG---AIPTAAAINWVLKDGIGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLE 224
           G G   A PTAA +  +L+D  G ++ I+ ++  G   +   K +RL AD+  + A  L+
Sbjct: 129 GVGNADASPTAALLLHILQDTSGRIATILFAHRVGTALEPECKMYRLAADIFNDLAMILD 188

Query: 225 MLTPAFP--HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAE 271
            L+P  P     V + + AG  R+   +   S+++   A F+   N AE
Sbjct: 189 CLSPMIPAGAPRVTVLSTAGVLRALCGVAGGSSKASLSAHFSRWGNLAE 237


>gi|3242710|gb|AAC23762.1| hypothetical protein [Arabidopsis thaliana]
          Length = 405

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 42/63 (66%), Gaps = 1/63 (1%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGI 187
            P  V+ +Y++Y  W+ +  V S    VLATQA+  AIG+G+  ++ ++AA NW+LKDG+
Sbjct: 124 LPKQVSDNYISYVKWKFLHRVFSSALQVLATQAMFRAIGIGQSRSLASSAAFNWILKDGL 183

Query: 188 GYL 190
           G L
Sbjct: 184 GRL 186


>gi|313238188|emb|CBY13282.1| unnamed protein product [Oikopleura dioica]
          Length = 93

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 1/70 (1%)

Query: 119 LFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG-KGAIPTAA 177
           L  +  LP+ +P+SV+ DY  Y +W S Q + S ++G LA  ++   IG+G   A   +A
Sbjct: 21  LICKVFLPKDYPNSVSEDYTTYQVWDSAQALCSSVTGSLAINSVFKGIGVGDSSASSLSA 80

Query: 178 AINWVLKDGI 187
             NW+++DGI
Sbjct: 81  TSNWIIRDGI 90


>gi|225560567|gb|EEH08848.1| DUF647 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 472

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 68/311 (21%), Positives = 122/311 (39%), Gaps = 65/311 (20%)

Query: 187 IGYLSKIMLSN-FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHF-VFIGAAAGAG 244
           +G ++ I+ ++  G   +   K +RL AD+L ++A   + L+PA P    V + A +   
Sbjct: 1   MGRIATILFAHRLGTSLEPECKMYRLAADVLNDSALIFDCLSPALPKRLRVVVLAFSSVL 60

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEV-------------------------------I 273
           R+   +   S+++   + FA   N  E+                               +
Sbjct: 61  RALCGVAAGSSKASLSSHFAKWGNLGELNAVRFPNVLVIAISGAAQRGFPFLFSDADQKL 120

Query: 274 AKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQSIELRTL 332
            K  +Q  V   +G++ G  + +HI  S PFA   +  ++  +H+  N  + +S+ + TL
Sbjct: 121 QKDSSQETVISLLGMLCGSVVVSHI--STPFATWTTLLLLLLVHLSTNYAAVRSVNMTTL 178

Query: 333 NPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVE 392
           N  RA++VFS     G      E +  E +F      + K             A  AA+ 
Sbjct: 179 NRQRANIVFSTLFEKGSVLTPTEASKCERIFERDGILRWK-------------ASPAALG 225

Query: 393 IEHRLQLGSKLSDVVNNKE-----------DAHALFSLYEDEGYIL--TEHGGKFCVVLK 439
                Q+G    +++               D + L  L+E E Y+L       K  +VLK
Sbjct: 226 Y---CQIGGSFQELLRGSTHGANSIRDVGIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLK 282

Query: 440 ESALPQDMLKS 450
            +  P   LK+
Sbjct: 283 NNVTPISQLKA 293


>gi|449682081|ref|XP_002168725.2| PREDICTED: UPF0420 protein C16orf58 homolog [Hydra magnipapillata]
          Length = 237

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/133 (26%), Positives = 61/133 (45%), Gaps = 13/133 (9%)

Query: 236 FIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALA 295
            +G A GA R+A  L QA            R N A+V AK  +Q  +   + +  G  + 
Sbjct: 7   IVGVAGGATRAALTLHQAR-----------RNNMADVSAKDGSQETLVNLMALFTGFLIT 55

Query: 296 NHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKE 355
             +  ++      F V T++H+Y N ++  ++ + T+N  R  ++   Y + G+    KE
Sbjct: 56  PLVVDNLFLTWYFFLVFTFLHLYANYQAVSTVVMETINLPRLHILVKHYFIYGKVLTPKE 115

Query: 356 VNDEEPLF--PAF 366
           VN  +P+   P F
Sbjct: 116 VNKVDPVLRRPGF 128


>gi|17945727|gb|AAL48912.1| RE31721p [Drosophila melanogaster]
          Length = 266

 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 54/243 (22%), Positives = 101/243 (41%), Gaps = 30/243 (12%)

Query: 221 FGLEM-LTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 279
            G+E+ + P +PH    I   +   ++   +   +TRS      A R N A+V +K  +Q
Sbjct: 1   MGIEIYVLPKYPHFSTQILCCSTLLKAIVGVAGGATRSALTQHHALRGNLADVASKDSSQ 60

Query: 280 GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 339
                 +   +G+ L + I S        F VV  +H+Y NLK+ +++ LR+ N  R  +
Sbjct: 61  ETCVNLVASFVGLYLLSLIKSQAVL-YTIFYVVVSLHLYANLKAVRAVCLRSFNESRYLI 119

Query: 340 VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 399
              E+  S +    ++VN  E +                            V +   ++L
Sbjct: 120 ALEEFFRSSRMLSPQQVNAMERV-----------------------TVGQTVSVSLNIKL 156

Query: 400 GSKLSDVVNNKEDAHALFSLYED----EGYILTEHGGKFCVVLKESALPQDMLKSLFQA- 454
           G  + ++++  + +  + ++       E +I+ +      V L     PQD+LK+ F A 
Sbjct: 157 GLSVKNLIDEYKSSSVIENIVSSFDPHEHFIIAQTKKCLGVYLHFETRPQDVLKAYFFAV 216

Query: 455 SYL 457
           SYL
Sbjct: 217 SYL 219


>gi|384494916|gb|EIE85407.1| hypothetical protein RO3G_10117 [Rhizopus delemar RA 99-880]
          Length = 247

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/113 (24%), Positives = 57/113 (50%), Gaps = 1/113 (0%)

Query: 251 IQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFS 310
           I  ++   F   F+   N  +++AK +AQ   +  +G++ G+ L + I  S  F   +F+
Sbjct: 9   IWLASHMTFTKHFSPNGNIGDIVAKDDAQMSTAHLLGMLSGVGLIS-ISHSPLFLFGAFA 67

Query: 311 VVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           V++ I+++  +K   + E   LN  + +L+  EY+  G     +++ D E  F
Sbjct: 68  VLSPINIWSTVKMLHAAEFEILNQAKLTLLAREYIDQGHVATYEKLRDREIGF 120


>gi|167378255|ref|XP_001734737.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165903647|gb|EDR29104.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 400

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 46/228 (20%), Positives = 95/228 (41%), Gaps = 14/228 (6%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINW 181
           + ++P G+P SV   Y  + ++  +   A+ +  V++  A+L       G    +AA   
Sbjct: 47  ELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL---RFNSGH---SAAFFV 100

Query: 182 VLKDGI-GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML---TPAFPHHFVFI 237
           V ++ + G +  I+   +G       K WR+  DL+      +E+    TP F    V  
Sbjct: 101 VFREVMTGMMHLIITERWGTSIVFFAKQWRMRIDLISELLRIIEIAFISTPIFYSMSVTC 160

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
               G  +++  +I+    +     +A   N AE+  K +    + + + +++G  + + 
Sbjct: 161 SVIEGVLQASRQVIKTRILN----NYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHV 216

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           +G      +  F      H  CN      I  +TLN  R +++ S ++
Sbjct: 217 LGDFTESGIIVFICALISHCVCNFLLSNVIVFKTLNYERLTILMSYFM 264


>gi|407034154|gb|EKE37096.1| hypothetical protein ENU1_209440 [Entamoeba nuttalli P19]
          Length = 400

 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 46/228 (20%), Positives = 96/228 (42%), Gaps = 14/228 (6%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINW 181
           + ++P G+P SV   Y  + ++  +   A+ +  V++  A+L  I  G      +AA   
Sbjct: 47  ELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL-RINPGH-----SAAFFV 100

Query: 182 VLKDGI-GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML---TPAFPHHFVFI 237
           V ++ + G +  I+   +G       K WR+  +L+      +E+    TP F    V  
Sbjct: 101 VFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTPIFYSISVTC 160

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
               G  +++  +I+    +     +A   N AE+  K +    + + + +++G  + + 
Sbjct: 161 SVIEGVLQASRQVIKTRILN----NYAKGNNVAELTEKVQNLETLFRVVSLLIGWVILHV 216

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           +G      +  F      H  CN      I  +TLN  R +++ S ++
Sbjct: 217 LGDFTESGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFM 264


>gi|67467345|ref|XP_649789.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56466294|gb|EAL44403.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449707066|gb|EMD46790.1| Hypothetical protein EHI5A_044670 [Entamoeba histolytica KU27]
          Length = 400

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 46/228 (20%), Positives = 96/228 (42%), Gaps = 14/228 (6%)

Query: 122 QFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKGAIPTAAAINW 181
           + ++P G+P SV   Y  + ++  +   A+ +  V++  A+L  I  G      +AA   
Sbjct: 47  ELIVPVGYPTSVERGYFRFVIFSVIMEAANMMMLVMSVSAIL-RINPGH-----SAAFFV 100

Query: 182 VLKDGI-GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEML---TPAFPHHFVFI 237
           V ++ + G +  I+   +G       K WR+  +L+      +E+    TP F    V  
Sbjct: 101 VFREVMTGMMHLIITERWGTSIVFFAKQWRMRIELISELLRIIEIAFISTPIFYSISVTC 160

Query: 238 GAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANH 297
               G  +++  +I+    +     +A   N AE+  K +    + + + +++G  + + 
Sbjct: 161 SVIEGVLQASRQVIKTRILN----NYAKGNNVAELTEKVQNLETLFRVVSLLIGWLILHV 216

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYL 345
           +G      +  F      H  CN      I  +TLN  R +++ S ++
Sbjct: 217 LGDFTESGIIVFICALISHCICNFLLSNVIVFKTLNYERLTILMSYFM 264


>gi|402580829|gb|EJW74778.1| hypothetical protein WUBG_14315 [Wuchereria bancrofti]
          Length = 191

 Score = 46.6 bits (109), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 28/119 (23%), Positives = 58/119 (48%)

Query: 245 RSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPF 304
           R+   +  ++TR+      A   N  +V AK  +Q  +     ++  + L   +  +  F
Sbjct: 3   RTVVGVAGSATRTTVINHQAILDNVGDVAAKDGSQETLINVFALLCSLLLLPVVSKNAVF 62

Query: 305 ALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
               F + T+IH+Y N ++ + ++ RTLN     +V  +Y+ + +   V E+N++EP+ 
Sbjct: 63  VWLLFCLFTFIHLYGNYRAVKMLQFRTLNQSLLRIVVKDYIQTRKIGTVNEMNNKEPIL 121


>gi|332372572|gb|AEE61428.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score = 45.4 bits (106), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 58/253 (22%), Positives = 97/253 (38%), Gaps = 29/253 (11%)

Query: 254 STRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVT 313
           +TR+      A R N AE+ AK   Q  +   IG    I L N+  SS    +   S++ 
Sbjct: 42  ATRASITHHQAIRDNMAEISAKDGTQETIVNLIGSFGSIFLLNYFSSSRAEWILIISLMV 101

Query: 314 WIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKS 373
            +H+Y N  + +++   T N  R +LV   Y   G      +VN  E +   F       
Sbjct: 102 -LHLYTNFLAVKALIFNTFNRERLALVLKSYFTIGSVLNPAKVNVNESVLLGF------G 154

Query: 374 ANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN--KEDAHALFSLYEDEGYILTEHG 431
            N  ++                 + LG  +   +N+    D   L S+Y+D+ Y+L  +G
Sbjct: 155 LNVKRMCGFD-------------IVLGESIKKSLNDYAAADIKLLQSVYKDKKYMLLVNG 201

Query: 432 --GKFCVVLKESALPQDMLKSLFQASYL-----YWLERNAGIVATSTSADCAPGGRLEIS 484
              K  V L++     D++   F A  L      +   +  I A        P  RL   
Sbjct: 202 TKKKIYVCLEKGETAADVIAGYFHAVCLGIATSIYNSLDLDIYARRQLHHPTPITRLYTY 261

Query: 485 LDYVQREFNHVKS 497
           +   ++  NH ++
Sbjct: 262 MKSYEKSHNHFRN 274


>gi|321459665|gb|EFX70716.1| hypothetical protein DAPPUDRAFT_327911 [Daphnia pulex]
          Length = 269

 Score = 45.1 bits (105), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 63/142 (44%), Gaps = 19/142 (13%)

Query: 139 NYSLWRSVQGVASQISGVLATQALLYAIGLGKG-AIPTAAAINWVLKDGIGYLSKIMLSN 197
           NYS+  ++Q     +SG L+T+A+L  +G+G   + P AA + W++KDG   +       
Sbjct: 6   NYSI-HAIQAFVGSLSGSLSTRAILEGVGVGSSTSTPLAATLMWLIKDGSTRM------- 57

Query: 198 FGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRS 257
            GR       G +L AD     + G              I   AG  +S   +   +TR+
Sbjct: 58  VGRILFAWCNGTKLDADTAVGGSNG----------TMTSILCVAGVAKSIVGVAGGATRA 107

Query: 258 CFYAGFAARRNFAEVIAKGEAQ 279
                 A + N A+V AK EA+
Sbjct: 108 ALTQHQARQGNLADVSAKVEAK 129


>gi|399218598|emb|CCF75485.1| unnamed protein product [Babesia microti strain RI]
          Length = 635

 Score = 43.5 bits (101), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 41/183 (22%), Positives = 76/183 (41%), Gaps = 20/183 (10%)

Query: 129 FPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGK-------GAIPTAAAINW 181
           +P  VTS+Y  Y+ WR ++  +  +S  ++   L   I L +       G I  A  I++
Sbjct: 116 YPSGVTSNYYKYANWRLLERFSHSMSQSISNNPLAAKILLNQSTYSTNDGVINIAYKIDF 175

Query: 182 --------VLKDGIGYLSKIM-LSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFPH 232
                   + K+ +  + K M + N G  FD+NP+ +  +  L+       + L+  F  
Sbjct: 176 IATNLMTQIFKEIVSRIVKTMWVGNIGIGFDMNPRAFCFWGSLICTITNTADFLSTLFNF 235

Query: 233 HFVFIGA-AAGAGRSAAALIQASTRSCFYAGFAARRNFA---EVIAKGEAQGMVSKAIGI 288
            ++ I        R    L  +++    Y  F ++ N +   E+ AK EA   +   +G 
Sbjct: 236 KYISITRHITTLFRQIGMLTTSASIGPIYGSFKSQNNVSNIGEITAKMEAVTPLCDLLGS 295

Query: 289 MLG 291
             G
Sbjct: 296 ATG 298


>gi|440799745|gb|ELR20789.1| hypothetical protein ACA1_055550 [Acanthamoeba castellanii str.
           Neff]
          Length = 275

 Score = 43.1 bits (100), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 51/101 (50%), Gaps = 3/101 (2%)

Query: 269 FAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIE 328
            A+V AK  +Q      +G++LG+ +A  +  +       F+V T  H++ N ++ + + 
Sbjct: 1   MADVAAKDGSQETAVTLVGLILGLIIAPLVDENYFLIWLLFAVFTTAHLFANYRAVRGLV 60

Query: 329 LRTLNPYRASLVFSEYLLSGQAP---PVKEVNDEEPLFPAF 366
           + TLN  R ++V + YL S       PV+    E PL+ AF
Sbjct: 61  MDTLNRQRLAIVVTHYLGSASREVLSPVQASAREWPLWFAF 101


>gi|12654037|gb|AAH00822.1| C16orf58 protein [Homo sapiens]
          Length = 231

 Score = 42.7 bits (99), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 33/63 (52%)

Query: 304 FALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 363
           F+L  F  +T +H+Y N ++ +++ + TLN  R  LV   YL  G+       N  EPL+
Sbjct: 22  FSLGCFFFLTALHIYANYRAVRALVMETLNEGRLRLVLKHYLQRGEVLDPTAANRMEPLW 81

Query: 364 PAF 366
             F
Sbjct: 82  TGF 84


>gi|422292653|gb|EKU19955.1| hypothetical protein NGA_2040500, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 181

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 38/163 (23%), Positives = 71/163 (43%), Gaps = 6/163 (3%)

Query: 268 NFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSI 327
           N AEV+AKG AQ       G+  G+  A     S      +++++T IH+  N  + + +
Sbjct: 20  NLAEVLAKGGAQHTAVSLFGLGFGMWFARVANQSPRRVWTAYTLLTVIHLTANYAAMRVL 79

Query: 328 ELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFF---KIKSA--NKSQLLVL 382
              ++N  R +++ + +  +       EV   E +      +   K+ +A   K +    
Sbjct: 80  AFTSINRRRLNVLLTAFRENASVLSPAEVARRETIIVDVGRWRPPKLWTALFQKGKRRAT 139

Query: 383 SSEAKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGY 425
           +  AKDA   I + ++LG ++ D+    E    L  L+  E Y
Sbjct: 140 AELAKDAHAPIWN-IRLGVRVKDLTRFAEGLTELQRLFSQEQY 181


>gi|154278545|ref|XP_001540086.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150413671|gb|EDN09054.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 425

 Score = 42.0 bits (97), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 92/213 (43%), Gaps = 16/213 (7%)

Query: 267 RNFAEVIAKGEAQGMVSKAIGIMLGIALANHIGSSMPFAL-ASFSVVTWIHMYCNLKSYQ 325
           ++F EV   G  + ++S  +G++ G  + +HI  S P A   +  ++  +H+  N  + +
Sbjct: 142 QSFCEVGKFGRVKCLIS-LLGMLCGSVVVSHI--STPSATWTTLLLLLLVHLSTNYAAVR 198

Query: 326 SIELRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSE 385
           S+ + TLN  RA++VFS     G      E +  E +F      + K++  +  L     
Sbjct: 199 SVNMTTLNRQRANIVFSTLFEKGSVLTPTEASKCERIFERDGILRWKASPAA--LGYCQI 256

Query: 386 AKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYIL--TEHGGKFCVVLKESAL 443
                  +   +   + + DV     D + L  L+E E Y+L       K  +VLK +  
Sbjct: 257 GGSFQELLRGSIHGANSIRDV---GIDIYKLLRLFEKEEYVLWFNPIHKKGTIVLKNNVT 313

Query: 444 PQDMLK----SLFQASYLYWLER-NAGIVATST 471
           P   LK    +L  A  L  L R  A +V  ST
Sbjct: 314 PISQLKAWSHALIVAKRLTDLRREEADMVNVST 346


>gi|194746211|ref|XP_001955574.1| GF16181 [Drosophila ananassae]
 gi|190628611|gb|EDV44135.1| GF16181 [Drosophila ananassae]
          Length = 287

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)

Query: 386 AKDAAVEIEHRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILT---------EHGGKFCV 436
            K   VE  H++++  ++ ++  +K+   A+   +E+E   LT          H  +F V
Sbjct: 101 VKHPLVEEVHQVEIDDRVVEL--SKQYLPAMACGFENEKLKLTIGDGFDYMKNHKNEFDV 158

Query: 437 VLKESALPQDMLKSLFQASYLYWLERNA----GIVATSTSADCAPGGRLEISLDYVQREF 492
           ++ +S+ P     SLFQ SY Y L ++A    GIV       C+ GG   + LDY+++  
Sbjct: 159 IITDSSDPIGPAVSLFQESY-YELMKHALKEDGIV-------CSQGGSFWLDLDYIKKTM 210

Query: 493 NHVKSDSASVGWVTDGLIARP 513
           +  K   A V +    + + P
Sbjct: 211 SGCKEHFAKVAYAVTSVPSYP 231


>gi|395331205|gb|EJF63586.1| hypothetical protein DICSQDRAFT_101426 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 285

 Score = 38.9 bits (89), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 46/192 (23%), Positives = 87/192 (45%), Gaps = 23/192 (11%)

Query: 175 TAAAINWVLKDGIGYLSKIMLSNF-GRHFDVNPKGWRLFADLLENAAFGLEMLTP--AF- 230
           T A +  VL+D    L+ I+   + G       K +RL AD+L +AA   + L+P  AF 
Sbjct: 16  TNAILLTVLQDVFSRLTTIVGGYYLGTSLTPEAKTYRLLADVLNDAAIISDTLSPYLAFV 75

Query: 231 ----PHHFVFIGAA----------AGAGRSAAALIQASTRSCFYAGFAAR----RNFAEV 272
                + FV  GA+          +GA R+    +   +++     FA       +  ++
Sbjct: 76  TLSTQYPFVRPGASSSLRVVALCLSGAFRALCGAVAGGSKAALTVHFATDGERPGDVGDL 135

Query: 273 IAKGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTL 332
            AK  ++  V   +G++ G  + +++ S+       F++V + H+  N  + + I +R  
Sbjct: 136 SAKDGSKETVLALLGMLCGSVVVHYVHSAQATYAVLFALV-FFHLAFNYMAVRVISMRYF 194

Query: 333 NPYRASLVFSEY 344
           N  RAS+ + +Y
Sbjct: 195 NRQRASIAWQKY 206


>gi|374921923|gb|AFA26139.1| hypothetical protein, partial [Lolium perenne]
          Length = 109

 Score = 38.5 bits (88), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 37/66 (56%)

Query: 298 IGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAPPVKEVN 357
           I    P  +ASF+V++  ++  + +  +S+ L TLN  R ++    ++ +G  P +KE N
Sbjct: 6   ISKRNPSLVASFAVLSCGYLLSSYQEVRSVVLNTLNTARFTVAVDSFVKTGHVPSLKEGN 65

Query: 358 DEEPLF 363
            EE +F
Sbjct: 66  SEETVF 71


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.133    0.402 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,406,292,817
Number of Sequences: 23463169
Number of extensions: 345282346
Number of successful extensions: 1606304
Number of sequences better than 100.0: 543
Number of HSP's better than 100.0 without gapping: 444
Number of HSP's successfully gapped in prelim test: 99
Number of HSP's that attempted gapping in prelim test: 1602993
Number of HSP's gapped (non-prelim): 2078
length of query: 529
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 382
effective length of database: 8,910,109,524
effective search space: 3403661838168
effective search space used: 3403661838168
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)