BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 015194
         (411 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224068755|ref|XP_002302817.1| predicted protein [Populus trichocarpa]
 gi|118484264|gb|ABK94012.1| unknown [Populus trichocarpa]
 gi|222844543|gb|EEE82090.1| predicted protein [Populus trichocarpa]
          Length = 411

 Score =  706 bits (1823), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/410 (80%), Positives = 378/410 (92%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           +++RIRSRDGLERV++D  ++TV++LKT+IQ+QL+IP+ SQTLSTNQ+LLLAKSPSDLL+
Sbjct: 2   VMVRIRSRDGLERVSIDNTNITVSQLKTLIQNQLQIPIRSQTLSTNQSLLLAKSPSDLLK 61

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM+NPD PLSSL+ISHGS++FL+YDGER + GP+  PAGSFGRKMT+DDLIAKQMRVT
Sbjct: 62  FTDMSNPDTPLSSLSISHGSLIFLAYDGERTIAGPAVRPAGSFGRKMTIDDLIAKQMRVT 121

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQENPHC+SVSFDRDCA+AFQ YVNETLAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQG
Sbjct: 122 RQENPHCDSVSFDRDCANAFQHYVNETLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQG 181

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TEEVL +LRD +EEKLV+AI A LGM++VGFIFTQTIMQ+KKDYTLS+REVLQA E HAE
Sbjct: 182 TEEVLMLLRDSDEEKLVEAITACLGMRRVGFIFTQTIMQDKKDYTLSHREVLQAAELHAE 241

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKLEVSE+G A +HFEAFQMSDMC+RLFKEGWFETEI E  DPKLSKMKK
Sbjct: 242 SELKEWVTAVVKLEVSEDGGADVHFEAFQMSDMCIRLFKEGWFETEIGEDADPKLSKMKK 301

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           DVVVG KDVKEVDNDFFLVVVKILDH GPLSSTFPIENR TQVTMRAL+SHL+R+ +LP 
Sbjct: 302 DVVVGTKDVKEVDNDFFLVVVKILDHLGPLSSTFPIENRITQVTMRALRSHLDRAKNLPF 361

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           VKRISDFHLLLFLA++LDLNSDVPALA+CV AQTAVPEGY++LI SMA T
Sbjct: 362 VKRISDFHLLLFLAKYLDLNSDVPALAECVLAQTAVPEGYQILIESMATT 411


>gi|356505489|ref|XP_003521523.1| PREDICTED: NPL4-like protein 1-like [Glycine max]
          Length = 413

 Score =  672 bits (1735), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/412 (78%), Positives = 363/412 (88%), Gaps = 2/412 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAH-VTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDGLERV+V+  H  TV+ LK II++QL IPV +QTLSTNQNLLLAKS  DL 
Sbjct: 1   MMLRIRSRDGLERVSVENPHSTTVSYLKRIIETQLGIPVHNQTLSTNQNLLLAKSLEDLH 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRV 119
           +FTDM+NPD PLSSLN+ HGS+VFL+Y+GER V GP+FNPAGSFGRKMTMDDLIAKQMRV
Sbjct: 61  RFTDMSNPDAPLSSLNLGHGSMVFLAYEGERRVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120

Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
           TRQENPHCE VSFDRDCA+AFQ YVN+TLAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
           G+E+ L   RD EEEK V+AIA GLGM KVGFIFTQTI Q+KKDYTLSNREVLQAVE+HA
Sbjct: 181 GSEDNLVFFRDTEEEKFVEAIAVGLGMTKVGFIFTQTITQDKKDYTLSNREVLQAVEYHA 240

Query: 240 ECNMEEWVTAVVKLEVSEE-GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKM 298
           E  ++EWVTAVVKLE +EE G A +HFEAFQMSD+CVRLFKEGWFETEI E DDPKLSKM
Sbjct: 241 ESGLKEWVTAVVKLEANEEMGGADVHFEAFQMSDVCVRLFKEGWFETEIKEDDDPKLSKM 300

Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSL 358
           KKDVVVG KD +EVDNDFFLVVVKI DHQGPLSS+FPIENR TQV M+ALK+HL+R+ +L
Sbjct: 301 KKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVPMKALKNHLDRTKNL 360

Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           P VKRISDFHLLL LAR LDLN+DVPAL +CVQ QT+VPEGY++LI SMA T
Sbjct: 361 PFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMAST 412


>gi|356572673|ref|XP_003554491.1| PREDICTED: NPL4-like protein 1-like [Glycine max]
          Length = 413

 Score =  670 bits (1728), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/412 (78%), Positives = 362/412 (87%), Gaps = 2/412 (0%)

Query: 1   MLLRIRSRDGLERVTVDGA-HVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDGLERV+V+     TV++LK IIQ+QL IPV +QTLSTNQNLLLAKS  DLL
Sbjct: 1   MMLRIRSRDGLERVSVENPLATTVSDLKRIIQTQLGIPVHNQTLSTNQNLLLAKSLEDLL 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRV 119
           +FTDM+N D  LSSLN+ HGSIVFL+Y+GER V GP+FNPAGSFGRKMTMDDLIAKQMRV
Sbjct: 61  RFTDMSNLDASLSSLNLGHGSIVFLAYEGERRVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120

Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
           TRQENPHCE VSFDRDCA+AFQ YVN+TLAFAVKRGGFMYGTV E  +VEV+FIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEVGKVEVDFIYEPPQQ 180

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
           G+E+ L   RD EEEK V+AIA GLGM+KVGFIFTQTI Q+KKDYTLSNREVLQAVE+HA
Sbjct: 181 GSEDNLVFFRDPEEEKFVEAIAVGLGMRKVGFIFTQTITQDKKDYTLSNREVLQAVEYHA 240

Query: 240 ECNMEEWVTAVVKLEVSEE-GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKM 298
           E  ++EW TAVVKLEV+EE G A +HFEAFQMSD+CVRLFKEGWFETEI E DDPKLSKM
Sbjct: 241 ESGLKEWATAVVKLEVNEEMGGADVHFEAFQMSDVCVRLFKEGWFETEIKEDDDPKLSKM 300

Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSL 358
           KKDVVVG KD +EVDNDFFLVVVKI DHQGPLSS+FPIENR TQVTM+ALK+HL RS +L
Sbjct: 301 KKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVTMKALKNHLERSKNL 360

Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           P VKRISDFHLLL LAR LDLN+DVPAL +CVQ QT+VPEGY++LI SMA T
Sbjct: 361 PFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMAST 412


>gi|449456429|ref|XP_004145952.1| PREDICTED: NPL4-like protein 1-like [Cucumis sativus]
          Length = 411

 Score =  657 bits (1695), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/408 (76%), Positives = 360/408 (88%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M+LRIRSRDGLERV V+  H+T+A+LK IIQSQL+IP+ +QTLSTNQN+LLAK+  DL +
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM+NP+  LSSLN+SHGSIVFL+Y+GER V GP+ +PAGSFGRKMTMDDLIAKQMR+T
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQENPHCE VSFDRDCA+AFQ YVNETLAFAVKRGG MYGTV  + +VEV+FIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TE+ L   RD +EE+LV+AIA GLGM+KVGFIFTQTI Q+KKDYTLSNREVLQA +FH+E
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFET+I E  DPKLSKMKK
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           DVVVG KD ++VDNDFFLVVVKILDHQGPLS+TFPIENR   VTM+ALK+HL+RS  LP 
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           VKRISDFHLLL LAR LD++SDVPALA+CVQ QT VPEGYK+LI SMA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMA 408


>gi|449497385|ref|XP_004160387.1| PREDICTED: LOW QUALITY PROTEIN: NPL4-like protein 1-like [Cucumis
           sativus]
          Length = 411

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/408 (76%), Positives = 360/408 (88%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M+LRIRSRDGLERV V+  H+T+A+LK IIQSQL+IP+ +QTLSTNQN+LLAK+  DL +
Sbjct: 1   MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM+NP+  LSSLN+SHGSIVFL+Y+GER V GP+ +PAGSFGRKMTMDDLIAKQMR+T
Sbjct: 61  FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQENPHCE VSFDRDCA+AFQ YVNETLAFAVKRGG MYGTV  + +VEV+FIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGXMYGTVSPEGKVEVDFIYEPPQQG 180

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TE+ L   RD +EE+LV+AIA GLGM+KVGFIFTQTI Q+KKDYTLSNREVLQA +FH+E
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFET+I E  DPKLSKMKK
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           DVVVG KD ++VDNDFFLVVVKILDHQGPLS+TFPIENR   VTM+ALK+HL+RS  LP 
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           VKRISDFHLLL LAR LD++SDVPALA+CVQ QT VPEGYK+LI SMA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMA 408


>gi|225436827|ref|XP_002270748.1| PREDICTED: NPL4-like protein 2 [Vitis vinifera]
          Length = 411

 Score =  655 bits (1691), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/408 (75%), Positives = 360/408 (88%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+RIRSRDGLER+T+D  H T+++LKT+IQ  L IP+ SQTLSTNQNLLLAK+PSD+ +
Sbjct: 1   MLIRIRSRDGLERLTLDNPHATISQLKTLIQQHLHIPISSQTLSTNQNLLLAKTPSDIAR 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM++P  P+++L ++HGSIV+L++D +R V GP+F+PAGSFGR+MTMDDLIAKQMRVT
Sbjct: 61  FTDMSDPQTPIAALGVTHGSIVYLAHDTQRTVSGPTFSPAGSFGRRMTMDDLIAKQMRVT 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQE PHCESVSFDRD A+AFQ YVNETL FAVKRGGFMYGTV +D  V V+FIYEPPQQG
Sbjct: 121 RQETPHCESVSFDRDAANAFQHYVNETLVFAVKRGGFMYGTVADDGAVRVDFIYEPPQQG 180

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TEE L ++RD +EE+LVDAIA GLGM++VGFIFTQTI QNKKDYT+SN EVLQA E H E
Sbjct: 181 TEENLILMRDTDEERLVDAIAMGLGMRRVGFIFTQTISQNKKDYTMSNSEVLQAAELHGE 240

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKL V+E+  A +HFEAFQMSDMCVRLFKEGWFET+I E  DPKLS+MKK
Sbjct: 241 SGLKEWVTAVVKLTVNEDDGADVHFEAFQMSDMCVRLFKEGWFETDIGEEVDPKLSRMKK 300

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           DVVVG KD +EVDNDFFLV+VKILDHQGPLSSTFPIENR TQVTMRALK+HL+R+ +LP 
Sbjct: 301 DVVVGVKDTREVDNDFFLVLVKILDHQGPLSSTFPIENRITQVTMRALKNHLDRAKNLPF 360

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           VKRISDFHLLL L+RFLD+NSDVPALA+CVQ Q+AVPEGY+LLI S+A
Sbjct: 361 VKRISDFHLLLLLSRFLDINSDVPALAECVQTQSAVPEGYQLLIESLA 408


>gi|358348088|ref|XP_003638081.1| NPL4-like protein [Medicago truncatula]
 gi|355504016|gb|AES85219.1| NPL4-like protein [Medicago truncatula]
          Length = 414

 Score =  650 bits (1678), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/411 (75%), Positives = 360/411 (87%), Gaps = 3/411 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAH-VTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDGLERVT+D  +  TV++LKTII+ QLRIP  +QTLSTNQNLLLAKS  DLL
Sbjct: 1   MMLRIRSRDGLERVTLDNPNNKTVSDLKTIIEKQLRIPTHNQTLSTNQNLLLAKSLQDLL 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS-FNPAGSFGRKMTMDDLIAKQMR 118
            FTDM+NP+ PL++LN+SHGSI+FL+Y+GER V+GP+ F+PAGSFG+KMTMDDLIAKQMR
Sbjct: 61  LFTDMSNPNTPLTALNLSHGSIIFLAYEGERTVQGPAAFSPAGSFGKKMTMDDLIAKQMR 120

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQ 178
           V+RQENPHCE VSFDRDCA+AFQ YVNE LAFAVKRGGFMYGTV E+ +VEVNFIYEPPQ
Sbjct: 121 VSRQENPHCELVSFDRDCANAFQHYVNENLAFAVKRGGFMYGTVTEEGKVEVNFIYEPPQ 180

Query: 179 QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFH 238
           QG E+ L   RD EEEK V+AIAAGLGMK+VGFIFTQ++ Q+KKDYTLS REVLQA E+H
Sbjct: 181 QGLEDNLLFFRDPEEEKCVEAIAAGLGMKRVGFIFTQSVSQDKKDYTLSYREVLQAAEYH 240

Query: 239 AECNMEEWVTAVVKLEVSEE-GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSK 297
           AE  ++EWVTAVVKLEV+EE G A +HFEAFQMSD+C+RLFKEGWF+TE+ E  DPKLS 
Sbjct: 241 AESGLKEWVTAVVKLEVNEEMGGADVHFEAFQMSDVCIRLFKEGWFDTEVKEDHDPKLSI 300

Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPS 357
           MKKDVVVG KD KEVDNDFFLVVVKI DHQGPLSSTFP+ENR TQVT  ALK+HL R+ +
Sbjct: 301 MKKDVVVGVKDTKEVDNDFFLVVVKISDHQGPLSSTFPVENRNTQVTAMALKNHLERTKN 360

Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           LP VKRISDFHLLL LAR LD+++DVPAL +CVQ Q+AVPEGY++LI SMA
Sbjct: 361 LPFVKRISDFHLLLVLARVLDVSADVPALTECVQTQSAVPEGYQILIESMA 411


>gi|15229360|ref|NP_191859.1| nuclear protein localization protein 4-like protein [Arabidopsis
           thaliana]
 gi|75181052|sp|Q9LYC2.1|NPL41_ARATH RecName: Full=NPL4-like protein 1
 gi|7573429|emb|CAB87745.1| putative protein [Arabidopsis thaliana]
 gi|14334782|gb|AAK59569.1| unknown protein [Arabidopsis thaliana]
 gi|15810647|gb|AAL07248.1| unknown protein [Arabidopsis thaliana]
 gi|110735118|gb|ABG89129.1| NPL4a [synthetic construct]
 gi|332646901|gb|AEE80422.1| nuclear protein localization protein 4-like protein [Arabidopsis
           thaliana]
          Length = 413

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 302/410 (73%), Positives = 362/410 (88%), Gaps = 1/410 (0%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQF 61
           +LR+RSRDGLERV+VDG H+TV++LKT+IQ QL+IP+ +QTLSTN+NLLLAKSPSD L F
Sbjct: 3   MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 62

Query: 62  TDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRVT 120
           TDMA+P+  +SSLN++HGS+V+L+Y+GER +RG P+  PAGSFGRKMT++DLIA+QMRV 
Sbjct: 63  TDMADPNLRISSLNLAHGSMVYLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQE  HC+SVSFDRDCA+AFQ +VNE+LAFAVKRGGFMYG V ED +VEVNFIYEPPQQG
Sbjct: 123 RQEKAHCDSVSFDRDCANAFQHFVNESLAFAVKRGGFMYGNVSEDGQVEVNFIYEPPQQG 182

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
            E+ L ++RD EEEK VDAIA GLGM++VGFIF QT+ Q+KK+YTLSN EVL A + HAE
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKLE++E+G A +HFE FQMSDMCVRLFKEGWFETEI   DDPKLSK+KK
Sbjct: 243 SELKEWVTAVVKLEINEDGGADVHFEPFQMSDMCVRLFKEGWFETEIGPEDDPKLSKLKK 302

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           +VVVG KDVKEVDNDFFLV+VKILDHQGPLS TFPIENR TQ TMRALK+H+ R+ SLP 
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQGPLSCTFPIENRNTQTTMRALKTHMERARSLPF 362

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           VKRISDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY+LLI+SMA T
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANT 412


>gi|18407495|ref|NP_566118.1| NPL4-like protein 2 [Arabidopsis thaliana]
 gi|75100586|sp|O82264.1|NPL42_ARATH RecName: Full=NPL4-like protein 2
 gi|3738309|gb|AAC63651.1| expressed protein [Arabidopsis thaliana]
 gi|20197558|gb|AAM15128.1| expressed protein [Arabidopsis thaliana]
 gi|110735120|gb|ABG89130.1| NPL4b [synthetic construct]
 gi|330255825|gb|AEC10919.1| NPL4-like protein 2 [Arabidopsis thaliana]
          Length = 413

 Score =  645 bits (1665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 300/411 (72%), Positives = 364/411 (88%), Gaps = 1/411 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M+LRIRSRDGLERVT +GAH+TV++LKT+I  QL+IP+  QTLSTN++LLLAK+P+DLL 
Sbjct: 2   MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRV 119
           FTD+ +P+ PLSSLN+ HGS+++L+YDGER++ G P   PAGSFGRKMT+DDLIA+QMRV
Sbjct: 62  FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121

Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
           TRQE  HC+SVSFDRD A+AFQ YVNE+LAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
           GTE  L ++RD +EEK VDAIA GLGM++VGFIF QT++Q+K +YTLSN EVLQA E HA
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241

Query: 240 ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK 299
           E  ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFETEI   DDPKLSKMK
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301

Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLP 359
           K+VVVG KD+KEVDNDFFLV+V+ILDHQGPLSSTFPIENR+++ TMRALK+HL+R+ SLP
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRALKTHLDRAKSLP 361

Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           LVK++SDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY LLI SMA T
Sbjct: 362 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMANT 412


>gi|297817634|ref|XP_002876700.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322538|gb|EFH52959.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 413

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 300/410 (73%), Positives = 362/410 (88%), Gaps = 1/410 (0%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQF 61
           +LRIRSRDGLERV+VDG H+TV++LKT+IQ QL+IP+ +QTLSTN+NLLLAK+P+D L F
Sbjct: 3   MLRIRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKTPADFLGF 62

Query: 62  TDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRVT 120
           TDMA+P+  +SSLN++HGS++FL+Y+GER +RG P+  PAGSFGRKMT++DLIA+QMRV 
Sbjct: 63  TDMADPNLRISSLNLAHGSMLFLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQE  HC+SVSFDRDCA+AFQ YVNE+LAFAVKRGGFMYG V E+ +VEVNFIYEPPQQG
Sbjct: 123 RQEKSHCDSVSFDRDCANAFQHYVNESLAFAVKRGGFMYGNVSEEGQVEVNFIYEPPQQG 182

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
            E+ L ++RD EEEK VDAIA GLGM++VGFIF QT+ Q+KK+YTLSN EVL A + HAE
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKLE++E+G+A +HFEAFQMSDMCVRLFKEGWFETEI   DDPKLSK+KK
Sbjct: 243 SELKEWVTAVVKLEINEDGAADVHFEAFQMSDMCVRLFKEGWFETEIGSEDDPKLSKLKK 302

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           +VVVG KDVKEVDNDFFLV+VKILDHQ PLS TFPIENR  Q TMRALK+H+ R+ SLP 
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQVPLSCTFPIENRNIQTTMRALKTHMERARSLPF 362

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           VKRISDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY+LLI+SMA T
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANT 412


>gi|297824909|ref|XP_002880337.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326176|gb|EFH56596.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 412

 Score =  639 bits (1649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 299/411 (72%), Positives = 363/411 (88%), Gaps = 1/411 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M+LRIRSRDGLERVT +GA++TV++LKT+I+ QL+IP+  QTLSTN++LLLAK+P+DLL 
Sbjct: 1   MMLRIRSRDGLERVTAEGANITVSQLKTLIEDQLQIPLHKQTLSTNRDLLLAKTPADLLA 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRV 119
           FTD+ +P+ PLSSLN+ HGS++FL+YDGER++ G P   PAGSFGRKMT+DDLIA+QMRV
Sbjct: 61  FTDLTDPNLPLSSLNLGHGSMLFLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 120

Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
           TRQE  HC+SVSFDRDCA+AFQ YVN++LAFAVKRGGFMYGTV E  +VEV+FIYEPPQQ
Sbjct: 121 TRQETSHCDSVSFDRDCANAFQHYVNDSLAFAVKRGGFMYGTVTEGGQVEVDFIYEPPQQ 180

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
           GTE  L ++RD +EEK VDAIA GLG++KVGFIF QT++Q+K  YTLSN EVLQA E HA
Sbjct: 181 GTEANLILMRDADEEKRVDAIAMGLGVRKVGFIFNQTVVQDKTGYTLSNAEVLQAAELHA 240

Query: 240 ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK 299
           E  ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFETEI   DDPKLSKMK
Sbjct: 241 EGELQEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPEDDPKLSKMK 300

Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLP 359
           K+VVVG KD+KEVDNDFFLV+V+ILDHQGPLSSTFPIENR+++ TMR LK+HL+R+ SLP
Sbjct: 301 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRDLKTHLDRAKSLP 360

Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           LVK++SDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY LLI SMA T
Sbjct: 361 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMANT 411


>gi|255559370|ref|XP_002520705.1| nuclear protein localization, putative [Ricinus communis]
 gi|223540090|gb|EEF41667.1| nuclear protein localization, putative [Ricinus communis]
          Length = 390

 Score =  622 bits (1605), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 300/410 (73%), Positives = 347/410 (84%), Gaps = 20/410 (4%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M++RIRSRDGLERV+V+  ++TV++LKT+IQ+QL IP+++QTLSTNQNLLLAKSPSD+L+
Sbjct: 1   MMIRIRSRDGLERVSVENPNITVSQLKTLIQNQLNIPIQNQTLSTNQNLLLAKSPSDVLK 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM+NP+ PLSSLNI+HGSI+FL+YDGER + GP+ NPAGSFGRKMTMDDLIAKQMR+T
Sbjct: 61  FTDMSNPNTPLSSLNITHGSILFLTYDGERTIAGPAVNPAGSFGRKMTMDDLIAKQMRIT 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQENPHCESVSFDRDCA+AFQ      +   VK              +    IYEPPQQG
Sbjct: 121 RQENPHCESVSFDRDCANAFQH-----IPMMVK--------------LRFIIIYEPPQQG 161

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TEE L +LRD EEEKLVDAIA GLGM++VGFIFTQTI Q+KKDYTLSNREVLQA E HAE
Sbjct: 162 TEENLMLLRDPEEEKLVDAIAVGLGMRRVGFIFTQTITQDKKDYTLSNREVLQAAELHAE 221

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTA+VKLEV+E+G A +HFEAFQMSDMC+RLFKE WFE EIAE  DPKLSKMKK
Sbjct: 222 SELKEWVTAIVKLEVNEDGGADVHFEAFQMSDMCIRLFKEDWFEKEIAEDSDPKLSKMKK 281

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           DVVVGGKDV+EVDNDFFLVVVKILDH GPLSS+FPIENR  QVT RALKSHL+R  +L  
Sbjct: 282 DVVVGGKDVREVDNDFFLVVVKILDHLGPLSSSFPIENRINQVTKRALKSHLDRLKNLSF 341

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           VKRISDFHLLLFL RFLD+ SDVPALA+CV  QTAVPEGY+L+I S+A +
Sbjct: 342 VKRISDFHLLLFLGRFLDV-SDVPALAECVLTQTAVPEGYQLIIESVANS 390


>gi|147777089|emb|CAN67841.1| hypothetical protein VITISV_016664 [Vitis vinifera]
          Length = 1616

 Score =  597 bits (1538), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 286/398 (71%), Positives = 334/398 (83%), Gaps = 16/398 (4%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+RIRSRDGLER+T+D  H T+++LKT+IQ  L IP+ SQTLSTNQNLLLAK+PSD+ +
Sbjct: 1   MLIRIRSRDGLERLTLDNPHATISQLKTLIQQHLHIPISSQTLSTNQNLLLAKTPSDIAR 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM++P  P+++L ++HGSIV+L++D +R V GP+F+PAGSFGR+MTMDDLIAKQMRVT
Sbjct: 61  FTDMSDPQTPIAALGVTHGSIVYLAHDTQRTVSGPTFSPAGSFGRRMTMDDLIAKQMRVT 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQE PHCESVSFDRD A+AFQ YVNETL FAVKRGGFMYGTV +D  V V+FIYEPPQQG
Sbjct: 121 RQETPHCESVSFDRDAANAFQHYVNETLVFAVKRGGFMYGTVADDGAVRVDFIYEPPQQG 180

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TEE L ++RD +EE+LVDAIA GLGM +VGFIFTQTI QNKKDYT+SN EVLQA E H E
Sbjct: 181 TEENLILMRDTDEERLVDAIAMGLGMXRVGFIFTQTISQNKKDYTMSNSEVLQAAELHGE 240

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKL   E G                RLFKEGWFET+I E  DPKLS+MKK
Sbjct: 241 SGLKEWVTAVVKL-YGERGR---------------RLFKEGWFETDIGEEVDPKLSRMKK 284

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           DVVVG KD +EVDNDFFLV+VKILDHQGPLSSTFPIENR TQVTMRALK+HL+R+ +LP 
Sbjct: 285 DVVVGVKDTREVDNDFFLVLVKILDHQGPLSSTFPIENRITQVTMRALKNHLDRAKNLPF 344

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
           VKRISDFHLLL L+RFLD+NSDVPALA+CVQ Q+AVPE
Sbjct: 345 VKRISDFHLLLLLSRFLDINSDVPALAECVQTQSAVPE 382


>gi|294461152|gb|ADE76140.1| unknown [Picea sitchensis]
          Length = 413

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 275/410 (67%), Positives = 341/410 (83%), Gaps = 4/410 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+R+RSRDGLERV VD    TV E K++IQSQL++P+ SQ LSTNQNLLLAK  ++  Q
Sbjct: 1   MLIRVRSRDGLERVKVDRPSATVGEFKSLIQSQLQVPIPSQILSTNQNLLLAKG-NNTQQ 59

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP--SFNPAGSFGRKMTMDDLIAKQMR 118
           F DM +P+ PLSSL I HG IVFLSY+GER++ GP  + NP+GSFG+KMTMDDLI++Q+R
Sbjct: 60  FKDMEDPNIPLSSLGIGHGHIVFLSYEGERSIAGPQINVNPSGSFGKKMTMDDLISRQVR 119

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQ 178
           + RQ+ PHC S SFDRD A+ FQQYVNE LAFA+KRGGFMYGTV E   V VNFIYE PQ
Sbjct: 120 IERQDKPHCGSASFDRDAANGFQQYVNENLAFAIKRGGFMYGTVGEGGDVAVNFIYEHPQ 179

Query: 179 QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFH 238
           QGTE+ L++ RD++EEK V+AIA GLGM++VGFIFTQTI Q KKDYTLSN E+ QAVE H
Sbjct: 180 QGTEDSLFLFRDQDEEKRVEAIAIGLGMRRVGFIFTQTITQGKKDYTLSNAEIRQAVELH 239

Query: 239 AECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKM 298
           AE  ++EWVTA+VKLEV+EEG A +HFEAFQ+SD CV+LFKEGWF  +  +  D K++++
Sbjct: 240 AESELKEWVTALVKLEVNEEGGADVHFEAFQLSDQCVKLFKEGWFVDDCTDL-DLKITRL 298

Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSL 358
           KKDVV+ GKD ++VDNDFFLV+VKILDHQGPLSSTFPIENR + VT+RALK+HL+R+ +L
Sbjct: 299 KKDVVILGKDTRDVDNDFFLVLVKILDHQGPLSSTFPIENRASPVTLRALKNHLDRTKNL 358

Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
             VKRISDFHLLL LA++LD+N+DVP LA CVQ Q +VPEGY+++I+S+A
Sbjct: 359 SFVKRISDFHLLLLLAKYLDVNADVPQLAACVQVQASVPEGYQIIIDSLA 408


>gi|224131268|ref|XP_002321042.1| predicted protein [Populus trichocarpa]
 gi|222861815|gb|EEE99357.1| predicted protein [Populus trichocarpa]
          Length = 327

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 255/326 (78%), Positives = 294/326 (90%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M++RIRSRDGLERV++D  ++T+++ KT+IQ+QL+IP+ +QTLSTNQNLLLAKSP DLLQ
Sbjct: 1   MMIRIRSRDGLERVSIDNPNITISQFKTLIQNQLQIPIHNQTLSTNQNLLLAKSPPDLLQ 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           FTDM+NP   LSSLNISHGSI+FL+Y+G R + GP+ +PAGSFGRKMTMDDLIAKQMRV 
Sbjct: 61  FTDMSNPSTLLSSLNISHGSIIFLAYEGHRTIAGPAVHPAGSFGRKMTMDDLIAKQMRVG 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQENPHC+SVSFDRDCA+AFQ YVNETLAFAVKRGGFMYGTV E+ +VEVNFIYE PQQG
Sbjct: 121 RQENPHCDSVSFDRDCANAFQHYVNETLAFAVKRGGFMYGTVSEEGKVEVNFIYELPQQG 180

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           TEE+L +LRD +EEK+VDAIAAGLGM++VGFIF QTIM +KKDYTLSNREVLQA E HAE
Sbjct: 181 TEEILMLLRDSDEEKIVDAIAAGLGMRRVGFIFNQTIMHDKKDYTLSNREVLQAAELHAE 240

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKEGWFETEI E  DPKLS MKK
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETEIGEDVDPKLSIMKK 300

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDH 326
           DVVVG KDVKEVDNDFFLVV+KILDH
Sbjct: 301 DVVVGSKDVKEVDNDFFLVVIKILDH 326


>gi|42571281|ref|NP_973714.1| NPL4-like protein 2 [Arabidopsis thaliana]
 gi|94442443|gb|ABF19009.1| At2g47970 [Arabidopsis thaliana]
 gi|330255824|gb|AEC10918.1| NPL4-like protein 2 [Arabidopsis thaliana]
          Length = 354

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 242/342 (70%), Positives = 293/342 (85%), Gaps = 2/342 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M+LRIRSRDGLERVT +GAH+TV++LKT+I  QL+IP+  QTLSTN++LLLAK+P+DLL 
Sbjct: 2   MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRV 119
           FTD+ +P+ PLSSLN+ HGS+++L+YDGER++ G P   PAGSFGRKMT+DDLIA+QMRV
Sbjct: 62  FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121

Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
           TRQE  HC+SVSFDRD A+AFQ YVNE+LAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
           GTE  L ++RD +EEK VDAIA GLGM++VGFIF QT++Q+K +YTLSN EVLQA E HA
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241

Query: 240 ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK 299
           E  ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFETEI   DDPKLSKMK
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301

Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQ-GPLSSTFPIENRT 340
           K+VVVG KD+KEVDNDFFLV+V+ILDHQ     ST P+   +
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQVRVFLSTLPLHQSS 343


>gi|168021566|ref|XP_001763312.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685447|gb|EDQ71842.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 409

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 238/410 (58%), Positives = 315/410 (76%), Gaps = 2/410 (0%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+R+RSRDGLERVT+   + TV +L+  I   L +PV SQ LS NQ LLL+K P +  +
Sbjct: 1   MLIRLRSRDGLERVTL-ANNATVGDLQAAIAENLNVPVSSQLLSRNQALLLSKDPVNGKE 59

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           F DM +    LSSL I+HGS+++L Y G R V+GP   P G+FGRKMT++DLIA+Q+R+ 
Sbjct: 60  FADMQDSGATLSSLGIAHGSLLYLFYSGNRVVKGPQVTPFGAFGRKMTVEDLIARQIRIE 119

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQE  HC ++SFDR+ A+AFQ YV+E+LAF++KRGG MYG+V E   ++V+ IYEPPQQG
Sbjct: 120 RQEKSHCSALSFDREAANAFQLYVHESLAFSIKRGGIMYGSVNEAGEIKVDVIYEPPQQG 179

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           +EE L ++RD +EEK  D IAAGLG++KVGFIFTQTI QN+  YTLSN E+ Q  EFHAE
Sbjct: 180 SEEGLLLMRDMDEEKRADVIAAGLGLRKVGFIFTQTITQNQGGYTLSNAEIRQTAEFHAE 239

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
            + E+W TA+VKLEV+E+G A +HFEAFQ+SD CV+LFK+G FE  + +  DPKL++MKK
Sbjct: 240 SDFEQWATAIVKLEVNEDGGADVHFEAFQLSDQCVKLFKDGLFEENVPDL-DPKLTRMKK 298

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           +V+V  KD  +VDNDFFLV VKILDH+GP+++TFPIENR    T++ALK+HL R  + P 
Sbjct: 299 EVIVLQKDTFDVDNDFFLVPVKILDHEGPMTTTFPIENRAPLATLQALKTHLERHRNKPY 358

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           VKRI+DFHLLL L+ FLD+N+DVP L + ++ +  VPEGY+L+I SMA +
Sbjct: 359 VKRIADFHLLLLLSNFLDINTDVPQLTEAIRTEGPVPEGYQLIIESMASS 408


>gi|302814770|ref|XP_002989068.1| hypothetical protein SELMODRAFT_129073 [Selaginella moellendorffii]
 gi|300143169|gb|EFJ09862.1| hypothetical protein SELMODRAFT_129073 [Selaginella moellendorffii]
          Length = 411

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 236/408 (57%), Positives = 308/408 (75%), Gaps = 7/408 (1%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDL-LQ 60
           ++RIRSRDGLERV+V     TV EL+  I+ +L++PV +Q +S NQ+LL+AK PS   L 
Sbjct: 5   VVRIRSRDGLERVSVH-PKATVLELQMAIERELQVPVSAQYVSRNQSLLMAKDPSGAALM 63

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
             DM +P   LSSL I HGS+VFL Y G+R V GP   P+G+FG+KMTMDDLIA+Q RV 
Sbjct: 64  AADMRDPSAALSSLGIGHGSVVFLFYSGKREVAGPKVTPSGAFGKKMTMDDLIARQTRVE 123

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQ++ HC S+SFDR  A+AFQ YV+ETLAF VKRGGFMYG+V E     V+FIYEPPQQG
Sbjct: 124 RQQHAHCNSLSFDRAAANAFQHYVHETLAFGVKRGGFMYGSVTEQGDALVHFIYEPPQQG 183

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           +E+ L +LRD +EE+ VD IAAGLGM+++GFIFT TI Q K DYTLSN E+ QA + HAE
Sbjct: 184 SEDGLLLLRDPDEERRVDGIAAGLGMRRLGFIFTHTISQGKADYTLSNAEIRQAAQLHAE 243

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
            N + W TA+VKLE + +    + FE+FQ+SD C++L+KEGWF  + AE  +PKLS+M K
Sbjct: 244 SNFDAWTTALVKLEANGD----VQFESFQLSDQCIKLWKEGWF-VQDAEDVNPKLSRMNK 298

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           +VV+ GKD ++VDN  FLV VKILDHQGPLS  FPIENR T  +  +L+SHL+ + S P 
Sbjct: 299 EVVILGKDTRDVDNVLFLVPVKILDHQGPLSCNFPIENRLTLPSFESLRSHLDSNKSRPY 358

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           V+RISDFHLLLFL++ LD+++D+P L + +  Q +VP+GY+LLI+S+A
Sbjct: 359 VQRISDFHLLLFLSKSLDIDTDIPRLTEAISTQGSVPDGYQLLIDSLA 406


>gi|302824858|ref|XP_002994068.1| hypothetical protein SELMODRAFT_187743 [Selaginella moellendorffii]
 gi|300138074|gb|EFJ04855.1| hypothetical protein SELMODRAFT_187743 [Selaginella moellendorffii]
          Length = 411

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 235/408 (57%), Positives = 307/408 (75%), Gaps = 7/408 (1%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDL-LQ 60
           ++RIRSRDGLERV+V     TV EL+  I+ +L++PV +Q +S NQ+LL+AK PS   L 
Sbjct: 5   VVRIRSRDGLERVSVH-PKATVLELQMAIERELQVPVSAQYVSRNQSLLMAKDPSGAALM 63

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
             DM +P   LSSL I HGS+VFL Y G+R V GP   P+G+FG+KMTMDDLIA+Q RV 
Sbjct: 64  AADMRDPSAALSSLGIGHGSVVFLFYSGKREVAGPKVTPSGAFGKKMTMDDLIARQTRVE 123

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQ++ HC S+SFDR  A+AFQ YV+ETLAF VKRGGFMYG+V E     V+FIYEPPQQG
Sbjct: 124 RQQHAHCNSLSFDRAAANAFQHYVHETLAFGVKRGGFMYGSVTEQGDALVHFIYEPPQQG 183

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           +E+ L +LRD +EE+  D IAAGLGM+++GFIFT TI Q K DYTLSN E+ QA + HAE
Sbjct: 184 SEDGLLLLRDPDEERRADGIAAGLGMRRLGFIFTHTISQGKADYTLSNAEIRQAAQLHAE 243

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
            N + W TA+VKLE + +    + FE+FQ+SD C++L+KEGWF  + AE  +PKLS+M K
Sbjct: 244 SNFDAWTTALVKLEANGD----VQFESFQLSDQCIKLWKEGWF-VQDAEDVNPKLSRMNK 298

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
           +VV+ GKD ++VDN  FLV VKILDHQGPLS  FPIENR T  +  +L+SHL+ + S P 
Sbjct: 299 EVVILGKDTRDVDNVLFLVPVKILDHQGPLSCNFPIENRLTLPSFESLRSHLDSNKSRPY 358

Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           V+RISDFHLLLFL++ LD+++D+P L + +  Q +VP+GY+LLI+S+A
Sbjct: 359 VQRISDFHLLLFLSKSLDIDTDIPRLTEAISTQGSVPDGYQLLIDSLA 406


>gi|115436652|ref|NP_001043084.1| Os01g0377700 [Oryza sativa Japonica Group]
 gi|75167666|sp|Q9AS33.1|NPL4_ORYSJ RecName: Full=NPL4-like protein
 gi|13365982|dbj|BAB39260.1| NPL4 -like protein [Oryza sativa Japonica Group]
 gi|113532615|dbj|BAF04998.1| Os01g0377700 [Oryza sativa Japonica Group]
 gi|125526006|gb|EAY74120.1| hypothetical protein OsI_02003 [Oryza sativa Indica Group]
          Length = 415

 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 247/417 (59%), Positives = 307/417 (73%), Gaps = 13/417 (3%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R+TV D A  TV +L+ +I +++ +PV  Q LS +  LLL  S S  L
Sbjct: 1   MILRIRSRDGTDRITVPDPAAATVGDLQRLIAARVTVPVPLQRLSLDPALLLPSSASAAL 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP----SFNPAGSFGRKMTMDDLIAK 115
                A     LSSL +S+GS V+LSY  +     P    + + AGSFG+KMTMDDLIA+
Sbjct: 61  LADPAAP----LSSLRLSNGSFVYLSYPPDARSSQPPPPKALSAAGSFGKKMTMDDLIAR 116

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIY 174
           Q+RVTRQE P C + SFDRD A+AFQ +V E+LAFA KR GF+YG V  D K V V+FIY
Sbjct: 117 QIRVTRQEAPLCAAASFDRDSANAFQLHVAESLAFATKRAGFLYGRVDADTKEVFVDFIY 176

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
           EPPQ GTE+V+ ++RD +EE  VDAIA GLGM++VG +FTQ + +   D   YT+SNREV
Sbjct: 177 EPPQVGTEDVVQLMRDAQEEARVDAIAHGLGMRRVGLVFTQAVGRKTSDTGEYTMSNREV 236

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           LQA E  AE  + EWVTA+VKLEV ++GS  +HFEAFQMS++CV+LFK+G  ETEI + D
Sbjct: 237 LQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLETEIGDKD 296

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQGPLS+ FPIENR   V M ALKSH
Sbjct: 297 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPIENRGNPVAMSALKSH 356

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L+R+  LP VKRISDFHLLL +A FLD+ +DVPAL  CV+ Q+ VPEGY+LLI S+A
Sbjct: 357 LDRAKHLPFVKRISDFHLLLLVAAFLDIKADVPALTACVKNQSVVPEGYQLLIESLA 413


>gi|242059373|ref|XP_002458832.1| hypothetical protein SORBIDRAFT_03g041070 [Sorghum bicolor]
 gi|241930807|gb|EES03952.1| hypothetical protein SORBIDRAFT_03g041070 [Sorghum bicolor]
          Length = 417

 Score =  479 bits (1233), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 239/417 (57%), Positives = 313/417 (75%), Gaps = 12/417 (2%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R+TV D A  TVA+L+ +I+S L +PV  Q LS +  LLL  +PS  +
Sbjct: 1   MILRIRSRDGTDRITVPDSASATVADLQRLIESHLTVPVPLQRLSLDPALLL-PNPSAAV 59

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGE-RNVRGP---SFNPAGSFGRKMTMDDLIAK 115
               +A+P  PL+SL +++G+ V+L+Y  + R+ R P   + + AGSFG+KMTMDDLIA+
Sbjct: 60  PL--LADPATPLASLRLANGAFVYLAYPPDARSARPPPPKALSVAGSFGKKMTMDDLIAR 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQEN  C + SFDRD A+AFQ YV E+LAFAVKR GF+YG V  E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAESLAFAVKRAGFLYGRVDAETKEVFVDFIY 177

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
           EPPQQG+E+V++++RD EEE  VD IA GLGM++VG +FTQ + +   +   YT+SNREV
Sbjct: 178 EPPQQGSEDVVHLMRDPEEEARVDTIAEGLGMRRVGLVFTQAVGRKASETGEYTMSNREV 237

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE  + EWVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 238 VQAAQLQAEGGIPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQGPLS  FPIENR + + + AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSVGFPIENRGSPLGISALRSH 357

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L R   L  V+RISDFHLLL LA FLD+ +D+P +A CV+ Q+ VPEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDVKADMPTIAACVKTQSRVPEGYQFLIESFA 414


>gi|212722814|ref|NP_001132660.1| NPL4 family [Zea mays]
 gi|194695028|gb|ACF81598.1| unknown [Zea mays]
 gi|195622926|gb|ACG33293.1| NPL4 family protein [Zea mays]
 gi|219888319|gb|ACL54534.1| unknown [Zea mays]
 gi|413951803|gb|AFW84452.1| NPL4 family [Zea mays]
          Length = 417

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 240/417 (57%), Positives = 309/417 (74%), Gaps = 12/417 (2%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R+TV D A VTVA+L+ +I+S L +PV  Q LS +  LLL    + + 
Sbjct: 1   MILRIRSRDGTDRITVPDSASVTVADLQRLIESHLTVPVPLQRLSLDPALLLPNPSAVVP 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGE-RNVRGP---SFNPAGSFGRKMTMDDLIAK 115
             TD A P   L+SL +++G+ V+L+Y  + R+ R P   + + AGSFG+KMTMDDLIA+
Sbjct: 61  LLTDSAAP---LASLRLANGAFVYLAYPPDARSARPPPPKALSAAGSFGKKMTMDDLIAR 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQEN  C + SFDRD A+AFQ YV E L F VKR GF+YG V  E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEFLGFGVKRAGFLYGRVDAETKDVFVDFIY 177

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
           EPPQQG+E+V+++LRD +EE  VD IA GLGM++VG +FTQ + +   +   YT+SNREV
Sbjct: 178 EPPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRKASETGEYTMSNREV 237

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE  + EWVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 238 VQAAQLQAEGGIPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQGPLS  FPIENR   V++ AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSVGFPIENRGNPVSISALRSH 357

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L R   L  VKRISDFHLLL LA FLD+ +D+P +A CV+ Q+ VPEGY+LLI S A
Sbjct: 358 LERMKHLAFVKRISDFHLLLKLATFLDVKADMPTIAACVKTQSRVPEGYQLLIESFA 414


>gi|326528657|dbj|BAJ97350.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score =  476 bits (1226), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 239/417 (57%), Positives = 313/417 (75%), Gaps = 14/417 (3%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R+TV D A  TV +L+ +I+++  +PV +Q LS +  LLL        
Sbjct: 1   MILRIRSRDGTDRITVPDAAAATVGDLQRLIEARATVPVAAQRLSLDPALLLPAP----- 55

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYD-GERNVRGP---SFNPAGSFGRKMTMDDLIAK 115
           Q   +++P  PL+SL +++GS V+LSY  G R+ + P   +   AGSFG+KMTMDDLIA+
Sbjct: 56  QPPLLSDPAAPLASLPLANGSFVYLSYPPGARSAQPPPPRALTAAGSFGKKMTMDDLIAR 115

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQE P C + SFDRD A+AFQ +V+E+LAFAVKR GF+YG V  E K V V+FIY
Sbjct: 116 QIRVTRQEAPFCSAASFDRDSANAFQLHVSESLAFAVKRAGFLYGRVDAETKEVLVDFIY 175

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
           EPPQQG+ +V+ ++RD +EE  VDAIA GLGM++VGF+FTQ + +   +   YTLSN EV
Sbjct: 176 EPPQQGSSDVVQLMRDADEEARVDAIADGLGMRRVGFVFTQAVGRKASETGEYTLSNSEV 235

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE  + EWVTAVVKLEV E+GSA +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 236 VQAAQLQAEGGIPEWVTAVVKLEVGEDGSADVHFEAFQMSEVCVKLFKDGVLETEVKDSD 295

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+V+ GGKD  EVDNDFFLV VKI DHQGPLS+ FP+ENR   +   AL+SH
Sbjct: 296 DPRLSKMRKEVIAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPVENRGIPLPASALRSH 355

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           ++R+  LP VKRISDFHLLL +A FLD+ +DVP LA CV  Q  VPEGY+LLI+++A
Sbjct: 356 MDRAKHLPFVKRISDFHLLLQIAAFLDVKADVPTLAACVGHQAEVPEGYRLLIDALA 412


>gi|222618478|gb|EEE54610.1| hypothetical protein OsJ_01845 [Oryza sativa Japonica Group]
          Length = 415

 Score =  475 bits (1223), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 245/417 (58%), Positives = 307/417 (73%), Gaps = 13/417 (3%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R+TV D A  TV +L+ +I +++ +PV  Q LS +  LLL  S S  L
Sbjct: 1   MILRIRSRDGTDRITVPDPAAATVGDLQRLIAARVTVPVPLQRLSLDPALLLPSSASAAL 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP----SFNPAGSFGRKMTMDDLIAK 115
                A     LSSL +S+GS V+LSY  +     P    + + AGSFG+KMTMDDLIA+
Sbjct: 61  LADPAAP----LSSLRLSNGSFVYLSYPPDARSSQPPPPKALSAAGSFGKKMTMDDLIAR 116

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIY 174
           Q+RVTRQE P C + SFDRD A+AFQ +V E+LAFA KR GF+YG V  D K V V+FIY
Sbjct: 117 QIRVTRQEAPLCAAASFDRDSANAFQLHVAESLAFATKRAGFLYGRVDADTKEVFVDFIY 176

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
           EPPQ GTE+V+ ++RD +EE  VDAIA GLGM++VG +FTQ + +   D   YT+SNREV
Sbjct: 177 EPPQVGTEDVVQLMRDAQEEARVDAIAHGLGMRRVGLVFTQAVGRKTSDTGEYTMSNREV 236

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           LQA E  AE  + EWVTA+VKLEV ++GS  +HFEAFQMS++CV+LFK+G  ETEI + D
Sbjct: 237 LQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLETEIGDKD 296

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQGPLS+ FPIENR   V M ALKSH
Sbjct: 297 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPIENRGNPVAMSALKSH 356

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L+R+  LP VKRIS+F+LLL +A FLD+ +DVPAL  CV+ Q+ VPEGY+LLI S+A
Sbjct: 357 LDRAKHLPFVKRISNFNLLLLVAAFLDIKADVPALTACVKNQSVVPEGYQLLIESLA 413


>gi|357132177|ref|XP_003567708.1| PREDICTED: NPL4-like protein-like [Brachypodium distachyon]
          Length = 415

 Score =  473 bits (1217), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 232/417 (55%), Positives = 308/417 (73%), Gaps = 14/417 (3%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R+TV DGA  TV EL+ +I +++ +PV  Q LS +  LLL  +   LL
Sbjct: 1   MILRIRSRDGTDRITVPDGASATVGELQRLIAARVTVPVTLQRLSLDPALLLPAAQPQLL 60

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP----SFNPAGSFGRKMTMDDLIAK 115
                ++P  PL+SL +++GS V+LSY  +     P    +   AGSFG+KMTMDDLIA+
Sbjct: 61  -----SDPATPLASLPLANGSFVYLSYPPDARSSQPPPPKALTVAGSFGKKMTMDDLIAR 115

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQE P C + SFDRD A+AFQ YV E+LAFAVKR GF+YG V  E K V V+FIY
Sbjct: 116 QIRVTRQEAPLCAAASFDRDSANAFQLYVAESLAFAVKRAGFLYGRVDAETKEVFVDFIY 175

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
           EPPQ G+E+V+ ++RD +EE  VDAIA GLGM++VG +FTQ + +   +   YT+SNREV
Sbjct: 176 EPPQSGSEDVVQLMRDSDEELRVDAIADGLGMRRVGLVFTQAVGRKASETGEYTMSNREV 235

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA E  AE  + EWVTA+VK++V ++G+  +HFEAFQ+S++CV+LFK+G  +TE+ + D
Sbjct: 236 VQAAELQAEGGIPEWVTAIVKIKVGDDGTGDVHFEAFQLSEVCVKLFKDGVLQTEVGDSD 295

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+V+ GGKD  EVDNDFFLV VKI DHQGPLS  FPIENR + + M  L++H
Sbjct: 296 DPRLSKMRKEVIAGGKDTMEVDNDFFLVPVKISDHQGPLSIGFPIENRGSPLAMSTLRNH 355

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L+R+  LP V+RISDFHLLL +  FLD+ +DVP LA CV+ Q  VPEGY+LLI ++A
Sbjct: 356 LDRAKHLPFVRRISDFHLLLQVTTFLDVKADVPILAACVKHQVKVPEGYQLLIETLA 412


>gi|414879508|tpg|DAA56639.1| TPA: NPL4 family protein [Zea mays]
          Length = 417

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 233/417 (55%), Positives = 306/417 (73%), Gaps = 12/417 (2%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R++V D A  TVA+L+ +I+S L +PV  Q +S +  LLL  +PS  +
Sbjct: 1   MILRIRSRDGTDRISVPDSASATVADLQRLIESHLTVPVPLQRISLDPALLL-PNPSAAV 59

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
               +A+P  PL+SL +++G+ ++L+Y  +     P+     + AGSFG+KMTMDDLIA+
Sbjct: 60  PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQEN  C + SFDRD A+AFQ YV E LAFAVKR GF+YG +  E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
           EPPQQG E+V++++RD +EE  VD IA GLGM++VG +FTQ + +      +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE    EWVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  +VDNDFFLV VKI DHQGPLS  FPIENR + V M AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQGPLSVGFPIENRGSPVGMSALRSH 357

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L R   L  V+RISDFHLLL LA FLD  SD+P +A CV+ Q+ +PEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDAKSDMPTIAACVKTQSRMPEGYQFLIESFA 414


>gi|226529197|ref|NP_001141001.1| uncharacterized protein LOC100273080 [Zea mays]
 gi|194702118|gb|ACF85143.1| unknown [Zea mays]
          Length = 417

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 233/417 (55%), Positives = 306/417 (73%), Gaps = 12/417 (2%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R++V D A  TVA+L+ +I+S L +PV  Q +S +  LLL  +PS  +
Sbjct: 1   MILRIRSRDGTDRISVPDSASATVADLQRLIESHLTVPVPLQRISLDPALLL-PNPSAAV 59

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
               +A+P  PL+SL +++G+ ++L+Y  +     P+     + AGSFG+KMTMDDLIA+
Sbjct: 60  PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQEN  C + SFDRD A+AFQ YV E LAFAVKR GF+YG +  E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
           EPPQQG E+V++++RD +EE  VD IA GLGM++VG +FTQ + +      +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE    EWVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  +VDNDFFLV VKI DHQGPLS  FPIENR + V M AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQGPLSVGFPIENRGSPVGMSALRSH 357

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L R   L  V+RISDFHLLL LA FLD  SD+P +A CV+ Q+ +PEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDAKSDMPTIAVCVKTQSRMPEGYQFLIESFA 414


>gi|195636888|gb|ACG37912.1| NPL4 family protein [Zea mays]
          Length = 417

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 232/417 (55%), Positives = 307/417 (73%), Gaps = 12/417 (2%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R++V D A  TVA+L+ +I+S+L +PV  Q +S +  LLL  +PS  +
Sbjct: 1   MILRIRSRDGTDRISVPDSASATVADLQRLIESRLTVPVPLQRISLDPALLL-PNPSAAV 59

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
               +A+P  PL+SL +++G+ ++L+Y  +     P+     + AGSFG+KMTMDDLIA+
Sbjct: 60  PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQEN  C + SFDRD A+AFQ YV E LAFAVKR GF+YG +  E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
           EPPQQG E+V++++RD +EE  VD IA GLGM++VG +FTQ + +      +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE    EWVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
           DP+LSKM+K+VV GGKD  +VDNDFFLV VKI DHQGPLS  FPIENR + V M AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQGPLSVGFPIENRGSPVGMSALRSH 357

Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           L R   L  V+RISDFHLLL LA FLD  S++P +A CV+ Q+ +PEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDAKSEMPTIAACVKTQSRMPEGYQFLIESFA 414


>gi|356514356|ref|XP_003525872.1| PREDICTED: LOW QUALITY PROTEIN: NPL4-like protein 2-like [Glycine
           max]
          Length = 359

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 190/296 (64%), Positives = 234/296 (79%), Gaps = 1/296 (0%)

Query: 2   LLRIRSRDGLERVTVDGAH-VTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ++ I S D L+ V+++  H  TV +LK II++QL IP+ +QTLSTNQNLLL KS  DL  
Sbjct: 1   MVIIHSHDNLKXVSIENRHRTTVLDLKQIIETQLGIPIHNQTLSTNQNLLLXKSLKDLHX 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
           F DMANPD PL +LN+SH SIVFL+Y+GER V  P+FNP GSFGRKMT+D LIAK M  T
Sbjct: 61  FIDMANPDAPLXALNLSHDSIVFLAYEGERRVTDPTFNPTGSFGRKMTVDHLIAKXMCTT 120

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
           RQENPHCE VSFDRD A+AFQ  VN+TLAFAVK   FMY  VL+ ++V+V+FIY+P QQG
Sbjct: 121 RQENPHCELVSFDRDYANAFQLCVNDTLAFAVKHDEFMYEIVLKVRKVKVDFIYKPHQQG 180

Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
           +++ L   RD +EEK V+AI  GLGM +VGFIF QTI Q+KKDYTLSNREVLQAVE+H++
Sbjct: 181 SKDNLVFFRDPKEEKFVEAIMXGLGMMEVGFIFMQTITQDKKDYTLSNREVLQAVEYHSD 240

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLS 296
             ++EWVT V+KL+V+ E    +HFEAFQ+SD+CVRLFKEGWF+TEI E DDPKLS
Sbjct: 241 SRLKEWVTIVIKLKVNGEMGVDVHFEAFQISDVCVRLFKEGWFQTEIKEDDDPKLS 296


>gi|414879509|tpg|DAA56640.1| TPA: hypothetical protein ZEAMMB73_509597 [Zea mays]
          Length = 358

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 184/336 (54%), Positives = 248/336 (73%), Gaps = 12/336 (3%)

Query: 1   MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           M+LRIRSRDG +R++V D A  TVA+L+ +I+S L +PV  Q +S +  LLL  +PS  +
Sbjct: 1   MILRIRSRDGTDRISVPDSASATVADLQRLIESHLTVPVPLQRISLDPALLL-PNPSAAV 59

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
               +A+P  PL+SL +++G+ ++L+Y  +     P+     + AGSFG+KMTMDDLIA+
Sbjct: 60  PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
           Q+RVTRQEN  C + SFDRD A+AFQ YV E LAFAVKR GF+YG +  E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
           EPPQQG E+V++++RD +EE  VD IA GLGM++VG +FTQ + +      +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237

Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
           +QA +  AE    EWVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297

Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           DP+LSKM+K+VV GGKD  +VDNDFFLV VKI DHQ
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQ 333


>gi|384245338|gb|EIE18832.1| NPL4-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 412

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 196/418 (46%), Positives = 267/418 (63%), Gaps = 16/418 (3%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+R+RSRDGLER+ +   H TVAELK  I S L IPVE  TLS N++LL A   S    
Sbjct: 1   MLIRLRSRDGLERIQLP-EHATVAELKKQINSNLSIPVEDITLSRNKDLLTALDKSG--- 56

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPA---GSFGRKMTMDDLIAKQM 117
           F D++ P  PL  L I HG +VF+ Y  ER V       A    +FG  MT+D+L+AKQ 
Sbjct: 57  FEDLSRPGTPLVKLGIQHGDVVFMHYPFEREVASTVPRTAYENRAFGTHMTVDELVAKQT 116

Query: 118 RVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPP 177
           R+ RQE PH  S SF+R  ADAFQ YV   +AF++KRGG +YG+V E+ +V V  IYEPP
Sbjct: 117 RIERQEAPHAASASFERHAADAFQSYVQGAIAFSIKRGGLLYGSVDEEGKVTVEAIYEPP 176

Query: 178 QQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF 237
           Q G+ E L + R  EEE+  D +A  LG++KVG+IF Q+    ++++ +S  EV Q    
Sbjct: 177 QSGSAESLQLERSTEEEERADFVAKQLGLQKVGWIFAQST--KEREFIMSAEEVCQMAAV 234

Query: 238 HAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFE--TEIAEGD---D 292
            AE   E  VTAVV + V+EEG+  IHFEAFQ+SD  VRL++EGWF   TE +      +
Sbjct: 235 QAELG-ERAVTAVVSMTVAEEGTPEIHFEAFQVSDQTVRLWQEGWFTNGTEFSGVSILRN 293

Query: 293 PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHL 352
           PK    K  V+V GKD  E DND+ L+ V ILDH+GPL+++FP+ENR        L+ HL
Sbjct: 294 PKEPDHKAPVIVAGKDQGEADNDYLLIPVNILDHEGPLTASFPVENRLLPQGKAELRHHL 353

Query: 353 NRSPSLPLVKRISDFHLLLFLARFLDLN-SDVPALAQCVQAQTAVPEGYKLLINSMAE 409
            ++   P  +R++DFHLLL+LA+  +L+ +D+  + + V+ +  VPEGY L+I+S+AE
Sbjct: 354 QKNRGKPYAERLADFHLLLYLAKQPNLDLADLAIIIEAVKTKGTVPEGYTLIIDSLAE 411


>gi|302853590|ref|XP_002958309.1| hypothetical protein VOLCADRAFT_69322 [Volvox carteri f.
           nagariensis]
 gi|300256334|gb|EFJ40602.1| hypothetical protein VOLCADRAFT_69322 [Volvox carteri f.
           nagariensis]
          Length = 421

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 179/423 (42%), Positives = 259/423 (61%), Gaps = 21/423 (4%)

Query: 1   MLLRIRSRDGLERVTVD-GAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
           MLLR+RSRDGLER+ VD GA+++   LK  I ++L IP++   LS    LL +K P    
Sbjct: 1   MLLRLRSRDGLERIEVDDGANLSA--LKAAINAKLAIPLDDMLLSKQPGLLTSKEP---Y 55

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAG---SFGRKMTMDDLIAKQ 116
            F D+A  D  L S  +SHG +V+L Y  ER V  P+  PAG    FG  MT++D++AK 
Sbjct: 56  SFRDLAINDASLRSQGVSHGDMVYLLYSFERQVE-PAVRPAGRSKPFGVHMTLNDVMAKV 114

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEP 176
           +++ RQE P   ++SFDR+ A+ FQ Y+     F++KRGG +YG V +DK V+V+FIYEP
Sbjct: 115 IKIERQEKPLVSAISFDRNAANVFQSYLQGAFNFSIKRGGLLYGNVDDDKTVKVDFIYEP 174

Query: 177 PQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNK----KDYTLSNREVL 232
           PQ+G+ + L + R   EE+ VD +A  LG +KVGFIF Q++   K     DY ++  E++
Sbjct: 175 PQEGSSDKLQLQRHTPEEQQVDLLAQLLGYRKVGFIFCQSVKAQKAAAESDYIINTDELI 234

Query: 233 QAVEFHA-ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE----- 286
                   E   E   TA+V L    E    +HFEAFQ SD+ VRL +EGW         
Sbjct: 235 MMAAMQVREGRGEHCTTALVTLVEEPETGPQVHFEAFQCSDLAVRLVREGWVVARDPVDG 294

Query: 287 IAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMR 346
           ++   +PK  ++K+ V++  +D  EVDND+FL  V I DH G L+++FP+ENR T     
Sbjct: 295 VSRMVNPKEPELKQPVMLNNRDTDEVDNDWFLCPVSIRDHDGRLTTSFPVENRLTPQGRT 354

Query: 347 ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN-SDVPALAQCVQAQTAVPEGYKLLIN 405
            L+ HL R  S P V+R+SDFHLLL+LA+  +L+ +D+  L + V+    V EGY+++I+
Sbjct: 355 ELREHLKRMSSRPYVERLSDFHLLLWLAKQPNLDPNDMALLCEAVREHRPVLEGYRVIID 414

Query: 406 SMA 408
           S+A
Sbjct: 415 SIA 417


>gi|145346114|ref|XP_001417539.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577766|gb|ABO95832.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 416

 Score =  314 bits (805), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 182/425 (42%), Positives = 269/425 (63%), Gaps = 29/425 (6%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M++R+R+RDG ER+TV  + +TVA+LK I++++L++P + Q LST+ ++L +K+P    +
Sbjct: 1   MIVRVRTRDGTERITVPSSDITVAKLKEILETELKVPKDEQILSTSLDVLTSKTPE---E 57

Query: 61  FTDMANPDRPLSSLNI-SHGSIVFLSYDGERNVRGPSFNPA-------GSFGRKMTMDDL 112
           F DMAN    ++S  I ++G++V L Y  ER    P+   A       G  GRKMT+DD+
Sbjct: 58  FKDMANGRAKIASCGIDTNGAVVHLMYGCERE---PTKTTAAVRDAARGLTGRKMTVDDM 114

Query: 113 IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF 172
           IA Q+R+ RQE PHC SVSFD + A+ FQ YVN TL F   R G++YG   +D +  V+ 
Sbjct: 115 IAAQVRIERQEAPHCVSVSFDANAANVFQSYVNYTLGFKQVRFGWLYGARDDDGKTRVDA 174

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
           IYEP Q+GTEE + +L D  E++  DA+A  LGM +VG I +    ++++D+TLS REV 
Sbjct: 175 IYEPEQEGTEEDVEVLEDTLEDEKADALAGALGMIRVGCIISTP--KDERDFTLSAREVS 232

Query: 233 QAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA---E 289
           +     ++   +E+VTAVV     E+ +  + FEAFQ+SD  +++ + GWF  E     +
Sbjct: 233 RVARLQSKFG-KEFVTAVVSTFEDEDDNWQVSFEAFQLSDAAMKVQERGWFTGECGVDED 291

Query: 290 GDDPK-LSKMKKDVVVGG----KDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV- 343
           GD+ K +SK+  +V+V       DV  VDND FLV V ILDH+G +S+TFP+ENR   V 
Sbjct: 292 GDELKGVSKLTNEVIVRDGPSINDVNAVDNDMFLVTVPILDHEGAISTTFPVENRLHPVQ 351

Query: 344 TMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLL 403
           T+  LK  L +    P  +R+ DFHLLLFLA+ +D+N ++  +A  V     VPEGY LL
Sbjct: 352 TVDDLKDALRKDK--PYGERLRDFHLLLFLAKHIDIN-ELAVIASAVHHGEDVPEGYTLL 408

Query: 404 INSMA 408
           I+ +A
Sbjct: 409 IDCLA 413


>gi|303282283|ref|XP_003060433.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457904|gb|EEH55202.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 432

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 191/439 (43%), Positives = 260/439 (59%), Gaps = 43/439 (9%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           +LLRIR+RDG ER+TVD A+ T+ +L+  I+++  +P E QTLS NQ LLLA + ++   
Sbjct: 3   LLLRIRTRDGTERLTVD-ANCTLRQLREKIETEYGVPQEDQTLSLNQGLLLANARAE--D 59

Query: 61  FTDMANPDRP-----LSSLNISHGSIVFLSYDGERNVRGPSFNPAGSF-----GRKMTMD 110
           F D+A P R      +  L I HGSIV+LSY  ER    P+   A S      G KMT++
Sbjct: 60  FDDLA-PARAKDAKLVRQLGIDHGSIVYLSYSVERE---PTPTAAASHRSELRGGKMTVE 115

Query: 111 DLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEV 170
            +IA Q RV RQE P+C SVSFD   A+AFQQYVN TL F   R G+MYGTV +   V V
Sbjct: 116 AMIAAQTRVERQETPNCASVSFDAHAANAFQQYVNGTLGFNQMRFGWMYGTVNDAGEVLV 175

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
           + IYEP Q+G+E+   I    EE+   DA+A+ LGM+KVG +F  T  + +K +TLS  E
Sbjct: 176 DAIYEPEQEGSEDAFIISEGSEEDARADALASALGMQKVGCVFNITTTKPRK-FTLSAFE 234

Query: 231 VLQAVEFHAECNMEE-----------------WVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
           V +  +  A                       +VTAVV +   E+    + FE FQ+SD 
Sbjct: 235 VREMAKRQAAAAAAAAAAAAADGAEKKDGKTTFVTAVVMMLEDEDEGPQVSFEPFQVSDQ 294

Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKKDVVVG---GKDVKEVDNDFFLVVVKILDHQGPL 330
           CVRL+ +GWF  E  E  D  L++M K+V+V     KDV EVDND +LV V I DH G L
Sbjct: 295 CVRLYADGWFHDEPCE--DRGLTRMTKEVIVADGVSKDVTEVDNDRWLVAVPIKDHSGAL 352

Query: 331 SSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
           S+ FPIENR   + TM  LK  L++S  +    R+ DFHLLLFL++ L  + D+  +A+ 
Sbjct: 353 STKFPIENRLHPIQTMEDLKDSLSQS-GMEYSARLRDFHLLLFLSKHLGTD-DLVDVAKT 410

Query: 390 VQAQTAVPEGYKLLINSMA 408
            +    + EGYK++I+S+A
Sbjct: 411 ARRGGTIAEGYKIIIDSLA 429


>gi|308803090|ref|XP_003078858.1| Nuclear pore complex, rNpl4 component (sc Npl4) (ISS) [Ostreococcus
           tauri]
 gi|116057311|emb|CAL51738.1| Nuclear pore complex, rNpl4 component (sc Npl4) (ISS) [Ostreococcus
           tauri]
          Length = 412

 Score =  298 bits (764), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 166/418 (39%), Positives = 261/418 (62%), Gaps = 19/418 (4%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M++R+R+RDG ERV +  A  TV ELK  I+ +L++P + Q LST+ ++L +K PS    
Sbjct: 1   MIVRVRTRDGTERVEIPSAASTVGELKDAIERELKVPRDDQILSTSPDVLTSKDPS---A 57

Query: 61  FTDMANPDRPLSSLNIS-HGSIVFLSY----DGERNVRGPSFNPAGSFGRKMTMDDLIAK 115
           F DM++    L S+ +  +G+IV   Y    +  + +     +  G  G KMT+D +IA 
Sbjct: 58  FLDMSDDGDALESVGVKQNGAIVHARYTVCREPTKTIAATHNDARGLTGGKMTVDQMIAA 117

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
           Q R+ RQE PHC SVSFD + A+ FQ YVN TL F   R G++YGT  ++ +V V  IYE
Sbjct: 118 QTRIERQEAPHCVSVSFDANAANVFQSYVNYTLGFKQVRFGWLYGTRDDENKVRVEAIYE 177

Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
           P Q+GTE+ + +L + +E+   + +A+ +G+ +VG I +    ++++D+TLS +EV +  
Sbjct: 178 PEQEGTEDDVEVLENTQEDHNAECLASSMGLTRVGCIISTP--KDERDFTLSAKEVARIA 235

Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA---EGDD 292
              ++   + + TAV+     E+ +  + FEAFQ+SD+ ++++  GWF  + A    G+D
Sbjct: 236 RLQSKYG-QHFATAVMMTFEDEDDNWQVSFEAFQLSDLAIKIYDRGWFTGKCAVDENGED 294

Query: 293 PK-LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV-TMRALKS 350
            K +SK+  +V+V  KDV +VDNDFFLV V ILDH+GPLSSTFP+ENR   + T+  L+ 
Sbjct: 295 LKGVSKLTNEVMVERKDVFDVDNDFFLVTVPILDHEGPLSSTFPVENRLHPIQTVDDLRE 354

Query: 351 HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
            L +    P   R+ DFHLLLFLA+ +D+N ++ A++        +PEGY+L+I+ +A
Sbjct: 355 ALQKDK--PYGDRLRDFHLLLFLAKHIDIN-ELVAISGAAHEGEEIPEGYRLMIDCLA 409


>gi|255085502|ref|XP_002505182.1| predicted protein [Micromonas sp. RCC299]
 gi|226520451|gb|ACO66440.1| predicted protein [Micromonas sp. RCC299]
          Length = 395

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 169/391 (43%), Positives = 244/391 (62%), Gaps = 18/391 (4%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+R+R+RDG ERV VD    T+++L+T+I+ +  +P   Q LS +Q LL++K P     
Sbjct: 1   MLIRLRTRDGTERVEVDDK-CTLSQLRTVIEEKFGVPTSDQRLSLDQGLLMSKDPH---A 56

Query: 61  FTDMA----NPDRPLSSLNISHGSIVFLSYDGERNVR---GPSFNPA-GSFGRKMTMDDL 112
           + D+     N  +PL SL I HG+IV+++Y  ER V      + N A G  G KMT++ +
Sbjct: 57  YDDLKPGRKNDGKPLKSLKIEHGTIVYMTYSVERKVTVCTAAAHNDARGLTGGKMTVEAM 116

Query: 113 IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF 172
           IA Q R+  QE+P+ ES+S D   A+AFQ YVN TL F   R G+MYGT  +D  V V+ 
Sbjct: 117 IAAQTRIETQESPNVESLSLDGHAANAFQAYVNGTLGFNQMRFGWMYGTCSDDGEVRVHA 176

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
           IYEP Q+GTE+   ++ D +E+   +AIA  LGM +VG +F  T  + + DYT+S  E+ 
Sbjct: 177 IYEPEQEGTEDAFVVVEDADEDARAEAIAESLGMTRVGMVFNVTTTKERPDYTMSAFEIR 236

Query: 233 QAVEFHAECNMEEWVTAVVK-LEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG- 290
           Q  ++ ++   +++VTAVV  LE  +EG+  + FE FQ+S+ CVRL+ EGWF  E  E  
Sbjct: 237 QMAKYQSKYG-DKFVTAVVMMLEDEDEGTPQVSFEPFQVSNQCVRLYSEGWFHDEPCEKP 295

Query: 291 DDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV-TMRALK 349
              +L+K    V    KDV EVDND +LV VKILDH+GP  ++FPIENR   V T+  LK
Sbjct: 296 GHTRLTKDVVVVDKVSKDVTEVDNDRWLVPVKILDHEGPWRTSFPIENRLHPVQTVEDLK 355

Query: 350 SHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
             +++     +  +++DFHLLLFL++ LD N
Sbjct: 356 DAVSKGGQYCV--KLADFHLLLFLSKHLDAN 384


>gi|414589343|tpg|DAA39914.1| TPA: hypothetical protein ZEAMMB73_187365 [Zea mays]
          Length = 392

 Score =  270 bits (691), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 131/222 (59%), Positives = 170/222 (76%), Gaps = 4/222 (1%)

Query: 110 DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRV 168
           ++LI +Q+RVTRQEN  C + SFDRD A+AFQ Y  + L F VKR GF+YG V  E K V
Sbjct: 18  ENLITRQIRVTRQENALCAAASFDRDAANAFQLYDAKFLDFGVKRAGFLYGRVDAETKDV 77

Query: 169 EVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YT 225
            V+FIYEPPQQG+E+V+++LRD +EE  VD IA GLGM+ VG +FTQ + +   +   YT
Sbjct: 78  FVDFIYEPPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRWVGLVFTQAVGRKPSETGEYT 137

Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
           +SNREV+QA +  AE  + +WVTA+VKLEV ++G+  +HFEAFQMS++CV+LFK+G  ET
Sbjct: 138 MSNREVMQAAQLQAEGGIPKWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLET 197

Query: 286 EIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           E+ + DDP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQ
Sbjct: 198 EVQDADDPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQ 239


>gi|307108993|gb|EFN57232.1| hypothetical protein CHLNCDRAFT_143718 [Chlorella variabilis]
          Length = 564

 Score =  261 bits (668), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 166/422 (39%), Positives = 224/422 (53%), Gaps = 57/422 (13%)

Query: 7   SRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQFTDMA 65
           +RDGLER+ V DG+  T+A LK  IQ +L +P+    LS N  LL AK+ +    F+DM 
Sbjct: 178 NRDGLERIEVEDGS--TLAGLKLAIQQKLGVPLADMLLSNNAQLLTAKNAAG---FSDML 232

Query: 66  NPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAG----SFGRKMTMDDLIAKQMRVTR 121
           +    L  L + HG +VFL Y  ER V  P F P       FG  M +  ++A Q R+ R
Sbjct: 233 DNSASLRQLGVHHGDMVFLLYHFERQVE-PVFRPKDWERRPFGAHMDVAAMVAAQTRIER 291

Query: 122 QENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVL--------------EDKR 167
           Q+ PH  S SFD    +AFQ YV+  + F++KRGG M+  V               E+  
Sbjct: 292 QDTPHAASASFDFATTNAFQSYVSSAIVFSIKRGGIMWWYVRCEGGWANTGQDRMDEEGA 351

Query: 168 VEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLS 227
           V VN +YEPPQQG+ + L + R   EE+  D IA  LG +KVG+IF Q+    ++D+ ++
Sbjct: 352 VFVNAVYEPPQQGSADSLSLERGTPEERQADFIAERLGWRKVGWIFAQST--KERDFIMT 409

Query: 228 NREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI 287
             EV Q      E   E  VTAVV     E+G   +HFE                   E 
Sbjct: 410 GEEVCQMAGIQDEMG-EHAVTAVVATWPGEDGQPEVHFE-------------------EP 449

Query: 288 AEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRA 347
            E            V+V GKD+ EVDND+FLV V + DH+GPL + FP+ENR     M  
Sbjct: 450 GE---------AAPVIVAGKDLGEVDNDYFLVPVAVRDHEGPLETKFPLENRLLPQGMPE 500

Query: 348 LKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS-DVPALAQCVQAQTAVPEGYKLLINS 406
           LK+HL    S P   RI+DFHLLLFLAR  +L   +V  L   V+ +  VPEG++++I+S
Sbjct: 501 LKAHLQARRSAPYWARIADFHLLLFLARQPNLEEPEVGLLVDAVRDKAPVPEGFQIIIDS 560

Query: 407 MA 408
           MA
Sbjct: 561 MA 562


>gi|440794522|gb|ELR15682.1| NPL4 family protein [Acanthamoeba castellanii str. Neff]
          Length = 392

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 153/405 (37%), Positives = 224/405 (55%), Gaps = 38/405 (9%)

Query: 22  TVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQFTDMANPDRPLSSLNISHGSI 81
           TV +LK  +Q +  IPV +QTLS      L ++  ++L  +  AN    LSSL I HG +
Sbjct: 6   TVQDLKKKVQQKTLIPVANQTLS------LDRAGKEVL--SSGANT---LSSLGIRHGQM 54

Query: 82  VFLSYDGER---NVRGPS--FNPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDC 136
           VFLSY GER   N   P    N    +  K  M+ L ++ +    QE  HC+  S D   
Sbjct: 55  VFLSYTGERPINNDEDPKKLRNIKKKWDLKSLMEHLDSQGLVFKAQETTHCKVASLDFAS 114

Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
            D FQQY  E  AF  +R G++YG  LED  V   +IYEPPQ+ +     +L D  EEKL
Sbjct: 115 CDKFQQYPRE-FAFLKRREGYLYGKYLEDGNVTAEYIYEPPQEQSPSGTILLDDPAEEKL 173

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTA-----VV 251
           VD IA  LG++KVG+IFT     +++DY +S+ EV++A + +AE  ++++  A      V
Sbjct: 174 VDTIANALGVQKVGWIFT-VPPNDERDYIMSSHEVVRAAQQNAEA-IKKYGDAGKSFIAV 231

Query: 252 KLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVV-----GG 306
           K+   EEG   +HFE FQ++D  V+L +    +       DPK+   K++V +     G 
Sbjct: 232 KVSTDEEGQ--VHFEPFQVTDQAVKLVQNDLLKPP----KDPKVVVAKQEVAMARQGAGR 285

Query: 307 KDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRIS 365
           KD  E+D D  +  V +++H+GPLS  F IENR      M ALK HL +    P +  ++
Sbjct: 286 KDSPELDPDVLIKPVAVVNHEGPLSCVFEIENRPEHPQNMAALKRHLLQFKGKPFLLAVT 345

Query: 366 DFHLLLFLAR-FLDLNSDVPALAQCVQ-AQTAVPEGYKLLINSMA 408
           DFH + +L + FL   +D+PAL + V+   +   EG+KLLI+S+A
Sbjct: 346 DFHFIRYLGKEFLSPETDIPALCEAVKTGDSDALEGFKLLIDSIA 390


>gi|325185911|emb|CCA20415.1| NPL4like protein putative [Albugo laibachii Nc14]
          Length = 429

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 144/446 (32%), Positives = 213/446 (47%), Gaps = 53/446 (11%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+R+RS++G  R+       TV +LK  ++    I   +Q    +         SD LQ
Sbjct: 1   MLVRVRSKNGTWRIDNLSPQSTVTDLKECVEKIHGIASHNQRYHLDNQGKAPVHESDTLQ 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-------------------------P 95
                       S+ + HG +++L +DG+  V G                          
Sbjct: 61  ------------SIRVGHGDLIYLQHDGDTIVTGRTAVSRKVNADGSITHVEYHVVNGKK 108

Query: 96  SFNPAGSFGRKMTMDDLIAKQMRVTR--------QENPHCESVSFDRDCADAFQQYVNET 147
           SF P     R + M   I + MR+          QE  H  +VS D+   D FQ Y+ + 
Sbjct: 109 SFRPGLKALRDIKMHWTIGEFMRMDEEFEFKLKAQEKAHVANVSLDKSSCDNFQSYLRQ- 167

Query: 148 LAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
            AF   R G++YG +  E K+V V+FIYEPPQ+GT     +  D   EK  DA+A  LG 
Sbjct: 168 FAFQQSRCGWLYGNIDQESKQVTVDFIYEPPQEGTPHDFQVFEDPLGEK-ADAVAKALGR 226

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFE 266
           +KVG++F+    ++  D+  S+ E+L A +   +   +  +   +K+ + +EG     FE
Sbjct: 227 EKVGWVFSHPPREDA-DFHFSSNEILMAAQLQLDAGGKGSLFVSLKVTLDKEGQTT--FE 283

Query: 267 AFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDH 326
           AFQ+SD C+ +F  G    E A              +V GK   EVDN+FFL VV +  H
Sbjct: 284 AFQVSDQCLDMFAAGALSAEHASNSKVCAISETYTAIVEGKTATEVDNNFFLCVVPVSTH 343

Query: 327 QGPLSSTFPIENRTTQVTMR-ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPA 385
              L S FP  +R     +R AL+ HL +  S P ++R+ DFHLLLFL   LD N+DVP 
Sbjct: 344 DSGLKSEFPRPHREIPRPLREALRRHLQKYRSEPYIERLKDFHLLLFLTEVLDPNTDVPT 403

Query: 386 LAQCV-QAQTAVPEGYKLLINSMAET 410
           + Q +   Q A+  GY +LI+S+A T
Sbjct: 404 ICQSIMDPQIALDSGYPILIDSVAGT 429


>gi|348686646|gb|EGZ26461.1| hypothetical protein PHYSODRAFT_345180 [Phytophthora sojae]
          Length = 428

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 156/447 (34%), Positives = 226/447 (50%), Gaps = 61/447 (13%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRI-PVESQTLSTNQNLLLAKSPSDLL 59
           ML+R+RS++G  RV       T+A++K  I  Q  I P + Q     Q    A++ SD  
Sbjct: 1   MLVRVRSKNGTWRVGDLTPSSTIADIKRWIFEQHSIAPAQQQLALDQQG---AQTTSDT- 56

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGER----NVRGPSFNPAGSFGR---------- 105
                    + L  L + HG ++FL YDGE      V G   N  G+  R          
Sbjct: 57  ---------QTLRELRVGHGDMLFLDYDGETLGTGGVVGTKINADGTLTRVAYHDRADKQ 107

Query: 106 ----------KMTMDDLIAKQMR--------VTRQENPHCESVSFDRDCADAFQQYVNET 147
                      M M   + + MR        V+ Q+ P    V  D    + FQ Y+ + 
Sbjct: 108 AFRPGMKSLRDMKMYWTLGEFMRMDAQFEFKVSAQKGPQVAKVHLDGASCNDFQAYLRQ- 166

Query: 148 LAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
            AF   R G++YG+V  E K V V+FIYEPPQ+G      +L D   +K VDA+A  LG 
Sbjct: 167 FAFQQSRCGWLYGSVDAETKEVTVDFIYEPPQEGNPYGFEVLDDPNADK-VDAVAEALGY 225

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFE 266
           +KVG++F+    ++  D+  S REVL A +  A+   +  +   +K+ + +EG A+  FE
Sbjct: 226 EKVGWVFSHPPREDD-DFHFSAREVLLAAQLQADAGGKGSLFLTLKVTLDKEGQAS--FE 282

Query: 267 AFQMSDMCVRLFKEGWFETEIAEGD-DPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVKI 323
           AFQ+SD CV +F  G     +AE + +P+   +      +V  K   EVDN+FFL VV +
Sbjct: 283 AFQVSDQCVEMFAAG----ALAENEQNPQACAVSDTFTALVEAKPATEVDNNFFLCVVPV 338

Query: 324 LDHQGPLSSTFPIENRTTQVTMRA-LKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
           + H+ PL S FP  NR      RA LK+HL +    P  KRISDF LL+FL+ FLDL +D
Sbjct: 339 VPHESPLRSEFPKLNREGSYRSRAALKAHLQKYRDEPYAKRISDFQLLVFLSDFLDLRTD 398

Query: 383 VPALAQCVQ-AQTAVPEGYKLLINSMA 408
           +PA+ Q V+     +  GY +LI+S+A
Sbjct: 399 IPAICQAVRDPNIPLDSGYPILIDSVA 425


>gi|301103316|ref|XP_002900744.1| NPL4-like protein [Phytophthora infestans T30-4]
 gi|262101499|gb|EEY59551.1| NPL4-like protein [Phytophthora infestans T30-4]
          Length = 428

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 151/445 (33%), Positives = 221/445 (49%), Gaps = 57/445 (12%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           ML+R+RS++G  RV       T+A++K  I  Q  I    Q L+ +Q            Q
Sbjct: 1   MLIRVRSKNGTWRVGDLTPSSTIADIKRWIFEQHSIAPARQQLALDQQ--------GAQQ 52

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGER-----------NVRGP-------------S 96
            TD     + L  L + HG ++FL YDGE            N  G              +
Sbjct: 53  TTDT----QTLRELRVGHGDMLFLDYDGETVSTGGVVGTKINADGTLTHVAYHGRADKQA 108

Query: 97  FNPAGSFGRKMTMDDLIAKQMR--------VTRQENPHCESVSFDRDCADAFQQYVNETL 148
           F P     R M M   + + MR        ++ Q+ P    V  D    + FQ Y+ +  
Sbjct: 109 FRPGMKSLRDMKMYWTLGEFMRMDAQFEFKISAQKEPQVAKVHLDGASCNDFQAYLRQ-F 167

Query: 149 AFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMK 207
           AF   R G++YG+V  E K V V+FIYEPPQ+G      +L D   +K VDA+A  LG +
Sbjct: 168 AFQQSRCGWLYGSVDSETKEVTVDFIYEPPQEGNPYGFEVLDDPNADK-VDAVAEALGYE 226

Query: 208 KVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEA 267
           KVG++F+    ++  D+  S RE L A +  A+   +  +   +K+ + ++G A+  FEA
Sbjct: 227 KVGWVFSHPPREDD-DFHFSARETLLAAQLQADAGGKASLFLTLKVTLDKQGQAS--FEA 283

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVKILD 325
           FQ+SD CV +F  G     +    +PK   +      +V  K   EVDN+FFL VV ++ 
Sbjct: 284 FQVSDQCVEMFAAGAL---MENEHNPKACAVTDTFTALVEAKRATEVDNNFFLCVVPVVP 340

Query: 326 HQGPLSSTFPIENRTTQVTMRA-LKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
           H+ PL   FP  NR      RA LKS L +  + P  KRISDF LL+FL+ FLD+ +D+P
Sbjct: 341 HESPLRCEFPKLNREGAYRNRAALKSQLQKYRNEPYAKRISDFQLLVFLSDFLDVQTDIP 400

Query: 385 ALAQCVQ-AQTAVPEGYKLLINSMA 408
           A+ Q V+   T +  GY +LI+S+A
Sbjct: 401 AICQAVRDPNTPLDSGYPILIDSVA 425


>gi|412988215|emb|CCO17551.1| predicted protein [Bathycoccus prasinos]
          Length = 499

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 158/503 (31%), Positives = 247/503 (49%), Gaps = 103/503 (20%)

Query: 1   MLLRIRSRDGLERV-TVDGAH-------VTVAELKTIIQSQ-LRIPVESQ---------- 41
           +LLRIR+RDG ER+   DGA         T+++++ +I  Q   +PVE+Q          
Sbjct: 2   VLLRIRTRDGTERLECADGAGGGENDVVKTISDVQDLISKQPWGVPVENQSLSLDPSLLL 61

Query: 42  -TLSTNQNLLLAKSPSDLLQFTDMANPDRPLSSLNISHGSIVFLSYDG--ERNVRGPSFN 98
            T S N +   AK+  +++    + +P + L SL + HG IV+++Y    ER V     N
Sbjct: 62  RTSSENADETKAKTKKEVV----VLSPRKTLKSLQLKHGDIVYMTYPDSIEREVTKMRVN 117

Query: 99  -PAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGF 157
                FG K T++++I+K++R+ RQE  +  SVSF    A+ FQ YVN TL F   R G+
Sbjct: 118 VETKPFGAKRTIEEMISKEVRIGRQERGNVASVSFCGHAANMFQSYVNNTLGFKQSRFGW 177

Query: 158 MYGTVLEDKRVE-----------------------------------VNFIYEPPQQGTE 182
           +YGT  + K+ +                                    + IYEP Q G  
Sbjct: 178 LYGTKTKRKKEKTEEEKEKEKSKDASSKKKEKDEEEGKEDEMITELFADVIYEPNQNGNA 237

Query: 183 EVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDY--TLSNREVLQAVEFHAE 240
            V ++  ++EE +  +AIA  +G +++G +F+Q+ +    +    +S  EV    +  A+
Sbjct: 238 YVAHVDEEDEESQTAEAIANAMGYERIGCVFSQSDVDRGDESVSAMSAHEVTLCAKLQAK 297

Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------- 290
              +E+VT +V     E+    + FEAFQ+SD CV+L+++G+FE +   G          
Sbjct: 298 YG-KEFVTVIVVQFEDEDEEMQVTFEAFQVSDQCVKLYEDGFFELDEKTGVAKLPASSSA 356

Query: 291 -----------------DDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST 333
                            +    +K+ K V+V  KDV EVDNDF++V V I DH G   S 
Sbjct: 357 GITKEEIEEEEEGSSFSNWGSHTKLVKPVMVERKDVTEVDNDFWIVAVPIKDHTGTAQSG 416

Query: 334 FPIENR-----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQ 388
           FPIENR     T      A++S      S    +++ DFHLLLFL++ +D N DV A+A+
Sbjct: 417 FPIENRLHPPQTADDIRSAIQSRSATCSS--YAEKLRDFHLLLFLSKQMDKN-DVLAIAE 473

Query: 389 CVQAQT---AVPEGYKLLINSMA 408
                     V EGYK+LI+++A
Sbjct: 474 STNDYIDGGDVSEGYKVLIDAIA 496


>gi|255642006|gb|ACU21270.1| unknown [Glycine max]
          Length = 114

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 88/113 (77%), Positives = 100/113 (88%)

Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPS 357
           MKKDVVVG KD +EVDNDFFLVVVKI DHQGPLSS+FPIENR TQV M+ALK+HL+R+ +
Sbjct: 1   MKKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVPMKALKNHLDRTKN 60

Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
           LP VKRISDFHLLL LAR LDLN+DVPAL +CVQ QT+VPEGY++LI SMA T
Sbjct: 61  LPFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMAST 113


>gi|296086647|emb|CBI32282.3| unnamed protein product [Vitis vinifera]
          Length = 114

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 91/111 (81%), Positives = 103/111 (92%)

Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPS 357
           MKKDVVVG KD +EVDNDFFLV+VKILDHQGPLSSTFPIENR TQVTMRALK+HL+R+ +
Sbjct: 1   MKKDVVVGVKDTREVDNDFFLVLVKILDHQGPLSSTFPIENRITQVTMRALKNHLDRAKN 60

Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           LP VKRISDFHLLL L+RFLD+NSDVPALA+CVQ Q+AVPEGY+LLI S+A
Sbjct: 61  LPFVKRISDFHLLLLLSRFLDINSDVPALAECVQTQSAVPEGYQLLIESLA 111


>gi|452823734|gb|EME30742.1| nuclear import protein NPL4-like protein [Galdieria sulphuraria]
          Length = 466

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/316 (35%), Positives = 171/316 (54%), Gaps = 33/316 (10%)

Query: 114 AKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFI 173
           A +  +  QE  +  SVS D   A +FQ  +  ++ F  +R GF+YG + +   V V+ I
Sbjct: 158 AMKFHIKAQEESYTNSVSVDHAAAFSFQANLV-SIGFQQQRLGFLYGRIEDSSFVFVDCI 216

Query: 174 YEPPQQGTEEVLYILRDEEE-------EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTL 226
           YEPPQ GT ++ Y L   EE       +K  D +A+ LG++KVG+IF+       +    
Sbjct: 217 YEPPQSGTYQI-YKLESPEENPDAAESKKRADKLASLLGLEKVGWIFSS----KPRKCIF 271

Query: 227 SNREVLQAVEFHAECNMEEW-------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
           S  ++L A +F  +C  EE            + + V+EEG     FEA+Q+SD+C+ ++K
Sbjct: 272 SAVDILTACQF--QCEAEEMHGKDVSKTFVTITVSVTEEG--MTRFEAYQISDLCLDMYK 327

Query: 280 EGWFETEIAEGDDPKLSKM--KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-SSTFPI 336
              F     E   P   K+  K++V+V G + + +D DFFLV V I DH   L ++ FP+
Sbjct: 328 ASVFAP--VEQQKPNSGKIRTKEEVIVEGVECRSIDTDFFLVSVPIKDHHSWLKNNNFPV 385

Query: 337 ENR-TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ-- 393
           ENR    V    L S L +S   P ++RISDFHLLL+++  LD++ D+P L   V+ Q  
Sbjct: 386 ENREQAPVQPSDLSSCLGKSLDEPFIRRISDFHLLLYISNVLDMDGDMPLLCNAVREQRE 445

Query: 394 -TAVPEGYKLLINSMA 408
            T   EGY+L+I ++A
Sbjct: 446 LTDEEEGYRLMIEAIA 461


>gi|449016512|dbj|BAM79914.1| similar to nuclear import protein NPL4 [Cyanidioschyzon merolae
           strain 10D]
          Length = 485

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 115/331 (34%), Positives = 166/331 (50%), Gaps = 40/331 (12%)

Query: 105 RKMTMDDL---IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT 161
           + MT+D L    A+  R+  Q+   C   S DR  ADAFQ Y+   +  +  R G +YG 
Sbjct: 166 KGMTVDYLEWLDAQHFRLRSQDESPCSQCSIDRQAADAFQSYLLR-IHDSQMRYGILYGD 224

Query: 162 VL-EDKRVEVNFIYEPPQQG-TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ 219
           V+  +  V+V  I+EPPQ G  +  L +   E   +  D +A  LGM++VG+IFT     
Sbjct: 225 VVASEGLVKVEAIFEPPQAGDADRYLPLPESEVAIREADTLAHYLGMRRVGWIFTHRA-- 282

Query: 220 NKKDYTLSNREVLQAVEFHAECNME---------------EWVTAVVKLEVSEEGSAAIH 264
             + + L  R+V  A +   E                   +W    V + VS +     H
Sbjct: 283 EDRSHALHARDVAYAAKLQLETTAYHLAEQERDQTPEASFDWSRYFVTIAVSIQPDGRRH 342

Query: 265 FEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKIL 324
           FEA+Q+SD C+ + + G     +   D  +L  +  +VVV G  V+ VD DFFLV+V I 
Sbjct: 343 FEAYQLSDQCLLMAQAGIL---MPVSDSGELHTLD-EVVVEGNRVRRVDTDFFLVLVPIR 398

Query: 325 DHQGPLSSTFPIENRTTQV-----TMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDL 379
           DHQG L + FPIENR  +        R L+S   +       +RI+DFHLLLFL+R L +
Sbjct: 399 DHQGWLRTAFPIENREVEPQSITDVQRVLESGYGK-----FTERIADFHLLLFLSRILGM 453

Query: 380 NSDVPALAQC-VQAQTAVPE--GYKLLINSM 407
            SD+P LA   V+ +   P   GY+LLI S+
Sbjct: 454 ESDMPLLANAIVERRDLEPAELGYQLLIESL 484


>gi|330840932|ref|XP_003292461.1| hypothetical protein DICPUDRAFT_83074 [Dictyostelium purpureum]
 gi|325077301|gb|EGC31023.1| hypothetical protein DICPUDRAFT_83074 [Dictyostelium purpureum]
          Length = 572

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 100/286 (34%), Positives = 155/286 (54%), Gaps = 21/286 (7%)

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
           +M++  Q+NPHC     D   A+ FQQYV ++  F V+R GF+YG +L+D  V ++ IYE
Sbjct: 278 KMKLKSQDNPHCPGALVDFTSANIFQQYV-QSKKFEVQRVGFLYGNILQDGSVVIDSIYE 336

Query: 176 PPQQGTE-EVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQA 234
           PPQ+  + + + IL D   EK V++IA+ LGM +VG+IF+       + Y +S+++++QA
Sbjct: 337 PPQECKDGQTVTILNDPRGEK-VESIASMLGMTRVGWIFSHP----SRKYVMSSKDIIQA 391

Query: 235 VEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
             F    N      A + L  + EG + I  EAFQ+SD  + L K G F   +    DP 
Sbjct: 392 AAFQ---NKYGSSFATLILSANTEGQSNI--EAFQVSDQALALEKRGEF---LQNQPDPT 443

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFLVVV--KILDHQGPLSSTFPIENRTTQVTMRALKSH- 351
             K+K  V   G +    DN+FF+V V  K  + +      FP+ENR    T+  L S+ 
Sbjct: 444 KCKLKTAVFEEGTETNNADNNFFIVTVPLKAREDKSIFKCNFPVENRQPGTTISDLASYK 503

Query: 352 LNRSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTA 395
           L    + PL +  SDFH L+FL      D + D P + + +Q +++
Sbjct: 504 LELKDTSPL-EFFSDFHFLIFLLENELFDFSVDFPVICESIQKRSS 548


>gi|218188252|gb|EEC70679.1| hypothetical protein OsI_02004 [Oryza sativa Indica Group]
          Length = 124

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 70/102 (68%), Positives = 84/102 (82%)

Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
           +SNREVLQA E  AE  + EWVTA+VKLEV ++GS  +HFEAFQMS++CV+LFK+G  ET
Sbjct: 1   MSNREVLQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLET 60

Query: 286 EIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           EI + DDP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQ
Sbjct: 61  EIGDKDDPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQ 102


>gi|281209275|gb|EFA83448.1| nuclear protein localization 4 [Polysphondylium pallidum PN500]
          Length = 602

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 93/284 (32%), Positives = 161/284 (56%), Gaps = 18/284 (6%)

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
           +M++  Q+  HC   +     A+ FQ YV+ +  F  +R GF+YG  L+D  V V+ IYE
Sbjct: 311 KMKLKSQDAAHCPGAAIAVTPANEFQHYVS-SRKFEEQRIGFLYGNFLQDGSVSVDVIYE 369

Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
           PPQ+G ++ + +L D+  +K ++++A+ LGM +VG+IF+       + Y +S+ E+LQA 
Sbjct: 370 PPQKGDKKSVTLLEDKSIDK-IESLASLLGMTRVGWIFSHP----SRKYVMSSSEILQAA 424

Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
            +  +     +VT +  L V+ EG++  + EAFQ+SD  ++L K G F++  +E   P +
Sbjct: 425 NYQNKFG-NSFVTLI--LSVNSEGNS--NLEAFQVSDQAMKLEKTGEFQSNQSE---PTI 476

Query: 296 SKMKKDVVVGGKDVKEVDNDFFLVV--VKILDHQGPLSSTFPIENRTTQVTMRALKSHLN 353
            K+K  V+V G +    DN+FF+V   VK  + +  L+  FP+ NR    +   L S+  
Sbjct: 477 CKLKSPVLVEGSETLNADNNFFIVTVPVKAREDKPVLNCVFPVNNRNPPTSDSDLASYKI 536

Query: 354 RSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTA 395
            +     +   SDFH L++L      DLN+D+P +   V+++T+
Sbjct: 537 DNQDQSKLTFFSDFHFLVYLLMHGIFDLNTDLPLICDNVRSRTS 580


>gi|299117192|emb|CBN75156.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 441

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 126/451 (27%), Positives = 197/451 (43%), Gaps = 57/451 (12%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M++R+R+  G+ RV       T+ EL+  + ++    +      T Q L L  +P     
Sbjct: 1   MIIRVRTHAGVWRVNDVAPDTTIKELRQRLSTEHNANLSD---DTRQPLTLKPNPRGD-- 55

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYD------GERNVRGP------------------- 95
             D    +  L SL + HG ++ L+ D            GP                   
Sbjct: 56  -QDTLALESTLQSLGLGHGDMIHLNLDESIRDMAHEEAGGPKRINKDGTIEQQSFDEISN 114

Query: 96  --SFNPAGSFGRKMTMDDLIAK--------QMRVTRQENPHCESVSFDRDCADAFQQYVN 145
              F P     R M M   +A           R+ + E   C  VS D    ++FQ +V 
Sbjct: 115 KTGFRPGMMSLRSMKMKWTLADFTEMNDQFTFRLKKPEKGVCTKVSLDSAACNSFQSFVR 174

Query: 146 ETLAFAVKRGGFMYGTVLEDK-RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
           +   F   R G++YG   +D  +V V  +YEPPQ+   E   +  D + +  V+A+A  L
Sbjct: 175 Q-FGFHRARMGYLYGHFTDDDTKVRVECVYEPPQENYPEGFQVSEDPKADT-VEALAGLL 232

Query: 205 GMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIH 264
           G+K+VG+IF       ++ +  S+ EV+ A     E       T  V ++V+ E   + H
Sbjct: 233 GLKRVGWIFAHP--PREEGFLFSSAEVVTAATLQLEAADGVNDTPFVTVKVTAEEDGSAH 290

Query: 265 FEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVK 322
           F+AFQ+S  C+ +  EG  E     G++P    + K    +V  K+ KEVD   FL  V 
Sbjct: 291 FDAFQVSKQCMEMVAEGALEV----GENPGHCMVPKTFTAIVEMKEAKEVDTTMFLNTVP 346

Query: 323 ILDHQ-GPLSSTFPIENRTTQV-TMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLD 378
           I  H+       FP  NR   + T   +K  L R+       +  +SDFHLLLFL  FLD
Sbjct: 347 IEQHESAKFLHDFPRANRDGVMQTWDDVKRQLGRAGGQGFTYMDVLSDFHLLLFLTAFLD 406

Query: 379 LNSDVPALA-QCVQAQTAVPEGYKLLINSMA 408
           + +D+P +    V  +  + +G+K LIN+ A
Sbjct: 407 MQTDMPKICDSVVDREKPLDDGFKFLINNFA 437


>gi|159163292|pdb|1WF9|A Chain A, Solution Structure Of A Novel Beta-Grasp Fold Like Domain
           Of Hypothetical Protein (Arabidopsis Thaliana)
          Length = 107

 Score =  144 bits (363), Expect = 8e-32,   Method: Composition-based stats.
 Identities = 64/93 (68%), Positives = 85/93 (91%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQF 61
           +LR+RSRDGLERV+VDG H+TV++LKT+IQ QL+IP+ +QTLSTN+NLLLAKSPSD L F
Sbjct: 9   MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 68

Query: 62  TDMANPDRPLSSLNISHGSIVFLSYDGERNVRG 94
           TDMA+P+  +SSLN++HGS+V+L+Y+GER +RG
Sbjct: 69  TDMADPNLRISSLNLAHGSMVYLAYEGERTIRG 101


>gi|413922638|gb|AFW62570.1| hypothetical protein ZEAMMB73_980053 [Zea mays]
          Length = 502

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 69/121 (57%), Positives = 90/121 (74%), Gaps = 4/121 (3%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYE 175
           +RVTRQEN  C + SFDRD A+AFQ YV E L F VKR GF+YG V +E K V V+FI E
Sbjct: 324 IRVTRQENALCTAASFDRDAANAFQLYVAEFLGFGVKRAGFLYGRVDVETKDVFVDFIDE 383

Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREVL 232
           PPQQG+E+V+++LRD +EE  VD IA GLGM++VGF+ TQ + +      +YT+SN+EV+
Sbjct: 384 PPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRRVGFVLTQAVGRKASETGEYTMSNQEVM 443

Query: 233 Q 233
           Q
Sbjct: 444 Q 444



 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 65/152 (42%), Positives = 89/152 (58%), Gaps = 42/152 (27%)

Query: 179 QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI---MQNKKDYTLSNREVLQAV 235
           QG+E+++++LRD +EE  VD IA GLGM++VG +FTQ +        +YT+SNRE     
Sbjct: 124 QGSEDMVHLLRDPDEEARVDTIAEGLGMRRVGLVFTQAVGGKASETGEYTMSNRE----- 178

Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
                                             MS++CV+LFK+G  ETE+ + DDP+ 
Sbjct: 179 ----------------------------------MSEICVKLFKDGVLETEVQDADDPRQ 204

Query: 296 SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           SKM+K+VV GGKD  EVDNDFFL++VKI DHQ
Sbjct: 205 SKMQKEVVAGGKDTMEVDNDFFLMLVKISDHQ 236


>gi|413922639|gb|AFW62571.1| hypothetical protein ZEAMMB73_980053 [Zea mays]
          Length = 198

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 69/121 (57%), Positives = 90/121 (74%), Gaps = 4/121 (3%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYE 175
           +RVTRQEN  C + SFDRD A+AFQ YV E L F VKR GF+YG V +E K V V+FI E
Sbjct: 20  IRVTRQENALCTAASFDRDAANAFQLYVAEFLGFGVKRAGFLYGRVDVETKDVFVDFIDE 79

Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREVL 232
           PPQQG+E+V+++LRD +EE  VD IA GLGM++VGF+ TQ + +      +YT+SN+EV+
Sbjct: 80  PPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRRVGFVLTQAVGRKASETGEYTMSNQEVM 139

Query: 233 Q 233
           Q
Sbjct: 140 Q 140


>gi|328872908|gb|EGG21275.1| nuclear protein localization 4 [Dictyostelium fasciculatum]
          Length = 570

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/285 (31%), Positives = 157/285 (55%), Gaps = 18/285 (6%)

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
           +M++  Q+  H   V+ + + A+ FQQY++    F  +R GF+YG  L+D  V V  IYE
Sbjct: 279 KMKLKAQDIAHAPGVAINFESANVFQQYISNK-KFEEQRCGFLYGNYLDDGSVVVECIYE 337

Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
           PPQ+G +    +L D+  +K V+++A+ LG+ +VG+IF+       + Y +S+ +++QA 
Sbjct: 338 PPQKGNKTNFTLLEDKFIDK-VESMASMLGLTRVGWIFSHP----SRKYIMSSTDIIQAA 392

Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
            F  +   + +VT +  L ++ EG +  + EAFQ+SD  ++L K G F   +A  +DP  
Sbjct: 393 SFQNKYG-KSFVTLI--LTINSEGKS--NLEAFQVSDQALKLEKTGEF---LANQEDPTK 444

Query: 296 SKMKKDVVVGGKDVKEVDNDFFL--VVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLN 353
            K+K  V V G +    D +FF+  V VK  + +  L +TFP+ NR  + +   L S+  
Sbjct: 445 CKLKSPVFVEGSETNLADINFFIANVPVKAREDKPFLHTTFPVNNRIPETSNSDLVSYKI 504

Query: 354 RSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTAV 396
            +     +   SDFH L+FL      DL +D P + + ++++T  
Sbjct: 505 DNQDQSKLYFFSDFHFLIFLLINNIFDLKTDFPPICENIKSKTTT 549


>gi|66804083|ref|XP_635845.1| nuclear protein localization 4 [Dictyostelium discoideum AX4]
 gi|74851898|sp|Q54GD3.1|NPL4_DICDI RecName: Full=Nuclear protein localization protein 4 homolog
 gi|60464175|gb|EAL62335.1| nuclear protein localization 4 [Dictyostelium discoideum AX4]
          Length = 576

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 148/284 (52%), Gaps = 17/284 (5%)

Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
           +MR+  Q+NPH      D   A+ FQQY+  +  +  +R GF++G  L D  V V+ IYE
Sbjct: 282 KMRLKSQDNPHAPGALVDFQSANIFQQYIANS-KYEQQRIGFLFGNFLSDGSVVVDSIYE 340

Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
           PPQ+  ++    L  +     ++++A+ LG+ +VG+IF+       + YT+S+ E++QA 
Sbjct: 341 PPQECKDKQTPTLLPDPLADKIESMASMLGLTRVGWIFSHP----SRKYTMSSTEIIQAA 396

Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
            +       ++  + V L +S       + EAFQ+SD  ++L K G F   +    DP  
Sbjct: 397 SYQ-----NKYGPSFVTLILSVNSDGQSNMEAFQVSDQALKLEKTGEF---LPTQPDPTK 448

Query: 296 SKMKKDVVVGGKDVKEVDNDFFLVVV--KILDHQGPLSSTFPIENRTTQVTMRALKSHLN 353
            K+K  V   G +    D  FF+V V  K  + +   + +FP+ENR    T+  L S+  
Sbjct: 449 CKLKSPVFEEGTETINADTHFFIVTVPLKAREDKSIFNISFPVENRIPVNTLSDLASYKL 508

Query: 354 RSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTA 395
               +  +K  SDFH L+FL   +FLD  SD P + + ++++++
Sbjct: 509 EHKDVSPLKFFSDFHFLIFLLENQFLDFQSDFPIICENIRSRSS 552


>gi|422293770|gb|EKU21070.1| nuclear protein localization 4-like protein [Nannochloropsis
           gaditana CCMP526]
          Length = 448

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 125/454 (27%), Positives = 197/454 (43%), Gaps = 75/454 (16%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           MLLR+RS+ G  RV       T+  L++ I S+         +S + + + +  P+    
Sbjct: 1   MLLRVRSKMGTWRVEGVEPGTTIKALRSRILSE-------HGVSPHPSQIFSLDPAGKRV 53

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSY---------------DGE-------RNVRGPSFN 98
             D    ++ + S N+ HG +V L+                DG           R   F 
Sbjct: 54  MDD----EQTVKSFNLQHGDLVHLAIREEDCAHEAGKKIAADGSIVNKTHLEASRVKGFR 109

Query: 99  PAGSFGRKMTM----------DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETL 148
           P     R M M          D     +++  + E P C     +  C + FQ Y+   +
Sbjct: 110 PGMMSLRSMKMHWTLTDFVELDAQFEYKIKAQKNETPWCTKAIMEPICGNDFQSYMRR-V 168

Query: 149 AFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKK 208
            F   R G++YG   E K  +V  +YEPPQ+ T E   +L D   EK V+A+AA LG+ +
Sbjct: 169 GFNQCRMGYLYGQFTEAKEAKVEVLYEPPQESTHEGFELLEDPRAEK-VEALAAALGLVR 227

Query: 209 VGFIFTQTIMQNKKDYTLSNREVLQAV--EFHAECNMEE--WVTAVVKLEVSEEG----- 259
           VG+IF       ++ +  S  EV+QA   +  A   ++E  +VT  V L   E+      
Sbjct: 228 VGWIFAHP--PREEGFVFSGLEVMQAAMGQLEAADGVKETPFVTVRVSLAPVEKAGGEDG 285

Query: 260 --------SAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDV 309
                   +   HF+AFQ+S  C+ +  EG  E     G +P    + +    +V GK  
Sbjct: 286 KEGGKEEVATVAHFDAFQVSLQCMEMVAEGAIEP----GPEPGAMLVNESFTAIVEGKPA 341

Query: 310 KEVDNDFFLVVVKILDHQGP-LSSTFPIENRTTQV-TMRALKSHLNRS--PSLPLVKRIS 365
             VD +FFLV V +   +   L+  FP  NR  ++ +  A+ + L  +       +  +S
Sbjct: 342 PTVDVNFFLVNVPVGSAESSLLTHRFPKANRLDELPSAAAMAAQLGEAGKAGWRFIDLMS 401

Query: 366 DFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
           DFHLLLFL  FL +  +V  L + V+ +  VP G
Sbjct: 402 DFHLLLFLTDFLGME-EVAVLCRSVREKETVPLG 434


>gi|219113663|ref|XP_002186415.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583265|gb|ACI65885.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 473

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 140/478 (29%), Positives = 208/478 (43%), Gaps = 81/478 (16%)

Query: 1   MLLRIRSRDGLERVT-VDGAHVTVAE-LKTIIQSQ---------LRIPVESQTLST---- 45
           ML+R+RS  G+ RV  +D    TV + L  I +S+          R P   +TLST    
Sbjct: 1   MLIRVRSNVGVWRVDGLDAETATVTDVLAGIARSRPHVVYERPLARDPACHETLSTTETL 60

Query: 46  -----NQNLLLAKSPSDLLQFTDMANPDRPLSSLNISHGSIV-----------FLSYDG- 88
                +QN  +     D     D   P   +S   ++  + V            +  DG 
Sbjct: 61  RAQGLDQNGAMIHCRVDPTTCADWTAPAAEMSPSTMTGNATVEPKEQGQHFRRVIDKDGG 120

Query: 89  -------------ERNVRGPSFNPAGSFGRKMTMDDLIAK----QMRVTRQENPHCESVS 131
                        +R  R     P        T++D IA     + ++ RQE      VS
Sbjct: 121 IKLVPTKDVPTQQDRGFRK-GMLPLRDMKMSWTLNDFIALDSQFEFKIQRQEKATTSKVS 179

Query: 132 FDRDCADAFQQYVNETLAFAVKRGGFMYGT-VLEDK------RVEVNFIYEPPQQGTEEV 184
            D      FQ Y+ +   F  KR GF+YG  V ED+      +V V  IYEPPQ+   + 
Sbjct: 180 LDVPSISNFQTYL-QRFQFQRKRCGFLYGKFVKEDESDEKPTKVLVEAIYEPPQEINPDS 238

Query: 185 L--YILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAEC- 241
              + L D+ +E+ V+ +A  LG+++VG++F     Q    Y LS  E + A EF  E  
Sbjct: 239 AEGFTLEDDPQEEEVNQLAEWLGLQRVGWVFGHAPRQG---YVLSAAETILAAEFQLEAA 295

Query: 242 -NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
             +EE     V +    +G  A+  EAFQ+S  C+ +  E   E    + D+P+L K+ +
Sbjct: 296 GGVEETPFCTVTVAPKLDGQVAV--EAFQVSQQCMAMVAE---EALQVDPDNPQLCKVNE 350

Query: 301 D--VVVGGKDVKEVDNDFFLVVVKILDHQG-PLSSTFPIENRT---TQVTMRALKSHLNR 354
               +  GK    V+  FFL VV I+ H      + FP  NR    T     AL   L +
Sbjct: 351 TFTAIQEGKASPTVETAFFLTVVPIVQHTSETFVADFPRANRDLDDTAPNKDALTRQLRQ 410

Query: 355 SPS--LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP--EGYKLLINSMA 408
           + S    LV R++DF+LL++L    D  SD P +   + A   +P  +GYKLLI S+A
Sbjct: 411 AGSNGWTLVDRLADFNLLIYLKSSFDFQSDFPKICAAI-ANRDIPLDDGYKLLIASLA 467


>gi|428186314|gb|EKX55164.1| hypothetical protein GUITHDRAFT_98945 [Guillardia theta CCMP2712]
          Length = 396

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/290 (30%), Positives = 150/290 (51%), Gaps = 32/290 (11%)

Query: 128 ESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYI 187
           +S+S +   A  FQ YV   L+F   R G+M+G V +++ V  + IYEPPQ+G EE   +
Sbjct: 127 QSLSVEFSAASQFQSYVVGDLSFKSHRVGYMFGHVDKNRNVFCDVIYEPPQRGDEEAYNL 186

Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFT--QTIMQNKKDYTLSNREVLQAVEFHAECNMEE 245
              + +E+ V+ +A+ LGM+    +    Q  M+ K  Y  ++                 
Sbjct: 187 TEVDPDEEKVERLASLLGMRMDVMLACSLQHKMEKKHGYDFAS----------------S 230

Query: 246 WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK--LSKMKKDVV 303
            V+AV+      E + +I FEA+Q+SD+C+ +++       +   ++P    ++  +DV+
Sbjct: 231 MVSAVI---TRNETTDSIVFEAYQISDLCLDMYRRDI----LVPPNNPNRGYTRTVQDVL 283

Query: 304 VGGKDVKEVDNDFFLVVVKILDHQ----GPLSSTFPIENRTTQVTMRALKSHLNRS-PSL 358
           V  KD ++VDNDFF+  V I  H     G   + FP++NR  Q   R    H+  S   +
Sbjct: 284 VERKDTQKVDNDFFINTVPIKTHNSEIWGNREAFFPVKNRPGQDQNRMEVKHILLSHEEV 343

Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
           P   R+ DFH+LL+L+  LD  +D+P + + V+    + +G+  L+ S+A
Sbjct: 344 PFCIRMRDFHMLLYLSNMLDPLTDLPIICKSVREDVELQQGHIELVKSIA 393


>gi|291000524|ref|XP_002682829.1| NPL4 family protein [Naegleria gruberi]
 gi|284096457|gb|EFC50085.1| NPL4 family protein [Naegleria gruberi]
          Length = 435

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 113/441 (25%), Positives = 189/441 (42%), Gaps = 74/441 (16%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           +L+R R +DG  R+ +  A  TV ++K +I+ +  IP  S +++        K+P     
Sbjct: 2   LLIRARGKDGTFRLEL-SADQTVLDIKKLIEKETSIPHSSISVTK----FREKTP----- 51

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVR--------------GPSFNPAGSFGRK 106
              + N D    S  + HG ++   YD  ++                    +  G   RK
Sbjct: 52  ---LKNTDTVQGS-QLKHGDMIEFVYDTTQHKSKQEVEEEKKSQLRDSEDLDNLGLRERK 107

Query: 107 --MTMDDLI--AKQMRVT--RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG 160
              T+ + +    Q+++T   Q+   C +   D+    +FQQ++ +T  F  +R  F+YG
Sbjct: 108 KHWTLSEYLDLVNQVKITIKHQKYAICSTAYLDQLVGQSFQQFLLQT-RFQFQRFAFLYG 166

Query: 161 TVLEDKR-------------------------VEVNFIYEPPQQGTEEVLYILRDEEEEK 195
              E K+                         V +  ++EP Q  T E    + D     
Sbjct: 167 ETKEGKKPQPTEEEKKKATKDENINVDDDYLDVYIKVLFEPKQDSTPEGFKQVEDTVFND 226

Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEV 255
             D +A+ LGMKKVG+IF+      +++Y L   E+++A E+ ++     +  + V +  
Sbjct: 227 RADTLASMLGMKKVGWIFSH---DGQREYPLLGPEIIKAAEYQSK-----YGPSFVTITA 278

Query: 256 SEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDND 315
                  I FEA+Q+S  CV L+++G         ++ +    KK V +  K   ++D  
Sbjct: 279 CPNEIGEIQFEAYQVSKQCVELWEKGLLS---ENPENLEFLIAKKQVEIERKMTNDIDAL 335

Query: 316 FFLVVVKILDHQGPLSSTFPIENRTTQ---VTMRALKSHLNRSPSLPLVKRISDFHLLLF 372
             +  V I  H+      FP  NR        M  LKS L    S   V+R+SDF LLLF
Sbjct: 336 LLVSNVAIAPHKSSFQVGFPPRNRPGNEMFQNMVELKSVLLERRSQKFVQRVSDFDLLLF 395

Query: 373 LARFLDLNSDVPALAQCVQAQ 393
           L  +L L+SD P L + V +Q
Sbjct: 396 LTDYLSLSSDFPGLCEAVLSQ 416


>gi|361067653|gb|AEW08138.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
          Length = 85

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/86 (62%), Positives = 68/86 (79%), Gaps = 1/86 (1%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
          ML+R+RSRDGLERV VD  + TV +LK++IQSQL++P+ SQ LST+QNLLLAK   ++ Q
Sbjct: 1  MLIRVRSRDGLERVKVDKPNPTVGDLKSLIQSQLQVPISSQILSTSQNLLLAKG-GNIQQ 59

Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
          F DM +P+  LSSL I HG IVFLSY
Sbjct: 60 FKDMEDPNMLLSSLGIDHGHIVFLSY 85


>gi|383167791|gb|AFG66945.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167793|gb|AFG66946.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167795|gb|AFG66947.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167797|gb|AFG66948.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167799|gb|AFG66949.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167801|gb|AFG66950.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167803|gb|AFG66951.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167805|gb|AFG66952.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167807|gb|AFG66953.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167809|gb|AFG66954.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167811|gb|AFG66955.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167813|gb|AFG66956.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167815|gb|AFG66957.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167817|gb|AFG66958.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167819|gb|AFG66959.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167821|gb|AFG66960.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167823|gb|AFG66961.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
 gi|383167825|gb|AFG66962.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
          Length = 85

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/86 (62%), Positives = 68/86 (79%), Gaps = 1/86 (1%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
          ML+R+RSRDGLERV VD  + TV +LK++IQSQL++P+ SQ LST+QNLLLAK   ++ Q
Sbjct: 1  MLIRVRSRDGLERVKVDKPNPTVGDLKSLIQSQLQVPISSQILSTSQNLLLAKG-GNVQQ 59

Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
          F DM +P+  LSSL I HG IVFLSY
Sbjct: 60 FKDMEDPNMLLSSLGIDHGHIVFLSY 85


>gi|407044247|gb|EKE42468.1| NPL4 family protein [Entamoeba nuttalli P19]
          Length = 461

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 158/302 (52%), Gaps = 25/302 (8%)

Query: 95  PSFNPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKR 154
           P+  P G     +   D   +++R+ +QE   C+    +    D FQ+YV +  +F V R
Sbjct: 157 PNHGPHGCCTLCLAKWDY--EKIRIQKQEA-VCKLCRMNFMETDNFQRYVFQR-SFYVNR 212

Query: 155 GGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT 214
            G +YGT  +D RV+V  IYEPPQ   EE  + ++ E+    V  +A  LG+++VG IF+
Sbjct: 213 IGILYGTFKQD-RVQVEAIYEPPQVINEEGDFEMKQEQTLLQVQQMAVELGLERVGIIFS 271

Query: 215 QTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
                 +++  L++RE++ A +   + + +  VT VV    +E GS +   E +Q S+  
Sbjct: 272 HA----RREEILTSREIISAAQRQLDWD-KRCVTVVVT--PNETGSES-KVECYQASEQL 323

Query: 275 VRLFKEGWFETEIAEGDDPKLSK--MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
           + L K+      I +   PK ++  +K+ V +  K++  VD   F+  V I   +     
Sbjct: 324 MELVKKA-----IVQPSQPKNNEINLKESVYLSQKEIMTVDPIVFVCNVPIRAFKAKTMF 378

Query: 330 LSSTFPIENRTTQ-VTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQ 388
           LSS +PIENR  + VT+   K +     +LP+ ++ISDFHLL+F   +  + S + ++ +
Sbjct: 379 LSSKYPIENRPQEDVTLLQFKKYFESVKTLPIYQQISDFHLLVFCLDY-GVFSSLHSVKE 437

Query: 389 CV 390
           C+
Sbjct: 438 CI 439


>gi|145489333|ref|XP_001430669.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397768|emb|CAK63271.1| unnamed protein product [Paramecium tetraurelia]
          Length = 580

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 86/271 (31%), Positives = 139/271 (51%), Gaps = 36/271 (13%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF+YG    D       R  V  IYEPPQ+GT   + +L+D  +    D IA  LGM
Sbjct: 306 QRMGFLYGYYAADPNYQNGVRAIVEAIYEPPQKGTFGHVDLLQDPAQTH-ADEIATNLGM 364

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-------HAE-CNMEEWVTAVVKLEVSEE 258
           +K+G+IFT        D  LS+ E+ QA  +       H E C + +++T V  L   ++
Sbjct: 365 EKIGWIFTNV----NHDCFLSSEELRQAARYQQMHSIQHPEGCLVSKFLTVV--LRTKKD 418

Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWFE-TEIAEGDDPKLSKMKKDV----VVGGKDVKEVD 313
               +  E + +SD   +L ++G F+ +E  +    + +K + DV    V  GK VKE +
Sbjct: 419 NPDEVVPEVYMVSDQGQQLEQDGIFQNSERRKVLQVREAKNELDVLPSFVYNGKSVKEFE 478

Query: 314 NDFFLVVVKILDHQGPLSST-------FPIENRTTQVTMRALKSHLNRSPSLPLVKRISD 366
            DFF  +V +     P S+        FP+ENR+     + LK+ L++    P   R +D
Sbjct: 479 PDFF--IVNVAHGYNPNSNYSILKLYDFPVENRSVVAKQQDLKTFLHKHQKKPSHHRFAD 536

Query: 367 FHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
           FHLLL+L++ +D++S +  +AQ + ++  VP
Sbjct: 537 FHLLLYLSKLIDIHS-IANIAQFIASECEVP 566


>gi|67480009|ref|XP_655374.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56472505|gb|EAL49987.1| hypothetical protein EHI_026450 [Entamoeba histolytica HM-1:IMSS]
 gi|449702247|gb|EMD42922.1| nuclear protein localization, putative [Entamoeba histolytica KU27]
          Length = 461

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 158/302 (52%), Gaps = 25/302 (8%)

Query: 95  PSFNPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKR 154
           P+  P G     +   D   +++R+ +QE   C+    +    D FQ+YV +  +F V R
Sbjct: 157 PNHGPHGCCTLCLAKWDY--EKVRIQKQEA-VCKLCRMNFMETDNFQRYVFQR-SFYVNR 212

Query: 155 GGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT 214
            G +YGT  +D RV+V  IYEPPQ   EE  + ++ E+    V  +A  LG+++VG IF+
Sbjct: 213 IGILYGTFKQD-RVQVEAIYEPPQVINEEGDFEMKQEQTLLQVQQMAVELGLERVGIIFS 271

Query: 215 QTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
                 +++  L++RE++ A +   + + +  VT VV    +E GS +   E +Q S+  
Sbjct: 272 HA----RREEILTSREIISAAQRQLDWD-KRCVTVVVT--PNETGSES-KVECYQASEQL 323

Query: 275 VRLFKEGWFETEIAEGDDPKLSK--MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
           + L K+      I +   PK ++  +K+ V +  K++  VD   F+  V I   +     
Sbjct: 324 MELVKKA-----IVQPSQPKNNEINLKESVYLSQKEIMTVDPIVFVCNVPIRAFKAKTMF 378

Query: 330 LSSTFPIENRTTQ-VTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQ 388
           LSS +PIENR  + VT+   K +     +LP+ ++ISDFHLL+F   +  + S + ++ +
Sbjct: 379 LSSKYPIENRPQEDVTLLQFKKYFESVKTLPIYQQISDFHLLVFCLDY-GVFSSLHSVKE 437

Query: 389 CV 390
           C+
Sbjct: 438 CI 439


>gi|145496668|ref|XP_001434324.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401449|emb|CAK66927.1| unnamed protein product [Paramecium tetraurelia]
          Length = 580

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/274 (31%), Positives = 139/274 (50%), Gaps = 42/274 (15%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF+YG    D       R  V  IYEPPQ+GT   + +L+D  +    D IA  LGM
Sbjct: 306 QRMGFLYGYYAADPNYQNGVRAIVEAIYEPPQKGTFGHVDLLQDPAQSH-ADEIATSLGM 364

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-------HAE-CNMEEWVTAVVKLEVSEE 258
           +K+G+IFT        D  LS+ E+ QA ++       H E C + +++T V  L   ++
Sbjct: 365 EKIGWIFTNV----NHDCFLSSEELRQAAQYQQIYSIKHPEGCLVSKFLTVV--LRTKKD 418

Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWFET----EIAEGDDPKLSKMKKDV----VVGGKDVK 310
               +  E + +SD   +L  +G F+     ++ +  D   +K + DV    V  GK VK
Sbjct: 419 SPDEVVPEVYMVSDQGQQLENDGIFQNSGRRKVLQVRD---AKNELDVLPSFVYNGKSVK 475

Query: 311 EVDNDFFLVVVKILDHQGPLSST-------FPIENRTTQVTMRALKSHLNRSPSLPLVKR 363
           E + DFF  +V +     P S+        FP+ENR+     + LK+ L++    P   R
Sbjct: 476 EFEPDFF--IVNVAHGYNPNSNYSILKLYDFPVENRSAVAKQQDLKTFLHKHQKKPSHNR 533

Query: 364 ISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
            +DFHLLL+L++ +D++S V  +AQ + ++  VP
Sbjct: 534 FADFHLLLYLSKLIDIHS-VVNIAQFIGSECEVP 566


>gi|167539988|ref|XP_001741490.1| nuclear protein localization [Entamoeba dispar SAW760]
 gi|165893905|gb|EDR22020.1| nuclear protein localization, putative [Entamoeba dispar SAW760]
          Length = 463

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 83/259 (32%), Positives = 137/259 (52%), Gaps = 22/259 (8%)

Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLV 197
           D FQ+YV++  +F V R G +YG   ED RV+V  IYEPPQ   EE  + ++ E+    V
Sbjct: 199 DNFQRYVSQR-SFYVNRIGILYGIFKED-RVQVEAIYEPPQVINEEGDFEMKQEQTLLQV 256

Query: 198 DAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSE 257
             +A  LG+++VG IF+      +++  L++RE++ A +   E + +  VT VV      
Sbjct: 257 QQMAIELGLERVGIIFSHA----RREEILTSREIIAAAQRQLEWD-KRCVTVVV---TPN 308

Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSK--MKKDVVVGGKDVKEVDND 315
           E       E +Q S+  + L K+      I +   PK ++  +K+ V +  K++  VD  
Sbjct: 309 ETGIESKVECYQASEQLMELVKKA-----IVQPSQPKNNEINLKESVYLSQKEIMTVDPV 363

Query: 316 FFLVVVKILDHQGP---LSSTFPIENRTTQ-VTMRALKSHLNRSPSLPLVKRISDFHLLL 371
            F+  V I   +     LSS +PIENR  + VT+   K +     +LP+ ++ISDFHLL+
Sbjct: 364 VFVCNVPIRAFKAKTMFLSSKYPIENRPQEDVTLLQFKKYFESVKTLPIYQQISDFHLLV 423

Query: 372 FLARFLDLNSDVPALAQCV 390
           F   +  + S + ++ +CV
Sbjct: 424 FCLDY-GVFSSLHSVKECV 441


>gi|388490756|gb|AFK33444.1| unknown [Lotus japonicus]
          Length = 67

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 46/64 (71%), Positives = 57/64 (89%)

Query: 345 MRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLI 404
           MRALK+HL R+ +LP VKRI+DFHLLL LARFLDLN+DVPAL +C+Q +T VPEGY++LI
Sbjct: 1   MRALKNHLERTKNLPFVKRIADFHLLLVLARFLDLNADVPALTECIQTETTVPEGYQILI 60

Query: 405 NSMA 408
           +SMA
Sbjct: 61  DSMA 64


>gi|296203397|ref|XP_002748882.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Callithrix jacchus]
          Length = 608

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 93/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  GF+YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGFLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|237833025|ref|XP_002365810.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|211963474|gb|EEA98669.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|221488270|gb|EEE26484.1| nuclear protein localization, putative [Toxoplasma gondii GT1]
 gi|221508777|gb|EEE34346.1| nuclear protein localization, putative [Toxoplasma gondii VEG]
          Length = 502

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 85/303 (28%), Positives = 143/303 (47%), Gaps = 41/303 (13%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F ++  E L    +R G+M+G  +ED       R     IYEPPQ+ T   L + +D+EE
Sbjct: 207 FVRFWQEDLQMLQQRFGYMFGYYVEDPHYPDGIRAVCEAIYEPPQENTLTSLNVKKDDEE 266

Query: 194 EKLVDAIAAGLGMKKVGFIFTQT----IMQNKKDYTLSNREVLQAVEFH-AECNMEEWVT 248
            K+ + IA  LG++ +G IFT      ++ + +   L+  ++ + V  H     + ++V 
Sbjct: 267 VKVAEKIADRLGLELIGCIFTHAPREELLTSHEVVDLAKTQLSRQVSTHFTGYPVSKFVC 326

Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDD-------PKLSKMKKD 301
             V L+ S     A+   AF +SDM V L ++G       + DD       P+  ++   
Sbjct: 327 CTVSLDKSTRDGEAVP-NAFMVSDMGVALVRDGVVSE--TQPDDTHIQLRSPEKGELLPQ 383

Query: 302 VVVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTFPIENRTTQVTMRALKSHLNR 354
           V+  G++    D  +F+V V   +   P        SS+FP  NR    T + +  HL R
Sbjct: 384 VLESGRETTRFDASWFIVRV---NESAPKKVRSFFCSSSFPRANRLVAQTPKDITDHLTR 440

Query: 355 SPSL----PLVK-----RISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLIN 405
             +L    P+ K     R +DFHLLL++A+  DL++   ++  CV+ +  V EG +  + 
Sbjct: 441 VAALAGPSPVAKKENWRRFADFHLLLYVAKLFDLDTAF-SICDCVRNRQPVDEGLEDTLK 499

Query: 406 SMA 408
           S  
Sbjct: 500 SFG 502


>gi|355707795|gb|AES03067.1| nuclear protein localization 4-like protein [Mustela putorius furo]
          Length = 550

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 324 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 436

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       S FPIENR     TQ    +L ++L+++ S   
Sbjct: 437 PLPVEYLIIDITTTFPKDPVHTFSISQSPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 495

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 496 LDTISDFHLLLFL 508


>gi|147898562|ref|NP_001080427.1| nuclear protein localization 4 homolog [Xenopus laevis]
 gi|33585642|gb|AAH56013.1| Npl4 protein [Xenopus laevis]
          Length = 610

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 90/309 (29%), Positives = 149/309 (48%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 215 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRIGYLYGRYTEHKDIPLGLRAE 271

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D+ + K+VD IAA LG++KVG+IFT  + +          
Sbjct: 272 VAAIYEPPQIGTQNSLQLL-DDPKAKVVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 330

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 331 RNKDTYFLSAEECITAGYFQNKYPNLCRLSPDAHFGSKFVTVVATGGPDNQVHFEGYQVS 390

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++   E  +   D P+L   K             KD+   G ++ ++      
Sbjct: 391 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 447

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+      S+ FPIENR     TQ    +L ++L+++ S   ++ I
Sbjct: 448 EYLIIDITTTFPKDPVYTFSTSSNLFPIENRDVLGETQ-DFHSLATYLSQNLSSVFLEII 506

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 507 SDFHLLLFL 515


>gi|401408699|ref|XP_003883798.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325118215|emb|CBZ53766.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 504

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 140/305 (45%), Gaps = 45/305 (14%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F Q+  + L    +R G+M+G  +ED       R     IYEPPQ+ T     + +D+EE
Sbjct: 209 FVQFWQQDLQMLQQRFGYMFGYYVEDPHYPDGIRAVCEAIYEPPQENTLTSTNVKKDDEE 268

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQ------AVEFHAECN---ME 244
            K+ + IA  LG++ +G IFT       ++  L++ EV+       A +F        M 
Sbjct: 269 VKMAEKIAERLGLEPIGCIFTHA----PREELLTSHEVVDLAKTQLARQFSTHFTGYPMS 324

Query: 245 EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD-----DPKLSKMK 299
           ++V   V L+ S     A+   AF +SD+ V L ++G    +  +        P+  ++ 
Sbjct: 325 KYVCCTVSLDKSTRDGEAVP-NAFMVSDLGVALVRDGVICDKQPDNTHIQLRSPEKGELL 383

Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTFPIENRTTQVTMRALKSHL 352
             V+  G++    D  +F+V V   +   P        SSTFP  NR    T + +  HL
Sbjct: 384 PQVLESGRETTRFDASWFIVRV---NESAPRKVRSFFCSSTFPRANRLVHQTPKDITDHL 440

Query: 353 NRSPSL----PLVK-----RISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLL 403
            R  +     P  K     R +DFHLLL++A+  DL++   ++  CV+++  V EG +  
Sbjct: 441 TRGAACAGSGPQAKKENWRRFADFHLLLYVAKLFDLDTAF-SICDCVRSRQPVDEGLEDT 499

Query: 404 INSMA 408
           + S  
Sbjct: 500 LKSFG 504


>gi|351706408|gb|EHB09327.1| Nuclear protein localization protein 4-like protein [Heterocephalus
           glaber]
          Length = 658

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 227 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 283

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  I +          
Sbjct: 284 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLISEDTRKGTVRYS 342

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 343 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 398

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 399 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 455

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 456 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 514

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 515 LDTISDFHLLLFL 527


>gi|7959259|dbj|BAA96023.1| KIAA1499 protein [Homo sapiens]
          Length = 660

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 256 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 312

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 313 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 371

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 372 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 427

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 428 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 484

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 485 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 543

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 544 LDTISDFHLLLFL 556


>gi|7020902|dbj|BAA91314.1| unnamed protein product [Homo sapiens]
          Length = 608

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 150/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     ++ G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQQFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|410982004|ref|XP_003997354.1| PREDICTED: nuclear protein localization protein 4 homolog [Felis
           catus]
          Length = 755

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 360 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 416

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 417 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 475

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 476 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 531

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 532 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 588

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 589 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 647

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 648 LDTISDFHLLLFL 660


>gi|332251481|ref|XP_003274874.1| PREDICTED: nuclear protein localization protein 4 homolog [Nomascus
           leucogenys]
          Length = 607

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|431908659|gb|ELK12251.1| Nuclear protein localization protein 4 like protein [Pteropus
           alecto]
          Length = 565

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 148/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +   T     +  G++YG   E K      R E
Sbjct: 170 ITLNRQKYRHVDNIMFENHTVADRFLDFWRRT---GSQHLGYLYGRYTEHKDIPLGVRAE 226

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 227 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 285

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS  E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 286 RNKDTYFLSAEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 341

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L  +K             KD+   G ++ ++  
Sbjct: 342 YQVSNQCMALVRD---ECLLPCKDAPELGYVKESSSEQYVPDVFYKDLDKFGNEITQLAR 398

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L++S S   
Sbjct: 399 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQSTSAVF 457

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 458 LDTISDFHLLLFL 470


>gi|332849283|ref|XP_003315818.1| PREDICTED: nuclear protein localization protein 4 homolog [Pan
           troglodytes]
 gi|410227436|gb|JAA10937.1| nuclear protein localization 4 homolog [Pan troglodytes]
 gi|410256178|gb|JAA16056.1| nuclear protein localization 4 homolog [Pan troglodytes]
 gi|410305070|gb|JAA31135.1| nuclear protein localization 4 homolog [Pan troglodytes]
 gi|410353911|gb|JAA43559.1| nuclear protein localization 4 homolog [Pan troglodytes]
          Length = 608

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|297702026|ref|XP_002827994.1| PREDICTED: nuclear protein localization protein 4 homolog [Pongo
           abelii]
          Length = 608

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|148702798|gb|EDL34745.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_c [Mus musculus]
          Length = 549

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 154 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 210

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 211 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 269

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 270 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 325

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 326 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 382

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 383 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 441

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 442 LDTISDFHLLLFL 454


>gi|380814530|gb|AFE79139.1| nuclear protein localization protein 4 homolog [Macaca mulatta]
 gi|383409511|gb|AFH27969.1| nuclear protein localization protein 4 homolog [Macaca mulatta]
 gi|384948116|gb|AFI37663.1| nuclear protein localization protein 4 homolog [Macaca mulatta]
          Length = 608

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|194377504|dbj|BAG57700.1| unnamed protein product [Homo sapiens]
          Length = 456

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 52  ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 108

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 109 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 167

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 168 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 223

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 224 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 280

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 281 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 339

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 340 LDTISDFHLLLFL 352


>gi|157426879|ref|NP_060391.2| nuclear protein localization protein 4 homolog [Homo sapiens]
 gi|50428974|sp|Q8TAT6.3|NPL4_HUMAN RecName: Full=Nuclear protein localization protein 4 homolog;
           Short=Protein NPL4
 gi|19683967|gb|AAH25930.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Homo
           sapiens]
 gi|119610067|gb|EAW89661.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_a [Homo sapiens]
 gi|119610068|gb|EAW89662.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_a [Homo sapiens]
 gi|123997361|gb|ABM86282.1| nuclear protein localization 4 homolog (S. cerevisiae) [synthetic
           construct]
          Length = 608

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|119610069|gb|EAW89663.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_b [Homo sapiens]
 gi|208967577|dbj|BAG72434.1| nuclear protein localization 4 homolog [synthetic construct]
          Length = 617

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|73964791|ref|XP_540482.2| PREDICTED: nuclear protein localization protein 4 homolog [Canis
           lupus familiaris]
          Length = 615

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 220 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 276

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 277 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 335

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 336 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 391

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 392 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 448

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 449 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 507

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 508 LDTISDFHLLLFL 520


>gi|37360414|dbj|BAC98185.1| mKIAA1499 protein [Mus musculus]
 gi|148702797|gb|EDL34744.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_b [Mus musculus]
          Length = 662

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 267 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 323

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 324 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 382

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 383 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 438

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 439 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 495

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 496 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 554

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 555 LDTISDFHLLLFL 567


>gi|10435132|dbj|BAB14499.1| unnamed protein product [Homo sapiens]
          Length = 520

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 125 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GSQHFGYLYGRYTEHKDIPLGIRAE 181

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 182 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 240

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 241 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 296

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 297 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 353

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 354 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 412

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 413 LDTISDFHLLLFL 425


>gi|303324586|ref|NP_001181952.1| nuclear protein localization protein 4 homolog isoform A [Mus
           musculus]
 gi|46397631|sp|P60670.3|NPL4_MOUSE RecName: Full=Nuclear protein localization protein 4 homolog;
           Short=Protein NPL4
          Length = 608

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|74207973|dbj|BAE29105.1| unnamed protein product [Mus musculus]
          Length = 608

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|348558274|ref|XP_003464943.1| PREDICTED: nuclear protein localization protein 4 homolog [Cavia
           porcellus]
          Length = 668

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 273 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 329

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 330 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 388

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 389 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 444

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 445 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 501

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 502 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 560

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 561 LDTISDFHLLLFL 573


>gi|335297237|ref|XP_003357980.1| PREDICTED: nuclear protein localization protein 4 homolog [Sus
           scrofa]
          Length = 564

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 169 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 225

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG+K+VG+IFT  + +          
Sbjct: 226 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLKEVGWIFTDLVSEDARKGTVRYS 284

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 285 RNKDTYFLSSEECITAGDFQNKHPNVCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 340

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 341 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 397

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L++S S   
Sbjct: 398 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQSTSSVF 456

Query: 361 VKRISDFHLLLFL 373
           +  +SDFHLLLFL
Sbjct: 457 LDTVSDFHLLLFL 469


>gi|354469126|ref|XP_003496981.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Cricetulus griseus]
          Length = 689

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 294 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 350

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 351 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 409

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 410 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 465

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 466 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 522

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 523 PLPVEYLIIDITTTFPKDPVFTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNNSSVF 581

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 582 LDTISDFHLLLFL 594


>gi|327264572|ref|XP_003217087.1| PREDICTED: nuclear protein localization protein 4 homolog [Anolis
           carolinensis]
          Length = 609

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 89/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +  G++YG   E K      R E
Sbjct: 214 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLAIRAE 270

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 271 VAAIYEPPQIGTQNTLELLEDSKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 329

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 330 RNKDSYFLSAEECITAGYFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 389

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
           + C+ L ++   E  +   D P+L   K+        DV   G+D          + +  
Sbjct: 390 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKGRDKFGNEITQLARPLPV 446

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   +  I
Sbjct: 447 EYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDII 505

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 506 SDFHLLLFL 514


>gi|340501211|gb|EGR28019.1| npl4 family protein, putative [Ichthyophthirius multifiliis]
          Length = 495

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/284 (30%), Positives = 140/284 (49%), Gaps = 43/284 (15%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           ++ GF+YG   ED       R  +  IYEPPQ G      +L DE E+  VD IA  L +
Sbjct: 223 QKMGFLYGYYAEDPNYKGGVRAIIEAIYEPPQVGDISGFKLLYDEYEQ-YVDQIAEALTL 281

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNM--------EEWVTAVVKLEVSEE 258
           +K+G+ FT TI     D  L + E+  A ++  +  +          +++ V  L   ++
Sbjct: 282 EKIGWAFT-TI---NHDTFLCSHEIRMAAKYQEQYRVLHESGYPFSTFISVV--LRTGKD 335

Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWF-ETE---IAEGDDP-KLSKMKKDVVVGGKDVKEVD 313
             + +  E + +SD    + K   F E+E     +  DP   + +   ++VGGK VKE +
Sbjct: 336 NPSEVRPEVYMVSDQAQIMEKNNIFGESERRKFMQIRDPVNDTDVLPQIIVGGKTVKEFE 395

Query: 314 NDFFLVVV----------KILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKR 363
            D+F+V V           I+ H     + FP+ NR  Q  M  +K +L +  +    ++
Sbjct: 396 PDYFIVNVAHGHSNHNKFSIIKH-----ADFPVANRQQQSPME-VKKYLQKFKNQKSFEK 449

Query: 364 ISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSM 407
            SDFHLLL+LA+ +DL + +  +AQ V  +T +PEG + LIN +
Sbjct: 450 YSDFHLLLYLAKLVDLQTAL-VIAQSVAQETDIPEGCENLINQI 492


>gi|440897699|gb|ELR49339.1| hypothetical protein M91_17353, partial [Bos grunniens mutus]
          Length = 603

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 148/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 324 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 436

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+ + S   
Sbjct: 437 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSHNTSSVF 495

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 496 LDTISDFHLLLFL 508


>gi|300794221|ref|NP_001179118.1| nuclear protein localization protein 4 homolog [Bos taurus]
 gi|296476170|tpg|DAA18285.1| TPA: nuclear protein localization 4 homolog [Bos taurus]
          Length = 608

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 148/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+ + S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSHNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|426239181|ref|XP_004013504.1| PREDICTED: nuclear protein localization protein 4 homolog [Ovis
           aries]
          Length = 614

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 219 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 275

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 276 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 334

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 335 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 390

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 391 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 447

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 448 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 506

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 507 LDTISDFHLLLFL 519


>gi|10434779|dbj|BAB14372.1| unnamed protein product [Homo sapiens]
          Length = 520

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 125 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 181

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 182 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 240

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 241 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 296

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 297 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 353

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 354 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 412

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 413 LDTISDFHLLLFL 425


>gi|397522213|ref|XP_003831172.1| PREDICTED: nuclear protein localization protein 4 homolog [Pan
           paniscus]
          Length = 520

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 125 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 181

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 182 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 240

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 241 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 296

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 297 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 353

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 354 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 412

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 413 LDTISDFHLLLFL 425


>gi|395533255|ref|XP_003768676.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Sarcophilus harrisii]
          Length = 730

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/311 (29%), Positives = 147/311 (47%), Gaps = 68/311 (21%)

Query: 119 VTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
           + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R EV 
Sbjct: 337 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 393

Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------N 220
            IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +           N
Sbjct: 394 AIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 452

Query: 221 KKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQ 269
           K  Y LS+ E + A  F  +    C +        ++VTAV       +    +HFE +Q
Sbjct: 453 KDTYFLSSEECITAGNFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQ 508

Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN-- 314
           +S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++    
Sbjct: 509 VSNQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPL 565

Query: 315 --DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVK 362
             ++ ++ +     + P+       S FPIENR     TQ    +L ++L+++ S   + 
Sbjct: 566 PVEYLIIDITTTFPKDPVYTFSISQSPFPIENRDVLGETQ-DFHSLATYLSQNTSSIFLD 624

Query: 363 RISDFHLLLFL 373
            ISDFHLLLFL
Sbjct: 625 IISDFHLLLFL 635


>gi|21739895|emb|CAD38971.1| hypothetical protein [Homo sapiens]
          Length = 519

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 124 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 180

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 181 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 239

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 240 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 295

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ ++  
Sbjct: 296 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 352

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 353 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 411

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 412 LDTISDFHLLLFL 424


>gi|348522316|ref|XP_003448671.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Oreochromis niloticus]
          Length = 624

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/308 (30%), Positives = 140/308 (45%), Gaps = 59/308 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GSQRMGYLYGRYTEHKDIPLGIRAE 285

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ  T+  L +L D +    VD IAA LGM KVG+IFT  + +          
Sbjct: 286 VAAIYEPPQNATQNSLELLEDPKAAA-VDEIAAKLGMCKVGWIFTDLLSEDTRIGTVRYS 344

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            N+  Y LS  E + A  F  E    C +     + +  V +  +      +HFE +Q+S
Sbjct: 345 RNQDSYYLSAEECITAGYFQNEHSNPCRLSRDGHFGSKFVTVVATGGPDNQVHFEGYQVS 404

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDVKEVDNDFF------ 317
           + C+ L ++   E  +   D P+L   K+        DV    KD    D  F       
Sbjct: 405 NQCMALVRD---ECLLPCKDAPELGYAKESSPEQYVPDVFYKDKDKFGNDVTFLARPLPV 461

Query: 318 -LVVVKIL-----DHQGPLSST--FPIENR----TTQVTMRALKSHLNRSPSLPLVKRIS 365
             +++ I      D Q   SST  FPIENR     TQ    +L ++L++  S   +  +S
Sbjct: 462 EYLIIDITTTFPKDPQYTFSSTQRFPIENRDILGETQ-DFHSLATYLSQCTSTSFLNIVS 520

Query: 366 DFHLLLFL 373
           DFHLLLFL
Sbjct: 521 DFHLLLFL 528


>gi|344291384|ref|XP_003417415.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Loxodonta africana]
          Length = 609

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E +      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHRDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPLSS------TFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+ +       FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVHTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSMF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|149055014|gb|EDM06831.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_b [Rattus norvegicus]
          Length = 549

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 154 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 210

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IA+ LG++KVG+IFT  + +          
Sbjct: 211 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIASKLGLRKVGWIFTDLVSEDTRKGTVRYS 269

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 270 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 325

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 326 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 382

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 383 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 441

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 442 LDTISDFHLLLFL 454


>gi|301754217|ref|XP_002912994.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Ailuropoda melanoleuca]
          Length = 605

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 210 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 266

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 267 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 325

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 326 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 381

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 382 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 438

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 439 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 497

Query: 361 VKRISDFHLLLFL 373
           +  +SDFHLLLFL
Sbjct: 498 LDTVSDFHLLLFL 510


>gi|281348660|gb|EFB24244.1| hypothetical protein PANDA_000715 [Ailuropoda melanoleuca]
          Length = 559

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 324 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 436

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 437 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 495

Query: 361 VKRISDFHLLLFL 373
           +  +SDFHLLLFL
Sbjct: 496 LDTVSDFHLLLFL 508


>gi|18034779|ref|NP_542144.1| nuclear protein localization protein 4 homolog [Rattus norvegicus]
 gi|48429023|sp|Q9ES54.3|NPL4_RAT RecName: Full=Nuclear protein localization protein 4 homolog;
           Short=Protein NPL4
 gi|11037252|gb|AAG27534.1|AF234600_1 NPL4 [Rattus norvegicus]
 gi|74353675|gb|AAI01888.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Rattus
           norvegicus]
 gi|149055013|gb|EDM06830.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_a [Rattus norvegicus]
          Length = 608

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IA+ LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIASKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|417411892|gb|JAA52365.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
           [Desmodus rotundus]
          Length = 603

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 147/313 (46%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS  E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 324 RNKDTYFLSAEECITAGDFQNKHPNVCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD    G ++ ++  
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDTDKFGNEITQLAR 436

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       S FPIENR     TQ    +L ++L+++ S   
Sbjct: 437 PLPVEYLIIDITTTFPKDPVHTFSTSQSPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 495

Query: 361 VKRISDFHLLLFL 373
           +  +SDFHLLLFL
Sbjct: 496 LDTVSDFHLLLFL 508


>gi|148702796|gb|EDL34743.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
           CRA_a [Mus musculus]
          Length = 627

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 138/286 (48%), Gaps = 46/286 (16%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 264 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 320

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 321 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 379

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 380 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 435

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           +Q+S+ C+ L ++   E  +   D P+L   K+         + V + F+  + K  +  
Sbjct: 436 YQVSNQCMALVRD---ECLLPCKDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEI 487

Query: 328 GPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
             L+   P+E         +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 488 TQLARPLPVEYLIID-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 532


>gi|149723455|ref|XP_001489914.1| PREDICTED: nuclear protein localization protein 4 homolog [Equus
           caballus]
          Length = 615

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 220 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 276

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + + +VD IAA LG++KVG+IFT  + +          
Sbjct: 277 VAAIYEPPQIGTQNSLELLEDPKAD-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 335

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 336 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 391

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 392 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 448

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 449 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 507

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 508 LDTISDFHLLLFL 520


>gi|41054974|ref|NP_955763.1| nuclear protein localization protein 4 homolog isoform B [Mus
           musculus]
 gi|40787841|gb|AAH65156.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Mus
           musculus]
          Length = 576

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 138/286 (48%), Gaps = 46/286 (16%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           +Q+S+ C+ L ++   E  +   D P+L   K+         + V + F+  + K  +  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEI 436

Query: 328 GPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
             L+   P+E         +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 437 TQLARPLPVEYLIID-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 481


>gi|363740846|ref|XP_415704.3| PREDICTED: nuclear protein localization protein 4 homolog [Gallus
           gallus]
          Length = 609

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +  G++YG   E K      R E
Sbjct: 214 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 270

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L IL D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 271 VAAIYEPPQIGTQNSLEILEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 329

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 330 RNKDTYYLSAEECITAGNFQNQHPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 389

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++   E  +   D P+L   K             KD+   G ++ ++      
Sbjct: 390 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 446

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   +  +
Sbjct: 447 EYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDIV 505

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 506 SDFHLLLFL 514


>gi|334322635|ref|XP_001379846.2| PREDICTED: nuclear protein localization protein 4 homolog isoform 1
           [Monodelphis domestica]
          Length = 608

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 147/313 (46%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS  E + A  F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSAEECITAGNFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSIF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDIISDFHLLLFL 513


>gi|402901341|ref|XP_003913609.1| PREDICTED: nuclear protein localization protein 4 homolog [Papio
           anubis]
          Length = 608

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/317 (29%), Positives = 149/317 (47%), Gaps = 76/317 (23%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDK----- 166
           + + RQ+  H +++ F +   AD F       L    ++G    G++YG   E K     
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRF-------LTSGERQGTQHFGYLYGRYTEHKDIPLG 265

Query: 167 -RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ------ 219
            R EV  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +      
Sbjct: 266 IRAEVAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGT 324

Query: 220 -----NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAI 263
                NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +
Sbjct: 325 VRYSRNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----V 380

Query: 264 HFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVK 310
           HFE +Q+S+ C+ L ++   E  +   D P+L   K             KDV   G ++ 
Sbjct: 381 HFEGYQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEIT 437

Query: 311 EVDN----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSP 356
           ++      ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ 
Sbjct: 438 QLARPLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNT 496

Query: 357 SLPLVKRISDFHLLLFL 373
           S   +  ISDFHLLLFL
Sbjct: 497 SSVFLDTISDFHLLLFL 513


>gi|387017400|gb|AFJ50818.1| Nuclear protein localization protein 4-like protein [Crotalus
           adamanteus]
          Length = 609

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +  G++YG   E K      R E
Sbjct: 214 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 270

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V+ IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 271 VSAIYEPPQIGTQNSLELLEDSKAE-IVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 329

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 330 RNKDAYFLSAEECITAGHFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 389

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++   E  +   D P+L   K             KD+   G ++ ++      
Sbjct: 390 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 446

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+       + FPIENR     TQ     L ++L+++ S   +  +
Sbjct: 447 EYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHILATYLSQNTSSVFLDIV 505

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 506 SDFHLLLFL 514


>gi|291415843|ref|XP_002724159.1| PREDICTED: nuclear protein localization 4, partial [Oryctolagus
           cuniculus]
          Length = 737

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 137/284 (48%), Gaps = 46/284 (16%)

Query: 119 VTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
           + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R EV 
Sbjct: 376 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 432

Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------N 220
            IYEPPQ GT+  L +L D + + +VD IAA LG++KVG+IFT  + +           N
Sbjct: 433 AIYEPPQIGTQNSLELLEDPKAD-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 491

Query: 221 KKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQ 269
           K  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE +Q
Sbjct: 492 KDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQ 547

Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP 329
           +S+ C+ L ++   E  +   D P+L   K+         + V + F+  + K  +    
Sbjct: 548 VSNQCMALVRD---ECLLPCKDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEITQ 599

Query: 330 LSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
           L+   P+E         +L ++L+++ S   +  +SDFHLLLFL
Sbjct: 600 LARPLPVEYLIID-DFHSLAAYLSQNTSSMFLDTVSDFHLLLFL 642


>gi|74208119|dbj|BAE29162.1| unnamed protein product [Mus musculus]
          Length = 608

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 90/313 (28%), Positives = 148/313 (47%), Gaps = 68/313 (21%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  I EPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAICEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
           +Q+S+ C+ L ++   E  +   D P+L   K             KD+   G ++ ++  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441

Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
               ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++ S   
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500

Query: 361 VKRISDFHLLLFL 373
           +  ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513


>gi|223996267|ref|XP_002287807.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976923|gb|EED95250.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 511

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 126/252 (50%), Gaps = 17/252 (6%)

Query: 168 VEVNFIYEPPQQGTEEVL--YILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
           V V  IYEPPQ+   E    +I+ D+  E+ V+ +   LG+ KVG+I        ++ + 
Sbjct: 260 VIVEAIYEPPQEPDSEAAEGFIILDDPMEENVEELVKMLGLVKVGWIVGHP--PREEGFQ 317

Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
           LS+ EV+ A E   E       T  V + V+ +    +  EAFQ+S  C+ +  E   + 
Sbjct: 318 LSSAEVIMAAELQLESAGGIEATPFVTVRVTVDDDGNVVVEAFQVSLQCMEMVAEEALDV 377

Query: 286 EIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVKILDHQGPL-SSTFPIENRT-- 340
               G +P    +      +  GK    V+N+FFL +V I  +Q  +  S FP  NR   
Sbjct: 378 ----GPNPGFCYVNDTFTAIQEGKPSATVENNFFLTLVPINGYQSEMFVSMFPKANRAFD 433

Query: 341 -TQVTMRALKSHLNRSPS--LPLVKRISDFHLLLFLARFLDLNSDVPALAQCV-QAQTAV 396
               T   +K  L++S +     +  +SDF LLL+L +FLD+ +D+P +   V + +  +
Sbjct: 434 DRAQTHDEMKRQLSKSGTAGWSFIDLLSDFGLLLYLTQFLDMKTDMPKICASVKEREIPL 493

Query: 397 PEGYKLLINSMA 408
            +GYK++I+S+A
Sbjct: 494 DDGYKIIISSLA 505


>gi|326930855|ref|XP_003211554.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Meleagris gallopavo]
          Length = 621

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 144/309 (46%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +  G++YG   E K      R E
Sbjct: 226 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 282

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L IL D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 283 VAAIYEPPQIGTQNSLEILEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 341

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 342 RNKDTYYLSAEECITAGNFQNQHPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 401

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++   E  +   D P+L   K             KD+   G ++ ++      
Sbjct: 402 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 458

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+       + FPIEN      TQ    +L ++L+++ S   +  +
Sbjct: 459 EYLIIDITTTFPKDPVYTFSISQNPFPIENHDVLGETQ-DFHSLATYLSQNTSSVFLDIV 517

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 518 SDFHLLLFL 526


>gi|334322637|ref|XP_003340282.1| PREDICTED: nuclear protein localization protein 4 homolog isoform 2
           [Monodelphis domestica]
          Length = 576

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 136/286 (47%), Gaps = 46/286 (16%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS  E + A  F  +    C +        ++VTAV       +    +HFE 
Sbjct: 329 RNKDTYFLSAEECITAGNFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           +Q+S+ C+ L ++   E  +   D P+L   K+         + V + F+  + K  +  
Sbjct: 385 YQVSNQCMALVRD---ECLLPCRDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEI 436

Query: 328 GPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
             L+   P+E         +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 437 TQLARPLPVEYLIID-DFHSLATYLSQNTSSIFLDIISDFHLLLFL 481


>gi|213983169|ref|NP_001135712.1| nuclear protein localization 4 homolog [Xenopus (Silurana)
           tropicalis]
 gi|197246525|gb|AAI69147.1| Unknown (protein for MGC:189610) [Xenopus (Silurana) tropicalis]
          Length = 610

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 148/309 (47%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 215 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRIGYLYGRYTEHKDIPLGLRAE 271

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D+ + K+VD IAA LG++KVG+IFT  + +          
Sbjct: 272 VAAIYEPPQIGTQNSLQLL-DDPKAKVVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 330

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + +  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 331 RNKDTYFLSAEECITSGFFQNKHPNLCRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 390

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++   E  +   D P+L   K             KD+   G ++ ++      
Sbjct: 391 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 447

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+      S+ F IENR     TQ    +L ++L+++ S   ++ I
Sbjct: 448 EYLIIDITATFPKDPVYTFSTSSNLFSIENRDALGETQ-DFHSLATYLSQNLSSVFLEII 506

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 507 SDFHLLLFL 515


>gi|444727732|gb|ELW68210.1| Nuclear protein localization protein 4 like protein [Tupaia
           chinensis]
          Length = 420

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 86/285 (30%), Positives = 134/285 (47%), Gaps = 68/285 (23%)

Query: 148 LAFAVKRG----GFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLV 197
           L F  K G    G++YG   E K      R EV  IYEPPQ GT+  L +L D + E +V
Sbjct: 12  LDFWRKTGNQHFGYLYGRYTEHKDIPLGIRAEVAAIYEPPQIGTQNSLELLEDPKAE-VV 70

Query: 198 DAIAAGLGMKKVGFIFTQTIMQ-----------NKKDYTLSNREVLQAVEFHAE----CN 242
           D IAA LG++KVG+IFT  + +           NK  Y LS+ E + A +F  +    C 
Sbjct: 71  DEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRNKDTYFLSSEECITAGDFQNKHPNICR 130

Query: 243 M-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
           +        ++VTAV       +    +HFE +Q+S+ C+ L ++   E  +   D P+L
Sbjct: 131 LSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQCMALVRD---ECLLPCKDAPEL 183

Query: 296 SKMK-------------KDVVVGGKDVKEVDN----DFFLVVVKILDHQGPLSS------ 332
              K             KD    G ++ ++      ++ ++ +     + P+ +      
Sbjct: 184 GYAKESSSEQYVPDVFYKDTDKFGNEITQLARPLPVEYLIIDITTTFPKDPVYTFSISPN 243

Query: 333 TFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
            FPIENR     TQ    +L ++L+++ S   +  +SDFHLLLFL
Sbjct: 244 PFPIENRDVLGETQ-DFHSLATYLSQNTSAMFLDTVSDFHLLLFL 287


>gi|118348606|ref|XP_001007778.1| NPL4 family protein [Tetrahymena thermophila]
 gi|89289545|gb|EAR87533.1| NPL4 family protein [Tetrahymena thermophila SB210]
          Length = 1157

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/333 (30%), Positives = 152/333 (45%), Gaps = 61/333 (18%)

Query: 106  KMTMDDLIAKQMRVTRQENPHCESVSFD--RDCADAFQQYVNETLAFAVKRGGFMYGTVL 163
            K   ++ +   + V RQ   H +   F   R+ ++  Q +   +   A +R G+MYG   
Sbjct: 852  KAMCNNCLPPNVVVYRQPYRHVDFAQFMNVREVSNFVQAWTQNS--HAEQRVGYMYGYYA 909

Query: 164  EDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI 217
            ED       R  +  IYEPPQ G      IL DE E  LV+ IA  L ++K+G+IFT TI
Sbjct: 910  EDPNYKGGVRAIIEAIYEPPQIGDISGFQILEDENE-TLVNTIAEALTLEKIGWIFT-TI 967

Query: 218  MQNKKDYTLSNREVLQAVEFHAECNM--------EEWVTAVVKLEVSEEGSAAIHFEAFQ 269
                 D  LS+ EV  A  F  +  +          ++T V  L   ++  + I  E + 
Sbjct: 968  ---NHDTFLSSHEVRMAARFQEQYRVLHESGYPFSTFITTV--LRTGKDNPSEIKPEVYM 1022

Query: 270  MSDMCVRLFKEGWF-ETEIAEGDDPKLSKMKKDV----VVGGKDVKEVDNDFFLVVV--- 321
            +SD    L K   F ++E  +    + SK ++DV    +V GK V E + DF LV V   
Sbjct: 1023 VSDQAQVLEKNNVFGDSERRKFMKVRESKDERDVLPSIIVSGKQVNEFEPDFLLVNVAHG 1082

Query: 322  -------KILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLA 374
                    I+ H     + FP+ENR  Q          N+S      ++ SDFHLL++LA
Sbjct: 1083 HSNHNKFSIVKH-----ADFPVENRQEQ----------NKS-----WEKYSDFHLLIYLA 1122

Query: 375  RFLDLNSDVPALAQCVQAQTAVPEGYKLLINSM 407
            + +DL + +  +A  V  +  VPE  + LI  +
Sbjct: 1123 KLVDLETTL-VIADAVAKEKDVPETCEELIRGI 1154


>gi|148226640|ref|NP_001088114.1| uncharacterized protein LOC494816 [Xenopus laevis]
 gi|52430472|gb|AAH82846.1| LOC494816 protein [Xenopus laevis]
          Length = 610

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 144/309 (46%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 215 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRIGYLYGRYTEHKDIPLGLRAE 271

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D+ +  +VD IAA LG++KVG+IFT  + +          
Sbjct: 272 VAAIYEPPQIGTQNSLQLL-DDPKSNVVDGIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 330

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 331 RNKDTYFLSAEECITAGYFQNKYPNLCRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 390

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++  F   +   D P+L   K             KD+   G ++ ++      
Sbjct: 391 NQCMALVRDECF---LPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 447

Query: 315 DFFLVVVKILDHQGPL------SSTFPIENRT----TQVTMRALKSHLNRSPSLPLVKRI 364
           ++ ++ +     + P+       + FPIENR     TQ    +L ++L+++     +  I
Sbjct: 448 EYLIIDITTTFPKDPVYTFSTSCNLFPIENRDVLGETQ-DFHSLATYLSQNLPSVFLDII 506

Query: 365 SDFHLLLFL 373
           SDFHLL FL
Sbjct: 507 SDFHLLQFL 515


>gi|410895239|ref|XP_003961107.1| PREDICTED: nuclear protein localization protein 4 homolog [Takifugu
           rubripes]
          Length = 624

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 93/308 (30%), Positives = 143/308 (46%), Gaps = 59/308 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRMGYLYGRYTEHKDIPLGIRAE 285

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ  T+  L ++ D+ +   VD IAA LG+ KVG+IFT  + +          
Sbjct: 286 VAAIYEPPQNATQNSLELI-DDPKAAAVDEIAAKLGLCKVGWIFTDLLSEDMRIGTVCYS 344

Query: 220 -NKKDYTLSNREVLQAVEF---HAE-CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  + LS  E + A  F   H+  C +     + +  V +  +      +HFE +Q+S
Sbjct: 345 RNKDSHYLSAEECITAGYFQNLHSNPCRLSRDGYFGSKFVTVVATGGPDNQVHFEGYQVS 404

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKKDVV------VGGKDVKEVDNDF-FL------ 318
           + C+ L ++   E  +   D P+L   K+         V  KD  +  ND  FL      
Sbjct: 405 NQCMALVRD---ECLLPCKDAPELGFAKESSPEQYVPDVFYKDTDKFGNDVTFLARPLPV 461

Query: 319 --VVVKIL-----DHQGPLSST--FPIENR----TTQVTMRALKSHLNRSPSLPLVKRIS 365
             +++ I      D Q   SS   FPIENR     TQ    +L ++L++  S   +  +S
Sbjct: 462 EYLIIDITTTFPKDPQYTFSSAQRFPIENRDILGETQ-NFHSLSTYLSQCTSTAFLGIVS 520

Query: 366 DFHLLLFL 373
           DFHLLLFL
Sbjct: 521 DFHLLLFL 528


>gi|449478740|ref|XP_004177023.1| PREDICTED: LOW QUALITY PROTEIN: nuclear protein localization
           protein 4 homolog [Taeniopygia guttata]
          Length = 650

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 143/310 (46%), Gaps = 61/310 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +  G++YG   E K      R E
Sbjct: 225 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 281

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L IL D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 282 VAAIYEPPQIGTQNSLEILEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 340

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 341 RNKDTYFLSAEECITAGNFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 400

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
           + C+ L ++   E  +   D P+L   K             KD+   G ++ ++      
Sbjct: 401 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 457

Query: 315 DFFLVVVKILDHQGP-------LSSTFPIENRT----TQVTMRALKSHLNRSPSLPLVKR 363
           ++ ++ +     +G        L +  PIEN      TQ    +L ++L+++ S   +  
Sbjct: 458 EYLIIDITTTFPKGSSLHFFYFLKTHSPIENPMCLGETQ-DFHSLATYLSQNTSSIFLDI 516

Query: 364 ISDFHLLLFL 373
           ISDFHLLLFL
Sbjct: 517 ISDFHLLLFL 526


>gi|351705102|gb|EHB08021.1| Nuclear protein localization protein 4-like protein [Heterocephalus
           glaber]
          Length = 583

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 83/271 (30%), Positives = 125/271 (46%), Gaps = 64/271 (23%)

Query: 158 MYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGF 211
           +YG   E K      R EV  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+
Sbjct: 227 LYGRYTEKKDIPLGIRAEVAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGW 285

Query: 212 IFTQTI-----------MQNKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTA 249
           IFT  I            QNK  Y L + E + A +F  +    C +        ++VTA
Sbjct: 286 IFTDLISEDTRKGTVRYSQNKDTYFLCSEECITAGDFQNKHPNICRLSSDGHFGSKFVTA 345

Query: 250 VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK---------- 299
           V       +    +HFE FQ+S  C+ L ++   E  +   D P+L   K          
Sbjct: 346 VATGGPDNQ----VHFEGFQVSSQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPD 398

Query: 300 ---KDVVVGGKDVKEVDND----FFLVVVKILDHQGPL------SSTFPIENR----TTQ 342
              KD+   G ++ ++       + ++ +     + P+       + FPIEN      TQ
Sbjct: 399 VFHKDIDKSGNEITQLARPLPVVYLIIDITTTFPKDPVYTFSISQNPFPIENWDVLGETQ 458

Query: 343 VTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
               +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 459 -DFHSLATYLSQNTSFVFLDTISDFHLLLFL 488


>gi|260817485|ref|XP_002603617.1| hypothetical protein BRAFLDRAFT_126926 [Branchiostoma floridae]
 gi|229288937|gb|EEN59628.1| hypothetical protein BRAFLDRAFT_126926 [Branchiostoma floridae]
          Length = 530

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 145/331 (43%), Gaps = 61/331 (18%)

Query: 119 VTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
           + RQ+  H +++ F+     D F  Y  +T     +R GF+YG     K      R  V 
Sbjct: 134 LNRQKYRHVDNIMFENHTIVDRFLDYWRKT---GNQRLGFLYGKYEHHKDVPLGIRATVA 190

Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQN 220
            IYEPPQ GT   L +L D   + +VD IA  LG++KVG+IFT  +            +N
Sbjct: 191 AIYEPPQVGTANSLELLEDRNAD-VVDDIARKLGLRKVGWIFTDLVADDSGKGTVKNFRN 249

Query: 221 KKDYTLSNREVLQAVEF---HAECNM----EEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
                LS  E + A EF   H  C        + +  V +  S + S  IHFE +Q+S+ 
Sbjct: 250 ADSVFLSAEECIMAGEFQNQHPNCTKLSPDGTFGSKFVTVVASGDESHQIHFEGYQVSNQ 309

Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDF 316
           C+ L ++   +  +   D P+L  +K+        DV    KD          + +  ++
Sbjct: 310 CMALVRD---DCLVPTKDAPELGYIKESTPEQYVPDVFYKEKDSYGNLVTLLARPLPVEY 366

Query: 317 FLVVVKILDHQGPLSS-------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISD 366
            LV V       P+S+        FPIENR     ++    L +++ + P   L++ +SD
Sbjct: 367 LLVQVTTGFPVDPVSTFNIRVAKPFPIENRADIGDIQDFGQLTAYMQQFPGYRLLETMSD 426

Query: 367 FHLLLFLAR--FLDLNSDVPALAQCVQAQTA 395
            H L +LA    L L   +  L + V+ Q A
Sbjct: 427 LHFLSYLATMDMLPLKDHMDPLLEGVREQNA 457


>gi|41054207|ref|NP_956101.1| nuclear protein localization protein 4 homolog [Danio rerio]
 gi|28277776|gb|AAH45845.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Danio
           rerio]
 gi|182889954|gb|AAI65856.1| Nploc4 protein [Danio rerio]
          Length = 624

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRMGYLYGRYTEHKDIPLGIRAE 285

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ  T+  L ++ D + +  V+ IAA LG++KVG+IFT  +            
Sbjct: 286 VAAIYEPPQIATQNSLELIEDPKADA-VEEIAAKLGLRKVGWIFTDLLSEDTRIGTVRYT 344

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 345 RNKDSYFLSAEECITAGHFQNQQANMCRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 404

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
           + C+ L ++   E  +   D P+L   K+        DV    KD          + +  
Sbjct: 405 NQCMALVRD---ECLLPCRDAPELGYAKESSTEQYVPDVFYKDKDKFGNDITHLARPLPV 461

Query: 315 DFFLVVVKILDHQGPL---SST--FPIENR----TTQVTMRALKSHLNR-SPSLPLVKRI 364
           ++ ++ +     + P+   SST  FPIENR     TQ    +L ++L++ + S   +  +
Sbjct: 462 EYLIIDITTTFPKDPVFTFSSTFRFPIENRDALGETQ-DFHSLATYLSQCTSSTSFLDIV 520

Query: 365 SDFHLLLFL 373
           SDFHLLLFL
Sbjct: 521 SDFHLLLFL 529


>gi|383170952|gb|AFG68738.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170953|gb|AFG68739.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170954|gb|AFG68740.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170955|gb|AFG68741.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170956|gb|AFG68742.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170957|gb|AFG68743.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170958|gb|AFG68744.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170960|gb|AFG68746.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170961|gb|AFG68747.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170962|gb|AFG68748.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170963|gb|AFG68749.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170964|gb|AFG68750.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170965|gb|AFG68751.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170966|gb|AFG68752.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170967|gb|AFG68753.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
 gi|383170968|gb|AFG68754.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
          Length = 51

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 39/51 (76%), Positives = 43/51 (84%)

Query: 110 DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG 160
           DDLIA+Q+R+ RQE PH  S SFDRD A+ FQQYVNE LAFAVKRGGFMYG
Sbjct: 1   DDLIARQVRIERQEKPHSVSASFDRDAANVFQQYVNENLAFAVKRGGFMYG 51


>gi|221123865|ref|XP_002157402.1| PREDICTED: nuclear protein localization protein 4 homolog [Hydra
           magnipapillata]
          Length = 621

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 144/306 (47%), Gaps = 58/306 (18%)

Query: 119 VTRQENPHCESVSFDRD-CADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
           ++RQ+  H +++ F+     D F  Y   T     +R G++YG   +        R  V 
Sbjct: 225 LSRQKYRHVDNIMFENPMIMDRFLNYWRNT---GFQRLGYLYGKYEKHDNVPLGIRATVA 281

Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQN 220
            IYEPPQ+ T++ + +L D  +E  V+ +A+ +G+++VG+IFT  +           +++
Sbjct: 282 AIYEPPQENTKDRIQLLEDNHDE-FVEILASYMGLRRVGWIFTDLLPDEENSGLVKHLRH 340

Query: 221 KKDYTLSNREVLQAVEFHA-------ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
            K + LS +E + A  F         + +  ++ +  V + VS      IH++ +Q+S+ 
Sbjct: 341 SKSHFLSAQECITAAHFQNKYPNSCRDSSAGKFGSKFVTVSVSGHEDKHIHYDGWQVSNQ 400

Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEVDNDF 316
           C+ L ++G     I   DDP L  +K+        DV    KDV         + +  ++
Sbjct: 401 CMALVRDGCLVPTI---DDPALGYIKESSSDQYVPDVFYKEKDVYGNEVTRIGRPMPMEY 457

Query: 317 FLVVVKI---LDHQ----GPLSSTFPIENRTT--QVTMRALKSHLNRSPSLPLVKRISDF 367
           FL+ V     ++ Q    G   S FPIENR+   ++    + +   R+ S  +V   SDF
Sbjct: 458 FLIEVPAAFPIEPQYTFVGCRDSPFPIENRSQIGEIQNFDVFAQYVRTISSNIVDGFSDF 517

Query: 368 HLLLFL 373
           H LL++
Sbjct: 518 HFLLYM 523


>gi|383170959|gb|AFG68745.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
          Length = 51

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 38/51 (74%), Positives = 42/51 (82%)

Query: 110 DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG 160
           DDLIA+Q+R+ RQE PH    SFDRD A+ FQQYVNE LAFAVKRGGFMYG
Sbjct: 1   DDLIARQVRIERQEKPHSVFASFDRDAANVFQQYVNENLAFAVKRGGFMYG 51


>gi|429327757|gb|AFZ79517.1| hypothetical protein BEWA_023660 [Babesia equi]
          Length = 384

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 78/273 (28%), Positives = 121/273 (44%), Gaps = 43/273 (15%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F  Y  +TL  A +R G++YG  +ED       R     IYEPPQ+   E   +L D   
Sbjct: 109 FANYWIDTLEMAEQRAGWLYGYYIEDSHYPLGIRAVCEAIYEPPQRSFMEYSELLSDPFL 168

Query: 194 EKLVDAIAAGLGMKKVGFIFT----QTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTA 249
              VD IA  LG++++G+IFT    + +M  K     + R++  A + H        V+ 
Sbjct: 169 NT-VDRIANKLGLERIGYIFTHLPRENVMTPKDIIDAAKRQLDTAKDIHYTGYP---VST 224

Query: 250 VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------- 300
            V   +  +G       AF  SD  + L ++G F     E +D K+ +++K         
Sbjct: 225 HVTCTMCPDGEGKPVLNAFMASDSAMALLRDGMFS---EEQNDIKMVEIRKASSSFEILP 281

Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPL----SSTFPIENRTTQVTMRALKSHLNRSP 356
            +   GK+V   D+D+ +V V       P+     + FP ENR     +   K     +P
Sbjct: 282 QIFESGKEVSSFDSDWLIVRVNDSAPIHPMPFFKHTGFPRENREVSTDVNIWK--YTNTP 339

Query: 357 SLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
                    DFHLLL++A  LD+++   ALA C
Sbjct: 340 --------CDFHLLLYIANTLDVDT---ALAVC 361


>gi|355569013|gb|EHH25294.1| hypothetical protein EGK_09090 [Macaca mulatta]
          Length = 629

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 148/349 (42%), Gaps = 104/349 (29%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 198 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 254

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 255 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 313

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 314 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 369

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DV----------------- 302
           +Q+S+ C+ L ++   E  +   D P+L   K+        DV                 
Sbjct: 370 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKVKVCVPITASRARN 426

Query: 303 ------------VVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTF--------- 334
                       V GG      D D F   +  L    P       +++TF         
Sbjct: 427 VEAVLTVAATRHVEGGHTTLTHDVDKFGNEITQLARPLPVEYLIIDITTTFPKDPVYTFS 486

Query: 335 ------PIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
                 PIENR     TQ    +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 487 ISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 534


>gi|355754452|gb|EHH58417.1| hypothetical protein EGM_08267 [Macaca fascicularis]
          Length = 704

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 148/349 (42%), Gaps = 104/349 (29%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 273 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 329

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 330 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 388

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
            NK  Y LS+ E + A +F  +    C +        ++VTAV       +    +HFE 
Sbjct: 389 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 444

Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DV----------------- 302
           +Q+S+ C+ L ++   E  +   D P+L   K+        DV                 
Sbjct: 445 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKVKVCVPITASRARN 501

Query: 303 ------------VVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTFP-------- 335
                       V GG      D D F   +  L    P       +++TFP        
Sbjct: 502 VEAVLTVAATRHVEGGHTTLTHDVDKFGNEITQLARPLPVEYLIIDITTTFPKDPVYTFS 561

Query: 336 -------IENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
                  IENR     TQ    +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 562 ISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 609


>gi|395826791|ref|XP_003786598.1| PREDICTED: nuclear protein localization protein 4 homolog [Otolemur
           garnettii]
          Length = 625

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 83/288 (28%), Positives = 136/288 (47%), Gaps = 51/288 (17%)

Query: 119 VTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
           + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R EV 
Sbjct: 261 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 317

Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------N 220
            IYEPPQ GT+  L +L D + E +VD IAA LG++KVG+IFT  + +           N
Sbjct: 318 AIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 376

Query: 221 KKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE 280
           K  Y LS+ E + A +F  +         + +L  S +G    HF + +       +F +
Sbjct: 377 KDTYFLSSEECITAGDFQNK------HPNICRL--SPDG----HFGS-KFVTAVATVFGQ 423

Query: 281 GWFET-EIAEGDDPKLSKMKKDVVVGGKDVKEVDN----DFFLVVVKILDHQGPLSS--- 332
           GW  T  +        S   +D+   G ++ ++      ++ ++ +     + P+ +   
Sbjct: 424 GWVRTCHLRYMGLAPYSFEPQDIDKFGNEITQLARPLPVEYLIIDITTTFPKDPVHTFSI 483

Query: 333 ---TFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
               FPIENR     TQ    +L ++L+++ S   +  ISDFHLLLFL
Sbjct: 484 SQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSMFLDTISDFHLLLFL 530


>gi|91088135|ref|XP_970927.1| PREDICTED: similar to nuclear protein localization [Tribolium
           castaneum]
          Length = 638

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 78/310 (25%), Positives = 143/310 (46%), Gaps = 58/310 (18%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V+F+    D  ++++N   +   +R GF+YG       V    R  V
Sbjct: 235 ITLNRQVYRHVDNVAFEN--KDLVERFLNYWRSTGHQRIGFLYGNYEIHNDVPLGIRANV 292

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ + + +++LRD++EE +V+ +A  LG+++VG+IFT  I           ++
Sbjct: 293 VAIYEPPQESSRDSIHLLRDDKEE-IVEELAQNLGLRRVGWIFTDLIPEDVQKGTVKHLR 351

Query: 220 NKKDYTLSNREVLQAVEFH----AECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           +   + LS  E + A  F       C       + + +V + V+ + +  +H E +Q+S+
Sbjct: 352 HINSHFLSAEECIMAGHFQNMYPNPCRFASGGYFGSKIVTVCVTGDKTNQVHMEGYQVSN 411

Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMKKDV---VVGGKDVKEVDN--------------D 315
            C+ L ++      +   D P+L  +++      V     KE DN              +
Sbjct: 412 QCMALVRDNCL---LPTKDAPELGYVRESSDKQFVPDVYYKEKDNYGNEVSRLGRPLPVE 468

Query: 316 FFLVVVKILD--------HQGPLSSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
           + L+ V            +  P    FP+ENR     ++   AL  +L++  S      I
Sbjct: 469 YLLLDVPASTPVTPTYTFNCDPSKQPFPVENRFIDGEIQDFNALNQYLSQFTSNEFFTAI 528

Query: 365 SDFHLLLFLA 374
           +DFHLLL++A
Sbjct: 529 NDFHLLLYIA 538


>gi|403333884|gb|EJY66073.1| Nuclear pore associated protein (NLP4), putative [Oxytricha
           trifallax]
          Length = 695

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 133/275 (48%), Gaps = 40/275 (14%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R  ++YG   ED       RV +  IYEPPQ G    +  L D  + K VD IA  L +
Sbjct: 269 QRMAWLYGYYSEDPNYPEGVRVNIEAIYEPPQVGEINGVSELDDPMQAK-VDMIAEALTL 327

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-------HAE-CNMEEWVTAVVKLEVSEE 258
           +KVG+IFT        D  LS +EV +  ++       H E  N+ ++VT VVK    + 
Sbjct: 328 EKVGWIFTSI----NHDAFLSAQEVRKISKYQEQYKVEHPEGYNVSKFVTVVVK---PKG 380

Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWF-ETE------IAEGDDPKLSKMKKDVVVGGKDVKE 311
             + I  E + +SD C  L ++  F E+E      + EG     ++M   VV  GK  KE
Sbjct: 381 DGSEIGIECYMVSDQCQALERDNIFGESESRKKMVLREGGP---NEMIPSVVKEGKPAKE 437

Query: 312 VDNDFFLVVV------KILDHQGPLSSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRI 364
            + DFF+V +         D++   +  FP++NR  +    +    +L R    P  ++ 
Sbjct: 438 FEPDFFIVSLANGQPKSNKDYKYLKNYDFPMQNRIGRTPAKKEFSDYLKRHKGEPSQRKY 497

Query: 365 SDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
           ++ HLL++LA  LD+++ + A+AQ V  +  +  G
Sbjct: 498 ANMHLLVYLADLLDIHTAM-AIAQNVAQEQPLDPG 531


>gi|156085300|ref|XP_001610127.1| NPL4 family protein [Babesia bovis]
 gi|154797379|gb|EDO06559.1| NPL4 family protein [Babesia bovis]
          Length = 480

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 82/280 (29%), Positives = 124/280 (44%), Gaps = 42/280 (15%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRDEEE 193
           F  Y    L  A +R G++YG  +ED    +        IYEPPQ  T   +  L DE  
Sbjct: 193 FANYWMNDLEMAEQRAGWLYGYYVEDSHYPLGIRAVCEGIYEPPQHSTLCDVEFLPDEFI 252

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-----HAECNMEEWVT 248
              VDAIAA LG++++G I T    +N     LS ++V+ + +      H        V+
Sbjct: 253 ST-VDAIAARLGLERIGHILTHLPRENY----LSPQDVIDSAKVQLGRVHKTHYTRYPVS 307

Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAE-GDDPKLSKMKK------- 300
             +   +  +        AF +SD  + L ++G     IA+   DP +  M++       
Sbjct: 308 THITCTLHPDSEGKPALNAFMVSDTAMALLRDGI----IADVQPDPMVISMRQPTNKNEV 363

Query: 301 --DVVVGGKDVKEVDNDFFLVVVKILDHQGPLS----STFPIENRTTQ-VTMRALK---- 349
              ++  GK+V   D  +F++ V       P S    S FP ENRT + +T  A+K    
Sbjct: 364 LPQIIESGKEVTSFDPSWFVIRVNDSAPINPNSIFKYSDFPRENRTIRPITPDAVKKFFS 423

Query: 350 SHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
           S L            SDFHLL+++AR LD+ +   ALA C
Sbjct: 424 SRLAEGLETSDANIFSDFHLLIYMARVLDVET---ALAIC 460


>gi|426346408|ref|XP_004040871.1| PREDICTED: nuclear protein localization protein 4 homolog [Gorilla
           gorilla gorilla]
          Length = 584

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 139/302 (46%), Gaps = 70/302 (23%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNR 229
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++             K  Y LS+ 
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLR-------------KDTYFLSSE 315

Query: 230 EVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLF 278
           E + A +F  +    C +        ++VTAV       +    +HFE +Q+S+ C+ L 
Sbjct: 316 ECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQCMALV 371

Query: 279 KEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN----DFFLVVV 321
           ++   E  +   D P+L   K             KDV   G ++ ++      ++ ++ +
Sbjct: 372 RD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLARPLPVEYLIIDI 428

Query: 322 KILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLL 371
                + P+       + FPIENR     TQ    +L ++L+++ S   +  ISDFHLLL
Sbjct: 429 TTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLL 487

Query: 372 FL 373
           FL
Sbjct: 488 FL 489


>gi|403280765|ref|XP_003931880.1| PREDICTED: nuclear protein localization protein 4 homolog [Saimiri
           boliviensis boliviensis]
          Length = 542

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 139/302 (46%), Gaps = 70/302 (23%)

Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F +   AD F  +  +T     +  G++YG   E K      R E
Sbjct: 171 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 227

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNR 229
           V  IYEPPQ GT+  L +L D + E +VD IAA LG++             K  Y LS+ 
Sbjct: 228 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLR-------------KDTYFLSSE 273

Query: 230 EVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLF 278
           E + A +F  +    C +        ++VTAV       +    +HFE +Q+S+ C+ L 
Sbjct: 274 ECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQCMALV 329

Query: 279 KEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN----DFFLVVV 321
           ++   E  +   D P+L   K             KDV   G ++ ++      ++ ++ +
Sbjct: 330 RD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLARPLPVEYLIIDI 386

Query: 322 KILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLL 371
                + P+       + FPIENR     TQ    +L ++L+++ S   +  ISDFHLLL
Sbjct: 387 TTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLL 445

Query: 372 FL 373
           FL
Sbjct: 446 FL 447


>gi|195395696|ref|XP_002056472.1| GJ10209 [Drosophila virilis]
 gi|194143181|gb|EDW59584.1| GJ10209 [Drosophila virilis]
          Length = 655

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 147/329 (44%), Gaps = 60/329 (18%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
           + RQ   H ++V F+       ++++N       +R GF+YGT  +        R +V  
Sbjct: 253 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGFLYGTYEQHADVPLGIRAKVAA 310

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT------- 225
           IYEPPQ+ + + + I  DE  ++ VDA+A  LG+KK+G+IFT  I ++    T       
Sbjct: 311 IYEPPQESSRDSINICPDEGADE-VDAVAKALGLKKIGWIFTDLITEDASAGTVKQIRGI 369

Query: 226 ----LSNREVLQAVEF-HAECNMEEWVT------AVVKLEVSEEGSAAIHFEAFQMSDMC 274
               L+ +E + A E  +   N  ++ T        V + V+ + +  +H E + +S  C
Sbjct: 370 ETHFLTAQECISAGELQNRHPNPCKYATNGTFGSKFVTICVTGDQTKQVHMEGYAVSAQC 429

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDFF 317
           + L ++      I   D P+L  +++        DV    KD          + +  ++ 
Sbjct: 430 MALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLPVEYL 486

Query: 318 LVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISD 366
           LV V       P  +         FPIENR     ++   AL S+L+       ++ ISD
Sbjct: 487 LVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSSYLSAWAEEDFLEAISD 546

Query: 367 FHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
           FHLL++L +   L L   +  L + V+A+
Sbjct: 547 FHLLIYLYKMDMLPLRQHMGPLLEAVRAK 575


>gi|432926050|ref|XP_004080805.1| PREDICTED: nuclear protein localization protein 4 homolog [Oryzias
           latipes]
          Length = 550

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 80/288 (27%), Positives = 124/288 (43%), Gaps = 64/288 (22%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +R G++YG   E K      R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GSQRVGYLYGRYTEHKDIPLGIRAE 285

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ   +  L +L D +    VD IAA LG+ KVG+IFT  +           +
Sbjct: 286 VAAIYEPPQTANQNSLELLEDPKAAA-VDEIAAKLGLCKVGWIFTDLLSEDTRIGTVRYL 344

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +N+  Y LS  E + +  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 345 RNQDSYYLSAEECITSGYFQNQHPNPCRLSRDGFFGSKFVTVVATGGPDNQVHFEGYQVS 404

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLS 331
           + C+ L +            D  L      +                      D Q   +
Sbjct: 405 NQCMALVR------------DECLLPFPGSITTTFPK----------------DPQYTFT 436

Query: 332 ST--FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
           ST  FPIENR     TQ    +L ++L++  S   +  +SDFHLLLFL
Sbjct: 437 STQRFPIENRDILGETQ-DFHSLATYLSQCSSTSFLDIVSDFHLLLFL 483


>gi|449269906|gb|EMC80643.1| Nuclear protein localization protein 4 like protein, partial
           [Columba livia]
          Length = 453

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 118/255 (46%), Gaps = 37/255 (14%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+    AD F  +  +T     +  G++YG   E K      R E
Sbjct: 209 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 265

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
           V  IYEPPQ GT+  L IL D + E +VD IAA LG++KVG+IFT  + +          
Sbjct: 266 VAAIYEPPQIGTQNSLEILDDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 324

Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
            NK  Y LS  E + A  F  +    C +     + +  V +  +      +HFE +Q+S
Sbjct: 325 RNKDTYYLSAEECITAGHFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 384

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLS 331
           + C+ L ++   E  +   D P+L   K+         + V + F+  + K  +    L+
Sbjct: 385 NQCMALVRD---ECLLPCRDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEITQLA 436

Query: 332 STFPIENRTTQVTMR 346
              P+E     V++R
Sbjct: 437 RPLPVEYLIIDVSVR 451


>gi|427793885|gb|JAA62394.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
           [Rhipicephalus pulchellus]
          Length = 588

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 66/317 (20%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ   H + V F+     + F  +  ET+    +R GF+YG     K      R  
Sbjct: 198 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 254

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT  +           +
Sbjct: 255 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 313

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +N +   LS +E + A  F       C +     + +  V + V+ +    +H E +Q+S
Sbjct: 314 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 373

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVG-----GKDVKEVDN---- 314
           + C+ L ++      +   D P+L  +++        DV        G +V ++      
Sbjct: 374 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 430

Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
           ++ LV V +   + P   T         FP+ENR  +  ++   A+  HL     N + S
Sbjct: 431 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 490

Query: 358 LPLVKRISDFHLLLFLA 374
             +++  SDFHLLL+LA
Sbjct: 491 ENVLEAFSDFHLLLYLA 507


>gi|427793273|gb|JAA62088.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
           [Rhipicephalus pulchellus]
          Length = 628

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 83/317 (26%), Positives = 143/317 (45%), Gaps = 66/317 (20%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ   H + V F+     + F  +  ET+    +R GF+YG     K      R  
Sbjct: 238 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 294

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT  +           +
Sbjct: 295 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 353

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +N +   LS +E + A  F       C +     + +  V + V+ +    +H E +Q+S
Sbjct: 354 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 413

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
           + C+ L ++      +   D P+L  +++        DV     D          + +  
Sbjct: 414 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 470

Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
           ++ LV V +   + P   T         FP+ENR  +  ++   A+  HL     N + S
Sbjct: 471 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 530

Query: 358 LPLVKRISDFHLLLFLA 374
             +++  SDFHLLL+LA
Sbjct: 531 ENVLEAFSDFHLLLYLA 547


>gi|428183916|gb|EKX52773.1| NPL4-like protein, forms a complex with UFD1 and CDC48 [Guillardia
           theta CCMP2712]
          Length = 656

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 141/323 (43%), Gaps = 56/323 (17%)

Query: 103 FGRKMTMDDLIAKQMRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGT 161
           +G +      + K M + RQ+  H + V   D    D F     +T A    R G++YGT
Sbjct: 212 WGERTVSMSRLPKPMTLDRQKYRHVDRVEIEDPQILDKFLDAWRQTGAM---RYGYLYGT 268

Query: 162 VLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIF-- 213
              D  V       V  IYEPPQ+   + + +LRD   +  +D+ AA LG+ KVG+IF  
Sbjct: 269 YEVDTNVPLGIKAVVKAIYEPPQECAADGVNMLRDPYSDA-IDSTAAMLGLVKVGWIFID 327

Query: 214 --------TQTIMQNKKD-YTLSNREVLQAVEFHA-------ECNMEEWVTAVVKLEVSE 257
                    + +    KD Y L + E + A            EC+  E+ +  V + V+ 
Sbjct: 328 LEVMADDHNKYVCSRGKDTYFLRSNECIIAARLQNMNPSKCLECDDLEFGSKFVTVVVTG 387

Query: 258 EGSAAIHFEAFQMSDMCVRLFK-EGWFETEIAEGDDPKLSKMKKDVVVGG---KDVKEVD 313
                + + A+Q+S   + L + EG  E   A+ +  ++++   D  V     +DV E  
Sbjct: 388 NEKNELDYVAYQVSQQGMDLERAEGILEY-TAKPNLCRVAQSTPDRYVPAIFYQDVSEFG 446

Query: 314 ND-------FFLVVVKILDHQGP---------LSSTFPIENRT-TQVTMRALKSHLNRSP 356
           N        +F V   ++   GP             FPIENR     T+ ALK+HLN   
Sbjct: 447 NKMVHEAKPYFPVDYLVVSVNGPTFPQQANPLFKRNFPIENRMYAPQTLEALKAHLNSPD 506

Query: 357 SLPL-----VKRISDFHLLLFLA 374
           + P      +K ISDFHL+++LA
Sbjct: 507 NKPFRPTGQLKTISDFHLIVYLA 529


>gi|427797169|gb|JAA64036.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
           [Rhipicephalus pulchellus]
          Length = 645

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 66/317 (20%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ   H + V F+     + F  +  ET+    +R GF+YG     K      R  
Sbjct: 255 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 311

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT  +           +
Sbjct: 312 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 370

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +N +   LS +E + A  F       C +     + +  V + V+ +    +H E +Q+S
Sbjct: 371 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 430

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVG-----GKDVKEVDN---- 314
           + C+ L ++      +   D P+L  +++        DV        G +V ++      
Sbjct: 431 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 487

Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
           ++ LV V +   + P   T         FP+ENR  +  ++   A+  HL     N + S
Sbjct: 488 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 547

Query: 358 LPLVKRISDFHLLLFLA 374
             +++  SDFHLLL+LA
Sbjct: 548 ENVLEAFSDFHLLLYLA 564


>gi|427783009|gb|JAA56956.1| Putative nuclear pore complex rnpl4 component sc npl4
           [Rhipicephalus pulchellus]
          Length = 592

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 66/317 (20%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ   H + V F+     + F  +  ET+    +R GF+YG     K      R  
Sbjct: 202 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 258

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT  +           +
Sbjct: 259 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 317

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +N +   LS +E + A  F       C +     + +  V + V+ +    +H E +Q+S
Sbjct: 318 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 377

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVG-----GKDVKEVDN---- 314
           + C+ L ++      +   D P+L  +++        DV        G +V ++      
Sbjct: 378 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 434

Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
           ++ LV V +   + P   T         FP+ENR  +  ++   A+  HL     N + S
Sbjct: 435 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 494

Query: 358 LPLVKRISDFHLLLFLA 374
             +++  SDFHLLL+LA
Sbjct: 495 ENVLEAFSDFHLLLYLA 511


>gi|397607010|gb|EJK59513.1| hypothetical protein THAOC_20248 [Thalassiosira oceanica]
          Length = 499

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 127/514 (24%), Positives = 215/514 (41%), Gaps = 127/514 (24%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           M++R+R+  G+ RV    A  T A++ + I +     V  + LS++        P D   
Sbjct: 1   MIVRVRTNVGVWRVEGLDASSTTADVLSGIAATRPHVVYERPLSSDPGCA---RPLD--- 54

Query: 61  FTDMANPDRPLSSLNISHGSIVF-------------------LSYDGERNVRGPSFN--- 98
                 P  PL   N+ HGS+V                    +  D  R+ R P+ N   
Sbjct: 55  ------PAVPLGRQNVGHGSMVHCRVDPGTCAENTADGGGESMDTDDSRDGRRPAGNTKR 108

Query: 99  -----------------PAGS-----FGRKMTMDDLIAKQM------------RVTRQEN 124
                             AGS      G+K   D  +A  +            ++ RQ+ 
Sbjct: 109 IIDKDGSIKLVHAEGSSSAGSDKGFRKGQKALRDMKMAWTLQEFTAMDDQYNFKIKRQDA 168

Query: 125 PHCE--SVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK---------------- 166
                  V  +    D FQ Y+     F  +R G++YG  ++++                
Sbjct: 169 SFVGKGGVHLESSAVDGFQSYLRR-FQFQQQRCGYLYGRFVDEEGDGDEEKDDGADKPAA 227

Query: 167 -------------------RVEVNFIYEPPQQGTEEVL--YILRDEEEEKLVDAIAAGLG 205
                              RV V  IYEPPQ+   +    ++L ++E E+ V+A+A  LG
Sbjct: 228 SSGWGARTPEEKKEAKKNARVVVEAIYEPPQEADADAAEGFVLLEDEAEEDVEALAGMLG 287

Query: 206 MKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAEC--NMEEWVTAVVKLEVSEEGSAAI 263
           +++VG+I        ++ + LS+ EV+ A E   E    +E      V++ V ++G+ A+
Sbjct: 288 LRRVGWIVGHP--PREEGFQLSSAEVIMAAELQLESAGGVEPTPFVTVRVTVGDDGNVAV 345

Query: 264 HFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVV 321
             EAFQ+S  C+ +  E      +  G +P    + +    +  GK    V+N+FFL VV
Sbjct: 346 --EAFQVSLQCMHMVAE----EALGVGPNPGFCYVDETFTAIQEGKPSATVENNFFLTVV 399

Query: 322 KILDHQGP-LSSTFPIENRTTQVTMRA---LKSHLNRS--PSLPLVKRISDFHLLLFLAR 375
            I  +Q   L + FP  NR     M+    +K  L++S       +  +SDF LLL+L +
Sbjct: 400 PINAYQSDMLVAQFPTANRAFDDRMQTHDEMKRQLSKSGQAGWNFIDLLSDFSLLLYLCK 459

Query: 376 FLDLNSDVPALAQCVQ-AQTAVPEGYKLLINSMA 408
           F+D+ +D+P +   V+  +  + +GYK++I SMA
Sbjct: 460 FMDVKTDMPKICASVKDREIPLDDGYKIIITSMA 493


>gi|157112064|ref|XP_001651776.1| nuclear protein localization [Aedes aegypti]
 gi|108878169|gb|EAT42394.1| AAEL006056-PA [Aedes aegypti]
          Length = 619

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/330 (25%), Positives = 148/330 (44%), Gaps = 61/330 (18%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+       ++++N       +R G ++GT      V    R  V  
Sbjct: 224 LNRQAYRHIDNVMFEN--TGIVERFLNYWRTTGHQRIGMLFGTYEVHPDVPLGIRARVAA 281

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNK----------- 221
           IYEPPQ+ T + + +L DE E + VD +A  LG+++VG+IFT  + +N            
Sbjct: 282 IYEPPQESTRDSILLLEDEHEAE-VDEMARLLGLRRVGWIFTDLLAENAAAGTVKHVRGI 340

Query: 222 KDYTLSNREVLQAV----EFHAEC---NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           K + L+ +E + A     ++   C   N   + +  V + V+ +    +H E + ++  C
Sbjct: 341 KTHFLTAQECILAGHLQNKYPNRCKYANGGYFGSKFVTVCVTGDDKKQVHMEGYAVTAQC 400

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEVDNDFF 317
           + L ++      I   D P+L  +++        DV    KDV         + +  ++ 
Sbjct: 401 MALVRDNCL---IPTKDAPELGYIRESSDKQYVPDVYYKEKDVYGNEVQRLGRPLPVEYL 457

Query: 318 LVVVKILDHQGPL---------SSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRIS 365
           LV V       PL         S  FP+ENR     ++   AL  +L +S S+  ++ +S
Sbjct: 458 LVDVPASTPMVPLFTFHERKDVSQYFPVENRMIDGHIQDFAALSDYLAKSRSMDFLECMS 517

Query: 366 DFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
           DFHLL +L R   L + S +  L + V+ +
Sbjct: 518 DFHLLFYLYRMDMLPMKSQMGPLLEAVRTK 547


>gi|195036638|ref|XP_001989777.1| GH18610 [Drosophila grimshawi]
 gi|193893973|gb|EDV92839.1| GH18610 [Drosophila grimshawi]
          Length = 655

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 147/329 (44%), Gaps = 60/329 (18%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
           + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V  
Sbjct: 253 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKVAA 310

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT------- 225
           IYEPPQ+ + + + I  DE  ++ VDA+A  LG+KK+G+IFT  I ++    T       
Sbjct: 311 IYEPPQESSRDSINICPDEGADE-VDAVAKALGLKKIGWIFTDLITEDASAGTVKQIRGI 369

Query: 226 ----LSNREVLQAVEF-HAECNMEEWVT------AVVKLEVSEEGSAAIHFEAFQMSDMC 274
               L+ +E + A E  +   N  ++ T        V + V+ + +  +H E + +S  C
Sbjct: 370 ETHFLTAQECISAGELQNRHPNPCKYATNGTFGSKFVTICVTGDQTKQVHMEGYAVSAQC 429

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDFF 317
           + L ++      I   D P+L  +++        DV    KD          + +  ++ 
Sbjct: 430 MALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLPVEYL 486

Query: 318 LVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISD 366
           LV V       P  +         FPIENR     ++   AL S+L+       ++ ISD
Sbjct: 487 LVDVPASTPLQPQYTFTEYDKRQAFPIENRYIDGQLQDFNALSSYLSAWADDEFLEAISD 546

Query: 367 FHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
           FHLL++L +   L L   +  L + V+A+
Sbjct: 547 FHLLVYLYKMDMLPLRQHMGPLLEAVRAR 575


>gi|48133028|ref|XP_393332.1| PREDICTED: nuclear protein localization protein 4 homolog [Apis
           mellifera]
          Length = 646

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/310 (26%), Positives = 142/310 (45%), Gaps = 58/310 (18%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+   +   ++++N   +   +R GF+YG       V    R  V
Sbjct: 230 ITLNRQTYRHVDNVMFEN--SSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 287

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ T+  + +L DE+E  +VD +A  L +KK+G+IFT  I           ++
Sbjct: 288 TAIYEPPQESTKNSIRLLPDEKE-TIVDELAHLLNLKKIGWIFTDLIADDIKIGTVKHVR 346

Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N + + LS +E + A  F  +    C       + +  V + V+ +    IH E +Q+S+
Sbjct: 347 NIESHFLSAQECIMAGYFQNKYPNPCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 406

Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
            C  L ++G     +   D P+L         +   DV    KD          + +  +
Sbjct: 407 QCTALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 463

Query: 316 FFLVVVKILDHQGP-----LSST---FPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
           + LV V       P     +S+T   FPIENR     ++   +L S++ +      ++ +
Sbjct: 464 YLLVDVPASTPLTPQFTFHISNTITPFPIENRFIDGQIQEFSSLCSYMQQFTKEQFLEAV 523

Query: 365 SDFHLLLFLA 374
           SDFHLL+F+A
Sbjct: 524 SDFHLLIFIA 533


>gi|114809978|gb|ABI81494.1| nuclear protein localization 4 [Squalus mitsukurii]
          Length = 355

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 118/252 (46%), Gaps = 50/252 (19%)

Query: 167 RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI--------- 217
           R EV  IYEPPQ GT   L  L D + E +VD IAA LG++KVG+IFT  +         
Sbjct: 14  RAEVAAIYEPPQVGTANSLEFLEDPKAE-VVDEIAAKLGLRKVGWIFTDLLSEDTRKGTV 72

Query: 218 --MQNKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAF 268
             ++NK  + LS  E + A +F  +    C +     + +  V +  +      +HFE +
Sbjct: 73  KFIRNKDAHFLSAEECITAGDFQNKHPNVCRLSPVNHFGSKFVTVLATGGPDNQVHFEGY 132

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN- 314
           Q+S+ C+ L  +  F   +   D P+L  +K             KD+   G ++ ++   
Sbjct: 133 QVSNQCMALVGDDCF---LPCRDAPELGYVKESSSGQCVPDVFYKDIDKFGNEITQLARP 189

Query: 315 ---DFFLVVVKILDHQGPLSS------TFPIENR----TTQVTMRALKSHLNRSPSLPLV 361
              ++ ++ +     + P+ +       FP+ NR     TQ     L S+L+++ S   +
Sbjct: 190 LPVEYLIIDIPTTFPKDPVYTFAASPHPFPVVNRDVVGKTQ-DFHTLASYLSQNQSAVFL 248

Query: 362 KRISDFHLLLFL 373
             IS+FHLLLFL
Sbjct: 249 DVISNFHLLLFL 260


>gi|195504288|ref|XP_002099014.1| GE10682 [Drosophila yakuba]
 gi|194185115|gb|EDW98726.1| GE10682 [Drosophila yakuba]
          Length = 624

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 219 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 276

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I +   D ++   +
Sbjct: 277 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASIGTVK 332

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 333 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 391

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD          + 
Sbjct: 392 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 448

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 449 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGDEEF 508

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
           ++ ISDFHLL++L +   L L   +  L + V+       AQ  V + +KLL
Sbjct: 509 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 560


>gi|62484495|ref|NP_733110.2| Npl4 ortholog, isoform B [Drosophila melanogaster]
 gi|281362576|ref|NP_001163731.1| Npl4 ortholog, isoform D [Drosophila melanogaster]
 gi|61679394|gb|AAN14058.2| Npl4 ortholog, isoform B [Drosophila melanogaster]
 gi|272477171|gb|ACZ95025.1| Npl4 ortholog, isoform D [Drosophila melanogaster]
          Length = 624

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 219 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 276

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I     D ++   +
Sbjct: 277 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 332

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 333 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 391

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD+         + 
Sbjct: 392 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDLYGNEVQRLARP 448

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 449 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 508

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
           ++ ISDFHLL++L +   L L   +  L + V+       AQ  V + +KLL
Sbjct: 509 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 560


>gi|194908465|ref|XP_001981774.1| GG12234 [Drosophila erecta]
 gi|190656412|gb|EDV53644.1| GG12234 [Drosophila erecta]
          Length = 624

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 219 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 276

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I +   D ++   +
Sbjct: 277 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASVGTVK 332

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 333 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 391

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD          + 
Sbjct: 392 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 448

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 449 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 508

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
           ++ ISDFHLL++L +   L L   +  L + V+       AQ  V + +KLL
Sbjct: 509 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNEAAQFKVVDVWKLL 560


>gi|379699068|gb|AFD10756.1| FI19201p1 [Drosophila melanogaster]
          Length = 628

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 223 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 280

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I     D ++   +
Sbjct: 281 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 336

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 337 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 395

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD+         + 
Sbjct: 396 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDLYGNEVQRLARP 452

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 453 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 512

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
           ++ ISDFHLL++L +   L L   +  L + V+       AQ  V + +KLL
Sbjct: 513 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 564


>gi|291226637|ref|XP_002733298.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Saccoglossus kowalevskii]
          Length = 637

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 136/309 (44%), Gaps = 61/309 (19%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG------TVLEDKRVEVNF 172
           + RQ   H ++V F+      F ++++       +R G +YG      +V    +  V+ 
Sbjct: 232 LNRQHYRHVDNVLFEN--PRLFDRFIDYWRKSGNQRAGLLYGRYEYHESVPLGIKASVSA 289

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEPPQ  T   +  L D+    +VD IA  LG++KVG+IFT  +           ++N 
Sbjct: 290 IYEPPQDCTPNSIEFL-DDPHGSVVDEIAGKLGLRKVGWIFTDLVAEDLRKGTVKHVRNM 348

Query: 222 KDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
             + LS  E + A EF  +    C +     + +  V + V+ + +  IH EA+Q+S+ C
Sbjct: 349 DAHFLSAEECIMAGEFQNQNPNKCKLATDGNFGSKFVTVIVTGDSTNQIHTEAYQVSNQC 408

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDVKEVDNDFFL------VV 320
           + L  +   +  I   D P+L  +K+        DV    KD    +N+  L      V 
Sbjct: 409 MALVND---DCLIPTKDAPELGYIKESSNEQYVPDVFYKMKD--SYNNEVALLARPLPVE 463

Query: 321 VKILDHQG--PLSS----------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRIS 365
             I+D     PL             FPIENR+    ++    L  +L   P+  L+  + 
Sbjct: 464 YLIIDVPAGFPLEPKYTFLDENERKFPIENRSAIAEIQNFETLSQYLQYYPAAKLLDAMC 523

Query: 366 DFHLLLFLA 374
           D H+LL+LA
Sbjct: 524 DLHVLLYLA 532


>gi|62484318|ref|NP_651407.3| Npl4 ortholog, isoform A [Drosophila melanogaster]
 gi|74868217|sp|Q9VBP9.3|NPL4_DROME RecName: Full=Nuclear protein localization protein 4 homolog
 gi|16182532|gb|AAL13515.1| GH03617p [Drosophila melanogaster]
 gi|61679395|gb|AAF56480.3| Npl4 ortholog, isoform A [Drosophila melanogaster]
 gi|220951542|gb|ACL88314.1| CG4673-PA [synthetic construct]
          Length = 652

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 247 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 304

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I     D ++   +
Sbjct: 305 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 360

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 361 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 419

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD+         + 
Sbjct: 420 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDLYGNEVQRLARP 476

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 477 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 536

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
           ++ ISDFHLL++L +   L L   +  L + V+       AQ  V + +KLL
Sbjct: 537 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 588


>gi|125602545|gb|EAZ41870.1| hypothetical protein OsJ_26416 [Oryza sativa Japonica Group]
          Length = 202

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 2/94 (2%)

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF 334
           ++LFK+G  +T+I + DDP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQ  L ++ 
Sbjct: 101 LQLFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDAMEVDNDFFLVPVKISDHQ--LQTSS 158

Query: 335 PIENRTTQVTMRALKSHLNRSPSLPLVKRISDFH 368
             + +      R+   H   + +   + R  D H
Sbjct: 159 QNKKKDKAAVFRSCPFHFYNAKNTAEIFRAHDEH 192


>gi|320163920|gb|EFW40819.1| nuclear protein localization protein [Capsaspora owczarzaki ATCC
           30864]
          Length = 526

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 88/331 (26%), Positives = 146/331 (44%), Gaps = 66/331 (19%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVN-ETLAFAVKRGGFM------YGTVLEDKRVEVN 171
           + RQ   H + V F++  A    Q+++    +  V+R G++      Y  V    +  V 
Sbjct: 135 LARQAYRHVDYVEFEQ--ASILNQFIDFWRQSGGVQRLGYLIGRYEPYPEVPLGIKAVVA 192

Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIF-------TQTIM----QN 220
            IYEPPQ G  + L +L D  EE ++  +A+ LG+++VG+IF       T+T M    ++
Sbjct: 193 AIYEPPQTGAVDSLELLDDPNEEAVL-GVASALGLQRVGWIFTDLEDDGTRTGMVQCKRH 251

Query: 221 KKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
           K  + LS  E++ A +  A         +  E+ +  V + +S   S  +  EA+Q+S+ 
Sbjct: 252 KDTFFLSASEIMMAADLQARHPNPTTAASTGEFGSKFVTVVISGNASREVSLEAYQVSNQ 311

Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEV--------------------D 313
           C  L +    E  +    DP  S M        + + EV                     
Sbjct: 312 CATLVQN---EFVVPSTRDP--SAMTVRETTDAQYIPEVFYRYKNEYGTEVVSTARPSFP 366

Query: 314 NDFFLVVVKILDHQGP-------LSSTFPIENRTT---QVTMRALKSHLNRSPS-LPLVK 362
            D+ +V +     Q P       +S+ F I NR+       + ALKS+L   PS    + 
Sbjct: 367 VDYLIVNLGNGSPQSPNPTFGLGISTPFTIGNRSVLGQNADVVALKSYLAAQPSNAAFLA 426

Query: 363 RISDFHLLLFLA--RFLDLNSDVPALAQCVQ 391
            +SDFHLL+FLA    L L++ +P L Q ++
Sbjct: 427 LVSDFHLLVFLATSSVLMLSNSLPELLQAIK 457


>gi|195573927|ref|XP_002104943.1| GD18187 [Drosophila simulans]
 gi|194200870|gb|EDX14446.1| GD18187 [Drosophila simulans]
          Length = 641

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 139/315 (44%), Gaps = 66/315 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 247 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 304

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I +   D ++   +
Sbjct: 305 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASIGTVK 360

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 361 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 419

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD          + 
Sbjct: 420 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 476

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 477 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 536

Query: 361 VKRISDFHLLLFLAR 375
           ++ ISDFHLL++L +
Sbjct: 537 LEAISDFHLLVYLYK 551


>gi|195349473|ref|XP_002041269.1| GM10234 [Drosophila sechellia]
 gi|194122964|gb|EDW45007.1| GM10234 [Drosophila sechellia]
          Length = 725

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 139/315 (44%), Gaps = 66/315 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 247 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 304

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I +   D ++   +
Sbjct: 305 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASIGTVK 360

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 361 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 419

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD          + 
Sbjct: 420 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 476

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 477 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 536

Query: 361 VKRISDFHLLLFLAR 375
           ++ ISDFHLL++L +
Sbjct: 537 LEAISDFHLLVYLYK 551


>gi|261338793|gb|ACX70078.1| UT01770p [Drosophila melanogaster]
          Length = 628

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 88/352 (25%), Positives = 153/352 (43%), Gaps = 75/352 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 223 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 280

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + I  DE  +  VDA+A+ LG+KK+G+IFT  I     D ++   +
Sbjct: 281 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 336

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 337 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 395

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++      I   D P+L  +++        DV    KD          + 
Sbjct: 396 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDRYGNEVQRLARP 452

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P+ +         FPIENR     ++   AL  +L+       
Sbjct: 453 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 512

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
           ++ ISDFHLL++L +   L L   +  L + V+       AQ  V + +KLL
Sbjct: 513 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 564


>gi|346469505|gb|AEO34597.1| hypothetical protein [Amblyomma maculatum]
          Length = 611

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 139/314 (44%), Gaps = 63/314 (20%)

Query: 117 MRVTRQENPHCESVSFDR-DCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ   H +SV F+     + F  +  ET+    +R GF+YG     K      R  
Sbjct: 224 LTLNRQVYRHVDSVMFENPSIVEHFLNFWRETMH---QRIGFLYGRYEHHKDVPLGIRAT 280

Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
           V  IYEPPQ  T + + +L DE++  LVD +A+ LG+ +VG+IFT  +           +
Sbjct: 281 VAAIYEPPQDSTPDSIKLLPDEKQ-PLVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 339

Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
           +N +   LS +E + A  F       C +     + +  V + V+ +    +H E +Q+S
Sbjct: 340 RNIESCFLSAQECIMAGHFQNNHPNPCKLSPDGYFGSKFVTVCVTGDLENRVHMEGYQVS 399

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
           + C+ L ++      +   D P+L  +++        DV     D          + +  
Sbjct: 400 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQLARPLPI 456

Query: 315 DFFLVVVKILDHQGPL-----SSTFPIENRTTQ--------VTMRALK-SHLNRSPSLPL 360
           ++ LV V +   + PL        FP+ENR  +        VT R  + +  N      +
Sbjct: 457 EYLLVDVPVSTPKEPLFTFSAKHCFPVENRLLEGHIQDLGAVTARLSRFTAANNGGQNAV 516

Query: 361 VKRISDFHLLLFLA 374
           ++  SDFHLLL+LA
Sbjct: 517 LEAFSDFHLLLYLA 530


>gi|156353418|ref|XP_001623063.1| predicted protein [Nematostella vectensis]
 gi|156209717|gb|EDO30963.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 81/311 (26%), Positives = 141/311 (45%), Gaps = 59/311 (18%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ+  H +++ F+       Q +++       +R GF+YG   E K      R  V
Sbjct: 137 ITLKRQDYRHVDNILFEN--PSIVQPFLDYWRRTGNQRLGFLYGRYEEHKDVPLGIRAVV 194

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT----- 225
             IYEPPQ+GT + L +L +   +  V+ +AA LG++ VG+IF+  +  + K  T     
Sbjct: 195 AAIYEPPQEGTPDSLELLPNTNLDT-VNKLAADLGLRIVGWIFSDLVPDDVKKGTVKHLR 253

Query: 226 ------LSNREVLQAVEFH----AECNME---EWVTAVVKLEVSEEGSAAIHFEAFQMSD 272
                 L+  E + A +F     + C      ++ +  V + VS      I F+ FQ+S+
Sbjct: 254 DINTHFLTAEECIMAGDFQNQYVSPCRYASSGKFGSKFVTVVVSGNDENQIGFDGFQVSN 313

Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDV----KEVDND 315
            C+ L ++   +  +   DDP L  ++             KD    G +V    + +  +
Sbjct: 314 QCMALVRD---DCLVPTLDDPGLGYIRESSSEQYVPDVFYKDTDSYGNEVTLLARPMPIE 370

Query: 316 FFLVVVK-------ILDHQGPLS--STFPIENRTTQVTMR---ALKSHLNRSPSLPLVKR 363
           + LV V        +     P S  + FPIENR     ++   A++ HL++ P    +  
Sbjct: 371 YLLVQVSAAFPVQPVYTFSPPPSNGAVFPIENRENIGELQDFDAVRRHLSQYPPGDFLSA 430

Query: 364 ISDFHLLLFLA 374
           +S+FHLL+FLA
Sbjct: 431 MSNFHLLVFLA 441


>gi|380028897|ref|XP_003698120.1| PREDICTED: nuclear protein localization protein 4 homolog [Apis
           florea]
          Length = 646

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 142/313 (45%), Gaps = 64/313 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+   +   ++++N   +   +R GF+YG       V    R  V
Sbjct: 230 ITLNRQTYRHVDNVMFEN--SSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 287

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ T+  + +L DE+E  +VD +A  L +K++G+IFT  I           ++
Sbjct: 288 TAIYEPPQESTKNSIRLLPDEKE-TIVDELAHLLNLKRIGWIFTDLIADDIKIGTVKHVR 346

Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N + + LS +E + A  F  +    C       + +  V + V+ +    IH E +Q+S+
Sbjct: 347 NIESHFLSAQECIMAGYFQNKYPNLCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 406

Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
            C+ L ++G     +   D P+L         +   DV    KD          + +  +
Sbjct: 407 QCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 463

Query: 316 FFLVVVKILDHQGPLS-----------STFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
           + LV V       PL+           + FPIENR     ++   +L S++ +      +
Sbjct: 464 YLLVDVPA---STPLTPQFTFYINNTITPFPIENRFIDGQIQEFSSLCSYMQQFTKEQFL 520

Query: 362 KRISDFHLLLFLA 374
           + +SDFHLL+F+A
Sbjct: 521 EAVSDFHLLIFIA 533


>gi|388329682|gb|AFK29236.1| CG4673-PB-like protein [Drosophila buzzatii]
          Length = 626

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 139/316 (43%), Gaps = 66/316 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 222 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 279

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ + + + I  DE  ++ V+A+A  LG+KK+G+IFT  I +   D T    +
Sbjct: 280 AAIYEPPQESSRDSINICSDEGADE-VEAVAKALGLKKIGWIFTDLITE---DATAGTVK 335

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 336 QIRGIETHF-LTAQECITAGELQNRHPNPCKYAKNGTFGSKFVTICVTGDQTKQVHMEGY 394

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++   +  I   D P+L  +++        DV    KD          + 
Sbjct: 395 AVSAQCMALVRD---DCLIPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 451

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P  +         FPIENR     ++   AL S+L+       
Sbjct: 452 LPVEYLLVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSSYLSAWAEEDF 511

Query: 361 VKRISDFHLLLFLARF 376
           ++ ISDFHLL++L + 
Sbjct: 512 LEAISDFHLLIYLYKM 527


>gi|388329683|gb|AFK29237.1| CG4673-PA-like protein [Drosophila buzzatii]
          Length = 655

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 139/315 (44%), Gaps = 66/315 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V
Sbjct: 251 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 308

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ + + + I  DE  ++ V+A+A  LG+KK+G+IFT  I +   D T    +
Sbjct: 309 AAIYEPPQESSRDSINICSDEGADE-VEAVAKALGLKKIGWIFTDLITE---DATAGTVK 364

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 365 QIRGIETHF-LTAQECITAGELQNRHPNPCKYAKNGTFGSKFVTICVTGDQTKQVHMEGY 423

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++   +  I   D P+L  +++        DV    KD          + 
Sbjct: 424 AVSAQCMALVRD---DCLIPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 480

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P  +         FPIENR     ++   AL S+L+       
Sbjct: 481 LPVEYLLVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSSYLSAWAEEDF 540

Query: 361 VKRISDFHLLLFLAR 375
           ++ ISDFHLL++L +
Sbjct: 541 LEAISDFHLLIYLYK 555


>gi|347966940|ref|XP_550827.4| AGAP001986-PA [Anopheles gambiae str. PEST]
 gi|333469830|gb|EAL38499.4| AGAP001986-PA [Anopheles gambiae str. PEST]
          Length = 651

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 140/314 (44%), Gaps = 59/314 (18%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+       ++++N       +R GF++G       V    R  V  
Sbjct: 253 LNRQPYRHVDNVMFEN--TGIVERFLNYWRTTGHQRIGFLFGKYEIHPDVPLGIRARVAA 310

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEPPQ+   + + +L D+  +  VD +A  LG+++VG+IFT  +           ++N 
Sbjct: 311 IYEPPQESNRDSIRLL-DDPHDADVDELARQLGLQRVGWIFTDLLSENLASGTVKHVRNI 369

Query: 222 KDYTLSNREVLQAV----EFHAECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           K + L+ +E + A     ++   C       + +  V + V+ +    +H E + +S  C
Sbjct: 370 KTHFLTAQECILAGHLQNKYPNRCKDSSNGYFGSKFVTVCVTGDEKKQVHMEGYAVSAQC 429

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEVDNDFF 317
           + L ++      I   D P+L  +++        DV    KDV         + +  ++ 
Sbjct: 430 MALVRDNCL---IPTKDAPELGYIRESSDKQYVPDVYYKEKDVYGNEVQRLGRPLPVEYL 486

Query: 318 LVVVKILDHQGPLSS---------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRIS 365
           LV V       PL +          FP+ENR     ++   AL ++L +S SLP +  +S
Sbjct: 487 LVDVPASTPLVPLYTFLERKDAKQYFPVENRMIDGHIQDFAALANYLAKSRSLPFLDAVS 546

Query: 366 DFHLLLFLARFLDL 379
           DFHLL +L R  D+
Sbjct: 547 DFHLLFYLYRMEDM 560


>gi|383861559|ref|XP_003706253.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Megachile rotundata]
          Length = 687

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 81/313 (25%), Positives = 140/313 (44%), Gaps = 64/313 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YG       V    R  V
Sbjct: 234 ITLNRQTYRHVDNVMFEN--PSLVERFLNYWRNTGHQRIGYLYGRYEIHTDVPLGIRAVV 291

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ T++ + +L DE +  LVD +A  L ++KVG+IFT  I           ++
Sbjct: 292 AAIYEPPQESTKDTIRLLPDERK-ALVDELARLLNLRKVGWIFTDLIADDVKKGTVKHVR 350

Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N + + LS +E + A  F  E    C       + +  V + V+ +    +H E +Q+S+
Sbjct: 351 NIESHFLSAQECIMAGYFQNEYPNPCRFSPSNYFGSKFVTVCVTGDDKNQVHMEGYQVSN 410

Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
            C+ L ++G     +   D P+L         +   DV    KD          + +  +
Sbjct: 411 QCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVPRLARPLPVE 467

Query: 316 FFLVVVKILDHQGPLS-----------STFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
           + LV V       PL+           + FPIENR     ++   +L S++ +      +
Sbjct: 468 YLLVDVPA---STPLTPQFTFYVSDNITPFPIENRLVDGQVQEFSSLCSYMQQFNQDEFL 524

Query: 362 KRISDFHLLLFLA 374
           + +SDFHLLLF+A
Sbjct: 525 EAVSDFHLLLFIA 537


>gi|42761319|dbj|BAD11562.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|45735811|dbj|BAD12847.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 178

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/53 (66%), Positives = 44/53 (83%)

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           ++LFK+G  +T+I + DDP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQ
Sbjct: 124 LQLFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDAMEVDNDFFLVPVKISDHQ 176


>gi|195453878|ref|XP_002073983.1| GK14396 [Drosophila willistoni]
 gi|194170068|gb|EDW84969.1| GK14396 [Drosophila willistoni]
          Length = 660

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/334 (24%), Positives = 146/334 (43%), Gaps = 66/334 (19%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT  +        R  V
Sbjct: 253 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRARV 310

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ + + L IL DE     V+A+AA LG+KK+G+IFT  I     D T+   +
Sbjct: 311 AAIYEPPQESSRDSLNILPDEGSAD-VEAVAAALGLKKIGWIFTDLIT---ADATVGTVK 366

Query: 231 VLQAVEFH----AEC-----------NMEEWVT------AVVKLEVSEEGSAAIHFEAFQ 269
            ++ ++ H     EC           N  ++ T        V + V+ + +  +H E + 
Sbjct: 367 QIRGIDTHFLTAQECITAGELQNRHPNPCKYATNGVFGSKFVTICVTGDQTKQVHMEGYA 426

Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEV 312
           +S  C+ L ++      +   D P+L  +++        DV    KD          + +
Sbjct: 427 VSAQCMALVRDNCL---LPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLGRPL 483

Query: 313 DNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
             ++ LV V       P  +         FP+ENR     ++   AL ++L+       +
Sbjct: 484 PVEYLLVDVPASTPLQPQYTFTKYDQRQPFPVENRYIDGHLQDFNALSNYLSAWGDEEFL 543

Query: 362 KRISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
           + ISDFHLL+FL +   L L   +  L + V+ +
Sbjct: 544 EAISDFHLLIFLYKMDMLPLRQHMAPLMEAVRTK 577


>gi|156349269|ref|XP_001621988.1| hypothetical protein NEMVEDRAFT_v1g176363 [Nematostella vectensis]
 gi|156208367|gb|EDO29888.1| predicted protein [Nematostella vectensis]
          Length = 614

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/315 (26%), Positives = 144/315 (45%), Gaps = 66/315 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ+  H +++ F+       Q +++       +R GF+YG   E K      R  V
Sbjct: 209 ITLKRQDYRHVDNILFEN--PSIVQPFLDYWRRTGNQRLGFLYGRYEEHKDVPLGIRAVV 266

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT----- 225
             IYEPPQ+GT + L +L +   +  V+ +AA LG++ VG+IF+  +  + K  T     
Sbjct: 267 AAIYEPPQEGTPDSLELLPNPNLDT-VNKLAADLGLRIVGWIFSDLVPDDVKKGTVKHLR 325

Query: 226 ------LSNREVLQAVEFH----AECNM-------EEWVTAVVKLEVSEEGSAAIHFEAF 268
                 L+  E + A +F     + C          ++VT VV  E ++E    I F+ F
Sbjct: 326 DINTHFLTAEECIMAGDFQNQYVSPCRYASSGKFGSKFVTVVVS-EGNDENQ--IGFDGF 382

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDV----KE 311
           Q+S+ C+ L ++   +  +   DDP L  ++             KD    G +V    + 
Sbjct: 383 QVSNQCMALVRD---DCLVPTLDDPGLGYIRESSSEQYVPDVFYKDTDSYGNEVTLLARP 439

Query: 312 VDNDFFLVVVK-------ILDHQGPLS--STFPIENRTTQVTMR---ALKSHLNRSPSLP 359
           +  ++ LV V        +     P S  + FPIENR     ++   A++ HL++ P   
Sbjct: 440 MPIEYLLVQVSAAFPVQPVYTFSPPPSNGAVFPIENRENIGELQDFDAVRRHLSQYPPGN 499

Query: 360 LVKRISDFHLLLFLA 374
            +  +S+FHLL+FLA
Sbjct: 500 FLSAMSNFHLLVFLA 514


>gi|57899187|dbj|BAD87239.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 196

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 35/53 (66%), Positives = 44/53 (83%)

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
           ++LFK+G  +T+I + DDP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQ
Sbjct: 100 LQLFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQ 152


>gi|195112190|ref|XP_002000657.1| GI10357 [Drosophila mojavensis]
 gi|193917251|gb|EDW16118.1| GI10357 [Drosophila mojavensis]
          Length = 653

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 138/316 (43%), Gaps = 72/316 (22%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
           + RQ   H ++V F+       ++++N       +R G++YGT  +        R  V  
Sbjct: 252 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGWLYGTYEQHTDVPLGIRARVAA 309

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
           IYEPPQ+ + + + I  DE  ++ VDA+A  LG+KK+G+IFT  I ++    T+     +
Sbjct: 310 IYEPPQESSRDSINICPDENADE-VDAVAKALGLKKIGWIFTDLITEDASAGTVKQ---I 365

Query: 233 QAVEFH----AEC-----------NMEEWVT------AVVKLEVSEEGSAAIHFEAFQMS 271
           + +E H     EC           N  ++ T        V + V+ + +  +H E + +S
Sbjct: 366 RGIETHFLTAQECITAGELQNRHPNPCKYATNGTFGSKFVTICVTGDQTKQVHMEGYAVS 425

Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
             C+ L ++      I   D P+L  +++        DV    KD          + +  
Sbjct: 426 AQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLPV 482

Query: 315 DFFLVVVKI------------LDHQGPLSSTFPIENRTTQVTMR---ALKSHLNRSPSLP 359
           ++ LV V               D + P    FPIENR     ++   AL ++L+      
Sbjct: 483 EYLLVDVPASTPLQPQFTFTEYDKRQP----FPIENRYIDGHLQDFNALSNYLSAWGEED 538

Query: 360 LVKRISDFHLLLFLAR 375
            ++ ISDFHLL++L +
Sbjct: 539 FLEAISDFHLLVYLYK 554


>gi|218200668|gb|EEC83095.1| hypothetical protein OsI_28230 [Oryza sativa Indica Group]
          Length = 163

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/92 (44%), Positives = 56/92 (60%), Gaps = 2/92 (2%)

Query: 277 LFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPI 336
           LFK+G  +T+I + DDP+LSKM+K+VV GGKD  EVDNDFFLV VKI DHQ  L ++   
Sbjct: 64  LFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDAMEVDNDFFLVPVKISDHQ--LQTSSQN 121

Query: 337 ENRTTQVTMRALKSHLNRSPSLPLVKRISDFH 368
           + +      R+   H   + +   + R  D H
Sbjct: 122 KKKDKAAVFRSCPFHFYNAKNTAEIFRAHDEH 153


>gi|361131707|gb|EHL03359.1| putative Nuclear protein localization protein 4 [Glarea lozoyensis
           74030]
          Length = 477

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 156/379 (41%), Gaps = 73/379 (19%)

Query: 98  NPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG-- 155
           NP     +    D L  K  R+ R+ +P    V F    ADA    + + L F  K G  
Sbjct: 57  NPWEVVKQHPVDDRLDKKDGRIPRKRDPKMYHVEF----ADA--SLIEKLLDFWRKSGAQ 110

Query: 156 --GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLGM 206
             G++YG   E   V       V  IYEPPQ   +E+  + L + E EK VD +A   G+
Sbjct: 111 RLGYLYGRYTEYTEVPLGIKAVVEAIYEPPQ--VDEIDGVTLNEWENEKDVDEVARLCGL 168

Query: 207 KKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWVT 248
           ++VG I+T  +     D           Y LS++E++ A    A+        +  ++ +
Sbjct: 169 QRVGVIWTDLLDSGVGDGSVVCKRHIDSYFLSSQEIMFAARQQAQHPKPTKWSDTGKFGS 228

Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKMK 299
           A V   +S + S  I   A+Q S+  V + +    E  +          E DD  +S+ +
Sbjct: 229 AFVTCVISGDESGQISISAYQASNSAVEMVRAELVEPSVDPSVMLVRNEEEDDGSISRTR 288

Query: 300 -------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST---------FPIENRTT 341
                  + +   G  V+E     F V  ++  L H  P +           FPIENR  
Sbjct: 289 YIPEVFYRKINEYGAQVQENAKPSFPVEYLLITLTHGFPSNPMPQFIADPPGFPIENRAV 348

Query: 342 QVTMRALKS-----HLNRSPSL----PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
               + LK       L+R+  L      +K ISDFHLL FL  +  L  D  AL   V  
Sbjct: 349 IGVDQDLKDLSKRLQLDRNGELTNQDAAIKSISDFHLLCFLHGYGVLGKDEEALLCKVAT 408

Query: 393 QTAVPEGYKLLINSMAETV 411
           Q  + +G++LL  S   T+
Sbjct: 409 QHDLADGFQLLSTSGWATL 427


>gi|326434890|gb|EGD80460.1| hypothetical protein PTSG_11104 [Salpingoeca sp. ATCC 50818]
          Length = 550

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 86/330 (26%), Positives = 139/330 (42%), Gaps = 64/330 (19%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNF 172
           + RQE  H + V F+    +   ++++       +R GFMYG+  E  +V       V  
Sbjct: 202 LNRQEYRHVDYVEFESH--EILDKFLDAWRRSGAQRAGFMYGSYEEYDQVPLGVKAVVKA 259

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT-------------QTIMQ 219
           IYEPPQ  +   L I  +  + + VD +A  LG+K+VG+IFT             + I  
Sbjct: 260 IYEPPQSNSALGLEIALESPQIQQVDQMAKQLGLKRVGWIFTDLEPDANGLAKYKRYIQP 319

Query: 220 NKKD-YTLSNREVLQAVEF---HAECNMEEWVTAVVK------LEVSEEGSAAIHFEAFQ 269
             +D Y +S  E+L A  F   H       + T  V       + VS +    I   A+Q
Sbjct: 320 GDEDSYVISAEEILNAAMFQNQHPNPVPPRFSTTGVHGSKFTTVVVSGDSEHQIVPRAYQ 379

Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMK--------KDVVVGGKD-----VKEVDNDF 316
            S   + L ++          D P L  ++         +V+   +D     +K   +  
Sbjct: 380 ASTQAMNLVRDDILRP----SDSPALVAVRPADDTTFVPEVLFTQRDKYNNEIKHKADPL 435

Query: 317 FLVVVKILDHQG---------PLSSTFPIENRT---TQVTMRALKSHLNRSPSLPLVKRI 364
           F V    +D              S +FPIENR       ++ ALK HL  + S   ++++
Sbjct: 436 FPVAYLAVDLPAGGRMDATSTTFSVSFPIENREALGAPASLSALKQHLQDT-SKSFLQKV 494

Query: 365 SDFHLLLFLAR---FLDLNSDVPALAQCVQ 391
           SDFHLL++L     F+ L   +P+L + V+
Sbjct: 495 SDFHLLVYLMTDDMFVSLKESLPSLLRAVR 524


>gi|321464097|gb|EFX75107.1| hypothetical protein DAPPUDRAFT_306878 [Daphnia pulex]
          Length = 620

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 140/313 (44%), Gaps = 61/313 (19%)

Query: 115 KQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF-- 172
           K + +  Q   H ++V F+    +  +Q+++       +R GF+YG    D  ++V    
Sbjct: 225 KAITLNSQAYRHVDNVVFEN--PNLVEQFLHYWRVTGRQRIGFLYGRF--DIHLDVPLGI 280

Query: 173 ------IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI--------- 217
                 IYEPPQ+ T + + +L  +  + +VD +AA LG++ +G+IFT  +         
Sbjct: 281 KGIVAAIYEPPQESTRDSVRLLLPDNRQSVVDRVAAKLGLQCIGWIFTDLVAEDVSKGTV 340

Query: 218 --MQNKKDYTLSNREVLQAVEFHA----ECNMEE---WVTAVVKLEVSEEGSAAIHFEAF 268
              +N + + LS +E + A  F +     C +     + +  V + V+ +    +H E +
Sbjct: 341 RHFRNAETHFLSAQECIMAGNFQSLHPNPCRLSSDGYFGSKFVTVCVTGDSQNQVHMEGY 400

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV--KEVDND--- 315
           Q+S+ C  L ++   E  +   D P+L+  ++        DV    KD    EV  D   
Sbjct: 401 QVSNQCQALVRD---ECLVPTKDAPELAYARQSTAEKYIPDVYYKEKDSYGNEVTRDARP 457

Query: 316 ----FFLVVVKILDHQGPL-------SSTFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
               + LV V +     P+          FPIENR     ++   AL S+L ++     +
Sbjct: 458 LPVEYLLVDVPVSTPLEPIFTFSVGSRKPFPIENRLIDGHLQDLSALDSYLKQN-RQNFL 516

Query: 362 KRISDFHLLLFLA 374
           +  SDFH L+++A
Sbjct: 517 EAASDFHFLIYVA 529


>gi|225677845|gb|EEH16129.1| nuclear protein localization 4 [Paracoccidioides brasiliensis Pb03]
          Length = 667

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 92/320 (28%), Positives = 135/320 (42%), Gaps = 59/320 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 286 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 344

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK +D +A   G++KVG IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 345 EKDIDEVAKLCGLEKVGVIFTDLLDSGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 404

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 405 KASKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 464

Query: 291 --DDPKLSKMK---------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
             ++P+    K         + V   G +V+E     F V  ++  L H  P        
Sbjct: 465 QENEPENDSSKIRYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPNAVFK 524

Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
           +STFPIENR      + LKS  ++  S + P  +++ +SDFHLL FL     L+ D  +L
Sbjct: 525 NSTFPIENRAAIGESQELKSLFDQLVSHNDPHQVLRAVSDFHLLCFLCGLGILSKDEESL 584

Query: 387 AQCVQAQTAVPEGYKLLINS 406
             C  A+T  P     L+N+
Sbjct: 585 L-CTVARTHDPADGMQLVNT 603


>gi|226287417|gb|EEH42930.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
           [Paracoccidioides brasiliensis Pb18]
          Length = 662

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 93/320 (29%), Positives = 138/320 (43%), Gaps = 59/320 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 281 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 339

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK +D +A   G++KVG IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 340 EKDIDEVAKLCGLEKVGVIFTDLLDSGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 399

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 400 KASKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 459

Query: 291 --DDPK--LSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
             ++P+   SK++       + V   G +V+E     F V  ++  L H  P        
Sbjct: 460 QENEPEDDSSKIRYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPNAVFK 519

Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
           +STFPIENR      + LKS  ++  S + P  +++ +SDFHLL FL     L+ D  +L
Sbjct: 520 NSTFPIENRAAIGESQELKSLFDQLVSHNDPHQVLRAVSDFHLLCFLCGLGILSKDEESL 579

Query: 387 AQCVQAQTAVPEGYKLLINS 406
             C  A+T  P     L+N+
Sbjct: 580 L-CTVARTHDPADGMQLVNT 598


>gi|198453799|ref|XP_001359342.2| GA18345, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198132518|gb|EAL28487.2| GA18345, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 659

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 144/333 (43%), Gaps = 68/333 (20%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+       ++++N       +R G++YGT      V    R  V  
Sbjct: 255 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATVAA 312

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
           IYEPPQ+ T + + +  DE + + VDA+A  LG+KK+G+IFT  I     D T    + +
Sbjct: 313 IYEPPQESTRDSINMFPDEGDAE-VDAVANALGLKKIGWIFTDLITD---DATAGTVKQI 368

Query: 233 QAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAFQM 270
           + +E H     +E +TA                       V + V+ + +  +H E + +
Sbjct: 369 RGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGYAV 427

Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVD 313
           S  C+ L ++   +  I   + P+L  +++        DV    KD          + + 
Sbjct: 428 SAQCMALVRD---DCLIPTKEAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLP 484

Query: 314 NDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVK 362
            ++ LV V       P  +         FPIENR     ++   AL  +L+       ++
Sbjct: 485 VEYLLVDVPASTPVKPQYTFTNYDKRQAFPIENRYIDGHLQDFSALSGYLSAWGEEEFLE 544

Query: 363 RISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
            ISDFHLL++L +   L L   + +L + V+ +
Sbjct: 545 AISDFHLLVYLYKMDMLPLRQHMASLLEAVRTK 577


>gi|156094424|ref|XP_001613249.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802123|gb|EDL43522.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 531

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 76/285 (26%), Positives = 128/285 (44%), Gaps = 46/285 (16%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F QY     +   +R G+MYG   ED       R     IYEPPQ    + + +L D+  
Sbjct: 242 FVQYWYTNGSMCEQRVGWMYGYYREDPHYTLGIRAVCECIYEPPQINEVDKVKLLPDDFL 301

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF----------HAECNM 243
              VD IA  LG++++G+IFT      +K++ L++ EV+Q              +   ++
Sbjct: 302 PS-VDVIANRLGLERIGWIFTHL---PRKEH-LTSEEVVQIANLQLANLKKNMHYTNYSV 356

Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD-----DPKLSKM 298
             ++T  +  +         +  AF +SDM + L +    ++   +       +P  +++
Sbjct: 357 SNFITCTISPDPMLSNEPVTN--AFMVSDMGMALIRSSLVDSTQTDPSHIQLRNPVRNEL 414

Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSH 351
              ++ GGK+  + D D+F+V +   +   P        +  FP ENR    T   +K +
Sbjct: 415 LPLILEGGKETNKFDTDWFIVRI---NESAPKVVRSLFKNFHFPRENRLHSPTAYDVKEY 471

Query: 352 LNRSPSLPL----VKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
              SP L      + R SDFHL+LF+ R LD+ +   ALA C  A
Sbjct: 472 F-ASPKLERGTNGMHRCSDFHLILFVCRVLDMET---ALALCDAA 512


>gi|390178809|ref|XP_003736733.1| GA18345, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859598|gb|EIM52806.1| GA18345, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 630

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 145/335 (43%), Gaps = 68/335 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT      V    R  V
Sbjct: 224 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATV 281

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + +  DE + + VDA+A  LG+KK+G+IFT  I     D T    +
Sbjct: 282 AAIYEPPQESTRDSINMFPDEGDAE-VDAVANALGLKKIGWIFTDLITD---DATAGTVK 337

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 338 QIRGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 396

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++   +  I   + P+L  +++        DV    KD          + 
Sbjct: 397 AVSAQCMALVRD---DCLIPTKEAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 453

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P  +         FPIENR     ++   AL  +L+       
Sbjct: 454 LPVEYLLVDVPASTPVKPQYTFTNYDKRQAFPIENRYIDGHLQDFSALSGYLSAWGEEEF 513

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
           ++ ISDFHLL++L +   L L   + +L + V+ +
Sbjct: 514 LEAISDFHLLVYLYKMDMLPLRQHMASLLEAVRTK 548


>gi|390178811|ref|XP_003736734.1| GA18345, isoform C [Drosophila pseudoobscura pseudoobscura]
 gi|388859599|gb|EIM52807.1| GA18345, isoform C [Drosophila pseudoobscura pseudoobscura]
          Length = 560

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 145/335 (43%), Gaps = 68/335 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+       ++++N       +R G++YGT      V    R  V
Sbjct: 154 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATV 211

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + +  DE + + VDA+A  LG+KK+G+IFT  I     D T    +
Sbjct: 212 AAIYEPPQESTRDSINMFPDEGDAE-VDAVANALGLKKIGWIFTDLITD---DATAGTVK 267

Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
            ++ +E H     +E +TA                       V + V+ + +  +H E +
Sbjct: 268 QIRGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 326

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
            +S  C+ L ++   +  I   + P+L  +++        DV    KD          + 
Sbjct: 327 AVSAQCMALVRD---DCLIPTKEAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 383

Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
           +  ++ LV V       P  +         FPIENR     ++   AL  +L+       
Sbjct: 384 LPVEYLLVDVPASTPVKPQYTFTNYDKRQAFPIENRYIDGHLQDFSALSGYLSAWGEEEF 443

Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
           ++ ISDFHLL++L +   L L   + +L + V+ +
Sbjct: 444 LEAISDFHLLVYLYKMDMLPLRQHMASLLEAVRTK 478


>gi|295663611|ref|XP_002792358.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279028|gb|EEH34594.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 906

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 134/321 (41%), Gaps = 61/321 (19%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 525 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 583

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK +D +A    ++KVG IFT  +     D           Y LS+ E+  A +  A   
Sbjct: 584 EKDIDEVARLCALEKVGVIFTDLLDSGAGDGTVICKRHIDSYYLSSLEIAFAAQLQARYP 643

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 644 KASKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 703

Query: 291 -----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------ 330
                DD   SK++       + V   G +V+E     F V  ++  L H  P       
Sbjct: 704 QENELDD-DSSKIRYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPNAVF 762

Query: 331 -SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPA 385
            +STFPIENR      + LKS  N+  S + P  +++ +SDFHLL FL     L+ D  +
Sbjct: 763 KNSTFPIENRAAIGESQELKSLFNKLVSHNDPHQVIRAVSDFHLLCFLCGLGILSKDEES 822

Query: 386 LAQCVQAQTAVPEGYKLLINS 406
           L  C  A+T  P     L+N+
Sbjct: 823 LL-CTVARTHDPADGMQLVNT 842


>gi|345485482|ref|XP_001606190.2| PREDICTED: nuclear protein localization protein 4 homolog [Nasonia
           vitripennis]
          Length = 652

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/321 (24%), Positives = 137/321 (42%), Gaps = 71/321 (22%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H ++V F+   A   ++++N       +R G++YG   +        R  V
Sbjct: 237 ITLNRQPYRHVDNVMFEN--AHLVERFLNYWRITGHQRIGYLYGRYEQHSDVPLGIRTVV 294

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ+ T + + +L DE+E  LVD +A  L +KKVG+IFT  I  + K  T+ +  
Sbjct: 295 AAIYEPPQESTRDSIVLLPDEKE-ALVDELAQSLNLKKVGWIFTDLIADDMKKGTVKH-- 351

Query: 231 VLQAVEFH----AECNMEEWV-----------------TAVVKLEVSEEGSAAIHFEAFQ 269
            ++ +E H     EC M   +                 +  V + V+ +    +H E +Q
Sbjct: 352 -VRNIESHFLSAQECIMAGHIQNKLPNPCRFSPNGHFGSKAVTVCVTGDDKNQVHMEGYQ 410

Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKDV---------KEV 312
           ++  C+ L ++G     +   D P+L         +   DV    KD          + +
Sbjct: 411 VTYQCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVYYKEKDTYGNEVSRLARPL 467

Query: 313 DNDFFLVVVKILDHQGPLS-----------STFPIENRTTQVTMRALKSHLNRSPSLP-- 359
             ++ LV V       PL+           + FP+E+R     ++   S  N        
Sbjct: 468 PVEYLLVDVPA---STPLTPQFTFYANENITPFPVEHRLIDGQVQEFGSLCNYMQQFTPD 524

Query: 360 -LVKRISDFHLLLFLARFLDL 379
             +  +SDFHLL+F A F+D+
Sbjct: 525 QFLDAVSDFHLLIFFA-FMDV 544


>gi|270011854|gb|EFA08302.1| hypothetical protein TcasGA2_TC005937 [Tribolium castaneum]
          Length = 483

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 104/206 (50%), Gaps = 30/206 (14%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V+F+    D  ++++N   +   +R GF+YG       V    R  V  
Sbjct: 237 LNRQVYRHVDNVAFEN--KDLVERFLNYWRSTGHQRIGFLYGNYEIHNDVPLGIRANVVA 294

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEPPQ+ + + +++LRD++EE +V+ +A  LG+++VG+IFT  I           +++ 
Sbjct: 295 IYEPPQESSRDSIHLLRDDKEE-IVEELAQNLGLRRVGWIFTDLIPEDVQKGTVKHLRHI 353

Query: 222 KDYTLSNREVLQAVEFH----AECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
             + LS  E + A  F       C       + + +V + V+ + +  +H E +Q+S+ C
Sbjct: 354 NSHFLSAEECIMAGHFQNMYPNPCRFASGGYFGSKIVTVCVTGDKTNQVHMEGYQVSNQC 413

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK 300
           + L ++      +   D P+L  +++
Sbjct: 414 MALVRDNCL---LPTKDAPELGYVRE 436


>gi|357606887|gb|EHJ65269.1| putative nuclear protein localization [Danaus plexippus]
          Length = 608

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 77/308 (25%), Positives = 134/308 (43%), Gaps = 62/308 (20%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           +TRQ   H ++V  +   A   +++++   A   +R GF+YG       V    R  V  
Sbjct: 228 LTRQPYRHVDNVLLEH--AAPVERFLSYWRATGHQRVGFLYGRYELHPDVPLGIRARVAA 285

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           +YEPPQ+ + + + +  D+    L+D +AA LG+++VG+IFT  +           ++  
Sbjct: 286 VYEPPQECSRDAVRLASDDHA-ALLDRLAARLGLERVGWIFTDLLPLDLVSGTVQCLRGV 344

Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
             + LS +E + A  F  E    C       + +  V + V+ +    IH E +Q+S  C
Sbjct: 345 DTHFLSAQECITAGHFQNEHPNACRHASSGYFGSKFVTVCVTGDADNHIHLEGYQVSGQC 404

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMK--------KDVVVGGKD---------VKEVDNDFF 317
             L ++G     +   D P+L  ++         DV    KD          K +   + 
Sbjct: 405 AALVRDGIL---LPTRDAPELGYIRDCSPEQYVPDVYYKEKDAYGNEVGVSAKRLPVAYL 461

Query: 318 LVVVKILDHQGPLS--------STFPIENRTTQ---VTMRALKSHLNRSPSLPLVKRISD 366
           LV V +     P +        ++FP  +R  Q    ++  L +H+ R+ S   +   SD
Sbjct: 462 LVDVPV--GVAPAAGEPTFDPRASFPPAHRPLQQHVQSLSGLHAHVERAES--FLAAASD 517

Query: 367 FHLLLFLA 374
            H+LLFLA
Sbjct: 518 LHVLLFLA 525


>gi|261196446|ref|XP_002624626.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
           SLH14081]
 gi|239595871|gb|EEQ78452.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
           SLH14081]
          Length = 660

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 131/320 (40%), Gaps = 59/320 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 279 INCLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQLGEIDGV-TLHEWSN 337

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK VD +A   G++K+G IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 338 EKDVDEVAKLCGLEKIGVIFTDLMDARAGDGTVICKRHIDSYYLSSLEIIFAAQLQARYP 397

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 398 KATKWSETGQFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 457

Query: 291 ----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
                D  LSK +       + V   G +V+E     F V  ++  L H  P        
Sbjct: 458 QENEPDDDLSKARYIPEVFYRRVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 517

Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
            STFPIENR      + L++  N+  S   P   +  +SDFHLL FL     LN D  +L
Sbjct: 518 DSTFPIENREVIGESQDLRTLANKLVSHGDPNKAICAVSDFHLLCFLHGLGILNKDEESL 577

Query: 387 AQCVQAQTAVPEGYKLLINS 406
             C  A+T  P     L+N+
Sbjct: 578 L-CTVARTHDPADGMQLVNT 596


>gi|239609444|gb|EEQ86431.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
           ER-3]
          Length = 660

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 131/320 (40%), Gaps = 59/320 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 279 INCLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQLGEIDGV-TLHEWSN 337

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK VD +A   G++K+G IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 338 EKDVDEVAKLCGLEKIGVIFTDLMDARAGDGTVICKRHIDSYYLSSLEIIFAAQLQARYP 397

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 398 KATKWSETGQFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 457

Query: 291 ----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
                D  LSK +       + V   G +V+E     F V  ++  L H  P        
Sbjct: 458 QENEPDDDLSKARYIPEVFYRRVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 517

Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
            STFPIENR      + L++  N+  S   P   +  +SDFHLL FL     LN D  +L
Sbjct: 518 DSTFPIENREVIGESQDLRTLANKLVSHGDPNKAICAVSDFHLLCFLHGLGILNKDEESL 577

Query: 387 AQCVQAQTAVPEGYKLLINS 406
             C  A+T  P     L+N+
Sbjct: 578 L-CTVARTHDPADGMQLVNT 596


>gi|327357005|gb|EGE85862.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 660

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 131/320 (40%), Gaps = 59/320 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 279 INCLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQLGEIDGV-TLHEWSN 337

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK VD +A   G++K+G IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 338 EKDVDEVAKLCGLEKIGVIFTDLMDARAGDGTVICKRHIDSYYLSSLEIIFAAQLQARYP 397

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 398 KATKWSETGQFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 457

Query: 291 ----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
                D  LSK +       + V   G +V+E     F V  ++  L H  P        
Sbjct: 458 QENEPDDDLSKARYIPEVFYRRVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 517

Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
            STFPIENR      + L++  N+  S   P   +  +SDFHLL FL     LN D  +L
Sbjct: 518 DSTFPIENREVIGESQDLRTLANKLVSHGDPNKAICAVSDFHLLCFLHGLGILNKDEESL 577

Query: 387 AQCVQAQTAVPEGYKLLINS 406
             C  A+T  P     L+N+
Sbjct: 578 L-CTVARTHDPADGMQLVNT 596


>gi|82594712|ref|XP_725542.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480584|gb|EAA17107.1| Drosophila melanogaster GH03617p [Plasmodium yoelii yoelii]
          Length = 520

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 75/271 (27%), Positives = 121/271 (44%), Gaps = 50/271 (18%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G+MYG   ED       R     IYEPPQ    + + +L D+     VD IA  LG+
Sbjct: 244 QRVGWMYGYYKEDSHYNLGIRAVCECIYEPPQFCEGDKVRLLPDDFMNN-VDLIAEKLGL 302

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF----------HAECNMEEWVTAVVKLEVS 256
           +K+G IFT      +K+Y L++ E+L   +           +   N+  ++T  +  + S
Sbjct: 303 EKIGIIFTHLP---RKEY-LTSYELLNIAKLQLANLKNNTHYTNYNISNFITCTISPDPS 358

Query: 257 EEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD 308
                  +  AF +SD+ + L K      EI   DDP   K++          ++ GGK+
Sbjct: 359 LNNEPITN--AFMVSDLGMSLVKYNLI-NEIQ--DDPSHIKLRTPNKYELLPQILEGGKE 413

Query: 309 VKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSP---SL 358
             + D D+ ++ V   +   P        +  FP ENR    T+  +K + + S     +
Sbjct: 414 TNKFDTDWLIIRV---NDSAPKVVRSIFKNYHFPRENRNKPQTVYDVKEYFSSSKLNRGV 470

Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
               R SDFHL++F A+ LD+ +   AL  C
Sbjct: 471 NGNSRCSDFHLIIFAAKVLDIET---ALVLC 498


>gi|72090152|ref|XP_789485.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Strongylocentrotus purpuratus]
          Length = 627

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 132/306 (43%), Gaps = 58/306 (18%)

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IY 174
           RQ   H ++V F+      F + ++       +R GF+YG     K V +        IY
Sbjct: 231 RQHYRHVDNVMFENPMM--FDRLLDFWRKCGNQRIGFLYGKYEHHKDVPLGIKATVLAIY 288

Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNKKD 223
           EPPQ  T   L +L D   E  VD +A+ LG+ KVG+I+T  +           ++N   
Sbjct: 289 EPPQNSTSNSLELLEDPFAEA-VDYVASKLGLCKVGWIYTDLMADDLTKGTVKHVRNMDA 347

Query: 224 YTLSNREVLQAVEFH----AECNME---EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVR 276
           + +S  E + A EF     + C +     + +  V   V+ + +  IH EA+Q+S+ C+ 
Sbjct: 348 HFMSAEECIMAGEFQNQHPSPCKLATEGHFGSKFVTCIVTGDSTNQIHTEAYQVSNQCMA 407

Query: 277 LFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDFFLV 319
           L ++      I   D P+L  +++        DV     D          + +  +F LV
Sbjct: 408 LVRDDCLVPTI---DAPELGYIRESTNEQYVPDVFYSETDSYGNSSKQLARPLPVEFLLV 464

Query: 320 VVK--------ILDHQGPLSSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISDFH 368
            +            +  P    FPIENR     ++   AL  +L +  S  L++ + DFH
Sbjct: 465 DLPAGFPKDATFAFNSQPGLKPFPIENRAGVGEVQGIEALTQYLGQFESGQLMEAMLDFH 524

Query: 369 LLLFLA 374
           LL++LA
Sbjct: 525 LLIYLA 530


>gi|193613344|ref|XP_001951828.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Acyrthosiphon pisum]
          Length = 630

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 73/310 (23%), Positives = 140/310 (45%), Gaps = 58/310 (18%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQ   H +++ F+   ++  ++++N       +R GF+YG            R  V
Sbjct: 235 ITLNRQTYRHVDNIMFEN--SELVERFLNYWRITGHQRIGFLYGRYETHPDIPLGIRAIV 292

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ + + + +L DE+++ +V+ +A  LG+  VG+IFT  +           ++
Sbjct: 293 AAIYEPPQESSRDHVKLLPDEKQD-IVNELARCLGLSCVGWIFTDLLTEDMQTGTVKYVR 351

Query: 220 NKKDYTLSNREVLQAVEFH----AECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N K + LS +E + A  F       C       + +  V + V+ +    +H E + +S+
Sbjct: 352 NIKSHFLSAQECIMAGYFQNLHPNPCRYASNGYFGSKFVTVCVTGDEKCQVHMEGYSVSN 411

Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDND 315
            C+ L K+   +  +   D P+L  +++        DV    KD          + +  +
Sbjct: 412 QCMALVKD---DCLVPTKDVPELGFVRESSDKQYVPDVYYKEKDSYGNEVSKLARPLPVE 468

Query: 316 FFLVVVKILDHQGPLS--------STFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
           + L+ V       PL+        + FP+ENR     ++   +L  +L +   L   + +
Sbjct: 469 YLLLDVPASTPLTPLNTFTSIKDITKFPVENRLIDGHIQDFDSLCKYLRQFTPLQFYESV 528

Query: 365 SDFHLLLFLA 374
           SDFHLLL++A
Sbjct: 529 SDFHLLLYIA 538


>gi|350406695|ref|XP_003487852.1| PREDICTED: nuclear protein localization protein 4 homolog [Bombus
           impatiens]
          Length = 647

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 78/310 (25%), Positives = 138/310 (44%), Gaps = 58/310 (18%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+       ++++N   +   +R GF+YG       V    R  V
Sbjct: 231 ITLNRQTYRHVDNVMFEN--PSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 288

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ T+  + +L DE+E  +VD +A  L +K+VG+IFT  I           ++
Sbjct: 289 AAIYEPPQESTKNSIRLLPDEKE-VVVDELAHLLNLKRVGWIFTDLIVDDIKKGTVKHVR 347

Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N + + LS +E + A  F  +    C       + +  V + V+ +    IH E +Q+S+
Sbjct: 348 NIESHFLSAQECIMAGYFQNKYPNPCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 407

Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
            C+ L ++G     +   D P+L         +   DV    KD          + +  +
Sbjct: 408 QCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 464

Query: 316 FFLVVVKILDHQGPL--------SSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
           + LV +       P          + F IENR     ++   +L S++ +      ++ +
Sbjct: 465 YLLVDIPASTPLNPQFTFYVSNNITPFLIENRLIDGQIQEFSSLCSYMQQFSKEQFLEAV 524

Query: 365 SDFHLLLFLA 374
           SDFHLL+F+A
Sbjct: 525 SDFHLLIFIA 534


>gi|340721568|ref|XP_003399190.1| PREDICTED: nuclear protein localization protein 4 homolog [Bombus
           terrestris]
          Length = 647

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 140/313 (44%), Gaps = 64/313 (20%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+       ++++N   +   +R GF+YG       V    R  V
Sbjct: 231 ITLNRQTYRHVDNVMFEN--PSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 288

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ T+  + +L DE+E  +VD +A  L +K+VG+IFT  I           ++
Sbjct: 289 AAIYEPPQESTKNSIRLLPDEKE-VVVDELAHLLNLKRVGWIFTDLIVDDIKKGTVKHVR 347

Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N + + LS +E + A  F  +    C       + +  V + V+ +    IH E +Q+S+
Sbjct: 348 NIESHFLSAQECIMAGYFQNKYPNPCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 407

Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
            C+ L ++  F   +   D P+L         +   DV    KD          + +  +
Sbjct: 408 QCMALVRDSCF---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 464

Query: 316 FFLVVVKILDHQGPLS-----------STFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
           + LV +       PL+           + F IENR     ++   +L S++ +      +
Sbjct: 465 YLLVDIPA---STPLNPQFTFYVSNNITPFLIENRLIDGQIQEFSSLCSYMQQFSKEQFL 521

Query: 362 KRISDFHLLLFLA 374
           + +SDFHLL+F+A
Sbjct: 522 EAVSDFHLLIFIA 534


>gi|67624425|ref|XP_668495.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54659695|gb|EAL38261.1| hypothetical protein Chro.70227 [Cryptosporidium hominis]
          Length = 490

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 112/503 (22%), Positives = 189/503 (37%), Gaps = 126/503 (25%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLS--TNQNLLLAKSPSDLL 59
           ++R+ S  G  RVT+      ++ LK+ I   L++P E Q LS  ++ N LL      +L
Sbjct: 6   IIRVFSSSGRSRVTIPST-CNLSMLKSKIAQSLKVPEEQQLLSLDSHGNTLLKGDSLSML 64

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNV--------------------------- 92
           Q               +S+GSIV+L  D    +                           
Sbjct: 65  QH-------------GLSNGSIVYLITDATPQIATQSIQRPKHMTTSDLPQLEKKPSNSN 111

Query: 93  -------------------RGPSFNPAGSF--GRKMTMDDLIAKQ--------------- 116
                               GP F    SF   R    DDL  KQ               
Sbjct: 112 YSSSTSADSKDMNNSVEKSEGPHFKSFDSFLSERNFVTDDLPLKQSYKSFFISKGVMNKI 171

Query: 117 ---MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------ 166
              + +  Q   H + +   +   A  F  Y    L    +R G+MYG   ED       
Sbjct: 172 PPSVTLRHQAYRHVDHLEMMNLSEAMQFVDYWRSKLGMMKQRVGWMYGYYREDSTYPMGI 231

Query: 167 RVEVNFIYEPPQ-QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
           R  +  IYEPPQ +  E    IL ++  +  VD +A  LG++ +G +FT     N++D  
Sbjct: 232 RAVMEAIYEPPQDEKAEPGKLILENDSFKSKVDRVAQSLGLECLGLVFT----HNERDEV 287

Query: 226 LSNREVLQAVEFHAEC---------NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVR 276
           L++ E++   +   E           +  ++   +    S +G   I   AF +SD+ + 
Sbjct: 288 LTSNEIVTLGKLQLEALKTLHYTGYPVSSFICCTIAPCKSIQGGDPIP-NAFAVSDLGLA 346

Query: 277 LFKEGWFETEIAEGD------DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL 330
           L ++   + +  + D      + +  ++   ++  G   ++ D  + +V +   +   P+
Sbjct: 347 LLRDKIIDEKGLKDDTHIVIREEEKGEILPQILEKGVSTRKFDAHWLIVRI---NESAPI 403

Query: 331 -------SSTFPIENRTTQVTMRALKSHL-NRSPSLPLV--KRISDFHLLLFLARFLDLN 380
                  SS FP ENR        +   + +R  S+PL     ++DFHLLL+LA+  D  
Sbjct: 404 QPKPFFSSSQFPRENRIVPQKPSDVSEFIKSRLTSVPLTSCNLLNDFHLLLYLAKLFD-- 461

Query: 381 SDVPALAQCVQAQTAVPEGYKLL 403
            +  A++ C       P    LL
Sbjct: 462 -EATAISICESIINKTPVDQHLL 483


>gi|258566089|ref|XP_002583789.1| protein NPL4 [Uncinocarpus reesii 1704]
 gi|237907490|gb|EEP81891.1| protein NPL4 [Uncinocarpus reesii 1704]
          Length = 669

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 128/311 (41%), Gaps = 58/311 (18%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
             +R GF+YGT  E   V       V  IYEPPQ   +EV  I LR+ E E+ VD +A  
Sbjct: 298 GAQRLGFLYGTYEEYSEVPLGIKAVVQAIYEPPQ--VDEVDGITLREWENEQDVDQVAKL 355

Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWV---- 247
            G++KVG IFT  +     D           Y LS+ EV+ A +  A      +W     
Sbjct: 356 CGLEKVGVIFTDLLDSGHGDGTVICRRHIDSYYLSSLEVIFAAQLQARYPKPSKWSETGR 415

Query: 248 --TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG-------------WFETEIAEGDD 292
             +  V   +S + + AI   A+Q S+  V + +                 E+E+   D 
Sbjct: 416 FGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLVQLESELENADT 475

Query: 293 PKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSS-------TFPIENR 339
             +  +     + V   G +V+E     F V  +   L H  P  S       TFPIENR
Sbjct: 476 GTVRYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTGSSPLFTDNTFPIENR 535

Query: 340 ----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
                +Q   R     L+R+     ++ +S+FHLL F+     L  D   L  C  A+T 
Sbjct: 536 EVIGESQDIRRVANKLLSRTDPDSAIRAVSNFHLLCFMHGLGILTKDEETLL-CTVARTH 594

Query: 396 VPEGYKLLINS 406
            P     LIN+
Sbjct: 595 DPADGVQLINT 605


>gi|66362816|ref|XP_628374.1| nuclear pore associated protein (NLP4) with N-terminal ubiquitin
           domain [Cryptosporidium parvum Iowa II]
 gi|46229793|gb|EAK90611.1| nuclear pore associated protein (NLP4) with N-terminal ubiquitin
           domain [Cryptosporidium parvum Iowa II]
          Length = 491

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 112/497 (22%), Positives = 190/497 (38%), Gaps = 126/497 (25%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLS--TNQNLLLAKSPSDLL 59
           ++R+ S  G  RVT+      ++ LK+ I   L++P E Q LS  ++ N LL      +L
Sbjct: 7   IIRVFSSSGRSRVTIPST-CNLSMLKSKIAQSLKVPEEQQLLSLDSHGNTLLKGDSLSVL 65

Query: 60  QFTDMANPDRPLSSLNISHGSIVFLSYDGERNV--------------------------- 92
           Q               + +GSIV+L  D    +                           
Sbjct: 66  QH-------------GLFNGSIVYLITDATPQIATQSIQRPKHMTTSDLPPLEKKPSNSN 112

Query: 93  -------------------RGPSFNPAGSF--GRKMTMDDLIAKQ--------------- 116
                               GP F    SF   R    DDL  KQ               
Sbjct: 113 YPSSSSADSKDMNNSVEKSEGPHFKSFDSFLSERNFVTDDLPLKQSYKSFFISKGVMNKI 172

Query: 117 ---MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------ 166
              + +  Q   H + +   +   A  F  Y    L    +R G+MYG   ED       
Sbjct: 173 PPSVTLRHQAYRHVDHLEMMNLSEAMQFVDYWRSKLGMMKQRVGWMYGYYREDSTYPMGI 232

Query: 167 RVEVNFIYEPPQ-QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
           R  +  IYEPPQ +  E    IL ++  +  VD +A  LG++ +G +FT     N++D  
Sbjct: 233 RAVMEAIYEPPQDEKAEPGKLILENDSFKSTVDRVAQSLGLECLGLVFT----HNERDEV 288

Query: 226 LSNREVLQAVEFHAEC---------NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVR 276
           L++ E++   +   E           +  ++   +    S +G   I   AF +SD+ + 
Sbjct: 289 LTSNEIVTLGKLQLEALKTPHYTGYPVSSFICCTIAPCKSIQGGDPIP-NAFAVSDLGLA 347

Query: 277 LFKEGWFETEIAEGDDPKLSKMKKD-------VVVGGKDVKEVDNDFFLVVVKILDHQGP 329
           L ++   + E    DD  +   ++D       ++  G   ++ D  + +V +   +   P
Sbjct: 348 LLRDKIID-EKGLKDDTHIVIREEDKGEILPQILEKGVSTRKFDAHWLIVRI---NESAP 403

Query: 330 L-------SSTFPIENRTTQVTMRALKSHL-NRSPSLPLV--KRISDFHLLLFLARFLDL 379
           +       SS FP ENR        +   + +R  S+PL     ++DFHLLL+LA+  D 
Sbjct: 404 IQPKPFFSSSQFPRENRIVPQKPSDVSEFIKSRLTSVPLTSCNLLNDFHLLLYLAKLFDE 463

Query: 380 NSDVPALAQCVQAQTAV 396
            + + ++ + +  +TAV
Sbjct: 464 ATAI-SICESIINKTAV 479


>gi|294925935|ref|XP_002779039.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239887885|gb|EER10834.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 363

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 137/330 (41%), Gaps = 83/330 (25%)

Query: 126 HCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQ 179
           HCE ++ +     +F QY         +R G++YG  L D       R+ V  IYEPP+Q
Sbjct: 69  HCEWMNTEE--IKSFVQYWQFDKEMLQQRCGWLYGYYLSDPNYDDGCRIVVEGIYEPPKQ 126

Query: 180 GT----------EEVLYILRDEEEEK----------------LVDAIAAGLGMKKVGFIF 213
                        E+L  +++  + K                 VD + + LG+++VG IF
Sbjct: 127 EVYNAATGGIRMNEMLVAMQEASKGKPLDPGVLGGMVDPALQKVDVVMSKLGLERVGCIF 186

Query: 214 TQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
           T       +DY +S+ E+L A                 KL              +  SD 
Sbjct: 187 TAL----PRDYEMSSGELLAAARLQ-------------KL-------------VWMASDQ 216

Query: 274 CVRLFKEGWFET-EIAEGD------DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDH 326
              + ++G F+  + AE        +P   +M   V+  G +V E D D+ LV V   + 
Sbjct: 217 AEGMLQDGLFDVKKTAETPTRVQLREPFSQEMMPPVLASGSEVTEFDPDWLLVKV---ND 273

Query: 327 QGPLS-------STFPIENRTTQVTMRALKSHLNRSPS-LPLVKRISDFHLLLFLARFLD 378
             PL        S FP ENR+ Q +   +K ++ + P+  P   R +DFHLL+++   LD
Sbjct: 274 GVPLKKRSMFRFSHFPRENRSRQQSPDDIKQYMRQIPAGTPSWARYADFHLLVYITLLLD 333

Query: 379 LNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
            ++   A+A  +  +  + +    L+ +MA
Sbjct: 334 EDT-AGAIAGSISREEEIDKAMDELLTNMA 362


>gi|70952299|ref|XP_745327.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56525615|emb|CAH80963.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 511

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 121/270 (44%), Gaps = 48/270 (17%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G+MYG   ED       R     IYEPPQ    + + +L D+     VD IA  LG+
Sbjct: 235 QRIGWMYGYYKEDTHYNLGIRAVCECIYEPPQFCEGDQVRLLPDDFMSN-VDLIAEKLGL 293

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREV---------LQAVEFHAECNMEEWVTAVVKLEVSE 257
           +K+G IFT      +K+Y  S+  V         L+    +   ++  ++T  +  +   
Sbjct: 294 EKIGIIFTHLP---RKEYLASHELVNIAKLQLSNLKKNTHYTNYSISNFITCTISPDPLL 350

Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV 309
                 +  AF +SD+ + L ++   + EI   DDP   K++          ++ GGK+ 
Sbjct: 351 NNEPITN--AFMVSDLGMSLIRDNLID-EIQ--DDPSHIKLRSPNKHELLPQILEGGKET 405

Query: 310 KEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSP---SLP 359
            + D D+F++ V   +   P        +  FP ENR    T+  +K + + S     + 
Sbjct: 406 NKFDTDWFIIRV---NDSAPKVVRSIFKNYHFPRENRNKPQTVYDVKEYFSSSKLDRGVN 462

Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQC 389
              R SDFHL++F A+ LD+ +   AL  C
Sbjct: 463 GKSRCSDFHLIIFAAKVLDIET---ALVLC 489


>gi|221057203|ref|XP_002259739.1| Nuclear pore associated protein (NLP4) [Plasmodium knowlesi strain
           H]
 gi|193809811|emb|CAQ40515.1| Nuclear pore associated protein (NLP4),putative [Plasmodium
           knowlesi strain H]
          Length = 523

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/284 (25%), Positives = 126/284 (44%), Gaps = 44/284 (15%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F QY     +   +R G+MYG   ED       R     IYEPPQ    + + +L D+  
Sbjct: 234 FVQYWYTNGSMCEQRVGWMYGYYREDPHYTMGIRAVCECIYEPPQINEVDKVKLLPDDFL 293

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF----------HAECNM 243
           +  VD IA  LG++++G+IFT      +K++ L++ EV+Q  +           +   ++
Sbjct: 294 DS-VDVIANRLGLERIGWIFTHL---PRKEH-LTSEEVVQIAKLQLANLKKNVHYTNYSV 348

Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET-----EIAEGDDPKLSKM 298
             ++T  +  +         +  AF +SD+ + L +    +       + +  +P  +++
Sbjct: 349 SNFITCTISPDPMLSNEPVTN--AFMVSDLGMSLIQSNLVDPVQTDPSVIQLRNPMPNEL 406

Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSH 351
              ++ GGK+  + D D+F+V +   +   P        +  FP ENR        +K +
Sbjct: 407 LPLILEGGKETNKFDTDWFIVRI---NESAPKVVRSLFKNFHFPRENRLRSPNAYDVKEY 463

Query: 352 LNRSP---SLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
              +        + R SDFHL+LF  R LD+ +   ALA C  A
Sbjct: 464 FASTKLERGTNGMHRCSDFHLILFACRVLDMET---ALALCDAA 504


>gi|399217022|emb|CCF73709.1| unnamed protein product [Babesia microti strain RI]
          Length = 450

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 110/458 (24%), Positives = 195/458 (42%), Gaps = 98/458 (21%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           +++RI    G+ RVT+    + +++L + I  +  +P+E Q LS         +PSD   
Sbjct: 3   IVVRIFCPTGIIRVTLPTDSM-LSDLMSEIAERTGVPIEKQCLSI--------APSDT-- 51

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGE--------------RNVRGPSFNPAGSFGRK 106
            +   N +  L  + I HG+I+ L+ D                ++V+  SF+   S  R 
Sbjct: 52  -SIDPNVNVKLDKIGIVHGTILNLNSDYPVEFQDTSKEIPEFMQDVKFKSFDSFLS-ERG 109

Query: 107 MTMDDL------------------IAKQMRVTRQENPHCESVSFDRDCADAFQQYVN--- 145
             + DL                  I + + + RQ   H + +       +  + +VN   
Sbjct: 110 YNVSDLPLKNEYLPVKIEVGKMNKIPQSIALKRQPYRHVDHLEIMN--TEEIRNFVNFWM 167

Query: 146 ETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDA 199
                  +R G+MYG  +ED       R     IYEP Q G  E + I  D     +VD 
Sbjct: 168 HDCEMLTQRCGWMYGYYIEDVHYPMGIRAVCEAIYEPIQTGDIESVRIEPDNFL-SVVDT 226

Query: 200 IAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF---------HAECNMEEWVTAV 250
           IA  LG++++GFIF+     N   + L+ +E+    +          +   N    VT  
Sbjct: 227 IAERLGLERIGFIFSHLPRDN---FFLTAQEIYDIAKIQNSRIDTGHYTGYNASHHVTCT 283

Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD-------VV 303
           +  +V+  G  +++  AF +SDM + L ++G  + +  + D+ K+  +  +       V+
Sbjct: 284 ISPDVNNNGECSLN--AFMVSDMGMALVRDGLIDPK--QDDNLKIHLVNPNEYVIFPGVL 339

Query: 304 VGGKDVKEVDNDFFLVVVKILDHQGPLS-------STFPIENRTTQVTMRA--LKSH-LN 353
             G +    D  + LV V   +   P++       S F  ENR     ++A  +K++  N
Sbjct: 340 EEGTETTSFDTSWVLVRV---NESAPITPNSFFKNSLFQRENRGKAPNIQAYFMKANGAN 396

Query: 354 RSPSLPLVKRI--SDFHLLLFLARFLDLNSDVPALAQC 389
            +P       +  SDFHLLL +A+ +D+++   ALA C
Sbjct: 397 DNPKDGFDSHVLYSDFHLLLHIAKNVDIDT---ALAIC 431


>gi|124506075|ref|XP_001351635.1| Nuclear pore associated protein (NLP4), putative [Plasmodium
           falciparum 3D7]
 gi|23504562|emb|CAD51442.1| Nuclear pore associated protein (NLP4), putative [Plasmodium
           falciparum 3D7]
          Length = 531

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 71/266 (26%), Positives = 121/266 (45%), Gaps = 40/266 (15%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G+MYG   ED       R     IYEPPQ   +  +++L D+     VD IA  LG+
Sbjct: 255 QRIGWMYGYYREDNHYNLGIRAVCECIYEPPQFCEDNKIHLLEDDFLPT-VDLIAERLGL 313

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREV----LQAVEFHAECNMEEW-VTAVVKLEVSEEG-- 259
           +++G+IFT    Q   +Y  S+  V    +Q        +   + ++  +   +S +   
Sbjct: 314 ERIGWIFTHLPRQ---EYLTSDEVVNIAKMQLTNIKKNMHYTNYPISNFITCTISPDPLL 370

Query: 260 SAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD-----DPKLSKMKKDVVVGGKDVKEVDN 314
           S      AF +SD+ + L ++   E    +       +P  +++   ++  GK+  + D 
Sbjct: 371 SNEPVTNAFMVSDLGMALMRDNLIEENQIDPSHIQLRNPNKNELLPQILESGKETNKFDT 430

Query: 315 DFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSPSLPL----VKR 363
           D+F+V V   +   P        +  FP ENR    +   +K +  RS  L      + +
Sbjct: 431 DWFIVRV---NESAPKVVRSIFKNFHFPRENRQHSQSAFHVKEYF-RSNKLDRGTKGIHQ 486

Query: 364 ISDFHLLLFLARFLDLNSDVPALAQC 389
            SDFHL+LF+A+ LD+ +   ALA C
Sbjct: 487 CSDFHLILFVAKVLDIET---ALALC 509


>gi|240279728|gb|EER43233.1| endoplasmic reticulum and nuclear membrane protein [Ajellomyces
           capsulatus H143]
 gi|325092857|gb|EGC46167.1| endoplasmic reticulum and nuclear membrane protein [Ajellomyces
           capsulatus H88]
          Length = 658

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 91/325 (28%), Positives = 130/325 (40%), Gaps = 59/325 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 278 INSLLDFWRKSGTQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 336

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK VD +A   G++K+G IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 337 EKDVDEVAKFCGLEKIGVIFTDLLDAGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 396

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 397 KATKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 456

Query: 291 -----DDPKLSKMKKDVVVG-----GKDVKEVDNDFFLV--VVKILDHQGPL-------S 331
                DD   ++   +V        G +V+E     F V  ++  L H  P        +
Sbjct: 457 QENQPDDASKARYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFNN 516

Query: 332 STFPIENR-----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPAL 386
           STFPIENR     +  + M A K   +  P+  +   +SDFHLL FL     LN D   L
Sbjct: 517 STFPIENREVIGESQDLRMLADKLVSHGDPNKTICG-VSDFHLLCFLNSLGILNKDEEFL 575

Query: 387 AQCVQAQTAVPEGYKLLINSMAETV 411
              V       +G +L+  S   T+
Sbjct: 576 LCTVARSHDTADGMQLINTSGWATL 600


>gi|225562903|gb|EEH11182.1| endoplasmic reticulum and nuclear membrane protein [Ajellomyces
           capsulatus G186AR]
          Length = 659

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 91/326 (27%), Positives = 130/326 (39%), Gaps = 60/326 (18%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 278 INSLLDFWRKSGTQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 336

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
           EK VD +A   G++K+G IFT  +     D           Y LS+ E++ A +  A   
Sbjct: 337 EKDVDEVAKFCGLEKIGVIFTDLLDAGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 396

Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
              +W       +  V   +S +   AI   A+Q S+  V + K    E     G     
Sbjct: 397 KATKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 456

Query: 291 ------DDPKLSKMKKDVVVG-----GKDVKEVDNDFFLV--VVKILDHQGPL------- 330
                 DD   ++   +V        G +V+E     F V  ++  L H  P        
Sbjct: 457 QENQLDDDASKARYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 516

Query: 331 SSTFPIENR-----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPA 385
           +STFPIENR     +  + M A K   +  P+  +   +SDFHLL FL     LN D   
Sbjct: 517 NSTFPIENREVIGESQDLRMLADKLVSHGDPNKTICG-VSDFHLLCFLNSLGILNKDEEF 575

Query: 386 LAQCVQAQTAVPEGYKLLINSMAETV 411
           L   V       +G +L+  S   T+
Sbjct: 576 LLCTVARSHDTADGMQLINTSGWATL 601


>gi|296810816|ref|XP_002845746.1| NPL4 [Arthroderma otae CBS 113480]
 gi|238843134|gb|EEQ32796.1| NPL4 [Arthroderma otae CBS 113480]
          Length = 652

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 134/318 (42%), Gaps = 61/318 (19%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L + E
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 335

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   GM+K+G IFT  I     D           Y LS+ EV+ A +  A  
Sbjct: 336 NEKDVDQVARLCGMEKIGVIFTDLIDSGAGDGTVICKRHIDSYYLSSLEVVFASQLQARN 395

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------- 287
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E          
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISISAYQASNSAVEMVRADIIEPSADPSVMLV 455

Query: 288 ---AEGDDPKLSKMKKDVVVGGKD-----VKEVDNDFFLV--VVKILDHQGP-------L 330
              +E D   +++   +V     +     V+E  N  F V  +   L H  P        
Sbjct: 456 QQESELDSANITRYIPEVFYRRINEYKVSVQENANPSFPVEYLFVTLTHGFPTMPTTVFT 515

Query: 331 SSTFPIENRTT---QVTMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLDLNSDVPA 385
            S+FPIENR        +R + S L  S  P++  +K +S+FHLL FL     L+ D  A
Sbjct: 516 DSSFPIENREIIGESQDLRRVASKLTGSSDPNV-AIKAVSNFHLLAFLHGMGILSEDEEA 574

Query: 386 LAQCVQAQTAVPEGYKLL 403
           L   V  +    +G +L+
Sbjct: 575 LLCNVATKHDPADGVQLI 592


>gi|242007387|ref|XP_002424522.1| nuclear protein localization, putative [Pediculus humanus corporis]
 gi|212507948|gb|EEB11784.1| nuclear protein localization, putative [Pediculus humanus corporis]
          Length = 451

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 100/203 (49%), Gaps = 30/203 (14%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
           + + RQ   H ++V F+   A   ++++N   +  ++R GF+YG       V    R  V
Sbjct: 220 ITLNRQVYRHTDNVMFEN--AHLVERFLNYWRSSGLQRVGFLYGKYEVHSDVPLGIRATV 277

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
             IYEPPQ+ +++ + IL D+ E+ +V+ +A  LG+ +VG+IFT  I           ++
Sbjct: 278 AAIYEPPQESSKDSINILPDDREQ-IVEELAQHLGLVRVGWIFTDLIADDVQKGTVKHVR 336

Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
           N + + LS +E + A     +    C       + +  V + V+ + S  +H E +Q+S 
Sbjct: 337 NIESHFLSAQECITAGYLQNQRPNPCRFSSTGFFGSKFVTVCVTGDSSKQVHMEGYQVSG 396

Query: 273 MCVRLFKEGWFETEIAEGDDPKL 295
            C+ L ++G     +   D P+L
Sbjct: 397 QCMALVRDGCL---VPTKDAPEL 416


>gi|332020704|gb|EGI61109.1| Nuclear protein localization protein 4-like protein [Acromyrmex
           echinatior]
          Length = 515

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 107/229 (46%), Gaps = 35/229 (15%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+   A   ++++N   +   +R G++YG       V    R  V  
Sbjct: 248 LNRQTYRHVDNVMFEN--ASLVERFLNYWRSTGHQRIGYLYGRYEVHSDVPLGIRAVVAA 305

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEPPQ+ T++ + +L D E E LV+ +   L +++VG+IFT  I           ++N 
Sbjct: 306 IYEPPQESTKDTIRLLPD-EREALVEELGRTLNLRRVGWIFTDLIADDVKKGTVKHVRNI 364

Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           + + LS +E + A  F  +    C       + +  V + V+ +    +H E +Q+S+ C
Sbjct: 365 ESHFLSAQECIMAGYFQNQYPNPCRFSPNGYFGSKFVTVCVTGDDKNQVHMEGYQVSNQC 424

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKI 323
           + L ++G     +   D P+L       V+   D + V + F+ V   I
Sbjct: 425 MALVRDGCL---VPTKDAPELG-----YVIESTDKQYVPDVFYKVSSNI 465


>gi|358366858|dbj|GAA83478.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
           [Aspergillus kawachii IFO 4308]
          Length = 667

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 135/330 (40%), Gaps = 70/330 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E K V       V  IYEPPQ   +EV  + L +  
Sbjct: 277 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 334

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   G+++VG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 335 NEKEVDEVAHLCGLERVGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARLQAQH 394

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG+ AI   A+Q S   V + +    E      
Sbjct: 395 PKATKWSRTGRFGSNFVTCV--LSGDEEGAIAI--SAYQASVAAVEMVRADIVEPSAEPS 450

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
                +E DD        P++   K+ +  +   ++ K      +L+V   L H  P   
Sbjct: 451 VMLVQSEDDDTENKSRYIPEVFYRKINEYGISAQQNAKPSFPVEYLLVT--LTHGFPTDA 508

Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
                 S+FPIENR     +Q  +   K  +        ++ +SDFH+L FL      N 
Sbjct: 509 SPLFTDSSFPIENREVIGESQELIHVAKKLVTHGDPDKAIRAVSDFHMLCFLHSLSTFNK 568

Query: 382 DVPALAQCVQAQTAVPEGYKLLINSMAETV 411
           D  AL   V  Q A  +G +L+  S   T+
Sbjct: 569 DEEALLCRVATQRAPADGLQLINTSGWATL 598


>gi|196000646|ref|XP_002110191.1| hypothetical protein TRIADDRAFT_63700 [Trichoplax adhaerens]
 gi|190588315|gb|EDV28357.1| hypothetical protein TRIADDRAFT_63700 [Trichoplax adhaerens]
          Length = 606

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 80/338 (23%), Positives = 143/338 (42%), Gaps = 71/338 (21%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF---- 172
           + + RQ   H + V F    +   + ++N   +   +R G++YG     K V +      
Sbjct: 223 ITLNRQSYRHIDYVMFQN--SKIMENFLNYWRSCGCQRLGYLYGYYDPHKDVPLGIKTVV 280

Query: 173 --IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT----- 225
             IYEPPQ+ T+  + +L D + +  VDA+A  LG++KVG+I T  + ++    T     
Sbjct: 281 VAIYEPPQKNTKNSIQLLEDAQADT-VDAVAKILGLRKVGWIVTDLLPKDVHKGTVQHTR 339

Query: 226 ------LSNREVLQAVEF-HAECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSD 272
                 +S  E + A  F +A  N  ++       +  V + VS + + +I    +Q+S+
Sbjct: 340 GVDMHLMSAEECIMAAAFQNAHPNPCKYARDKYFGSKFVTVIVSGDCNDSIDIHGYQVSN 399

Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------------- 308
            C+ L ++      +   DDP L+ ++         DV    KD                
Sbjct: 400 QCMSLVRDNCL---VPTRDDPTLAYVRNSTNIQYVPDVFYKMKDEYNNEKLALGRPLPVE 456

Query: 309 --VKEVDNDFFLVVVKILDHQGPLSSTFPIENRTT-------QVTMRALKSHLNRSPSLP 359
             + +V N F   +  +LD    L   FPIENR          V  +  + H++      
Sbjct: 457 YLLIDVPNGFPKNMKFLLDK---LDDQFPIENRDNIGQPQNFNVLAKYAEQHMDGK---- 509

Query: 360 LVKRISDFHLLLFLARFLDLN-SDVPALAQCVQAQTAV 396
            ++  SDFH L +L +   +   D+  LA  V+ ++A+
Sbjct: 510 FLEMASDFHFLTYLVKTEIIKLEDLQPLATAVKQESAL 547


>gi|21360364|gb|AAM47486.1| unknown [Squalus mitsukurii]
          Length = 167

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 76/152 (50%), Gaps = 22/152 (14%)

Query: 167 RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI--------- 217
           R EV  IYEPPQ GT   L  L D + E +VD IAA LG++KVG+IFT  +         
Sbjct: 14  RAEVAAIYEPPQVGTANSLEFLEDPKAE-VVDEIAAKLGLRKVGWIFTDLLSEDTRKGTV 72

Query: 218 --MQNKKDYTLSNREVLQAVEFHAE----CNM---EEWVTAVVKLEVSEEGSAAIHFEAF 268
             ++NK  + LS  E + A +F  +    C +     + +  V +  +      +HFE +
Sbjct: 73  KFIRNKDAHFLSAEECITAGDFQNKHPNVCRLSPVNHFGSKFVTVLATGGPDNQVHFEGY 132

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
           Q+S+ C+ L  +  F   +   D P+L  +K+
Sbjct: 133 QVSNQCMALVGDDCF---LPCRDAPELGYVKE 161


>gi|440291930|gb|ELP85172.1| nuclear protein localization, putative [Entamoeba invadens IP1]
          Length = 375

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)

Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLV 197
           + FQ+Y+ E   F V R G ++G   ++  V V  IYEPPQ   +   ++ +D+     V
Sbjct: 194 NRFQKYILER-NFYVNRIGILFGEFAKNNNVRVEAIYEPPQVQEDNGDFVFKDQAIIANV 252

Query: 198 DAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSE 257
           D + A LG+ KVG I++      K+   L++RE + A +        +W    + L VS 
Sbjct: 253 DKMMAELGLSKVGVIYSHN---GKRKEILTSREAIAAAQRQI-----DWNPNCLTLVVSP 304

Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFF 317
           + +A+   E +Q+S+  V L K+   +   A+ +   L   K+ V +  K++ EV+   F
Sbjct: 305 DKNAS-KVECYQVSEQLVDLVKKALVQPTQADNNSILL---KESVYLSQKEIMEVEPVVF 360

Query: 318 LVVV 321
           +  V
Sbjct: 361 VCNV 364


>gi|68071893|ref|XP_677860.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56498134|emb|CAH99046.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 516

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 76/272 (27%), Positives = 120/272 (44%), Gaps = 52/272 (19%)

Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G+MYG   ED       R     IYEPPQ    + + +L D      VD IA  LG+
Sbjct: 240 QRVGWMYGYYKEDSHYNLGIRAVCECIYEPPQFCEGDKVRLLPDYFMNN-VDLIAEKLGL 298

Query: 207 KKVGFIFTQTIMQNKKDYTLSNREV---------LQAVEFHAECNMEEWVTAVVKLEVSE 257
           +K+G IFT      +K+Y  S   V         L+    +   ++  ++T  +  + S 
Sbjct: 299 EKIGIIFTHLP---RKEYLTSEELVNIAKLQLSNLKKNTHYTNYSISNFITCTISPDPSL 355

Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV 309
                 +  AF +SD+ + L ++   + EI   DDP   K++          ++ GGK+ 
Sbjct: 356 NNEPITN--AFMVSDLGMSLIRDNLID-EIQ--DDPSHIKLRTPNKYELLPQILEGGKET 410

Query: 310 KEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALK-----SHLNRSPS 357
            + D D+F++ V   +   P        +  F  ENR    T+  +K     S LNR  +
Sbjct: 411 NKFDTDWFIIRV---NDSAPKIVRSIFKNYHFSRENRNKPQTVYDVKEYFSSSKLNRGVN 467

Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
                R SDFHL++F A+ LD+ +   AL  C
Sbjct: 468 GN--SRCSDFHLIIFAAKVLDIET---ALVLC 494


>gi|303323093|ref|XP_003071538.1| NPL4 family protein [Coccidioides posadasii C735 delta SOWgp]
 gi|240111240|gb|EER29393.1| NPL4 family protein [Coccidioides posadasii C735 delta SOWgp]
 gi|320033362|gb|EFW15310.1| endoplasmic reticulum and nuclear membrane protein c Npl4
           [Coccidioides posadasii str. Silveira]
          Length = 657

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 93/322 (28%), Positives = 130/322 (40%), Gaps = 62/322 (19%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  I LR+ E
Sbjct: 276 INSLLDFWRKSGSQRLGFLYGTYEEYSEVPLGVKAVVQAIYEPPQ--VDEVDGITLREWE 333

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            E+ VD +A   G++KVG IFT  +     D           Y LS+ EV  A    A  
Sbjct: 334 NERDVDEVAKLCGLEKVGVIFTDLLDSGLGDGTVISRRHIDSYYLSSLEVAFAARLQARY 393

Query: 242 -NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF----------- 283
               +W       +  V   +S + + AI   A+Q S+  V + +               
Sbjct: 394 PKPSKWSETGRFGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLV 453

Query: 284 --ETEIAEGDDPKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL----- 330
             E E+   D      +     + V   G +V+E     F V  +   L H  P      
Sbjct: 454 QRENELENADTGNARYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTEPSAL 513

Query: 331 --SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
              +TFPIENR     +Q      K  L+RS     ++ +S+FHLL F+     L+ +  
Sbjct: 514 FADTTFPIENREVIGESQDIRNVAKKLLSRSDPDGAIRAVSNFHLLCFMHGLGILSKEEE 573

Query: 385 ALAQCVQAQTAVPEGYKLLINS 406
           AL  C  A+T  P     LIN+
Sbjct: 574 ALL-CTVARTHDPADGVQLINT 594


>gi|159487106|ref|XP_001701576.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158271517|gb|EDO97334.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 102

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 9/71 (12%)

Query: 113 IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF 172
           +AK +++ RQ+ PH  ++SFDR+ A+ FQ Y+     F++KRGG +YGTVLE+       
Sbjct: 1   MAKVVKIERQDKPHVSAISFDRNAANVFQSYLQGAFNFSIKRGGILYGTVLEE------- 53

Query: 173 IYEPPQQGTEE 183
             E P+ G  E
Sbjct: 54  --EGPEPGKTE 62


>gi|47217659|emb|CAG03056.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 584

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/376 (24%), Positives = 147/376 (39%), Gaps = 125/376 (33%)

Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
           + + RQ+  H +++ F+ D  AD F  +  +T     +R G++YG   E K      R E
Sbjct: 119 ITLNRQKYRHVDNIMFENDTIADRFLDFWRKT---GNQRMGYLYGRYTEHKDIPLGIRAE 175

Query: 170 VNFIYEPPQQG--TEEVLYILR-----------------------DEEEEKLVDAIAAGL 204
           V  IYEPPQ    + +V +I+                        D+ + + VD IAA L
Sbjct: 176 VAAIYEPPQVNLFSHKVSFIMSTVCMVTLSQYLFQNATQNSLELIDDPKAEAVDEIAAKL 235

Query: 205 GMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEF---HAE-CNMEE---W 246
           G+ KVG+IFT  +            +NK  + LS  E + A  F   H+  C +     +
Sbjct: 236 GLCKVGWIFTDLLSEDTRIGTVRYTRNKDSHYLSAEECITAGYFQNLHSNPCRLSRDGYF 295

Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK------ 300
            +  V +  +      +HFE +Q+S+ C+ L ++   E  +   D P+L   K+      
Sbjct: 296 GSKFVTVVATGGPDNQVHFEGYQVSNQCMALVRD---ECLLPCKDAPELGFAKESSPEQY 352

Query: 301 --DVVVGGKDVKEVDNDFF-------LVVVKIL-----DHQGPLSST--FPIENR----T 340
             DV    KD    D  F         +++ I      D Q   SS   FPIENR     
Sbjct: 353 VPDVFYKDKDKFGNDVTFLARPLPVEYLIIDITTTFPKDPQYTFSSAQRFPIENRDILGE 412

Query: 341 TQVT-------------------------------------------MRALKSHLNRSPS 357
           TQ++                                             +L ++L++  S
Sbjct: 413 TQLSQLFSSRYRSSPPSAVMCRCNNKYSEELLLESSRPVTLEPLLNNFHSLSTYLSQCTS 472

Query: 358 LPLVKRISDFHLLLFL 373
              ++ +SDFHLLLFL
Sbjct: 473 TSFLEIVSDFHLLLFL 488


>gi|119189423|ref|XP_001245318.1| hypothetical protein CIMG_04759 [Coccidioides immitis RS]
 gi|121755658|sp|Q1DY54.1|NPL4_COCIM RecName: Full=Nuclear protein localization protein 4
          Length = 657

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/322 (28%), Positives = 130/322 (40%), Gaps = 62/322 (19%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  I LR+ E
Sbjct: 276 INSLLDFWRKSGSQRLGFLYGTYEEYSEVPLGVKAVVQAIYEPPQ--VDEVDGITLREWE 333

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            E+ VD +A   G++KVG IFT  +     D           Y LS+ EV  A    A  
Sbjct: 334 NERDVDEVAKLCGLEKVGVIFTDLLDSGLGDGTVICRRHIDSYYLSSLEVAFAARLQARY 393

Query: 242 -NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF----------- 283
               +W       +  V   +S + + AI   A+Q S+  V + +               
Sbjct: 394 PKPSKWSETGRFGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLV 453

Query: 284 --ETEIAEGDDPKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL----- 330
             E E+   D      +     + V   G +V+E     F V  +   L H  P      
Sbjct: 454 QRENELENADTGNARYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTEPSAL 513

Query: 331 --SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
              +TFPIENR     +Q      K  L+RS     ++ +S+FHLL F+     L+ +  
Sbjct: 514 FADTTFPIENREVIGESQDIRNVAKKLLSRSDPDEAIRAVSNFHLLCFMHGLGILSKEEE 573

Query: 385 ALAQCVQAQTAVPEGYKLLINS 406
           AL  C  A+T  P     LIN+
Sbjct: 574 ALL-CTVARTHDPADGVQLINT 594


>gi|392868223|gb|EAS33974.2| nuclear protein localization protein 4 [Coccidioides immitis RS]
          Length = 608

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/322 (28%), Positives = 130/322 (40%), Gaps = 62/322 (19%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  I LR+ E
Sbjct: 276 INSLLDFWRKSGSQRLGFLYGTYEEYSEVPLGVKAVVQAIYEPPQ--VDEVDGITLREWE 333

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            E+ VD +A   G++KVG IFT  +     D           Y LS+ EV  A    A  
Sbjct: 334 NERDVDEVAKLCGLEKVGVIFTDLLDSGLGDGTVICRRHIDSYYLSSLEVAFAARLQARY 393

Query: 242 -NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF----------- 283
               +W       +  V   +S + + AI   A+Q S+  V + +               
Sbjct: 394 PKPSKWSETGRFGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLV 453

Query: 284 --ETEIAEGDDPKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL----- 330
             E E+   D      +     + V   G +V+E     F V  +   L H  P      
Sbjct: 454 QRENELENADTGNARYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTEPSAL 513

Query: 331 --SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
              +TFPIENR     +Q      K  L+RS     ++ +S+FHLL F+     L+ +  
Sbjct: 514 FADTTFPIENREVIGESQDIRNVAKKLLSRSDPDEAIRAVSNFHLLCFMHGLGILSKEEE 573

Query: 385 ALAQCVQAQTAVPEGYKLLINS 406
           AL  C  A+T  P     LIN+
Sbjct: 574 ALL-CTVARTHDPADGVQLINT 594


>gi|307175902|gb|EFN65715.1| Nuclear protein localization protein 4-like protein [Camponotus
           floridanus]
          Length = 513

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 96/202 (47%), Gaps = 30/202 (14%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+   A   ++++N   +   +R GF+YG       V    R  V  
Sbjct: 287 LNRQTYRHVDNVMFEN--ASLVERFLNYWRSTGHQRIGFLYGRYEVHSDVPLGIRAVVAA 344

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEPPQ+ T++ + +L D E E LV+ +   L ++++G+IFT  I           ++N 
Sbjct: 345 IYEPPQESTKDTIRLLPD-EREALVEELGRTLNLRRIGWIFTDLIADDVKKGTVKHVRNI 403

Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           + + LS +E + A  F       C       + +  V + V+ +    +H E +Q+S+ C
Sbjct: 404 ESHFLSAQECIMAGYFQNRYPNLCRFSPNGLFGSKFVTVCVTGDDKNQVHMEGYQVSNQC 463

Query: 275 VRLFKEGWFETEIAEGDDPKLS 296
           + L ++G     +   D P+L 
Sbjct: 464 MALVRDGCL---VPTKDAPELG 482


>gi|407406442|gb|EKF30802.1| hypothetical protein MOQ_005377 [Trypanosoma cruzi marinkellei]
          Length = 393

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/278 (23%), Positives = 123/278 (44%), Gaps = 31/278 (11%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVD 198
           FQ +V + L ++++R G +YG        +EV+ IYEP Q G     + L D+     VD
Sbjct: 104 FQSFVRQ-LDWSLQRYGILYGKYDASLNLIEVHTIYEPEQHGDAYTFHYLPDKRLPD-VD 161

Query: 199 AIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM----EEWVTAV 250
            IAA LG+++VG + T    ++ ++  LS+RE+L    +   F  EC +        T  
Sbjct: 162 RIAALLGLRRVGAVCTHP-PRDIEEMVLSSREILLCAREQSRFGDECVLLTMSPNLTTGR 220

Query: 251 VKLEVSEEGSAAIHF--------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
           ++ +  +    A+HF        ++   SD+    +       E+A+ +  +    +   
Sbjct: 221 IECQAWQTSPQAVHFYRLGALREKSDDYSDLESAKYVHSSIPLEVAQQETDEKGHPR--- 277

Query: 303 VVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF------PIENRTTQVTMRALKSHLNRSP 356
           VV      ++D  +F   V +   + P+          P       + +R       R  
Sbjct: 278 VVTRAPSHDIDTRWFTSYVAVQQFESPVVRNLFMRVSRPGMEPPAMINLRNYMEDPKRR- 336

Query: 357 SLPLVKRISDFHLLLFLAR-FLDLNSDVPALAQCVQAQ 393
           ++PL+++++DFH+L+FLA     L  D+P +   +  Q
Sbjct: 337 NVPLIEKLADFHVLIFLAESVFSLADDMPVIIGAITKQ 374


>gi|350638102|gb|EHA26458.1| hypothetical protein ASPNIDRAFT_46653 [Aspergillus niger ATCC 1015]
          Length = 666

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 93/322 (28%), Positives = 131/322 (40%), Gaps = 70/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E K V       V  IYEPPQ   +EV  + L +  
Sbjct: 277 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 334

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   G+++VG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 335 NEKEVDEVAHLCGLERVGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARMQAQH 394

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 395 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 450

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
                +E DD        P++   K+ +  V   ++ K      +L+V   L H  P   
Sbjct: 451 VMLVQSEDDDTENKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTVA 508

Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
                 S+FPIENR     +Q  +   K  +        ++ +SDFH+L FL      N 
Sbjct: 509 SPLFTDSSFPIENREVIGESQELVHVAKKLVTHGDPDKAIQAVSDFHMLCFLHSLSTFNK 568

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  Q A  +G +L+
Sbjct: 569 DEEALLCRVATQRAPADGLQLI 590


>gi|317025379|ref|XP_001388958.2| nuclear protein localization protein 4 [Aspergillus niger CBS
           513.88]
          Length = 606

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 131/322 (40%), Gaps = 70/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E K V       V  IYEPPQ   +EV  + L +  
Sbjct: 277 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 334

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   G++++G IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 335 NEKEVDEVAHLCGLERIGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARMQAQH 394

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 395 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 450

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
                +E DD        P++   K+ +  V   ++ K      +L+V   L H  P   
Sbjct: 451 VMLVQSEDDDTENKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTDA 508

Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
                 S+FPIENR     +Q  +   K  +        ++ +SDFH+L FL      N 
Sbjct: 509 SPLFTDSSFPIENREVIGESQELVHVAKKLVTHGDPDKAIQAVSDFHMLCFLHSLSTFNK 568

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  Q A  +G +L+
Sbjct: 569 DEEALLCRVATQRAPADGLQLI 590


>gi|194741672|ref|XP_001953313.1| GF17266 [Drosophila ananassae]
 gi|190626372|gb|EDV41896.1| GF17266 [Drosophila ananassae]
          Length = 657

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 135/313 (43%), Gaps = 66/313 (21%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
           + RQ   H ++V F+       ++++N       +R G++YGT  +        R +V  
Sbjct: 254 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHADVPLGIRAKVAA 311

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
           IYEPPQ+ T + + I  DE  +++    AA LG+KK+G+IFT  I +   D ++   + +
Sbjct: 312 IYEPPQESTRDSINIQPDEGADEVEAVAAA-LGLKKIGWIFTDLITE---DASVGTVKQI 367

Query: 233 QAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAFQM 270
           + +E H     +E +TA                       V + V+ + +  +H E + +
Sbjct: 368 RGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDNTKQVHMEGYAV 426

Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVD 313
           S  C+ L ++      I   D P+L  +++        DV    KD          + + 
Sbjct: 427 SAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLP 483

Query: 314 NDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVK 362
            ++ LV V       P  +         FPIENR     ++   AL  +L+       ++
Sbjct: 484 VEYLLVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEFLE 543

Query: 363 RISDFHLLLFLAR 375
            +SDFHLL++L +
Sbjct: 544 AVSDFHLLVYLYK 556


>gi|307193797|gb|EFN76470.1| Nuclear protein localization protein 4-like protein [Harpegnathos
           saltator]
          Length = 432

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 106/226 (46%), Gaps = 35/226 (15%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+   A   ++++N      ++R G++YG       V    R  +  
Sbjct: 206 LNRQTYRHVDNVMFEN--ASLVERFLNYWRGTGLQRIGYLYGRYEVHSDVPLGVRAVIAA 263

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEPPQ+ T++ + +L D E E LV+ +   L + +VG+IFT  I           ++N 
Sbjct: 264 IYEPPQESTKDTIRLLPD-EREALVEELGRMLNLWRVGWIFTDLIADDVKKGTVKHVRNI 322

Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           + + LS +E + A  F  +    C       + +  V + V+ +    +H E +Q+S+ C
Sbjct: 323 ESHFLSAQECIMAGYFQNQYPNPCRFSPNGYFGSKFVTVCVTGDDKNQVHMEGYQVSNQC 382

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVV 320
           + L ++G     +   D P+L       V+   D + V + F+ ++
Sbjct: 383 MALVRDGCL---VPTKDAPELG-----YVIESTDKQYVPDVFYKLI 420


>gi|190359911|sp|A2Q8R9.1|NPL4_ASPNC RecName: Full=Nuclear protein localization protein 4
 gi|134055061|emb|CAK43702.1| unnamed protein product [Aspergillus niger]
          Length = 654

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 131/322 (40%), Gaps = 70/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E K V       V  IYEPPQ   +EV  + L +  
Sbjct: 265 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 322

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   G++++G IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 323 NEKEVDEVAHLCGLERIGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARMQAQH 382

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 383 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 438

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
                +E DD        P++   K+ +  V   ++ K      +L+V   L H  P   
Sbjct: 439 VMLVQSEDDDTENKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTDA 496

Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
                 S+FPIENR     +Q  +   K  +        ++ +SDFH+L FL      N 
Sbjct: 497 SPLFTDSSFPIENREVIGESQELVHVAKKLVTHGDPDKAIQAVSDFHMLCFLHSLSTFNK 556

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  Q A  +G +L+
Sbjct: 557 DEEALLCRVATQRAPADGLQLI 578


>gi|407847213|gb|EKG03039.1| hypothetical protein TCSYLVIO_005923 [Trypanosoma cruzi]
          Length = 393

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 122/278 (43%), Gaps = 31/278 (11%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVD 198
           FQ +V + L ++++R G +YG        +EV+ IYEP Q G     + L D+     VD
Sbjct: 104 FQSFVRQ-LDWSLQRYGILYGKYDASLNLIEVHAIYEPEQHGDAYTFHYLPDKRLPD-VD 161

Query: 199 AIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM----EEWVTAV 250
            IAA LG+++VG + T    ++ ++  LS+RE+L    +   F  EC +        T  
Sbjct: 162 RIAALLGLRRVGAVCTHP-PRDVEEMVLSSREILLCAREQSRFGDECVLLTMSPSLTTGR 220

Query: 251 VKLEVSEEGSAAIHF--------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
           ++ +  +    A+HF        ++   SD+    +       E+A+ +  +    +   
Sbjct: 221 IECQAWQTSPQAVHFYRLGVLREKSDNYSDLESAKYVHSSIPLEVAQQETDEKGHPR--- 277

Query: 303 VVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF------PIENRTTQVTMRALKSHLNRSP 356
           VV      ++D  +F   V +   + P+          P       + +R       R  
Sbjct: 278 VVTRAPSHDIDTRWFTSYVAVQQFESPVVRNLFMRVSRPGMEPPAMINLRNYMEDPKRR- 336

Query: 357 SLPLVKRISDFHLLLFLAR-FLDLNSDVPALAQCVQAQ 393
            +PL+++++DFH+L+FLA     +  D+P +   +  Q
Sbjct: 337 KVPLIEKLADFHVLIFLAESVFSVADDMPVIIGAITKQ 374


>gi|154280186|ref|XP_001540906.1| hypothetical protein HCAG_04746 [Ajellomyces capsulatus NAm1]
 gi|190359909|sp|A6R538.1|NPL4_AJECN RecName: Full=Nuclear protein localization protein 4
 gi|150412849|gb|EDN08236.1| hypothetical protein HCAG_04746 [Ajellomyces capsulatus NAm1]
          Length = 642

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 126/315 (40%), Gaps = 55/315 (17%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L +   
Sbjct: 278 INSLLDFWRKSGTQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWGN 336

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAEC-NMEEWV----- 247
           EK VD +A   G++K+G IFT  +     D      E++ A +  A      +W      
Sbjct: 337 EKDVDEVAKFCGLEKIGVIFTDLLDAGAGD------EIVFAAQLQARYPKATKWSETGRF 390

Query: 248 -TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLSK 297
            +  V   +S +   AI   A+Q S+  V + K    E     G          D   SK
Sbjct: 391 GSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQQENHSDDDASK 450

Query: 298 MK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL-------SSTFPIENR-- 339
            +       + V   G +V+E     F V  ++  L H  P        +STFPIENR  
Sbjct: 451 ARYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFNNSTFPIENREV 510

Query: 340 ---TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
              +  + M A K   +  P+  +   +SDFHLL FL     LN D   L   V      
Sbjct: 511 IGESQDLRMLADKLVSHGDPNKTICG-VSDFHLLCFLNSLGILNKDEEFLLCTVARSHDT 569

Query: 397 PEGYKLLINSMAETV 411
            +G +L+  S   T+
Sbjct: 570 ADGMQLINTSGWATL 584


>gi|71408582|ref|XP_806685.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870504|gb|EAN84834.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 393

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 122/278 (43%), Gaps = 31/278 (11%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVD 198
           FQ +V + L ++++R G +YG        +EV+ IYEP Q G     + L D+     VD
Sbjct: 104 FQSFVRQ-LDWSLQRYGILYGKYDASLNLIEVHAIYEPEQHGDAYTFHYLPDKRLPN-VD 161

Query: 199 AIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM----EEWVTAV 250
            IAA LG+++VG + T    ++ ++  LS+RE+L    +   F  EC +        T  
Sbjct: 162 RIAALLGLRRVGAVCTHP-PRDIEEMVLSSREILLCAREQSRFGDECVLLTMSPSLTTGR 220

Query: 251 VKLEVSEEGSAAIHF--------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
           ++ +  +    A+HF        ++   SD+    +       E+A+ +  +    +   
Sbjct: 221 IECQAWQTSPQAVHFYRLGVLREKSDDYSDLESAKYVHSSIPLEVAQQETDEKGHPR--- 277

Query: 303 VVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF------PIENRTTQVTMRALKSHLNRSP 356
           VV      ++D  +F   V +   + P+          P       + +R       R  
Sbjct: 278 VVTMAPSHDIDTRWFTSYVAVQQFESPVVRNLFMRVSRPGMEPPAMINLRNYMEDPKRR- 336

Query: 357 SLPLVKRISDFHLLLFLAR-FLDLNSDVPALAQCVQAQ 393
            +PL+++++DFH+L+FLA     +  D+P +   +  Q
Sbjct: 337 KVPLIEKLADFHVLIFLAESVFSVADDMPVIIGAITKQ 374


>gi|391343438|ref|XP_003746017.1| PREDICTED: nuclear protein localization protein 4 homolog
           [Metaseiulus occidentalis]
          Length = 600

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 86/337 (25%), Positives = 142/337 (42%), Gaps = 78/337 (23%)

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFA----VKRGGFMYGTVLEDKRV------EV 170
           +Q+  H ++V F+        Q VN  L F     ++R G + G       V       V
Sbjct: 216 QQKYRHVDAVLFEN------SQIVNRFLDFWRQSLLQRVGILLGRYETYNEVPLGICARV 269

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
             IYEPPQ    +   +L D  EE  VD + A LG+++VG+IF   +  + +  T+    
Sbjct: 270 AAIYEPPQNSARDGFELLDDPNEET-VDELCAKLGLRRVGWIFCDLLPLDVQKGTV---H 325

Query: 231 VLQAVEFHAECNMEEWVTAV----------------------VKLEVSEEGSAAIHFEAF 268
            L+ V+F    + +E +TA                       V + V+ + +  IH EA+
Sbjct: 326 YLRDVDFSYLLSAQECITAAALQNKYPNPCKLSSDGYSGSKFVTVCVTGDKNNQIHMEAY 385

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
           Q+S+ C+ L K+      I   D P+L+ +K+        DV    KD          + 
Sbjct: 386 QVSNQCMALVKDDCL---IPTQDAPELAWVKESTPKKFVPDVFYKTKDRYGLEVQKVARP 442

Query: 312 VDNDFFLV---------VVKILDHQGPLSSTFPIENR--TTQVT-MRALKSHL-NRSPSL 358
           +  +F L+          V     +GP    F +ENR    QV  + A+  +L N + S 
Sbjct: 443 LPVEFLLIDLPVFSPNDSVNTFRSEGP---HFTVENRAFANQVQDLPAVNQYLRNFTASK 499

Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
            L+    DFH LL+LA    ++ ++  L + V+ + A
Sbjct: 500 DLLAACRDFHFLLYLATLDVVHLEMDKLLEAVKTKNA 536


>gi|357528824|sp|Q5BGN5.2|NPL4_EMENI RecName: Full=Nuclear protein localization protein 4
 gi|259489478|tpe|CBF89782.1| TPA: Nuclear protein localization protein 4
           [Source:UniProtKB/Swiss-Prot;Acc:Q5BGN5] [Aspergillus
           nidulans FGSC A4]
          Length = 652

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 130/330 (39%), Gaps = 82/330 (24%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    G++YGT  E   V       V  IYEPPQ   +EV  + L + E
Sbjct: 276 INSLLDFWRKSGTQRLGYLYGTYEEYDEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 333

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFH-AE 240
            EK VD IA   G++KVG IFT  +   + D           Y LS+ E+  A      +
Sbjct: 334 NEKDVDEIARLCGLEKVGVIFTDLLDAGRGDGSVLCKRHIDSYYLSSLEIAFASRLQMQQ 393

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
                W          VT V  L   EEG  AI   ++Q S   V + +    E      
Sbjct: 394 PKATRWSRTGYFGSNFVTCV--LSGDEEG--AITISSYQASVAAVEMIRADIIEPSAEPS 449

Query: 288 -----AEGDDPKLSKMKKDVV------------VGGKDVKEVDNDFFLVVVKILDHQGP- 329
                +E DD   S+   +V             V  K    V+   +L+V   L H  P 
Sbjct: 450 VMLVQSEDDDTNKSRYIPEVFYRKINEYGVSAQVNAKPAFPVE---YLLVT--LTHGFPT 504

Query: 330 ------LSSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARF 376
                 + ST+PIENR             R L SH +   +   ++ +SDFHLL FL  F
Sbjct: 505 ESKPLFIDSTYPIENREVIGEGQEFHSLARKLVSHGDPQKA---IRAVSDFHLLCFLRTF 561

Query: 377 LDLNSDVPALAQCVQAQTAVPEGYKLLINS 406
              + +  AL  C  A T  P     LIN+
Sbjct: 562 STFSKEEEALL-CRVATTQNPTDGLQLINT 590


>gi|67516027|ref|XP_657899.1| hypothetical protein AN0295.2 [Aspergillus nidulans FGSC A4]
 gi|40746545|gb|EAA65701.1| hypothetical protein AN0295.2 [Aspergillus nidulans FGSC A4]
          Length = 649

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 130/330 (39%), Gaps = 82/330 (24%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    G++YGT  E   V       V  IYEPPQ   +EV  + L + E
Sbjct: 273 INSLLDFWRKSGTQRLGYLYGTYEEYDEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFH-AE 240
            EK VD IA   G++KVG IFT  +   + D           Y LS+ E+  A      +
Sbjct: 331 NEKDVDEIARLCGLEKVGVIFTDLLDAGRGDGSVLCKRHIDSYYLSSLEIAFASRLQMQQ 390

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
                W          VT V  L   EEG  AI   ++Q S   V + +    E      
Sbjct: 391 PKATRWSRTGYFGSNFVTCV--LSGDEEG--AITISSYQASVAAVEMIRADIIEPSAEPS 446

Query: 288 -----AEGDDPKLSKMKKDVV------------VGGKDVKEVDNDFFLVVVKILDHQGP- 329
                +E DD   S+   +V             V  K    V+   +L+V   L H  P 
Sbjct: 447 VMLVQSEDDDTNKSRYIPEVFYRKINEYGVSAQVNAKPAFPVE---YLLVT--LTHGFPT 501

Query: 330 ------LSSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARF 376
                 + ST+PIENR             R L SH +   +   ++ +SDFHLL FL  F
Sbjct: 502 ESKPLFIDSTYPIENREVIGEGQEFHSLARKLVSHGDPQKA---IRAVSDFHLLCFLRTF 558

Query: 377 LDLNSDVPALAQCVQAQTAVPEGYKLLINS 406
              + +  AL  C  A T  P     LIN+
Sbjct: 559 STFSKEEEALL-CRVATTQNPTDGLQLINT 587


>gi|149245254|ref|XP_001527151.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|190359912|sp|A5DX93.1|NPL4_LODEL RecName: Full=Nuclear protein localization protein 4
 gi|146449545|gb|EDK43801.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 600

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 80/161 (49%), Gaps = 25/161 (15%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++N   +  V+R G++YG   +  +V       V  IYEPPQ    + L +L D E+E +
Sbjct: 276 FINVWRSTGVQRFGYLYGRYEKFDKVPMGIKAVVEAIYEPPQHDELDGLTLL-DWEDEPI 334

Query: 197 VDAIAAGLGMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHAE----- 240
           VDAIAA LG++KVG +FT          T++  ++K  Y L+N E++ A +F  +     
Sbjct: 335 VDAIAAKLGLQKVGIVFTDLTDSGNRDGTVLCKRHKDSYFLTNLEIIMAAKFQIKNPNIT 394

Query: 241 --CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
              N+ E+ +  V   +S      I   ++Q+S     L K
Sbjct: 395 KYANLGEFSSKFVTCVISGGLQGEIEPRSYQVSSSAEGLVK 435


>gi|327296223|ref|XP_003232806.1| nuclear protein localization protein 4 [Trichophyton rubrum CBS
           118892]
 gi|326465117|gb|EGD90570.1| nuclear protein localization protein 4 [Trichophyton rubrum CBS
           118892]
          Length = 652

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 128/321 (39%), Gaps = 67/321 (20%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L +  
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 335

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   GM+K+G IFT  I     D           Y LS+ EV+ A +  A  
Sbjct: 336 NEKDVDHVAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 395

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E       DP 
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
           +  ++++  +   ++     + F        V + ++  P                    
Sbjct: 452 VMLVQQEDELNNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511

Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
                ++FPIENR     +Q   R      + S    ++K +S+FHLL FL     LN D
Sbjct: 512 TVFTDTSFPIENREFIGESQDLRRVASKLTDSSDPNTIIKAVSNFHLLTFLHGMGILNED 571

Query: 383 VPALAQCVQAQTAVPEGYKLL 403
             AL   V  +    +G +LL
Sbjct: 572 EEALLCTVATKHDPADGIQLL 592


>gi|326477582|gb|EGE01592.1| nuclear protein localization protein 4 [Trichophyton equinum CBS
           127.97]
          Length = 652

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 127/321 (39%), Gaps = 67/321 (20%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L +  
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 335

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   GM+K+G IFT  I     D           Y LS+ EV+ A +  A  
Sbjct: 336 NEKDVDYVAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 395

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E       DP 
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
           +  ++++  +   ++     + F        V + ++  P                    
Sbjct: 452 IMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511

Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
                 +FPIENR     +Q   R      + S    ++K +S+FHLL FL     LN D
Sbjct: 512 TVFTDPSFPIENREIIGESQDLRRVASKLTDSSDPNAVIKAVSNFHLLTFLHGMGILNED 571

Query: 383 VPALAQCVQAQTAVPEGYKLL 403
             AL   V  +    +G +LL
Sbjct: 572 EEALLCAVATKHDPADGIQLL 592


>gi|302503091|ref|XP_003013506.1| hypothetical protein ARB_00324 [Arthroderma benhamiae CBS 112371]
 gi|291177070|gb|EFE32866.1| hypothetical protein ARB_00324 [Arthroderma benhamiae CBS 112371]
          Length = 652

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 89/322 (27%), Positives = 131/322 (40%), Gaps = 69/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L +  
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 335

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD IA   GM+K+G IFT  I     D           Y LS+ EV+ A +  A  
Sbjct: 336 NEKDVDHIAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 395

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E       DP 
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
           +  ++++  +   ++     + F        V + ++  P                    
Sbjct: 452 VMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511

Query: 330 ---LSSTFPIENRTT---QVTMRALKSHLNRSPSLP--LVKRISDFHLLLFLARFLDLNS 381
                S+FPIENR        +R + S L  S S P  ++K +S+FHLL FL     L+ 
Sbjct: 512 TVFTDSSFPIENREVIGESQDLRRVASKLTDS-SDPNAVIKAVSNFHLLTFLHGMGILSE 570

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  +    +G +LL
Sbjct: 571 DEEALLCTVATKHDPADGIQLL 592


>gi|452001809|gb|EMD94268.1| hypothetical protein COCHEDRAFT_1028190 [Cochliobolus
           heterostrophus C5]
          Length = 1071

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 86/324 (26%), Positives = 137/324 (42%), Gaps = 66/324 (20%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRD 190
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ    EV  I L D
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQ--INEVDGISLGD 316

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
            + E+++D IAA  G+++VG IFT  +  +K D           Y LS+ EV  A  + A
Sbjct: 317 WDNEQVIDDIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEVCFAARYQA 376

Query: 240 EC-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI----- 287
           +     +W       +  V   +S +    I   ++Q+S+  V + +    E        
Sbjct: 377 KYPRPSKWSDTGRHGSNFVTCVISGDDQGQIGISSYQVSNDAVEMVRADIIEPSAEPKVM 436

Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------ 329
              +E D+  L++ +       + +   G +V+E+    F V  ++  L H  P      
Sbjct: 437 LVQSEEDNEALNRTRYIPEVFYRKINEHGANVQEIAKPDFPVEYLLLTLTHGFPTQPNPL 496

Query: 330 -LSSTFPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLD 378
                FPIENR     M  + +HL+     ++  L L     +K IS+FH+L F+     
Sbjct: 497 FTGGKFPIENREAMGEMPEI-THLSKALNAKANGLALNTASGLKAISNFHMLCFIHNLGI 555

Query: 379 LNSDVPALAQCVQAQTAVPEGYKL 402
           L+ D  +L   V +     EG  L
Sbjct: 556 LSKDEESLLFKVASTHDTSEGSAL 579


>gi|451849988|gb|EMD63291.1| hypothetical protein COCSADRAFT_161800 [Cochliobolus sativus
           ND90Pr]
          Length = 655

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 86/324 (26%), Positives = 137/324 (42%), Gaps = 66/324 (20%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRD 190
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ    EV  I L D
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQ--INEVDGISLGD 316

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
            + E+++D IAA  G+++VG IFT  +  +K D           Y LS+ EV  A  + A
Sbjct: 317 WDNEQVIDDIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEVCFAARYQA 376

Query: 240 EC-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI----- 287
           +     +W       +  V   +S +    I   ++Q+S+  V + +    E        
Sbjct: 377 KYPRPSKWSDTGRHGSNFVTCVISGDDQGQIGISSYQVSNDAVEMVRADIIEPSAEPKVM 436

Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------ 329
              +E D+  L++ +       + +   G +V+E+    F V  ++  L H  P      
Sbjct: 437 LVQSEEDNEALNRTRYIPEVFYRKINEHGANVQEIAKPDFPVEYLLLTLTHGFPTQPNPL 496

Query: 330 -LSSTFPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLD 378
                FPIENR     M  + +HL+     ++  L L     +K IS+FH+L F+     
Sbjct: 497 FTGGKFPIENREAMGEMPEI-THLSKALNAKANGLALNTASGLKAISNFHMLCFIHNLGI 555

Query: 379 LNSDVPALAQCVQAQTAVPEGYKL 402
           L+ D  +L   V +     EG  L
Sbjct: 556 LSKDEESLLFKVASTHDTSEGSAL 579


>gi|326474161|gb|EGD98170.1| nuclear protein localization protein 4 [Trichophyton tonsurans CBS
           112818]
          Length = 641

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 127/321 (39%), Gaps = 67/321 (20%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L +  
Sbjct: 267 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 324

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   GM+K+G IFT  I     D           Y LS+ EV+ A +  A  
Sbjct: 325 NEKDVDYVAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 384

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E       DP 
Sbjct: 385 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 440

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
           +  ++++  +   ++     + F        V + ++  P                    
Sbjct: 441 VMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 500

Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
                 +FPIENR     +Q   R      + S    ++K +S+FHLL FL     LN D
Sbjct: 501 TVFTDPSFPIENREIIGESQDLRRVASKLTDSSDPNAVIKAVSNFHLLTFLHGMGILNED 560

Query: 383 VPALAQCVQAQTAVPEGYKLL 403
             AL   V  +    +G +LL
Sbjct: 561 EEALLCAVATKHDPADGIQLL 581


>gi|218511719|sp|Q6BRJ9.2|NPL4_DEBHA RecName: Full=Nuclear protein localization protein 4
          Length = 566

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 121/287 (42%), Gaps = 47/287 (16%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
             ++++      V+R G +YG   +  +V       V  IYEPPQ G  + L +L   E 
Sbjct: 246 LNKFIDSWRTTGVQRFGILYGRYEQFDKVPLGIKAVVEAIYEPPQSGELDGLTLL-PWEN 304

Query: 194 EKLVDAIAAGLGMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHA--- 239
           E+ VD IAA LG+ KVG  FT          T++  ++K  Y LS  EV+ A ++     
Sbjct: 305 ERQVDEIAASLGLYKVGMTFTDLTDSGAKNGTVLCKRHKNSYFLSCLEVIMAAKYQVSNP 364

Query: 240 ----ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE----GWFETEIAEGD 291
                 N   + +  V   +S   +  I   ++Q+S+    L K     G  +  +   +
Sbjct: 365 NITKHSNSGRFSSKFVTCVISGGMNGEIEPRSYQVSNSAEALIKADIITGSTQPSMLYIN 424

Query: 292 DPKLSKMKKDVVVG-----GKDVKEVDN-----DFFLVVVK---ILDHQGPLSSTFPIEN 338
           D +  +   DV        G +VK         +F LV +     LD        FPIEN
Sbjct: 425 DSEGKRYVPDVFYSKINEYGLEVKTNAKPAFPVEFLLVSLSDSFPLDPSPMFRENFPIEN 484

Query: 339 RTTQVTMRALKS---HLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
           R     ++ LKS   +LN  P      ++ DFH L ++A+   L+ D
Sbjct: 485 RDFMGDLQDLKSAYNYLNADPG--DGSQLFDFHFLTYIAKTGILSHD 529


>gi|440640577|gb|ELR10496.1| nuclear protein localization protein 4, variant [Geomyces
           destructans 20631-21]
          Length = 612

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 124/314 (39%), Gaps = 67/314 (21%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G+MYG   E  +V       V  IYEPPQ   +E+  I L +   EK VD +A   G
Sbjct: 287 QRLGYMYGRYEEYTKVPLGIKAVVEAIYEPPQH--DEIDGITLNEWTNEKDVDGVARLCG 344

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           ++KVG I+T  +     D           Y LS+ E+       A+        +   + 
Sbjct: 345 LQKVGVIWTDLLDSGGGDGSVVCKRHIDSYFLSSLEIAFISRLQAQHPKPTKWSDTGRFG 404

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLSKM 298
           +  V   V+ +    I   A+Q S+  V + +    E     G         DD  +S+ 
Sbjct: 405 SNFVTCVVTGDEQGQISISAYQASNSAVEMIRADIVEPSADPGVMIVWAEEEDDASVSRT 464

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP---------LSSTFPIENRT 340
           +       + +   G  V+E     F V  ++  L H  P           S FP+ENR 
Sbjct: 465 RYIPEVFFRKINEYGAAVQENAKPSFPVEYLLVTLTHGFPQEPKPAFTAPPSAFPVENRE 524

Query: 341 TQVTMRALKS-----------HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
                +  KS            L RS +   ++ +SDFHLL FL     LN D  AL   
Sbjct: 525 AIGESQEYKSVSQAIGLRPDGQLKRSDTG--IEAVSDFHLLCFLRNMAILNKDEEALLCR 582

Query: 390 VQAQTAVPEGYKLL 403
           V  Q  + +GY+LL
Sbjct: 583 VATQHDLADGYQLL 596


>gi|294656853|ref|XP_459171.2| DEHA2D15818p [Debaryomyces hansenii CBS767]
 gi|199431790|emb|CAG87342.2| DEHA2D15818p [Debaryomyces hansenii CBS767]
          Length = 576

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 121/287 (42%), Gaps = 47/287 (16%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
             ++++      V+R G +YG   +  +V       V  IYEPPQ G  + L +L   E 
Sbjct: 256 LNKFIDSWRTTGVQRFGILYGRYEQFDKVPLGIKAVVEAIYEPPQSGELDGLTLL-PWEN 314

Query: 194 EKLVDAIAAGLGMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHA--- 239
           E+ VD IAA LG+ KVG  FT          T++  ++K  Y LS  EV+ A ++     
Sbjct: 315 ERQVDEIAASLGLYKVGMTFTDLTDSGAKNGTVLCKRHKNSYFLSCLEVIMAAKYQVSNP 374

Query: 240 ----ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE----GWFETEIAEGD 291
                 N   + +  V   +S   +  I   ++Q+S+    L K     G  +  +   +
Sbjct: 375 NITKHSNSGRFSSKFVTCVISGGMNGEIEPRSYQVSNSAEALIKADIITGSTQPSMLYIN 434

Query: 292 DPKLSKMKKDVVVG-----GKDVKEVDN-----DFFLVVVK---ILDHQGPLSSTFPIEN 338
           D +  +   DV        G +VK         +F LV +     LD        FPIEN
Sbjct: 435 DSEGKRYVPDVFYSKINEYGLEVKTNAKPAFPVEFLLVSLSDSFPLDPSPMFRENFPIEN 494

Query: 339 RTTQVTMRALKS---HLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
           R     ++ LKS   +LN  P      ++ DFH L ++A+   L+ D
Sbjct: 495 RDFMGDLQDLKSAYNYLNADPG--DGSQLFDFHFLTYIAKTGILSHD 539


>gi|315051616|ref|XP_003175182.1| NPL4 [Arthroderma gypseum CBS 118893]
 gi|311340497|gb|EFQ99699.1| NPL4 [Arthroderma gypseum CBS 118893]
          Length = 652

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 86/322 (26%), Positives = 131/322 (40%), Gaps = 69/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L + E
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 335

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   GM+K+G IFT  I     D           Y LS+ EV+ A +F A  
Sbjct: 336 NEKDVDHVAKLCGMQKIGVIFTDLIDAGAGDGTVICKRHIDSYYLSSLEVVFASQFQARN 395

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E       DP 
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
           +  ++++  +   ++     + F        V + ++  P                    
Sbjct: 452 VMLVQQEDELNDANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511

Query: 330 ---LSSTFPIENRTT---QVTMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLDLNS 381
                S+FPIENR        +R + + L  S  P +  +K +S+FHLL FL     L+ 
Sbjct: 512 TVFTDSSFPIENREIIGESQDLRRVATKLTGSNNPDV-AIKAVSNFHLLTFLHGMGILSQ 570

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D   L   V  +    +G +L+
Sbjct: 571 DEETLLCTVATKHDPADGIQLI 592


>gi|212539570|ref|XP_002149940.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Talaromyces marneffei ATCC 18224]
 gi|210067239|gb|EEA21331.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Talaromyces marneffei ATCC 18224]
          Length = 606

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 139/329 (42%), Gaps = 67/329 (20%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    G++YGT  E   V       V  IYEPPQ   +EV  I L +  
Sbjct: 276 INGLLDFWRKSGTQRLGYLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGITLNEWT 333

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            E  +D +A   G+++VG IFT  +   K D           Y LS+ E++ A    A+ 
Sbjct: 334 NEAEIDEVAQLCGLQRVGVIFTDLLDTGKGDGSVICKRHADSYYLSSLEIVFASRQQAQH 393

Query: 241 CNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------- 287
               +W       +  V   +S + + AI   ++Q S+  V + +    E          
Sbjct: 394 PKATKWSETGRFGSNFVTCVLSGDENGAIAVSSYQASNSAVEMVRADIIEPSADPSVMLV 453

Query: 288 -AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------L 330
            +E DD   SK++       + +   G +V+E     F V  ++  L H  P       +
Sbjct: 454 QSEDDDDLGSKLRYIPEVFYRRINEYGANVQENAKPSFPVEYLLVTLTHGFPTEESPTFV 513

Query: 331 SSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDV 383
            STFPIENR         +   + L +H + + +   ++ +SDFHLL FL      N++ 
Sbjct: 514 DSTFPIENREVIGESQELRTLAKKLVAHGDPNKA---IRAVSDFHLLAFLRGLGTFNAEE 570

Query: 384 PALAQCVQAQTAVP-EGYKLLINSMAETV 411
             L  C  A T  P EG +L+  S   T+
Sbjct: 571 EKLL-CRVATTHDPAEGMQLVNTSGWATL 598


>gi|440640578|gb|ELR10497.1| nuclear protein localization protein 4 [Geomyces destructans
           20631-21]
          Length = 640

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 84/314 (26%), Positives = 124/314 (39%), Gaps = 67/314 (21%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G+MYG   E  +V       V  IYEPPQ   +E+  I L +   EK VD +A   G
Sbjct: 287 QRLGYMYGRYEEYTKVPLGIKAVVEAIYEPPQH--DEIDGITLNEWTNEKDVDGVARLCG 344

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           ++KVG I+T  +     D           Y LS+ E+       A+        +   + 
Sbjct: 345 LQKVGVIWTDLLDSGGGDGSVVCKRHIDSYFLSSLEIAFISRLQAQHPKPTKWSDTGRFG 404

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLSKM 298
           +  V   V+ +    I   A+Q S+  V + +    E     G         DD  +S+ 
Sbjct: 405 SNFVTCVVTGDEQGQISISAYQASNSAVEMIRADIVEPSADPGVMIVWAEEEDDASVSRT 464

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP---------LSSTFPIENRT 340
           +       + +   G  V+E     F V  ++  L H  P           S FP+ENR 
Sbjct: 465 RYIPEVFFRKINEYGAAVQENAKPSFPVEYLLVTLTHGFPQEPKPAFTAPPSAFPVENRE 524

Query: 341 TQVTMRALKS-----------HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
                +  KS            L RS +   ++ +SDFHLL FL     LN D  AL   
Sbjct: 525 AIGESQEYKSVSQAIGLRPDGQLKRSDTG--IEAVSDFHLLCFLRNMAILNKDEEALLCR 582

Query: 390 VQAQTAVPEGYKLL 403
           V  Q  + +GY+LL
Sbjct: 583 VATQHDLADGYQLL 596


>gi|198418032|ref|XP_002123974.1| PREDICTED: similar to Nuclear protein localization protein 4
           homolog (Protein NPL4) [Ciona intestinalis]
          Length = 565

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 142/323 (43%), Gaps = 59/323 (18%)

Query: 102 SFGRKMTMDDLIAKQMRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYG 160
           S  +K ++ DL +  + + RQ+  H +++ F +R+  D F  Y  ++     +R G++YG
Sbjct: 176 SNSKKHSVVDLPS-AVTLNRQKFRHVDNIMFENREIVDRFLNYWRKS---GNQRMGYLYG 231

Query: 161 TVLEDKRV------EVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT 214
                  V      EV  IYEPPQ+ T+  L    D  EE  V+ IA  L ++KVG+I +
Sbjct: 232 RYSPHSEVPLGIKAEVCAIYEPPQESTKNWLQFHPDPNEEA-VEYIAKKLNLRKVGWILS 290

Query: 215 QTIMQNKKDYTLSN-----------REVLQAVEFHAE----CNME---EWVTAVVKLEVS 256
              ++++   T+ N            E + A  F       C +    ++ +    + VS
Sbjct: 291 DLTVEDRSKGTVKNTRHSDTHFLTAEECITAATFQNNYPNPCKLASDGKYGSKFATVIVS 350

Query: 257 EEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD 308
            +    I FE +Q+S+  + L  +   E  +   D P+L  +++        DV    KD
Sbjct: 351 GDNDCQISFEGYQVSNQGMSLVAD---ECLLPTLDAPELGYIRESSPELFVSDVYYKEKD 407

Query: 309 ---------VKEVDNDFFLVVVKI---LDHQGPLSST---FPIENRTTQVTMRALKS--- 350
                     + +  ++ L  V     L+ Q   S+    FP+  R T   +++L S   
Sbjct: 408 KYGNEITKLARPLPLEYLLTDVPTAFSLNSQYSCSNAVAGFPVPYRDTIGEVQSLSSLSK 467

Query: 351 HLNRSPSLPLVKRISDFHLLLFL 373
           +LN+      V  +SDF+LL++L
Sbjct: 468 YLNQFKPDQFVDAVSDFNLLVYL 490


>gi|242804178|ref|XP_002484322.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Talaromyces stipitatus ATCC 10500]
 gi|218717667|gb|EED17088.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Talaromyces stipitatus ATCC 10500]
          Length = 607

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 87/328 (26%), Positives = 136/328 (41%), Gaps = 65/328 (19%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    G++YGT  E   V       V  IYEPPQ   +EV  I L +  
Sbjct: 277 INGLLDFWRKSGAQRLGYLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGITLNEWT 334

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            E  +D +A   G+++VG IFT  +   K D           Y LS+ E++ A    A+ 
Sbjct: 335 NEAEIDEVARLCGLQRVGVIFTDLLDAGKGDGTVVCKRHVDSYYLSSLEIVFASRQQAQH 394

Query: 241 CNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------- 287
               +W       +  V   +S + + AI   ++Q S+  V + +    E          
Sbjct: 395 PKATKWSETGRFGSNFVTCVLSGDENGAITVSSYQASNSAVEMVRADIIEPSADPSVMLV 454

Query: 288 -AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------L 330
            +E DD   SK++       + +   G +V+E     F V  ++  L H  P       +
Sbjct: 455 QSEDDDDPGSKLRYIPEVFYRRINEYGANVQENAKPSFPVEYLLVTLTHGFPTEESPMFI 514

Query: 331 SSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDV 383
            STFPIENR         +   + L +H + + +   +  +SDFHLL FL      N++ 
Sbjct: 515 DSTFPIENREVIGESQELRTLAKKLVAHGDPNKA---ISAVSDFHLLAFLRSLGTFNAEE 571

Query: 384 PALAQCVQAQTAVPEGYKLLINSMAETV 411
             L   V +     EG +L+  S   T+
Sbjct: 572 ERLLYRVASTHDPAEGMQLVNTSGWATL 599


>gi|407929695|gb|EKG22507.1| NPL4 zinc-binding putative, partial [Macrophomina phaseolina MS6]
          Length = 598

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 90/326 (27%), Positives = 133/326 (40%), Gaps = 74/326 (22%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YG   E   V       V  IYEPPQ G  + +  L D + 
Sbjct: 264 INTLLDFWRKSGCQRLGFLYGRYEEYTEVPLGTKAVVEAIYEPPQVGEVDGIS-LNDWDN 322

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-C 241
           EK +D +A   G+++VG IFT  +     D           Y LS+ E   A  +  +  
Sbjct: 323 EKEIDEVARMCGLERVGVIFTDLLDAGAGDGSVVCKRHADSYYLSSLETCFAARYQGKFP 382

Query: 242 NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---- 287
              +W          VT +V  +  EEG   I   A+Q S+  V + +    E       
Sbjct: 383 RPSKWSETGRFGSNFVTCIVSGD--EEGQIGI--SAYQASNAAVEMVRADIIEPSADPTM 438

Query: 288 ----AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL---- 330
                E ++ +L + +       + V   G +V+E     F V  +   L H  P     
Sbjct: 439 MLVQNEEEESELGRRRYIPEVFYRKVNEYGANVQESAKPAFPVEYLFVTLTHGFPTNPQP 498

Query: 331 ---SSTFPIENR-----TTQVTMRALKSHLNRSPSLPLVK------RISDFHLLLFLARF 376
              +++FPIENR     + +VT  AL   LN   S P +        ++DFHLL F+   
Sbjct: 499 KFPNTSFPIENREVLGISQEVT--ALAKQLNAKSSGPKLSDATGLAAVADFHLLTFIHGL 556

Query: 377 LDLNSDVPALAQCVQAQTAVPEGYKL 402
             L+ D  AL   V A+  V +G +L
Sbjct: 557 GILHKDEEALLCRVAAEHNVSDGAQL 582


>gi|330921786|ref|XP_003299562.1| hypothetical protein PTT_10586 [Pyrenophora teres f. teres 0-1]
 gi|311326697|gb|EFQ92338.1| hypothetical protein PTT_10586 [Pyrenophora teres f. teres 0-1]
          Length = 655

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 133/323 (41%), Gaps = 64/323 (19%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ    + +  L + 
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVNEADGIS-LGEW 317

Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
           + EK +D IAA  G+++VG IFT  +  N  D           Y L++ EV  A  + A+
Sbjct: 318 DNEKEIDEIAAQCGLQRVGVIFTDLLDANNGDGSVICKRHMDSYYLASLEVCFAARYQAK 377

Query: 241 C-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
                +W       +  V   +S +    I   ++Q+S+  V + +    E         
Sbjct: 378 YPRPSKWSDTGRHGSNFVTCVISGDDKGQIAISSYQVSNDAVEMVRADIIEPSAEPSVML 437

Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST--- 333
             +E DD  L + +       + +   G +V+EV    F V  ++  L H  P   +   
Sbjct: 438 VQSEEDDEALGRARYIPEVFYRKINEHGANVQEVAKPAFPVEYLLLTLTHGFPTQPSPLF 497

Query: 334 ----FPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLDL 379
               FPIENR     +  + SHL+     ++  L L     +  IS+FH+L F+     L
Sbjct: 498 TGGKFPIENREAMGEVPEV-SHLSKALNAKANGLALNTASDLNAISNFHMLCFIHNLGVL 556

Query: 380 NSDVPALAQCVQAQTAVPEGYKL 402
           + D  AL   V +     EG  L
Sbjct: 557 SKDEEALLFKVASSHDTSEGSAL 579


>gi|302659770|ref|XP_003021572.1| hypothetical protein TRV_04314 [Trichophyton verrucosum HKI 0517]
 gi|291185476|gb|EFE40954.1| hypothetical protein TRV_04314 [Trichophyton verrucosum HKI 0517]
          Length = 582

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 127/321 (39%), Gaps = 67/321 (20%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L +  
Sbjct: 208 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 265

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD IA   GM+K+G IFT  I     D           Y LS+ EV+ A +  A  
Sbjct: 266 NEKDVDHIAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 325

Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
              C   E   + +  V   +S + + AI   A+Q S+  V + +    E       DP 
Sbjct: 326 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 381

Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
           +  ++++  +   ++     + F        V + ++  P                    
Sbjct: 382 VMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 441

Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
                 +FPIENR     +Q   R      + S    ++K +S+FHLL FL     L+ D
Sbjct: 442 TVFTDPSFPIENREVIGESQDLRRVASKLTDSSDPNAVIKAVSNFHLLTFLHGMGILSED 501

Query: 383 VPALAQCVQAQTAVPEGYKLL 403
             AL   V  +    +G +LL
Sbjct: 502 EEALLCTVATKHDPADGIQLL 522


>gi|195152455|ref|XP_002017152.1| GL21676 [Drosophila persimilis]
 gi|194112209|gb|EDW34252.1| GL21676 [Drosophila persimilis]
          Length = 443

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 101/226 (44%), Gaps = 47/226 (20%)

Query: 85  SYDGERNVRGPSFNPAGSFGRKMT--MDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQ 142
           +Y  E N++  SF+   S+ RK T  MD       R       H ++V F+       ++
Sbjct: 192 AYLKENNIKHLSFH---SYIRKQTSGMDHGKYFTYR-------HVDNVMFEN--TKIVER 239

Query: 143 YVNETLAFAVKRGGFMYGT------VLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++N       +R G++YGT      V    R  V  IYEPPQ+ T + + +  DE + + 
Sbjct: 240 FLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATVAAIYEPPQESTRDSINMFPDEGDAE- 298

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTA------- 249
           VDA+A  LG+KK+G+IFT  I     D T    + ++ +E H     +E +TA       
Sbjct: 299 VDAVANALGLKKIGWIFTDLITD---DATAGTVKQIRGIETHF-LTAQECITAGELQNRH 354

Query: 250 ---------------VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE 280
                           V + V+ + +  +H E + +S  C+ L ++
Sbjct: 355 PNPCKYASNGVFGSKFVTICVTGDQTKQVHMEGYAVSAQCMALVRD 400


>gi|119497211|ref|XP_001265368.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Neosartorya fischeri NRRL 181]
 gi|190359914|sp|A1D4X8.1|NPL4_NEOFI RecName: Full=Nuclear protein localization protein 4
 gi|119413530|gb|EAW23471.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Neosartorya fischeri NRRL 181]
          Length = 652

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 96/323 (29%), Positives = 132/323 (40%), Gaps = 72/323 (22%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +E+  I L +  
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEIDGITLHEWP 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   G++KVG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 331 NEKEVDEVARQCGLEKVGVIFTDLLDAGRGDGSVVCKRHIDSYYLSSLEIAFASRMQAKH 390

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   ++Q S   V + +    E      
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AITVSSYQASISAVEMVRADIVEPSAEPS 446

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-- 330
                +E DD        P++   K+ +  V   ++ K      +L+V   L H  P   
Sbjct: 447 VMLVQSEDDDTDNKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTEA 504

Query: 331 -----SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
                +STFPIENR     +Q      K  ++       ++ +SDFHLL FL      + 
Sbjct: 505 SPMFSASTFPIENREVIGESQELRHVAKKLVSHGDPDKAIREVSDFHLLCFLHSLSMFSK 564

Query: 382 DVPALAQCVQAQTAVP-EGYKLL 403
           +  AL  C  A T  P EG KLL
Sbjct: 565 EEEALL-CRVATTHDPTEGLKLL 586


>gi|396471846|ref|XP_003838966.1| hypothetical protein LEMA_P026390.1 [Leptosphaeria maculans JN3]
 gi|312215535|emb|CBX95487.1| hypothetical protein LEMA_P026390.1 [Leptosphaeria maculans JN3]
          Length = 759

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 133/322 (41%), Gaps = 62/322 (19%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ   ++ +  L D 
Sbjct: 357 QVVDDLINFWRNTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVDEQDGIS-LTDW 415

Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
             EK +D IAA  G+++VG IFT  +   + D           Y LS+ E++ A  + A+
Sbjct: 416 NNEKEIDDIAAQCGLQRVGVIFTDLLDAGQGDGSVVCKRHFDSYYLSSLEIVFAARYQAK 475

Query: 241 -------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
                   +  ++ +  V   +S +    I   ++Q+S+  V + +    E         
Sbjct: 476 YPRPSKWSDTGKFGSNFVTCVISGDDKGQIGISSYQVSNDAVEMVRADIIEPSAEPSVML 535

Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------- 329
             +E D+  L++ +       + +   G +V+E+    F V  ++  L H  P       
Sbjct: 536 VQSEEDNEALNRARYIPEVFYRRINEHGANVQEIAKPDFPVEYLLLTLTHGFPTQPNPLF 595

Query: 330 LSSTFPIENRTTQVTM---RALKSHLN-RSPSLPL-----VKRISDFHLLLFLARFLDLN 380
               FPIENR     M     L   LN ++  L L     V  IS+FHLL F+     L+
Sbjct: 596 TGGKFPIENREIMGEMADVSTLGKTLNAKANGLALNTSSGVNAISNFHLLCFIHNLGILS 655

Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
            D  +L   V +     EG  L
Sbjct: 656 KDEESLLFKVASTHDTSEGSAL 677


>gi|255951286|ref|XP_002566410.1| Pc22g25230 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211593427|emb|CAP99811.1| Pc22g25230 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 665

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 117/292 (40%), Gaps = 69/292 (23%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L D   
Sbjct: 269 INSLLNFWRKSGAQRLGFLYGTYEEYAEVPLGVKAVVEAIYEPPQTGEIDGV-TLHDWHN 327

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-- 240
           EK VD +A   G++KVG IFT  I   + D           Y LS+ E+  A    A+  
Sbjct: 328 EKEVDEVARLCGLQKVGVIFTDLIDAGRGDGSVVCKRHIDSYYLSSLEIAFAARLQAQNP 387

Query: 241 -----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI-------- 287
                    ++ +  V   +S + + AI   ++Q S   V + +    E           
Sbjct: 388 KSTKWSRTGQFGSDFVTCVLSGDEAGAISVSSYQASVSAVEMVRGDIVEPSADPTVMLVQ 447

Query: 288 AEGDDPKLSKMK-------KDVVVGGKDVKEVDN-----DFFLVVVKILDHQGPLSST-- 333
           +E D+   S+ +       + +   G  V+E        DF LV    L H  P  S+  
Sbjct: 448 SEHDEDLGSRTRYIPEVFYRRINEYGASVQESAEPSFPVDFLLVT---LTHGFPTESSPL 504

Query: 334 -----FPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
                FPIENR         +   + L SH +   +   ++ +SDFH+L FL
Sbjct: 505 FTDSRFPIENREVIGESQELRHVAQKLMSHGDPDKA---IQGVSDFHILCFL 553


>gi|238483481|ref|XP_002372979.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus flavus NRRL3357]
 gi|220701029|gb|EED57367.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus flavus NRRL3357]
 gi|391871001|gb|EIT80170.1| nuclear pore complex, rNpl4 component [Aspergillus oryzae 3.042]
          Length = 603

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 89/322 (27%), Positives = 128/322 (39%), Gaps = 70/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +E+  + L +  
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEIDGVTLHEWH 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            EK VD +A   G++KVG IFT  +   + D           Y LS+ E++ A    A+ 
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGQGDGSVICKRHIDSYFLSSLEIIFAARLQAQY 390

Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 446

Query: 288 -----AEGDDPK----------LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
                +E DD +            K+ +  V   ++ K      +L+V   L H  P  S
Sbjct: 447 VMLVQSEEDDSENKSRYIPEVFYRKINEYGVSAQQNAKPAFPVEYLLVT--LTHGFPTES 504

Query: 333 T-------FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
           +       FPIENR     +Q      K  ++       ++ +SDFHLL FL      + 
Sbjct: 505 SPLFVEGNFPIENREVIGESQELRHVAKKLVSHGDPDKAIRAVSDFHLLCFLHSLSTFSK 564

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  +    +G +L+
Sbjct: 565 DEEALLGRVATKHDPADGVQLI 586


>gi|317139892|ref|XP_001817829.2| nuclear protein localization protein 4 [Aspergillus oryzae RIB40]
          Length = 603

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 89/322 (27%), Positives = 127/322 (39%), Gaps = 70/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +E+  + L +  
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEIDGVTLHEWH 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            EK VD +A   G++KVG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGQGDGSVICKRHIDSYFLSSLEITFAARLQAQY 390

Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 446

Query: 288 -----AEGDDPK----------LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
                +E DD +            K+ +  V   ++ K      +L+V   L H  P  S
Sbjct: 447 VMLVQSEEDDSENKSRYIPEVFYRKINEYGVSAQQNAKPAFPVEYLLVT--LTHGFPTES 504

Query: 333 T-------FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
           +       FPIENR     +Q      K  ++       ++ +SDFHLL FL      + 
Sbjct: 505 SPLFVEGNFPIENREVIGESQELRHVAKKLVSHGDPDKAIRAVSDFHLLCFLHSLSTFSK 564

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  +    +G +L+
Sbjct: 565 DEEALLGRVATKHDPADGVQLI 586


>gi|123448649|ref|XP_001313052.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121894921|gb|EAY00123.1| hypothetical protein TVAG_471060 [Trichomonas vaginalis G3]
          Length = 403

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 96/423 (22%), Positives = 183/423 (43%), Gaps = 48/423 (11%)

Query: 2   LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQN-LLLAKSPSDLLQ 60
           ++RIR + G E+V V+  H T+  L   + ++  I  E  TL  ++   +L K       
Sbjct: 6   VVRIRWKKGTEKVPVN-PHDTIGTLIQYMVNKFEIKEEISTLIQDKTKTVLQKRIK---- 60

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFN-----PAGSFGRKMTMDDLIAK 115
                     LSSL++ +G +  LS   + N    S       P     R +   D    
Sbjct: 61  ----------LSSLDMENGELFELSLPSDANPNKDSKTSKPDTPIDDHQRNVYTLDYFKD 110

Query: 116 Q---MRVTRQENPHCESVSFDRDCADAFQQY--VNETLAFAVKRGGFMYGTVLEDKRVEV 170
               ++  + EN   + V F    +   Q+   ++  L F   +   +YGT      + +
Sbjct: 111 HNPTIQFQKYENSILKRVLFP---SAELQRIGRISNALHFNSTQIFLLYGTTPNPAAIRI 167

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
           + +  P Q+ T E L I  +E   K    +A   G+K +GF+    I+  K +  ++   
Sbjct: 168 HALSLPSQECTGEELLI--NESHIKDSHHVALACGLKFLGFL----IISPKTNVPIAPHL 221

Query: 231 VLQAVEFHAECNMEEWV--TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA 288
           +++ +   ++ ++E ++   ++  L+ S   S + + EA+ +S   + +++ G F     
Sbjct: 222 LIKLMPLISD-DLESFIFIRSLPTLK-SMVDSVSCNLEAYSVSKQFIDMYRRGLFT---G 276

Query: 289 EGDDPKLSKMKKDVVVGGKDVKEVDNDFFL--VVVKILDHQGPLSSTFPIENRTTQVTMR 346
           E  D  L   K  V+V  K +KEVD  +F+  +  K+ +   P S++FP +  +   T+ 
Sbjct: 277 ESTDTSLI-TKSTVLVSAKSLKEVDVSYFVTPLATKLRESWFP-SNSFPYQ--SFYPTIA 332

Query: 347 ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINS 406
              +   +   +P   R  DF+LLLF     D   +VP +A+   A+  +P+   + I  
Sbjct: 333 DFVAICYKFIDVPNYIRFLDFNLLLFCCTIFDREEEVPLIARSCIARDDIPDKVLVKIEE 392

Query: 407 MAE 409
           M +
Sbjct: 393 MID 395


>gi|209879848|ref|XP_002141364.1| NPL4 family protein [Cryptosporidium muris RN66]
 gi|209556970|gb|EEA07015.1| NPL4 family protein [Cryptosporidium muris RN66]
          Length = 489

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 73/290 (25%), Positives = 126/290 (43%), Gaps = 43/290 (14%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYI-LRDEE 192
           F +Y    L    +R G+MYG   ED       R  +  IYEPPQ    +   + L  ++
Sbjct: 198 FVKYWTCNLQLQKQRIGWMYGYYKEDNTYPMGIRAVMEAIYEPPQTNNHKFGNLELEFDD 257

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQ---------AVEFHAECNM 243
            +  VD IA  LG++ +G IFT      ++D  L++RE++              +    +
Sbjct: 258 FKSTVDTIAHSLGLECIGLIFT----HKERDELLTSREIISLGKLQLDYLKYNHYTGYPV 313

Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDP-KLSKMKK-- 300
             ++   +      E SA +   AF +SD+ +   ++   + +  + D   KL   +K  
Sbjct: 314 SNFIACTIAPCKKSEDSAPVP-NAFAVSDLGLAFIRDHLLDEDNLDDDLHMKLRAEQKGE 372

Query: 301 ---DVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKS 350
               V+ GG++    D  +F+V V   +   P+        + FP ENR +  T   + +
Sbjct: 373 LLPQVLEGGQETSRFDAHWFVVRV---NESAPIRPRSLFNCNQFPRENRISTQTPADVAT 429

Query: 351 HL-NR--SPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
            + NR  + S    + ++DFHLLL LA+  D ++   AL+ C       P
Sbjct: 430 FIKNRLVNISSTSYQLLNDFHLLLHLAKLFDEST---ALSICNSIANKTP 476


>gi|189203729|ref|XP_001938200.1| endoplasmic reticulum and nuclear membrane protein Npl4
           [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985299|gb|EDU50787.1| endoplasmic reticulum and nuclear membrane protein Npl4
           [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 655

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 131/323 (40%), Gaps = 64/323 (19%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ    + +  L + 
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVNEADGIS-LGEW 317

Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
           + EK +D IAA  G+++VG IFT  +  N  D           Y L++ EV  A  + A+
Sbjct: 318 DNEKEIDEIAAQCGLQRVGVIFTDLLDTNNGDGSVICKRHMDSYYLASLEVCFAARYQAK 377

Query: 241 C-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
                +W       +  V   +S +    I   ++Q+S+  V + +    E         
Sbjct: 378 YPRPSKWSDTGRHGSNFVTCVISGDDKGQIGISSYQVSNDAVEMVRADIIEPSAEPSVML 437

Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------- 329
             +E DD  L + +       + +   G +V+EV    F V  ++  L H  P       
Sbjct: 438 VQSEEDDEALGRARYIPEVFYRRINEHGANVQEVAKPAFPVEYLLLTLTHGFPTQPNPLF 497

Query: 330 LSSTFPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLDL 379
               FPIENR     +  + SHL+     ++  L L     +  IS+FH+L F+     L
Sbjct: 498 TGGKFPIENREAMGEVPEV-SHLSKVLNAKANGLALNTTSDLNAISNFHMLCFIHNLGVL 556

Query: 380 NSDVPALAQCVQAQTAVPEGYKL 402
           + D   L   V +     EG  L
Sbjct: 557 SKDEETLLFKVASSHDTSEGSAL 579


>gi|121805628|sp|Q2URI8.1|NPL4_ASPOR RecName: Full=Nuclear protein localization protein 4
 gi|83765684|dbj|BAE55827.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 652

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 89/322 (27%), Positives = 127/322 (39%), Gaps = 70/322 (21%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +E+  + L +  
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEIDGVTLHEWH 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            EK VD +A   G++KVG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGQGDGSVICKRHIDSYFLSSLEITFAARLQAQY 390

Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 446

Query: 288 -----AEGDDPK----------LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
                +E DD +            K+ +  V   ++ K      +L+V   L H  P  S
Sbjct: 447 VMLVQSEEDDSENKSRYIPEVFYRKINEYGVSAQQNAKPAFPVEYLLVT--LTHGFPTES 504

Query: 333 T-------FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
           +       FPIENR     +Q      K  ++       ++ +SDFHLL FL      + 
Sbjct: 505 SPLFVEGNFPIENREVIGESQELRHVAKKLVSHGDPDKAIRAVSDFHLLCFLHSLSTFSK 564

Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
           D  AL   V  +    +G +L+
Sbjct: 565 DEEALLGRVATKHDPADGVQLI 586


>gi|70990368|ref|XP_750033.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus fumigatus Af293]
 gi|74669989|sp|Q4WKD7.1|NPL4_ASPFU RecName: Full=Nuclear protein localization protein 4
 gi|66847665|gb|EAL87995.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus fumigatus Af293]
 gi|159130511|gb|EDP55624.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus fumigatus A1163]
          Length = 652

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 95/323 (29%), Positives = 132/323 (40%), Gaps = 72/323 (22%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +E+  I L +  
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEIDGITLHEWP 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
            EK VD +A   G++KVG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 331 NEKEVDEVARQCGLEKVGVIFTDLLDAGRGDGSVVCKRHIDSYYLSSLEIAFASRMQAKH 390

Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   ++Q S   V + +    E      
Sbjct: 391 PKPTKWSRTGRFGSNFVTCV--LSGDEEG--AITVSSYQASISAVEMVRADIIEPSAEPS 446

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-- 330
                +E +D        P++   K+ +  V   ++ K      +L+V   L H  P   
Sbjct: 447 VMLVQSEDEDTDNKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTEA 504

Query: 331 -----SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
                +STFPIENR     +Q      K  ++       ++ +SDFHLL FL      + 
Sbjct: 505 SPIFPASTFPIENREVIGESQELRHVAKKLVSHGDPDKAIREVSDFHLLCFLHSLSMFSK 564

Query: 382 DVPALAQCVQAQTAVP-EGYKLL 403
           +  AL  C  A T  P EG KLL
Sbjct: 565 EEEALL-CRVATTHDPTEGLKLL 586


>gi|403222165|dbj|BAM40297.1| uncharacterized protein TOT_020000556 [Theileria orientalis strain
           Shintoku]
          Length = 540

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 74/343 (21%), Positives = 139/343 (40%), Gaps = 88/343 (25%)

Query: 139 AFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEV-------- 184
           +F  Y    L  + +R G+MYG   ED+      R     IYEPPQ     V        
Sbjct: 206 SFANYWLADLEMSFQRIGWMYGYYTEDQHYPYGIRAVCEAIYEPPQMSQNVVKVASPRNK 265

Query: 185 LYILRDEE----------EEKL---VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREV 231
           +Y   D +          E+ L   VD IA  LG++++G IFT+       +  L+ +E+
Sbjct: 266 VYKHNDSDNVQVNYVSLMEDNLLPNVDKIANRLGLERIGLIFTRL----PNECFLTAKEL 321

Query: 232 LQAVEFH----AECNMEEW-VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE 286
           ++ +E         +   + V+ +V   +S + +     +AF  SDM + LF++   +  
Sbjct: 322 MRTIEMQYNSLKHIHYTGYPVSTMVTCTMSPDDNGQPVLDAFMASDMALALFRDELLDNN 381

Query: 287 IAEGDDPKLSKMKKD------VVVGGKDVKEVDNDFFLVVVKILDHQGPL---------- 330
               +  K+ +++        ++  GK+V   D  + +V +   +   P+          
Sbjct: 382 QDHDNIIKMREVEHSQEIMPPIIESGKEVTSFDTSWLMVRI---NESAPIKPNSFFGNNG 438

Query: 331 -SSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRIS------------------------ 365
            +  FP+ENRT   T    K+ ++ S       R++                        
Sbjct: 439 RNHKFPVENRTYSSTA---KTRMHESLVSDCSARVTNSNERFRKWDKKILTDYFNTHDPE 495

Query: 366 ----DFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLI 404
               DFHLL+++ + LD+++ + A+  C+  +  V +  K L+
Sbjct: 496 TLWNDFHLLIYVCQVLDVDTAL-AICDCIVRKEQVDDLVKDLL 537


>gi|340992720|gb|EGS23275.1| hypothetical protein CTHT_0009420 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 659

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 124/318 (38%), Gaps = 64/318 (20%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
            V+R G +YG  LE   V       V  IYEPPQ   +E+  I L   E E+ V+ +A  
Sbjct: 279 GVQRLGILYGRYLEYDAVPLGIKAVVEAIYEPPQ--VDEIDGITLNPWENEQEVNQVAKY 336

Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWV---- 247
            G+++VG I+T  +   K D           Y L+ +E++ A    A+     +W     
Sbjct: 337 CGLEQVGVIWTDLLDAGKGDGSVVCKRHADSYFLAAQEIVFAARLQAQHPKPSKWSDTGR 396

Query: 248 --TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLS 296
             +  V   VS      I   A+QMS+  V + +    E             E DD   S
Sbjct: 397 FGSNFVTCVVSGNEQGEISISAYQMSNDAVEMVRADIIEPSADPTLMLVREEEEDDGSPS 456

Query: 297 KMK--------------KDVVVGGKDVKEVDNDFFLVVVKILDHQGPL--SSTFPIENRT 340
           K +               +V+   K    V+  F  +     D   PL   + FPIENR 
Sbjct: 457 KTRYIPEVFYRKINEYGANVLENAKPAFPVEYLFVTLTHGFPDSPSPLFTDNIFPIENRE 516

Query: 341 -------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ 393
                       +ALK H   +P       +SDFHLL F+ +   L+ +  AL   V   
Sbjct: 517 YVGEAQEVSAVAKALKVHEADAP-----MNVSDFHLLCFIHQMSVLSKEEEALLCRVATL 571

Query: 394 TAVPEGYKLLINSMAETV 411
             + E ++L   +  +T+
Sbjct: 572 HDLAESFQLRSTTGWQTL 589


>gi|339252344|ref|XP_003371395.1| eukaryotic initiation factor 4A-III [Trichinella spiralis]
 gi|316968374|gb|EFV52655.1| eukaryotic initiation factor 4A-III [Trichinella spiralis]
          Length = 970

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 130/282 (46%), Gaps = 48/282 (17%)

Query: 153 KRGGFMYGT------VLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF++G       V    R  V  IYEPPQ    + + IL D  +E +VD++A  LG+
Sbjct: 562 QRMGFLFGRYEPMTDVPLGIRAVVTVIYEPPQITYTDGIDILEDPHDE-IVDSLAECLGL 620

Query: 207 KKVGFIFTQ--------TIMQNKKD---YTLSNREVLQAVEFHAE----CNM-------E 244
           K++G+IFT         T++ N ++   Y LS  E L A ++  +    C          
Sbjct: 621 KRIGWIFTDFLRDPDDPTLLVNTRNANTYMLSAAECLIAAKYQNKYPNACKYSLDGTFGS 680

Query: 245 EWVTAVVK-LEVSEEGSAAIHFEAFQMSD-----MCVRLFKEGWFETEIAE-GDDPKLSK 297
           ++VT V+   +VS +GSA +  E    S       CVR      +  ++     D   +K
Sbjct: 681 KFVTLVLSAYQVSNQGSALVAAECLLPSRSSPNMACVRRSTPTHYVPDVYYMCHDEYNNK 740

Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKIL-DHQGPLSST---FPIENRT---TQVTMRALKS 350
           ++K        +  V+  F  + V +  + +  L+S    FPIENR+   +   +R +  
Sbjct: 741 IRKTA-----GLLPVEYLFVTIPVGLAKEMRYTLTSQKKPFPIENRSSIGSAQNIRVVGE 795

Query: 351 HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
           ++ +      +K IS+FH L++L    ++  ++  LA+ + A
Sbjct: 796 YVKQFEESEFLKMISNFHFLVYLLTNTEVTFELDDLAELLDA 837


>gi|345312165|ref|XP_001519304.2| PREDICTED: nuclear protein localization protein 4 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 502

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 93/208 (44%), Gaps = 53/208 (25%)

Query: 177 PQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------NKKDYT 225
           PQ GT+  L +L D + E +VD +AA LG++KVG+IFT  + +           NK  Y 
Sbjct: 296 PQIGTQNSLELLEDPKAE-VVDEMAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRNKDTYF 354

Query: 226 LSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           LS  E + A  F  +    C +        ++VTAV       +    +HFE +Q+S+ C
Sbjct: 355 LSAEECITAGNFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQC 410

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN----DFF 317
           + L ++   E  +   D P+L   K             KD+   G ++ ++      ++ 
Sbjct: 411 MALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPVEYL 467

Query: 318 LVVVKILDHQGPL------SSTFPIENR 339
           ++ +     + P+       S FPIENR
Sbjct: 468 IIDITTTFPKDPVYTFSISQSPFPIENR 495


>gi|347829803|emb|CCD45500.1| similar to nuclear protein localization protein 4 [Botryotinia
           fuckeliana]
          Length = 640

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 129/316 (40%), Gaps = 65/316 (20%)

Query: 149 AFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIA 201
           A   +R G++YG   E   V       V  IYEPPQ   +E+  + L + E EK +D IA
Sbjct: 284 ATGAQRFGYLYGRYEEYTEVPLGVKAVVEAIYEPPQ--VDELDGVTLNEWESEKDIDEIA 341

Query: 202 AGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNM 243
              GM++VG I+T  I     D           Y LS+ E+  A    A+        + 
Sbjct: 342 RLCGMERVGVIWTDLIDSGVGDGTVLCKRHIDSYYLSSLEIAFAARLQAKHPKPTKWSDT 401

Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--------AEGDDPKL 295
            ++ +  V   V+ + +  I   A+Q+S+  V + +    E           +EGDD   
Sbjct: 402 GKFGSNFVTCVVTADENGGIAISAYQVSNSAVEMVRADIVEPSADPAVMIVRSEGDDDSD 461

Query: 296 SKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP--------LSSTFPIEN 338
           +  +       + +   G+ V+E     F V  ++  L H  P          S+F IEN
Sbjct: 462 TSSRYIPEVFFRKINEYGRSVQENAKPSFPVEYLLVTLTHGFPSDPKPAFMAKSSFTIEN 521

Query: 339 RTTQVTMRALK-----------SHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALA 387
           R      + LK             L +SP   L   +SDFHLL F+     L+ +  AL 
Sbjct: 522 RLVIGQEQDLKDVGKQLGLDKSGQLTQSPDGVLA--VSDFHLLSFIYSMGILSKEEMALL 579

Query: 388 QCVQAQTAVPEGYKLL 403
             V  Q  + +GY+L+
Sbjct: 580 CRVATQHDLSDGYQLI 595


>gi|425766761|gb|EKV05359.1| Endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Penicillium digitatum Pd1]
 gi|425775414|gb|EKV13686.1| Endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Penicillium digitatum PHI26]
          Length = 675

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 79/292 (27%), Positives = 118/292 (40%), Gaps = 69/292 (23%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ G  + +  L + + 
Sbjct: 269 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVEAIYEPPQTGEVDGV-TLHEWQN 327

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-- 240
           EK VD +A   G++KVG IFT  I   + D           Y LS+ E+  A    A+  
Sbjct: 328 EKEVDDVARLCGLQKVGVIFTDLIDGGRGDGSVVCKRHIDSYFLSSLEIAFAARLQAQNP 387

Query: 241 -----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI-------- 287
                    ++ +  V   +S + + AI   ++Q S   V + +    E           
Sbjct: 388 KSTKWSRTGQFGSDFVTCVLSGDETGAIAISSYQASVSAVEMVRGDIVEPSADPTVMLVQ 447

Query: 288 AEGDDPKLSKMK-------KDVVVGGKDVKEVDN-----DFFLVVVKILDHQGPLSST-- 333
           +E D+   S+ +       + +   G  V+E        DF LV    L H  P  S+  
Sbjct: 448 SEHDEDLGSRTRYIPEVFYRRINEYGASVQESAEPSFPVDFLLVT---LTHGFPTESSPL 504

Query: 334 -----FPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
                FPIENR         +   + L SH +   +   ++ +SDFH+L FL
Sbjct: 505 FTNSRFPIENREVIGESQELRHVAQKLMSHGDPDKA---IQGVSDFHILCFL 553


>gi|169602695|ref|XP_001794769.1| hypothetical protein SNOG_04350 [Phaeosphaeria nodorum SN15]
 gi|160706232|gb|EAT88110.2| hypothetical protein SNOG_04350 [Phaeosphaeria nodorum SN15]
          Length = 1017

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 85/324 (26%), Positives = 132/324 (40%), Gaps = 66/324 (20%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ    + +  L D 
Sbjct: 259 QVVDDLINFWRNTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVNELDGIS-LGDW 317

Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
           + EK +D IAA  G+++VG IFT  +  +K D           Y LS+ E+  A  + A+
Sbjct: 318 DNEKEIDEIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEIAFAARYQAK 377

Query: 241 -------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
                      ++ +  V   +S +    I   ++Q S+  V + +    E         
Sbjct: 378 YPRPTKWSETGKFGSNFVTCVISGDDQGQIGISSYQASNDAVEMVRADIIEPSAEPSVML 437

Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKE-VDNDF---FLVVVKILDHQGP----- 329
             +E D+  L++ +       + +   G +V+E    DF   +L V   L H  P     
Sbjct: 438 VQSEDDNEALNRARYIPEVFYRRINEHGANVQENAKPDFPVEYLFVT--LTHGFPTQPNP 495

Query: 330 --LSSTFPIENRTTQVTM---RALKSHLN-RSPSLPL-----VKRISDFHLLLFLARFLD 378
                 FPIENR     M    AL   LN ++  L L     +  IS+FH+L F+     
Sbjct: 496 LFTGGKFPIENREIMGEMPDVSALGKSLNAKANGLALNTTSGLNAISNFHMLCFIHNLGI 555

Query: 379 LNSDVPALAQCVQAQTAVPEGYKL 402
           L+ D  +L   V +     EG  L
Sbjct: 556 LSKDEESLLFKVASTHDTSEGSAL 579


>gi|448113826|ref|XP_004202429.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
 gi|359383297|emb|CCE79213.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
          Length = 566

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/121 (36%), Positives = 59/121 (48%), Gaps = 20/121 (16%)

Query: 137 ADA--FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
           ADA    ++++   A  V+R G MYGT    ++V       V  IYEPPQ G  + L +L
Sbjct: 241 ADATILNKFLDSWRASGVQRFGIMYGTYEPFEKVPLGTKAVVQAIYEPPQAGEVDGLTLL 300

Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
             E E + VD++AA LG+ KVG +FT       +D           Y LS  EVL A  F
Sbjct: 301 EWENESQ-VDSVAAALGLYKVGMVFTDLTDSGARDGSVICKRHTDTYFLSCLEVLMASRF 359

Query: 238 H 238
            
Sbjct: 360 Q 360


>gi|71030974|ref|XP_765129.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352085|gb|EAN32846.1| hypothetical protein, conserved [Theileria parva]
          Length = 531

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 17/188 (9%)

Query: 113 IAKQMRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK----- 166
           + K + +  Q+  H + +   + D    F  +    L  +++R G+MYG   ED      
Sbjct: 167 LPKSITIRHQKYRHVDHLEMMNVDEVRGFANFWLSELQMSLQRIGWMYGYYTEDHHYPYG 226

Query: 167 -RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
            R     IYEPPQ  T   + +L DE     VD IA  LG+++VG IFT+       +  
Sbjct: 227 IRAVCEAIYEPPQFSTINDVVMLEDEMLCS-VDKIAEKLGLERVGMIFTRL----PNECF 281

Query: 226 LSNREVLQAVEFHAECNMEEW-----VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE 280
           L+ RE+++ +E               V+ +V   +S + S      AF  SD+ + L+++
Sbjct: 282 LTARELMRVIEMQYNSLRHTHYTGYPVSTMVTCTMSPDDSGQPVLNAFMASDLALALYRD 341

Query: 281 GWFETEIA 288
             F +  A
Sbjct: 342 ELFTSSQA 349


>gi|72386595|ref|XP_843722.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62175416|gb|AAX69558.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70800254|gb|AAZ10163.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 408

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 67/284 (23%), Positives = 121/284 (42%), Gaps = 47/284 (16%)

Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYIL 188
           V  D    + FQ +V ++  +AV+R G +YG     +  +EV+ +YEP Q+G  +    L
Sbjct: 94  VRVDVTSLNLFQSFVRQS-GWAVQRCGILYGKYDSAESTIEVHAVYEPEQEGGLQKFICL 152

Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM- 243
           RD      VD +A  LG+++VG + T       K   LS  E+L    +   F  EC + 
Sbjct: 153 RD-SRVGTVDRLAERLGLRRVGMVCTHQARDPDK-MVLSGYELLLCAREQSRFGDECVLL 210

Query: 244 ---EEWVTAVVKLEVSEEGSAAIHF----------EAFQMSDMCVRLFKEGWFETEIA-- 288
                  T  ++ +  +    A+HF          +A   +   V    +G+   EI   
Sbjct: 211 TMSPSITTGRIECQAWQASPQAVHFYRLGTLTEKLDADPRNSTVVTGTADGFCGDEIPSH 270

Query: 289 --------EGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRT 340
                   E    +     +  V+     KE+D  +F   V +     P  S   ++N  
Sbjct: 271 FVYSSISLEVAQEQTDDKGRPQVITKAPSKEIDTRWFTSYVAV----DPFDSHV-VKNLF 325

Query: 341 TQV--------TMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
            ++        T+  L+++L   +   +P++++++DFH+L+FLA
Sbjct: 326 VRISRPGMDPPTITNLRNYLKDPKRADVPILQKLADFHVLIFLA 369


>gi|121703177|ref|XP_001269853.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus clavatus NRRL 1]
 gi|190359910|sp|A1CS06.1|NPL4_ASPCL RecName: Full=Nuclear protein localization protein 4
 gi|119397996|gb|EAW08427.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
           [Aspergillus clavatus NRRL 1]
          Length = 674

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 87/295 (29%), Positives = 120/295 (40%), Gaps = 76/295 (25%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  + L +  
Sbjct: 273 INSLLDFWRKSGSQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            EK VD +A   G++KVG IFT  +   + D           Y LS+ E+  A    A+ 
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGRGDGSVVCKRHIDSYYLSSLEIAFASRLQAQY 390

Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   ++Q S   V + +    E      
Sbjct: 391 PKTTKWSRTGRFGSNFVTCV--LSGDEEG--AITISSYQASVSAVEMVRADIVEPSAEPS 446

Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
                +E DD        P++   K+ +  V   ++ K      FL+V   L H  P   
Sbjct: 447 VMLVQSEDDDSDNKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEFLLVT--LTHGFPTES 504

Query: 330 ----LSSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
                 STFPIENR         +   + L SH + +  +P    +SDFHLL +L
Sbjct: 505 NPLFTKSTFPIENREVIGESQDLRSVAKKLVSHRDSNEVIP---EVSDFHLLCYL 556


>gi|190359915|sp|P0C7N6.1|NPL4_PHANO RecName: Full=Nuclear protein localization protein 4
          Length = 653

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 87/325 (26%), Positives = 133/325 (40%), Gaps = 68/325 (20%)

Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRD 190
           Q V++ + F      +R GF+YG   E   V       V  IYEPPQ    E+  I L D
Sbjct: 259 QVVDDLINFWRNTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQ--VNELDGISLGD 316

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
            + EK +D IAA  G+++VG IFT  +  +K D           Y LS+ E+  A  + A
Sbjct: 317 WDNEKEIDEIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEIAFAARYQA 376

Query: 240 E-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI----- 287
           +           ++ +  V   +S +    I   ++Q S+  V + +    E        
Sbjct: 377 KYPRPTKWSETGKFGSNFVTCVISGDDQGQIGISSYQASNDAVEMVRADIIEPSAEPSVM 436

Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-VDNDF---FLVVVKILDHQGP---- 329
              +E D+  L++ +       + +   G +V+E    DF   +L V   L H  P    
Sbjct: 437 LVQSEDDNEALNRARYIPEVFYRRINEHGANVQENAKPDFPVEYLFVT--LTHGFPTQPN 494

Query: 330 ---LSSTFPIENRTTQVTM---RALKSHLN-RSPSLPL-----VKRISDFHLLLFLARFL 377
                  FPIENR     M    AL   LN ++  L L     +  IS+FH+L F+    
Sbjct: 495 PLFTGGKFPIENREIMGEMPDVSALGKSLNAKANGLALNTTSGLNAISNFHMLCFIHNLG 554

Query: 378 DLNSDVPALAQCVQAQTAVPEGYKL 402
            L+ D  +L   V +     EG  L
Sbjct: 555 ILSKDEESLLFKVASTHDTSEGSAL 579


>gi|261326791|emb|CBH09764.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 500

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 122/284 (42%), Gaps = 47/284 (16%)

Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYIL 188
           V  D    + FQ +V ++  +AV+R G +YG     +  +EV+ +YEP Q+G  +    L
Sbjct: 186 VRVDVTSLNLFQSFVRQS-GWAVQRCGILYGKYDSAESTIEVHAVYEPEQEGGLQKFVCL 244

Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM- 243
           RD      VD +A  LG+++VG + T    ++     LS  E+L    +   F  EC + 
Sbjct: 245 RD-SRVGTVDRLAERLGLRRVGMVCTHQ-ARDPDKMVLSGYELLLCAREQSRFGDECVLL 302

Query: 244 ---EEWVTAVVKLEVSEEGSAAIHF----------EAFQMSDMCVRLFKEGWFETEIA-- 288
                  T  ++ +  +    A+HF          +A   +   V    +G+   EI   
Sbjct: 303 TMSPSITTGRIECQAWQASPQAVHFYRLGTLTEKLDADPRNSTVVTGTADGFCGDEIPSH 362

Query: 289 --------EGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRT 340
                   E    +     +  V+     KE+D  +F   V +     P  S   ++N  
Sbjct: 363 FVYSSISLEVAQEQTDDKGRPQVITKAPSKEIDTRWFTSYVAV----DPFDSHV-VKNLF 417

Query: 341 TQV--------TMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
            ++        T+  L+++L   +   +P++++++DFH+L+FLA
Sbjct: 418 VRISRPGMDPPTITNLRNYLKDPKRADVPILQKLADFHVLIFLA 461


>gi|389584256|dbj|GAB66989.1| nuclear pore associated protein [Plasmodium cynomolgi strain B]
          Length = 507

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/275 (25%), Positives = 108/275 (39%), Gaps = 76/275 (27%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F QY       +  R G+MYG   ED       R     IYEPPQ    + + +L D+  
Sbjct: 268 FVQYWYTNGCMSEHRVGWMYGYYREDPHYNLGIRAVCECIYEPPQINEVDKVKLLPDDFL 327

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKL 253
           +  VD IA  LG++++             D  LSN  V  A                   
Sbjct: 328 DS-VDVIANRLGLERI-------------DPMLSNEPVTNA------------------- 354

Query: 254 EVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET-----EIAEGDDPKLSKMKKDVVVGGKD 308
                         F +SD+ + L +    ++      + +  +P  +++   ++ GGK+
Sbjct: 355 --------------FMVSDLGMALIRSNLVDSTQTDPSVIQLRNPVRNELLPLILEGGKE 400

Query: 309 VKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSPSLPL- 360
             + D D+F+V +   +   P        +  FP ENR    T   +K +   SP L   
Sbjct: 401 TNKFDADWFIVRI---NESAPKVVRSLFKNFHFPRENRLHSPTAYDVKEYF-ASPKLERG 456

Query: 361 ---VKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
              + R SDFHL+LF  R LD+ +   ALA C  A
Sbjct: 457 TNGMHRCSDFHLILFACRVLDMET---ALALCDAA 488


>gi|115387239|ref|XP_001211125.1| protein NPL4 [Aspergillus terreus NIH2624]
 gi|114195209|gb|EAU36909.1| protein NPL4 [Aspergillus terreus NIH2624]
          Length = 566

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/297 (28%), Positives = 113/297 (38%), Gaps = 66/297 (22%)

Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           +N  L F  K G    GF+YGT  E   V       V  IYEPPQ   +EV  I L +  
Sbjct: 273 INSLLDFWRKSGAQRMGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGITLHEWP 330

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
            E+ VD +A   G++KVG IFT  +   + D           Y LS+ EV  A    A+ 
Sbjct: 331 NEREVDEVARLCGLEKVGVIFTDLLDAGQGDGSVVCKRHIDSYYLSSLEVAFASRLQAQY 390

Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
               +W          VT V  L   EEG  AI   A+Q S   V + +    E      
Sbjct: 391 PKATKWSRNGRFGSNFVTCV--LSGDEEG--AITVSAYQASVAAVEMVRADIVEPSAEPS 446

Query: 288 -----AEGDDP-------------KLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP 329
                +E DD              K+++         K    V+  F  +         P
Sbjct: 447 VMLVQSEEDDTDNKSRYIPEVFYRKINEYGVSAQENAKPAFPVEYLFVTLTHGFPTESSP 506

Query: 330 L--SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
           L   S+FPIENR     +Q      K  ++       ++ +SDFHLL FL      N
Sbjct: 507 LFTDSSFPIENREVIGESQELRHVAKKLVSHGDPDKAIQAVSDFHLLCFLHSLSTFN 563


>gi|123477189|ref|XP_001321763.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121904596|gb|EAY09540.1| hypothetical protein TVAG_276130 [Trichomonas vaginalis G3]
          Length = 430

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 111/240 (46%), Gaps = 21/240 (8%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK---RVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
           FQQ    T  +   R  F++G +  DK   +V V+   EP Q+   + + IL +E    L
Sbjct: 169 FQQKALMT-GYKTHRIMFLFGRI--DKYTGKVTVHCSCEPQQKNFVDHVEIL-EEFNMDL 224

Query: 197 VDAIAAGLGMKKVGFIFTQT-IMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEV 255
              IA   GM+ VG   ++    +  +DY     E +  +    +  + E+ T +V   V
Sbjct: 225 DRNIAHEFGMECVGMAISRPEPTEKSEDYKHEMTEYMVRLAAKYQVEITEYFTTIV---V 281

Query: 256 SEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVG-GKDVKEVDN 314
           +E+     H E +Q SD  ++ +++G F     +   P L K K +V +G  K +KE+D+
Sbjct: 282 TEK-----HAEPYQASDAAMKAYQQGLF----LQSSSPTLLKFKSEVKLGTAKHLKELDS 332

Query: 315 DFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLA 374
           ++ +  ++I +       T P    +   ++  +K + +     P+  R+ +F+LLL+  
Sbjct: 333 NYAVATLRIRNANSHFPDTEPFPPLSEYPSVMDVKKYFDDMQYCPMWYRLFNFNLLLYFC 392


>gi|241949039|ref|XP_002417242.1| nuclear-envelope, endoplasmic-reticulum-associated-degradation
           network protein NPL4, putative [Candida dubliniensis
           CD36]
 gi|223640580|emb|CAX44837.1| nuclear-envelope, endoplasmic-reticulum-associated-degradation
           network protein NPL4, putative [Candida dubliniensis
           CD36]
          Length = 580

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 62/123 (50%), Gaps = 22/123 (17%)

Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
           AD+F    ++N      V+R G++YG   +  +        V  I EPPQ    + + +L
Sbjct: 247 ADSFILNDFINVWRISGVQRFGYLYGKYAKSDKTPLGIKAIVEMIIEPPQHDELDGITLL 306

Query: 189 ---RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQA 234
              ++EEEEKLVD +A   G+ KVG IFT          + + +  KD Y L+N E++ A
Sbjct: 307 DWNKEEEEEKLVDNLANKFGLYKVGMIFTDLTDDGNKNGKVLCKRHKDSYFLTNLEIIMA 366

Query: 235 VEF 237
            +F
Sbjct: 367 AKF 369


>gi|68474832|ref|XP_718457.1| hypothetical protein CaO19.2434 [Candida albicans SC5314]
 gi|74590800|sp|Q5AA50.1|NPL4_CANAL RecName: Full=Nuclear protein localization protein 4
 gi|46440223|gb|EAK99531.1| hypothetical protein CaO19.2434 [Candida albicans SC5314]
          Length = 598

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 69/134 (51%), Gaps = 21/134 (15%)

Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
           AD+F    ++N      V+R G++YG   + ++        V  I EPPQ    + + +L
Sbjct: 267 ADSFILNDFINVWRVSGVQRFGYLYGRYAKSEKTPLGIKAIVETIIEPPQHDELDGITLL 326

Query: 189 -RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVE 236
             D++EEK+VD +A   G+ KVG IFT          + + +  KD Y L+N E++ A +
Sbjct: 327 DWDQQEEKMVDQVANKFGLYKVGIIFTDLTDAGTKNGKVLCKRHKDSYFLTNLEIIMAAK 386

Query: 237 FHAEC-NMEEWVTA 249
           F  +  N+ ++ TA
Sbjct: 387 FQLKYPNISKYSTA 400


>gi|384490701|gb|EIE81923.1| hypothetical protein RO3G_06628 [Rhizopus delemar RA 99-880]
          Length = 565

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 72/289 (24%), Positives = 121/289 (41%), Gaps = 57/289 (19%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYG------TVLEDKRVEVNFIYEPPQQGTEEVLYILRD 190
           A     ++N   A   +R G++YG       V    +  V  IYEPPQQ   + + +   
Sbjct: 197 ASIIDSFLNYWRASGSQRFGYLYGRYEPYLEVPLGVKAVVEAIYEPPQQDHSDGIQLTLP 256

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEF-- 237
            EEE  V+ +A   G+ +VG I++  +            ++   Y LS+ E   A E   
Sbjct: 257 WEEENRVNEVAEACGLVRVGMIYSDLLDDGSGSGKVIAKRHVNSYFLSSVECAFAAEMQR 316

Query: 238 -HAECNME----EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDD 292
            H   + +    ++ +  V   +S +    I   A+Q+SD  V L +  + E       +
Sbjct: 317 RHPNASKQSISGKFSSKFVTCVISGDTEGNIDVNAYQVSDTLVALQEANFTEP----SRN 372

Query: 293 PKLSKMKKDVVVGGKDVKEVDNDF---FLVVVK-------------------ILDHQGPL 330
           P L ++ KD +   + V EV   F   + VVVK                      +  PL
Sbjct: 373 PSLMRV-KDSIPHERYVPEVFYKFKNEYNVVVKQSAKPTFPVEYLLINVTHGFPHNPSPL 431

Query: 331 ---SSTFPIENRT--TQVTMRALKSHLNRSPSLPLVKR-ISDFHLLLFL 373
              +S+FP+ENR   +  +M  L   +N + +   +K  ++DFHLL F+
Sbjct: 432 FNSTSSFPVENREGLSHQSMSELAKVINSADNEEALKNALADFHLLCFI 480



 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 8/86 (9%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
          MLLRIRS++G+ RV VD    T   L   I   L+I  +  T++  ++     +P+    
Sbjct: 1  MLLRIRSKEGMHRVQVDNGE-TFWTLAQKISELLKIE-DPSTIAMGRD----PNPATAAS 54

Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
           + +A  D+ + S N+ HG IV++SY
Sbjct: 55 LSQLA--DKTIESANLKHGDIVYVSY 78


>gi|68474665|ref|XP_718540.1| hypothetical protein CaO19.9970 [Candida albicans SC5314]
 gi|46440312|gb|EAK99619.1| hypothetical protein CaO19.9970 [Candida albicans SC5314]
          Length = 631

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 69/134 (51%), Gaps = 21/134 (15%)

Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
           AD+F    ++N      V+R G++YG   + ++        V  I EPPQ    + + +L
Sbjct: 300 ADSFILNDFINVWRVSGVQRFGYLYGRYAKSEKTPLGIKAIVETIIEPPQHDELDGITLL 359

Query: 189 -RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVE 236
             D++EEK+VD +A   G+ KVG IFT          + + +  KD Y L+N E++ A +
Sbjct: 360 DWDQQEEKMVDQVANKFGLYKVGIIFTDLTDAGTKNGKVLCKRHKDSYFLTNLEIIMAAK 419

Query: 237 FHAEC-NMEEWVTA 249
           F  +  N+ ++ TA
Sbjct: 420 FQLKYPNISKYSTA 433


>gi|238878941|gb|EEQ42579.1| protein NPL4 [Candida albicans WO-1]
          Length = 593

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 69/134 (51%), Gaps = 21/134 (15%)

Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
           AD+F    ++N      V+R G++YG   + ++        V  I EPPQ    + + +L
Sbjct: 262 ADSFILNDFINVWRVSGVQRFGYLYGRYAKSEKTPLGIKAIVETIIEPPQHDELDGITLL 321

Query: 189 -RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVE 236
             D++EEK+VD +A   G+ KVG IFT          + + +  KD Y L+N E++ A +
Sbjct: 322 DWDQQEEKMVDQVANKFGLYKVGIIFTDLTDAGTKNGKVLCKRHKDSYFLTNLEIIMAAK 381

Query: 237 FHAEC-NMEEWVTA 249
           F  +  N+ ++ TA
Sbjct: 382 FQLKYPNISKYSTA 395


>gi|328770163|gb|EGF80205.1| hypothetical protein BATDEDRAFT_25045 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 587

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 119/292 (40%), Gaps = 63/292 (21%)

Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVL------ 185
           D F Q+   + A   +R G +YG       V       V+ IYEP Q+ + + +      
Sbjct: 214 DNFLQFWRSSGA---QRVGILYGKYEPYAEVPLGVKAVVSAIYEPLQEDSHDTIQLAVTS 270

Query: 186 -YILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TIMQNKKDYTLSNREVLQ 233
            + + +      VDA+A  LG+ +VG I+T               ++   Y LS+ E++ 
Sbjct: 271 SHSILNNATSATVDAVATSLGLVRVGIIYTDLTDDGSGNGTVVCKRHADSYFLSSAEIVF 330

Query: 234 AVEFHAECNMEEWVTAV-------VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE 286
           A     +  M    +A+       V   +S      I   ++Q+S+MC  + ++   E  
Sbjct: 331 AGVNQEQHPMVTPYSALHTFGSRFVTCVISGNSEGGIDIFSYQVSNMCAAMVRDNIIEAS 390

Query: 287 -------IAEGDDPKLS-------KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
                  +AE  D +         K + +++V          ++ LV V    H  P + 
Sbjct: 391 VEPSLMRVAESTDKQYVPEVFYKFKNEYNIMVKNAAKPTFPVEYLLVTVT---HGFPSTP 447

Query: 333 T--------FPIENRT---TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
           +        FPIENR    +   M ALK HL  + +  L+  +SDFH+LLF+
Sbjct: 448 SPTFTNTKAFPIENRVEMGSAQDMHALKQHLTFTENH-LLDLLSDFHVLLFI 498


>gi|448101102|ref|XP_004199483.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
 gi|359380905|emb|CCE81364.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
          Length = 566

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/121 (36%), Positives = 57/121 (47%), Gaps = 20/121 (16%)

Query: 137 ADA--FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
           ADA    ++++   A  V+R G MYGT    ++V       V  IYEPPQ G  + L +L
Sbjct: 241 ADATILNKFLDSWRASGVQRFGIMYGTYEPFEKVPLGTKAVVQAIYEPPQAGEVDGLTLL 300

Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
             E E + VD+ AA LG+ KVG +FT        D           Y LS  EVL A  F
Sbjct: 301 EWENESQ-VDSAAAALGLYKVGMVFTDLTDSGTGDGSVICKRHTDTYFLSCLEVLMASRF 359

Query: 238 H 238
            
Sbjct: 360 Q 360


>gi|367042586|ref|XP_003651673.1| hypothetical protein THITE_2112228 [Thielavia terrestris NRRL 8126]
 gi|346998935|gb|AEO65337.1| hypothetical protein THITE_2112228 [Thielavia terrestris NRRL 8126]
          Length = 603

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 128/316 (40%), Gaps = 60/316 (18%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
             +R G +YG  LE   V       V  IYEPPQ   +EV  I L   + E+ V+ +A  
Sbjct: 285 GTQRLGILYGKYLEYDVVPLGIKAVVEAIYEPPQ--IDEVDGISLNPWDNEQEVNQVAKF 342

Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWV---- 247
            G+++VG I+T  +   K D           Y L+++E+  A    A+     +W     
Sbjct: 343 CGLEQVGVIWTDLLDAGKGDGSAVCKRHADSYFLASQEIAFAARMQAQHPKPSKWSDTGR 402

Query: 248 --TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLS 296
             +  V   VS      I   A+QMS+  V + + G  E     G         DD  +S
Sbjct: 403 FGSNFVTCVVSGNEKGEISIAAYQMSNDAVEMVRAGIVEPSADPGLMLVREEEEDDGSVS 462

Query: 297 KMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLS-------STFPIENR- 339
           + +       + +   G +V+E     F V  +   L H  P +       S FPIENR 
Sbjct: 463 RTRYIPEVFYRRINEYGANVQENAKPAFPVEYLFVTLTHGFPEAPKPLFQDSGFPIENRE 522

Query: 340 ----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
                 +V+  A    +N+         +SDFHLL F+ +   L  D  AL   V  Q  
Sbjct: 523 HVGEAQEVSAVAKTLKVNQKDD---NLHLSDFHLLCFIHQMGVLAKDEEALLCRVATQHD 579

Query: 396 VPEGYKLLINSMAETV 411
           + E ++L   +  +T+
Sbjct: 580 LAESFQLRSTTGWQTL 595


>gi|342180158|emb|CCC89634.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 422

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 62/279 (22%), Positives = 115/279 (41%), Gaps = 49/279 (17%)

Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDK-RVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
           + F+ +V +   ++++R G +YGT       +EV+ IYEP Q+G  +   +L+D +    
Sbjct: 102 NVFESFVRQ-CNWSLQRYGVLYGTYNATTFTIEVHAIYEPEQEGNTQTFTVLQD-DRIAT 159

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVS 256
           VD +A  LG+++VG + T    ++  +  LS RE+L      +    +E V   V   V+
Sbjct: 160 VDLVAKQLGLRRVGVVCTHG-PRDTSEMALSARELLLCAREQSRFG-DECVILTVSPNVT 217

Query: 257 EEGSAAIHFEAFQMSDMCVRLFKEGWF-ETEIAEGDDPKLSKMKKDVVVGGKDVKEVDND 315
              +     +A+Q S   V L+  G   E  +A+  D    +  +    G          
Sbjct: 218 ---TGRTECQAWQASPQAVHLYSLGLLAENPVADVKDDNRGETSEAGTSGSTAAASGATG 274

Query: 316 FFL--------------------VVVKILDHQGPLSSTFPIENRT--------------- 340
            F+                    VV K   H+G     + +  R                
Sbjct: 275 RFVYSSIPLEVAQEQRDASGHPQVVTKAPSHKGRNLLVYRLHRRAVISPVVRNSFVRISR 334

Query: 341 ---TQVTMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
              +  T   L+++L   +   + +++R++DFH+L+FLA
Sbjct: 335 PGMSPPTRENLRTYLEDPKRKGMKMLQRLADFHVLVFLA 373


>gi|299116579|emb|CBN74767.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 478

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 63/143 (44%), Gaps = 22/143 (15%)

Query: 266 EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVV--GGKDVKEVDNDFFLVVVKI 323
           EAF +S+  V++F EG       E   P   ++     V  G K+    +    +  V I
Sbjct: 319 EAFSVSNQTVQMFSEGVLGHHQPE---PTAGRVATTAAVKEGSKETTTPETLGLICNVAI 375

Query: 324 LDHQGPLSSTFPIENRTTQVTMRALKSHL---------------NRSPSLPLVKRISDFH 368
           + H+G L + FP  NR+   ++  LK  L                + PS   +KR+SDF 
Sbjct: 376 VQHEGVLQTRFPSANRSKAASLSDLKECLFGKIGSKAAAKNQKKQKQPSF--LKRLSDFQ 433

Query: 369 LLLFLARFLDLNSDVPALAQCVQ 391
           LLLFL+R +  + D   L   VQ
Sbjct: 434 LLLFLSRQMGFSRDFEELCSVVQ 456


>gi|171680155|ref|XP_001905023.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939704|emb|CAP64930.1| unnamed protein product [Podospora anserina S mat+]
          Length = 547

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 123/309 (39%), Gaps = 63/309 (20%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
             +R G +YG  +E + V       V  IYEPPQ   +EV  + L   E E+ ++ IA  
Sbjct: 228 GAQRLGILYGRYMEYEVVPLGIKAVVEAIYEPPQ--IDEVDGVSLNAWENEQEINQIAKF 285

Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWVTA-- 249
            G+++VG I+T  +   K D           Y L+++EV  A    A+     +W  A  
Sbjct: 286 CGLEQVGVIWTDLLDAGKGDGSAVCKRHADSYFLASQEVCFAARMQAQHPKPSKWSDAGR 345

Query: 250 ----VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLS 296
                V   VS      I   A+QMS+  V + +    E             E DD  +S
Sbjct: 346 FGSNFVTCVVSGNEQGEISISAYQMSNDAVEMVRADIIEPSADPTQMLVREEEEDDGSVS 405

Query: 297 KMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR- 339
           + +       + +   G +V+E     F V  +   L H  P S         FPIENR 
Sbjct: 406 RTRYIPEVFYRRINEYGVNVQENAKPAFPVEYLFVTLTHGFPESPKPLFTDEGFPIENRE 465

Query: 340 ------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ 393
                       +AL+ +      LP    +SDFHLL ++ +   L+ D  AL   V   
Sbjct: 466 YVGEAQQPSAVAKALRVNQKGDNRLP----VSDFHLLCYIHQMSVLSKDEEALLCKVATH 521

Query: 394 TAVPEGYKL 402
             + E ++L
Sbjct: 522 QDLAEAFQL 530


>gi|167516094|ref|XP_001742388.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163779012|gb|EDQ92626.1| predicted protein [Monosiga brevicollis MX1]
          Length = 932

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 125/302 (41%), Gaps = 77/302 (25%)

Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRDE 191
           D   +++        +R G++YG     ++V +        IYEPPQ GT + L  L+D+
Sbjct: 238 DILNRFLEPYRKTGAQRVGYLYGRYAPYEQVPLGIQAVVAAIYEPPQVGTADGLE-LQDD 296

Query: 192 EEEKLVDAIAAGLGMKKVGFIFT---------------QTIMQNKKDYT--LSNREVLQA 234
           E + LV+ +AA LG+++VG+IFT               + +     D T  L+  E+L A
Sbjct: 297 EHQMLVEQMAAALGLQRVGWIFTDLEAEDLAAGTVKHKRYVTHEPDDDTTVLTAPEILTA 356

Query: 235 VEFHAEC-------------NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG 281
             F                 +  ++VT V    VS + +  I  +A+Q+S   + L ++ 
Sbjct: 357 AMFQQSYPNPVPLRYNDTGFHGSKFVTVV----VSGDDTNMISPKAYQVSTQAMALTRDD 412

Query: 282 WFE-TEIAE-------GDD---PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILD----- 325
               T+ A+        D+   P +  ++KD    G  +K   N  F V    +D     
Sbjct: 413 ILRPTDRADMMAVKPRTDEVYVPDIFYVEKDKY--GNQIKTPANPVFPVEYLFVDMEAGA 470

Query: 326 --HQGPL----SSTFPIENR-------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLF 372
              QG      S+ FPIE+R         Q   R   +   R  +L     + DFHLL F
Sbjct: 471 GTGQGTFVNNPSAPFPIEHREAIGEGQNMQALGRYFAAGGARDGNL-----LRDFHLLFF 525

Query: 373 LA 374
           LA
Sbjct: 526 LA 527


>gi|340052592|emb|CCC46874.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 416

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 70/293 (23%), Positives = 122/293 (41%), Gaps = 50/293 (17%)

Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLE-DKRVEVNFIYEPPQQ 179
           R E      V  D+   + FQ +   T  + V+R   +YG      K VEV+ IYEP Q 
Sbjct: 92  RWEKNRTNLVLVDQTSINIFQTFCR-TCGWCVQRYALLYGRYDNATKTVEVHAIYEPKQH 150

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE--- 236
           G     +  +D E  + VD +A GL +++VG I T    ++     L+ RE+L       
Sbjct: 151 GDARTFHFEKD-ENIRTVDMVAEGLELRRVGAICTHP-KRDSNAVVLTAREMLLCAREQS 208

Query: 237 -FHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF-ETEIAEG--DD 292
            F  EC        V+     ++     H  A+Q+S   + L++ G+  E+  A G  D+
Sbjct: 209 IFGDEC--------VLLTVYHDDVEDRTHCSAWQVSTQAIELYRNGFLSESNGARGPDDE 260

Query: 293 P---------------------KLSKMKKDVVVGGKDV-----KEVDNDFFLVVVKILDH 326
           P                     ++++  KD   G + V      E+D  +F+  + I + 
Sbjct: 261 PATTDEQTMWEREARLCSEIELEVAETVKDDKGGCRSVIKKMHHEIDVTWFVSYIGIKEF 320

Query: 327 QGPLSSTFPIENRTTQV---TMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
              +     +      +   T   LK +L+  +   +  +++ +DFH+L+FLA
Sbjct: 321 DSNVVRNLFVRLSRPGMDPPTYENLKIYLDDPKRRDMRRLEKYADFHVLVFLA 373


>gi|154338814|ref|XP_001565629.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134062681|emb|CAM39124.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 427

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/341 (21%), Positives = 140/341 (41%), Gaps = 68/341 (19%)

Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED--KRVEVNFIYEPPQQGTEEVLYI 187
           V  D    + FQ +V + + + ++R   +YG + +D    +EV+ +YEP Q G+      
Sbjct: 95  VKVDAQSINIFQSFVAQ-MDWKLQRYAVLYG-LYDDATHTIEVHAVYEPEQHGSTYAFDP 152

Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE----FHAECNM 243
           L D   +K V+ IA  LG+++VG   T   M++ +   L+ RE+L   +    +  EC +
Sbjct: 153 LPDAHMDK-VEKIAKALGLRRVGVACTHP-MRDPEHILLNYRELLLCTKEQSRYGDECAL 210

Query: 244 --------------------EEWVT--AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG 281
                                E +T  A      + E +  +  +A+Q S  CV L++ G
Sbjct: 211 LTVAPAAMPSTESSGNTSAPGELLTDSAAAASGATRESTIVVSCQAWQTSPQCVHLYRLG 270

Query: 282 WFET-----------------------EIAEGD-DPKLSKMKKDVVVGGKDVKEVDNDFF 317
             +                        E+A+ + DP   +      V      E+D  +F
Sbjct: 271 VLQKPPGGEEALQDVEKARQVHCTMPLEVAQTETDPSGHRR----FVTKSPSTEIDTRWF 326

Query: 318 LVVVKILDHQGPL-SSTFPIENRT--TQVTMRALKSHLN--RSPSLPLVKRISDFHLLLF 372
              + +     P+    F   +R       ++ L++++N  +   +P  +RI+DFH+L++
Sbjct: 327 TSYIAVQQFVSPIVRGAFMRLSRPGMPPPALQNLRNYMNDPKRKGMPFAERIADFHVLVY 386

Query: 373 -LARFLDLNSDVPALAQCVQAQTAVPEG--YKLLINSMAET 410
            L +    + ++ AL    + +    E   Y+ ++  M  T
Sbjct: 387 LLTQIFTTDDELRALCSVARTKMMTEEAANYQAILQGMMPT 427


>gi|354543615|emb|CCE40336.1| hypothetical protein CPAR2_103740 [Candida parapsilosis]
          Length = 577

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 18/115 (15%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
              ++N      V+R G++YG   + ++V       V  I EPPQ    + +  L D  +
Sbjct: 254 LNDFINVWRVSGVQRFGYLYGRYEKFEKVPMGIKAVVEAIVEPPQSDELDGI-TLSDWPD 312

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
           E+LVD IAA LG+ KVG +FT      +KD           Y L+N E+L A +F
Sbjct: 313 EQLVDEIAAKLGIYKVGIVFTDLTDSGQKDGTVLCKRHKDSYFLTNLEILMAAKF 367


>gi|367020574|ref|XP_003659572.1| hypothetical protein MYCTH_2296798 [Myceliophthora thermophila ATCC
           42464]
 gi|347006839|gb|AEO54327.1| hypothetical protein MYCTH_2296798 [Myceliophthora thermophila ATCC
           42464]
          Length = 597

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 86/322 (26%), Positives = 134/322 (41%), Gaps = 72/322 (22%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
             +R G +YG  LE   V       V  IYEPPQ   +E+  + L   E E+ V+A+A  
Sbjct: 279 GTQRLGILYGRYLEYDVVPLGIKAVVEAIYEPPQ--VDEIDGVTLGAWENEQEVNAVAKL 336

Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA-ECNMEEW----- 246
            G+++VG I+T  +     D           Y L+++EV+ A    A      +W     
Sbjct: 337 CGLEQVGVIWTDLLDAGNGDGTAICKRHADSYFLASQEVVFAARMQARHPKPSKWSDTGR 396

Query: 247 -----VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDD 292
                VT VV    +EEG  +I   A+QMS+  V + +    E             E +D
Sbjct: 397 FGSNFVTCVVS--GNEEGEISI--SAYQMSNDAVEMVRADIIEPSADPTLMLVREEEEED 452

Query: 293 PKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPI 336
             +S+ +       + +   G +V+E     F V  +   L H  P         S FPI
Sbjct: 453 GSVSRTRYIPEVFYRRINEYGANVQENAKPAFPVEYLFVTLTHGFPETPKPVFTDSGFPI 512

Query: 337 ENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
           ENR             +ALK +  +  +LP    +SDFHLL F+ +   L+ +  AL   
Sbjct: 513 ENREYLGEAQEVSAVSKALKVN-QKGDALP----VSDFHLLCFIHQMSVLSKEEEALLCR 567

Query: 390 VQAQTAVPEGYKLLINSMAETV 411
           V  Q  + E ++L   +  +T+
Sbjct: 568 VATQHDLAESFQLRSTTGWQTL 589



 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 11/94 (11%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQ--TLSTNQNLLLAKSPSDL 58
          MLLR+RS DG+ R++ D    + A+L   +  QL   V+ Q  TLS + +   AK   ++
Sbjct: 1  MLLRLRSPDGMFRLSADKDE-SFADLIGKLVPQLPSTVDPQSITLSNHPSGGDAKKIGEI 59

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSYDGERNV 92
           +F         +S + +SHG ++F+SY  +  V
Sbjct: 60 ARFK--------ISQVGLSHGDLIFVSYKHQDTV 85


>gi|378728298|gb|EHY54757.1| hypothetical protein HMPREF1120_02922 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 597

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 118/292 (40%), Gaps = 63/292 (21%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRD 190
           AD   + ++       +R GF+YG   E   V       V  IYEPPQ    + L +L D
Sbjct: 267 ADMVDKVIDFWRKSGCQRLGFLYGRYEEYTEVPLGIKAVVEAIYEPPQMCEVDGLTLL-D 325

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVEFHA 239
            + EK VD +A   G+++VG IFT          Q + +   D Y LS+ E+  A  F A
Sbjct: 326 WDNEKDVDEMARLCGLERVGVIFTDLMSPEDGEGQAVCKRHIDSYFLSSLEIAFASRFQA 385

Query: 240 EC-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDD 292
           +     +W       +  V   V+ +   +I   A+Q S+  V + +    E       +
Sbjct: 386 KYPKPSKWSDTGRFGSNFVTCVVTGDEDQSITINAYQASNTAVEMVRADIVEPSA----E 441

Query: 293 PK--LSKMKKDVVVG-----------------GKDVKEVDNDFFLVVVKILD-------H 326
           P   L + ++D VVG                 G +V+E     F V   ++         
Sbjct: 442 PSVMLVQSEEDEVVGARTRYIPEVFYRRINEYGANVQENAKPAFPVEYLLVSLTAGMPIK 501

Query: 327 QGPL--SSTFPIENRTTQVT---MRALKSHLNRSPSLPLVKRISDFHLLLFL 373
             PL  ++ FP+ENR         R +   L  S S   V  ISDFHLL FL
Sbjct: 502 PNPLFKNNKFPVENREAVAEGQDPREVAKLLRSSQS---VDAISDFHLLCFL 550


>gi|149603388|ref|XP_001513178.1| PREDICTED: nuclear protein localization protein 4 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 333

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 11/104 (10%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
           F  +    L    +R G+MYG  +ED       R     IYEPPQ+       IL+D+  
Sbjct: 196 FANFWMRDLQMITQRHGWMYGYYIEDAHYGRGIRAVCEAIYEPPQRDINGYGEILQDDFL 255

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF 237
             +VD IA+ LG++++G IFT       +D  L+  EVLQA + 
Sbjct: 256 -NVVDTIASKLGLERIGQIFTHL----PRDSYLTPTEVLQAAKL 294


>gi|344305066|gb|EGW35298.1| nuclear protein localization protein 4 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 563

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 64/128 (50%), Gaps = 19/128 (14%)

Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
             +++N   +  V+R G++YG      +V       V  IYEPPQ G  + + +L  ++E
Sbjct: 239 LNEFLNGWRSTGVQRFGYLYGRYEAFDKVPLGIKAVVESIYEPPQVGEFDGVTLLNWDDE 298

Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF-HAEC 241
           ++ +DAIA+ LG+ KVG  +T      +KD           Y L+N E + A  F H   
Sbjct: 299 QQ-IDAIASKLGLTKVGIAYTDLTDSGRKDGTVLCKRHKDSYFLTNLECIMAARFQHKYP 357

Query: 242 NMEEWVTA 249
           N+ ++ ++
Sbjct: 358 NVTKYTSS 365


>gi|296421545|ref|XP_002840325.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636540|emb|CAZ84516.1| unnamed protein product [Tuber melanosporum]
          Length = 675

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 78/312 (25%), Positives = 126/312 (40%), Gaps = 53/312 (16%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G++YG +     V       V  IYEPPQ G  + + +      EK V  +A+  G+
Sbjct: 284 QRIGYLYGRLAPYPEVPLGIKAIVEAIYEPPQNGESDGVTLQLPWTGEKDVAEVASLCGL 343

Query: 207 KKVGFIF-----------TQTIMQNKKDYTLSNREVLQAVEFHAE-CNMEEWV------T 248
           +K+G IF           T    ++   Y LS+ E++ A    AE  +  +W       +
Sbjct: 344 RKIGVIFSDLLDDGTGQGTVVCKRHIDSYYLSSLEIVFAARLQAEHPHPSKWSETGQFGS 403

Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE------IAEGDDPKLSKMK--- 299
           + V   VS      I   A+Q+S+  V + +    E        +   +D K   +    
Sbjct: 404 SFVTCVVSGNEDNGIDVAAYQVSNTAVEMVRADIIEPSADPSVMMVHEEDGKTRYVPEVF 463

Query: 300 -KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSS---------TFPIENR---TTQVT 344
            + +   G +VKE     F V  ++  L H  P  +          FPI NR        
Sbjct: 464 FRHINEYGLNVKENAKPSFPVEYLLVTLTHGFPNDTKPIFTAPLGKFPIGNREIMGISQD 523

Query: 345 MRALKSHL---NRSPSLPL--VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
            RA+   L   ++S + P   +  +SDFHLL F+  +  L+ +   L   V     V +G
Sbjct: 524 HRAVAKQLGITSQSANRPTDGIDVVSDFHLLTFIKGWGVLDKNEEELLAKVAVGHDVNDG 583

Query: 400 YKLLINSMAETV 411
           YKLL     +T+
Sbjct: 584 YKLLQTPGWQTL 595


>gi|312375034|gb|EFR22482.1| hypothetical protein AND_15202 [Anopheles darlingi]
          Length = 333

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 48/206 (23%), Positives = 95/206 (46%), Gaps = 30/206 (14%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ+  H ++V F+       ++++N       +R G++YGT      V    R  V  
Sbjct: 92  LNRQQYRHVDNVMFEN--TGIVERFLNYWRTTGHQRIGYLYGTYEIHPDVPLGIRARVAA 149

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
           IYEP Q+   + + +L D  ++  VD +A  LG+++VG+IFT  +           ++N 
Sbjct: 150 IYEPQQESNRDTIRLL-DGADDGDVDEMAKLLGLQRVGWIFTDLLSENLANGTVKHVRNI 208

Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           K + L+++E + A     +    C       + +  V + V+ +    +H E + +S  C
Sbjct: 209 KTHFLTSQECILAGHLQNKHPNRCKYSSNGYFGSKFVTVCVTGDEKKQVHMEGYAVSAQC 268

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK 300
           + L ++      I   D P+L  +++
Sbjct: 269 MALVRDNCL---IPTKDAPELGYIRE 291


>gi|401423393|ref|XP_003876183.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322492424|emb|CBZ27698.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 631

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 69/301 (22%), Positives = 126/301 (41%), Gaps = 65/301 (21%)

Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED--KRVEVNFIYEPPQQGTEEVLYI 187
           V  D    + FQ +V + + + ++R   +YG + +D    +EV+ +YEP Q G+      
Sbjct: 299 VKVDAQSINIFQSFVAQ-MDWRLQRYAILYG-LYDDATHTIEVHAVYEPEQHGSTYTFEP 356

Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE----FHAECNM 243
           L D   +K V+ IA  LG+++VG   T   M++ +D  L+ RE+L   +    +  EC +
Sbjct: 357 LPDAHLDK-VEKIAKALGLRRVGVACTHP-MRDPEDILLNFRELLLCAKEQSRYGDECAL 414

Query: 244 ---------------------EEWVTAVVKLE-VSEEGSAAIHFEAFQMSDMCVRLFKEG 281
                                E   ++V  L   + E +  +  +A+Q S  CV L++ G
Sbjct: 415 LTVAPAAMPSTKSGDTTSAPGEFPASSVAALSGATPEPTIVVSCQAWQASPQCVHLYRLG 474

Query: 282 WFET-----------------------EIAEGD-DPKLSKMKKDVVVGGKDVKEVDNDFF 317
             +                        E+A+ D DP  S  ++ V        E+D  +F
Sbjct: 475 VLQESPGGEAALQGVEQARQVHCTMPLEVAQTDTDP--SGHQRFVTTAPS--TEIDTRWF 530

Query: 318 LVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLN--RSP---SLPLVKRISDFHLLLF 372
              + +   + P+     +      +   AL++  N  R P   S+   + I+DFH+L++
Sbjct: 531 TSYIAVQQFESPIVRGAFMRLSRPGMPPPALQNLRNYIRDPKRKSMSFAETIADFHVLVY 590

Query: 373 L 373
           L
Sbjct: 591 L 591


>gi|324508917|gb|ADY43759.1| Nuclear protein localization protein 4 [Ascaris suum]
          Length = 507

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 74/316 (23%), Positives = 135/316 (42%), Gaps = 70/316 (22%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGT------VLEDK 166
           + + RQ   H +++S +       ++ VN  LAF  K G    G++ G       V    
Sbjct: 117 LTLNRQRFRHVDNISIEN------KEVVNRFLAFWRKSGNQRVGYLIGRYEPFPDVPLGI 170

Query: 167 RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ----------- 215
           +  V  IYEPPQ  + E +    ++  EKLVD + + L +++VG+IFT            
Sbjct: 171 KATVAAIYEPPQSSSSESVK-FEEDPNEKLVDELCSYLNLRRVGWIFTDLWSADSAKGTV 229

Query: 216 TIMQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAF 268
              ++K  + LS +E + A  F  +       C+   + +    L  S + S  I+F  +
Sbjct: 230 HCTRHKDSFFLSAKECITAGYFQNKYPNVTTFCSDRYFGSKFTTLVASGDQSEQINFNGY 289

Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DV---------VVGGKDV 309
           Q+S+ C  + +       +     P+L+ +++          DV         V   KD 
Sbjct: 290 QVSNQCAAMVEANI----LCPTSHPELAYLRQTPLYPAHYITDVHFMDKNEYGVETMKDG 345

Query: 310 KEVDNDFFLVVVKILDHQGP------LSST---FPIENRTTQVTMR---ALKSHLNRSPS 357
           + +  ++ LV V     + P      LSS    FP ENR     ++   A+ +++++  +
Sbjct: 346 RPMPVEYLLVDVPAGMPKEPHATFHILSSKGYPFPNENRDIIGELQDTNAIGAYISQFSA 405

Query: 358 LPLVKRISDFHLLLFL 373
              ++  ++FH LLFL
Sbjct: 406 NQFLELATNFHFLLFL 421


>gi|302925668|ref|XP_003054140.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256735081|gb|EEU48427.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 645

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 56/304 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG   E   V       V  IYEPPQ   +E+  I +   E +K +DA+A   G
Sbjct: 270 QRLGYLYGKYAEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEIDAVAKHCG 327

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWV------ 247
           ++ VG I+T  +     D           Y LS+ E+  +    A+     +W       
Sbjct: 328 LEPVGVIWTDLLDAGNGDGSVICRRHADSYFLSSLEITFSSRLQAQYPKSSKWSDTGSFG 387

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           +  V   ++      I   ++QMS+  V + +    E             E DD  +S+ 
Sbjct: 388 SNFVTCIITGNEDGEISISSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGSVSRT 447

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR--- 339
           +       + V   G +V+E     F V  +   L H  P           FPIENR   
Sbjct: 448 RYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPAEPKPMFTQQGFPIENREYI 507

Query: 340 TTQVTMRALKSHLNRSPS-LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
                  AL   +  SP   P  + +S+FHLL F+ +   L+ D  AL   V +Q  + +
Sbjct: 508 GESQEHSALSKFVKSSPGKQPTGQEVSNFHLLCFIHQMSVLSKDEEALLCRVASQHDLAD 567

Query: 399 GYKL 402
            ++L
Sbjct: 568 SFQL 571



 Score = 38.5 bits (88), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
          MLLR+R  DG+ R+TV+    T  +L   +  QL   V+  T  LS   N   +K  +D+
Sbjct: 1  MLLRLRGPDGMARLTVEPT-TTFGDLGKQLLDQLPKTVDPTTISLSNAPNGGDSKRLADI 59

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
          + F         +  + + HG ++FL+Y
Sbjct: 60 INFK--------VGQIGLKHGDLIFLTY 79


>gi|385303973|gb|EIF48012.1| nuclear protein localization protein [Dekkera bruxellensis
           AWRI1499]
          Length = 508

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 72/285 (25%), Positives = 123/285 (43%), Gaps = 50/285 (17%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRV------EVNFIYEPPQQGTEEVLYILRD 190
           ++    ++N      ++R G M+G+     +V      +V  IYE PQ   E+ + + + 
Sbjct: 196 SEIVNNFINAWRLSGMQRIGLMFGSYARYDKVPLGIKAKVQTIYELPQVDREDGVMLQKW 255

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
           ++E+K++  +A+ L +  VG IFT  +   K D           + LS+ E++ AV++  
Sbjct: 256 DDEDKVL-KLASELNLVPVGIIFTDLMDAGKGDGSVICKRHKSSFFLSSLELIFAVKWQL 314

Query: 240 EC-NMEEW------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE------ 286
           +  N+  W       +  V   +S   S  I  EA+Q S+    L K             
Sbjct: 315 KFPNICHWSSSGKFSSKFVTCVISGNPSGEIDMEAYQASESAEGLVKADLISPSTHPSQV 374

Query: 287 -IAEGDDPKLSK--MKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------- 333
            + E +B +     M   V   G +VK+     F V  ++  L H  P  S+        
Sbjct: 375 FVNEQNBKRYVPDIMYNMVNEYGLEVKKNGKPTFPVEYLLVSLTHGFPEKSSPFFKDTEK 434

Query: 334 FPIENRTT---QVTMRALKSHLNRSPSLP--LVKRISDFHLLLFL 373
           FPIENR       TM  +K H N  P+ P   + R+S+FHLL ++
Sbjct: 435 FPIENRNYIDEVPTMSDIKEHFN-IPNDPAVFISRLSNFHLLXYI 478


>gi|84994858|ref|XP_952151.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65302312|emb|CAI74419.1| hypothetical protein, conserved [Theileria annulata]
          Length = 549

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 76/154 (49%), Gaps = 17/154 (11%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++NE L  +++R G+MYG   ED+      R     IYEPPQ      + ++ DE     
Sbjct: 207 WLNE-LQMSLQRIGWMYGYYTEDQHYPYGIRAVCEAIYEPPQYSNINDVIMIEDEMLMN- 264

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFH----AECNMEEW-VTAVV 251
           VD IA  LG++++G IFT+       +  L+ RE+++ +E         +   + ++ +V
Sbjct: 265 VDKIAERLGLERIGMIFTRL----PNECFLTARELIRVIEMQYNSLKHIHYTGYPISTMV 320

Query: 252 KLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
              +S + +      A+  SD+ + LF++  F +
Sbjct: 321 TCTMSPDDNGQPVLNAYMASDLALALFRDELFTS 354


>gi|170042588|ref|XP_001849002.1| nuclear protein localization [Culex quinquefasciatus]
 gi|167866115|gb|EDS29498.1| nuclear protein localization [Culex quinquefasciatus]
          Length = 464

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 52/230 (22%), Positives = 101/230 (43%), Gaps = 35/230 (15%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
           + RQ   H ++V F+       ++++N       +R G ++GT      V    R  V  
Sbjct: 244 LNRQTYRHVDNVMFEN--TGIVERFLNYWRTTGHQRIGMLFGTYEVHPDVPLGIRARVAA 301

Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNK----------- 221
           IYEPPQ+   + + +L DE   + ++ +A  LG+++VG+IFT  + +N            
Sbjct: 302 IYEPPQESNRDSIRLLEDEHAVE-IEELARQLGLQRVGWIFTDLLAENAAAGTVKHVRGI 360

Query: 222 KDYTLSNREVLQAV----EFHAECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
           K + L+  E + A     ++   C       + +  V + V+ +    +H E + +S  C
Sbjct: 361 KTHFLTAHECILAGHLQNKYPNRCKHASKGLFGSKFVTVCVTGDDKKQVHMEGYAVSAQC 420

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKIL 324
           + L ++      I   D P+L  +++       D + V + ++  VV  L
Sbjct: 421 MALVRDNCL---IPTKDAPELGYIRES-----SDKQYVPDVYYKTVVPNL 462


>gi|156057431|ref|XP_001594639.1| hypothetical protein SS1G_04446 [Sclerotinia sclerotiorum 1980]
 gi|190359918|sp|A7EGK5.1|NPL4_SCLS1 RecName: Full=Nuclear protein localization protein 4
 gi|154702232|gb|EDO01971.1| hypothetical protein SS1G_04446 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 611

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 78/312 (25%), Positives = 128/312 (41%), Gaps = 61/312 (19%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
             +R G++YG   E   V       V  IYEPPQ   +E+  I L   E EK VD +A  
Sbjct: 286 GAQRFGYLYGRYEEYTEVPLGVKAVVEAIYEPPQ--VDELDGITLNKWESEKDVDEMARL 343

Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEE 245
            GM++VG I+T  +     D           Y LS+ E+  A    A+        +  +
Sbjct: 344 CGMERVGVIWTDLLDSGAGDGTVICKRHIDSYYLSSLEIAFAARLQAKHPKPTKWSDTGK 403

Query: 246 WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--------AEGDDPKLSK 297
           + +  V   V+ + +  I   A+Q+S+  V + +    E           +EGDD   + 
Sbjct: 404 FGSNFVTCVVTADETGGIAISAYQVSNTAVEMVRADIVEPSADPAVMIVRSEGDDDSDTS 463

Query: 298 MK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP--------LSSTFPIENRT 340
            +       + +   G+ V+E     F V  ++  L H  P          ++F IENR 
Sbjct: 464 ARYIPEVFYRKINEYGRSVQENAKPSFPVEYLLVTLTHGFPSDPKPAFMAKNSFTIENRL 523

Query: 341 TQVTMRALKS-----HLNRSPSLPL----VKRISDFHLLLFLARFLDLNSDVPALAQCVQ 391
                + +K       L+++  L      V  +SDFHLL ++     L+ +  AL   V 
Sbjct: 524 VIGQEQDIKDVGKQLGLDKNGQLTQSSDGVLAVSDFHLLCYIYSMGILSKEEMALLCRVA 583

Query: 392 AQTAVPEGYKLL 403
            Q  + +GY+L+
Sbjct: 584 TQHDLADGYQLI 595


>gi|219116761|ref|XP_002179175.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409066|gb|EEC48998.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 501

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 98/221 (44%), Gaps = 18/221 (8%)

Query: 168 VEVNFIYEPPQQGTEEVLY---ILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDY 224
            +V+ I+EPPQ       Y    LR      L   IAA LG+K VG+IFT +  ++K   
Sbjct: 241 AKVHAIWEPPQNSQPGTAYDGKALRKIPSNVL--RIAAHLGLKPVGWIFTYSDDRHKDQD 298

Query: 225 TLS--NREVLQAVEFHAECNMEEWVTA-VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG 281
            L   +++V    +  A  NM+    A   +       +     EAFQ+SD+ V++  E 
Sbjct: 299 ALPVWSKDVQHGAQLQA-FNMQRTDRAEGARFATLAMDAKVGTTEAFQLSDVAVQIVAED 357

Query: 282 WFETEIAEGD---DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPI-- 336
               +  + D     +       ++V G++   +D    LV   +L H+G  S    +  
Sbjct: 358 MLLIQNGKKDVDGGGRYVTTSHAILVDGQETTTLDCVLCLVNTALLSHEGSFSGGSAVAA 417

Query: 337 ----ENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
               +   T  T +A+ ++++ S    L++ +S+F +LL L
Sbjct: 418 VKKKDGGLTSKTRKAILAYIDGSDDAALLEILSNFAVLLAL 458


>gi|150951498|ref|XP_001387827.2| nuclear protein localization factor and ER translocation component
           [Scheffersomyces stipitis CBS 6054]
 gi|284018106|sp|A3GFS1.2|NPL4_PICST RecName: Full=Nuclear protein localization protein 4
 gi|149388644|gb|EAZ63804.2| nuclear protein localization factor and ER translocation component
           [Scheffersomyces stipitis CBS 6054]
          Length = 577

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 18/102 (17%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
            V+R G MYG      +V       V  IYEPPQ G  + + +L  E E + VDAIA+ L
Sbjct: 265 GVQRFGVMYGRYEPFDKVPLGIKAVVEAIYEPPQSGELDGITMLPWENEAE-VDAIASEL 323

Query: 205 GMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAV 235
           G+ KVG +FT          T++  ++K  Y LSN E+L A 
Sbjct: 324 GIYKVGVVFTDLTDSGQKNGTVLCKRHKDSYFLSNLEILMAA 365


>gi|430814351|emb|CCJ28392.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 530

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 73/299 (24%), Positives = 121/299 (40%), Gaps = 57/299 (19%)

Query: 153 KRGGFMYG------TVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G++YG       V    +  V  IYEPPQ+     + ++    EE  V+ +A   G+
Sbjct: 223 QRFGYLYGRYEPYDAVPLGIKAVVEAIYEPPQENEPNGISLILPWNEESTVNEVAHLFGL 282

Query: 207 KKVGFIFTQ-TIMQNKKDYTLSNREVLQAVEFHAEC-NMEEWV------TAVVKLEVSEE 258
           KKVG IFT  T   + K   +  R V+ A     +  N+ +W       +  V   VS  
Sbjct: 283 KKVGMIFTDLTDDDSGKGTVICKRHVVFAATMQLKHKNVTKWSKTGEFGSKFVTCVVSAN 342

Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--DVVVGGKDVKEVDN-- 314
               I   A+Q+S+  + L      E  +    DP    ++K  DV +     ++++   
Sbjct: 343 KEGGIDIFAYQVSNTAMALVDAKIIEPSV----DPNTMLIRKEHDVYIPEVFFQKINQYK 398

Query: 315 -------------DFFLVVVKILDHQGPL---------SSTFPIENRTTQVTMRALKSHL 352
                        ++ LV    L H  P+         S TF IENR +   ++ + +  
Sbjct: 399 TQVQENAKSSFPIEYLLVT---LTHGFPIKSESIFSSASGTFLIENRASIGQVQNINALA 455

Query: 353 NR------SPSLPLVKRISDFHLLLFLARFLDL--NSDVPALAQCVQAQTAVPEGYKLL 403
            R       P    +K + DFH L +L   LD+    D   L +C++ +  + + Y LL
Sbjct: 456 ERIKPILEQPYGDAIKALCDFHFLTYLWS-LDILDKEDGTLLIKCIK-EWQINDLYNLL 512


>gi|297798574|ref|XP_002867171.1| hypothetical protein ARALYDRAFT_913062 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313007|gb|EFH43430.1| hypothetical protein ARALYDRAFT_913062 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 50

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 34/45 (75%), Gaps = 8/45 (17%)

Query: 345 MRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
           MR LK+H      LPL    SDFHLLLF+A+FLD++SDVPALA+C
Sbjct: 1   MRTLKTH----SELPL----SDFHLLLFVAQFLDVSSDVPALAEC 37


>gi|448516776|ref|XP_003867639.1| Npl4 ubiquitin-binding protein [Candida orthopsilosis Co 90-125]
 gi|380351978|emb|CCG22202.1| Npl4 ubiquitin-binding protein [Candida orthopsilosis]
          Length = 589

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 56/113 (49%), Gaps = 18/113 (15%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++N      V+R G++YG   +  +V       V  I EPPQ    + + +L +  +E+L
Sbjct: 269 FINVWRVSGVQRFGYLYGRYEKFDKVPMGIKAVVEAIVEPPQSDELDGITLL-EWPDEQL 327

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFH 238
           VD IAA LG+ KVG +FT      +KD           Y L+N E+L A +F 
Sbjct: 328 VDEIAAKLGIYKVGVVFTDLTDSGQKDGTVLCKRHKDSYFLTNLEILMAAKFQ 380


>gi|320582084|gb|EFW96302.1| Nuclear protein localization protein [Ogataea parapolymorpha DL-1]
          Length = 578

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 81/326 (24%), Positives = 140/326 (42%), Gaps = 62/326 (19%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRV------EVNFIYEPPQQGTEEVLYILRD 190
           ++   +++N       +R G M G+    ++V      +V  IYE PQ   E+ + IL++
Sbjct: 240 SEIINEFINAWRLSGTQRIGLMLGSYDRYEKVPLGIKAKVEAIYELPQVDQEDGI-ILQN 298

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHA 239
            EEE+ + ++   L ++ VG IFT  +            ++K  + LS+ E + A+++  
Sbjct: 299 WEEEQQILSLIRKLDLQPVGIIFTDLLDAGSGNGSVICKRHKDSFFLSSLEAIFAIKWQL 358

Query: 240 EC-NMEEW------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF-------ET 285
           +  N+ +W       +  V   +S   S  I  EA+Q+S+    L K           E 
Sbjct: 359 KFPNICKWSDSGIFSSKFVTCVISGNTSGEIDIEAYQISESGEGLVKADLISPSTHPNEV 418

Query: 286 EIAEGDD----PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGP---------- 329
            I E +D    P++   K +    G  VK+     F V  ++  L H  P          
Sbjct: 419 YINEQNDERYVPEIFYQKINEY--GLQVKQHATPSFPVEYLLVSLTHGFPERSSPFFKAG 476

Query: 330 LSSTFPIENRT---TQVTMRALKSHL---NRSPSLPLVKRISDFHLLLFLARFLDLNSD- 382
            S+ FPIENR       +M  LK++L   N      L   +S+FHLL++     D+ S  
Sbjct: 477 ASNKFPIENRAYIGESASMPDLKNYLSSINGDDMSVLATLLSNFHLLVYFTLNQDILSGH 536

Query: 383 -----VPALAQCVQAQTAVPEGYKLL 403
                V    +  + + A+P+ YKL+
Sbjct: 537 EYELLVEITRKIGKGEEALPDCYKLV 562


>gi|221054019|ref|XP_002261757.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193808217|emb|CAQ38920.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 529

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/119 (26%), Positives = 58/119 (48%), Gaps = 5/119 (4%)

Query: 297 KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSP 356
           K K DV++    VK +D  F +  V IL H+   ++ FP  N +   T+    ++++RS 
Sbjct: 406 KCKSDVLIEKIQVKHIDVLFCVNNVPILSHKSVYNTFFPYPNNSNYYTILQRFNYMSRSL 465

Query: 357 SLP--LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT---AVPEGYKLLINSMAET 410
             P  +V    DF+ LLFL        D+P + + +        +P+ Y  ++ S++++
Sbjct: 466 HNPKDMVSIFRDFNFLLFLTNIFSFEHDIPPICKAINDSNNSLGIPDQYVSILRSLSQS 524


>gi|255730104|ref|XP_002549977.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240133046|gb|EER32603.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 581

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 53/112 (47%), Gaps = 18/112 (16%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++N      V+R G++YG   + ++        V  I EPPQ    + + +L D EEEKL
Sbjct: 265 FINIWRLSGVQRFGYLYGHYEKSEKSPLGIKAVVETIVEPPQHDELDGITLL-DWEEEKL 323

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
           VD +AA  G+ KVG  FT       KD           Y L+N E++ A  F
Sbjct: 324 VDELAAQFGIYKVGITFTDLTDSGNKDGTVLCKRHKDSYFLTNLEIIMASRF 375


>gi|406603385|emb|CCH45063.1| Nuclear protein localization protein 4 [Wickerhamomyces ciferrii]
          Length = 566

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 76/310 (24%), Positives = 125/310 (40%), Gaps = 67/310 (21%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
             +R GF+YGT  +  +V       V  IYEP Q    + L +    E+E+ +D +A   
Sbjct: 251 GTQRIGFLYGTYQKYDKVPLGIKAVVEAIYEPTQHDENDGLTLGLPWEDEERIDNLAKSA 310

Query: 205 GMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
           G+ K+G IF+  I            ++K  Y LS+ EV+ A +  ++           ++
Sbjct: 311 GLYKIGVIFSDLIDAGNGDGSVICKRHKDSYFLSSLEVIFAAKLQSKYPNYTQFSQSGKF 370

Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG-------------DD- 292
            +  V   VS      I   ++Q+S+       EG  + +I  G             DD 
Sbjct: 371 SSKFVTCVVSGNLQGEIDIASYQVSEGA-----EGLVKADILTGSTHPSFAYINETNDDR 425

Query: 293 -------PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIEN 338
                   K+++ K  V    K    VD   +L+V   L H  P        S+ FPIEN
Sbjct: 426 YVPDIFYTKMNEYKLQVKSNAKPAFPVD---YLLVT--LSHGFPQEPNPVFKSNKFPIEN 480

Query: 339 R----TTQVTMRALKS-HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ 393
           R     +Q  +   K   ++   +  ++  ISDFHLL +L     L+     L   +  +
Sbjct: 481 RQAAGISQDFLEIKKQLKIDSFDTNDVINSISDFHLLAYLFSLDILSPSEQKLVVKIAVE 540

Query: 394 TAVPEGYKLL 403
             + + Y+LL
Sbjct: 541 HRLEDAYRLL 550


>gi|406867683|gb|EKD20721.1| NPL4 family protein [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 709

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 78/318 (24%), Positives = 129/318 (40%), Gaps = 67/318 (21%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
             +R G++YG   E   V       V  IYEPPQ    + + +L D +EE LV+ IA   
Sbjct: 271 GAQRIGYLYGRYTEYTEVPLGIKAVVEAIYEPPQINEVDGVTLL-DWQEESLVEGIARQC 329

Query: 205 GMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEW 246
           G++KVG I+T  +     D           Y LS+ E+  A    A+        +   +
Sbjct: 330 GLQKVGVIWTDLLDAGAGDGSVVCKRHIDSYYLSSLEIAFAARLQAQHPKPTKWSDTGRF 389

Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSK 297
            +  V   VS +    I   ++Q+S+  V + +    E             E DD  +S+
Sbjct: 390 GSNFVTCVVSGDEKGQIAISSYQVSNSAVEMVRANIVEPSADPTVMLVRNEEEDDGSISR 449

Query: 298 MK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP--------LSSTFPIENRT 340
            +       + +   G  V+E     F V  ++  L H  P         +  F I NR 
Sbjct: 450 TRYIPEVFYRRINEYGASVQENAKPSFPVEYLLVTLTHGFPDAPKPMFIAAPGFTIANRL 509

Query: 341 T------QVTMRALKSHLNRSPSLP----LVKRISDFHLLLFLAR-----FLDLNSDVPA 385
           T      + T+ A +  ++R+  +      ++ +SDFHLL +L +     F   + +  A
Sbjct: 510 TLGEEQDKKTL-AKRIGVDRAGEVKNADGALRAVSDFHLLCYLKKEYVNLFEKASLEDEA 568

Query: 386 LAQCVQAQTAVPEGYKLL 403
           L   V  +  + EGY+LL
Sbjct: 569 LLYRVATKHDLAEGYQLL 586


>gi|342876146|gb|EGU77804.1| hypothetical protein FOXB_11668 [Fusarium oxysporum Fo5176]
          Length = 646

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 125/308 (40%), Gaps = 64/308 (20%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG   E   V       V  IYEPPQ   +E+  I +   E +K VD +A   G
Sbjct: 271 QRLGYLYGKYTEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEVDRVAKQCG 328

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEW------- 246
           ++ VG I+T  +     D           Y LS+ E+  +    A+     +W       
Sbjct: 329 LEPVGVIWTDLLDAGNGDGTVVCKRHADSYFLSSLEICFSSRLQAQYPKPSKWSDTGSFG 388

Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPK 294
              VT ++    +E+G  +I   ++QMS+  V + +    E             E DD  
Sbjct: 389 SNFVTCIIT--GNEQGEISI--SSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGS 444

Query: 295 LSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIEN 338
           +S+ +       + V   G +V+E     F V  +   L H  P           FPIEN
Sbjct: 445 VSRTRYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPMFTQQGFPIEN 504

Query: 339 R---TTQVTMRALKSHLNRSPSL-PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT 394
           R          AL   L  SP   P  + +S+FHLL F+ +   L+ D  AL   V +Q 
Sbjct: 505 REYVGESQEHSALSKFLKLSPGQKPTGQEVSNFHLLCFIQQMGVLSKDEEALLCRVASQH 564

Query: 395 AVPEGYKL 402
            + + ++L
Sbjct: 565 DLADTFQL 572



 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 7/86 (8%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
          MLLR+R  DG+ R+ V+    T  +L  ++  QL   V+ +TL+    L  A +  D  +
Sbjct: 1  MLLRLRGPDGMARLNVE-PTTTFGDLGKLLLDQLPKTVDPKTLA----LSNAPNGGDSKR 55

Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
            D+ N    +  + + HG ++FL+Y
Sbjct: 56 LGDIVN--FKVGQIGLKHGDLIFLTY 79


>gi|340517387|gb|EGR47631.1| nuclear pore complex, npl4 component [Trichoderma reesei QM6a]
          Length = 663

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 123/317 (38%), Gaps = 61/317 (19%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG   E  +V       V  IYEPPQ   +E+  I +   E +K VD IA   G
Sbjct: 273 QRLGYLYGRYAEYTKVPLGVKAIVEAIYEPPQ--VDEIDGITMNAWENQKEVDHIAKLCG 330

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           ++ VG I+T  +   + D           Y LS+ E+  +    A+        +  ++ 
Sbjct: 331 LEPVGVIWTDLLDSGRGDGSVVCKRHLESYYLSSLEICFSSRLQAQYPKPSKWSDSGQFG 390

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           +  V   ++ +    I   A+QMS+  V + +    E             E DD  +S+ 
Sbjct: 391 SNFVTCVITGDEDGNIGISAYQMSNEAVEMVRADIVEPSADPNSMLVREEEEDDGSVSRT 450

Query: 299 K--------------KDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST---FPIENRTT 341
           +                V    K    V+  F  +     D   P  +T   FPIENR  
Sbjct: 451 RYIPEVFYSKINEYGAKVQQNAKPAFPVEYLFVTLTHGFPDEPKPTFATQTGFPIENREY 510

Query: 342 -------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT 394
                      RAL     + PS    + +S+FH+L F+ +   L+ D  AL   V  Q 
Sbjct: 511 IGESQEHSALARALNLRPGQKPS-DNGQEVSNFHILCFIHQLGVLSKDEEALLCRVATQH 569

Query: 395 AVPEGYKLLINSMAETV 411
            + + ++L      +T+
Sbjct: 570 DLADSFQLRSTGGWQTL 586


>gi|336272553|ref|XP_003351033.1| hypothetical protein SMAC_04337 [Sordaria macrospora k-hell]
          Length = 661

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 129/322 (40%), Gaps = 72/322 (22%)

Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
           +++N       +R G +YG  LE   V       V  IYEPPQ   +E+  + L   E E
Sbjct: 274 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNVWENE 331

Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CN 242
           K V+ +A   G++ VG I+T  +   K D           Y L+ +E+  A  F A+   
Sbjct: 332 KDVNEVARLCGLEPVGVIWTDLLDSGKGDGSAICKRHTDSYFLAAQEICFAARFQAQHPK 391

Query: 243 MEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG-- 290
             +W          VT V+    +EEG  +I   A+QMS+  V + +    E     G  
Sbjct: 392 ATKWSDTGRFGSNFVTCVIS--GNEEGEISI--SAYQMSNDAVEMVRADIIEPSTDPGQM 447

Query: 291 -------DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST- 333
                  DD  +S+ +       + +   G +V+E     F V  +   L H  P S   
Sbjct: 448 LVREEEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPKP 507

Query: 334 ------FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
                 FPI NR             + LK +  +S        +S+FHLL F+ +   L+
Sbjct: 508 VFTNDGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIHQMSVLS 562

Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
            D  AL   V  Q  + + ++L
Sbjct: 563 KDEEALLCRVATQHDLADAFQL 584


>gi|46108668|ref|XP_381392.1| hypothetical protein FG01216.1 [Gibberella zeae PH-1]
          Length = 648

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 75/304 (24%), Positives = 121/304 (39%), Gaps = 56/304 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG  +E   V       V  IYEPPQ   +E+  I +   E +K +D +A   G
Sbjct: 272 QRLGYLYGKYVEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEIDKVAKQCG 329

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWVTA---- 249
           ++ VG I+T  +     D           Y LS+ E+  +    A+     +W  +    
Sbjct: 330 LEPVGVIWTDLLDSGNGDGSVVCKRHADSYFLSSLEICFSSRLQAQYPKASKWSDSGNFG 389

Query: 250 --VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
              V   ++      I   ++QMS+  V + +    E             E DD  +S+ 
Sbjct: 390 SNFVTCIITGNEDGEISISSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGSVSRT 449

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR--- 339
           +       + V   G +V+E     F V  +   L H  P           FPIENR   
Sbjct: 450 RYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPMFTQQGFPIENREYV 509

Query: 340 TTQVTMRALKSHLNRSPSL-PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
                  AL   +  SP+  P  + +S+FHLL F+ +   L+ D  AL   V +Q    +
Sbjct: 510 GESQEHSALSKFVKISPNQKPTGQEVSNFHLLCFIQQMGVLSKDEEALLCRVASQHDFAD 569

Query: 399 GYKL 402
            ++L
Sbjct: 570 TFQL 573



 Score = 42.4 bits (98), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
          MLLR+R  DG+ R+TV+ A  T  +L  ++  QL   V+ +T  LS   N   +K   D+
Sbjct: 1  MLLRLRGPDGMARLTVE-ATTTFGDLGKLLLDQLPKTVDPKTIALSNAPNGGDSKRLGDI 59

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
          + F         +  + + HG ++FL+Y
Sbjct: 60 INFK--------VGQIGLKHGDLIFLTY 79


>gi|380090800|emb|CCC04970.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 601

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 129/322 (40%), Gaps = 72/322 (22%)

Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
           +++N       +R G +YG  LE   V       V  IYEPPQ   +E+  + L   E E
Sbjct: 274 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNVWENE 331

Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CN 242
           K V+ +A   G++ VG I+T  +   K D           Y L+ +E+  A  F A+   
Sbjct: 332 KDVNEVARLCGLEPVGVIWTDLLDSGKGDGSAICKRHTDSYFLAAQEICFAARFQAQHPK 391

Query: 243 MEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG-- 290
             +W          VT V+    +EEG  +I   A+QMS+  V + +    E     G  
Sbjct: 392 ATKWSDTGRFGSNFVTCVIS--GNEEGEISI--SAYQMSNDAVEMVRADIIEPSTDPGQM 447

Query: 291 -------DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST- 333
                  DD  +S+ +       + +   G +V+E     F V  +   L H  P S   
Sbjct: 448 LVREEEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPKP 507

Query: 334 ------FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
                 FPI NR             + LK +  +S        +S+FHLL F+ +   L+
Sbjct: 508 VFTNDGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIHQMSVLS 562

Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
            D  AL   V  Q  + + ++L
Sbjct: 563 KDEEALLCRVATQHDLADAFQL 584


>gi|157870668|ref|XP_001683884.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126951|emb|CAJ05219.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 627

 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 67/301 (22%), Positives = 124/301 (41%), Gaps = 65/301 (21%)

Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED--KRVEVNFIYEPPQQGTEEVLYI 187
           V  D    + FQ +V + + + ++R   +YG + +D    +EV+ +YEP Q G+      
Sbjct: 295 VKVDAQSINIFQSFVAQ-MDWRLQRYAILYG-LYDDATHTIEVHAVYEPEQHGSTYTFEP 352

Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE----FHAECNM 243
           L D   +K V+ IA  LG+++VG   T   M++ +   L+ RE+L   +    +  EC +
Sbjct: 353 LPDAHLDK-VERIAKALGLRRVGVACTHP-MRDPEHILLNFRELLLCAKEQSRYGDECAL 410

Query: 244 EEWVTAVVKLEVS----------EEGSAA------------IHFEAFQMSDMCVRLFKEG 281
                A +    S            GSAA            +  +A+Q S  CV L++ G
Sbjct: 411 LTVAPAAMPSTKSGGATSAPGELPTGSAAASSGATLEPTIVVSCQAWQTSPQCVHLYRLG 470

Query: 282 WFET-----------------------EIAEGD-DPKLSKMKKDVVVGGKDVKEVDNDFF 317
             +                        E+A+ D DP          V      E+D  +F
Sbjct: 471 VLQESPSGEAALQDVEQARQVHCTMPLEVAQTDTDPS----GHQRFVTKAPSTEIDTRWF 526

Query: 318 LVVVKILDHQGPL-SSTFPIENRT--TQVTMRALKSHLN--RSPSLPLVKRISDFHLLLF 372
              + +   + P+    F   +R       ++ L+++++  +  S+   + I+DFH+L++
Sbjct: 527 TSYIAVQQFESPIVRGAFMRLSRPGMPPPVLQNLRNYISDPKRKSMSFAETIADFHVLVY 586

Query: 373 L 373
           L
Sbjct: 587 L 587


>gi|408398882|gb|EKJ78008.1| hypothetical protein FPSE_01796 [Fusarium pseudograminearum CS3096]
          Length = 648

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 75/304 (24%), Positives = 121/304 (39%), Gaps = 56/304 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG  +E   V       V  IYEPPQ   +E+  I +   E +K +D +A   G
Sbjct: 272 QRLGYLYGKYVEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEIDKVAKQCG 329

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWVTA---- 249
           ++ VG I+T  +     D           Y LS+ E+  +    A+     +W  +    
Sbjct: 330 LEPVGVIWTDLLDSGNGDGSVVCKRHADSYFLSSLEICFSSRLQAQYPKASKWSDSGNFG 389

Query: 250 --VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
              V   ++      I   ++QMS+  V + +    E             E DD  +S+ 
Sbjct: 390 SNFVTCIITGNEDGEISISSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGSVSRT 449

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR--- 339
           +       + V   G +V+E     F V  +   L H  P           FPIENR   
Sbjct: 450 RYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPMFTQQGFPIENREYV 509

Query: 340 TTQVTMRALKSHLNRSPSL-PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
                  AL   +  SP+  P  + +S+FHLL F+ +   L+ D  AL   V +Q    +
Sbjct: 510 GESQEHSALSKFVKISPNQKPTGQEVSNFHLLCFIQQMGVLSKDEEALLCRVASQHDFAD 569

Query: 399 GYKL 402
            ++L
Sbjct: 570 TFQL 573



 Score = 42.4 bits (98), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
          MLLR+R  DG+ R+TV+ A  T  +L  ++  QL   V+ +T  LS   N   +K   D+
Sbjct: 1  MLLRLRGPDGMARLTVE-ATTTFGDLGKLLLDQLPKTVDPKTIALSNAPNGGDSKRLGDI 59

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
          + F         +  + + HG ++FL+Y
Sbjct: 60 INFK--------VGQIGLKHGDLIFLTY 79


>gi|322705619|gb|EFY97203.1| NPL4 family protein [Metarhizium anisopliae ARSEF 23]
          Length = 644

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 78/303 (25%), Positives = 121/303 (39%), Gaps = 58/303 (19%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R GF+YG   E   V       V  IYEPPQ   +E+  I L     +K V+ IA   G
Sbjct: 274 QRIGFLYGRYAEYTEVPLGVKAVVEAIYEPPQ--VDELDGITLNAWGNQKEVEHIAQLCG 331

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           ++ VG I+T  +   + D           Y LS+ E+  +    A+        +   + 
Sbjct: 332 LEPVGVIWTDLLDAGRGDGSVVCKRHADSYFLSSLEICFSSRLQAQHPKPSKWSDTGRYG 391

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           +  V   ++      I   ++QMS+  V + +    E             E DD  +S+ 
Sbjct: 392 SNFVTCIITGNEQGEIGISSYQMSNEAVEMVRADIVEPSADPTVMLVRDEEEDDGSVSRT 451

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIENR--- 339
           +       + V   G +V+E     F V  +   L H  P         + FPIENR   
Sbjct: 452 RYIPEVFYRKVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPTFTQTGFPIENREYI 511

Query: 340 TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
                  AL   LN S +    + +S+FHLL FL +   L+ D  AL   V +Q  + E 
Sbjct: 512 GESQEHSALSKALNLSDT---GQEVSNFHLLCFLHQMGILSKDEEALLCRVASQHDLAES 568

Query: 400 YKL 402
           ++L
Sbjct: 569 FQL 571


>gi|312067660|ref|XP_003136847.1| NPL4 family protein [Loa loa]
          Length = 503

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 130/322 (40%), Gaps = 73/322 (22%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
           + RQ   H +++S +       ++ VN  LAF  K G    G+M G     + + +    
Sbjct: 147 LNRQRFRHVDNISIEN------EEVVNRFLAFWRKTGHQRMGYMIGRYEPFQEIPLGIKA 200

Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
               IYEPPQ  + + +  L  + +EK+VD + + L +K+VG+IFT              
Sbjct: 201 TVAAIYEPPQSCSADSV-CLEADPQEKMVDELCSYLNLKRVGWIFTDLWSADSSKGTVCC 259

Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
            ++K  + L+ +E + A     +       C    + +    +  S      I F  +Q+
Sbjct: 260 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 319

Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
           S+ C  L +       +     P+L+ +++          DV    K+   V+       
Sbjct: 320 SNQCASLVEANL----LCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 375

Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
              ++ LV V     + P ++        FP ENR     +   +A+  +++       +
Sbjct: 376 MPVEYLLVDVPAGMPKEPHATFNISKKCNFPTENRVLIGELQDSKAVGRYISEFSQNQFI 435

Query: 362 KRISDFHLLLFLARFLDLNSDV 383
           +  ++FH LLFL     LN+D+
Sbjct: 436 ELATNFHFLLFL-----LNNDI 452


>gi|393910799|gb|EFO27216.2| NPL4 family protein [Loa loa]
          Length = 548

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 130/322 (40%), Gaps = 73/322 (22%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
           + RQ   H +++S +       ++ VN  LAF  K G    G+M G     + + +    
Sbjct: 161 LNRQRFRHVDNISIEN------EEVVNRFLAFWRKTGHQRMGYMIGRYEPFQEIPLGIKA 214

Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
               IYEPPQ  + + +  L  + +EK+VD + + L +K+VG+IFT              
Sbjct: 215 TVAAIYEPPQSCSADSV-CLEADPQEKMVDELCSYLNLKRVGWIFTDLWSADSSKGTVCC 273

Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
            ++K  + L+ +E + A     +       C    + +    +  S      I F  +Q+
Sbjct: 274 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 333

Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
           S+ C  L +       +     P+L+ +++          DV    K+   V+       
Sbjct: 334 SNQCASLVEANL----LCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 389

Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
              ++ LV V     + P ++        FP ENR     +   +A+  +++       +
Sbjct: 390 MPVEYLLVDVPAGMPKEPHATFNISKKCNFPTENRVLIGELQDSKAVGRYISEFSQNQFI 449

Query: 362 KRISDFHLLLFLARFLDLNSDV 383
           +  ++FH LLFL     LN+D+
Sbjct: 450 ELATNFHFLLFL-----LNNDI 466


>gi|322697947|gb|EFY89721.1| NPL4 family protein [Metarhizium acridum CQMa 102]
          Length = 650

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 79/306 (25%), Positives = 121/306 (39%), Gaps = 58/306 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R GF+YG   E   V       V  IYEPPQ   +E+  I L     +K V+ IA   G
Sbjct: 274 QRIGFLYGRYAEYTEVPLGVKAVVEAIYEPPQ--VDELDGITLNAWGNQKEVEHIAQLCG 331

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           ++ VG I+T  +   + D           Y LS+ E+  +    A+        +   + 
Sbjct: 332 LEPVGVIWTDLLDAGRGDGSVICKRHADSYFLSSLEICFSSRLQAQHPKPSKWSDTGRYG 391

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           +  V   ++      I   ++QMS+  V + +    E             E DD  +S+ 
Sbjct: 392 SNFVTCIITGNEQGEIGISSYQMSNEAVEMVRADIVEPSADPTVMLVRDEEEDDGSVSRT 451

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIENR--- 339
           +       + V   G +V+E     F V  +   L H  P         + FPIENR   
Sbjct: 452 RYIPEVFYRKVNEYGANVQENAKPSFPVEYLFVTLTHGFPAEPKPTFTQTGFPIENREYI 511

Query: 340 TTQVTMRALKSHLNRSP---SLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
                  AL   LN S    S    + +S+FHLL FL +   L+ D  AL   V +Q  +
Sbjct: 512 GESQEHSALSKALNLSSAQKSSDTGQEVSNFHLLCFLHKMGILSKDEEALLCRVASQHDL 571

Query: 397 PEGYKL 402
            E ++L
Sbjct: 572 AESFQL 577


>gi|389631719|ref|XP_003713512.1| hypothetical protein MGG_04413 [Magnaporthe oryzae 70-15]
 gi|374095430|sp|A4RN19.2|NPL4_MAGO7 RecName: Full=Nuclear protein localization protein 4
 gi|351645845|gb|EHA53705.1| hypothetical protein MGG_04413 [Magnaporthe oryzae 70-15]
          Length = 669

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 133/332 (40%), Gaps = 69/332 (20%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEK 195
           ++N       +R G MYG   E + V       V  IYEPPQ   +EV  + L   + EK
Sbjct: 274 FINTWRRTGGQRYGIMYGKYSEYEEVPLGIKAVVQAIYEPPQ--VDEVDGVSLNSWDNEK 331

Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
            V+ +A   G++ VG I+T  +     D           Y LS+ EV  A    A+    
Sbjct: 332 DVNQVARLCGLEPVGAIWTDLLDAGAGDGSVVCKRHADSYFLSSLEVCFAARLQAQHPKP 391

Query: 244 EEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
            +W          VT ++    +E+G  AI   ++Q+S+  V + +    E         
Sbjct: 392 SKWSDTGRFGSNFVTCIIS--GNEQGEIAI--SSYQVSNEAVEMVRADIMEPSADPTVML 447

Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPL 330
               E DD   S+ +       + +   G +V+E       V+  F  +     D   P+
Sbjct: 448 VREEEEDDGSTSRTRYIPDVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDVAKPM 507

Query: 331 SS---TFPIENRTT-------QVTMRALKSHLNRSP-SLPLVKRISDFHLLLFLARFLDL 379
            S    FPIENR             +ALK H   S  S     ++S+FHLL FL +   L
Sbjct: 508 FSDEGAFPIENREYMGESQEHSAAAKALKVHEKASSGSSKDGMKVSNFHLLCFLHQMSVL 567

Query: 380 NSDVPALAQCVQAQTAVPEGYKLLINSMAETV 411
           + D  +L   V  Q  + + ++L   +  +T+
Sbjct: 568 SKDEESLLCRVATQHDLADAFQLRSTTGWQTL 599


>gi|402588870|gb|EJW82803.1| NPL4 family protein, partial [Wuchereria bancrofti]
          Length = 492

 Score = 48.9 bits (115), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 131/322 (40%), Gaps = 73/322 (22%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
           + RQ   H +++S +       ++ VN  LAF  K G    G+M G     + + +    
Sbjct: 101 LNRQRFRHVDNISIEN------EEVVNRFLAFWRKTGHQRIGYMIGRYESFQEIPLGIKA 154

Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
               IYEPPQ  + + +  L  + +EK+VD + + L +K+VG+IFT              
Sbjct: 155 TVAAIYEPPQSCSADSV-CLEADPQEKVVDELCSYLNLKRVGWIFTDLWSADSSKGTVYC 213

Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
            ++K  + L+ +E + A     +       C    + +    +  S      I F  +Q+
Sbjct: 214 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 273

Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
           S+ C  L +       +     P+L+ +++          DV    K+   V+       
Sbjct: 274 SNQCASLVE----ANLLCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 329

Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
              ++ LV V     + P ++        FP ENR     +   +A+  ++++      +
Sbjct: 330 MPVEYLLVDVPAGMPKEPHATFNISKKCYFPTENRILIGELQDSKAVGRYISKFSQNQFI 389

Query: 362 KRISDFHLLLFLARFLDLNSDV 383
           +  ++FH LLFL     LN+D+
Sbjct: 390 ELATNFHFLLFL-----LNNDI 406


>gi|310790815|gb|EFQ26348.1| NPL4 family protein [Glomerella graminicola M1.001]
          Length = 649

 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 122/307 (39%), Gaps = 60/307 (19%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G MYG   E + V       V  IYEPPQ   +EV  I L   E +  VD +A   G
Sbjct: 280 QRLGIMYGRYTEYEVVPLGIKAVVEAIYEPPQ--VDEVDGISLNAWENQNDVDEVARLCG 337

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           +++VG IFT  +     D           Y LS++E+       A+        +  ++ 
Sbjct: 338 LQQVGVIFTDLLDAGAGDGSVVCKRHADSYFLSSQEICFISRLQAQHPKPTKWSDTGKFG 397

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           +  V   +S   S  I   A+Q S+  V + +    E             E DD   S+ 
Sbjct: 398 SNFVTCIISGNESGEISISAYQASNDAVEMVRADIVEPSADPNAMLVREEEEDDGSTSRT 457

Query: 299 K-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPLSST--FPIENRTT- 341
           +       + +   G +V+E       V+  F  +     DH  P  +   FPIENR   
Sbjct: 458 RYIPEVFYRRINEYGANVQENAKPTFPVEYLFVTLTHGFPDHSKPTFTQLGFPIENREYM 517

Query: 342 ------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
                     R LK    + P+   ++  S+FHLL F+ +   L+ D  +L   V +Q  
Sbjct: 518 GESQEHSAASRLLKFSGGQKPAEGGLE-TSNFHLLCFIHQMGILSKDEESLLCRVASQHD 576

Query: 396 VPEGYKL 402
           + + Y+L
Sbjct: 577 LADTYQL 583



 Score = 41.2 bits (95), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPS--DL 58
          MLLR+R  DG+ R+T+D    +  EL      +L +P   +T+  N  + L+ +P+  D 
Sbjct: 1  MLLRLRGPDGMSRLTID-KDASFGEL-----GRLLLPTLPKTVDPN-TITLSNAPTGGDA 53

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
             TD+A+    +  + + HG ++F+SY
Sbjct: 54 KNLTDIAS--FKVGQIGLKHGDLIFISY 79


>gi|389631717|ref|XP_003713511.1| hypothetical protein, variant [Magnaporthe oryzae 70-15]
 gi|351645844|gb|EHA53704.1| hypothetical protein, variant [Magnaporthe oryzae 70-15]
          Length = 607

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 133/332 (40%), Gaps = 69/332 (20%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEK 195
           ++N       +R G MYG   E + V       V  IYEPPQ   +EV  + L   + EK
Sbjct: 274 FINTWRRTGGQRYGIMYGKYSEYEEVPLGIKAVVQAIYEPPQ--VDEVDGVSLNSWDNEK 331

Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
            V+ +A   G++ VG I+T  +     D           Y LS+ EV  A    A+    
Sbjct: 332 DVNQVARLCGLEPVGAIWTDLLDAGAGDGSVVCKRHADSYFLSSLEVCFAARLQAQHPKP 391

Query: 244 EEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
            +W          VT ++    +E+G  AI   ++Q+S+  V + +    E         
Sbjct: 392 SKWSDTGRFGSNFVTCIIS--GNEQGEIAI--SSYQVSNEAVEMVRADIMEPSADPTVML 447

Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPL 330
               E DD   S+ +       + +   G +V+E       V+  F  +     D   P+
Sbjct: 448 VREEEEDDGSTSRTRYIPDVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDVAKPM 507

Query: 331 SS---TFPIENRTT-------QVTMRALKSHLNRSP-SLPLVKRISDFHLLLFLARFLDL 379
            S    FPIENR             +ALK H   S  S     ++S+FHLL FL +   L
Sbjct: 508 FSDEGAFPIENREYMGESQEHSAAAKALKVHEKASSGSSKDGMKVSNFHLLCFLHQMSVL 567

Query: 380 NSDVPALAQCVQAQTAVPEGYKLLINSMAETV 411
           + D  +L   V  Q  + + ++L   +  +T+
Sbjct: 568 SKDEESLLCRVATQHDLADAFQLRSTTGWQTL 599


>gi|400601268|gb|EJP68911.1| NPL4 family protein [Beauveria bassiana ARSEF 2860]
          Length = 662

 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 75/306 (24%), Positives = 118/306 (38%), Gaps = 58/306 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G++YG   E   V       V  IYEPPQ    + +  L   E +K +D IA   G+
Sbjct: 272 QRLGYLYGRYAEYTEVPLGTKAVVEAIYEPPQVDEMDGV-TLNAWENQKDIDHIAKLCGL 330

Query: 207 KKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAEC-NMEEWV------T 248
           + VG I+T  +            ++   Y LS+ E   A    A+     +W       +
Sbjct: 331 EPVGVIWTDLLDAGQGNGSVVCKRHADSYFLSSLETCFAARLQAQYPKATKWSDTGRFGS 390

Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKMK 299
           + V   VS      I   A+Q S+  V + +    E             E DD   S+ +
Sbjct: 391 SFVTCIVSGNEDGEIAISAYQASNDAVEMVRADIIEPSADPTQMLVRSEEEDDGSTSRTR 450

Query: 300 -------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIENRTT-- 341
                  + +   G +V+E     F V  +   L H  P         + FPIENR    
Sbjct: 451 YIPEVFFRKINEYGANVQENAKPAFPVEYLFVTLTHGFPADPKPTFTQTDFPIENREYIG 510

Query: 342 -----QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
                    +AL     + PS    + +S+FHLL FL +   ++ D  AL   V +Q  +
Sbjct: 511 ESQEHTAVAKALGMKPGQKPS-DNCQEVSNFHLLCFLHQMGTMSKDEEALLCKVASQHDL 569

Query: 397 PEGYKL 402
            + ++L
Sbjct: 570 ADSFQL 575



 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPS--DL 58
          MLLR+R  DG+ R+TV+    T  E+   + S L   V+ +T+S      ++ SP+  D 
Sbjct: 1  MLLRVRGPDGMARLTVEPT-TTFGEMGQQLLSHLPPTVDPKTIS------MSNSPTGGDA 53

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
           +  D+ N    +  + +SHG ++F++Y
Sbjct: 54 KRLGDIVN--FKIGQIGLSHGDLIFITY 79


>gi|358395872|gb|EHK45259.1| hypothetical protein TRIATDRAFT_265090 [Trichoderma atroviride IMI
           206040]
          Length = 667

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 123/311 (39%), Gaps = 68/311 (21%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG   E  +V       V  IYEPPQ   +E+  I ++  E +K +D IA   G
Sbjct: 272 QRLGYLYGRYAEYTKVPLGVKAVVEAIYEPPQ--VDEIDGISMKAWENQKEIDHIAKLCG 329

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEW------- 246
           ++ VG I+T  +     D           Y LS+ E+  +    A+     +W       
Sbjct: 330 LEPVGAIWTDLLDSGHGDGSVVCKRHLDSYFLSSLEICFSSRLQAQYPKPSKWSDSGQFG 389

Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPK 294
              VT V+  +  EEG+ AI   A+QMS+  V + +    E             E DD  
Sbjct: 390 SNFVTCVITGD--EEGNIAI--SAYQMSNDAVEMVRADIVEPSADPNSMLVREEEEDDGS 445

Query: 295 LSKMK--------------KDVVVGGKDVKEVDNDFFLVVVKILDHQGPL--SSTFPIEN 338
           LS+ +                V    K    V+  F  +     D   P    + FPIEN
Sbjct: 446 LSRTRYIPEVFYSKVNEYGAKVQQNAKPAFPVEYLFVTLTHGFPDESKPTFTQTGFPIEN 505

Query: 339 RTTQVTMRALKSHLNRSPSLPLVKRISD-------FHLLLFLARFLDLNSDVPALAQCVQ 391
           R   +      S L R+ +L   ++ SD       FH+L F+     L+ D  AL   V 
Sbjct: 506 RE-YIGESQEHSALARALNLRSGQKASDNGQEVSNFHILCFIHTLGVLSKDEEALLCRVA 564

Query: 392 AQTAVPEGYKL 402
            Q  + + ++L
Sbjct: 565 TQHDLADSFQL 575



 Score = 41.6 bits (96), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 11/91 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
          ML+R+R  DG+ R+T+D    T  +L   + +QL   ++ +T  LS +     AK   D+
Sbjct: 1  MLIRLRGPDGMIRMTLDPT-TTFGDLGQQLLAQLPSTIDPKTIVLSNSPTGRDAKRLGDI 59

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSYDGE 89
          +QF         +  + + HG ++FLSY  E
Sbjct: 60 VQFQ--------IGQIGLKHGDLIFLSYQHE 82


>gi|170579617|ref|XP_001894909.1| NPL4 family protein [Brugia malayi]
 gi|158598324|gb|EDP36238.1| NPL4 family protein [Brugia malayi]
          Length = 532

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 130/322 (40%), Gaps = 73/322 (22%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
           + RQ   H +++S + +      + VN  LAF  K G    G+M G     + + +    
Sbjct: 135 LNRQRFRHVDNISIENE------EVVNRFLAFWRKTGHQRIGYMIGRYESFQEIPLGIKA 188

Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
               IYEPPQ  + + +  L  + +EK+VD + + L +K+VG+IFT              
Sbjct: 189 TVAAIYEPPQSCSADSV-CLEADPQEKVVDELCSYLNLKRVGWIFTDLWSADSSKGTVYC 247

Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
            ++K  + L+ +E + A     +       C    + +    +  S      I F  +Q+
Sbjct: 248 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 307

Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
           S+ C  L +       +     P+L+ +++          DV    K+   V+       
Sbjct: 308 SNQCASLVE----ANLLCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 363

Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
              ++ LV V     + P ++        FP ENR     +   +A+  +++       +
Sbjct: 364 MPVEYLLVDVPAGMPKEPHATFNISKKCYFPTENRILIGELQDSKAVGRYISEFSQNQFI 423

Query: 362 KRISDFHLLLFLARFLDLNSDV 383
           +  ++FH LLFL     LN+D+
Sbjct: 424 ELATNFHFLLFL-----LNNDI 440


>gi|344234048|gb|EGV65918.1| polyubiquitin-tagged protein recognition complex, Npl4 component
           [Candida tenuis ATCC 10573]
          Length = 568

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 71/270 (26%), Positives = 112/270 (41%), Gaps = 52/270 (19%)

Query: 151 AVKRGGFMYGT------VLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
             +R G++YG+      V    + +V FIYEPPQ    + + ++  E EE  +DA+AA L
Sbjct: 258 GTQRFGYLYGSYEPFDQVPLGIKAKVEFIYEPPQACELDGITLIPWENEEA-IDALAAEL 316

Query: 205 GMKKVGFIFTQ-----------TIMQNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
            + KVG IFT               ++K  Y L+N EV  A     +        N  ++
Sbjct: 317 NLYKVGVIFTDLTDSGLKNGSVLCKRHKDSYFLTNIEVAMAARNQLKYAYSTKYSNDRKF 376

Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE-------IAEGDD------- 292
            +  V   VS   +  I   ++Q+S     L +     T        I E +D       
Sbjct: 377 SSRFVTCVVSGGLNGEIEPRSYQVSVNAEALLEADIVTTSTQPSMMYINESNDTRYVPDV 436

Query: 293 --PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP----LSSTFPIENRT---TQV 343
              K+++   +V    K    V  D+ LV +     + P     S++F IE+R       
Sbjct: 437 FYSKINEYGLEVKTNAKPAFPV--DYLLVTLSDAMPKEPKPFFTSNSFVIEHRDFLGELQ 494

Query: 344 TMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
            ++AL +HLN       V  + DFHLL +L
Sbjct: 495 NLKALFNHLNNDIGDGSV--LLDFHLLAYL 522


>gi|268533954|ref|XP_002632107.1| Hypothetical protein CBG06960 [Caenorhabditis briggsae]
          Length = 527

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R G++ G       V +        IYEPPQ   E+ +  L D
Sbjct: 134 QELVNQFLDYWRVSGHQRVGYLIGQYQPHPEVPLGIKATVAAIYEPPQHCREDGIEFLED 193

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF-H 238
           + E K+VD +   LG+++VG+IFT               ++K  + LS  E + A    +
Sbjct: 194 KNE-KIVDKLLEMLGLQRVGWIFTDCWTAARAEGTVHYTRHKDSFFLSAEECITAGMLQN 252

Query: 239 AECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
           A  N+ ++       +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 253 AHPNVTDYSMDRRYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297


>gi|341896803|gb|EGT52738.1| hypothetical protein CAEBREN_20908 [Caenorhabditis brenneri]
          Length = 524

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R G++ G       V +        IYEPPQ   E+ +  L D
Sbjct: 132 QELVNQFLDYWRLSGHQRVGYLIGQYQPHPEVPLGIKATVAAIYEPPQHCREDGVEFLED 191

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT-----------LSNREVLQAVEF-H 238
           + E K+VD +   LG+++VG+IFT     N+ + T           LS  E + A    +
Sbjct: 192 KNE-KVVDKLLEMLGLQRVGWIFTDCWTANRAEGTVHYTRHKDSFFLSAEECITAGMLQN 250

Query: 239 AECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
           +  N+ ++       +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 251 SHPNVTDYSMDRRYGSKFVTVVASGDDSMHVNFHGYQVSNQCAAM 295


>gi|17534585|ref|NP_495093.1| Protein NPL-4.2, isoform b [Caenorhabditis elegans]
 gi|351050428|emb|CCD64971.1| Protein NPL-4.2, isoform b [Caenorhabditis elegans]
          Length = 529

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R GF+ G       V +        IYEPPQ   E+ +  L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
           + + K +D +   LG+++VG+IFT     N           K  + LS  E + A     
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252

Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
                 E +M+  + +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297


>gi|17534583|ref|NP_495094.1| Protein NPL-4.2, isoform a [Caenorhabditis elegans]
 gi|351050427|emb|CCD64970.1| Protein NPL-4.2, isoform a [Caenorhabditis elegans]
          Length = 527

 Score = 47.4 bits (111), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R GF+ G       V +        IYEPPQ   E+ +  L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
           + + K +D +   LG+++VG+IFT     N           K  + LS  E + A     
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252

Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
                 E +M+  + +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297


>gi|429856201|gb|ELA31125.1| endoplasmic reticulum and nuclear membrane proteinc [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 654

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 77/307 (25%), Positives = 124/307 (40%), Gaps = 60/307 (19%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G+MYG   E + V       V  IYEPPQ   +E+  I L   E +K +D +A   G
Sbjct: 283 QRLGYMYGRYTEYEVVPLGIKAVVEAIYEPPQ--VDELDGISLNGWENQKDIDDVARLCG 340

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWVTA---- 249
           ++K+G I+T  +     D           Y LS++E+       A+     +W  +    
Sbjct: 341 LQKIGVIWTDLLDAGAGDGSVVCKRHADSYFLSSQEICFISRLQAQHPKPTKWSDSGRFG 400

Query: 250 --VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
              V   +S   S  I   ++Q S+  V + +    E             E DD  +S+ 
Sbjct: 401 SNFVTCIISGNESGEISISSYQASNDAVEMVRADIIEPSADPNVMLVREEEEDDGSVSRT 460

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENRTT- 341
           +       + +   G +V+E     F V  +   L H  P  S        FPIENR   
Sbjct: 461 RYIPEVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDESKPTFTQLGFPIENREYM 520

Query: 342 ------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
                     R LK    + P+   ++ +S+FHLL F+ +   L+ D  +L   V +Q  
Sbjct: 521 GESQEHSAVARILKFSGGQKPAEGGLE-MSNFHLLCFIHQMDILSKDEESLLCRVASQHD 579

Query: 396 VPEGYKL 402
           + + Y+L
Sbjct: 580 LADTYQL 586



 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 14/112 (12%)

Query: 1   MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
           MLLR+R  DG+ R+T+D    T  EL   +   L   V+  T++    L  A +  D  +
Sbjct: 1   MLLRLRGPDGMSRLTIDKDE-TFGELGRQLLPNLPKTVDPSTIT----LSNAPTGGDAKK 55

Query: 61  FTDMANPDRPLSSLNISHGSIVFLSY-------DGERNVRGPSFNPAGSFGR 105
             D+ N    +  + + HG ++F++Y       +G  N   P+ +P  S  R
Sbjct: 56  LVDITN--FKIGQIGLKHGDLIFINYQHQDSLSNGATNGAQPTSHPLASSNR 105


>gi|17534581|ref|NP_495096.1| Protein NPL-4.1, isoform b [Caenorhabditis elegans]
 gi|351050426|emb|CCD64969.1| Protein NPL-4.1, isoform b [Caenorhabditis elegans]
          Length = 529

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R GF+ G       V +        IYEPPQ   E+ +  L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
           + + K +D +   LG+++VG+IFT     N           K  + LS  E + A     
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252

Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
                 E +M+  + +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297


>gi|17534579|ref|NP_495097.1| Protein NPL-4.1, isoform a [Caenorhabditis elegans]
 gi|351050425|emb|CCD64968.1| Protein NPL-4.1, isoform a [Caenorhabditis elegans]
          Length = 527

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R GF+ G       V +        IYEPPQ   E+ +  L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
           + + K +D +   LG+++VG+IFT     N           K  + LS  E + A     
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252

Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
                 E +M+  + +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297


>gi|85118482|ref|XP_965453.1| hypothetical protein NCU02680 [Neurospora crassa OR74A]
 gi|74618839|sp|Q7SH49.1|NPL4_NEUCR RecName: Full=Nuclear protein localization protein 4
 gi|28927262|gb|EAA36217.1| hypothetical protein NCU02680 [Neurospora crassa OR74A]
 gi|38567196|emb|CAE76489.1| probable nuclear protein localization factor and ER translocation
           component NPL4 [Neurospora crassa]
          Length = 651

 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 72/313 (23%), Positives = 120/313 (38%), Gaps = 54/313 (17%)

Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
           +++N       +R G +YG  LE   V       V  IYEPPQ   +E+  + L   E E
Sbjct: 264 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNAWENE 321

Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE--- 240
           K V+ +A   G++ VG I+T  +   K D           Y L+ +E+  A    A+   
Sbjct: 322 KDVNEVARLCGLEPVGVIWTDLLDAGKGDGSAICKRHSDSYFLAAQEICFAARLQAQHPK 381

Query: 241 ----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG------ 290
                +   + +  V   +S      I   A+QMS+  V + +    E     G      
Sbjct: 382 PTKWSDTGRFGSNFVTCVISGNEQGEISISAYQMSNDAVEMVRADIIEPSTDPGQMLVRE 441

Query: 291 ---DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSSTFPIEN 338
              DD  +S+ +       + +   G +V+E     F V  +   L H  P S      N
Sbjct: 442 EEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPRPVFTN 501

Query: 339 RTTQVTMRALKSHLNRSPSLPLVKRI---------SDFHLLLFLARFLDLNSDVPALAQC 389
               +  R        + S+  + ++         S+FHLL F+ +   L+ D  AL   
Sbjct: 502 DGFPIANREFVGEAQEASSVAKILKVNQKSDQFDVSNFHLLCFIRQMSVLSKDEEALLCR 561

Query: 390 VQAQTAVPEGYKL 402
           V  Q  + + ++L
Sbjct: 562 VATQHDLADAFQL 574


>gi|350296839|gb|EGZ77816.1| nuclear protein localization protein 4 [Neurospora tetrasperma FGSC
           2509]
          Length = 651

 Score = 47.0 bits (110), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 123/318 (38%), Gaps = 64/318 (20%)

Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
           +++N       +R G +YG  LE   V       V  IYEPPQ   +E+  + L   E E
Sbjct: 264 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNAWENE 321

Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE--- 240
           K V+ +A   G++ VG I+T  +   K D           Y L+ +E+  A    A+   
Sbjct: 322 KDVNEVARLCGLEPVGVIWTDLLDAGKGDGSAVCKRHTDSYFLAAQEICFAARLQAQHPK 381

Query: 241 ----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG------ 290
                +   + +  V   +S      I   A+QMS+  V + +    E     G      
Sbjct: 382 PTKWSDTGRFGSNFVTCVISGNEQGEISISAYQMSNDAVEMVRADIIEPSTDPGQMLVRE 441

Query: 291 ---DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST----- 333
              DD  +S+ +       + +   G +V+E     F V  +   L H  P S       
Sbjct: 442 EEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPRPVFTN 501

Query: 334 --FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
             FPI NR             + LK +  +S        +S+FHLL F+ +   L+ D  
Sbjct: 502 DGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIRQMSVLSKDEE 556

Query: 385 ALAQCVQAQTAVPEGYKL 402
           AL   V  Q  + + ++L
Sbjct: 557 ALLCRVATQHDLADAFQL 574


>gi|358388811|gb|EHK26404.1| hypothetical protein TRIVIDRAFT_167022 [Trichoderma virens Gv29-8]
          Length = 593

 Score = 47.0 bits (110), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 125/311 (40%), Gaps = 68/311 (21%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G++YG   E   V       V  IYEPPQ   +EV  I +     +K +D IA   G
Sbjct: 273 QRLGYLYGRYTEYTEVPLGVKAVVEAIYEPPQ--VDEVDGISMNAWGNQKDIDHIAKLCG 330

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEW------- 246
           ++ VG I+T  +     D           Y LS+ E+  +    A+  N  +W       
Sbjct: 331 LEPVGVIWTDLLDSGLGDGSVVCKRHLESYYLSSLEICFSSRLQAQYPNPSKWSDSGQFG 390

Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPK 294
              VT V+  +  E+G+ A+   A+QMS+  V + +    E             E DD  
Sbjct: 391 SKFVTCVITGD--EDGNIAV--SAYQMSNESVEMVRADIVEPSADPNSMLVREEEEDDGS 446

Query: 295 LSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIEN 338
           +S+ +       + V   G  V+E     F V  +   L H  P         + FPIEN
Sbjct: 447 VSRTRYIPEVFYRKVNEYGAKVQENAKPAFPVEYLFVTLTHGFPADPKPTFTQTGFPIEN 506

Query: 339 RTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQ 391
           R             +AL     + PS    + +S+FH+L F+ +   L+ D  AL   V 
Sbjct: 507 REYIGESQEHSALAKALNLRKGQKPS-DNGQEVSNFHILCFIHQLGVLSKDEEALLCRVA 565

Query: 392 AQTAVPEGYKL 402
            Q  + + ++L
Sbjct: 566 TQHDLADSFQL 576


>gi|7504790|pir||T15260 hypothetical protein F59E12.4 - Caenorhabditis elegans
          Length = 601

 Score = 46.6 bits (109), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 45/187 (24%), Positives = 81/187 (43%), Gaps = 35/187 (18%)

Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF-- 172
           + RQ+  H +++  +       Q+ VN+ L +      +R GF+ G       V +    
Sbjct: 192 LNRQKFRHVDNIQIEN------QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKA 245

Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-------- 220
               IYEPPQ   E+ +  L D+ + K +D +   LG+++VG+IFT     N        
Sbjct: 246 TVAAIYEPPQHCREDGIEFLEDKNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHY 304

Query: 221 ---KKDYTLSNREVLQAVEFH------AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQM 270
              K  + LS  E + A           E +M+  + +  V +  S + S  ++F  +Q+
Sbjct: 305 TRHKDSFFLSAEECITAAMLQNQHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQV 364

Query: 271 SDMCVRL 277
           S+ C  +
Sbjct: 365 SNQCAAM 371


>gi|336464738|gb|EGO52978.1| hypothetical protein NEUTE1DRAFT_105928 [Neurospora tetrasperma
           FGSC 2508]
          Length = 653

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 123/318 (38%), Gaps = 64/318 (20%)

Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
           +++N       +R G +YG  LE   V       V  IYEPPQ   +E+  + L   E E
Sbjct: 266 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNAWENE 323

Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE--- 240
           K V+ +A   G++ VG I+T  +   K D           Y L+ +E+  A    A+   
Sbjct: 324 KDVNEVARLCGLEPVGVIWTDLLDAGKGDGSAICKRHTDSYFLAAQEICFAARLQAQHPK 383

Query: 241 ----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG------ 290
                +   + +  V   +S      I   A+QMS+  V + +    E     G      
Sbjct: 384 PTKWSDTGRFGSNFVTCVISGNEQGEISISAYQMSNDAVEMVRADIIEPSTDPGQMLVRE 443

Query: 291 ---DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST----- 333
              DD  +S+ +       + +   G +V+E     F V  +   L H  P S       
Sbjct: 444 EEDDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPRPVFTN 503

Query: 334 --FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
             FPI NR             + LK +  +S        +S+FHLL F+ +   L+ D  
Sbjct: 504 DGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIRQMSVLSKDEE 558

Query: 385 ALAQCVQAQTAVPEGYKL 402
           AL   V  Q  + + ++L
Sbjct: 559 ALLCRVATQHDLADAFQL 576


>gi|403213522|emb|CCK68024.1| hypothetical protein KNAG_0A03410 [Kazachstania naganishii CBS
           8797]
          Length = 609

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 85/321 (26%), Positives = 128/321 (39%), Gaps = 67/321 (20%)

Query: 142 QYVNETLAFAVKRGGFMYGTV-------LEDKRVEVNFIYEPPQQGTEEVLYILRDE--E 192
           Q++    +  ++R  +MYGT        L  K V V  IYEPPQ   ++ L +  D+  +
Sbjct: 259 QFIESWRSTGMQRFAYMYGTYSRYDSTPLGIKAV-VEAIYEPPQHDEQDGLTMDMDQVTK 317

Query: 193 EEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF---H 238
           E   VDA+A  +G+ +VGFIFT               ++K  Y L++ E + A      H
Sbjct: 318 EMDAVDALAFQMGLFRVGFIFTDLTDAGQGDGSVFCKRHKDSYFLTSFETIMAARHQMAH 377

Query: 239 AE-CNMEE-------WVTAVVKLEVSEE---GSAAIHFEAFQMSDMCVRLFKEGWFETEI 287
           A  C   E       +VT VV   +  E    S  +  +A  + D  +           I
Sbjct: 378 ANACKFSERGTFSSKFVTCVVSGNLEGEIDIASYQVSIDAEALVDAQMISGSTHPSMAYI 437

Query: 288 AEGDD----PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------- 333
            E  D    P+L  MKK+    G  VKE     F V  ++  L H  P  S         
Sbjct: 438 NETTDERYVPELFYMKKNEY--GLTVKENAKPAFPVDYLIVSLTHGFPKDSAAAAAAGAT 495

Query: 334 ---------FPIENRTT---QVTMRALKSHLNRSPSLP----LVKRISDFHLLLFLARFL 377
                    FP  NR         + LK +L  + +      L K++S+FH+LL++    
Sbjct: 496 AGKFATTMGFPWANRQIMGHSQDYQELKKYLEAATTAGDYNLLRKKLSNFHVLLYIHSLQ 555

Query: 378 DLNSDVPALAQCVQAQTAVPE 398
            L+ +  +L     A  A PE
Sbjct: 556 ILSHEEWSLLIQAAASGASPE 576


>gi|156082071|ref|XP_001608528.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148801099|gb|EDL42504.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 505

 Score = 46.6 bits (109), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 33/133 (24%), Positives = 65/133 (48%), Gaps = 6/133 (4%)

Query: 283 FETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQ 342
           F TE+   ++  L K K D+++    VK++D  F +  V IL H+   +S FP    +  
Sbjct: 369 FNTEVDLMNELYL-KCKSDILIEKIQVKQIDVLFCVNNVPILSHKSLYNSFFPYPKNSNY 427

Query: 343 VTMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT---AVP 397
             +    ++++RS      +V    DF+ LLFL      + D+P + + V   +    +P
Sbjct: 428 YAILQRFNYMSRSLRNQKDMVSIFRDFNFLLFLTNIFSFDRDIPPICKAVNDASNAVGIP 487

Query: 398 EGYKLLINSMAET 410
           + Y  ++ S++++
Sbjct: 488 DQYVSILRSLSQS 500


>gi|308487254|ref|XP_003105823.1| hypothetical protein CRE_17909 [Caenorhabditis remanei]
 gi|308255279|gb|EFO99231.1| hypothetical protein CRE_17909 [Caenorhabditis remanei]
          Length = 523

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 77/165 (46%), Gaps = 29/165 (17%)

Query: 141 QQYVNETLAF----AVKRGGFM---YGTVLE---DKRVEVNFIYEPPQQGTEEVLYILRD 190
           Q+ VN+ L +      +R G++   Y T LE     +  V  IYEPPQ   E+ +  L D
Sbjct: 132 QELVNQFLDYWRLSGHQRVGYLIGQYQTHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 191

Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF-H 238
           +   K+VD +   LG+++VG+IFT               ++K  + LS  E + A    +
Sbjct: 192 KNA-KIVDKLLEMLGLQRVGWIFTDCWTAARAEGTVHYTRHKDSFFLSAEECITAGMLQN 250

Query: 239 AECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
           A  N+ ++       +  V +  S + S  ++F  +Q+S+ C  +
Sbjct: 251 AHPNITDYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 295


>gi|156836568|ref|XP_001642339.1| hypothetical protein Kpol_242p4 [Vanderwaltozyma polyspora DSM
           70294]
 gi|190359919|sp|A7TTC4.1|NPL4_VANPO RecName: Full=Nuclear protein localization protein 4
 gi|156112844|gb|EDO14481.1| hypothetical protein Kpol_242p4 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 568

 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 10/87 (11%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYG-------TVLEDKRVEVNFIYEPPQQGTEEVLYILR 189
           +D   Q++    A  ++R G++YG       T L  K V V+ IYEPPQ   ++ L +  
Sbjct: 231 SDLINQFIESWRATGMQRFGYLYGSYEKYDSTPLGVKAV-VHAIYEPPQHDEQDGLTMDL 289

Query: 190 DEEEEKL--VDAIAAGLGMKKVGFIFT 214
           ++ EE++  VD IA  +G+ +VG IF+
Sbjct: 290 EQVEEEMQKVDQIAMSMGLLRVGLIFS 316


>gi|380478569|emb|CCF43518.1| nuclear protein localization protein 4 [Colletotrichum
           higginsianum]
          Length = 649

 Score = 45.8 bits (107), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 123/307 (40%), Gaps = 60/307 (19%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R G MYG   E + V       V  IYEPPQ   +E+  I L   E +K VD IA   G
Sbjct: 280 QRLGVMYGRYTEYEVVPLGIKAVVEAIYEPPQ--VDEMDGISLNAWENQKDVDEIARLCG 337

Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
           +++VG IFT  +     D           Y LS++E+       A+        +  ++ 
Sbjct: 338 LQQVGVIFTDLLDAGAGDGSVVCKRHADSYFLSSQEICFISRLQAQHPKPTKWSDTGKFG 397

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           +  V   +S   S  I   ++Q S+  V + +    E             E DD   S+ 
Sbjct: 398 SNFVTCIISGNESGEISISSYQASNDAVEMVRADIVEPSADPNAMLVREEEEDDGSTSRT 457

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENRTT- 341
           +       + +   G +V+E     F V  +   L H  P  +        FPIENR   
Sbjct: 458 RYIPEVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDEAKPAFTQLGFPIENREYM 517

Query: 342 ------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
                     R LK    + P+   ++ +S+FHLL F+ +   L+ D  +L   V +Q  
Sbjct: 518 GESQEHSAAARILKFSGGQKPAEGGLE-MSNFHLLCFIRQMGILSKDEESLLCRVASQHD 576

Query: 396 VPEGYKL 402
           + + Y+L
Sbjct: 577 LADTYQL 583


>gi|440465534|gb|ELQ34853.1| hypothetical protein OOU_Y34scaffold00744g17 [Magnaporthe oryzae
           Y34]
 gi|440481024|gb|ELQ61651.1| hypothetical protein OOW_P131scaffold01166g17 [Magnaporthe oryzae
           P131]
          Length = 683

 Score = 45.8 bits (107), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 84/320 (26%), Positives = 125/320 (39%), Gaps = 82/320 (25%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEK 195
           ++N       +R G MYG   E + V       V  IYEPPQ   +EV  + L   + EK
Sbjct: 274 FINTWRRTGGQRYGIMYGKYSEYEEVPLGIKAVVQAIYEPPQ--VDEVDGVSLNSWDNEK 331

Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
            V+ +A   G++ VG I+T  +     D           Y LS+ EV  A    A+    
Sbjct: 332 DVNQVARLCGLEPVGAIWTDLLDAGAGDGSVVCKRHADSYFLSSLEVCFAARLQAQHPKP 391

Query: 244 EEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
            +W          VT ++    +E+G  AI   ++Q+S+  V + +    E         
Sbjct: 392 SKWSDTGRFGSNFVTCII--SGNEQGEIAI--SSYQVSNEAVEMVRADIMEPSADPTVML 447

Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPL 330
               E DD   S+ +       + +   G +V+E       V+  F  +     D   P+
Sbjct: 448 VREEEEDDGSTSRTRYIPDVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDVAKPM 507

Query: 331 SS---TFPIENRTT-------QVTMRALKSHLNRSP-SLPLVKRISDFHLLLFL------ 373
            S    FPIENR             +ALK H   S  S     ++S+FHLL FL      
Sbjct: 508 FSDEGAFPIENREYMGESQEHSAAAKALKVHEKASSGSSKDGMKVSNFHLLCFLHQMSVL 567

Query: 374 --ARFLDLNSDVPALAQCVQ 391
             A FL L+     L +C Q
Sbjct: 568 SKASFLRLD-----LLRCFQ 582


>gi|154324156|ref|XP_001561392.1| hypothetical protein BC1G_00477 [Botryotinia fuckeliana B05.10]
          Length = 608

 Score = 45.4 bits (106), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 35/177 (19%)

Query: 149 AFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIA 201
           A   +R G++YG   E   V       V  IYEPPQ   +E+  + L + E EK +D IA
Sbjct: 270 ATGAQRFGYLYGRYEEYTEVPLGVKAVVEAIYEPPQ--VDELDGVTLNEWESEKDIDEIA 327

Query: 202 AGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNM 243
              GM++VG I+T  I     D           Y LS+ E+  A    A+        + 
Sbjct: 328 RLCGMERVGVIWTDLIDSGVGDGTVLCKRHIDSYYLSSLEIAFAARLQAKHPKPTKWSDT 387

Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--------AEGDD 292
            ++ +  V   V+ + +  I   A+Q+S+  V + +    E           +EGDD
Sbjct: 388 GKFGSNFVTCVVTADENGGIAISAYQVSNSAVEMVRADIVEPSADPAVMIVRSEGDD 444


>gi|402078068|gb|EJT73417.1| hypothetical protein GGTG_10255 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 677

 Score = 45.4 bits (106), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 123/325 (37%), Gaps = 60/325 (18%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++N       +R G +YG   E   V       V  IYEPPQ   E     L   + EK 
Sbjct: 275 FINAWRVSGGQRFGILYGKYTEYDVVPLGIKAVVEAIYEPPQV-CETDGVSLNPWDNEKD 333

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNME 244
           V  +A   G++ VG ++T  +     D           Y LS+ EV  A    A+     
Sbjct: 334 VHEVARFCGLEPVGAVWTDLVDAGAGDGSVICKRHADSYYLSSLEVCFAARMQAQHPKSS 393

Query: 245 EWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AE 289
           +W       +  +   +S      I   A+Q+S+  V + +    E             E
Sbjct: 394 KWSDTGRFGSNFITCVISGNEQGEISISAYQVSNDAVEMVRADIVEPSADPNVMLVREEE 453

Query: 290 GDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSST 333
            DD   S+ +       + +   G +V+E     F V  +   L H  P         S 
Sbjct: 454 EDDGSQSRTRYIPEVFYRRINEYGANVQENAKPSFPVEYLYVTLTHGFPDAAKPIFTDSG 513

Query: 334 FPIENRTTQVTMRALKSHLNRSPSLPLVKR-------ISDFHLLLFLARFLDLNSDVPAL 386
           FPIENR     M   + H + S +L + ++       +SDFHLL FL +   L+ D   L
Sbjct: 514 FPIENREY---MGESQEHSSASKALKVHQKADTGGLQVSDFHLLCFLHQMGVLSKDEENL 570

Query: 387 AQCVQAQTAVPEGYKLLINSMAETV 411
              V  Q  + + ++L   +  +T+
Sbjct: 571 LCRVATQRDLADAFQLRSTTGWQTL 595


>gi|223590104|sp|A5DBC9.2|NPL4_PICGU RecName: Full=Nuclear protein localization protein 4
 gi|190344745|gb|EDK36486.2| hypothetical protein PGUG_00584 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 564

 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 73/272 (26%), Positives = 109/272 (40%), Gaps = 57/272 (20%)

Query: 151 AVKRGGFMYGTV-LEDK-----RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
            V+R GFM+G   + DK     +  V  IYEPPQ G  + + +L  E +E LV  +A  L
Sbjct: 285 GVQRFGFMFGRYEVFDKVPLGIKAVVEAIYEPPQAGETDGITLLPWENQE-LVLKVAEKL 343

Query: 205 GMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
            +  VG  FT          T++  ++K  Y LS  EVL A  F  E        N   +
Sbjct: 344 NLYPVGIAFTDLTDSGARNGTVLCKRHKDTYFLSCLEVLMAARFQIEHPNITKHSNSGRF 403

Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMS---------DMCVRLFKEGWFETEIAEGDD----- 292
            +  V   VS      I   +FQ+S         D+     +        ++G       
Sbjct: 404 SSKFVTCVVSGGLEGEIEPRSFQVSTNAEALVRADIITGSTQPSMLYINSSQGKRYVPDV 463

Query: 293 --PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST------FPIENRTTQVT 344
              K+++   +V    K    VD   FL+V   L    PL+ T      F IENR     
Sbjct: 464 FYSKINEYGLEVKTNAKPAFPVD---FLLVT--LSDAFPLNPTPRFTNGFTIENRDFMGN 518

Query: 345 ---MRALKSHLNRSPSLPLVKRISDFHLLLFL 373
              +RA   ++N  P       +S+FH +++L
Sbjct: 519 LQDLRAAYRYINSDPGNGSC--LSNFHFIVYL 548


>gi|402078067|gb|EJT73416.1| hypothetical protein, variant [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 603

 Score = 45.4 bits (106), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 78/325 (24%), Positives = 123/325 (37%), Gaps = 60/325 (18%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
           ++N       +R G +YG   E   V       V  IYEPPQ   E     L   + EK 
Sbjct: 275 FINAWRVSGGQRFGILYGKYTEYDVVPLGIKAVVEAIYEPPQV-CETDGVSLNPWDNEKD 333

Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNME 244
           V  +A   G++ VG ++T  +     D           Y LS+ EV  A    A+     
Sbjct: 334 VHEVARFCGLEPVGAVWTDLVDAGAGDGSVICKRHADSYYLSSLEVCFAARMQAQHPKSS 393

Query: 245 EWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AE 289
           +W       +  +   +S      I   A+Q+S+  V + +    E             E
Sbjct: 394 KWSDTGRFGSNFITCVISGNEQGEISISAYQVSNDAVEMVRADIVEPSADPNVMLVREEE 453

Query: 290 GDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSST 333
            DD   S+ +       + +   G +V+E     F V  +   L H  P         S 
Sbjct: 454 EDDGSQSRTRYIPEVFYRRINEYGANVQENAKPSFPVEYLYVTLTHGFPDAAKPIFTDSG 513

Query: 334 FPIENRTTQVTMRALKSHLNRSPSLPLVKR-------ISDFHLLLFLARFLDLNSDVPAL 386
           FPIENR     M   + H + S +L + ++       +SDFHLL FL +   L+ D   L
Sbjct: 514 FPIENREY---MGESQEHSSASKALKVHQKADTGGLQVSDFHLLCFLHQMGVLSKDEENL 570

Query: 387 AQCVQAQTAVPEGYKLLINSMAETV 411
              V  Q  + + ++L   +  +T+
Sbjct: 571 LCRVATQRDLADAFQLRSTTGWQTL 595


>gi|320588312|gb|EFX00781.1| endoplasmic reticulum and nuclear membrane proteinc [Grosmannia
           clavigera kw1407]
          Length = 649

 Score = 45.4 bits (106), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 76/322 (23%), Positives = 123/322 (38%), Gaps = 68/322 (21%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE-EEEK 195
           ++N       +R G++YG + E   V       V  IYEPPQ  T+E+  +  D   +E+
Sbjct: 281 FINAWRKSGSQRFGYLYGRLDEYDEVPLGVKAVVEAIYEPPQ--TDELDGLTLDAWSQEQ 338

Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
            VD +A   G++K+G ++T        D           Y LS+ EV+ A    A+    
Sbjct: 339 AVDGVARLCGLEKLGVVWTDLTDAGAGDGTVVCKRHADSYYLSSLEVVFAARLQAQHPKA 398

Query: 244 EEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFE---------TEIA 288
             W       +  V   +S   +  I   A+QMS+  V + +    E             
Sbjct: 399 SRWSDTGRFGSDFVTCLISGNEAGEIGIAAYQMSNEAVEMVRADIVEPSAEPETMLVRAE 458

Query: 289 EGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPLSS-- 332
           E DD   S+ +       + +   G +V+E       V+  F  +     D   PL +  
Sbjct: 459 EEDDGASSRTRYIPEVFYRHINEYGANVQENAKPAFPVEYLFVTLTHGFPDQARPLFTDN 518

Query: 333 --TFPIENR----------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
              FPIENR              T++A          L     +S+FHLL  L +   L+
Sbjct: 519 LAGFPIENREHIGESQEHTAVARTLKATSGGQTTDGDL----SVSNFHLLCALHQMEVLS 574

Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
            D  +L   V  Q  + + ++L
Sbjct: 575 KDEESLLCRVATQHDLADKFQL 596


>gi|224002495|ref|XP_002290919.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972695|gb|EED91026.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 569

 Score = 45.1 bits (105), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 42/215 (19%)

Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREVLQAVEFHAECNMEE 245
           R E   +    +A  LGM+ +G+IF+       +D     +  R+ +   +   E     
Sbjct: 324 RKESATERAVRVANYLGMQPIGWIFSYADENRHEDGDALPVHGRDAIVGSKLQIE----- 378

Query: 246 WVTAVVKLEVSEEGSAAIHF---------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLS 296
                +KL   E+G   I           EAFQ+SD+CV++  E      +    D KL+
Sbjct: 379 ----TMKLLGREDGGKFITLTLDGRLGATEAFQLSDVCVQMVAEDVLCVPVV---DEKLT 431

Query: 297 -----KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQG--------PLSSTFPIENRTTQV 343
                K+K  V+V G++ K +D+   LV   +L H G        P+  +    + T  V
Sbjct: 432 STRFIKLKDPVLVSGEETKRLDSVLLLVNTAMLSHVGLYSGGEDAPVGGSVKKGSGTLLV 491

Query: 344 TMR-----ALKSHLNRSPSLPLVKRISDFHLLLFL 373
             R     AL++         L++++ DF  LL L
Sbjct: 492 KTRKRILSALENSNGSGSDGKLLQQLCDFDTLLAL 526


>gi|346321034|gb|EGX90634.1| protein NPL4 [Cordyceps militaris CM01]
          Length = 660

 Score = 45.1 bits (105), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 76/306 (24%), Positives = 118/306 (38%), Gaps = 58/306 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
           +R GF+YG   E   V       V  IYEPPQ   +E+  + L   E +K VD IA   G
Sbjct: 272 QRLGFLYGRYAEYTEVPLGIKAVVEAIYEPPQ--VDEMDGVSLSAWENQKDVDYIAGLCG 329

Query: 206 MKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWV 247
           ++ VG I+T  +            ++   Y LS+ E   +    A+        +  ++ 
Sbjct: 330 LEPVGVIWTDLLDAGQSNGSVVCKRHADSYFLSSLETCFSARMQAQYPKATKWSDTGKFG 389

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
           ++ V   VS      I   A+Q S+  V + +    E             E DD   S+ 
Sbjct: 390 SSFVTCIVSGNEDGEIAISAYQASNDAVEMVRADIVEPSADPTQMLVRSEEEDDGSTSRT 449

Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENRT-- 340
           +       + V   G +V+E     F V  +   L H  P           FPIENR   
Sbjct: 450 RYIPEVFFRRVNEYGANVQENAKPAFPVEYLFVTLTHGFPADPKPTFTQPGFPIENRQYI 509

Query: 341 -TQVTMRALKSHLNRSPSLPL---VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
                  A+   L   P        + +S+FHLL FL +   ++ D  AL   V +Q  +
Sbjct: 510 GESQEHTAVAKALGMRPGQRTSENCQEVSNFHLLCFLHQMGTMSKDEEALLCKVASQHDL 569

Query: 397 PEGYKL 402
            + ++L
Sbjct: 570 ADSFQL 575



 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 49/88 (55%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPS--DL 58
          MLLRIR  DG+ R+TV+   +T  E+   + + L   V+ +T+S      ++ SPS  D 
Sbjct: 1  MLLRIRGPDGMVRLTVEPT-MTFGEMGQQLLTHLPPTVDPKTIS------MSNSPSGGDA 53

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
           +  D+ N +  +  + + HG ++F++Y
Sbjct: 54 KRLGDIVNFE--IGRIGLKHGDLIFITY 79


>gi|146422541|ref|XP_001487207.1| hypothetical protein PGUG_00584 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 564

 Score = 44.7 bits (104), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 110/272 (40%), Gaps = 57/272 (20%)

Query: 151 AVKRGGFMYGTV-LEDK-----RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
            V+R GFM+G   + DK     +  V  IYEPPQ G  + + +L  E +E LV  +A  L
Sbjct: 285 GVQRFGFMFGRYEVFDKVPLGIKAVVEAIYEPPQAGETDGITLLPWENQE-LVLKVAEKL 343

Query: 205 GMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
            +  VG  FT          T++  ++K  Y LS  EVL A  F  E        N   +
Sbjct: 344 NLYPVGIAFTDLTDSGARNGTVLCKRHKDTYFLSCLEVLMAARFQIEHPNITKHSNSGRF 403

Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMS---------DMCVRLFKEGWFETEIAEGDDP---- 293
            +  V   VS      I   +FQ+S         D+     +        ++G       
Sbjct: 404 SSKFVTCVVSGGLEGEIEPRSFQVSTNAEALVRADIITGSTQPSMLYINSSQGKRYVPDV 463

Query: 294 ---KLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST------FPIENRTTQVT 344
              K+++   +V    K    VD     ++V +LD   PL+ T      F IENR     
Sbjct: 464 FYLKINEYGLEVKTNAKPAFPVD----FLLVTLLD-AFPLNPTPRFTNGFTIENRDFMGN 518

Query: 345 ---MRALKSHLNRSPSLPLVKRISDFHLLLFL 373
              +RA   ++N  P       +S+FH +++L
Sbjct: 519 LQDLRAAYRYINSDPGNGSC--LSNFHFIVYL 548


>gi|452979305|gb|EME79067.1| hypothetical protein MYCFIDRAFT_79805 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 638

 Score = 44.7 bits (104), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 74/179 (41%), Gaps = 31/179 (17%)

Query: 126 HCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG------TVLEDKRVEVNFIYEPPQQ 179
           H E  SFD        ++++      V+R GF+YG       V    +  V  IYEPPQ 
Sbjct: 251 HVEFQSFD-----IVNRFIDFWRKSGVQRIGFLYGRYEAYDQVPLGTKAVVEAIYEPPQ- 304

Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD------------YTLS 227
           G E     + + E+E  +D +A   GM+++G +FT  I     +            Y LS
Sbjct: 305 GDEVDGVTMTEWEKEAQIDKVAELTGMQRLGVVFTDLISPENPEEGAAICKRHVDSYFLS 364

Query: 228 NREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
           + EV  A  + A+           ++ +  V   +S +    I   A+Q S+  V + +
Sbjct: 365 SLEVCFASRYQAKYPRPSKWSETGQFGSNFVTCVISGDSEGQIGIAAYQASNAAVEMVR 423


>gi|389582714|dbj|GAB65451.1| hypothetical protein PCYB_061830, partial [Plasmodium cynomolgi
           strain B]
          Length = 521

 Score = 44.3 bits (103), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 5/119 (4%)

Query: 297 KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRS- 355
           K K  V++   +VK++D  F +  + I+ H+   ++ FP  N +   T+    ++++RS 
Sbjct: 398 KCKSAVLIEKIEVKQIDVLFCVNNMPIVSHKSFYNTFFPYPNNSNYYTILQRFNYMSRSL 457

Query: 356 -PSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCV---QAQTAVPEGYKLLINSMAET 410
                +V    DF+ LLFL        D+P + + V   +    +P+ Y  ++ S++++
Sbjct: 458 HNQKDMVNIFRDFNFLLFLTNIFSFERDIPPICKAVNDSKNSVGIPDQYVSILRSLSQS 516


>gi|449299195|gb|EMC95209.1| hypothetical protein BAUCODRAFT_35205 [Baudoinia compniacensis UAMH
           10762]
          Length = 596

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 120/292 (41%), Gaps = 75/292 (25%)

Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
             +R G++YG   E + V       V  IYEPPQ   +E+  + L + + E  +D +A+ 
Sbjct: 273 GAQRVGYLYGHYEEYEVVPLGTKAVVEAIYEPPQ--ADEIDGVTLGEWDNEDSIDELASK 330

Query: 204 LGMKKVGFIFTQTIMQNKK------------DYTLSNREVLQAVEFHAEC-NMEEW---- 246
            G+++VG IFT  +  +               + LS+ E+L A  + A+     +W    
Sbjct: 331 CGLQRVGVIFTDLVRPDDPAQGNAICKRHIDSFFLSSLEILFASRYQAKYPRPSKWSETG 390

Query: 247 ------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA--------EGDD 292
                 VT VV  +  EEG  AI   A+Q S+  V + +    E            E +D
Sbjct: 391 KFGSNFVTCVVSGD--EEGQIAI--SAYQASNSAVEMVRADIIEPSAEPSVMLVQDEDED 446

Query: 293 PKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPI 336
            +L +++       + +   G+ V+E     F V  ++  L H  P       + + F I
Sbjct: 447 NELGRVRYIPEVFYRRINEYGRSVQENAKPSFPVEYLLVTLTHGFPNEAAPVFVQNKFAI 506

Query: 337 ENR------------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARF 376
           ENR             TQ+   + K  L+       V  +SDFHLL F+ + 
Sbjct: 507 ENREAIGEAQEPRDLQTQLKAGSNKVALDTPQG---VLAVSDFHLLCFVHQL 555


>gi|256077991|ref|XP_002575282.1| nuclear protein localization [Schistosoma mansoni]
 gi|256077993|ref|XP_002575283.1| nuclear protein localization [Schistosoma mansoni]
 gi|350644612|emb|CCD60660.1| nuclear protein localization, putative [Schistosoma mansoni]
 gi|350644613|emb|CCD60661.1| nuclear protein localization, putative [Schistosoma mansoni]
          Length = 692

 Score = 43.9 bits (102), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 68/157 (43%), Gaps = 33/157 (21%)

Query: 170 VNFIYEPPQQGTEEVLYILR--DEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---- 223
           V  IYEP Q+ T + + ++   D    K V  +A  +G++ VG+IFT  + QN       
Sbjct: 277 VAVIYEPAQESTPKSVKLIGPLDSVLPKHVMEVAQRIGLQPVGWIFTDLVAQNSNGNGVV 336

Query: 224 ---------YTLSNREVLQAVEF---HAE--------CNMEEWVTAVVKLEVSEEGSAAI 263
                    + LS  E + A      H          C   ++ T VV    + + S  I
Sbjct: 337 KHFRGTVDTFFLSAEECITAAHLQNLHPNICRLSPDGCFGSKFTTVVV----TGDASNHI 392

Query: 264 HFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
           HFEA+Q+S+  + L K+      +   D P+L  +K+
Sbjct: 393 HFEAYQVSNQAMALVKDNII---VPTYDAPELGYVKE 426


>gi|365985301|ref|XP_003669483.1| hypothetical protein NDAI_0C05810 [Naumovozyma dairenensis CBS 421]
 gi|343768251|emb|CCD24240.1| hypothetical protein NDAI_0C05810 [Naumovozyma dairenensis CBS 421]
          Length = 587

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 74/293 (25%), Positives = 119/293 (40%), Gaps = 60/293 (20%)

Query: 151 AVKRGGFMYG-------TVLEDKRVEVNFIYEPPQQGTEE--VLYILRDEEEEKLVDAIA 201
            ++R  FMYG       T L  K + +  IYEPPQ   ++   L I +  +E   +D +A
Sbjct: 263 GMQRFAFMYGSYEKYDSTPLGIKAI-IEAIYEPPQHDEQDGLTLDIEQTRQEMDEIDKLA 321

Query: 202 AGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEFHA-ECNMEEW--- 246
             +G+ +VG IFT               ++K  + LS+ EV+ A ++     N+ ++   
Sbjct: 322 KTMGLSRVGVIFTDLTDAGNGDGSVFCKRHKDSFFLSSLEVIMAAKYQKRHPNVSKYSEQ 381

Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK----EGWFETEIAEGDD------- 292
               +  V   VS      I   ++Q+S     L       G     +A  ++       
Sbjct: 382 GIFSSKFVTCVVSGNTEGEIDISSYQVSTAAEALVDATMISGSTHPSMAYINETTNDRYV 441

Query: 293 PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST----------FPIENRT 340
           P++  MKK+    G  VKE     F V  ++  L H  P+  T          FP  NR 
Sbjct: 442 PEIFYMKKNEY--GLTVKENAKPAFPVDYLLVSLTHGFPIDDTVSGKFRTIQGFPWANRQ 499

Query: 341 T---QVTMRALKSHLNRSPSLP----LVKRISDFHLLLFLARFLDLNSDVPAL 386
           +       + LK ++ +S        L  RIS+FH LL++     L+ D  AL
Sbjct: 500 SMGQSQDYQELKKYIYQSAIAGDFNVLHDRISNFHFLLYVHTLQILSEDEWAL 552


>gi|429240358|ref|NP_595883.2| Cdc48-Npl4-Ufd1 complex component Npl4 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|384872595|sp|Q9P780.2|NPL4_SCHPO RecName: Full=Nuclear protein localization protein 4
 gi|347834321|emb|CAB88240.2| Cdc48-Npl4-Ufd1 complex component Npl4 (predicted)
           [Schizosaccharomyces pombe]
          Length = 572

 Score = 43.1 bits (100), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 43/187 (22%), Positives = 82/187 (43%), Gaps = 37/187 (19%)

Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRDEEEEKL 196
           ++N+      +R G+ YG   +   V +        IYEPPQ    + +  L +  +E L
Sbjct: 253 FLNKWRQSGFQRIGYTYGHFEQYNNVPLGIKGVIEAIYEPPQVSEADGV-TLEEWADEAL 311

Query: 197 VDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVEFHAEC-NME 244
           V+ +A   G++++G IFT          + + +   D Y LS+ EV  +  F  +  N  
Sbjct: 312 VEQVATACGLRRIGIIFTDLTDDGSNSGKVLCKRHSDSYFLSSLEVYNSANFQTKFKNPC 371

Query: 245 EW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
           +W          VT+V+   ++ E    I   ++Q+S++   L++    +  +    DP 
Sbjct: 372 KWSRSGYFGSKFVTSVISGNLNGE----IEVMSYQVSNIGTALYQADLIQPSV----DPD 423

Query: 295 LSKMKKD 301
              +KK+
Sbjct: 424 RMLVKKE 430


>gi|300120945|emb|CBK21187.2| unnamed protein product [Blastocystis hominis]
          Length = 444

 Score = 42.7 bits (99), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 37/169 (21%), Positives = 77/169 (45%), Gaps = 17/169 (10%)

Query: 169 EVNFIYE---PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
           E+ F+Y    P Q+     + +L  +   K  DA+A  L ++ +G I+T+     K+  T
Sbjct: 185 EIAFVYSLYIPKQRFINGKIELLASDPYRKAADAVARRLDLRIIGAIYTRP--SQKERIT 242

Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
             + +++  +++  E    ++ +  V + V    S  I   A+ +SD C+ ++++G  E+
Sbjct: 243 AGDVDLIAKLQW-GERTRGDFCSRFVSVVVERNESREIEPFAYMLSDQCMCMYRDGMIES 301

Query: 286 EIAEGDDPKLSK-----------MKKDVVVGGKDVKEVDNDFFLVVVKI 323
                D   +             ++ D  +G ++V E +  FFLV + I
Sbjct: 302 PRKGSDLVHVRNRTPAGELLSVVIRNDAKLGSQEVTEFEAIFFLVELTI 350


>gi|254584574|ref|XP_002497855.1| ZYRO0F15048p [Zygosaccharomyces rouxii]
 gi|238940748|emb|CAR28922.1| ZYRO0F15048p [Zygosaccharomyces rouxii]
          Length = 572

 Score = 42.7 bits (99), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 11/101 (10%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
          M+LRIRS++G++RV+ D   +    ++ I+ S L   V   T++       A  P   L 
Sbjct: 1  MILRIRSKEGMQRVSCDSGDLFGTVVEKIL-SHLSPSVNPATITVGSQESTANEPIAGLV 59

Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY---DGERNVRGPSFN 98
          +       R ++ L ++HG IVF+ Y   +GE+     S N
Sbjct: 60 Y-------RSVADLGLNHGDIVFVKYQEKEGEQPKEAASVN 93


>gi|367009150|ref|XP_003679076.1| hypothetical protein TDEL_0A05330 [Torulaspora delbrueckii]
 gi|359746733|emb|CCE89865.1| hypothetical protein TDEL_0A05330 [Torulaspora delbrueckii]
          Length = 583

 Score = 42.4 bits (98), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 10/87 (11%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYG-------TVLEDKRVEVNFIYEPPQQGTEEVLYILR 189
           +D   +++        +R G+MYG       T L  K V V  IYEPPQ   ++ L +  
Sbjct: 243 SDLINEFIEAWRDTGTQRFGYMYGSYQTYDATPLGTKAV-VEAIYEPPQHDEQDGLTMDM 301

Query: 190 DEEEEKL--VDAIAAGLGMKKVGFIFT 214
           ++ E+++  VD +A  +G++++G IFT
Sbjct: 302 EQVEKEMDHVDNLAQQMGLQRLGLIFT 328


>gi|213403230|ref|XP_002172387.1| NPL4 [Schizosaccharomyces japonicus yFS275]
 gi|212000434|gb|EEB06094.1| NPL4 [Schizosaccharomyces japonicus yFS275]
          Length = 583

 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 63/151 (41%), Gaps = 24/151 (15%)

Query: 153 KRGGFMYG------TVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R G+MYG       VL   +  V  I +P Q    + + +    + E  VD +AA  G+
Sbjct: 273 QRIGYMYGRYQKYENVLLGIKAVVELIIDPAQVSESDGVTLQNPWDREAEVDKLAAACGL 332

Query: 207 KKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAEC-NMEEWV------T 248
           ++VG IFT  +            ++   Y LS+ E   +  F ++  N  +W       +
Sbjct: 333 RRVGIIFTDLVDDGSGKGTVLCKRHAGSYFLSSLETYNSAYFQSQTPNPSKWSKSGHFGS 392

Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
             V   +S      I   A+Q+S+  V L +
Sbjct: 393 KFVTCVLSGNEKGEIEVMAYQVSNTAVALLQ 423


>gi|366989755|ref|XP_003674645.1| hypothetical protein NCAS_0B01870 [Naumovozyma castellii CBS 4309]
 gi|342300509|emb|CCC68271.1| hypothetical protein NCAS_0B01870 [Naumovozyma castellii CBS 4309]
          Length = 579

 Score = 42.0 bits (97), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 69/298 (23%), Positives = 127/298 (42%), Gaps = 67/298 (22%)

Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRD 190
           ++   Q++       ++R  +MYG+ +E        +  V  IYEPPQ   ++ L +  +
Sbjct: 239 SELINQFIESWRYTGMQRFAYMYGSYVEYDSTPLGIKALVEVIYEPPQHDEQDGLTMDME 298

Query: 191 E--EEEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF 237
           +  +E +  D +AA +G+ ++G IFT               ++K  + LS+ EV+ + + 
Sbjct: 299 QVTKEMEQCDKMAAQMGLSRIGLIFTDLTDAGNGDGSVFCKRHKDSFFLSSLEVIMSAKH 358

Query: 238 ---------HAECNM--EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE----GW 282
                    ++E  +   ++VT VV   +  E    I   ++Q+S     L       G 
Sbjct: 359 QLRHPNTSKYSEQGIFSSKFVTCVVSGNLEGE----IDISSYQVSTDAEALVDASMIGGS 414

Query: 283 FETEIAEGDD-------PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST 333
               +A  ++       P++  M+K+    G  VKE     F V  ++  L H  P  ++
Sbjct: 415 THPSMAYINETTKDRYVPEIFYMRKNEY--GLTVKENAKPAFPVDYLLVSLTHGFPRDAS 472

Query: 334 -----------FPIENRTT---QVTMRALKSHLNRSPSL----PLVKRISDFHLLLFL 373
                      FP  NR +       + LK +L R+ +      L K+IS+FHLLL++
Sbjct: 473 ETTGKFLTVGGFPWANRQSMGQSQDYQELKKYLYRAATSGDFNELHKKISNFHLLLYI 530


>gi|397634455|gb|EJK71430.1| hypothetical protein THAOC_07131 [Thalassiosira oceanica]
          Length = 540

 Score = 41.6 bits (96), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 58/119 (48%), Gaps = 13/119 (10%)

Query: 266 EAFQMSDMCVRLFKEGWFETEIAEGD--DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKI 323
           EAFQ+SD+CV++  EG  +    +      +  ++   VV+ G++  ++D    LV   +
Sbjct: 381 EAFQLSDVCVQMVAEGALKAPTVDESLQGTRSMELTDPVVISGEETTKLDTVLLLVNTAM 440

Query: 324 LDHQG--------PLSSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
           L H G        P+  +    +    V T + L S L+RS S  ++K + DF +LL L
Sbjct: 441 LSHVGLYSGGEDAPVGGSVKKSSGALLVKTRKRLLSALDRSNS--VLKELCDFDVLLAL 497


>gi|398397621|ref|XP_003852268.1| hypothetical protein MYCGRDRAFT_109593 [Zymoseptoria tritici
           IPO323]
 gi|339472149|gb|EGP87244.1| hypothetical protein MYCGRDRAFT_109593 [Zymoseptoria tritici
           IPO323]
          Length = 613

 Score = 41.2 bits (95), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 122/293 (41%), Gaps = 66/293 (22%)

Query: 144 VNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
           VN+ L F      +R GF+YG   E  +V       V  IYEPPQ   +EV  + L +  
Sbjct: 262 VNKFLNFWRGGGAQRIGFLYGHYEEYDQVPLGTKAVVEAIYEPPQ--VDEVDGVTLSEWP 319

Query: 193 EEKLVDAIAAGLGMKKVGFIFT---------QTIMQNKK--DYTLSNREVLQAVEFHAE- 240
            E  ++ +A   G++++G IFT         QT +  +    Y LS+ E++ A  + A+ 
Sbjct: 320 NEAQIEKLAGFCGLQRLGVIFTDLLRPEEGEQTAVCKRHIDSYFLSSLEIMFAARYQAQY 379

Query: 241 ------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE-------- 286
                     ++ +  V   +S +    I   A+Q S+  V +      E          
Sbjct: 380 PRPSKWSETGQFGSNFVTCVISGDAEGNIGISAYQASNAAVEMVTADIIEASADPSVMLV 439

Query: 287 IAEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------L 330
           I E ++  + + +       + +   G++V+E     F V  ++  L H  P       +
Sbjct: 440 IDEDEENAIGRKRYIPEVFYRRLNEYGRNVQENAKPSFPVEYLLVTLTHGFPNEVNPKFV 499

Query: 331 SSTFPIENRTTQVTM---RALKSHL----NRSPSLPLVK---RISDFHLLLFL 373
              F IENR     +   R L+ HL    NR  +L  V+    +SDFHLL ++
Sbjct: 500 KCDFAIENRDALGELQEQRDLQKHLRAGANRV-ALDTVEGTLAVSDFHLLGYV 551


>gi|452841970|gb|EME43906.1| hypothetical protein DOTSEDRAFT_71645 [Dothistroma septosporum
           NZE10]
          Length = 596

 Score = 41.2 bits (95), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 52/210 (24%), Positives = 85/210 (40%), Gaps = 34/210 (16%)

Query: 98  NPAGSFGRKMTMDDLIAK----QMRVTRQENPHCESVSFDR-DCADAFQQYVNETLAFAV 152
           NP    G     D + +K     + +  QE    + V F+  D  D F  +  +  A   
Sbjct: 215 NPECPGGHAPFPDGICSKCQPSAISLVPQEYRMVDHVEFETFDMVDKFIGFWRQGGA--- 271

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF+YG   E  +V       V  IYEPPQ    + +  L D   E  ++ +A   G+
Sbjct: 272 QRMGFLYGHYEEYDQVPLGTKAVVEAIYEPPQVDEPDGV-TLGDWNNENEIEQLAELSGL 330

Query: 207 KKVGFIFTQTIMQNKKD------------YTLSNREVLQAVEFHAEC-NMEEWV------ 247
           ++VG I+T  + Q + +            Y LS+ EV  A  + A+     +W       
Sbjct: 331 QRVGVIYTDLVQQTEPEKGLAVCKRHADSYYLSSLEVCFASRYQAKYPRPSKWSETGKFG 390

Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
           +  V   +S +    I   A+Q S+  V +
Sbjct: 391 SNFVTCVISGDSDGNIAISAYQASNAAVEM 420


>gi|312378145|gb|EFR24796.1| hypothetical protein AND_10383 [Anopheles darlingi]
          Length = 194

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 3/49 (6%)

Query: 334 FPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDL 379
           FP+ENR     ++   AL S+L +S S+  +  +SDFHLLLFL R  D+
Sbjct: 49  FPVENRLIDGHIQDFSALASYLAKSRSMDFLDVVSDFHLLLFLYRMEDM 97


>gi|302653597|ref|XP_003018622.1| hypothetical protein TRV_07382 [Trichophyton verrucosum HKI 0517]
 gi|291182280|gb|EFE37977.1| hypothetical protein TRV_07382 [Trichophyton verrucosum HKI 0517]
          Length = 773

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 46/91 (50%), Gaps = 16/91 (17%)

Query: 14  VTVD-GAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQFTDMANPDRPLS 72
           VT+D G  +T+A+LK +IQS+++IP +SQ L  N   L              A+  +PL 
Sbjct: 315 VTLDVGEDMTLADLKAVIQSEIQIPPQSQHLFHNNRPL--------------ADESKPLG 360

Query: 73  SLNISHGSIVFLSYDGERNVRGPS-FNPAGS 102
            L IS G ++ +         GP   NP+G+
Sbjct: 361 QLGISEGDMLGMHIRVPTPASGPGQGNPSGA 391


>gi|297725717|ref|NP_001175222.1| Os07g0514801 [Oryza sativa Japonica Group]
 gi|255677805|dbj|BAH93950.1| Os07g0514801 [Oryza sativa Japonica Group]
          Length = 182

 Score = 40.8 bits (94), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/38 (57%), Positives = 28/38 (73%), Gaps = 1/38 (2%)

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVV-GGKDVKE 311
           V LFK+G  + EI + +DP LSKM+K+VVV GGKD  E
Sbjct: 109 VALFKDGVLQAEIGDKEDPLLSKMRKEVVVAGGKDTME 146


>gi|453085771|gb|EMF13814.1| endoplasmic reticulum and nuclear membrane protein Npl4
           [Mycosphaerella populorum SO2202]
          Length = 644

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 23/121 (19%)

Query: 142 QYVNETLAFAVKRG----GFMYGTVLE------DKRVEVNFIYEPPQQGTEEVLYILRDE 191
           Q V+  L F  K G    GF+YG   E        +  V  IYEPPQ    + +  + + 
Sbjct: 259 QIVDRFLDFWRKSGAQRIGFLYGRYEEYDLVPLGTKAVVEAIYEPPQVNEPDGV-TMTEW 317

Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD------------YTLSNREVLQAVEFHA 239
           E E  VD +A   G+++VG IFT  +  +  +            Y LS+ EV  A  + A
Sbjct: 318 ENEAAVDKLAEFSGLQRVGIIFTDLLAPDSPEQGTAVCKRHVDSYFLSSLEVCLAARYQA 377

Query: 240 E 240
           +
Sbjct: 378 K 378


>gi|363755074|ref|XP_003647752.1| hypothetical protein Ecym_7082 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356891788|gb|AET40935.1| hypothetical protein Ecym_7082 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 573

 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 134/338 (39%), Gaps = 63/338 (18%)

Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
           + + RQE    + V F    ++   ++++      ++R G++YGT  +        +  V
Sbjct: 210 ISLQRQEFRMVDHVEFQH--SELVNEFIDSWRTTGMQRFGYLYGTYSKYTNTPLGIKAVV 267

Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKL----VDAIAAGLGMKKVGFIFTQT---------- 216
             I+EPPQ   ++ L +  D E+  L    VD +A+ +G+ +VG IFT            
Sbjct: 268 EAIWEPPQHDEQDGLTM--DIEQVALEVGHVDNLASEMGLMRVGMIFTDLTDSGNGDGTV 325

Query: 217 -IMQNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAF 268
              ++K  + LS+ EV+ A +        C   +   + +      VS      I   A+
Sbjct: 326 FCKRHKDSFFLSSLEVIMAAKHQLNAPNACRFSQQSRFSSKFFTCVVSGNLEGEIDISAY 385

Query: 269 QMSDMCVRLFK----EGWFETEIAEGDDPKLSKMKKDVVVGGKD-----VKEVDNDFFLV 319
           Q+S     L       G     +A  ++   ++   D+    K+     VKE     F V
Sbjct: 386 QVSTDAEALVDSDIISGSTHPSMAYINETTHNRYVPDIFYMRKNEYNITVKENAKPAFPV 445

Query: 320 --VVKILDHQGPLSST-----------FPIENRTT---QVTMRALKSHL----NRSPSLP 359
             ++  L H  P+ S            FP  NR           LK ++    N S    
Sbjct: 446 DYLIVSLTHGFPVPSASNAPKFHRHTGFPWTNRQALGQSQDYLELKRYILPTANSSDLFK 505

Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
           L +++S+FHLLL++     LNS   +L   + A T  P
Sbjct: 506 LQQQLSNFHLLLYIHSLHILNSKEWSL--LLAATTCTP 541


>gi|170042583|ref|XP_001849000.1| nuclear protein localization 4 [Culex quinquefasciatus]
 gi|167866113|gb|EDS29496.1| nuclear protein localization 4 [Culex quinquefasciatus]
          Length = 192

 Score = 40.4 bits (93), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 330 LSSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISDFHLLLFLAR 375
           +S  FP+ENR     ++   AL  +L RS S+  +  +SDFHLLLFL R
Sbjct: 45  VSQYFPVENRLIDGHIQDFSALSDYLARSRSMDFLDAMSDFHLLLFLYR 93


>gi|222637133|gb|EEE67265.1| hypothetical protein OsJ_24441 [Oryza sativa Japonica Group]
          Length = 178

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/38 (57%), Positives = 28/38 (73%), Gaps = 1/38 (2%)

Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVV-GGKDVKE 311
           V LFK+G  + EI + +DP LSKM+K+VVV GGKD  E
Sbjct: 109 VALFKDGVLQAEIGDKEDPLLSKMRKEVVVAGGKDTME 146


>gi|302414408|ref|XP_003005036.1| NPL4 [Verticillium albo-atrum VaMs.102]
 gi|261356105|gb|EEY18533.1| NPL4 [Verticillium albo-atrum VaMs.102]
          Length = 250

 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 11/88 (12%)

Query: 1  MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQ--TLSTNQNLLLAKSPSDL 58
          MLLR+R  DG+ R+TVD  +    +L   +   L   V+    TLS N N   +K   D+
Sbjct: 1  MLLRVRGPDGMSRLTVD-QNDNFGDLGRQLLPNLPTTVDPTTITLSNNPNGNDSKRLGDI 59

Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
            F         L  + + HG ++F++Y
Sbjct: 60 ANFK--------LGQIGLKHGDLIFITY 79


>gi|401826979|ref|XP_003887582.1| nuclear pore protein [Encephalitozoon hellem ATCC 50504]
 gi|392998588|gb|AFM98601.1| nuclear pore protein [Encephalitozoon hellem ATCC 50504]
          Length = 505

 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 38/173 (21%), Positives = 74/173 (42%), Gaps = 32/173 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF+ G  +E + +       V+ I+EP Q+   +   I         ++ +  G G+
Sbjct: 201 QRFGFLIGKYMEHEMIPLGTKAVVSGIWEPDQEDYPDGFVITGP------IEDLFTGTGL 254

Query: 207 KKVGFIFTQTIMQNK--------KDYTLSNREV--------LQAVEFHAECNMEEWVTAV 250
             VG I+T   M+N         +DY LS+ E+        +         N  E+ +  
Sbjct: 255 GIVGMIYTDISMENGSVTSDKIIRDYFLSSLEIEFIAKMQLMHPYVLKDGGNEMEFGSRF 314

Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVV 303
           V +  + +    I  + +Q+S+ C+ L K  W    I   ++PK+   ++D++
Sbjct: 315 VTIIATVDKDGNIGLQEYQVSNQCMALVKGSW----ILPTENPKMLLARRDIL 363


>gi|52081992|ref|YP_080783.1| transcriptional regulator Slr [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319647856|ref|ZP_08002074.1| slr protein [Bacillus sp. BT1B_CT2]
 gi|404490872|ref|YP_006714978.1| HTH-type transcriptional regulator SlrR [Bacillus licheniformis DSM
           13 = ATCC 14580]
 gi|423683996|ref|ZP_17658835.1| transcriptional regulator Slr [Bacillus licheniformis WX-02]
 gi|52005203|gb|AAU25145.1| transcriptional regulator Slr [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|52349881|gb|AAU42515.1| HTH-type transcriptional regulator SlrR [Bacillus licheniformis DSM
           13 = ATCC 14580]
 gi|317390197|gb|EFV71006.1| slr protein [Bacillus sp. BT1B_CT2]
 gi|383440770|gb|EID48545.1| transcriptional regulator Slr [Bacillus licheniformis WX-02]
          Length = 152

 Score = 38.9 bits (89), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 14/94 (14%)

Query: 181 TEEVLYILRDEEEE---KLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTL-SNREVLQAVE 236
           TE +LY   ++E+E    LV A+ AG+  +++ F FT  + Q KK+  L  NRE+ +A  
Sbjct: 64  TEVILYRRTEQEDEWRTHLVQAVQAGMNQEEL-FQFTHRLKQKKKERALYRNRELTRA-- 120

Query: 237 FHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
                N++EW    + LE  E G      EAF M
Sbjct: 121 -----NIDEW--RALMLEAKEMGLTVKEVEAFLM 147


>gi|449329529|gb|AGE95800.1| protein involved in er translocation [Encephalitozoon cuniculi]
          Length = 505

 Score = 38.5 bits (88), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 79/172 (45%), Gaps = 32/172 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF+ G  ++ + +       V+ I+EP Q+   +   I         +DA  +G G+
Sbjct: 201 QRFGFLVGRYMDHEMIPLGTKAVVSGIWEPEQEDYPDGFVITGS------MDAPFSGTGL 254

Query: 207 KKVGFIFTQTIMQN--------KKDYTLSNREV-----LQAVEFHA--ECNME-EWVTAV 250
           + VG I+T  +M+          K Y LS+ E+     +Q +  H   +   E E+ + +
Sbjct: 255 EIVGMIYTDILMERGSVTSDKVAKGYFLSSLEIEFIAKMQLMHPHTVRDGGREMEFGSRL 314

Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
             + V+ E   +I  + +Q+S+ C+ L K G+    I   +DP+     +D+
Sbjct: 315 ATIVVTAEKDGSIGLQEYQVSNQCMALVKGGY----ILPTEDPRRFLATRDI 362


>gi|19074499|ref|NP_586005.1| PROTEIN INVOLVED IN ER TRANSLOCATION [Encephalitozoon cuniculi
           GB-M1]
 gi|19069141|emb|CAD25609.1| PROTEIN INVOLVED IN ER TRANSLOCATION [Encephalitozoon cuniculi
           GB-M1]
          Length = 505

 Score = 38.5 bits (88), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 79/172 (45%), Gaps = 32/172 (18%)

Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
           +R GF+ G  ++ + +       V+ I+EP Q+   +   I         +DA  +G G+
Sbjct: 201 QRFGFLVGRYMDHEMIPLGTKAVVSGIWEPEQEDYPDGFVITGS------MDAPFSGTGL 254

Query: 207 KKVGFIFTQTIMQN--------KKDYTLSNREV-----LQAVEFHA--ECNME-EWVTAV 250
           + VG I+T  +M+          K Y LS+ E+     +Q +  H   +   E E+ + +
Sbjct: 255 EIVGMIYTDILMERGSVTSDKVAKGYFLSSLEIEFIAKMQLMHPHTVRDGGREMEFGSRL 314

Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
             + V+ E   +I  + +Q+S+ C+ L K G+    I   +DP+     +D+
Sbjct: 315 ATIVVTAEKDGSIGLQEYQVSNQCMALVKGGY----ILPTEDPRRFLATRDI 362


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.380 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,918,132,237
Number of Sequences: 23463169
Number of extensions: 236421254
Number of successful extensions: 562097
Number of sequences better than 100.0: 404
Number of HSP's better than 100.0 without gapping: 85
Number of HSP's successfully gapped in prelim test: 319
Number of HSP's that attempted gapping in prelim test: 561067
Number of HSP's gapped (non-prelim): 709
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)