BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015194
(411 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224068755|ref|XP_002302817.1| predicted protein [Populus trichocarpa]
gi|118484264|gb|ABK94012.1| unknown [Populus trichocarpa]
gi|222844543|gb|EEE82090.1| predicted protein [Populus trichocarpa]
Length = 411
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/410 (80%), Positives = 378/410 (92%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
+++RIRSRDGLERV++D ++TV++LKT+IQ+QL+IP+ SQTLSTNQ+LLLAKSPSDLL+
Sbjct: 2 VMVRIRSRDGLERVSIDNTNITVSQLKTLIQNQLQIPIRSQTLSTNQSLLLAKSPSDLLK 61
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM+NPD PLSSL+ISHGS++FL+YDGER + GP+ PAGSFGRKMT+DDLIAKQMRVT
Sbjct: 62 FTDMSNPDTPLSSLSISHGSLIFLAYDGERTIAGPAVRPAGSFGRKMTIDDLIAKQMRVT 121
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQENPHC+SVSFDRDCA+AFQ YVNETLAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQG
Sbjct: 122 RQENPHCDSVSFDRDCANAFQHYVNETLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQG 181
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TEEVL +LRD +EEKLV+AI A LGM++VGFIFTQTIMQ+KKDYTLS+REVLQA E HAE
Sbjct: 182 TEEVLMLLRDSDEEKLVEAITACLGMRRVGFIFTQTIMQDKKDYTLSHREVLQAAELHAE 241
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKLEVSE+G A +HFEAFQMSDMC+RLFKEGWFETEI E DPKLSKMKK
Sbjct: 242 SELKEWVTAVVKLEVSEDGGADVHFEAFQMSDMCIRLFKEGWFETEIGEDADPKLSKMKK 301
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
DVVVG KDVKEVDNDFFLVVVKILDH GPLSSTFPIENR TQVTMRAL+SHL+R+ +LP
Sbjct: 302 DVVVGTKDVKEVDNDFFLVVVKILDHLGPLSSTFPIENRITQVTMRALRSHLDRAKNLPF 361
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
VKRISDFHLLLFLA++LDLNSDVPALA+CV AQTAVPEGY++LI SMA T
Sbjct: 362 VKRISDFHLLLFLAKYLDLNSDVPALAECVLAQTAVPEGYQILIESMATT 411
>gi|356505489|ref|XP_003521523.1| PREDICTED: NPL4-like protein 1-like [Glycine max]
Length = 413
Score = 672 bits (1735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/412 (78%), Positives = 363/412 (88%), Gaps = 2/412 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAH-VTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDGLERV+V+ H TV+ LK II++QL IPV +QTLSTNQNLLLAKS DL
Sbjct: 1 MMLRIRSRDGLERVSVENPHSTTVSYLKRIIETQLGIPVHNQTLSTNQNLLLAKSLEDLH 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRV 119
+FTDM+NPD PLSSLN+ HGS+VFL+Y+GER V GP+FNPAGSFGRKMTMDDLIAKQMRV
Sbjct: 61 RFTDMSNPDAPLSSLNLGHGSMVFLAYEGERRVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120
Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
TRQENPHCE VSFDRDCA+AFQ YVN+TLAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEEGKVEVDFIYEPPQQ 180
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
G+E+ L RD EEEK V+AIA GLGM KVGFIFTQTI Q+KKDYTLSNREVLQAVE+HA
Sbjct: 181 GSEDNLVFFRDTEEEKFVEAIAVGLGMTKVGFIFTQTITQDKKDYTLSNREVLQAVEYHA 240
Query: 240 ECNMEEWVTAVVKLEVSEE-GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKM 298
E ++EWVTAVVKLE +EE G A +HFEAFQMSD+CVRLFKEGWFETEI E DDPKLSKM
Sbjct: 241 ESGLKEWVTAVVKLEANEEMGGADVHFEAFQMSDVCVRLFKEGWFETEIKEDDDPKLSKM 300
Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSL 358
KKDVVVG KD +EVDNDFFLVVVKI DHQGPLSS+FPIENR TQV M+ALK+HL+R+ +L
Sbjct: 301 KKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVPMKALKNHLDRTKNL 360
Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
P VKRISDFHLLL LAR LDLN+DVPAL +CVQ QT+VPEGY++LI SMA T
Sbjct: 361 PFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMAST 412
>gi|356572673|ref|XP_003554491.1| PREDICTED: NPL4-like protein 1-like [Glycine max]
Length = 413
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/412 (78%), Positives = 362/412 (87%), Gaps = 2/412 (0%)
Query: 1 MLLRIRSRDGLERVTVDGA-HVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDGLERV+V+ TV++LK IIQ+QL IPV +QTLSTNQNLLLAKS DLL
Sbjct: 1 MMLRIRSRDGLERVSVENPLATTVSDLKRIIQTQLGIPVHNQTLSTNQNLLLAKSLEDLL 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRV 119
+FTDM+N D LSSLN+ HGSIVFL+Y+GER V GP+FNPAGSFGRKMTMDDLIAKQMRV
Sbjct: 61 RFTDMSNLDASLSSLNLGHGSIVFLAYEGERRVAGPAFNPAGSFGRKMTMDDLIAKQMRV 120
Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
TRQENPHCE VSFDRDCA+AFQ YVN+TLAFAVKRGGFMYGTV E +VEV+FIYEPPQQ
Sbjct: 121 TRQENPHCELVSFDRDCANAFQHYVNDTLAFAVKRGGFMYGTVSEVGKVEVDFIYEPPQQ 180
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
G+E+ L RD EEEK V+AIA GLGM+KVGFIFTQTI Q+KKDYTLSNREVLQAVE+HA
Sbjct: 181 GSEDNLVFFRDPEEEKFVEAIAVGLGMRKVGFIFTQTITQDKKDYTLSNREVLQAVEYHA 240
Query: 240 ECNMEEWVTAVVKLEVSEE-GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKM 298
E ++EW TAVVKLEV+EE G A +HFEAFQMSD+CVRLFKEGWFETEI E DDPKLSKM
Sbjct: 241 ESGLKEWATAVVKLEVNEEMGGADVHFEAFQMSDVCVRLFKEGWFETEIKEDDDPKLSKM 300
Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSL 358
KKDVVVG KD +EVDNDFFLVVVKI DHQGPLSS+FPIENR TQVTM+ALK+HL RS +L
Sbjct: 301 KKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVTMKALKNHLERSKNL 360
Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
P VKRISDFHLLL LAR LDLN+DVPAL +CVQ QT+VPEGY++LI SMA T
Sbjct: 361 PFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMAST 412
>gi|449456429|ref|XP_004145952.1| PREDICTED: NPL4-like protein 1-like [Cucumis sativus]
Length = 411
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/408 (76%), Positives = 360/408 (88%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M+LRIRSRDGLERV V+ H+T+A+LK IIQSQL+IP+ +QTLSTNQN+LLAK+ DL +
Sbjct: 1 MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM+NP+ LSSLN+SHGSIVFL+Y+GER V GP+ +PAGSFGRKMTMDDLIAKQMR+T
Sbjct: 61 FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQENPHCE VSFDRDCA+AFQ YVNETLAFAVKRGG MYGTV + +VEV+FIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGMMYGTVSPEGKVEVDFIYEPPQQG 180
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TE+ L RD +EE+LV+AIA GLGM+KVGFIFTQTI Q+KKDYTLSNREVLQA +FH+E
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFET+I E DPKLSKMKK
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
DVVVG KD ++VDNDFFLVVVKILDHQGPLS+TFPIENR VTM+ALK+HL+RS LP
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
VKRISDFHLLL LAR LD++SDVPALA+CVQ QT VPEGYK+LI SMA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMA 408
>gi|449497385|ref|XP_004160387.1| PREDICTED: LOW QUALITY PROTEIN: NPL4-like protein 1-like [Cucumis
sativus]
Length = 411
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/408 (76%), Positives = 360/408 (88%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M+LRIRSRDGLERV V+ H+T+A+LK IIQSQL+IP+ +QTLSTNQN+LLAK+ DL +
Sbjct: 1 MMLRIRSRDGLERVAVENPHITIAQLKAIIQSQLKIPIHNQTLSTNQNILLAKTQDDLSK 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM+NP+ LSSLN+SHGSIVFL+Y+GER V GP+ +PAGSFGRKMTMDDLIAKQMR+T
Sbjct: 61 FTDMSNPNTYLSSLNLSHGSIVFLAYEGERTVAGPTVHPAGSFGRKMTMDDLIAKQMRIT 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQENPHCE VSFDRDCA+AFQ YVNETLAFAVKRGG MYGTV + +VEV+FIYEPPQQG
Sbjct: 121 RQENPHCELVSFDRDCANAFQHYVNETLAFAVKRGGXMYGTVSPEGKVEVDFIYEPPQQG 180
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TE+ L RD +EE+LV+AIA GLGM+KVGFIFTQTI Q+KKDYTLSNREVLQA +FH+E
Sbjct: 181 TEDNLLFFRDHDEERLVEAIAVGLGMRKVGFIFTQTISQDKKDYTLSNREVLQAAQFHSE 240
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFET+I E DPKLSKMKK
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKECWFETDIGEDFDPKLSKMKK 300
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
DVVVG KD ++VDNDFFLVVVKILDHQGPLS+TFPIENR VTM+ALK+HL+RS LP
Sbjct: 301 DVVVGVKDTRDVDNDFFLVVVKILDHQGPLSTTFPIENRNVPVTMKALKNHLDRSKGLPF 360
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
VKRISDFHLLL LAR LD++SDVPALA+CVQ QT VPEGYK+LI SMA
Sbjct: 361 VKRISDFHLLLLLARVLDVSSDVPALAECVQTQTGVPEGYKILIESMA 408
>gi|225436827|ref|XP_002270748.1| PREDICTED: NPL4-like protein 2 [Vitis vinifera]
Length = 411
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/408 (75%), Positives = 360/408 (88%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+RIRSRDGLER+T+D H T+++LKT+IQ L IP+ SQTLSTNQNLLLAK+PSD+ +
Sbjct: 1 MLIRIRSRDGLERLTLDNPHATISQLKTLIQQHLHIPISSQTLSTNQNLLLAKTPSDIAR 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM++P P+++L ++HGSIV+L++D +R V GP+F+PAGSFGR+MTMDDLIAKQMRVT
Sbjct: 61 FTDMSDPQTPIAALGVTHGSIVYLAHDTQRTVSGPTFSPAGSFGRRMTMDDLIAKQMRVT 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQE PHCESVSFDRD A+AFQ YVNETL FAVKRGGFMYGTV +D V V+FIYEPPQQG
Sbjct: 121 RQETPHCESVSFDRDAANAFQHYVNETLVFAVKRGGFMYGTVADDGAVRVDFIYEPPQQG 180
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TEE L ++RD +EE+LVDAIA GLGM++VGFIFTQTI QNKKDYT+SN EVLQA E H E
Sbjct: 181 TEENLILMRDTDEERLVDAIAMGLGMRRVGFIFTQTISQNKKDYTMSNSEVLQAAELHGE 240
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKL V+E+ A +HFEAFQMSDMCVRLFKEGWFET+I E DPKLS+MKK
Sbjct: 241 SGLKEWVTAVVKLTVNEDDGADVHFEAFQMSDMCVRLFKEGWFETDIGEEVDPKLSRMKK 300
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
DVVVG KD +EVDNDFFLV+VKILDHQGPLSSTFPIENR TQVTMRALK+HL+R+ +LP
Sbjct: 301 DVVVGVKDTREVDNDFFLVLVKILDHQGPLSSTFPIENRITQVTMRALKNHLDRAKNLPF 360
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
VKRISDFHLLL L+RFLD+NSDVPALA+CVQ Q+AVPEGY+LLI S+A
Sbjct: 361 VKRISDFHLLLLLSRFLDINSDVPALAECVQTQSAVPEGYQLLIESLA 408
>gi|358348088|ref|XP_003638081.1| NPL4-like protein [Medicago truncatula]
gi|355504016|gb|AES85219.1| NPL4-like protein [Medicago truncatula]
Length = 414
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/411 (75%), Positives = 360/411 (87%), Gaps = 3/411 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAH-VTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDGLERVT+D + TV++LKTII+ QLRIP +QTLSTNQNLLLAKS DLL
Sbjct: 1 MMLRIRSRDGLERVTLDNPNNKTVSDLKTIIEKQLRIPTHNQTLSTNQNLLLAKSLQDLL 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS-FNPAGSFGRKMTMDDLIAKQMR 118
FTDM+NP+ PL++LN+SHGSI+FL+Y+GER V+GP+ F+PAGSFG+KMTMDDLIAKQMR
Sbjct: 61 LFTDMSNPNTPLTALNLSHGSIIFLAYEGERTVQGPAAFSPAGSFGKKMTMDDLIAKQMR 120
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQ 178
V+RQENPHCE VSFDRDCA+AFQ YVNE LAFAVKRGGFMYGTV E+ +VEVNFIYEPPQ
Sbjct: 121 VSRQENPHCELVSFDRDCANAFQHYVNENLAFAVKRGGFMYGTVTEEGKVEVNFIYEPPQ 180
Query: 179 QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFH 238
QG E+ L RD EEEK V+AIAAGLGMK+VGFIFTQ++ Q+KKDYTLS REVLQA E+H
Sbjct: 181 QGLEDNLLFFRDPEEEKCVEAIAAGLGMKRVGFIFTQSVSQDKKDYTLSYREVLQAAEYH 240
Query: 239 AECNMEEWVTAVVKLEVSEE-GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSK 297
AE ++EWVTAVVKLEV+EE G A +HFEAFQMSD+C+RLFKEGWF+TE+ E DPKLS
Sbjct: 241 AESGLKEWVTAVVKLEVNEEMGGADVHFEAFQMSDVCIRLFKEGWFDTEVKEDHDPKLSI 300
Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPS 357
MKKDVVVG KD KEVDNDFFLVVVKI DHQGPLSSTFP+ENR TQVT ALK+HL R+ +
Sbjct: 301 MKKDVVVGVKDTKEVDNDFFLVVVKISDHQGPLSSTFPVENRNTQVTAMALKNHLERTKN 360
Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
LP VKRISDFHLLL LAR LD+++DVPAL +CVQ Q+AVPEGY++LI SMA
Sbjct: 361 LPFVKRISDFHLLLVLARVLDVSADVPALTECVQTQSAVPEGYQILIESMA 411
>gi|15229360|ref|NP_191859.1| nuclear protein localization protein 4-like protein [Arabidopsis
thaliana]
gi|75181052|sp|Q9LYC2.1|NPL41_ARATH RecName: Full=NPL4-like protein 1
gi|7573429|emb|CAB87745.1| putative protein [Arabidopsis thaliana]
gi|14334782|gb|AAK59569.1| unknown protein [Arabidopsis thaliana]
gi|15810647|gb|AAL07248.1| unknown protein [Arabidopsis thaliana]
gi|110735118|gb|ABG89129.1| NPL4a [synthetic construct]
gi|332646901|gb|AEE80422.1| nuclear protein localization protein 4-like protein [Arabidopsis
thaliana]
Length = 413
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/410 (73%), Positives = 362/410 (88%), Gaps = 1/410 (0%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQF 61
+LR+RSRDGLERV+VDG H+TV++LKT+IQ QL+IP+ +QTLSTN+NLLLAKSPSD L F
Sbjct: 3 MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 62
Query: 62 TDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRVT 120
TDMA+P+ +SSLN++HGS+V+L+Y+GER +RG P+ PAGSFGRKMT++DLIA+QMRV
Sbjct: 63 TDMADPNLRISSLNLAHGSMVYLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQE HC+SVSFDRDCA+AFQ +VNE+LAFAVKRGGFMYG V ED +VEVNFIYEPPQQG
Sbjct: 123 RQEKAHCDSVSFDRDCANAFQHFVNESLAFAVKRGGFMYGNVSEDGQVEVNFIYEPPQQG 182
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
E+ L ++RD EEEK VDAIA GLGM++VGFIF QT+ Q+KK+YTLSN EVL A + HAE
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKLE++E+G A +HFE FQMSDMCVRLFKEGWFETEI DDPKLSK+KK
Sbjct: 243 SELKEWVTAVVKLEINEDGGADVHFEPFQMSDMCVRLFKEGWFETEIGPEDDPKLSKLKK 302
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
+VVVG KDVKEVDNDFFLV+VKILDHQGPLS TFPIENR TQ TMRALK+H+ R+ SLP
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQGPLSCTFPIENRNTQTTMRALKTHMERARSLPF 362
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
VKRISDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY+LLI+SMA T
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANT 412
>gi|18407495|ref|NP_566118.1| NPL4-like protein 2 [Arabidopsis thaliana]
gi|75100586|sp|O82264.1|NPL42_ARATH RecName: Full=NPL4-like protein 2
gi|3738309|gb|AAC63651.1| expressed protein [Arabidopsis thaliana]
gi|20197558|gb|AAM15128.1| expressed protein [Arabidopsis thaliana]
gi|110735120|gb|ABG89130.1| NPL4b [synthetic construct]
gi|330255825|gb|AEC10919.1| NPL4-like protein 2 [Arabidopsis thaliana]
Length = 413
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/411 (72%), Positives = 364/411 (88%), Gaps = 1/411 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M+LRIRSRDGLERVT +GAH+TV++LKT+I QL+IP+ QTLSTN++LLLAK+P+DLL
Sbjct: 2 MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRV 119
FTD+ +P+ PLSSLN+ HGS+++L+YDGER++ G P PAGSFGRKMT+DDLIA+QMRV
Sbjct: 62 FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121
Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
TRQE HC+SVSFDRD A+AFQ YVNE+LAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
GTE L ++RD +EEK VDAIA GLGM++VGFIF QT++Q+K +YTLSN EVLQA E HA
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241
Query: 240 ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK 299
E ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFETEI DDPKLSKMK
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301
Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLP 359
K+VVVG KD+KEVDNDFFLV+V+ILDHQGPLSSTFPIENR+++ TMRALK+HL+R+ SLP
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRALKTHLDRAKSLP 361
Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
LVK++SDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY LLI SMA T
Sbjct: 362 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMANT 412
>gi|297817634|ref|XP_002876700.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322538|gb|EFH52959.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
Length = 413
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/410 (73%), Positives = 362/410 (88%), Gaps = 1/410 (0%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQF 61
+LRIRSRDGLERV+VDG H+TV++LKT+IQ QL+IP+ +QTLSTN+NLLLAK+P+D L F
Sbjct: 3 MLRIRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKTPADFLGF 62
Query: 62 TDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRVT 120
TDMA+P+ +SSLN++HGS++FL+Y+GER +RG P+ PAGSFGRKMT++DLIA+QMRV
Sbjct: 63 TDMADPNLRISSLNLAHGSMLFLAYEGERTIRGGPAVTPAGSFGRKMTVEDLIARQMRVG 122
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQE HC+SVSFDRDCA+AFQ YVNE+LAFAVKRGGFMYG V E+ +VEVNFIYEPPQQG
Sbjct: 123 RQEKSHCDSVSFDRDCANAFQHYVNESLAFAVKRGGFMYGNVSEEGQVEVNFIYEPPQQG 182
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
E+ L ++RD EEEK VDAIA GLGM++VGFIF QT+ Q+KK+YTLSN EVL A + HAE
Sbjct: 183 MEDNLILMRDSEEEKRVDAIALGLGMRRVGFIFNQTVTQDKKEYTLSNVEVLLAAQLHAE 242
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKLE++E+G+A +HFEAFQMSDMCVRLFKEGWFETEI DDPKLSK+KK
Sbjct: 243 SELKEWVTAVVKLEINEDGAADVHFEAFQMSDMCVRLFKEGWFETEIGSEDDPKLSKLKK 302
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
+VVVG KDVKEVDNDFFLV+VKILDHQ PLS TFPIENR Q TMRALK+H+ R+ SLP
Sbjct: 303 EVVVGVKDVKEVDNDFFLVLVKILDHQVPLSCTFPIENRNIQTTMRALKTHMERARSLPF 362
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
VKRISDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY+LLI+SMA T
Sbjct: 363 VKRISDFHLLLFVAQFLDVSSDVPALAECVRLQSHVPEGYELLIDSMANT 412
>gi|297824909|ref|XP_002880337.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
gi|297326176|gb|EFH56596.1| NPL4 family protein [Arabidopsis lyrata subsp. lyrata]
Length = 412
Score = 639 bits (1649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 299/411 (72%), Positives = 363/411 (88%), Gaps = 1/411 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M+LRIRSRDGLERVT +GA++TV++LKT+I+ QL+IP+ QTLSTN++LLLAK+P+DLL
Sbjct: 1 MMLRIRSRDGLERVTAEGANITVSQLKTLIEDQLQIPLHKQTLSTNRDLLLAKTPADLLA 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRV 119
FTD+ +P+ PLSSLN+ HGS++FL+YDGER++ G P PAGSFGRKMT+DDLIA+QMRV
Sbjct: 61 FTDLTDPNLPLSSLNLGHGSMLFLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 120
Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
TRQE HC+SVSFDRDCA+AFQ YVN++LAFAVKRGGFMYGTV E +VEV+FIYEPPQQ
Sbjct: 121 TRQETSHCDSVSFDRDCANAFQHYVNDSLAFAVKRGGFMYGTVTEGGQVEVDFIYEPPQQ 180
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
GTE L ++RD +EEK VDAIA GLG++KVGFIF QT++Q+K YTLSN EVLQA E HA
Sbjct: 181 GTEANLILMRDADEEKRVDAIAMGLGVRKVGFIFNQTVVQDKTGYTLSNAEVLQAAELHA 240
Query: 240 ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK 299
E ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFETEI DDPKLSKMK
Sbjct: 241 EGELQEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPEDDPKLSKMK 300
Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLP 359
K+VVVG KD+KEVDNDFFLV+V+ILDHQGPLSSTFPIENR+++ TMR LK+HL+R+ SLP
Sbjct: 301 KEVVVGVKDLKEVDNDFFLVLVRILDHQGPLSSTFPIENRSSRATMRDLKTHLDRAKSLP 360
Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
LVK++SDFHLLLF+A+FLD++SDVPALA+CV+ Q+ VPEGY LLI SMA T
Sbjct: 361 LVKKMSDFHLLLFVAQFLDVSSDVPALAECVRLQSPVPEGYALLIESMANT 411
>gi|255559370|ref|XP_002520705.1| nuclear protein localization, putative [Ricinus communis]
gi|223540090|gb|EEF41667.1| nuclear protein localization, putative [Ricinus communis]
Length = 390
Score = 622 bits (1605), Expect = e-176, Method: Compositional matrix adjust.
Identities = 300/410 (73%), Positives = 347/410 (84%), Gaps = 20/410 (4%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M++RIRSRDGLERV+V+ ++TV++LKT+IQ+QL IP+++QTLSTNQNLLLAKSPSD+L+
Sbjct: 1 MMIRIRSRDGLERVSVENPNITVSQLKTLIQNQLNIPIQNQTLSTNQNLLLAKSPSDVLK 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM+NP+ PLSSLNI+HGSI+FL+YDGER + GP+ NPAGSFGRKMTMDDLIAKQMR+T
Sbjct: 61 FTDMSNPNTPLSSLNITHGSILFLTYDGERTIAGPAVNPAGSFGRKMTMDDLIAKQMRIT 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQENPHCESVSFDRDCA+AFQ + VK + IYEPPQQG
Sbjct: 121 RQENPHCESVSFDRDCANAFQH-----IPMMVK--------------LRFIIIYEPPQQG 161
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TEE L +LRD EEEKLVDAIA GLGM++VGFIFTQTI Q+KKDYTLSNREVLQA E HAE
Sbjct: 162 TEENLMLLRDPEEEKLVDAIAVGLGMRRVGFIFTQTITQDKKDYTLSNREVLQAAELHAE 221
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTA+VKLEV+E+G A +HFEAFQMSDMC+RLFKE WFE EIAE DPKLSKMKK
Sbjct: 222 SELKEWVTAIVKLEVNEDGGADVHFEAFQMSDMCIRLFKEDWFEKEIAEDSDPKLSKMKK 281
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
DVVVGGKDV+EVDNDFFLVVVKILDH GPLSS+FPIENR QVT RALKSHL+R +L
Sbjct: 282 DVVVGGKDVREVDNDFFLVVVKILDHLGPLSSSFPIENRINQVTKRALKSHLDRLKNLSF 341
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
VKRISDFHLLLFL RFLD+ SDVPALA+CV QTAVPEGY+L+I S+A +
Sbjct: 342 VKRISDFHLLLFLGRFLDV-SDVPALAECVLTQTAVPEGYQLIIESVANS 390
>gi|147777089|emb|CAN67841.1| hypothetical protein VITISV_016664 [Vitis vinifera]
Length = 1616
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 286/398 (71%), Positives = 334/398 (83%), Gaps = 16/398 (4%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+RIRSRDGLER+T+D H T+++LKT+IQ L IP+ SQTLSTNQNLLLAK+PSD+ +
Sbjct: 1 MLIRIRSRDGLERLTLDNPHATISQLKTLIQQHLHIPISSQTLSTNQNLLLAKTPSDIAR 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM++P P+++L ++HGSIV+L++D +R V GP+F+PAGSFGR+MTMDDLIAKQMRVT
Sbjct: 61 FTDMSDPQTPIAALGVTHGSIVYLAHDTQRTVSGPTFSPAGSFGRRMTMDDLIAKQMRVT 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQE PHCESVSFDRD A+AFQ YVNETL FAVKRGGFMYGTV +D V V+FIYEPPQQG
Sbjct: 121 RQETPHCESVSFDRDAANAFQHYVNETLVFAVKRGGFMYGTVADDGAVRVDFIYEPPQQG 180
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TEE L ++RD +EE+LVDAIA GLGM +VGFIFTQTI QNKKDYT+SN EVLQA E H E
Sbjct: 181 TEENLILMRDTDEERLVDAIAMGLGMXRVGFIFTQTISQNKKDYTMSNSEVLQAAELHGE 240
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKL E G RLFKEGWFET+I E DPKLS+MKK
Sbjct: 241 SGLKEWVTAVVKL-YGERGR---------------RLFKEGWFETDIGEEVDPKLSRMKK 284
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
DVVVG KD +EVDNDFFLV+VKILDHQGPLSSTFPIENR TQVTMRALK+HL+R+ +LP
Sbjct: 285 DVVVGVKDTREVDNDFFLVLVKILDHQGPLSSTFPIENRITQVTMRALKNHLDRAKNLPF 344
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
VKRISDFHLLL L+RFLD+NSDVPALA+CVQ Q+AVPE
Sbjct: 345 VKRISDFHLLLLLSRFLDINSDVPALAECVQTQSAVPE 382
>gi|294461152|gb|ADE76140.1| unknown [Picea sitchensis]
Length = 413
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 275/410 (67%), Positives = 341/410 (83%), Gaps = 4/410 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RSRDGLERV VD TV E K++IQSQL++P+ SQ LSTNQNLLLAK ++ Q
Sbjct: 1 MLIRVRSRDGLERVKVDRPSATVGEFKSLIQSQLQVPIPSQILSTNQNLLLAKG-NNTQQ 59
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP--SFNPAGSFGRKMTMDDLIAKQMR 118
F DM +P+ PLSSL I HG IVFLSY+GER++ GP + NP+GSFG+KMTMDDLI++Q+R
Sbjct: 60 FKDMEDPNIPLSSLGIGHGHIVFLSYEGERSIAGPQINVNPSGSFGKKMTMDDLISRQVR 119
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQ 178
+ RQ+ PHC S SFDRD A+ FQQYVNE LAFA+KRGGFMYGTV E V VNFIYE PQ
Sbjct: 120 IERQDKPHCGSASFDRDAANGFQQYVNENLAFAIKRGGFMYGTVGEGGDVAVNFIYEHPQ 179
Query: 179 QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFH 238
QGTE+ L++ RD++EEK V+AIA GLGM++VGFIFTQTI Q KKDYTLSN E+ QAVE H
Sbjct: 180 QGTEDSLFLFRDQDEEKRVEAIAIGLGMRRVGFIFTQTITQGKKDYTLSNAEIRQAVELH 239
Query: 239 AECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKM 298
AE ++EWVTA+VKLEV+EEG A +HFEAFQ+SD CV+LFKEGWF + + D K++++
Sbjct: 240 AESELKEWVTALVKLEVNEEGGADVHFEAFQLSDQCVKLFKEGWFVDDCTDL-DLKITRL 298
Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSL 358
KKDVV+ GKD ++VDNDFFLV+VKILDHQGPLSSTFPIENR + VT+RALK+HL+R+ +L
Sbjct: 299 KKDVVILGKDTRDVDNDFFLVLVKILDHQGPLSSTFPIENRASPVTLRALKNHLDRTKNL 358
Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
VKRISDFHLLL LA++LD+N+DVP LA CVQ Q +VPEGY+++I+S+A
Sbjct: 359 SFVKRISDFHLLLLLAKYLDVNADVPQLAACVQVQASVPEGYQIIIDSLA 408
>gi|224131268|ref|XP_002321042.1| predicted protein [Populus trichocarpa]
gi|222861815|gb|EEE99357.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 255/326 (78%), Positives = 294/326 (90%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M++RIRSRDGLERV++D ++T+++ KT+IQ+QL+IP+ +QTLSTNQNLLLAKSP DLLQ
Sbjct: 1 MMIRIRSRDGLERVSIDNPNITISQFKTLIQNQLQIPIHNQTLSTNQNLLLAKSPPDLLQ 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
FTDM+NP LSSLNISHGSI+FL+Y+G R + GP+ +PAGSFGRKMTMDDLIAKQMRV
Sbjct: 61 FTDMSNPSTLLSSLNISHGSIIFLAYEGHRTIAGPAVHPAGSFGRKMTMDDLIAKQMRVG 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQENPHC+SVSFDRDCA+AFQ YVNETLAFAVKRGGFMYGTV E+ +VEVNFIYE PQQG
Sbjct: 121 RQENPHCDSVSFDRDCANAFQHYVNETLAFAVKRGGFMYGTVSEEGKVEVNFIYELPQQG 180
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
TEE+L +LRD +EEK+VDAIAAGLGM++VGFIF QTIM +KKDYTLSNREVLQA E HAE
Sbjct: 181 TEEILMLLRDSDEEKIVDAIAAGLGMRRVGFIFNQTIMHDKKDYTLSNREVLQAAELHAE 240
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKEGWFETEI E DPKLS MKK
Sbjct: 241 SELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEGWFETEIGEDVDPKLSIMKK 300
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDH 326
DVVVG KDVKEVDNDFFLVV+KILDH
Sbjct: 301 DVVVGSKDVKEVDNDFFLVVIKILDH 326
>gi|42571281|ref|NP_973714.1| NPL4-like protein 2 [Arabidopsis thaliana]
gi|94442443|gb|ABF19009.1| At2g47970 [Arabidopsis thaliana]
gi|330255824|gb|AEC10918.1| NPL4-like protein 2 [Arabidopsis thaliana]
Length = 354
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 242/342 (70%), Positives = 293/342 (85%), Gaps = 2/342 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M+LRIRSRDGLERVT +GAH+TV++LKT+I QL+IP+ QTLSTN++LLLAK+P+DLL
Sbjct: 2 MMLRIRSRDGLERVTAEGAHITVSQLKTLIADQLQIPLHKQTLSTNRDLLLAKTPADLLA 61
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-PSFNPAGSFGRKMTMDDLIAKQMRV 119
FTD+ +P+ PLSSLN+ HGS+++L+YDGER++ G P PAGSFGRKMT+DDLIA+QMRV
Sbjct: 62 FTDLTDPNLPLSSLNLGHGSMLYLAYDGERSIPGAPPVTPAGSFGRKMTVDDLIARQMRV 121
Query: 120 TRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQ 179
TRQE HC+SVSFDRD A+AFQ YVNE+LAFAVKRGGFMYGTV E+ +VEV+FIYEPPQQ
Sbjct: 122 TRQETSHCDSVSFDRDAANAFQHYVNESLAFAVKRGGFMYGTVTEEGQVEVDFIYEPPQQ 181
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHA 239
GTE L ++RD +EEK VDAIA GLGM++VGFIF QT++Q+K +YTLSN EVLQA E HA
Sbjct: 182 GTEANLILMRDADEEKRVDAIAMGLGMRRVGFIFNQTVVQDKTEYTLSNAEVLQAAELHA 241
Query: 240 ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK 299
E ++EWVTAVVKLEV+E+G A +HFEAFQMSDMC+RLFKE WFETEI DDPKLSKMK
Sbjct: 242 ESELKEWVTAVVKLEVNEDGGADVHFEAFQMSDMCIRLFKEEWFETEIMPDDDPKLSKMK 301
Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQ-GPLSSTFPIENRT 340
K+VVVG KD+KEVDNDFFLV+V+ILDHQ ST P+ +
Sbjct: 302 KEVVVGVKDLKEVDNDFFLVLVRILDHQVRVFLSTLPLHQSS 343
>gi|168021566|ref|XP_001763312.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685447|gb|EDQ71842.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 409
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 238/410 (58%), Positives = 315/410 (76%), Gaps = 2/410 (0%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RSRDGLERVT+ + TV +L+ I L +PV SQ LS NQ LLL+K P + +
Sbjct: 1 MLIRLRSRDGLERVTL-ANNATVGDLQAAIAENLNVPVSSQLLSRNQALLLSKDPVNGKE 59
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
F DM + LSSL I+HGS+++L Y G R V+GP P G+FGRKMT++DLIA+Q+R+
Sbjct: 60 FADMQDSGATLSSLGIAHGSLLYLFYSGNRVVKGPQVTPFGAFGRKMTVEDLIARQIRIE 119
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQE HC ++SFDR+ A+AFQ YV+E+LAF++KRGG MYG+V E ++V+ IYEPPQQG
Sbjct: 120 RQEKSHCSALSFDREAANAFQLYVHESLAFSIKRGGIMYGSVNEAGEIKVDVIYEPPQQG 179
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
+EE L ++RD +EEK D IAAGLG++KVGFIFTQTI QN+ YTLSN E+ Q EFHAE
Sbjct: 180 SEEGLLLMRDMDEEKRADVIAAGLGLRKVGFIFTQTITQNQGGYTLSNAEIRQTAEFHAE 239
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
+ E+W TA+VKLEV+E+G A +HFEAFQ+SD CV+LFK+G FE + + DPKL++MKK
Sbjct: 240 SDFEQWATAIVKLEVNEDGGADVHFEAFQLSDQCVKLFKDGLFEENVPDL-DPKLTRMKK 298
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
+V+V KD +VDNDFFLV VKILDH+GP+++TFPIENR T++ALK+HL R + P
Sbjct: 299 EVIVLQKDTFDVDNDFFLVPVKILDHEGPMTTTFPIENRAPLATLQALKTHLERHRNKPY 358
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
VKRI+DFHLLL L+ FLD+N+DVP L + ++ + VPEGY+L+I SMA +
Sbjct: 359 VKRIADFHLLLLLSNFLDINTDVPQLTEAIRTEGPVPEGYQLIIESMASS 408
>gi|302814770|ref|XP_002989068.1| hypothetical protein SELMODRAFT_129073 [Selaginella moellendorffii]
gi|300143169|gb|EFJ09862.1| hypothetical protein SELMODRAFT_129073 [Selaginella moellendorffii]
Length = 411
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 236/408 (57%), Positives = 308/408 (75%), Gaps = 7/408 (1%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDL-LQ 60
++RIRSRDGLERV+V TV EL+ I+ +L++PV +Q +S NQ+LL+AK PS L
Sbjct: 5 VVRIRSRDGLERVSVH-PKATVLELQMAIERELQVPVSAQYVSRNQSLLMAKDPSGAALM 63
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
DM +P LSSL I HGS+VFL Y G+R V GP P+G+FG+KMTMDDLIA+Q RV
Sbjct: 64 AADMRDPSAALSSLGIGHGSVVFLFYSGKREVAGPKVTPSGAFGKKMTMDDLIARQTRVE 123
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQ++ HC S+SFDR A+AFQ YV+ETLAF VKRGGFMYG+V E V+FIYEPPQQG
Sbjct: 124 RQQHAHCNSLSFDRAAANAFQHYVHETLAFGVKRGGFMYGSVTEQGDALVHFIYEPPQQG 183
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
+E+ L +LRD +EE+ VD IAAGLGM+++GFIFT TI Q K DYTLSN E+ QA + HAE
Sbjct: 184 SEDGLLLLRDPDEERRVDGIAAGLGMRRLGFIFTHTISQGKADYTLSNAEIRQAAQLHAE 243
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
N + W TA+VKLE + + + FE+FQ+SD C++L+KEGWF + AE +PKLS+M K
Sbjct: 244 SNFDAWTTALVKLEANGD----VQFESFQLSDQCIKLWKEGWF-VQDAEDVNPKLSRMNK 298
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
+VV+ GKD ++VDN FLV VKILDHQGPLS FPIENR T + +L+SHL+ + S P
Sbjct: 299 EVVILGKDTRDVDNVLFLVPVKILDHQGPLSCNFPIENRLTLPSFESLRSHLDSNKSRPY 358
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
V+RISDFHLLLFL++ LD+++D+P L + + Q +VP+GY+LLI+S+A
Sbjct: 359 VQRISDFHLLLFLSKSLDIDTDIPRLTEAISTQGSVPDGYQLLIDSLA 406
>gi|302824858|ref|XP_002994068.1| hypothetical protein SELMODRAFT_187743 [Selaginella moellendorffii]
gi|300138074|gb|EFJ04855.1| hypothetical protein SELMODRAFT_187743 [Selaginella moellendorffii]
Length = 411
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 235/408 (57%), Positives = 307/408 (75%), Gaps = 7/408 (1%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDL-LQ 60
++RIRSRDGLERV+V TV EL+ I+ +L++PV +Q +S NQ+LL+AK PS L
Sbjct: 5 VVRIRSRDGLERVSVH-PKATVLELQMAIERELQVPVSAQYVSRNQSLLMAKDPSGAALM 63
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
DM +P LSSL I HGS+VFL Y G+R V GP P+G+FG+KMTMDDLIA+Q RV
Sbjct: 64 AADMRDPSAALSSLGIGHGSVVFLFYSGKREVAGPKVTPSGAFGKKMTMDDLIARQTRVE 123
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQ++ HC S+SFDR A+AFQ YV+ETLAF VKRGGFMYG+V E V+FIYEPPQQG
Sbjct: 124 RQQHAHCNSLSFDRAAANAFQHYVHETLAFGVKRGGFMYGSVTEQGDALVHFIYEPPQQG 183
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
+E+ L +LRD +EE+ D IAAGLGM+++GFIFT TI Q K DYTLSN E+ QA + HAE
Sbjct: 184 SEDGLLLLRDPDEERRADGIAAGLGMRRLGFIFTHTISQGKADYTLSNAEIRQAAQLHAE 243
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
N + W TA+VKLE + + + FE+FQ+SD C++L+KEGWF + AE +PKLS+M K
Sbjct: 244 SNFDAWTTALVKLEANGD----VQFESFQLSDQCIKLWKEGWF-VQDAEDVNPKLSRMNK 298
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPL 360
+VV+ GKD ++VDN FLV VKILDHQGPLS FPIENR T + +L+SHL+ + S P
Sbjct: 299 EVVILGKDTRDVDNVLFLVPVKILDHQGPLSCNFPIENRLTLPSFESLRSHLDSNKSRPY 358
Query: 361 VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
V+RISDFHLLLFL++ LD+++D+P L + + Q +VP+GY+LLI+S+A
Sbjct: 359 VQRISDFHLLLFLSKSLDIDTDIPRLTEAISTQGSVPDGYQLLIDSLA 406
>gi|115436652|ref|NP_001043084.1| Os01g0377700 [Oryza sativa Japonica Group]
gi|75167666|sp|Q9AS33.1|NPL4_ORYSJ RecName: Full=NPL4-like protein
gi|13365982|dbj|BAB39260.1| NPL4 -like protein [Oryza sativa Japonica Group]
gi|113532615|dbj|BAF04998.1| Os01g0377700 [Oryza sativa Japonica Group]
gi|125526006|gb|EAY74120.1| hypothetical protein OsI_02003 [Oryza sativa Indica Group]
Length = 415
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 247/417 (59%), Positives = 307/417 (73%), Gaps = 13/417 (3%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R+TV D A TV +L+ +I +++ +PV Q LS + LLL S S L
Sbjct: 1 MILRIRSRDGTDRITVPDPAAATVGDLQRLIAARVTVPVPLQRLSLDPALLLPSSASAAL 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP----SFNPAGSFGRKMTMDDLIAK 115
A LSSL +S+GS V+LSY + P + + AGSFG+KMTMDDLIA+
Sbjct: 61 LADPAAP----LSSLRLSNGSFVYLSYPPDARSSQPPPPKALSAAGSFGKKMTMDDLIAR 116
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIY 174
Q+RVTRQE P C + SFDRD A+AFQ +V E+LAFA KR GF+YG V D K V V+FIY
Sbjct: 117 QIRVTRQEAPLCAAASFDRDSANAFQLHVAESLAFATKRAGFLYGRVDADTKEVFVDFIY 176
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
EPPQ GTE+V+ ++RD +EE VDAIA GLGM++VG +FTQ + + D YT+SNREV
Sbjct: 177 EPPQVGTEDVVQLMRDAQEEARVDAIAHGLGMRRVGLVFTQAVGRKTSDTGEYTMSNREV 236
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
LQA E AE + EWVTA+VKLEV ++GS +HFEAFQMS++CV+LFK+G ETEI + D
Sbjct: 237 LQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLETEIGDKD 296
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQGPLS+ FPIENR V M ALKSH
Sbjct: 297 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPIENRGNPVAMSALKSH 356
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L+R+ LP VKRISDFHLLL +A FLD+ +DVPAL CV+ Q+ VPEGY+LLI S+A
Sbjct: 357 LDRAKHLPFVKRISDFHLLLLVAAFLDIKADVPALTACVKNQSVVPEGYQLLIESLA 413
>gi|242059373|ref|XP_002458832.1| hypothetical protein SORBIDRAFT_03g041070 [Sorghum bicolor]
gi|241930807|gb|EES03952.1| hypothetical protein SORBIDRAFT_03g041070 [Sorghum bicolor]
Length = 417
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 239/417 (57%), Positives = 313/417 (75%), Gaps = 12/417 (2%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R+TV D A TVA+L+ +I+S L +PV Q LS + LLL +PS +
Sbjct: 1 MILRIRSRDGTDRITVPDSASATVADLQRLIESHLTVPVPLQRLSLDPALLL-PNPSAAV 59
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGE-RNVRGP---SFNPAGSFGRKMTMDDLIAK 115
+A+P PL+SL +++G+ V+L+Y + R+ R P + + AGSFG+KMTMDDLIA+
Sbjct: 60 PL--LADPATPLASLRLANGAFVYLAYPPDARSARPPPPKALSVAGSFGKKMTMDDLIAR 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQEN C + SFDRD A+AFQ YV E+LAFAVKR GF+YG V E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAESLAFAVKRAGFLYGRVDAETKEVFVDFIY 177
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
EPPQQG+E+V++++RD EEE VD IA GLGM++VG +FTQ + + + YT+SNREV
Sbjct: 178 EPPQQGSEDVVHLMRDPEEEARVDTIAEGLGMRRVGLVFTQAVGRKASETGEYTMSNREV 237
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE + EWVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 238 VQAAQLQAEGGIPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQGPLS FPIENR + + + AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSVGFPIENRGSPLGISALRSH 357
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L R L V+RISDFHLLL LA FLD+ +D+P +A CV+ Q+ VPEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDVKADMPTIAACVKTQSRVPEGYQFLIESFA 414
>gi|212722814|ref|NP_001132660.1| NPL4 family [Zea mays]
gi|194695028|gb|ACF81598.1| unknown [Zea mays]
gi|195622926|gb|ACG33293.1| NPL4 family protein [Zea mays]
gi|219888319|gb|ACL54534.1| unknown [Zea mays]
gi|413951803|gb|AFW84452.1| NPL4 family [Zea mays]
Length = 417
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 240/417 (57%), Positives = 309/417 (74%), Gaps = 12/417 (2%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R+TV D A VTVA+L+ +I+S L +PV Q LS + LLL + +
Sbjct: 1 MILRIRSRDGTDRITVPDSASVTVADLQRLIESHLTVPVPLQRLSLDPALLLPNPSAVVP 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGE-RNVRGP---SFNPAGSFGRKMTMDDLIAK 115
TD A P L+SL +++G+ V+L+Y + R+ R P + + AGSFG+KMTMDDLIA+
Sbjct: 61 LLTDSAAP---LASLRLANGAFVYLAYPPDARSARPPPPKALSAAGSFGKKMTMDDLIAR 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQEN C + SFDRD A+AFQ YV E L F VKR GF+YG V E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEFLGFGVKRAGFLYGRVDAETKDVFVDFIY 177
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
EPPQQG+E+V+++LRD +EE VD IA GLGM++VG +FTQ + + + YT+SNREV
Sbjct: 178 EPPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRKASETGEYTMSNREV 237
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE + EWVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 238 VQAAQLQAEGGIPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQGPLS FPIENR V++ AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSVGFPIENRGNPVSISALRSH 357
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L R L VKRISDFHLLL LA FLD+ +D+P +A CV+ Q+ VPEGY+LLI S A
Sbjct: 358 LERMKHLAFVKRISDFHLLLKLATFLDVKADMPTIAACVKTQSRVPEGYQLLIESFA 414
>gi|326528657|dbj|BAJ97350.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 476 bits (1226), Expect = e-132, Method: Compositional matrix adjust.
Identities = 239/417 (57%), Positives = 313/417 (75%), Gaps = 14/417 (3%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R+TV D A TV +L+ +I+++ +PV +Q LS + LLL
Sbjct: 1 MILRIRSRDGTDRITVPDAAAATVGDLQRLIEARATVPVAAQRLSLDPALLLPAP----- 55
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYD-GERNVRGP---SFNPAGSFGRKMTMDDLIAK 115
Q +++P PL+SL +++GS V+LSY G R+ + P + AGSFG+KMTMDDLIA+
Sbjct: 56 QPPLLSDPAAPLASLPLANGSFVYLSYPPGARSAQPPPPRALTAAGSFGKKMTMDDLIAR 115
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQE P C + SFDRD A+AFQ +V+E+LAFAVKR GF+YG V E K V V+FIY
Sbjct: 116 QIRVTRQEAPFCSAASFDRDSANAFQLHVSESLAFAVKRAGFLYGRVDAETKEVLVDFIY 175
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
EPPQQG+ +V+ ++RD +EE VDAIA GLGM++VGF+FTQ + + + YTLSN EV
Sbjct: 176 EPPQQGSSDVVQLMRDADEEARVDAIADGLGMRRVGFVFTQAVGRKASETGEYTLSNSEV 235
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE + EWVTAVVKLEV E+GSA +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 236 VQAAQLQAEGGIPEWVTAVVKLEVGEDGSADVHFEAFQMSEVCVKLFKDGVLETEVKDSD 295
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+V+ GGKD EVDNDFFLV VKI DHQGPLS+ FP+ENR + AL+SH
Sbjct: 296 DPRLSKMRKEVIAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPVENRGIPLPASALRSH 355
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
++R+ LP VKRISDFHLLL +A FLD+ +DVP LA CV Q VPEGY+LLI+++A
Sbjct: 356 MDRAKHLPFVKRISDFHLLLQIAAFLDVKADVPTLAACVGHQAEVPEGYRLLIDALA 412
>gi|222618478|gb|EEE54610.1| hypothetical protein OsJ_01845 [Oryza sativa Japonica Group]
Length = 415
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 245/417 (58%), Positives = 307/417 (73%), Gaps = 13/417 (3%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R+TV D A TV +L+ +I +++ +PV Q LS + LLL S S L
Sbjct: 1 MILRIRSRDGTDRITVPDPAAATVGDLQRLIAARVTVPVPLQRLSLDPALLLPSSASAAL 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP----SFNPAGSFGRKMTMDDLIAK 115
A LSSL +S+GS V+LSY + P + + AGSFG+KMTMDDLIA+
Sbjct: 61 LADPAAP----LSSLRLSNGSFVYLSYPPDARSSQPPPPKALSAAGSFGKKMTMDDLIAR 116
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIY 174
Q+RVTRQE P C + SFDRD A+AFQ +V E+LAFA KR GF+YG V D K V V+FIY
Sbjct: 117 QIRVTRQEAPLCAAASFDRDSANAFQLHVAESLAFATKRAGFLYGRVDADTKEVFVDFIY 176
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
EPPQ GTE+V+ ++RD +EE VDAIA GLGM++VG +FTQ + + D YT+SNREV
Sbjct: 177 EPPQVGTEDVVQLMRDAQEEARVDAIAHGLGMRRVGLVFTQAVGRKTSDTGEYTMSNREV 236
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
LQA E AE + EWVTA+VKLEV ++GS +HFEAFQMS++CV+LFK+G ETEI + D
Sbjct: 237 LQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLETEIGDKD 296
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQGPLS+ FPIENR V M ALKSH
Sbjct: 297 DPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQGPLSTGFPIENRGNPVAMSALKSH 356
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L+R+ LP VKRIS+F+LLL +A FLD+ +DVPAL CV+ Q+ VPEGY+LLI S+A
Sbjct: 357 LDRAKHLPFVKRISNFNLLLLVAAFLDIKADVPALTACVKNQSVVPEGYQLLIESLA 413
>gi|357132177|ref|XP_003567708.1| PREDICTED: NPL4-like protein-like [Brachypodium distachyon]
Length = 415
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 232/417 (55%), Positives = 308/417 (73%), Gaps = 14/417 (3%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R+TV DGA TV EL+ +I +++ +PV Q LS + LLL + LL
Sbjct: 1 MILRIRSRDGTDRITVPDGASATVGELQRLIAARVTVPVTLQRLSLDPALLLPAAQPQLL 60
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGP----SFNPAGSFGRKMTMDDLIAK 115
++P PL+SL +++GS V+LSY + P + AGSFG+KMTMDDLIA+
Sbjct: 61 -----SDPATPLASLPLANGSFVYLSYPPDARSSQPPPPKALTVAGSFGKKMTMDDLIAR 115
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQE P C + SFDRD A+AFQ YV E+LAFAVKR GF+YG V E K V V+FIY
Sbjct: 116 QIRVTRQEAPLCAAASFDRDSANAFQLYVAESLAFAVKRAGFLYGRVDAETKEVFVDFIY 175
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREV 231
EPPQ G+E+V+ ++RD +EE VDAIA GLGM++VG +FTQ + + + YT+SNREV
Sbjct: 176 EPPQSGSEDVVQLMRDSDEELRVDAIADGLGMRRVGLVFTQAVGRKASETGEYTMSNREV 235
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA E AE + EWVTA+VK++V ++G+ +HFEAFQ+S++CV+LFK+G +TE+ + D
Sbjct: 236 VQAAELQAEGGIPEWVTAIVKIKVGDDGTGDVHFEAFQLSEVCVKLFKDGVLQTEVGDSD 295
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+V+ GGKD EVDNDFFLV VKI DHQGPLS FPIENR + + M L++H
Sbjct: 296 DPRLSKMRKEVIAGGKDTMEVDNDFFLVPVKISDHQGPLSIGFPIENRGSPLAMSTLRNH 355
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L+R+ LP V+RISDFHLLL + FLD+ +DVP LA CV+ Q VPEGY+LLI ++A
Sbjct: 356 LDRAKHLPFVRRISDFHLLLQVTTFLDVKADVPILAACVKHQVKVPEGYQLLIETLA 412
>gi|414879508|tpg|DAA56639.1| TPA: NPL4 family protein [Zea mays]
Length = 417
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 233/417 (55%), Positives = 306/417 (73%), Gaps = 12/417 (2%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R++V D A TVA+L+ +I+S L +PV Q +S + LLL +PS +
Sbjct: 1 MILRIRSRDGTDRISVPDSASATVADLQRLIESHLTVPVPLQRISLDPALLL-PNPSAAV 59
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
+A+P PL+SL +++G+ ++L+Y + P+ + AGSFG+KMTMDDLIA+
Sbjct: 60 PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQEN C + SFDRD A+AFQ YV E LAFAVKR GF+YG + E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
EPPQQG E+V++++RD +EE VD IA GLGM++VG +FTQ + + +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE EWVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD +VDNDFFLV VKI DHQGPLS FPIENR + V M AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQGPLSVGFPIENRGSPVGMSALRSH 357
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L R L V+RISDFHLLL LA FLD SD+P +A CV+ Q+ +PEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDAKSDMPTIAACVKTQSRMPEGYQFLIESFA 414
>gi|226529197|ref|NP_001141001.1| uncharacterized protein LOC100273080 [Zea mays]
gi|194702118|gb|ACF85143.1| unknown [Zea mays]
Length = 417
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 233/417 (55%), Positives = 306/417 (73%), Gaps = 12/417 (2%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R++V D A TVA+L+ +I+S L +PV Q +S + LLL +PS +
Sbjct: 1 MILRIRSRDGTDRISVPDSASATVADLQRLIESHLTVPVPLQRISLDPALLL-PNPSAAV 59
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
+A+P PL+SL +++G+ ++L+Y + P+ + AGSFG+KMTMDDLIA+
Sbjct: 60 PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQEN C + SFDRD A+AFQ YV E LAFAVKR GF+YG + E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
EPPQQG E+V++++RD +EE VD IA GLGM++VG +FTQ + + +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE EWVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD +VDNDFFLV VKI DHQGPLS FPIENR + V M AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQGPLSVGFPIENRGSPVGMSALRSH 357
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L R L V+RISDFHLLL LA FLD SD+P +A CV+ Q+ +PEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDAKSDMPTIAVCVKTQSRMPEGYQFLIESFA 414
>gi|195636888|gb|ACG37912.1| NPL4 family protein [Zea mays]
Length = 417
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 232/417 (55%), Positives = 307/417 (73%), Gaps = 12/417 (2%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R++V D A TVA+L+ +I+S+L +PV Q +S + LLL +PS +
Sbjct: 1 MILRIRSRDGTDRISVPDSASATVADLQRLIESRLTVPVPLQRISLDPALLL-PNPSAAV 59
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
+A+P PL+SL +++G+ ++L+Y + P+ + AGSFG+KMTMDDLIA+
Sbjct: 60 PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQEN C + SFDRD A+AFQ YV E LAFAVKR GF+YG + E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
EPPQQG E+V++++RD +EE VD IA GLGM++VG +FTQ + + +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE EWVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSH 351
DP+LSKM+K+VV GGKD +VDNDFFLV VKI DHQGPLS FPIENR + V M AL+SH
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQGPLSVGFPIENRGSPVGMSALRSH 357
Query: 352 LNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L R L V+RISDFHLLL LA FLD S++P +A CV+ Q+ +PEGY+ LI S A
Sbjct: 358 LERMKHLTFVRRISDFHLLLKLATFLDAKSEMPTIAACVKTQSRMPEGYQFLIESFA 414
>gi|356514356|ref|XP_003525872.1| PREDICTED: LOW QUALITY PROTEIN: NPL4-like protein 2-like [Glycine
max]
Length = 359
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/296 (64%), Positives = 234/296 (79%), Gaps = 1/296 (0%)
Query: 2 LLRIRSRDGLERVTVDGAH-VTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
++ I S D L+ V+++ H TV +LK II++QL IP+ +QTLSTNQNLLL KS DL
Sbjct: 1 MVIIHSHDNLKXVSIENRHRTTVLDLKQIIETQLGIPIHNQTLSTNQNLLLXKSLKDLHX 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAGSFGRKMTMDDLIAKQMRVT 120
F DMANPD PL +LN+SH SIVFL+Y+GER V P+FNP GSFGRKMT+D LIAK M T
Sbjct: 61 FIDMANPDAPLXALNLSHDSIVFLAYEGERRVTDPTFNPTGSFGRKMTVDHLIAKXMCTT 120
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQG 180
RQENPHCE VSFDRD A+AFQ VN+TLAFAVK FMY VL+ ++V+V+FIY+P QQG
Sbjct: 121 RQENPHCELVSFDRDYANAFQLCVNDTLAFAVKHDEFMYEIVLKVRKVKVDFIYKPHQQG 180
Query: 181 TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAE 240
+++ L RD +EEK V+AI GLGM +VGFIF QTI Q+KKDYTLSNREVLQAVE+H++
Sbjct: 181 SKDNLVFFRDPKEEKFVEAIMXGLGMMEVGFIFMQTITQDKKDYTLSNREVLQAVEYHSD 240
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLS 296
++EWVT V+KL+V+ E +HFEAFQ+SD+CVRLFKEGWF+TEI E DDPKLS
Sbjct: 241 SRLKEWVTIVIKLKVNGEMGVDVHFEAFQISDVCVRLFKEGWFQTEIKEDDDPKLS 296
>gi|414879509|tpg|DAA56640.1| TPA: hypothetical protein ZEAMMB73_509597 [Zea mays]
Length = 358
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/336 (54%), Positives = 248/336 (73%), Gaps = 12/336 (3%)
Query: 1 MLLRIRSRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
M+LRIRSRDG +R++V D A TVA+L+ +I+S L +PV Q +S + LLL +PS +
Sbjct: 1 MILRIRSRDGTDRISVPDSASATVADLQRLIESHLTVPVPLQRISLDPALLL-PNPSAAV 59
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPS----FNPAGSFGRKMTMDDLIAK 115
+A+P PL+SL +++G+ ++L+Y + P+ + AGSFG+KMTMDDLIA+
Sbjct: 60 PL--LADPAAPLASLRLANGAFIYLAYPPDARSARPTPPKVLSAAGSFGKKMTMDDLIAR 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIY 174
Q+RVTRQEN C + SFDRD A+AFQ YV E LAFAVKR GF+YG + E K V V+FIY
Sbjct: 118 QIRVTRQENALCAAASFDRDAANAFQLYVAEYLAFAVKRAGFLYGRIDAETKEVLVDFIY 177
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREV 231
EPPQQG E+V++++RD +EE VD IA GLGM++VG +FTQ + + +YT+SNREV
Sbjct: 178 EPPQQGFEDVVHLMRDPDEEARVDTIAEGLGMRRVGLVFTQAVGRRASKTGEYTMSNREV 237
Query: 232 LQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD 291
+QA + AE EWVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ETE+ + D
Sbjct: 238 VQAAQLQAEGGTPEWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLETEVQDAD 297
Query: 292 DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
DP+LSKM+K+VV GGKD +VDNDFFLV VKI DHQ
Sbjct: 298 DPRLSKMRKEVVAGGKDTMKVDNDFFLVPVKISDHQ 333
>gi|384245338|gb|EIE18832.1| NPL4-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 412
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 196/418 (46%), Positives = 267/418 (63%), Gaps = 16/418 (3%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RSRDGLER+ + H TVAELK I S L IPVE TLS N++LL A S
Sbjct: 1 MLIRLRSRDGLERIQLP-EHATVAELKKQINSNLSIPVEDITLSRNKDLLTALDKSG--- 56
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPA---GSFGRKMTMDDLIAKQM 117
F D++ P PL L I HG +VF+ Y ER V A +FG MT+D+L+AKQ
Sbjct: 57 FEDLSRPGTPLVKLGIQHGDVVFMHYPFEREVASTVPRTAYENRAFGTHMTVDELVAKQT 116
Query: 118 RVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPP 177
R+ RQE PH S SF+R ADAFQ YV +AF++KRGG +YG+V E+ +V V IYEPP
Sbjct: 117 RIERQEAPHAASASFERHAADAFQSYVQGAIAFSIKRGGLLYGSVDEEGKVTVEAIYEPP 176
Query: 178 QQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF 237
Q G+ E L + R EEE+ D +A LG++KVG+IF Q+ ++++ +S EV Q
Sbjct: 177 QSGSAESLQLERSTEEEERADFVAKQLGLQKVGWIFAQST--KEREFIMSAEEVCQMAAV 234
Query: 238 HAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFE--TEIAEGD---D 292
AE E VTAVV + V+EEG+ IHFEAFQ+SD VRL++EGWF TE + +
Sbjct: 235 QAELG-ERAVTAVVSMTVAEEGTPEIHFEAFQVSDQTVRLWQEGWFTNGTEFSGVSILRN 293
Query: 293 PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHL 352
PK K V+V GKD E DND+ L+ V ILDH+GPL+++FP+ENR L+ HL
Sbjct: 294 PKEPDHKAPVIVAGKDQGEADNDYLLIPVNILDHEGPLTASFPVENRLLPQGKAELRHHL 353
Query: 353 NRSPSLPLVKRISDFHLLLFLARFLDLN-SDVPALAQCVQAQTAVPEGYKLLINSMAE 409
++ P +R++DFHLLL+LA+ +L+ +D+ + + V+ + VPEGY L+I+S+AE
Sbjct: 354 QKNRGKPYAERLADFHLLLYLAKQPNLDLADLAIIIEAVKTKGTVPEGYTLIIDSLAE 411
>gi|302853590|ref|XP_002958309.1| hypothetical protein VOLCADRAFT_69322 [Volvox carteri f.
nagariensis]
gi|300256334|gb|EFJ40602.1| hypothetical protein VOLCADRAFT_69322 [Volvox carteri f.
nagariensis]
Length = 421
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 179/423 (42%), Positives = 259/423 (61%), Gaps = 21/423 (4%)
Query: 1 MLLRIRSRDGLERVTVD-GAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLL 59
MLLR+RSRDGLER+ VD GA+++ LK I ++L IP++ LS LL +K P
Sbjct: 1 MLLRLRSRDGLERIEVDDGANLSA--LKAAINAKLAIPLDDMLLSKQPGLLTSKEP---Y 55
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAG---SFGRKMTMDDLIAKQ 116
F D+A D L S +SHG +V+L Y ER V P+ PAG FG MT++D++AK
Sbjct: 56 SFRDLAINDASLRSQGVSHGDMVYLLYSFERQVE-PAVRPAGRSKPFGVHMTLNDVMAKV 114
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEP 176
+++ RQE P ++SFDR+ A+ FQ Y+ F++KRGG +YG V +DK V+V+FIYEP
Sbjct: 115 IKIERQEKPLVSAISFDRNAANVFQSYLQGAFNFSIKRGGLLYGNVDDDKTVKVDFIYEP 174
Query: 177 PQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNK----KDYTLSNREVL 232
PQ+G+ + L + R EE+ VD +A LG +KVGFIF Q++ K DY ++ E++
Sbjct: 175 PQEGSSDKLQLQRHTPEEQQVDLLAQLLGYRKVGFIFCQSVKAQKAAAESDYIINTDELI 234
Query: 233 QAVEFHA-ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE----- 286
E E TA+V L E +HFEAFQ SD+ VRL +EGW
Sbjct: 235 MMAAMQVREGRGEHCTTALVTLVEEPETGPQVHFEAFQCSDLAVRLVREGWVVARDPVDG 294
Query: 287 IAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMR 346
++ +PK ++K+ V++ +D EVDND+FL V I DH G L+++FP+ENR T
Sbjct: 295 VSRMVNPKEPELKQPVMLNNRDTDEVDNDWFLCPVSIRDHDGRLTTSFPVENRLTPQGRT 354
Query: 347 ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN-SDVPALAQCVQAQTAVPEGYKLLIN 405
L+ HL R S P V+R+SDFHLLL+LA+ +L+ +D+ L + V+ V EGY+++I+
Sbjct: 355 ELREHLKRMSSRPYVERLSDFHLLLWLAKQPNLDPNDMALLCEAVREHRPVLEGYRVIID 414
Query: 406 SMA 408
S+A
Sbjct: 415 SIA 417
>gi|145346114|ref|XP_001417539.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577766|gb|ABO95832.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 416
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 182/425 (42%), Positives = 269/425 (63%), Gaps = 29/425 (6%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M++R+R+RDG ER+TV + +TVA+LK I++++L++P + Q LST+ ++L +K+P +
Sbjct: 1 MIVRVRTRDGTERITVPSSDITVAKLKEILETELKVPKDEQILSTSLDVLTSKTPE---E 57
Query: 61 FTDMANPDRPLSSLNI-SHGSIVFLSYDGERNVRGPSFNPA-------GSFGRKMTMDDL 112
F DMAN ++S I ++G++V L Y ER P+ A G GRKMT+DD+
Sbjct: 58 FKDMANGRAKIASCGIDTNGAVVHLMYGCERE---PTKTTAAVRDAARGLTGRKMTVDDM 114
Query: 113 IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF 172
IA Q+R+ RQE PHC SVSFD + A+ FQ YVN TL F R G++YG +D + V+
Sbjct: 115 IAAQVRIERQEAPHCVSVSFDANAANVFQSYVNYTLGFKQVRFGWLYGARDDDGKTRVDA 174
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
IYEP Q+GTEE + +L D E++ DA+A LGM +VG I + ++++D+TLS REV
Sbjct: 175 IYEPEQEGTEEDVEVLEDTLEDEKADALAGALGMIRVGCIISTP--KDERDFTLSAREVS 232
Query: 233 QAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA---E 289
+ ++ +E+VTAVV E+ + + FEAFQ+SD +++ + GWF E +
Sbjct: 233 RVARLQSKFG-KEFVTAVVSTFEDEDDNWQVSFEAFQLSDAAMKVQERGWFTGECGVDED 291
Query: 290 GDDPK-LSKMKKDVVVGG----KDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV- 343
GD+ K +SK+ +V+V DV VDND FLV V ILDH+G +S+TFP+ENR V
Sbjct: 292 GDELKGVSKLTNEVIVRDGPSINDVNAVDNDMFLVTVPILDHEGAISTTFPVENRLHPVQ 351
Query: 344 TMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLL 403
T+ LK L + P +R+ DFHLLLFLA+ +D+N ++ +A V VPEGY LL
Sbjct: 352 TVDDLKDALRKDK--PYGERLRDFHLLLFLAKHIDIN-ELAVIASAVHHGEDVPEGYTLL 408
Query: 404 INSMA 408
I+ +A
Sbjct: 409 IDCLA 413
>gi|303282283|ref|XP_003060433.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457904|gb|EEH55202.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 432
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 191/439 (43%), Positives = 260/439 (59%), Gaps = 43/439 (9%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
+LLRIR+RDG ER+TVD A+ T+ +L+ I+++ +P E QTLS NQ LLLA + ++
Sbjct: 3 LLLRIRTRDGTERLTVD-ANCTLRQLREKIETEYGVPQEDQTLSLNQGLLLANARAE--D 59
Query: 61 FTDMANPDRP-----LSSLNISHGSIVFLSYDGERNVRGPSFNPAGSF-----GRKMTMD 110
F D+A P R + L I HGSIV+LSY ER P+ A S G KMT++
Sbjct: 60 FDDLA-PARAKDAKLVRQLGIDHGSIVYLSYSVERE---PTPTAAASHRSELRGGKMTVE 115
Query: 111 DLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEV 170
+IA Q RV RQE P+C SVSFD A+AFQQYVN TL F R G+MYGTV + V V
Sbjct: 116 AMIAAQTRVERQETPNCASVSFDAHAANAFQQYVNGTLGFNQMRFGWMYGTVNDAGEVLV 175
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
+ IYEP Q+G+E+ I EE+ DA+A+ LGM+KVG +F T + +K +TLS E
Sbjct: 176 DAIYEPEQEGSEDAFIISEGSEEDARADALASALGMQKVGCVFNITTTKPRK-FTLSAFE 234
Query: 231 VLQAVEFHAECNMEE-----------------WVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
V + + A +VTAVV + E+ + FE FQ+SD
Sbjct: 235 VREMAKRQAAAAAAAAAAAAADGAEKKDGKTTFVTAVVMMLEDEDEGPQVSFEPFQVSDQ 294
Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKKDVVVG---GKDVKEVDNDFFLVVVKILDHQGPL 330
CVRL+ +GWF E E D L++M K+V+V KDV EVDND +LV V I DH G L
Sbjct: 295 CVRLYADGWFHDEPCE--DRGLTRMTKEVIVADGVSKDVTEVDNDRWLVAVPIKDHSGAL 352
Query: 331 SSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
S+ FPIENR + TM LK L++S + R+ DFHLLLFL++ L + D+ +A+
Sbjct: 353 STKFPIENRLHPIQTMEDLKDSLSQS-GMEYSARLRDFHLLLFLSKHLGTD-DLVDVAKT 410
Query: 390 VQAQTAVPEGYKLLINSMA 408
+ + EGYK++I+S+A
Sbjct: 411 ARRGGTIAEGYKIIIDSLA 429
>gi|308803090|ref|XP_003078858.1| Nuclear pore complex, rNpl4 component (sc Npl4) (ISS) [Ostreococcus
tauri]
gi|116057311|emb|CAL51738.1| Nuclear pore complex, rNpl4 component (sc Npl4) (ISS) [Ostreococcus
tauri]
Length = 412
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 261/418 (62%), Gaps = 19/418 (4%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M++R+R+RDG ERV + A TV ELK I+ +L++P + Q LST+ ++L +K PS
Sbjct: 1 MIVRVRTRDGTERVEIPSAASTVGELKDAIERELKVPRDDQILSTSPDVLTSKDPS---A 57
Query: 61 FTDMANPDRPLSSLNIS-HGSIVFLSY----DGERNVRGPSFNPAGSFGRKMTMDDLIAK 115
F DM++ L S+ + +G+IV Y + + + + G G KMT+D +IA
Sbjct: 58 FLDMSDDGDALESVGVKQNGAIVHARYTVCREPTKTIAATHNDARGLTGGKMTVDQMIAA 117
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
Q R+ RQE PHC SVSFD + A+ FQ YVN TL F R G++YGT ++ +V V IYE
Sbjct: 118 QTRIERQEAPHCVSVSFDANAANVFQSYVNYTLGFKQVRFGWLYGTRDDENKVRVEAIYE 177
Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
P Q+GTE+ + +L + +E+ + +A+ +G+ +VG I + ++++D+TLS +EV +
Sbjct: 178 PEQEGTEDDVEVLENTQEDHNAECLASSMGLTRVGCIISTP--KDERDFTLSAKEVARIA 235
Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA---EGDD 292
++ + + TAV+ E+ + + FEAFQ+SD+ ++++ GWF + A G+D
Sbjct: 236 RLQSKYG-QHFATAVMMTFEDEDDNWQVSFEAFQLSDLAIKIYDRGWFTGKCAVDENGED 294
Query: 293 PK-LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV-TMRALKS 350
K +SK+ +V+V KDV +VDNDFFLV V ILDH+GPLSSTFP+ENR + T+ L+
Sbjct: 295 LKGVSKLTNEVMVERKDVFDVDNDFFLVTVPILDHEGPLSSTFPVENRLHPIQTVDDLRE 354
Query: 351 HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
L + P R+ DFHLLLFLA+ +D+N ++ A++ +PEGY+L+I+ +A
Sbjct: 355 ALQKDK--PYGDRLRDFHLLLFLAKHIDIN-ELVAISGAAHEGEEIPEGYRLMIDCLA 409
>gi|255085502|ref|XP_002505182.1| predicted protein [Micromonas sp. RCC299]
gi|226520451|gb|ACO66440.1| predicted protein [Micromonas sp. RCC299]
Length = 395
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 169/391 (43%), Positives = 244/391 (62%), Gaps = 18/391 (4%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+R+RDG ERV VD T+++L+T+I+ + +P Q LS +Q LL++K P
Sbjct: 1 MLIRLRTRDGTERVEVDDK-CTLSQLRTVIEEKFGVPTSDQRLSLDQGLLMSKDPH---A 56
Query: 61 FTDMA----NPDRPLSSLNISHGSIVFLSYDGERNVR---GPSFNPA-GSFGRKMTMDDL 112
+ D+ N +PL SL I HG+IV+++Y ER V + N A G G KMT++ +
Sbjct: 57 YDDLKPGRKNDGKPLKSLKIEHGTIVYMTYSVERKVTVCTAAAHNDARGLTGGKMTVEAM 116
Query: 113 IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF 172
IA Q R+ QE+P+ ES+S D A+AFQ YVN TL F R G+MYGT +D V V+
Sbjct: 117 IAAQTRIETQESPNVESLSLDGHAANAFQAYVNGTLGFNQMRFGWMYGTCSDDGEVRVHA 176
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
IYEP Q+GTE+ ++ D +E+ +AIA LGM +VG +F T + + DYT+S E+
Sbjct: 177 IYEPEQEGTEDAFVVVEDADEDARAEAIAESLGMTRVGMVFNVTTTKERPDYTMSAFEIR 236
Query: 233 QAVEFHAECNMEEWVTAVVK-LEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG- 290
Q ++ ++ +++VTAVV LE +EG+ + FE FQ+S+ CVRL+ EGWF E E
Sbjct: 237 QMAKYQSKYG-DKFVTAVVMMLEDEDEGTPQVSFEPFQVSNQCVRLYSEGWFHDEPCEKP 295
Query: 291 DDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV-TMRALK 349
+L+K V KDV EVDND +LV VKILDH+GP ++FPIENR V T+ LK
Sbjct: 296 GHTRLTKDVVVVDKVSKDVTEVDNDRWLVPVKILDHEGPWRTSFPIENRLHPVQTVEDLK 355
Query: 350 SHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
+++ + +++DFHLLLFL++ LD N
Sbjct: 356 DAVSKGGQYCV--KLADFHLLLFLSKHLDAN 384
>gi|414589343|tpg|DAA39914.1| TPA: hypothetical protein ZEAMMB73_187365 [Zea mays]
Length = 392
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 131/222 (59%), Positives = 170/222 (76%), Gaps = 4/222 (1%)
Query: 110 DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRV 168
++LI +Q+RVTRQEN C + SFDRD A+AFQ Y + L F VKR GF+YG V E K V
Sbjct: 18 ENLITRQIRVTRQENALCAAASFDRDAANAFQLYDAKFLDFGVKRAGFLYGRVDAETKDV 77
Query: 169 EVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YT 225
V+FIYEPPQQG+E+V+++LRD +EE VD IA GLGM+ VG +FTQ + + + YT
Sbjct: 78 FVDFIYEPPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRWVGLVFTQAVGRKPSETGEYT 137
Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
+SNREV+QA + AE + +WVTA+VKLEV ++G+ +HFEAFQMS++CV+LFK+G ET
Sbjct: 138 MSNREVMQAAQLQAEGGIPKWVTAIVKLEVGDDGTGDVHFEAFQMSEICVKLFKDGVLET 197
Query: 286 EIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
E+ + DDP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQ
Sbjct: 198 EVQDADDPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQ 239
>gi|307108993|gb|EFN57232.1| hypothetical protein CHLNCDRAFT_143718 [Chlorella variabilis]
Length = 564
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 166/422 (39%), Positives = 224/422 (53%), Gaps = 57/422 (13%)
Query: 7 SRDGLERVTV-DGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQFTDMA 65
+RDGLER+ V DG+ T+A LK IQ +L +P+ LS N LL AK+ + F+DM
Sbjct: 178 NRDGLERIEVEDGS--TLAGLKLAIQQKLGVPLADMLLSNNAQLLTAKNAAG---FSDML 232
Query: 66 NPDRPLSSLNISHGSIVFLSYDGERNVRGPSFNPAG----SFGRKMTMDDLIAKQMRVTR 121
+ L L + HG +VFL Y ER V P F P FG M + ++A Q R+ R
Sbjct: 233 DNSASLRQLGVHHGDMVFLLYHFERQVE-PVFRPKDWERRPFGAHMDVAAMVAAQTRIER 291
Query: 122 QENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVL--------------EDKR 167
Q+ PH S SFD +AFQ YV+ + F++KRGG M+ V E+
Sbjct: 292 QDTPHAASASFDFATTNAFQSYVSSAIVFSIKRGGIMWWYVRCEGGWANTGQDRMDEEGA 351
Query: 168 VEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLS 227
V VN +YEPPQQG+ + L + R EE+ D IA LG +KVG+IF Q+ ++D+ ++
Sbjct: 352 VFVNAVYEPPQQGSADSLSLERGTPEERQADFIAERLGWRKVGWIFAQST--KERDFIMT 409
Query: 228 NREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI 287
EV Q E E VTAVV E+G +HFE E
Sbjct: 410 GEEVCQMAGIQDEMG-EHAVTAVVATWPGEDGQPEVHFE-------------------EP 449
Query: 288 AEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRA 347
E V+V GKD+ EVDND+FLV V + DH+GPL + FP+ENR M
Sbjct: 450 GE---------AAPVIVAGKDLGEVDNDYFLVPVAVRDHEGPLETKFPLENRLLPQGMPE 500
Query: 348 LKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS-DVPALAQCVQAQTAVPEGYKLLINS 406
LK+HL S P RI+DFHLLLFLAR +L +V L V+ + VPEG++++I+S
Sbjct: 501 LKAHLQARRSAPYWARIADFHLLLFLARQPNLEEPEVGLLVDAVRDKAPVPEGFQIIIDS 560
Query: 407 MA 408
MA
Sbjct: 561 MA 562
>gi|440794522|gb|ELR15682.1| NPL4 family protein [Acanthamoeba castellanii str. Neff]
Length = 392
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 153/405 (37%), Positives = 224/405 (55%), Gaps = 38/405 (9%)
Query: 22 TVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQFTDMANPDRPLSSLNISHGSI 81
TV +LK +Q + IPV +QTLS L ++ ++L + AN LSSL I HG +
Sbjct: 6 TVQDLKKKVQQKTLIPVANQTLS------LDRAGKEVL--SSGANT---LSSLGIRHGQM 54
Query: 82 VFLSYDGER---NVRGPS--FNPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDC 136
VFLSY GER N P N + K M+ L ++ + QE HC+ S D
Sbjct: 55 VFLSYTGERPINNDEDPKKLRNIKKKWDLKSLMEHLDSQGLVFKAQETTHCKVASLDFAS 114
Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
D FQQY E AF +R G++YG LED V +IYEPPQ+ + +L D EEKL
Sbjct: 115 CDKFQQYPRE-FAFLKRREGYLYGKYLEDGNVTAEYIYEPPQEQSPSGTILLDDPAEEKL 173
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTA-----VV 251
VD IA LG++KVG+IFT +++DY +S+ EV++A + +AE ++++ A V
Sbjct: 174 VDTIANALGVQKVGWIFT-VPPNDERDYIMSSHEVVRAAQQNAEA-IKKYGDAGKSFIAV 231
Query: 252 KLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVV-----GG 306
K+ EEG +HFE FQ++D V+L + + DPK+ K++V + G
Sbjct: 232 KVSTDEEGQ--VHFEPFQVTDQAVKLVQNDLLKPP----KDPKVVVAKQEVAMARQGAGR 285
Query: 307 KDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRIS 365
KD E+D D + V +++H+GPLS F IENR M ALK HL + P + ++
Sbjct: 286 KDSPELDPDVLIKPVAVVNHEGPLSCVFEIENRPEHPQNMAALKRHLLQFKGKPFLLAVT 345
Query: 366 DFHLLLFLAR-FLDLNSDVPALAQCVQ-AQTAVPEGYKLLINSMA 408
DFH + +L + FL +D+PAL + V+ + EG+KLLI+S+A
Sbjct: 346 DFHFIRYLGKEFLSPETDIPALCEAVKTGDSDALEGFKLLIDSIA 390
>gi|325185911|emb|CCA20415.1| NPL4like protein putative [Albugo laibachii Nc14]
Length = 429
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 144/446 (32%), Positives = 213/446 (47%), Gaps = 53/446 (11%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RS++G R+ TV +LK ++ I +Q + SD LQ
Sbjct: 1 MLVRVRSKNGTWRIDNLSPQSTVTDLKECVEKIHGIASHNQRYHLDNQGKAPVHESDTLQ 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRG-------------------------P 95
S+ + HG +++L +DG+ V G
Sbjct: 61 ------------SIRVGHGDLIYLQHDGDTIVTGRTAVSRKVNADGSITHVEYHVVNGKK 108
Query: 96 SFNPAGSFGRKMTMDDLIAKQMRVTR--------QENPHCESVSFDRDCADAFQQYVNET 147
SF P R + M I + MR+ QE H +VS D+ D FQ Y+ +
Sbjct: 109 SFRPGLKALRDIKMHWTIGEFMRMDEEFEFKLKAQEKAHVANVSLDKSSCDNFQSYLRQ- 167
Query: 148 LAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
AF R G++YG + E K+V V+FIYEPPQ+GT + D EK DA+A LG
Sbjct: 168 FAFQQSRCGWLYGNIDQESKQVTVDFIYEPPQEGTPHDFQVFEDPLGEK-ADAVAKALGR 226
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFE 266
+KVG++F+ ++ D+ S+ E+L A + + + + +K+ + +EG FE
Sbjct: 227 EKVGWVFSHPPREDA-DFHFSSNEILMAAQLQLDAGGKGSLFVSLKVTLDKEGQTT--FE 283
Query: 267 AFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDH 326
AFQ+SD C+ +F G E A +V GK EVDN+FFL VV + H
Sbjct: 284 AFQVSDQCLDMFAAGALSAEHASNSKVCAISETYTAIVEGKTATEVDNNFFLCVVPVSTH 343
Query: 327 QGPLSSTFPIENRTTQVTMR-ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPA 385
L S FP +R +R AL+ HL + S P ++R+ DFHLLLFL LD N+DVP
Sbjct: 344 DSGLKSEFPRPHREIPRPLREALRRHLQKYRSEPYIERLKDFHLLLFLTEVLDPNTDVPT 403
Query: 386 LAQCV-QAQTAVPEGYKLLINSMAET 410
+ Q + Q A+ GY +LI+S+A T
Sbjct: 404 ICQSIMDPQIALDSGYPILIDSVAGT 429
>gi|348686646|gb|EGZ26461.1| hypothetical protein PHYSODRAFT_345180 [Phytophthora sojae]
Length = 428
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 156/447 (34%), Positives = 226/447 (50%), Gaps = 61/447 (13%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRI-PVESQTLSTNQNLLLAKSPSDLL 59
ML+R+RS++G RV T+A++K I Q I P + Q Q A++ SD
Sbjct: 1 MLVRVRSKNGTWRVGDLTPSSTIADIKRWIFEQHSIAPAQQQLALDQQG---AQTTSDT- 56
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGER----NVRGPSFNPAGSFGR---------- 105
+ L L + HG ++FL YDGE V G N G+ R
Sbjct: 57 ---------QTLRELRVGHGDMLFLDYDGETLGTGGVVGTKINADGTLTRVAYHDRADKQ 107
Query: 106 ----------KMTMDDLIAKQMR--------VTRQENPHCESVSFDRDCADAFQQYVNET 147
M M + + MR V+ Q+ P V D + FQ Y+ +
Sbjct: 108 AFRPGMKSLRDMKMYWTLGEFMRMDAQFEFKVSAQKGPQVAKVHLDGASCNDFQAYLRQ- 166
Query: 148 LAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
AF R G++YG+V E K V V+FIYEPPQ+G +L D +K VDA+A LG
Sbjct: 167 FAFQQSRCGWLYGSVDAETKEVTVDFIYEPPQEGNPYGFEVLDDPNADK-VDAVAEALGY 225
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFE 266
+KVG++F+ ++ D+ S REVL A + A+ + + +K+ + +EG A+ FE
Sbjct: 226 EKVGWVFSHPPREDD-DFHFSAREVLLAAQLQADAGGKGSLFLTLKVTLDKEGQAS--FE 282
Query: 267 AFQMSDMCVRLFKEGWFETEIAEGD-DPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVKI 323
AFQ+SD CV +F G +AE + +P+ + +V K EVDN+FFL VV +
Sbjct: 283 AFQVSDQCVEMFAAG----ALAENEQNPQACAVSDTFTALVEAKPATEVDNNFFLCVVPV 338
Query: 324 LDHQGPLSSTFPIENRTTQVTMRA-LKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
+ H+ PL S FP NR RA LK+HL + P KRISDF LL+FL+ FLDL +D
Sbjct: 339 VPHESPLRSEFPKLNREGSYRSRAALKAHLQKYRDEPYAKRISDFQLLVFLSDFLDLRTD 398
Query: 383 VPALAQCVQ-AQTAVPEGYKLLINSMA 408
+PA+ Q V+ + GY +LI+S+A
Sbjct: 399 IPAICQAVRDPNIPLDSGYPILIDSVA 425
>gi|301103316|ref|XP_002900744.1| NPL4-like protein [Phytophthora infestans T30-4]
gi|262101499|gb|EEY59551.1| NPL4-like protein [Phytophthora infestans T30-4]
Length = 428
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 151/445 (33%), Positives = 221/445 (49%), Gaps = 57/445 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RS++G RV T+A++K I Q I Q L+ +Q Q
Sbjct: 1 MLIRVRSKNGTWRVGDLTPSSTIADIKRWIFEQHSIAPARQQLALDQQ--------GAQQ 52
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGER-----------NVRGP-------------S 96
TD + L L + HG ++FL YDGE N G +
Sbjct: 53 TTDT----QTLRELRVGHGDMLFLDYDGETVSTGGVVGTKINADGTLTHVAYHGRADKQA 108
Query: 97 FNPAGSFGRKMTMDDLIAKQMR--------VTRQENPHCESVSFDRDCADAFQQYVNETL 148
F P R M M + + MR ++ Q+ P V D + FQ Y+ +
Sbjct: 109 FRPGMKSLRDMKMYWTLGEFMRMDAQFEFKISAQKEPQVAKVHLDGASCNDFQAYLRQ-F 167
Query: 149 AFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMK 207
AF R G++YG+V E K V V+FIYEPPQ+G +L D +K VDA+A LG +
Sbjct: 168 AFQQSRCGWLYGSVDSETKEVTVDFIYEPPQEGNPYGFEVLDDPNADK-VDAVAEALGYE 226
Query: 208 KVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEA 267
KVG++F+ ++ D+ S RE L A + A+ + + +K+ + ++G A+ FEA
Sbjct: 227 KVGWVFSHPPREDD-DFHFSARETLLAAQLQADAGGKASLFLTLKVTLDKQGQAS--FEA 283
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVKILD 325
FQ+SD CV +F G + +PK + +V K EVDN+FFL VV ++
Sbjct: 284 FQVSDQCVEMFAAGAL---MENEHNPKACAVTDTFTALVEAKRATEVDNNFFLCVVPVVP 340
Query: 326 HQGPLSSTFPIENRTTQVTMRA-LKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
H+ PL FP NR RA LKS L + + P KRISDF LL+FL+ FLD+ +D+P
Sbjct: 341 HESPLRCEFPKLNREGAYRNRAALKSQLQKYRNEPYAKRISDFQLLVFLSDFLDVQTDIP 400
Query: 385 ALAQCVQ-AQTAVPEGYKLLINSMA 408
A+ Q V+ T + GY +LI+S+A
Sbjct: 401 AICQAVRDPNTPLDSGYPILIDSVA 425
>gi|412988215|emb|CCO17551.1| predicted protein [Bathycoccus prasinos]
Length = 499
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 158/503 (31%), Positives = 247/503 (49%), Gaps = 103/503 (20%)
Query: 1 MLLRIRSRDGLERV-TVDGAH-------VTVAELKTIIQSQ-LRIPVESQ---------- 41
+LLRIR+RDG ER+ DGA T+++++ +I Q +PVE+Q
Sbjct: 2 VLLRIRTRDGTERLECADGAGGGENDVVKTISDVQDLISKQPWGVPVENQSLSLDPSLLL 61
Query: 42 -TLSTNQNLLLAKSPSDLLQFTDMANPDRPLSSLNISHGSIVFLSYDG--ERNVRGPSFN 98
T S N + AK+ +++ + +P + L SL + HG IV+++Y ER V N
Sbjct: 62 RTSSENADETKAKTKKEVV----VLSPRKTLKSLQLKHGDIVYMTYPDSIEREVTKMRVN 117
Query: 99 -PAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGF 157
FG K T++++I+K++R+ RQE + SVSF A+ FQ YVN TL F R G+
Sbjct: 118 VETKPFGAKRTIEEMISKEVRIGRQERGNVASVSFCGHAANMFQSYVNNTLGFKQSRFGW 177
Query: 158 MYGTVLEDKRVE-----------------------------------VNFIYEPPQQGTE 182
+YGT + K+ + + IYEP Q G
Sbjct: 178 LYGTKTKRKKEKTEEEKEKEKSKDASSKKKEKDEEEGKEDEMITELFADVIYEPNQNGNA 237
Query: 183 EVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDY--TLSNREVLQAVEFHAE 240
V ++ ++EE + +AIA +G +++G +F+Q+ + + +S EV + A+
Sbjct: 238 YVAHVDEEDEESQTAEAIANAMGYERIGCVFSQSDVDRGDESVSAMSAHEVTLCAKLQAK 297
Query: 241 CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------- 290
+E+VT +V E+ + FEAFQ+SD CV+L+++G+FE + G
Sbjct: 298 YG-KEFVTVIVVQFEDEDEEMQVTFEAFQVSDQCVKLYEDGFFELDEKTGVAKLPASSSA 356
Query: 291 -----------------DDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST 333
+ +K+ K V+V KDV EVDNDF++V V I DH G S
Sbjct: 357 GITKEEIEEEEEGSSFSNWGSHTKLVKPVMVERKDVTEVDNDFWIVAVPIKDHTGTAQSG 416
Query: 334 FPIENR-----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQ 388
FPIENR T A++S S +++ DFHLLLFL++ +D N DV A+A+
Sbjct: 417 FPIENRLHPPQTADDIRSAIQSRSATCSS--YAEKLRDFHLLLFLSKQMDKN-DVLAIAE 473
Query: 389 CVQAQT---AVPEGYKLLINSMA 408
V EGYK+LI+++A
Sbjct: 474 STNDYIDGGDVSEGYKVLIDAIA 496
>gi|255642006|gb|ACU21270.1| unknown [Glycine max]
Length = 114
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 88/113 (77%), Positives = 100/113 (88%)
Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPS 357
MKKDVVVG KD +EVDNDFFLVVVKI DHQGPLSS+FPIENR TQV M+ALK+HL+R+ +
Sbjct: 1 MKKDVVVGVKDTREVDNDFFLVVVKIADHQGPLSSSFPIENRNTQVPMKALKNHLDRTKN 60
Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMAET 410
LP VKRISDFHLLL LAR LDLN+DVPAL +CVQ QT+VPEGY++LI SMA T
Sbjct: 61 LPFVKRISDFHLLLVLARVLDLNADVPALTECVQTQTSVPEGYQILIESMAST 113
>gi|296086647|emb|CBI32282.3| unnamed protein product [Vitis vinifera]
Length = 114
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 91/111 (81%), Positives = 103/111 (92%)
Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPS 357
MKKDVVVG KD +EVDNDFFLV+VKILDHQGPLSSTFPIENR TQVTMRALK+HL+R+ +
Sbjct: 1 MKKDVVVGVKDTREVDNDFFLVLVKILDHQGPLSSTFPIENRITQVTMRALKNHLDRAKN 60
Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
LP VKRISDFHLLL L+RFLD+NSDVPALA+CVQ Q+AVPEGY+LLI S+A
Sbjct: 61 LPFVKRISDFHLLLLLSRFLDINSDVPALAECVQTQSAVPEGYQLLIESLA 111
>gi|452823734|gb|EME30742.1| nuclear import protein NPL4-like protein [Galdieria sulphuraria]
Length = 466
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 171/316 (54%), Gaps = 33/316 (10%)
Query: 114 AKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFI 173
A + + QE + SVS D A +FQ + ++ F +R GF+YG + + V V+ I
Sbjct: 158 AMKFHIKAQEESYTNSVSVDHAAAFSFQANLV-SIGFQQQRLGFLYGRIEDSSFVFVDCI 216
Query: 174 YEPPQQGTEEVLYILRDEEE-------EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTL 226
YEPPQ GT ++ Y L EE +K D +A+ LG++KVG+IF+ +
Sbjct: 217 YEPPQSGTYQI-YKLESPEENPDAAESKKRADKLASLLGLEKVGWIFSS----KPRKCIF 271
Query: 227 SNREVLQAVEFHAECNMEEW-------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
S ++L A +F +C EE + + V+EEG FEA+Q+SD+C+ ++K
Sbjct: 272 SAVDILTACQF--QCEAEEMHGKDVSKTFVTITVSVTEEG--MTRFEAYQISDLCLDMYK 327
Query: 280 EGWFETEIAEGDDPKLSKM--KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-SSTFPI 336
F E P K+ K++V+V G + + +D DFFLV V I DH L ++ FP+
Sbjct: 328 ASVFAP--VEQQKPNSGKIRTKEEVIVEGVECRSIDTDFFLVSVPIKDHHSWLKNNNFPV 385
Query: 337 ENR-TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ-- 393
ENR V L S L +S P ++RISDFHLLL+++ LD++ D+P L V+ Q
Sbjct: 386 ENREQAPVQPSDLSSCLGKSLDEPFIRRISDFHLLLYISNVLDMDGDMPLLCNAVREQRE 445
Query: 394 -TAVPEGYKLLINSMA 408
T EGY+L+I ++A
Sbjct: 446 LTDEEEGYRLMIEAIA 461
>gi|449016512|dbj|BAM79914.1| similar to nuclear import protein NPL4 [Cyanidioschyzon merolae
strain 10D]
Length = 485
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 115/331 (34%), Positives = 166/331 (50%), Gaps = 40/331 (12%)
Query: 105 RKMTMDDL---IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT 161
+ MT+D L A+ R+ Q+ C S DR ADAFQ Y+ + + R G +YG
Sbjct: 166 KGMTVDYLEWLDAQHFRLRSQDESPCSQCSIDRQAADAFQSYLLR-IHDSQMRYGILYGD 224
Query: 162 VL-EDKRVEVNFIYEPPQQG-TEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ 219
V+ + V+V I+EPPQ G + L + E + D +A LGM++VG+IFT
Sbjct: 225 VVASEGLVKVEAIFEPPQAGDADRYLPLPESEVAIREADTLAHYLGMRRVGWIFTHRA-- 282
Query: 220 NKKDYTLSNREVLQAVEFHAECNME---------------EWVTAVVKLEVSEEGSAAIH 264
+ + L R+V A + E +W V + VS + H
Sbjct: 283 EDRSHALHARDVAYAAKLQLETTAYHLAEQERDQTPEASFDWSRYFVTIAVSIQPDGRRH 342
Query: 265 FEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKIL 324
FEA+Q+SD C+ + + G + D +L + +VVV G V+ VD DFFLV+V I
Sbjct: 343 FEAYQLSDQCLLMAQAGIL---MPVSDSGELHTLD-EVVVEGNRVRRVDTDFFLVLVPIR 398
Query: 325 DHQGPLSSTFPIENRTTQV-----TMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDL 379
DHQG L + FPIENR + R L+S + +RI+DFHLLLFL+R L +
Sbjct: 399 DHQGWLRTAFPIENREVEPQSITDVQRVLESGYGK-----FTERIADFHLLLFLSRILGM 453
Query: 380 NSDVPALAQC-VQAQTAVPE--GYKLLINSM 407
SD+P LA V+ + P GY+LLI S+
Sbjct: 454 ESDMPLLANAIVERRDLEPAELGYQLLIESL 484
>gi|330840932|ref|XP_003292461.1| hypothetical protein DICPUDRAFT_83074 [Dictyostelium purpureum]
gi|325077301|gb|EGC31023.1| hypothetical protein DICPUDRAFT_83074 [Dictyostelium purpureum]
Length = 572
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 100/286 (34%), Positives = 155/286 (54%), Gaps = 21/286 (7%)
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
+M++ Q+NPHC D A+ FQQYV ++ F V+R GF+YG +L+D V ++ IYE
Sbjct: 278 KMKLKSQDNPHCPGALVDFTSANIFQQYV-QSKKFEVQRVGFLYGNILQDGSVVIDSIYE 336
Query: 176 PPQQGTE-EVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQA 234
PPQ+ + + + IL D EK V++IA+ LGM +VG+IF+ + Y +S+++++QA
Sbjct: 337 PPQECKDGQTVTILNDPRGEK-VESIASMLGMTRVGWIFSHP----SRKYVMSSKDIIQA 391
Query: 235 VEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
F N A + L + EG + I EAFQ+SD + L K G F + DP
Sbjct: 392 AAFQ---NKYGSSFATLILSANTEGQSNI--EAFQVSDQALALEKRGEF---LQNQPDPT 443
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFLVVV--KILDHQGPLSSTFPIENRTTQVTMRALKSH- 351
K+K V G + DN+FF+V V K + + FP+ENR T+ L S+
Sbjct: 444 KCKLKTAVFEEGTETNNADNNFFIVTVPLKAREDKSIFKCNFPVENRQPGTTISDLASYK 503
Query: 352 LNRSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTA 395
L + PL + SDFH L+FL D + D P + + +Q +++
Sbjct: 504 LELKDTSPL-EFFSDFHFLIFLLENELFDFSVDFPVICESIQKRSS 548
>gi|218188252|gb|EEC70679.1| hypothetical protein OsI_02004 [Oryza sativa Indica Group]
Length = 124
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 70/102 (68%), Positives = 84/102 (82%)
Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
+SNREVLQA E AE + EWVTA+VKLEV ++GS +HFEAFQMS++CV+LFK+G ET
Sbjct: 1 MSNREVLQATELQAEGGIPEWVTAIVKLEVGDDGSGDVHFEAFQMSEICVKLFKDGVLET 60
Query: 286 EIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
EI + DDP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQ
Sbjct: 61 EIGDKDDPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQ 102
>gi|281209275|gb|EFA83448.1| nuclear protein localization 4 [Polysphondylium pallidum PN500]
Length = 602
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 161/284 (56%), Gaps = 18/284 (6%)
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
+M++ Q+ HC + A+ FQ YV+ + F +R GF+YG L+D V V+ IYE
Sbjct: 311 KMKLKSQDAAHCPGAAIAVTPANEFQHYVS-SRKFEEQRIGFLYGNFLQDGSVSVDVIYE 369
Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
PPQ+G ++ + +L D+ +K ++++A+ LGM +VG+IF+ + Y +S+ E+LQA
Sbjct: 370 PPQKGDKKSVTLLEDKSIDK-IESLASLLGMTRVGWIFSHP----SRKYVMSSSEILQAA 424
Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
+ + +VT + L V+ EG++ + EAFQ+SD ++L K G F++ +E P +
Sbjct: 425 NYQNKFG-NSFVTLI--LSVNSEGNS--NLEAFQVSDQAMKLEKTGEFQSNQSE---PTI 476
Query: 296 SKMKKDVVVGGKDVKEVDNDFFLVV--VKILDHQGPLSSTFPIENRTTQVTMRALKSHLN 353
K+K V+V G + DN+FF+V VK + + L+ FP+ NR + L S+
Sbjct: 477 CKLKSPVLVEGSETLNADNNFFIVTVPVKAREDKPVLNCVFPVNNRNPPTSDSDLASYKI 536
Query: 354 RSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTA 395
+ + SDFH L++L DLN+D+P + V+++T+
Sbjct: 537 DNQDQSKLTFFSDFHFLVYLLMHGIFDLNTDLPLICDNVRSRTS 580
>gi|299117192|emb|CBN75156.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 441
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 126/451 (27%), Positives = 197/451 (43%), Gaps = 57/451 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M++R+R+ G+ RV T+ EL+ + ++ + T Q L L +P
Sbjct: 1 MIIRVRTHAGVWRVNDVAPDTTIKELRQRLSTEHNANLSD---DTRQPLTLKPNPRGD-- 55
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYD------GERNVRGP------------------- 95
D + L SL + HG ++ L+ D GP
Sbjct: 56 -QDTLALESTLQSLGLGHGDMIHLNLDESIRDMAHEEAGGPKRINKDGTIEQQSFDEISN 114
Query: 96 --SFNPAGSFGRKMTMDDLIAK--------QMRVTRQENPHCESVSFDRDCADAFQQYVN 145
F P R M M +A R+ + E C VS D ++FQ +V
Sbjct: 115 KTGFRPGMMSLRSMKMKWTLADFTEMNDQFTFRLKKPEKGVCTKVSLDSAACNSFQSFVR 174
Query: 146 ETLAFAVKRGGFMYGTVLEDK-RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
+ F R G++YG +D +V V +YEPPQ+ E + D + + V+A+A L
Sbjct: 175 Q-FGFHRARMGYLYGHFTDDDTKVRVECVYEPPQENYPEGFQVSEDPKADT-VEALAGLL 232
Query: 205 GMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIH 264
G+K+VG+IF ++ + S+ EV+ A E T V ++V+ E + H
Sbjct: 233 GLKRVGWIFAHP--PREEGFLFSSAEVVTAATLQLEAADGVNDTPFVTVKVTAEEDGSAH 290
Query: 265 FEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVK 322
F+AFQ+S C+ + EG E G++P + K +V K+ KEVD FL V
Sbjct: 291 FDAFQVSKQCMEMVAEGALEV----GENPGHCMVPKTFTAIVEMKEAKEVDTTMFLNTVP 346
Query: 323 ILDHQ-GPLSSTFPIENRTTQV-TMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLD 378
I H+ FP NR + T +K L R+ + +SDFHLLLFL FLD
Sbjct: 347 IEQHESAKFLHDFPRANRDGVMQTWDDVKRQLGRAGGQGFTYMDVLSDFHLLLFLTAFLD 406
Query: 379 LNSDVPALA-QCVQAQTAVPEGYKLLINSMA 408
+ +D+P + V + + +G+K LIN+ A
Sbjct: 407 MQTDMPKICDSVVDREKPLDDGFKFLINNFA 437
>gi|159163292|pdb|1WF9|A Chain A, Solution Structure Of A Novel Beta-Grasp Fold Like Domain
Of Hypothetical Protein (Arabidopsis Thaliana)
Length = 107
Score = 144 bits (363), Expect = 8e-32, Method: Composition-based stats.
Identities = 64/93 (68%), Positives = 85/93 (91%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQF 61
+LR+RSRDGLERV+VDG H+TV++LKT+IQ QL+IP+ +QTLSTN+NLLLAKSPSD L F
Sbjct: 9 MLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAKSPSDFLAF 68
Query: 62 TDMANPDRPLSSLNISHGSIVFLSYDGERNVRG 94
TDMA+P+ +SSLN++HGS+V+L+Y+GER +RG
Sbjct: 69 TDMADPNLRISSLNLAHGSMVYLAYEGERTIRG 101
>gi|413922638|gb|AFW62570.1| hypothetical protein ZEAMMB73_980053 [Zea mays]
Length = 502
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/121 (57%), Positives = 90/121 (74%), Gaps = 4/121 (3%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYE 175
+RVTRQEN C + SFDRD A+AFQ YV E L F VKR GF+YG V +E K V V+FI E
Sbjct: 324 IRVTRQENALCTAASFDRDAANAFQLYVAEFLGFGVKRAGFLYGRVDVETKDVFVDFIDE 383
Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREVL 232
PPQQG+E+V+++LRD +EE VD IA GLGM++VGF+ TQ + + +YT+SN+EV+
Sbjct: 384 PPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRRVGFVLTQAVGRKASETGEYTMSNQEVM 443
Query: 233 Q 233
Q
Sbjct: 444 Q 444
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 89/152 (58%), Gaps = 42/152 (27%)
Query: 179 QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI---MQNKKDYTLSNREVLQAV 235
QG+E+++++LRD +EE VD IA GLGM++VG +FTQ + +YT+SNRE
Sbjct: 124 QGSEDMVHLLRDPDEEARVDTIAEGLGMRRVGLVFTQAVGGKASETGEYTMSNRE----- 178
Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
MS++CV+LFK+G ETE+ + DDP+
Sbjct: 179 ----------------------------------MSEICVKLFKDGVLETEVQDADDPRQ 204
Query: 296 SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
SKM+K+VV GGKD EVDNDFFL++VKI DHQ
Sbjct: 205 SKMQKEVVAGGKDTMEVDNDFFLMLVKISDHQ 236
>gi|413922639|gb|AFW62571.1| hypothetical protein ZEAMMB73_980053 [Zea mays]
Length = 198
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/121 (57%), Positives = 90/121 (74%), Gaps = 4/121 (3%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYE 175
+RVTRQEN C + SFDRD A+AFQ YV E L F VKR GF+YG V +E K V V+FI E
Sbjct: 20 IRVTRQENALCTAASFDRDAANAFQLYVAEFLGFGVKRAGFLYGRVDVETKDVFVDFIDE 79
Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKK---DYTLSNREVL 232
PPQQG+E+V+++LRD +EE VD IA GLGM++VGF+ TQ + + +YT+SN+EV+
Sbjct: 80 PPQQGSEDVVHLLRDPDEEARVDTIAEGLGMRRVGFVLTQAVGRKASETGEYTMSNQEVM 139
Query: 233 Q 233
Q
Sbjct: 140 Q 140
>gi|328872908|gb|EGG21275.1| nuclear protein localization 4 [Dictyostelium fasciculatum]
Length = 570
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 157/285 (55%), Gaps = 18/285 (6%)
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
+M++ Q+ H V+ + + A+ FQQY++ F +R GF+YG L+D V V IYE
Sbjct: 279 KMKLKAQDIAHAPGVAINFESANVFQQYISNK-KFEEQRCGFLYGNYLDDGSVVVECIYE 337
Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
PPQ+G + +L D+ +K V+++A+ LG+ +VG+IF+ + Y +S+ +++QA
Sbjct: 338 PPQKGNKTNFTLLEDKFIDK-VESMASMLGLTRVGWIFSHP----SRKYIMSSTDIIQAA 392
Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
F + + +VT + L ++ EG + + EAFQ+SD ++L K G F +A +DP
Sbjct: 393 SFQNKYG-KSFVTLI--LTINSEGKS--NLEAFQVSDQALKLEKTGEF---LANQEDPTK 444
Query: 296 SKMKKDVVVGGKDVKEVDNDFFL--VVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLN 353
K+K V V G + D +FF+ V VK + + L +TFP+ NR + + L S+
Sbjct: 445 CKLKSPVFVEGSETNLADINFFIANVPVKAREDKPFLHTTFPVNNRIPETSNSDLVSYKI 504
Query: 354 RSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTAV 396
+ + SDFH L+FL DL +D P + + ++++T
Sbjct: 505 DNQDQSKLYFFSDFHFLIFLLINNIFDLKTDFPPICENIKSKTTT 549
>gi|66804083|ref|XP_635845.1| nuclear protein localization 4 [Dictyostelium discoideum AX4]
gi|74851898|sp|Q54GD3.1|NPL4_DICDI RecName: Full=Nuclear protein localization protein 4 homolog
gi|60464175|gb|EAL62335.1| nuclear protein localization 4 [Dictyostelium discoideum AX4]
Length = 576
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 148/284 (52%), Gaps = 17/284 (5%)
Query: 116 QMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYE 175
+MR+ Q+NPH D A+ FQQY+ + + +R GF++G L D V V+ IYE
Sbjct: 282 KMRLKSQDNPHAPGALVDFQSANIFQQYIANS-KYEQQRIGFLFGNFLSDGSVVVDSIYE 340
Query: 176 PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAV 235
PPQ+ ++ L + ++++A+ LG+ +VG+IF+ + YT+S+ E++QA
Sbjct: 341 PPQECKDKQTPTLLPDPLADKIESMASMLGLTRVGWIFSHP----SRKYTMSSTEIIQAA 396
Query: 236 EFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
+ ++ + V L +S + EAFQ+SD ++L K G F + DP
Sbjct: 397 SYQ-----NKYGPSFVTLILSVNSDGQSNMEAFQVSDQALKLEKTGEF---LPTQPDPTK 448
Query: 296 SKMKKDVVVGGKDVKEVDNDFFLVVV--KILDHQGPLSSTFPIENRTTQVTMRALKSHLN 353
K+K V G + D FF+V V K + + + +FP+ENR T+ L S+
Sbjct: 449 CKLKSPVFEEGTETINADTHFFIVTVPLKAREDKSIFNISFPVENRIPVNTLSDLASYKL 508
Query: 354 RSPSLPLVKRISDFHLLLFL--ARFLDLNSDVPALAQCVQAQTA 395
+ +K SDFH L+FL +FLD SD P + + ++++++
Sbjct: 509 EHKDVSPLKFFSDFHFLIFLLENQFLDFQSDFPIICENIRSRSS 552
>gi|422293770|gb|EKU21070.1| nuclear protein localization 4-like protein [Nannochloropsis
gaditana CCMP526]
Length = 448
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/454 (27%), Positives = 197/454 (43%), Gaps = 75/454 (16%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
MLLR+RS+ G RV T+ L++ I S+ +S + + + + P+
Sbjct: 1 MLLRVRSKMGTWRVEGVEPGTTIKALRSRILSE-------HGVSPHPSQIFSLDPAGKRV 53
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY---------------DGE-------RNVRGPSFN 98
D ++ + S N+ HG +V L+ DG R F
Sbjct: 54 MDD----EQTVKSFNLQHGDLVHLAIREEDCAHEAGKKIAADGSIVNKTHLEASRVKGFR 109
Query: 99 PAGSFGRKMTM----------DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETL 148
P R M M D +++ + E P C + C + FQ Y+ +
Sbjct: 110 PGMMSLRSMKMHWTLTDFVELDAQFEYKIKAQKNETPWCTKAIMEPICGNDFQSYMRR-V 168
Query: 149 AFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKK 208
F R G++YG E K +V +YEPPQ+ T E +L D EK V+A+AA LG+ +
Sbjct: 169 GFNQCRMGYLYGQFTEAKEAKVEVLYEPPQESTHEGFELLEDPRAEK-VEALAAALGLVR 227
Query: 209 VGFIFTQTIMQNKKDYTLSNREVLQAV--EFHAECNMEE--WVTAVVKLEVSEEG----- 259
VG+IF ++ + S EV+QA + A ++E +VT V L E+
Sbjct: 228 VGWIFAHP--PREEGFVFSGLEVMQAAMGQLEAADGVKETPFVTVRVSLAPVEKAGGEDG 285
Query: 260 --------SAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDV 309
+ HF+AFQ+S C+ + EG E G +P + + +V GK
Sbjct: 286 KEGGKEEVATVAHFDAFQVSLQCMEMVAEGAIEP----GPEPGAMLVNESFTAIVEGKPA 341
Query: 310 KEVDNDFFLVVVKILDHQGP-LSSTFPIENRTTQV-TMRALKSHLNRS--PSLPLVKRIS 365
VD +FFLV V + + L+ FP NR ++ + A+ + L + + +S
Sbjct: 342 PTVDVNFFLVNVPVGSAESSLLTHRFPKANRLDELPSAAAMAAQLGEAGKAGWRFIDLMS 401
Query: 366 DFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
DFHLLLFL FL + +V L + V+ + VP G
Sbjct: 402 DFHLLLFLTDFLGME-EVAVLCRSVREKETVPLG 434
>gi|219113663|ref|XP_002186415.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583265|gb|ACI65885.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 473
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 140/478 (29%), Positives = 208/478 (43%), Gaps = 81/478 (16%)
Query: 1 MLLRIRSRDGLERVT-VDGAHVTVAE-LKTIIQSQ---------LRIPVESQTLST---- 45
ML+R+RS G+ RV +D TV + L I +S+ R P +TLST
Sbjct: 1 MLIRVRSNVGVWRVDGLDAETATVTDVLAGIARSRPHVVYERPLARDPACHETLSTTETL 60
Query: 46 -----NQNLLLAKSPSDLLQFTDMANPDRPLSSLNISHGSIV-----------FLSYDG- 88
+QN + D D P +S ++ + V + DG
Sbjct: 61 RAQGLDQNGAMIHCRVDPTTCADWTAPAAEMSPSTMTGNATVEPKEQGQHFRRVIDKDGG 120
Query: 89 -------------ERNVRGPSFNPAGSFGRKMTMDDLIAK----QMRVTRQENPHCESVS 131
+R R P T++D IA + ++ RQE VS
Sbjct: 121 IKLVPTKDVPTQQDRGFRK-GMLPLRDMKMSWTLNDFIALDSQFEFKIQRQEKATTSKVS 179
Query: 132 FDRDCADAFQQYVNETLAFAVKRGGFMYGT-VLEDK------RVEVNFIYEPPQQGTEEV 184
D FQ Y+ + F KR GF+YG V ED+ +V V IYEPPQ+ +
Sbjct: 180 LDVPSISNFQTYL-QRFQFQRKRCGFLYGKFVKEDESDEKPTKVLVEAIYEPPQEINPDS 238
Query: 185 L--YILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAEC- 241
+ L D+ +E+ V+ +A LG+++VG++F Q Y LS E + A EF E
Sbjct: 239 AEGFTLEDDPQEEEVNQLAEWLGLQRVGWVFGHAPRQG---YVLSAAETILAAEFQLEAA 295
Query: 242 -NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
+EE V + +G A+ EAFQ+S C+ + E E + D+P+L K+ +
Sbjct: 296 GGVEETPFCTVTVAPKLDGQVAV--EAFQVSQQCMAMVAE---EALQVDPDNPQLCKVNE 350
Query: 301 D--VVVGGKDVKEVDNDFFLVVVKILDHQG-PLSSTFPIENRT---TQVTMRALKSHLNR 354
+ GK V+ FFL VV I+ H + FP NR T AL L +
Sbjct: 351 TFTAIQEGKASPTVETAFFLTVVPIVQHTSETFVADFPRANRDLDDTAPNKDALTRQLRQ 410
Query: 355 SPS--LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP--EGYKLLINSMA 408
+ S LV R++DF+LL++L D SD P + + A +P +GYKLLI S+A
Sbjct: 411 AGSNGWTLVDRLADFNLLIYLKSSFDFQSDFPKICAAI-ANRDIPLDDGYKLLIASLA 467
>gi|428186314|gb|EKX55164.1| hypothetical protein GUITHDRAFT_98945 [Guillardia theta CCMP2712]
Length = 396
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 150/290 (51%), Gaps = 32/290 (11%)
Query: 128 ESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYI 187
+S+S + A FQ YV L+F R G+M+G V +++ V + IYEPPQ+G EE +
Sbjct: 127 QSLSVEFSAASQFQSYVVGDLSFKSHRVGYMFGHVDKNRNVFCDVIYEPPQRGDEEAYNL 186
Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFT--QTIMQNKKDYTLSNREVLQAVEFHAECNMEE 245
+ +E+ V+ +A+ LGM+ + Q M+ K Y ++
Sbjct: 187 TEVDPDEEKVERLASLLGMRMDVMLACSLQHKMEKKHGYDFAS----------------S 230
Query: 246 WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK--LSKMKKDVV 303
V+AV+ E + +I FEA+Q+SD+C+ +++ + ++P ++ +DV+
Sbjct: 231 MVSAVI---TRNETTDSIVFEAYQISDLCLDMYRRDI----LVPPNNPNRGYTRTVQDVL 283
Query: 304 VGGKDVKEVDNDFFLVVVKILDHQ----GPLSSTFPIENRTTQVTMRALKSHLNRS-PSL 358
V KD ++VDNDFF+ V I H G + FP++NR Q R H+ S +
Sbjct: 284 VERKDTQKVDNDFFINTVPIKTHNSEIWGNREAFFPVKNRPGQDQNRMEVKHILLSHEEV 343
Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
P R+ DFH+LL+L+ LD +D+P + + V+ + +G+ L+ S+A
Sbjct: 344 PFCIRMRDFHMLLYLSNMLDPLTDLPIICKSVREDVELQQGHIELVKSIA 393
>gi|291000524|ref|XP_002682829.1| NPL4 family protein [Naegleria gruberi]
gi|284096457|gb|EFC50085.1| NPL4 family protein [Naegleria gruberi]
Length = 435
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 113/441 (25%), Positives = 189/441 (42%), Gaps = 74/441 (16%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
+L+R R +DG R+ + A TV ++K +I+ + IP S +++ K+P
Sbjct: 2 LLIRARGKDGTFRLEL-SADQTVLDIKKLIEKETSIPHSSISVTK----FREKTP----- 51
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVR--------------GPSFNPAGSFGRK 106
+ N D S + HG ++ YD ++ + G RK
Sbjct: 52 ---LKNTDTVQGS-QLKHGDMIEFVYDTTQHKSKQEVEEEKKSQLRDSEDLDNLGLRERK 107
Query: 107 --MTMDDLI--AKQMRVT--RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG 160
T+ + + Q+++T Q+ C + D+ +FQQ++ +T F +R F+YG
Sbjct: 108 KHWTLSEYLDLVNQVKITIKHQKYAICSTAYLDQLVGQSFQQFLLQT-RFQFQRFAFLYG 166
Query: 161 TVLEDKR-------------------------VEVNFIYEPPQQGTEEVLYILRDEEEEK 195
E K+ V + ++EP Q T E + D
Sbjct: 167 ETKEGKKPQPTEEEKKKATKDENINVDDDYLDVYIKVLFEPKQDSTPEGFKQVEDTVFND 226
Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEV 255
D +A+ LGMKKVG+IF+ +++Y L E+++A E+ ++ + + V +
Sbjct: 227 RADTLASMLGMKKVGWIFSH---DGQREYPLLGPEIIKAAEYQSK-----YGPSFVTITA 278
Query: 256 SEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDND 315
I FEA+Q+S CV L+++G ++ + KK V + K ++D
Sbjct: 279 CPNEIGEIQFEAYQVSKQCVELWEKGLLS---ENPENLEFLIAKKQVEIERKMTNDIDAL 335
Query: 316 FFLVVVKILDHQGPLSSTFPIENRTTQ---VTMRALKSHLNRSPSLPLVKRISDFHLLLF 372
+ V I H+ FP NR M LKS L S V+R+SDF LLLF
Sbjct: 336 LLVSNVAIAPHKSSFQVGFPPRNRPGNEMFQNMVELKSVLLERRSQKFVQRVSDFDLLLF 395
Query: 373 LARFLDLNSDVPALAQCVQAQ 393
L +L L+SD P L + V +Q
Sbjct: 396 LTDYLSLSSDFPGLCEAVLSQ 416
>gi|361067653|gb|AEW08138.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
Length = 85
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/86 (62%), Positives = 68/86 (79%), Gaps = 1/86 (1%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RSRDGLERV VD + TV +LK++IQSQL++P+ SQ LST+QNLLLAK ++ Q
Sbjct: 1 MLIRVRSRDGLERVKVDKPNPTVGDLKSLIQSQLQVPISSQILSTSQNLLLAKG-GNIQQ 59
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
F DM +P+ LSSL I HG IVFLSY
Sbjct: 60 FKDMEDPNMLLSSLGIDHGHIVFLSY 85
>gi|383167791|gb|AFG66945.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167793|gb|AFG66946.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167795|gb|AFG66947.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167797|gb|AFG66948.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167799|gb|AFG66949.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167801|gb|AFG66950.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167803|gb|AFG66951.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167805|gb|AFG66952.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167807|gb|AFG66953.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167809|gb|AFG66954.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167811|gb|AFG66955.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167813|gb|AFG66956.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167815|gb|AFG66957.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167817|gb|AFG66958.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167819|gb|AFG66959.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167821|gb|AFG66960.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167823|gb|AFG66961.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
gi|383167825|gb|AFG66962.1| Pinus taeda anonymous locus 2_792_01 genomic sequence
Length = 85
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/86 (62%), Positives = 68/86 (79%), Gaps = 1/86 (1%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
ML+R+RSRDGLERV VD + TV +LK++IQSQL++P+ SQ LST+QNLLLAK ++ Q
Sbjct: 1 MLIRVRSRDGLERVKVDKPNPTVGDLKSLIQSQLQVPISSQILSTSQNLLLAKG-GNVQQ 59
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
F DM +P+ LSSL I HG IVFLSY
Sbjct: 60 FKDMEDPNMLLSSLGIDHGHIVFLSY 85
>gi|407044247|gb|EKE42468.1| NPL4 family protein [Entamoeba nuttalli P19]
Length = 461
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 158/302 (52%), Gaps = 25/302 (8%)
Query: 95 PSFNPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKR 154
P+ P G + D +++R+ +QE C+ + D FQ+YV + +F V R
Sbjct: 157 PNHGPHGCCTLCLAKWDY--EKIRIQKQEA-VCKLCRMNFMETDNFQRYVFQR-SFYVNR 212
Query: 155 GGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT 214
G +YGT +D RV+V IYEPPQ EE + ++ E+ V +A LG+++VG IF+
Sbjct: 213 IGILYGTFKQD-RVQVEAIYEPPQVINEEGDFEMKQEQTLLQVQQMAVELGLERVGIIFS 271
Query: 215 QTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+++ L++RE++ A + + + + VT VV +E GS + E +Q S+
Sbjct: 272 HA----RREEILTSREIISAAQRQLDWD-KRCVTVVVT--PNETGSES-KVECYQASEQL 323
Query: 275 VRLFKEGWFETEIAEGDDPKLSK--MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+ L K+ I + PK ++ +K+ V + K++ VD F+ V I +
Sbjct: 324 MELVKKA-----IVQPSQPKNNEINLKESVYLSQKEIMTVDPIVFVCNVPIRAFKAKTMF 378
Query: 330 LSSTFPIENRTTQ-VTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQ 388
LSS +PIENR + VT+ K + +LP+ ++ISDFHLL+F + + S + ++ +
Sbjct: 379 LSSKYPIENRPQEDVTLLQFKKYFESVKTLPIYQQISDFHLLVFCLDY-GVFSSLHSVKE 437
Query: 389 CV 390
C+
Sbjct: 438 CI 439
>gi|145489333|ref|XP_001430669.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397768|emb|CAK63271.1| unnamed protein product [Paramecium tetraurelia]
Length = 580
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/271 (31%), Positives = 139/271 (51%), Gaps = 36/271 (13%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF+YG D R V IYEPPQ+GT + +L+D + D IA LGM
Sbjct: 306 QRMGFLYGYYAADPNYQNGVRAIVEAIYEPPQKGTFGHVDLLQDPAQTH-ADEIATNLGM 364
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-------HAE-CNMEEWVTAVVKLEVSEE 258
+K+G+IFT D LS+ E+ QA + H E C + +++T V L ++
Sbjct: 365 EKIGWIFTNV----NHDCFLSSEELRQAARYQQMHSIQHPEGCLVSKFLTVV--LRTKKD 418
Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWFE-TEIAEGDDPKLSKMKKDV----VVGGKDVKEVD 313
+ E + +SD +L ++G F+ +E + + +K + DV V GK VKE +
Sbjct: 419 NPDEVVPEVYMVSDQGQQLEQDGIFQNSERRKVLQVREAKNELDVLPSFVYNGKSVKEFE 478
Query: 314 NDFFLVVVKILDHQGPLSST-------FPIENRTTQVTMRALKSHLNRSPSLPLVKRISD 366
DFF +V + P S+ FP+ENR+ + LK+ L++ P R +D
Sbjct: 479 PDFF--IVNVAHGYNPNSNYSILKLYDFPVENRSVVAKQQDLKTFLHKHQKKPSHHRFAD 536
Query: 367 FHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
FHLLL+L++ +D++S + +AQ + ++ VP
Sbjct: 537 FHLLLYLSKLIDIHS-IANIAQFIASECEVP 566
>gi|67480009|ref|XP_655374.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472505|gb|EAL49987.1| hypothetical protein EHI_026450 [Entamoeba histolytica HM-1:IMSS]
gi|449702247|gb|EMD42922.1| nuclear protein localization, putative [Entamoeba histolytica KU27]
Length = 461
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 158/302 (52%), Gaps = 25/302 (8%)
Query: 95 PSFNPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKR 154
P+ P G + D +++R+ +QE C+ + D FQ+YV + +F V R
Sbjct: 157 PNHGPHGCCTLCLAKWDY--EKVRIQKQEA-VCKLCRMNFMETDNFQRYVFQR-SFYVNR 212
Query: 155 GGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT 214
G +YGT +D RV+V IYEPPQ EE + ++ E+ V +A LG+++VG IF+
Sbjct: 213 IGILYGTFKQD-RVQVEAIYEPPQVINEEGDFEMKQEQTLLQVQQMAVELGLERVGIIFS 271
Query: 215 QTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+++ L++RE++ A + + + + VT VV +E GS + E +Q S+
Sbjct: 272 HA----RREEILTSREIISAAQRQLDWD-KRCVTVVVT--PNETGSES-KVECYQASEQL 323
Query: 275 VRLFKEGWFETEIAEGDDPKLSK--MKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+ L K+ I + PK ++ +K+ V + K++ VD F+ V I +
Sbjct: 324 MELVKKA-----IVQPSQPKNNEINLKESVYLSQKEIMTVDPIVFVCNVPIRAFKAKTMF 378
Query: 330 LSSTFPIENRTTQ-VTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQ 388
LSS +PIENR + VT+ K + +LP+ ++ISDFHLL+F + + S + ++ +
Sbjct: 379 LSSKYPIENRPQEDVTLLQFKKYFESVKTLPIYQQISDFHLLVFCLDY-GVFSSLHSVKE 437
Query: 389 CV 390
C+
Sbjct: 438 CI 439
>gi|145496668|ref|XP_001434324.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401449|emb|CAK66927.1| unnamed protein product [Paramecium tetraurelia]
Length = 580
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 139/274 (50%), Gaps = 42/274 (15%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF+YG D R V IYEPPQ+GT + +L+D + D IA LGM
Sbjct: 306 QRMGFLYGYYAADPNYQNGVRAIVEAIYEPPQKGTFGHVDLLQDPAQSH-ADEIATSLGM 364
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-------HAE-CNMEEWVTAVVKLEVSEE 258
+K+G+IFT D LS+ E+ QA ++ H E C + +++T V L ++
Sbjct: 365 EKIGWIFTNV----NHDCFLSSEELRQAAQYQQIYSIKHPEGCLVSKFLTVV--LRTKKD 418
Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWFET----EIAEGDDPKLSKMKKDV----VVGGKDVK 310
+ E + +SD +L +G F+ ++ + D +K + DV V GK VK
Sbjct: 419 SPDEVVPEVYMVSDQGQQLENDGIFQNSGRRKVLQVRD---AKNELDVLPSFVYNGKSVK 475
Query: 311 EVDNDFFLVVVKILDHQGPLSST-------FPIENRTTQVTMRALKSHLNRSPSLPLVKR 363
E + DFF +V + P S+ FP+ENR+ + LK+ L++ P R
Sbjct: 476 EFEPDFF--IVNVAHGYNPNSNYSILKLYDFPVENRSAVAKQQDLKTFLHKHQKKPSHNR 533
Query: 364 ISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
+DFHLLL+L++ +D++S V +AQ + ++ VP
Sbjct: 534 FADFHLLLYLSKLIDIHS-VVNIAQFIGSECEVP 566
>gi|167539988|ref|XP_001741490.1| nuclear protein localization [Entamoeba dispar SAW760]
gi|165893905|gb|EDR22020.1| nuclear protein localization, putative [Entamoeba dispar SAW760]
Length = 463
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 137/259 (52%), Gaps = 22/259 (8%)
Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLV 197
D FQ+YV++ +F V R G +YG ED RV+V IYEPPQ EE + ++ E+ V
Sbjct: 199 DNFQRYVSQR-SFYVNRIGILYGIFKED-RVQVEAIYEPPQVINEEGDFEMKQEQTLLQV 256
Query: 198 DAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSE 257
+A LG+++VG IF+ +++ L++RE++ A + E + + VT VV
Sbjct: 257 QQMAIELGLERVGIIFSHA----RREEILTSREIIAAAQRQLEWD-KRCVTVVV---TPN 308
Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSK--MKKDVVVGGKDVKEVDND 315
E E +Q S+ + L K+ I + PK ++ +K+ V + K++ VD
Sbjct: 309 ETGIESKVECYQASEQLMELVKKA-----IVQPSQPKNNEINLKESVYLSQKEIMTVDPV 363
Query: 316 FFLVVVKILDHQGP---LSSTFPIENRTTQ-VTMRALKSHLNRSPSLPLVKRISDFHLLL 371
F+ V I + LSS +PIENR + VT+ K + +LP+ ++ISDFHLL+
Sbjct: 364 VFVCNVPIRAFKAKTMFLSSKYPIENRPQEDVTLLQFKKYFESVKTLPIYQQISDFHLLV 423
Query: 372 FLARFLDLNSDVPALAQCV 390
F + + S + ++ +CV
Sbjct: 424 FCLDY-GVFSSLHSVKECV 441
>gi|388490756|gb|AFK33444.1| unknown [Lotus japonicus]
Length = 67
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 46/64 (71%), Positives = 57/64 (89%)
Query: 345 MRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLI 404
MRALK+HL R+ +LP VKRI+DFHLLL LARFLDLN+DVPAL +C+Q +T VPEGY++LI
Sbjct: 1 MRALKNHLERTKNLPFVKRIADFHLLLVLARFLDLNADVPALTECIQTETTVPEGYQILI 60
Query: 405 NSMA 408
+SMA
Sbjct: 61 DSMA 64
>gi|296203397|ref|XP_002748882.1| PREDICTED: nuclear protein localization protein 4 homolog
[Callithrix jacchus]
Length = 608
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + GF+YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGFLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|237833025|ref|XP_002365810.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211963474|gb|EEA98669.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|221488270|gb|EEE26484.1| nuclear protein localization, putative [Toxoplasma gondii GT1]
gi|221508777|gb|EEE34346.1| nuclear protein localization, putative [Toxoplasma gondii VEG]
Length = 502
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 143/303 (47%), Gaps = 41/303 (13%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F ++ E L +R G+M+G +ED R IYEPPQ+ T L + +D+EE
Sbjct: 207 FVRFWQEDLQMLQQRFGYMFGYYVEDPHYPDGIRAVCEAIYEPPQENTLTSLNVKKDDEE 266
Query: 194 EKLVDAIAAGLGMKKVGFIFTQT----IMQNKKDYTLSNREVLQAVEFH-AECNMEEWVT 248
K+ + IA LG++ +G IFT ++ + + L+ ++ + V H + ++V
Sbjct: 267 VKVAEKIADRLGLELIGCIFTHAPREELLTSHEVVDLAKTQLSRQVSTHFTGYPVSKFVC 326
Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDD-------PKLSKMKKD 301
V L+ S A+ AF +SDM V L ++G + DD P+ ++
Sbjct: 327 CTVSLDKSTRDGEAVP-NAFMVSDMGVALVRDGVVSE--TQPDDTHIQLRSPEKGELLPQ 383
Query: 302 VVVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTFPIENRTTQVTMRALKSHLNR 354
V+ G++ D +F+V V + P SS+FP NR T + + HL R
Sbjct: 384 VLESGRETTRFDASWFIVRV---NESAPKKVRSFFCSSSFPRANRLVAQTPKDITDHLTR 440
Query: 355 SPSL----PLVK-----RISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLIN 405
+L P+ K R +DFHLLL++A+ DL++ ++ CV+ + V EG + +
Sbjct: 441 VAALAGPSPVAKKENWRRFADFHLLLYVAKLFDLDTAF-SICDCVRNRQPVDEGLEDTLK 499
Query: 406 SMA 408
S
Sbjct: 500 SFG 502
>gi|355707795|gb|AES03067.1| nuclear protein localization 4-like protein [Mustela putorius furo]
Length = 550
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 324 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 436
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ S FPIENR TQ +L ++L+++ S
Sbjct: 437 PLPVEYLIIDITTTFPKDPVHTFSISQSPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 495
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 496 LDTISDFHLLLFL 508
>gi|147898562|ref|NP_001080427.1| nuclear protein localization 4 homolog [Xenopus laevis]
gi|33585642|gb|AAH56013.1| Npl4 protein [Xenopus laevis]
Length = 610
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 149/309 (48%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 215 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRIGYLYGRYTEHKDIPLGLRAE 271
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D+ + K+VD IAA LG++KVG+IFT + +
Sbjct: 272 VAAIYEPPQIGTQNSLQLL-DDPKAKVVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 330
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 331 RNKDTYFLSAEECITAGYFQNKYPNLCRLSPDAHFGSKFVTVVATGGPDNQVHFEGYQVS 390
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 391 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 447
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ S+ FPIENR TQ +L ++L+++ S ++ I
Sbjct: 448 EYLIIDITTTFPKDPVYTFSTSSNLFPIENRDVLGETQ-DFHSLATYLSQNLSSVFLEII 506
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 507 SDFHLLLFL 515
>gi|401408699|ref|XP_003883798.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325118215|emb|CBZ53766.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 504
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 140/305 (45%), Gaps = 45/305 (14%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F Q+ + L +R G+M+G +ED R IYEPPQ+ T + +D+EE
Sbjct: 209 FVQFWQQDLQMLQQRFGYMFGYYVEDPHYPDGIRAVCEAIYEPPQENTLTSTNVKKDDEE 268
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQ------AVEFHAECN---ME 244
K+ + IA LG++ +G IFT ++ L++ EV+ A +F M
Sbjct: 269 VKMAEKIAERLGLEPIGCIFTHA----PREELLTSHEVVDLAKTQLARQFSTHFTGYPMS 324
Query: 245 EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD-----DPKLSKMK 299
++V V L+ S A+ AF +SD+ V L ++G + + P+ ++
Sbjct: 325 KYVCCTVSLDKSTRDGEAVP-NAFMVSDLGVALVRDGVICDKQPDNTHIQLRSPEKGELL 383
Query: 300 KDVVVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTFPIENRTTQVTMRALKSHL 352
V+ G++ D +F+V V + P SSTFP NR T + + HL
Sbjct: 384 PQVLESGRETTRFDASWFIVRV---NESAPRKVRSFFCSSTFPRANRLVHQTPKDITDHL 440
Query: 353 NRSPSL----PLVK-----RISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLL 403
R + P K R +DFHLLL++A+ DL++ ++ CV+++ V EG +
Sbjct: 441 TRGAACAGSGPQAKKENWRRFADFHLLLYVAKLFDLDTAF-SICDCVRSRQPVDEGLEDT 499
Query: 404 INSMA 408
+ S
Sbjct: 500 LKSFG 504
>gi|351706408|gb|EHB09327.1| Nuclear protein localization protein 4-like protein [Heterocephalus
glaber]
Length = 658
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 227 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 283
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT I +
Sbjct: 284 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLISEDTRKGTVRYS 342
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 343 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 398
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 399 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 455
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 456 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 514
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 515 LDTISDFHLLLFL 527
>gi|7959259|dbj|BAA96023.1| KIAA1499 protein [Homo sapiens]
Length = 660
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 256 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 312
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 313 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 371
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 372 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 427
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 428 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 484
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 485 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 543
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 544 LDTISDFHLLLFL 556
>gi|7020902|dbj|BAA91314.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 150/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T ++ G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQQFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|410982004|ref|XP_003997354.1| PREDICTED: nuclear protein localization protein 4 homolog [Felis
catus]
Length = 755
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 360 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 416
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 417 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 475
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 476 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 531
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 532 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 588
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 589 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 647
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 648 LDTISDFHLLLFL 660
>gi|332251481|ref|XP_003274874.1| PREDICTED: nuclear protein localization protein 4 homolog [Nomascus
leucogenys]
Length = 607
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|431908659|gb|ELK12251.1| Nuclear protein localization protein 4 like protein [Pteropus
alecto]
Length = 565
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 148/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + T + G++YG E K R E
Sbjct: 170 ITLNRQKYRHVDNIMFENHTVADRFLDFWRRT---GSQHLGYLYGRYTEHKDIPLGVRAE 226
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 227 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 285
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS E + A +F + C + ++VTAV + +HFE
Sbjct: 286 RNKDTYFLSAEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 341
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L +K KD+ G ++ ++
Sbjct: 342 YQVSNQCMALVRD---ECLLPCKDAPELGYVKESSSEQYVPDVFYKDLDKFGNEITQLAR 398
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L++S S
Sbjct: 399 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQSTSAVF 457
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 458 LDTISDFHLLLFL 470
>gi|332849283|ref|XP_003315818.1| PREDICTED: nuclear protein localization protein 4 homolog [Pan
troglodytes]
gi|410227436|gb|JAA10937.1| nuclear protein localization 4 homolog [Pan troglodytes]
gi|410256178|gb|JAA16056.1| nuclear protein localization 4 homolog [Pan troglodytes]
gi|410305070|gb|JAA31135.1| nuclear protein localization 4 homolog [Pan troglodytes]
gi|410353911|gb|JAA43559.1| nuclear protein localization 4 homolog [Pan troglodytes]
Length = 608
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|297702026|ref|XP_002827994.1| PREDICTED: nuclear protein localization protein 4 homolog [Pongo
abelii]
Length = 608
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|148702798|gb|EDL34745.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_c [Mus musculus]
Length = 549
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 154 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 210
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 211 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 269
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 270 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 325
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 326 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 382
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 383 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 441
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 442 LDTISDFHLLLFL 454
>gi|380814530|gb|AFE79139.1| nuclear protein localization protein 4 homolog [Macaca mulatta]
gi|383409511|gb|AFH27969.1| nuclear protein localization protein 4 homolog [Macaca mulatta]
gi|384948116|gb|AFI37663.1| nuclear protein localization protein 4 homolog [Macaca mulatta]
Length = 608
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|194377504|dbj|BAG57700.1| unnamed protein product [Homo sapiens]
Length = 456
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 52 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 108
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 109 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 167
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 168 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 223
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 224 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 280
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 281 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 339
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 340 LDTISDFHLLLFL 352
>gi|157426879|ref|NP_060391.2| nuclear protein localization protein 4 homolog [Homo sapiens]
gi|50428974|sp|Q8TAT6.3|NPL4_HUMAN RecName: Full=Nuclear protein localization protein 4 homolog;
Short=Protein NPL4
gi|19683967|gb|AAH25930.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Homo
sapiens]
gi|119610067|gb|EAW89661.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_a [Homo sapiens]
gi|119610068|gb|EAW89662.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_a [Homo sapiens]
gi|123997361|gb|ABM86282.1| nuclear protein localization 4 homolog (S. cerevisiae) [synthetic
construct]
Length = 608
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|119610069|gb|EAW89663.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_b [Homo sapiens]
gi|208967577|dbj|BAG72434.1| nuclear protein localization 4 homolog [synthetic construct]
Length = 617
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|73964791|ref|XP_540482.2| PREDICTED: nuclear protein localization protein 4 homolog [Canis
lupus familiaris]
Length = 615
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 220 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 276
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 277 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 335
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 336 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 391
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 392 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 448
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 449 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 507
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 508 LDTISDFHLLLFL 520
>gi|37360414|dbj|BAC98185.1| mKIAA1499 protein [Mus musculus]
gi|148702797|gb|EDL34744.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_b [Mus musculus]
Length = 662
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 267 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 323
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 324 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 382
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 383 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 438
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 439 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 495
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 496 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 554
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 555 LDTISDFHLLLFL 567
>gi|10435132|dbj|BAB14499.1| unnamed protein product [Homo sapiens]
Length = 520
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 125 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GSQHFGYLYGRYTEHKDIPLGIRAE 181
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 182 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 240
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 241 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 296
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 297 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 353
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 354 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 412
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 413 LDTISDFHLLLFL 425
>gi|303324586|ref|NP_001181952.1| nuclear protein localization protein 4 homolog isoform A [Mus
musculus]
gi|46397631|sp|P60670.3|NPL4_MOUSE RecName: Full=Nuclear protein localization protein 4 homolog;
Short=Protein NPL4
Length = 608
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|74207973|dbj|BAE29105.1| unnamed protein product [Mus musculus]
Length = 608
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|348558274|ref|XP_003464943.1| PREDICTED: nuclear protein localization protein 4 homolog [Cavia
porcellus]
Length = 668
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 273 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 329
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 330 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 388
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 389 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 444
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 445 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 501
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 502 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 560
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 561 LDTISDFHLLLFL 573
>gi|335297237|ref|XP_003357980.1| PREDICTED: nuclear protein localization protein 4 homolog [Sus
scrofa]
Length = 564
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 169 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 225
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG+K+VG+IFT + +
Sbjct: 226 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLKEVGWIFTDLVSEDARKGTVRYS 284
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 285 RNKDTYFLSSEECITAGDFQNKHPNVCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 340
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 341 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 397
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L++S S
Sbjct: 398 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQSTSSVF 456
Query: 361 VKRISDFHLLLFL 373
+ +SDFHLLLFL
Sbjct: 457 LDTVSDFHLLLFL 469
>gi|354469126|ref|XP_003496981.1| PREDICTED: nuclear protein localization protein 4 homolog
[Cricetulus griseus]
Length = 689
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 294 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 350
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 351 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 409
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 410 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 465
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 466 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 522
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 523 PLPVEYLIIDITTTFPKDPVFTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNNSSVF 581
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 582 LDTISDFHLLLFL 594
>gi|327264572|ref|XP_003217087.1| PREDICTED: nuclear protein localization protein 4 homolog [Anolis
carolinensis]
Length = 609
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T + G++YG E K R E
Sbjct: 214 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLAIRAE 270
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 271 VAAIYEPPQIGTQNTLELLEDSKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 329
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 330 RNKDSYFLSAEECITAGYFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 389
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
+ C+ L ++ E + D P+L K+ DV G+D + +
Sbjct: 390 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKGRDKFGNEITQLARPLPV 446
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ + FPIENR TQ +L ++L+++ S + I
Sbjct: 447 EYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDII 505
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 506 SDFHLLLFL 514
>gi|340501211|gb|EGR28019.1| npl4 family protein, putative [Ichthyophthirius multifiliis]
Length = 495
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 140/284 (49%), Gaps = 43/284 (15%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
++ GF+YG ED R + IYEPPQ G +L DE E+ VD IA L +
Sbjct: 223 QKMGFLYGYYAEDPNYKGGVRAIIEAIYEPPQVGDISGFKLLYDEYEQ-YVDQIAEALTL 281
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNM--------EEWVTAVVKLEVSEE 258
+K+G+ FT TI D L + E+ A ++ + + +++ V L ++
Sbjct: 282 EKIGWAFT-TI---NHDTFLCSHEIRMAAKYQEQYRVLHESGYPFSTFISVV--LRTGKD 335
Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWF-ETE---IAEGDDP-KLSKMKKDVVVGGKDVKEVD 313
+ + E + +SD + K F E+E + DP + + ++VGGK VKE +
Sbjct: 336 NPSEVRPEVYMVSDQAQIMEKNNIFGESERRKFMQIRDPVNDTDVLPQIIVGGKTVKEFE 395
Query: 314 NDFFLVVV----------KILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKR 363
D+F+V V I+ H + FP+ NR Q M +K +L + + ++
Sbjct: 396 PDYFIVNVAHGHSNHNKFSIIKH-----ADFPVANRQQQSPME-VKKYLQKFKNQKSFEK 449
Query: 364 ISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINSM 407
SDFHLLL+LA+ +DL + + +AQ V +T +PEG + LIN +
Sbjct: 450 YSDFHLLLYLAKLVDLQTAL-VIAQSVAQETDIPEGCENLINQI 492
>gi|440897699|gb|ELR49339.1| hypothetical protein M91_17353, partial [Bos grunniens mutus]
Length = 603
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 148/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 324 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 436
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+ + S
Sbjct: 437 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSHNTSSVF 495
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 496 LDTISDFHLLLFL 508
>gi|300794221|ref|NP_001179118.1| nuclear protein localization protein 4 homolog [Bos taurus]
gi|296476170|tpg|DAA18285.1| TPA: nuclear protein localization 4 homolog [Bos taurus]
Length = 608
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 148/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+ + S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSHNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|426239181|ref|XP_004013504.1| PREDICTED: nuclear protein localization protein 4 homolog [Ovis
aries]
Length = 614
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 219 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 275
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 276 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 334
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 335 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 390
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 391 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 447
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 448 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 506
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 507 LDTISDFHLLLFL 519
>gi|10434779|dbj|BAB14372.1| unnamed protein product [Homo sapiens]
Length = 520
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 125 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 181
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 182 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 240
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 241 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 296
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 297 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 353
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 354 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 412
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 413 LDTISDFHLLLFL 425
>gi|397522213|ref|XP_003831172.1| PREDICTED: nuclear protein localization protein 4 homolog [Pan
paniscus]
Length = 520
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 125 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 181
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 182 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 240
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 241 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 296
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 297 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 353
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 354 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 412
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 413 LDTISDFHLLLFL 425
>gi|395533255|ref|XP_003768676.1| PREDICTED: nuclear protein localization protein 4 homolog
[Sarcophilus harrisii]
Length = 730
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 147/311 (47%), Gaps = 68/311 (21%)
Query: 119 VTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
+ RQ+ H +++ F + AD F + +T + G++YG E K R EV
Sbjct: 337 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 393
Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------N 220
IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + + N
Sbjct: 394 AIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 452
Query: 221 KKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQ 269
K Y LS+ E + A F + C + ++VTAV + +HFE +Q
Sbjct: 453 KDTYFLSSEECITAGNFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQ 508
Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN-- 314
+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 509 VSNQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPL 565
Query: 315 --DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVK 362
++ ++ + + P+ S FPIENR TQ +L ++L+++ S +
Sbjct: 566 PVEYLIIDITTTFPKDPVYTFSISQSPFPIENRDVLGETQ-DFHSLATYLSQNTSSIFLD 624
Query: 363 RISDFHLLLFL 373
ISDFHLLLFL
Sbjct: 625 IISDFHLLLFL 635
>gi|21739895|emb|CAD38971.1| hypothetical protein [Homo sapiens]
Length = 519
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 124 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 180
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 181 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 239
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 240 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 295
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KDV G ++ ++
Sbjct: 296 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLAR 352
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 353 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 411
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 412 LDTISDFHLLLFL 424
>gi|348522316|ref|XP_003448671.1| PREDICTED: nuclear protein localization protein 4 homolog
[Oreochromis niloticus]
Length = 624
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 140/308 (45%), Gaps = 59/308 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GSQRMGYLYGRYTEHKDIPLGIRAE 285
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ T+ L +L D + VD IAA LGM KVG+IFT + +
Sbjct: 286 VAAIYEPPQNATQNSLELLEDPKAAA-VDEIAAKLGMCKVGWIFTDLLSEDTRIGTVRYS 344
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
N+ Y LS E + A F E C + + + V + + +HFE +Q+S
Sbjct: 345 RNQDSYYLSAEECITAGYFQNEHSNPCRLSRDGHFGSKFVTVVATGGPDNQVHFEGYQVS 404
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDVKEVDNDFF------ 317
+ C+ L ++ E + D P+L K+ DV KD D F
Sbjct: 405 NQCMALVRD---ECLLPCKDAPELGYAKESSPEQYVPDVFYKDKDKFGNDVTFLARPLPV 461
Query: 318 -LVVVKIL-----DHQGPLSST--FPIENR----TTQVTMRALKSHLNRSPSLPLVKRIS 365
+++ I D Q SST FPIENR TQ +L ++L++ S + +S
Sbjct: 462 EYLIIDITTTFPKDPQYTFSSTQRFPIENRDILGETQ-DFHSLATYLSQCTSTSFLNIVS 520
Query: 366 DFHLLLFL 373
DFHLLLFL
Sbjct: 521 DFHLLLFL 528
>gi|344291384|ref|XP_003417415.1| PREDICTED: nuclear protein localization protein 4 homolog
[Loxodonta africana]
Length = 609
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E + R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHRDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPLSS------TFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVHTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSMF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|149055014|gb|EDM06831.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_b [Rattus norvegicus]
Length = 549
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 154 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 210
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IA+ LG++KVG+IFT + +
Sbjct: 211 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIASKLGLRKVGWIFTDLVSEDTRKGTVRYS 269
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 270 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 325
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 326 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 382
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 383 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 441
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 442 LDTISDFHLLLFL 454
>gi|301754217|ref|XP_002912994.1| PREDICTED: nuclear protein localization protein 4 homolog
[Ailuropoda melanoleuca]
Length = 605
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 210 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 266
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 267 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 325
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 326 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 381
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 382 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 438
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 439 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 497
Query: 361 VKRISDFHLLLFL 373
+ +SDFHLLLFL
Sbjct: 498 LDTVSDFHLLLFL 510
>gi|281348660|gb|EFB24244.1| hypothetical protein PANDA_000715 [Ailuropoda melanoleuca]
Length = 559
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 324 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 436
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 437 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 495
Query: 361 VKRISDFHLLLFL 373
+ +SDFHLLLFL
Sbjct: 496 LDTVSDFHLLLFL 508
>gi|18034779|ref|NP_542144.1| nuclear protein localization protein 4 homolog [Rattus norvegicus]
gi|48429023|sp|Q9ES54.3|NPL4_RAT RecName: Full=Nuclear protein localization protein 4 homolog;
Short=Protein NPL4
gi|11037252|gb|AAG27534.1|AF234600_1 NPL4 [Rattus norvegicus]
gi|74353675|gb|AAI01888.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Rattus
norvegicus]
gi|149055013|gb|EDM06830.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_a [Rattus norvegicus]
Length = 608
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IA+ LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIASKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|417411892|gb|JAA52365.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
[Desmodus rotundus]
Length = 603
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 147/313 (46%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 208 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 264
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 265 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 323
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS E + A +F + C + ++VTAV + +HFE
Sbjct: 324 RNKDTYFLSAEECITAGDFQNKHPNVCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 379
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD G ++ ++
Sbjct: 380 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDTDKFGNEITQLAR 436
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ S FPIENR TQ +L ++L+++ S
Sbjct: 437 PLPVEYLIIDITTTFPKDPVHTFSTSQSPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 495
Query: 361 VKRISDFHLLLFL 373
+ +SDFHLLLFL
Sbjct: 496 LDTVSDFHLLLFL 508
>gi|148702796|gb|EDL34743.1| nuclear protein localization 4 homolog (S. cerevisiae), isoform
CRA_a [Mus musculus]
Length = 627
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 138/286 (48%), Gaps = 46/286 (16%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 264 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 320
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 321 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 379
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 380 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 435
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
+Q+S+ C+ L ++ E + D P+L K+ + V + F+ + K +
Sbjct: 436 YQVSNQCMALVRD---ECLLPCKDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEI 487
Query: 328 GPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
L+ P+E +L ++L+++ S + ISDFHLLLFL
Sbjct: 488 TQLARPLPVEYLIID-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 532
>gi|149723455|ref|XP_001489914.1| PREDICTED: nuclear protein localization protein 4 homolog [Equus
caballus]
Length = 615
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 149/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 220 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 276
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + + +VD IAA LG++KVG+IFT + +
Sbjct: 277 VAAIYEPPQIGTQNSLELLEDPKAD-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 335
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 336 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 391
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 392 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 448
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 449 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 507
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 508 LDTISDFHLLLFL 520
>gi|41054974|ref|NP_955763.1| nuclear protein localization protein 4 homolog isoform B [Mus
musculus]
gi|40787841|gb|AAH65156.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Mus
musculus]
Length = 576
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 138/286 (48%), Gaps = 46/286 (16%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
+Q+S+ C+ L ++ E + D P+L K+ + V + F+ + K +
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEI 436
Query: 328 GPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
L+ P+E +L ++L+++ S + ISDFHLLLFL
Sbjct: 437 TQLARPLPVEYLIID-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 481
>gi|363740846|ref|XP_415704.3| PREDICTED: nuclear protein localization protein 4 homolog [Gallus
gallus]
Length = 609
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T + G++YG E K R E
Sbjct: 214 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 270
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L IL D + E +VD IAA LG++KVG+IFT + +
Sbjct: 271 VAAIYEPPQIGTQNSLEILEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 329
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 330 RNKDTYYLSAEECITAGNFQNQHPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 389
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 390 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 446
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ + FPIENR TQ +L ++L+++ S + +
Sbjct: 447 EYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDIV 505
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 506 SDFHLLLFL 514
>gi|334322635|ref|XP_001379846.2| PREDICTED: nuclear protein localization protein 4 homolog isoform 1
[Monodelphis domestica]
Length = 608
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/313 (29%), Positives = 147/313 (46%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS E + A F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSAEECITAGNFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSIF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDIISDFHLLLFL 513
>gi|402901341|ref|XP_003913609.1| PREDICTED: nuclear protein localization protein 4 homolog [Papio
anubis]
Length = 608
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/317 (29%), Positives = 149/317 (47%), Gaps = 76/317 (23%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDK----- 166
+ + RQ+ H +++ F + AD F L ++G G++YG E K
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRF-------LTSGERQGTQHFGYLYGRYTEHKDIPLG 265
Query: 167 -RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ------ 219
R EV IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 266 IRAEVAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGT 324
Query: 220 -----NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAI 263
NK Y LS+ E + A +F + C + ++VTAV + +
Sbjct: 325 VRYSRNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----V 380
Query: 264 HFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVK 310
HFE +Q+S+ C+ L ++ E + D P+L K KDV G ++
Sbjct: 381 HFEGYQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEIT 437
Query: 311 EVDN----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSP 356
++ ++ ++ + + P+ + FPIENR TQ +L ++L+++
Sbjct: 438 QLARPLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNT 496
Query: 357 SLPLVKRISDFHLLLFL 373
S + ISDFHLLLFL
Sbjct: 497 SSVFLDTISDFHLLLFL 513
>gi|387017400|gb|AFJ50818.1| Nuclear protein localization protein 4-like protein [Crotalus
adamanteus]
Length = 609
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T + G++YG E K R E
Sbjct: 214 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 270
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V+ IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 271 VSAIYEPPQIGTQNSLELLEDSKAE-IVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 329
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 330 RNKDAYFLSAEECITAGHFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 389
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 390 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 446
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ + FPIENR TQ L ++L+++ S + +
Sbjct: 447 EYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHILATYLSQNTSSVFLDIV 505
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 506 SDFHLLLFL 514
>gi|291415843|ref|XP_002724159.1| PREDICTED: nuclear protein localization 4, partial [Oryctolagus
cuniculus]
Length = 737
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 137/284 (48%), Gaps = 46/284 (16%)
Query: 119 VTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
+ RQ+ H +++ F + AD F + +T + G++YG E K R EV
Sbjct: 376 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 432
Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------N 220
IYEPPQ GT+ L +L D + + +VD IAA LG++KVG+IFT + + N
Sbjct: 433 AIYEPPQIGTQNSLELLEDPKAD-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 491
Query: 221 KKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQ 269
K Y LS+ E + A +F + C + ++VTAV + +HFE +Q
Sbjct: 492 KDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQ 547
Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP 329
+S+ C+ L ++ E + D P+L K+ + V + F+ + K +
Sbjct: 548 VSNQCMALVRD---ECLLPCKDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEITQ 599
Query: 330 LSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
L+ P+E +L ++L+++ S + +SDFHLLLFL
Sbjct: 600 LARPLPVEYLIID-DFHSLAAYLSQNTSSMFLDTVSDFHLLLFL 642
>gi|74208119|dbj|BAE29162.1| unnamed protein product [Mus musculus]
Length = 608
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 148/313 (47%), Gaps = 68/313 (21%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V I EPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAICEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSSEECITAGDFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN 314
+Q+S+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 385 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLAR 441
Query: 315 ----DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPL 360
++ ++ + + P+ + FPIENR TQ +L ++L+++ S
Sbjct: 442 PLPVEYLIIDITTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVF 500
Query: 361 VKRISDFHLLLFL 373
+ ISDFHLLLFL
Sbjct: 501 LDTISDFHLLLFL 513
>gi|223996267|ref|XP_002287807.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976923|gb|EED95250.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 511
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 126/252 (50%), Gaps = 17/252 (6%)
Query: 168 VEVNFIYEPPQQGTEEVL--YILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
V V IYEPPQ+ E +I+ D+ E+ V+ + LG+ KVG+I ++ +
Sbjct: 260 VIVEAIYEPPQEPDSEAAEGFIILDDPMEENVEELVKMLGLVKVGWIVGHP--PREEGFQ 317
Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
LS+ EV+ A E E T V + V+ + + EAFQ+S C+ + E +
Sbjct: 318 LSSAEVIMAAELQLESAGGIEATPFVTVRVTVDDDGNVVVEAFQVSLQCMEMVAEEALDV 377
Query: 286 EIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVVKILDHQGPL-SSTFPIENRT-- 340
G +P + + GK V+N+FFL +V I +Q + S FP NR
Sbjct: 378 ----GPNPGFCYVNDTFTAIQEGKPSATVENNFFLTLVPINGYQSEMFVSMFPKANRAFD 433
Query: 341 -TQVTMRALKSHLNRSPS--LPLVKRISDFHLLLFLARFLDLNSDVPALAQCV-QAQTAV 396
T +K L++S + + +SDF LLL+L +FLD+ +D+P + V + + +
Sbjct: 434 DRAQTHDEMKRQLSKSGTAGWSFIDLLSDFGLLLYLTQFLDMKTDMPKICASVKEREIPL 493
Query: 397 PEGYKLLINSMA 408
+GYK++I+S+A
Sbjct: 494 DDGYKIIISSLA 505
>gi|326930855|ref|XP_003211554.1| PREDICTED: nuclear protein localization protein 4 homolog
[Meleagris gallopavo]
Length = 621
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 144/309 (46%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T + G++YG E K R E
Sbjct: 226 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 282
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L IL D + E +VD IAA LG++KVG+IFT + +
Sbjct: 283 VAAIYEPPQIGTQNSLEILEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 341
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 342 RNKDTYYLSAEECITAGNFQNQHPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 401
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 402 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 458
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ + FPIEN TQ +L ++L+++ S + +
Sbjct: 459 EYLIIDITTTFPKDPVYTFSISQNPFPIENHDVLGETQ-DFHSLATYLSQNTSSVFLDIV 517
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 518 SDFHLLLFL 526
>gi|334322637|ref|XP_003340282.1| PREDICTED: nuclear protein localization protein 4 homolog isoform 2
[Monodelphis domestica]
Length = 576
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 136/286 (47%), Gaps = 46/286 (16%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 328
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS E + A F + C + ++VTAV + +HFE
Sbjct: 329 RNKDTYFLSAEECITAGNFQNKHPNICRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 384
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
+Q+S+ C+ L ++ E + D P+L K+ + V + F+ + K +
Sbjct: 385 YQVSNQCMALVRD---ECLLPCRDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEI 436
Query: 328 GPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
L+ P+E +L ++L+++ S + ISDFHLLLFL
Sbjct: 437 TQLARPLPVEYLIID-DFHSLATYLSQNTSSIFLDIISDFHLLLFL 481
>gi|213983169|ref|NP_001135712.1| nuclear protein localization 4 homolog [Xenopus (Silurana)
tropicalis]
gi|197246525|gb|AAI69147.1| Unknown (protein for MGC:189610) [Xenopus (Silurana) tropicalis]
Length = 610
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 148/309 (47%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 215 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRIGYLYGRYTEHKDIPLGLRAE 271
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D+ + K+VD IAA LG++KVG+IFT + +
Sbjct: 272 VAAIYEPPQIGTQNSLQLL-DDPKAKVVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 330
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + + F + C + + + V + + +HFE +Q+S
Sbjct: 331 RNKDTYFLSAEECITSGFFQNKHPNLCRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 390
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 391 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 447
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ S+ F IENR TQ +L ++L+++ S ++ I
Sbjct: 448 EYLIIDITATFPKDPVYTFSTSSNLFSIENRDALGETQ-DFHSLATYLSQNLSSVFLEII 506
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 507 SDFHLLLFL 515
>gi|444727732|gb|ELW68210.1| Nuclear protein localization protein 4 like protein [Tupaia
chinensis]
Length = 420
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 134/285 (47%), Gaps = 68/285 (23%)
Query: 148 LAFAVKRG----GFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLV 197
L F K G G++YG E K R EV IYEPPQ GT+ L +L D + E +V
Sbjct: 12 LDFWRKTGNQHFGYLYGRYTEHKDIPLGIRAEVAAIYEPPQIGTQNSLELLEDPKAE-VV 70
Query: 198 DAIAAGLGMKKVGFIFTQTIMQ-----------NKKDYTLSNREVLQAVEFHAE----CN 242
D IAA LG++KVG+IFT + + NK Y LS+ E + A +F + C
Sbjct: 71 DEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRNKDTYFLSSEECITAGDFQNKHPNICR 130
Query: 243 M-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKL 295
+ ++VTAV + +HFE +Q+S+ C+ L ++ E + D P+L
Sbjct: 131 LSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQCMALVRD---ECLLPCKDAPEL 183
Query: 296 SKMK-------------KDVVVGGKDVKEVDN----DFFLVVVKILDHQGPLSS------ 332
K KD G ++ ++ ++ ++ + + P+ +
Sbjct: 184 GYAKESSSEQYVPDVFYKDTDKFGNEITQLARPLPVEYLIIDITTTFPKDPVYTFSISPN 243
Query: 333 TFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
FPIENR TQ +L ++L+++ S + +SDFHLLLFL
Sbjct: 244 PFPIENRDVLGETQ-DFHSLATYLSQNTSAMFLDTVSDFHLLLFL 287
>gi|118348606|ref|XP_001007778.1| NPL4 family protein [Tetrahymena thermophila]
gi|89289545|gb|EAR87533.1| NPL4 family protein [Tetrahymena thermophila SB210]
Length = 1157
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 152/333 (45%), Gaps = 61/333 (18%)
Query: 106 KMTMDDLIAKQMRVTRQENPHCESVSFD--RDCADAFQQYVNETLAFAVKRGGFMYGTVL 163
K ++ + + V RQ H + F R+ ++ Q + + A +R G+MYG
Sbjct: 852 KAMCNNCLPPNVVVYRQPYRHVDFAQFMNVREVSNFVQAWTQNS--HAEQRVGYMYGYYA 909
Query: 164 EDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI 217
ED R + IYEPPQ G IL DE E LV+ IA L ++K+G+IFT TI
Sbjct: 910 EDPNYKGGVRAIIEAIYEPPQIGDISGFQILEDENE-TLVNTIAEALTLEKIGWIFT-TI 967
Query: 218 MQNKKDYTLSNREVLQAVEFHAECNM--------EEWVTAVVKLEVSEEGSAAIHFEAFQ 269
D LS+ EV A F + + ++T V L ++ + I E +
Sbjct: 968 ---NHDTFLSSHEVRMAARFQEQYRVLHESGYPFSTFITTV--LRTGKDNPSEIKPEVYM 1022
Query: 270 MSDMCVRLFKEGWF-ETEIAEGDDPKLSKMKKDV----VVGGKDVKEVDNDFFLVVV--- 321
+SD L K F ++E + + SK ++DV +V GK V E + DF LV V
Sbjct: 1023 VSDQAQVLEKNNVFGDSERRKFMKVRESKDERDVLPSIIVSGKQVNEFEPDFLLVNVAHG 1082
Query: 322 -------KILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLA 374
I+ H + FP+ENR Q N+S ++ SDFHLL++LA
Sbjct: 1083 HSNHNKFSIVKH-----ADFPVENRQEQ----------NKS-----WEKYSDFHLLIYLA 1122
Query: 375 RFLDLNSDVPALAQCVQAQTAVPEGYKLLINSM 407
+ +DL + + +A V + VPE + LI +
Sbjct: 1123 KLVDLETTL-VIADAVAKEKDVPETCEELIRGI 1154
>gi|148226640|ref|NP_001088114.1| uncharacterized protein LOC494816 [Xenopus laevis]
gi|52430472|gb|AAH82846.1| LOC494816 protein [Xenopus laevis]
Length = 610
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 144/309 (46%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 215 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRIGYLYGRYTEHKDIPLGLRAE 271
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D+ + +VD IAA LG++KVG+IFT + +
Sbjct: 272 VAAIYEPPQIGTQNSLQLL-DDPKSNVVDGIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 330
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 331 RNKDTYFLSAEECITAGYFQNKYPNLCRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 390
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ F + D P+L K KD+ G ++ ++
Sbjct: 391 NQCMALVRDECF---LPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 447
Query: 315 DFFLVVVKILDHQGPL------SSTFPIENRT----TQVTMRALKSHLNRSPSLPLVKRI 364
++ ++ + + P+ + FPIENR TQ +L ++L+++ + I
Sbjct: 448 EYLIIDITTTFPKDPVYTFSTSCNLFPIENRDVLGETQ-DFHSLATYLSQNLPSVFLDII 506
Query: 365 SDFHLLLFL 373
SDFHLL FL
Sbjct: 507 SDFHLLQFL 515
>gi|410895239|ref|XP_003961107.1| PREDICTED: nuclear protein localization protein 4 homolog [Takifugu
rubripes]
Length = 624
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/308 (30%), Positives = 143/308 (46%), Gaps = 59/308 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRMGYLYGRYTEHKDIPLGIRAE 285
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ T+ L ++ D+ + VD IAA LG+ KVG+IFT + +
Sbjct: 286 VAAIYEPPQNATQNSLELI-DDPKAAAVDEIAAKLGLCKVGWIFTDLLSEDMRIGTVCYS 344
Query: 220 -NKKDYTLSNREVLQAVEF---HAE-CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK + LS E + A F H+ C + + + V + + +HFE +Q+S
Sbjct: 345 RNKDSHYLSAEECITAGYFQNLHSNPCRLSRDGYFGSKFVTVVATGGPDNQVHFEGYQVS 404
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKKDVV------VGGKDVKEVDNDF-FL------ 318
+ C+ L ++ E + D P+L K+ V KD + ND FL
Sbjct: 405 NQCMALVRD---ECLLPCKDAPELGFAKESSPEQYVPDVFYKDTDKFGNDVTFLARPLPV 461
Query: 319 --VVVKIL-----DHQGPLSST--FPIENR----TTQVTMRALKSHLNRSPSLPLVKRIS 365
+++ I D Q SS FPIENR TQ +L ++L++ S + +S
Sbjct: 462 EYLIIDITTTFPKDPQYTFSSAQRFPIENRDILGETQ-NFHSLSTYLSQCTSTAFLGIVS 520
Query: 366 DFHLLLFL 373
DFHLLLFL
Sbjct: 521 DFHLLLFL 528
>gi|449478740|ref|XP_004177023.1| PREDICTED: LOW QUALITY PROTEIN: nuclear protein localization
protein 4 homolog [Taeniopygia guttata]
Length = 650
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 143/310 (46%), Gaps = 61/310 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T + G++YG E K R E
Sbjct: 225 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 281
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L IL D + E +VD IAA LG++KVG+IFT + +
Sbjct: 282 VAAIYEPPQIGTQNSLEILEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 340
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 341 RNKDTYFLSAEECITAGNFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 400
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN---- 314
+ C+ L ++ E + D P+L K KD+ G ++ ++
Sbjct: 401 NQCMALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPV 457
Query: 315 DFFLVVVKILDHQGP-------LSSTFPIENRT----TQVTMRALKSHLNRSPSLPLVKR 363
++ ++ + +G L + PIEN TQ +L ++L+++ S +
Sbjct: 458 EYLIIDITTTFPKGSSLHFFYFLKTHSPIENPMCLGETQ-DFHSLATYLSQNTSSIFLDI 516
Query: 364 ISDFHLLLFL 373
ISDFHLLLFL
Sbjct: 517 ISDFHLLLFL 526
>gi|351705102|gb|EHB08021.1| Nuclear protein localization protein 4-like protein [Heterocephalus
glaber]
Length = 583
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 125/271 (46%), Gaps = 64/271 (23%)
Query: 158 MYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGF 211
+YG E K R EV IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+
Sbjct: 227 LYGRYTEKKDIPLGIRAEVAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGW 285
Query: 212 IFTQTI-----------MQNKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTA 249
IFT I QNK Y L + E + A +F + C + ++VTA
Sbjct: 286 IFTDLISEDTRKGTVRYSQNKDTYFLCSEECITAGDFQNKHPNICRLSSDGHFGSKFVTA 345
Query: 250 VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMK---------- 299
V + +HFE FQ+S C+ L ++ E + D P+L K
Sbjct: 346 VATGGPDNQ----VHFEGFQVSSQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPD 398
Query: 300 ---KDVVVGGKDVKEVDND----FFLVVVKILDHQGPL------SSTFPIENR----TTQ 342
KD+ G ++ ++ + ++ + + P+ + FPIEN TQ
Sbjct: 399 VFHKDIDKSGNEITQLARPLPVVYLIIDITTTFPKDPVYTFSISQNPFPIENWDVLGETQ 458
Query: 343 VTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
+L ++L+++ S + ISDFHLLLFL
Sbjct: 459 -DFHSLATYLSQNTSFVFLDTISDFHLLLFL 488
>gi|260817485|ref|XP_002603617.1| hypothetical protein BRAFLDRAFT_126926 [Branchiostoma floridae]
gi|229288937|gb|EEN59628.1| hypothetical protein BRAFLDRAFT_126926 [Branchiostoma floridae]
Length = 530
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 145/331 (43%), Gaps = 61/331 (18%)
Query: 119 VTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
+ RQ+ H +++ F+ D F Y +T +R GF+YG K R V
Sbjct: 134 LNRQKYRHVDNIMFENHTIVDRFLDYWRKT---GNQRLGFLYGKYEHHKDVPLGIRATVA 190
Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQN 220
IYEPPQ GT L +L D + +VD IA LG++KVG+IFT + +N
Sbjct: 191 AIYEPPQVGTANSLELLEDRNAD-VVDDIARKLGLRKVGWIFTDLVADDSGKGTVKNFRN 249
Query: 221 KKDYTLSNREVLQAVEF---HAECNM----EEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
LS E + A EF H C + + V + S + S IHFE +Q+S+
Sbjct: 250 ADSVFLSAEECIMAGEFQNQHPNCTKLSPDGTFGSKFVTVVASGDESHQIHFEGYQVSNQ 309
Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDF 316
C+ L ++ + + D P+L +K+ DV KD + + ++
Sbjct: 310 CMALVRD---DCLVPTKDAPELGYIKESTPEQYVPDVFYKEKDSYGNLVTLLARPLPVEY 366
Query: 317 FLVVVKILDHQGPLSS-------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISD 366
LV V P+S+ FPIENR ++ L +++ + P L++ +SD
Sbjct: 367 LLVQVTTGFPVDPVSTFNIRVAKPFPIENRADIGDIQDFGQLTAYMQQFPGYRLLETMSD 426
Query: 367 FHLLLFLAR--FLDLNSDVPALAQCVQAQTA 395
H L +LA L L + L + V+ Q A
Sbjct: 427 LHFLSYLATMDMLPLKDHMDPLLEGVREQNA 457
>gi|41054207|ref|NP_956101.1| nuclear protein localization protein 4 homolog [Danio rerio]
gi|28277776|gb|AAH45845.1| Nuclear protein localization 4 homolog (S. cerevisiae) [Danio
rerio]
gi|182889954|gb|AAI65856.1| Nploc4 protein [Danio rerio]
Length = 624
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 145/309 (46%), Gaps = 60/309 (19%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQRMGYLYGRYTEHKDIPLGIRAE 285
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ T+ L ++ D + + V+ IAA LG++KVG+IFT +
Sbjct: 286 VAAIYEPPQIATQNSLELIEDPKADA-VEEIAAKLGLRKVGWIFTDLLSEDTRIGTVRYT 344
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
+NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 345 RNKDSYFLSAEECITAGHFQNQQANMCRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 404
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
+ C+ L ++ E + D P+L K+ DV KD + +
Sbjct: 405 NQCMALVRD---ECLLPCRDAPELGYAKESSTEQYVPDVFYKDKDKFGNDITHLARPLPV 461
Query: 315 DFFLVVVKILDHQGPL---SST--FPIENR----TTQVTMRALKSHLNR-SPSLPLVKRI 364
++ ++ + + P+ SST FPIENR TQ +L ++L++ + S + +
Sbjct: 462 EYLIIDITTTFPKDPVFTFSSTFRFPIENRDALGETQ-DFHSLATYLSQCTSSTSFLDIV 520
Query: 365 SDFHLLLFL 373
SDFHLLLFL
Sbjct: 521 SDFHLLLFL 529
>gi|383170952|gb|AFG68738.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170953|gb|AFG68739.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170954|gb|AFG68740.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170955|gb|AFG68741.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170956|gb|AFG68742.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170957|gb|AFG68743.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170958|gb|AFG68744.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170960|gb|AFG68746.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170961|gb|AFG68747.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170962|gb|AFG68748.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170963|gb|AFG68749.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170964|gb|AFG68750.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170965|gb|AFG68751.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170966|gb|AFG68752.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170967|gb|AFG68753.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
gi|383170968|gb|AFG68754.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
Length = 51
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 39/51 (76%), Positives = 43/51 (84%)
Query: 110 DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG 160
DDLIA+Q+R+ RQE PH S SFDRD A+ FQQYVNE LAFAVKRGGFMYG
Sbjct: 1 DDLIARQVRIERQEKPHSVSASFDRDAANVFQQYVNENLAFAVKRGGFMYG 51
>gi|221123865|ref|XP_002157402.1| PREDICTED: nuclear protein localization protein 4 homolog [Hydra
magnipapillata]
Length = 621
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/306 (26%), Positives = 144/306 (47%), Gaps = 58/306 (18%)
Query: 119 VTRQENPHCESVSFDRD-CADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
++RQ+ H +++ F+ D F Y T +R G++YG + R V
Sbjct: 225 LSRQKYRHVDNIMFENPMIMDRFLNYWRNT---GFQRLGYLYGKYEKHDNVPLGIRATVA 281
Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQN 220
IYEPPQ+ T++ + +L D +E V+ +A+ +G+++VG+IFT + +++
Sbjct: 282 AIYEPPQENTKDRIQLLEDNHDE-FVEILASYMGLRRVGWIFTDLLPDEENSGLVKHLRH 340
Query: 221 KKDYTLSNREVLQAVEFHA-------ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
K + LS +E + A F + + ++ + V + VS IH++ +Q+S+
Sbjct: 341 SKSHFLSAQECITAAHFQNKYPNSCRDSSAGKFGSKFVTVSVSGHEDKHIHYDGWQVSNQ 400
Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEVDNDF 316
C+ L ++G I DDP L +K+ DV KDV + + ++
Sbjct: 401 CMALVRDGCLVPTI---DDPALGYIKESSSDQYVPDVFYKEKDVYGNEVTRIGRPMPMEY 457
Query: 317 FLVVVKI---LDHQ----GPLSSTFPIENRTT--QVTMRALKSHLNRSPSLPLVKRISDF 367
FL+ V ++ Q G S FPIENR+ ++ + + R+ S +V SDF
Sbjct: 458 FLIEVPAAFPIEPQYTFVGCRDSPFPIENRSQIGEIQNFDVFAQYVRTISSNIVDGFSDF 517
Query: 368 HLLLFL 373
H LL++
Sbjct: 518 HFLLYM 523
>gi|383170959|gb|AFG68745.1| Pinus taeda anonymous locus 0_3211_01 genomic sequence
Length = 51
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/51 (74%), Positives = 42/51 (82%)
Query: 110 DDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG 160
DDLIA+Q+R+ RQE PH SFDRD A+ FQQYVNE LAFAVKRGGFMYG
Sbjct: 1 DDLIARQVRIERQEKPHSVFASFDRDAANVFQQYVNENLAFAVKRGGFMYG 51
>gi|429327757|gb|AFZ79517.1| hypothetical protein BEWA_023660 [Babesia equi]
Length = 384
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 121/273 (44%), Gaps = 43/273 (15%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F Y +TL A +R G++YG +ED R IYEPPQ+ E +L D
Sbjct: 109 FANYWIDTLEMAEQRAGWLYGYYIEDSHYPLGIRAVCEAIYEPPQRSFMEYSELLSDPFL 168
Query: 194 EKLVDAIAAGLGMKKVGFIFT----QTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTA 249
VD IA LG++++G+IFT + +M K + R++ A + H V+
Sbjct: 169 NT-VDRIANKLGLERIGYIFTHLPRENVMTPKDIIDAAKRQLDTAKDIHYTGYP---VST 224
Query: 250 VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------- 300
V + +G AF SD + L ++G F E +D K+ +++K
Sbjct: 225 HVTCTMCPDGEGKPVLNAFMASDSAMALLRDGMFS---EEQNDIKMVEIRKASSSFEILP 281
Query: 301 DVVVGGKDVKEVDNDFFLVVVKILDHQGPL----SSTFPIENRTTQVTMRALKSHLNRSP 356
+ GK+V D+D+ +V V P+ + FP ENR + K +P
Sbjct: 282 QIFESGKEVSSFDSDWLIVRVNDSAPIHPMPFFKHTGFPRENREVSTDVNIWK--YTNTP 339
Query: 357 SLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
DFHLLL++A LD+++ ALA C
Sbjct: 340 --------CDFHLLLYIANTLDVDT---ALAVC 361
>gi|355569013|gb|EHH25294.1| hypothetical protein EGK_09090 [Macaca mulatta]
Length = 629
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 148/349 (42%), Gaps = 104/349 (29%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 198 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 254
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 255 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 313
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 314 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 369
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DV----------------- 302
+Q+S+ C+ L ++ E + D P+L K+ DV
Sbjct: 370 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKVKVCVPITASRARN 426
Query: 303 ------------VVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTF--------- 334
V GG D D F + L P +++TF
Sbjct: 427 VEAVLTVAATRHVEGGHTTLTHDVDKFGNEITQLARPLPVEYLIIDITTTFPKDPVYTFS 486
Query: 335 ------PIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
PIENR TQ +L ++L+++ S + ISDFHLLLFL
Sbjct: 487 ISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 534
>gi|355754452|gb|EHH58417.1| hypothetical protein EGM_08267 [Macaca fascicularis]
Length = 704
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 148/349 (42%), Gaps = 104/349 (29%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 273 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 329
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + +
Sbjct: 330 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 388
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEA 267
NK Y LS+ E + A +F + C + ++VTAV + +HFE
Sbjct: 389 RNKDTYFLSSEECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEG 444
Query: 268 FQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DV----------------- 302
+Q+S+ C+ L ++ E + D P+L K+ DV
Sbjct: 445 YQVSNQCMALVRD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKVKVCVPITASRARN 501
Query: 303 ------------VVGGKDVKEVDNDFFLVVVKILDHQGP-------LSSTFP-------- 335
V GG D D F + L P +++TFP
Sbjct: 502 VEAVLTVAATRHVEGGHTTLTHDVDKFGNEITQLARPLPVEYLIIDITTTFPKDPVYTFS 561
Query: 336 -------IENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
IENR TQ +L ++L+++ S + ISDFHLLLFL
Sbjct: 562 ISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLLFL 609
>gi|395826791|ref|XP_003786598.1| PREDICTED: nuclear protein localization protein 4 homolog [Otolemur
garnettii]
Length = 625
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 136/288 (47%), Gaps = 51/288 (17%)
Query: 119 VTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVN 171
+ RQ+ H +++ F + AD F + +T + G++YG E K R EV
Sbjct: 261 LNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAEVA 317
Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------N 220
IYEPPQ GT+ L +L D + E +VD IAA LG++KVG+IFT + + N
Sbjct: 318 AIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRN 376
Query: 221 KKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE 280
K Y LS+ E + A +F + + +L S +G HF + + +F +
Sbjct: 377 KDTYFLSSEECITAGDFQNK------HPNICRL--SPDG----HFGS-KFVTAVATVFGQ 423
Query: 281 GWFET-EIAEGDDPKLSKMKKDVVVGGKDVKEVDN----DFFLVVVKILDHQGPLSS--- 332
GW T + S +D+ G ++ ++ ++ ++ + + P+ +
Sbjct: 424 GWVRTCHLRYMGLAPYSFEPQDIDKFGNEITQLARPLPVEYLIIDITTTFPKDPVHTFSI 483
Query: 333 ---TFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
FPIENR TQ +L ++L+++ S + ISDFHLLLFL
Sbjct: 484 SQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSMFLDTISDFHLLLFL 530
>gi|91088135|ref|XP_970927.1| PREDICTED: similar to nuclear protein localization [Tribolium
castaneum]
Length = 638
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 143/310 (46%), Gaps = 58/310 (18%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V+F+ D ++++N + +R GF+YG V R V
Sbjct: 235 ITLNRQVYRHVDNVAFEN--KDLVERFLNYWRSTGHQRIGFLYGNYEIHNDVPLGIRANV 292
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ + + +++LRD++EE +V+ +A LG+++VG+IFT I ++
Sbjct: 293 VAIYEPPQESSRDSIHLLRDDKEE-IVEELAQNLGLRRVGWIFTDLIPEDVQKGTVKHLR 351
Query: 220 NKKDYTLSNREVLQAVEFH----AECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
+ + LS E + A F C + + +V + V+ + + +H E +Q+S+
Sbjct: 352 HINSHFLSAEECIMAGHFQNMYPNPCRFASGGYFGSKIVTVCVTGDKTNQVHMEGYQVSN 411
Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMKKDV---VVGGKDVKEVDN--------------D 315
C+ L ++ + D P+L +++ V KE DN +
Sbjct: 412 QCMALVRDNCL---LPTKDAPELGYVRESSDKQFVPDVYYKEKDNYGNEVSRLGRPLPVE 468
Query: 316 FFLVVVKILD--------HQGPLSSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
+ L+ V + P FP+ENR ++ AL +L++ S I
Sbjct: 469 YLLLDVPASTPVTPTYTFNCDPSKQPFPVENRFIDGEIQDFNALNQYLSQFTSNEFFTAI 528
Query: 365 SDFHLLLFLA 374
+DFHLLL++A
Sbjct: 529 NDFHLLLYIA 538
>gi|403333884|gb|EJY66073.1| Nuclear pore associated protein (NLP4), putative [Oxytricha
trifallax]
Length = 695
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 133/275 (48%), Gaps = 40/275 (14%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R ++YG ED RV + IYEPPQ G + L D + K VD IA L +
Sbjct: 269 QRMAWLYGYYSEDPNYPEGVRVNIEAIYEPPQVGEINGVSELDDPMQAK-VDMIAEALTL 327
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-------HAE-CNMEEWVTAVVKLEVSEE 258
+KVG+IFT D LS +EV + ++ H E N+ ++VT VVK +
Sbjct: 328 EKVGWIFTSI----NHDAFLSAQEVRKISKYQEQYKVEHPEGYNVSKFVTVVVK---PKG 380
Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWF-ETE------IAEGDDPKLSKMKKDVVVGGKDVKE 311
+ I E + +SD C L ++ F E+E + EG ++M VV GK KE
Sbjct: 381 DGSEIGIECYMVSDQCQALERDNIFGESESRKKMVLREGGP---NEMIPSVVKEGKPAKE 437
Query: 312 VDNDFFLVVV------KILDHQGPLSSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRI 364
+ DFF+V + D++ + FP++NR + + +L R P ++
Sbjct: 438 FEPDFFIVSLANGQPKSNKDYKYLKNYDFPMQNRIGRTPAKKEFSDYLKRHKGEPSQRKY 497
Query: 365 SDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
++ HLL++LA LD+++ + A+AQ V + + G
Sbjct: 498 ANMHLLVYLADLLDIHTAM-AIAQNVAQEQPLDPG 531
>gi|156085300|ref|XP_001610127.1| NPL4 family protein [Babesia bovis]
gi|154797379|gb|EDO06559.1| NPL4 family protein [Babesia bovis]
Length = 480
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 124/280 (44%), Gaps = 42/280 (15%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRDEEE 193
F Y L A +R G++YG +ED + IYEPPQ T + L DE
Sbjct: 193 FANYWMNDLEMAEQRAGWLYGYYVEDSHYPLGIRAVCEGIYEPPQHSTLCDVEFLPDEFI 252
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF-----HAECNMEEWVT 248
VDAIAA LG++++G I T +N LS ++V+ + + H V+
Sbjct: 253 ST-VDAIAARLGLERIGHILTHLPRENY----LSPQDVIDSAKVQLGRVHKTHYTRYPVS 307
Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAE-GDDPKLSKMKK------- 300
+ + + AF +SD + L ++G IA+ DP + M++
Sbjct: 308 THITCTLHPDSEGKPALNAFMVSDTAMALLRDGI----IADVQPDPMVISMRQPTNKNEV 363
Query: 301 --DVVVGGKDVKEVDNDFFLVVVKILDHQGPLS----STFPIENRTTQ-VTMRALK---- 349
++ GK+V D +F++ V P S S FP ENRT + +T A+K
Sbjct: 364 LPQIIESGKEVTSFDPSWFVIRVNDSAPINPNSIFKYSDFPRENRTIRPITPDAVKKFFS 423
Query: 350 SHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
S L SDFHLL+++AR LD+ + ALA C
Sbjct: 424 SRLAEGLETSDANIFSDFHLLIYMARVLDVET---ALAIC 460
>gi|426346408|ref|XP_004040871.1| PREDICTED: nuclear protein localization protein 4 homolog [Gorilla
gorilla gorilla]
Length = 584
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 139/302 (46%), Gaps = 70/302 (23%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 213 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 269
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNR 229
V IYEPPQ GT+ L +L D + E +VD IAA LG++ K Y LS+
Sbjct: 270 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLR-------------KDTYFLSSE 315
Query: 230 EVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLF 278
E + A +F + C + ++VTAV + +HFE +Q+S+ C+ L
Sbjct: 316 ECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQCMALV 371
Query: 279 KEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN----DFFLVVV 321
++ E + D P+L K KDV G ++ ++ ++ ++ +
Sbjct: 372 RD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLARPLPVEYLIIDI 428
Query: 322 KILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLL 371
+ P+ + FPIENR TQ +L ++L+++ S + ISDFHLLL
Sbjct: 429 TTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLL 487
Query: 372 FL 373
FL
Sbjct: 488 FL 489
>gi|403280765|ref|XP_003931880.1| PREDICTED: nuclear protein localization protein 4 homolog [Saimiri
boliviensis boliviensis]
Length = 542
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 139/302 (46%), Gaps = 70/302 (23%)
Query: 117 MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F + AD F + +T + G++YG E K R E
Sbjct: 171 ITLNRQKYRHVDNIMFENHTVADRFLDFWRKT---GNQHFGYLYGRYTEHKDIPLGIRAE 227
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNR 229
V IYEPPQ GT+ L +L D + E +VD IAA LG++ K Y LS+
Sbjct: 228 VAAIYEPPQIGTQNSLELLEDPKAE-VVDEIAAKLGLR-------------KDTYFLSSE 273
Query: 230 EVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLF 278
E + A +F + C + ++VTAV + +HFE +Q+S+ C+ L
Sbjct: 274 ECITAGDFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQCMALV 329
Query: 279 KEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN----DFFLVVV 321
++ E + D P+L K KDV G ++ ++ ++ ++ +
Sbjct: 330 RD---ECLLPCKDAPELGYAKESSSEQYVPDVFYKDVDKFGNEITQLARPLPVEYLIIDI 386
Query: 322 KILDHQGPL------SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLL 371
+ P+ + FPIENR TQ +L ++L+++ S + ISDFHLLL
Sbjct: 387 TTTFPKDPVYTFSISQNPFPIENRDVLGETQ-DFHSLATYLSQNTSSVFLDTISDFHLLL 445
Query: 372 FL 373
FL
Sbjct: 446 FL 447
>gi|195395696|ref|XP_002056472.1| GJ10209 [Drosophila virilis]
gi|194143181|gb|EDW59584.1| GJ10209 [Drosophila virilis]
Length = 655
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 147/329 (44%), Gaps = 60/329 (18%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
+ RQ H ++V F+ ++++N +R GF+YGT + R +V
Sbjct: 253 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGFLYGTYEQHADVPLGIRAKVAA 310
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT------- 225
IYEPPQ+ + + + I DE ++ VDA+A LG+KK+G+IFT I ++ T
Sbjct: 311 IYEPPQESSRDSINICPDEGADE-VDAVAKALGLKKIGWIFTDLITEDASAGTVKQIRGI 369
Query: 226 ----LSNREVLQAVEF-HAECNMEEWVT------AVVKLEVSEEGSAAIHFEAFQMSDMC 274
L+ +E + A E + N ++ T V + V+ + + +H E + +S C
Sbjct: 370 ETHFLTAQECISAGELQNRHPNPCKYATNGTFGSKFVTICVTGDQTKQVHMEGYAVSAQC 429
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDFF 317
+ L ++ I D P+L +++ DV KD + + ++
Sbjct: 430 MALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLPVEYL 486
Query: 318 LVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISD 366
LV V P + FPIENR ++ AL S+L+ ++ ISD
Sbjct: 487 LVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSSYLSAWAEEDFLEAISD 546
Query: 367 FHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
FHLL++L + L L + L + V+A+
Sbjct: 547 FHLLIYLYKMDMLPLRQHMGPLLEAVRAK 575
>gi|432926050|ref|XP_004080805.1| PREDICTED: nuclear protein localization protein 4 homolog [Oryzias
latipes]
Length = 550
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 124/288 (43%), Gaps = 64/288 (22%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T +R G++YG E K R E
Sbjct: 229 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GSQRVGYLYGRYTEHKDIPLGIRAE 285
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ + L +L D + VD IAA LG+ KVG+IFT + +
Sbjct: 286 VAAIYEPPQTANQNSLELLEDPKAAA-VDEIAAKLGLCKVGWIFTDLLSEDTRIGTVRYL 344
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
+N+ Y LS E + + F + C + + + V + + +HFE +Q+S
Sbjct: 345 RNQDSYYLSAEECITSGYFQNQHPNPCRLSRDGFFGSKFVTVVATGGPDNQVHFEGYQVS 404
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLS 331
+ C+ L + D L + D Q +
Sbjct: 405 NQCMALVR------------DECLLPFPGSITTTFPK----------------DPQYTFT 436
Query: 332 ST--FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
ST FPIENR TQ +L ++L++ S + +SDFHLLLFL
Sbjct: 437 STQRFPIENRDILGETQ-DFHSLATYLSQCSSTSFLDIVSDFHLLLFL 483
>gi|449269906|gb|EMC80643.1| Nuclear protein localization protein 4 like protein, partial
[Columba livia]
Length = 453
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 118/255 (46%), Gaps = 37/255 (14%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ AD F + +T + G++YG E K R E
Sbjct: 209 ITLNRQKYRHVDNIMFENHTIADRFLDFWRKT---GNQHLGYLYGRYTEHKDIPLGIRAE 265
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ---------- 219
V IYEPPQ GT+ L IL D + E +VD IAA LG++KVG+IFT + +
Sbjct: 266 VAAIYEPPQIGTQNSLEILDDPKAE-VVDEIAAKLGLRKVGWIFTDLVSEDTRKGTVRYS 324
Query: 220 -NKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMS 271
NK Y LS E + A F + C + + + V + + +HFE +Q+S
Sbjct: 325 RNKDTYYLSAEECITAGHFQNQQPNICRLSPDGHFGSKFVTVVATGGPDNQVHFEGYQVS 384
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLS 331
+ C+ L ++ E + D P+L K+ + V + F+ + K + L+
Sbjct: 385 NQCMALVRD---ECLLPCRDAPELGYAKES-----SSEQYVPDVFYKDIDKFGNEITQLA 436
Query: 332 STFPIENRTTQVTMR 346
P+E V++R
Sbjct: 437 RPLPVEYLIIDVSVR 451
>gi|427793885|gb|JAA62394.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
[Rhipicephalus pulchellus]
Length = 588
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 66/317 (20%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ H + V F+ + F + ET+ +R GF+YG K R
Sbjct: 198 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 254
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT + +
Sbjct: 255 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 313
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
+N + LS +E + A F C + + + V + V+ + +H E +Q+S
Sbjct: 314 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 373
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVG-----GKDVKEVDN---- 314
+ C+ L ++ + D P+L +++ DV G +V ++
Sbjct: 374 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 430
Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
++ LV V + + P T FP+ENR + ++ A+ HL N + S
Sbjct: 431 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 490
Query: 358 LPLVKRISDFHLLLFLA 374
+++ SDFHLLL+LA
Sbjct: 491 ENVLEAFSDFHLLLYLA 507
>gi|427793273|gb|JAA62088.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
[Rhipicephalus pulchellus]
Length = 628
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 143/317 (45%), Gaps = 66/317 (20%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ H + V F+ + F + ET+ +R GF+YG K R
Sbjct: 238 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 294
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT + +
Sbjct: 295 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 353
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
+N + LS +E + A F C + + + V + V+ + +H E +Q+S
Sbjct: 354 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 413
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
+ C+ L ++ + D P+L +++ DV D + +
Sbjct: 414 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 470
Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
++ LV V + + P T FP+ENR + ++ A+ HL N + S
Sbjct: 471 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 530
Query: 358 LPLVKRISDFHLLLFLA 374
+++ SDFHLLL+LA
Sbjct: 531 ENVLEAFSDFHLLLYLA 547
>gi|428183916|gb|EKX52773.1| NPL4-like protein, forms a complex with UFD1 and CDC48 [Guillardia
theta CCMP2712]
Length = 656
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 141/323 (43%), Gaps = 56/323 (17%)
Query: 103 FGRKMTMDDLIAKQMRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGT 161
+G + + K M + RQ+ H + V D D F +T A R G++YGT
Sbjct: 212 WGERTVSMSRLPKPMTLDRQKYRHVDRVEIEDPQILDKFLDAWRQTGAM---RYGYLYGT 268
Query: 162 VLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIF-- 213
D V V IYEPPQ+ + + +LRD + +D+ AA LG+ KVG+IF
Sbjct: 269 YEVDTNVPLGIKAVVKAIYEPPQECAADGVNMLRDPYSDA-IDSTAAMLGLVKVGWIFID 327
Query: 214 --------TQTIMQNKKD-YTLSNREVLQAVEFHA-------ECNMEEWVTAVVKLEVSE 257
+ + KD Y L + E + A EC+ E+ + V + V+
Sbjct: 328 LEVMADDHNKYVCSRGKDTYFLRSNECIIAARLQNMNPSKCLECDDLEFGSKFVTVVVTG 387
Query: 258 EGSAAIHFEAFQMSDMCVRLFK-EGWFETEIAEGDDPKLSKMKKDVVVGG---KDVKEVD 313
+ + A+Q+S + L + EG E A+ + ++++ D V +DV E
Sbjct: 388 NEKNELDYVAYQVSQQGMDLERAEGILEY-TAKPNLCRVAQSTPDRYVPAIFYQDVSEFG 446
Query: 314 ND-------FFLVVVKILDHQGP---------LSSTFPIENRT-TQVTMRALKSHLNRSP 356
N +F V ++ GP FPIENR T+ ALK+HLN
Sbjct: 447 NKMVHEAKPYFPVDYLVVSVNGPTFPQQANPLFKRNFPIENRMYAPQTLEALKAHLNSPD 506
Query: 357 SLPL-----VKRISDFHLLLFLA 374
+ P +K ISDFHL+++LA
Sbjct: 507 NKPFRPTGQLKTISDFHLIVYLA 529
>gi|427797169|gb|JAA64036.1| Putative nuclear pore complex rnpl4 component sc npl4, partial
[Rhipicephalus pulchellus]
Length = 645
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 66/317 (20%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ H + V F+ + F + ET+ +R GF+YG K R
Sbjct: 255 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 311
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT + +
Sbjct: 312 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 370
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
+N + LS +E + A F C + + + V + V+ + +H E +Q+S
Sbjct: 371 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 430
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVG-----GKDVKEVDN---- 314
+ C+ L ++ + D P+L +++ DV G +V ++
Sbjct: 431 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 487
Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
++ LV V + + P T FP+ENR + ++ A+ HL N + S
Sbjct: 488 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 547
Query: 358 LPLVKRISDFHLLLFLA 374
+++ SDFHLLL+LA
Sbjct: 548 ENVLEAFSDFHLLLYLA 564
>gi|427783009|gb|JAA56956.1| Putative nuclear pore complex rnpl4 component sc npl4
[Rhipicephalus pulchellus]
Length = 592
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 66/317 (20%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ H + V F+ + F + ET+ +R GF+YG K R
Sbjct: 202 LTLNRQVYRHVDMVMFENPTIVEHFLDFWRETMH---QRIGFLYGRYEAHKDVPLGIRAT 258
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ+ T + + +L DE+E+ LVD +A+ LG+ +VG+IFT + +
Sbjct: 259 VAAIYEPPQESTPDAVKLLPDEKEQ-LVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 317
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
+N + LS +E + A F C + + + V + V+ + +H E +Q+S
Sbjct: 318 RNIESCFLSAQECIMAGHFQNSHPNPCRLSPDGYFGSKFVTVCVTGDSENRVHMEGYQVS 377
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVG-----GKDVKEVDN---- 314
+ C+ L ++ + D P+L +++ DV G +V ++
Sbjct: 378 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQMARPLPI 434
Query: 315 DFFLVVVKILDHQGPLSST---------FPIENRTTQVTMR---ALKSHL-----NRSPS 357
++ LV V + + P T FP+ENR + ++ A+ HL N + S
Sbjct: 435 EYLLVDVPVSTPKDPKEPTFYAAPGKRHFPVENRLLEGHIQDLGAVARHLSQFTANNAGS 494
Query: 358 LPLVKRISDFHLLLFLA 374
+++ SDFHLLL+LA
Sbjct: 495 ENVLEAFSDFHLLLYLA 511
>gi|397607010|gb|EJK59513.1| hypothetical protein THAOC_20248 [Thalassiosira oceanica]
Length = 499
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 127/514 (24%), Positives = 215/514 (41%), Gaps = 127/514 (24%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M++R+R+ G+ RV A T A++ + I + V + LS++ P D
Sbjct: 1 MIVRVRTNVGVWRVEGLDASSTTADVLSGIAATRPHVVYERPLSSDPGCA---RPLD--- 54
Query: 61 FTDMANPDRPLSSLNISHGSIVF-------------------LSYDGERNVRGPSFN--- 98
P PL N+ HGS+V + D R+ R P+ N
Sbjct: 55 ------PAVPLGRQNVGHGSMVHCRVDPGTCAENTADGGGESMDTDDSRDGRRPAGNTKR 108
Query: 99 -----------------PAGS-----FGRKMTMDDLIAKQM------------RVTRQEN 124
AGS G+K D +A + ++ RQ+
Sbjct: 109 IIDKDGSIKLVHAEGSSSAGSDKGFRKGQKALRDMKMAWTLQEFTAMDDQYNFKIKRQDA 168
Query: 125 PHCE--SVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK---------------- 166
V + D FQ Y+ F +R G++YG ++++
Sbjct: 169 SFVGKGGVHLESSAVDGFQSYLRR-FQFQQQRCGYLYGRFVDEEGDGDEEKDDGADKPAA 227
Query: 167 -------------------RVEVNFIYEPPQQGTEEVL--YILRDEEEEKLVDAIAAGLG 205
RV V IYEPPQ+ + ++L ++E E+ V+A+A LG
Sbjct: 228 SSGWGARTPEEKKEAKKNARVVVEAIYEPPQEADADAAEGFVLLEDEAEEDVEALAGMLG 287
Query: 206 MKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAEC--NMEEWVTAVVKLEVSEEGSAAI 263
+++VG+I ++ + LS+ EV+ A E E +E V++ V ++G+ A+
Sbjct: 288 LRRVGWIVGHP--PREEGFQLSSAEVIMAAELQLESAGGVEPTPFVTVRVTVGDDGNVAV 345
Query: 264 HFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD--VVVGGKDVKEVDNDFFLVVV 321
EAFQ+S C+ + E + G +P + + + GK V+N+FFL VV
Sbjct: 346 --EAFQVSLQCMHMVAE----EALGVGPNPGFCYVDETFTAIQEGKPSATVENNFFLTVV 399
Query: 322 KILDHQGP-LSSTFPIENRTTQVTMRA---LKSHLNRS--PSLPLVKRISDFHLLLFLAR 375
I +Q L + FP NR M+ +K L++S + +SDF LLL+L +
Sbjct: 400 PINAYQSDMLVAQFPTANRAFDDRMQTHDEMKRQLSKSGQAGWNFIDLLSDFSLLLYLCK 459
Query: 376 FLDLNSDVPALAQCVQ-AQTAVPEGYKLLINSMA 408
F+D+ +D+P + V+ + + +GYK++I SMA
Sbjct: 460 FMDVKTDMPKICASVKDREIPLDDGYKIIITSMA 493
>gi|157112064|ref|XP_001651776.1| nuclear protein localization [Aedes aegypti]
gi|108878169|gb|EAT42394.1| AAEL006056-PA [Aedes aegypti]
Length = 619
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 148/330 (44%), Gaps = 61/330 (18%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ ++++N +R G ++GT V R V
Sbjct: 224 LNRQAYRHIDNVMFEN--TGIVERFLNYWRTTGHQRIGMLFGTYEVHPDVPLGIRARVAA 281
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNK----------- 221
IYEPPQ+ T + + +L DE E + VD +A LG+++VG+IFT + +N
Sbjct: 282 IYEPPQESTRDSILLLEDEHEAE-VDEMARLLGLRRVGWIFTDLLAENAAAGTVKHVRGI 340
Query: 222 KDYTLSNREVLQAV----EFHAEC---NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
K + L+ +E + A ++ C N + + V + V+ + +H E + ++ C
Sbjct: 341 KTHFLTAQECILAGHLQNKYPNRCKYANGGYFGSKFVTVCVTGDDKKQVHMEGYAVTAQC 400
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEVDNDFF 317
+ L ++ I D P+L +++ DV KDV + + ++
Sbjct: 401 MALVRDNCL---IPTKDAPELGYIRESSDKQYVPDVYYKEKDVYGNEVQRLGRPLPVEYL 457
Query: 318 LVVVKILDHQGPL---------SSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRIS 365
LV V PL S FP+ENR ++ AL +L +S S+ ++ +S
Sbjct: 458 LVDVPASTPMVPLFTFHERKDVSQYFPVENRMIDGHIQDFAALSDYLAKSRSMDFLECMS 517
Query: 366 DFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
DFHLL +L R L + S + L + V+ +
Sbjct: 518 DFHLLFYLYRMDMLPMKSQMGPLLEAVRTK 547
>gi|195036638|ref|XP_001989777.1| GH18610 [Drosophila grimshawi]
gi|193893973|gb|EDV92839.1| GH18610 [Drosophila grimshawi]
Length = 655
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 147/329 (44%), Gaps = 60/329 (18%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
+ RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 253 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKVAA 310
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT------- 225
IYEPPQ+ + + + I DE ++ VDA+A LG+KK+G+IFT I ++ T
Sbjct: 311 IYEPPQESSRDSINICPDEGADE-VDAVAKALGLKKIGWIFTDLITEDASAGTVKQIRGI 369
Query: 226 ----LSNREVLQAVEF-HAECNMEEWVT------AVVKLEVSEEGSAAIHFEAFQMSDMC 274
L+ +E + A E + N ++ T V + V+ + + +H E + +S C
Sbjct: 370 ETHFLTAQECISAGELQNRHPNPCKYATNGTFGSKFVTICVTGDQTKQVHMEGYAVSAQC 429
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDFF 317
+ L ++ I D P+L +++ DV KD + + ++
Sbjct: 430 MALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLPVEYL 486
Query: 318 LVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISD 366
LV V P + FPIENR ++ AL S+L+ ++ ISD
Sbjct: 487 LVDVPASTPLQPQYTFTEYDKRQAFPIENRYIDGQLQDFNALSSYLSAWADDEFLEAISD 546
Query: 367 FHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
FHLL++L + L L + L + V+A+
Sbjct: 547 FHLLVYLYKMDMLPLRQHMGPLLEAVRAR 575
>gi|48133028|ref|XP_393332.1| PREDICTED: nuclear protein localization protein 4 homolog [Apis
mellifera]
Length = 646
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 142/310 (45%), Gaps = 58/310 (18%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ + ++++N + +R GF+YG V R V
Sbjct: 230 ITLNRQTYRHVDNVMFEN--SSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 287
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ T+ + +L DE+E +VD +A L +KK+G+IFT I ++
Sbjct: 288 TAIYEPPQESTKNSIRLLPDEKE-TIVDELAHLLNLKKIGWIFTDLIADDIKIGTVKHVR 346
Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N + + LS +E + A F + C + + V + V+ + IH E +Q+S+
Sbjct: 347 NIESHFLSAQECIMAGYFQNKYPNPCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 406
Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
C L ++G + D P+L + DV KD + + +
Sbjct: 407 QCTALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 463
Query: 316 FFLVVVKILDHQGP-----LSST---FPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
+ LV V P +S+T FPIENR ++ +L S++ + ++ +
Sbjct: 464 YLLVDVPASTPLTPQFTFHISNTITPFPIENRFIDGQIQEFSSLCSYMQQFTKEQFLEAV 523
Query: 365 SDFHLLLFLA 374
SDFHLL+F+A
Sbjct: 524 SDFHLLIFIA 533
>gi|114809978|gb|ABI81494.1| nuclear protein localization 4 [Squalus mitsukurii]
Length = 355
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 118/252 (46%), Gaps = 50/252 (19%)
Query: 167 RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI--------- 217
R EV IYEPPQ GT L L D + E +VD IAA LG++KVG+IFT +
Sbjct: 14 RAEVAAIYEPPQVGTANSLEFLEDPKAE-VVDEIAAKLGLRKVGWIFTDLLSEDTRKGTV 72
Query: 218 --MQNKKDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAF 268
++NK + LS E + A +F + C + + + V + + +HFE +
Sbjct: 73 KFIRNKDAHFLSAEECITAGDFQNKHPNVCRLSPVNHFGSKFVTVLATGGPDNQVHFEGY 132
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN- 314
Q+S+ C+ L + F + D P+L +K KD+ G ++ ++
Sbjct: 133 QVSNQCMALVGDDCF---LPCRDAPELGYVKESSSGQCVPDVFYKDIDKFGNEITQLARP 189
Query: 315 ---DFFLVVVKILDHQGPLSS------TFPIENR----TTQVTMRALKSHLNRSPSLPLV 361
++ ++ + + P+ + FP+ NR TQ L S+L+++ S +
Sbjct: 190 LPVEYLIIDIPTTFPKDPVYTFAASPHPFPVVNRDVVGKTQ-DFHTLASYLSQNQSAVFL 248
Query: 362 KRISDFHLLLFL 373
IS+FHLLLFL
Sbjct: 249 DVISNFHLLLFL 260
>gi|195504288|ref|XP_002099014.1| GE10682 [Drosophila yakuba]
gi|194185115|gb|EDW98726.1| GE10682 [Drosophila yakuba]
Length = 624
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 219 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 276
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I + D ++ +
Sbjct: 277 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASIGTVK 332
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 333 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 391
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ I D P+L +++ DV KD +
Sbjct: 392 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 448
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 449 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGDEEF 508
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
++ ISDFHLL++L + L L + L + V+ AQ V + +KLL
Sbjct: 509 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 560
>gi|62484495|ref|NP_733110.2| Npl4 ortholog, isoform B [Drosophila melanogaster]
gi|281362576|ref|NP_001163731.1| Npl4 ortholog, isoform D [Drosophila melanogaster]
gi|61679394|gb|AAN14058.2| Npl4 ortholog, isoform B [Drosophila melanogaster]
gi|272477171|gb|ACZ95025.1| Npl4 ortholog, isoform D [Drosophila melanogaster]
Length = 624
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 219 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 276
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I D ++ +
Sbjct: 277 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 332
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 333 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 391
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KE 311
+S C+ L ++ I D P+L +++ DV KD+ +
Sbjct: 392 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDLYGNEVQRLARP 448
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 449 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 508
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
++ ISDFHLL++L + L L + L + V+ AQ V + +KLL
Sbjct: 509 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 560
>gi|194908465|ref|XP_001981774.1| GG12234 [Drosophila erecta]
gi|190656412|gb|EDV53644.1| GG12234 [Drosophila erecta]
Length = 624
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 219 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 276
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I + D ++ +
Sbjct: 277 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASVGTVK 332
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 333 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 391
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ I D P+L +++ DV KD +
Sbjct: 392 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 448
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 449 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 508
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
++ ISDFHLL++L + L L + L + V+ AQ V + +KLL
Sbjct: 509 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNEAAQFKVVDVWKLL 560
>gi|379699068|gb|AFD10756.1| FI19201p1 [Drosophila melanogaster]
Length = 628
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 223 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 280
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I D ++ +
Sbjct: 281 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 336
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 337 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 395
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KE 311
+S C+ L ++ I D P+L +++ DV KD+ +
Sbjct: 396 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDLYGNEVQRLARP 452
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 453 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 512
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
++ ISDFHLL++L + L L + L + V+ AQ V + +KLL
Sbjct: 513 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 564
>gi|291226637|ref|XP_002733298.1| PREDICTED: nuclear protein localization protein 4 homolog
[Saccoglossus kowalevskii]
Length = 637
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 136/309 (44%), Gaps = 61/309 (19%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG------TVLEDKRVEVNF 172
+ RQ H ++V F+ F ++++ +R G +YG +V + V+
Sbjct: 232 LNRQHYRHVDNVLFEN--PRLFDRFIDYWRKSGNQRAGLLYGRYEYHESVPLGIKASVSA 289
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEPPQ T + L D+ +VD IA LG++KVG+IFT + ++N
Sbjct: 290 IYEPPQDCTPNSIEFL-DDPHGSVVDEIAGKLGLRKVGWIFTDLVAEDLRKGTVKHVRNM 348
Query: 222 KDYTLSNREVLQAVEFHAE----CNME---EWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+ LS E + A EF + C + + + V + V+ + + IH EA+Q+S+ C
Sbjct: 349 DAHFLSAEECIMAGEFQNQNPNKCKLATDGNFGSKFVTVIVTGDSTNQIHTEAYQVSNQC 408
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDVKEVDNDFFL------VV 320
+ L + + I D P+L +K+ DV KD +N+ L V
Sbjct: 409 MALVND---DCLIPTKDAPELGYIKESSNEQYVPDVFYKMKD--SYNNEVALLARPLPVE 463
Query: 321 VKILDHQG--PLSS----------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRIS 365
I+D PL FPIENR+ ++ L +L P+ L+ +
Sbjct: 464 YLIIDVPAGFPLEPKYTFLDENERKFPIENRSAIAEIQNFETLSQYLQYYPAAKLLDAMC 523
Query: 366 DFHLLLFLA 374
D H+LL+LA
Sbjct: 524 DLHVLLYLA 532
>gi|62484318|ref|NP_651407.3| Npl4 ortholog, isoform A [Drosophila melanogaster]
gi|74868217|sp|Q9VBP9.3|NPL4_DROME RecName: Full=Nuclear protein localization protein 4 homolog
gi|16182532|gb|AAL13515.1| GH03617p [Drosophila melanogaster]
gi|61679395|gb|AAF56480.3| Npl4 ortholog, isoform A [Drosophila melanogaster]
gi|220951542|gb|ACL88314.1| CG4673-PA [synthetic construct]
Length = 652
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 154/352 (43%), Gaps = 75/352 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 247 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 304
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I D ++ +
Sbjct: 305 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 360
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 361 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 419
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KE 311
+S C+ L ++ I D P+L +++ DV KD+ +
Sbjct: 420 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDLYGNEVQRLARP 476
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 477 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 536
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
++ ISDFHLL++L + L L + L + V+ AQ V + +KLL
Sbjct: 537 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 588
>gi|125602545|gb|EAZ41870.1| hypothetical protein OsJ_26416 [Oryza sativa Japonica Group]
Length = 202
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 58/94 (61%), Gaps = 2/94 (2%)
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF 334
++LFK+G +T+I + DDP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQ L ++
Sbjct: 101 LQLFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDAMEVDNDFFLVPVKISDHQ--LQTSS 158
Query: 335 PIENRTTQVTMRALKSHLNRSPSLPLVKRISDFH 368
+ + R+ H + + + R D H
Sbjct: 159 QNKKKDKAAVFRSCPFHFYNAKNTAEIFRAHDEH 192
>gi|320163920|gb|EFW40819.1| nuclear protein localization protein [Capsaspora owczarzaki ATCC
30864]
Length = 526
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 146/331 (44%), Gaps = 66/331 (19%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVN-ETLAFAVKRGGFM------YGTVLEDKRVEVN 171
+ RQ H + V F++ A Q+++ + V+R G++ Y V + V
Sbjct: 135 LARQAYRHVDYVEFEQ--ASILNQFIDFWRQSGGVQRLGYLIGRYEPYPEVPLGIKAVVA 192
Query: 172 FIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIF-------TQTIM----QN 220
IYEPPQ G + L +L D EE ++ +A+ LG+++VG+IF T+T M ++
Sbjct: 193 AIYEPPQTGAVDSLELLDDPNEEAVL-GVASALGLQRVGWIFTDLEDDGTRTGMVQCKRH 251
Query: 221 KKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
K + LS E++ A + A + E+ + V + +S S + EA+Q+S+
Sbjct: 252 KDTFFLSASEIMMAADLQARHPNPTTAASTGEFGSKFVTVVISGNASREVSLEAYQVSNQ 311
Query: 274 CVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEV--------------------D 313
C L + E + DP S M + + EV
Sbjct: 312 CATLVQN---EFVVPSTRDP--SAMTVRETTDAQYIPEVFYRYKNEYGTEVVSTARPSFP 366
Query: 314 NDFFLVVVKILDHQGP-------LSSTFPIENRTT---QVTMRALKSHLNRSPS-LPLVK 362
D+ +V + Q P +S+ F I NR+ + ALKS+L PS +
Sbjct: 367 VDYLIVNLGNGSPQSPNPTFGLGISTPFTIGNRSVLGQNADVVALKSYLAAQPSNAAFLA 426
Query: 363 RISDFHLLLFLA--RFLDLNSDVPALAQCVQ 391
+SDFHLL+FLA L L++ +P L Q ++
Sbjct: 427 LVSDFHLLVFLATSSVLMLSNSLPELLQAIK 457
>gi|195573927|ref|XP_002104943.1| GD18187 [Drosophila simulans]
gi|194200870|gb|EDX14446.1| GD18187 [Drosophila simulans]
Length = 641
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 139/315 (44%), Gaps = 66/315 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 247 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 304
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I + D ++ +
Sbjct: 305 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASIGTVK 360
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 361 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 419
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ I D P+L +++ DV KD +
Sbjct: 420 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 476
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 477 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 536
Query: 361 VKRISDFHLLLFLAR 375
++ ISDFHLL++L +
Sbjct: 537 LEAISDFHLLVYLYK 551
>gi|195349473|ref|XP_002041269.1| GM10234 [Drosophila sechellia]
gi|194122964|gb|EDW45007.1| GM10234 [Drosophila sechellia]
Length = 725
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 139/315 (44%), Gaps = 66/315 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 247 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 304
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I + D ++ +
Sbjct: 305 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITE---DASIGTVK 360
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 361 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 419
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ I D P+L +++ DV KD +
Sbjct: 420 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 476
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 477 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 536
Query: 361 VKRISDFHLLLFLAR 375
++ ISDFHLL++L +
Sbjct: 537 LEAISDFHLLVYLYK 551
>gi|261338793|gb|ACX70078.1| UT01770p [Drosophila melanogaster]
Length = 628
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 153/352 (43%), Gaps = 75/352 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 223 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 280
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + I DE + VDA+A+ LG+KK+G+IFT I D ++ +
Sbjct: 281 AAIYEPPQESTRDSINIQPDEFADD-VDAVASALGLKKIGWIFTDLITD---DASIGTVK 336
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 337 QIRGIESHF-ITAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 395
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ I D P+L +++ DV KD +
Sbjct: 396 AVSAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDRYGNEVQRLARP 452
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P+ + FPIENR ++ AL +L+
Sbjct: 453 LPVEYLLVDVPASTPLQPIYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEF 512
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQ-------AQTAVPEGYKLL 403
++ ISDFHLL++L + L L + L + V+ AQ V + +KLL
Sbjct: 513 LEAISDFHLLVYLYKMDMLPLRQHMGPLLEAVRTKNPNQAAQFKVVDVWKLL 564
>gi|346469505|gb|AEO34597.1| hypothetical protein [Amblyomma maculatum]
Length = 611
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 139/314 (44%), Gaps = 63/314 (20%)
Query: 117 MRVTRQENPHCESVSFDR-DCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ H +SV F+ + F + ET+ +R GF+YG K R
Sbjct: 224 LTLNRQVYRHVDSVMFENPSIVEHFLNFWRETMH---QRIGFLYGRYEHHKDVPLGIRAT 280
Query: 170 VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------M 218
V IYEPPQ T + + +L DE++ LVD +A+ LG+ +VG+IFT + +
Sbjct: 281 VAAIYEPPQDSTPDSIKLLPDEKQ-PLVDELASRLGLVRVGWIFTDLVPEDLQKGTVKHL 339
Query: 219 QNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMS 271
+N + LS +E + A F C + + + V + V+ + +H E +Q+S
Sbjct: 340 RNIESCFLSAQECIMAGHFQNNHPNPCKLSPDGYFGSKFVTVCVTGDLENRVHMEGYQVS 399
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
+ C+ L ++ + D P+L +++ DV D + +
Sbjct: 400 NQCMALVRDNCL---VPTRDAPELGYVRESSNAQYVPDVFYKYTDSYGNEVTQLARPLPI 456
Query: 315 DFFLVVVKILDHQGPL-----SSTFPIENRTTQ--------VTMRALK-SHLNRSPSLPL 360
++ LV V + + PL FP+ENR + VT R + + N +
Sbjct: 457 EYLLVDVPVSTPKEPLFTFSAKHCFPVENRLLEGHIQDLGAVTARLSRFTAANNGGQNAV 516
Query: 361 VKRISDFHLLLFLA 374
++ SDFHLLL+LA
Sbjct: 517 LEAFSDFHLLLYLA 530
>gi|156353418|ref|XP_001623063.1| predicted protein [Nematostella vectensis]
gi|156209717|gb|EDO30963.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 141/311 (45%), Gaps = 59/311 (18%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ+ H +++ F+ Q +++ +R GF+YG E K R V
Sbjct: 137 ITLKRQDYRHVDNILFEN--PSIVQPFLDYWRRTGNQRLGFLYGRYEEHKDVPLGIRAVV 194
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT----- 225
IYEPPQ+GT + L +L + + V+ +AA LG++ VG+IF+ + + K T
Sbjct: 195 AAIYEPPQEGTPDSLELLPNTNLDT-VNKLAADLGLRIVGWIFSDLVPDDVKKGTVKHLR 253
Query: 226 ------LSNREVLQAVEFH----AECNME---EWVTAVVKLEVSEEGSAAIHFEAFQMSD 272
L+ E + A +F + C ++ + V + VS I F+ FQ+S+
Sbjct: 254 DINTHFLTAEECIMAGDFQNQYVSPCRYASSGKFGSKFVTVVVSGNDENQIGFDGFQVSN 313
Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDV----KEVDND 315
C+ L ++ + + DDP L ++ KD G +V + + +
Sbjct: 314 QCMALVRD---DCLVPTLDDPGLGYIRESSSEQYVPDVFYKDTDSYGNEVTLLARPMPIE 370
Query: 316 FFLVVVK-------ILDHQGPLS--STFPIENRTTQVTMR---ALKSHLNRSPSLPLVKR 363
+ LV V + P S + FPIENR ++ A++ HL++ P +
Sbjct: 371 YLLVQVSAAFPVQPVYTFSPPPSNGAVFPIENRENIGELQDFDAVRRHLSQYPPGDFLSA 430
Query: 364 ISDFHLLLFLA 374
+S+FHLL+FLA
Sbjct: 431 MSNFHLLVFLA 441
>gi|380028897|ref|XP_003698120.1| PREDICTED: nuclear protein localization protein 4 homolog [Apis
florea]
Length = 646
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 142/313 (45%), Gaps = 64/313 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ + ++++N + +R GF+YG V R V
Sbjct: 230 ITLNRQTYRHVDNVMFEN--SSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 287
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ T+ + +L DE+E +VD +A L +K++G+IFT I ++
Sbjct: 288 TAIYEPPQESTKNSIRLLPDEKE-TIVDELAHLLNLKRIGWIFTDLIADDIKIGTVKHVR 346
Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N + + LS +E + A F + C + + V + V+ + IH E +Q+S+
Sbjct: 347 NIESHFLSAQECIMAGYFQNKYPNLCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 406
Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
C+ L ++G + D P+L + DV KD + + +
Sbjct: 407 QCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 463
Query: 316 FFLVVVKILDHQGPLS-----------STFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
+ LV V PL+ + FPIENR ++ +L S++ + +
Sbjct: 464 YLLVDVPA---STPLTPQFTFYINNTITPFPIENRFIDGQIQEFSSLCSYMQQFTKEQFL 520
Query: 362 KRISDFHLLLFLA 374
+ +SDFHLL+F+A
Sbjct: 521 EAVSDFHLLIFIA 533
>gi|388329682|gb|AFK29236.1| CG4673-PB-like protein [Drosophila buzzatii]
Length = 626
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 139/316 (43%), Gaps = 66/316 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 222 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 279
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ + + + I DE ++ V+A+A LG+KK+G+IFT I + D T +
Sbjct: 280 AAIYEPPQESSRDSINICSDEGADE-VEAVAKALGLKKIGWIFTDLITE---DATAGTVK 335
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 336 QIRGIETHF-LTAQECITAGELQNRHPNPCKYAKNGTFGSKFVTICVTGDQTKQVHMEGY 394
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ + I D P+L +++ DV KD +
Sbjct: 395 AVSAQCMALVRD---DCLIPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 451
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P + FPIENR ++ AL S+L+
Sbjct: 452 LPVEYLLVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSSYLSAWAEEDF 511
Query: 361 VKRISDFHLLLFLARF 376
++ ISDFHLL++L +
Sbjct: 512 LEAISDFHLLIYLYKM 527
>gi|388329683|gb|AFK29237.1| CG4673-PA-like protein [Drosophila buzzatii]
Length = 655
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 139/315 (44%), Gaps = 66/315 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 251 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRAKV 308
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ + + + I DE ++ V+A+A LG+KK+G+IFT I + D T +
Sbjct: 309 AAIYEPPQESSRDSINICSDEGADE-VEAVAKALGLKKIGWIFTDLITE---DATAGTVK 364
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 365 QIRGIETHF-LTAQECITAGELQNRHPNPCKYAKNGTFGSKFVTICVTGDQTKQVHMEGY 423
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ + I D P+L +++ DV KD +
Sbjct: 424 AVSAQCMALVRD---DCLIPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 480
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P + FPIENR ++ AL S+L+
Sbjct: 481 LPVEYLLVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSSYLSAWAEEDF 540
Query: 361 VKRISDFHLLLFLAR 375
++ ISDFHLL++L +
Sbjct: 541 LEAISDFHLLIYLYK 555
>gi|347966940|ref|XP_550827.4| AGAP001986-PA [Anopheles gambiae str. PEST]
gi|333469830|gb|EAL38499.4| AGAP001986-PA [Anopheles gambiae str. PEST]
Length = 651
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 140/314 (44%), Gaps = 59/314 (18%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ ++++N +R GF++G V R V
Sbjct: 253 LNRQPYRHVDNVMFEN--TGIVERFLNYWRTTGHQRIGFLFGKYEIHPDVPLGIRARVAA 310
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEPPQ+ + + +L D+ + VD +A LG+++VG+IFT + ++N
Sbjct: 311 IYEPPQESNRDSIRLL-DDPHDADVDELARQLGLQRVGWIFTDLLSENLASGTVKHVRNI 369
Query: 222 KDYTLSNREVLQAV----EFHAECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
K + L+ +E + A ++ C + + V + V+ + +H E + +S C
Sbjct: 370 KTHFLTAQECILAGHLQNKYPNRCKDSSNGYFGSKFVTVCVTGDEKKQVHMEGYAVSAQC 429
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEVDNDFF 317
+ L ++ I D P+L +++ DV KDV + + ++
Sbjct: 430 MALVRDNCL---IPTKDAPELGYIRESSDKQYVPDVYYKEKDVYGNEVQRLGRPLPVEYL 486
Query: 318 LVVVKILDHQGPLSS---------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRIS 365
LV V PL + FP+ENR ++ AL ++L +S SLP + +S
Sbjct: 487 LVDVPASTPLVPLYTFLERKDAKQYFPVENRMIDGHIQDFAALANYLAKSRSLPFLDAVS 546
Query: 366 DFHLLLFLARFLDL 379
DFHLL +L R D+
Sbjct: 547 DFHLLFYLYRMEDM 560
>gi|383861559|ref|XP_003706253.1| PREDICTED: nuclear protein localization protein 4 homolog
[Megachile rotundata]
Length = 687
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 140/313 (44%), Gaps = 64/313 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ ++++N +R G++YG V R V
Sbjct: 234 ITLNRQTYRHVDNVMFEN--PSLVERFLNYWRNTGHQRIGYLYGRYEIHTDVPLGIRAVV 291
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ T++ + +L DE + LVD +A L ++KVG+IFT I ++
Sbjct: 292 AAIYEPPQESTKDTIRLLPDERK-ALVDELARLLNLRKVGWIFTDLIADDVKKGTVKHVR 350
Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N + + LS +E + A F E C + + V + V+ + +H E +Q+S+
Sbjct: 351 NIESHFLSAQECIMAGYFQNEYPNPCRFSPSNYFGSKFVTVCVTGDDKNQVHMEGYQVSN 410
Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
C+ L ++G + D P+L + DV KD + + +
Sbjct: 411 QCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVPRLARPLPVE 467
Query: 316 FFLVVVKILDHQGPLS-----------STFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
+ LV V PL+ + FPIENR ++ +L S++ + +
Sbjct: 468 YLLVDVPA---STPLTPQFTFYVSDNITPFPIENRLVDGQVQEFSSLCSYMQQFNQDEFL 524
Query: 362 KRISDFHLLLFLA 374
+ +SDFHLLLF+A
Sbjct: 525 EAVSDFHLLLFIA 537
>gi|42761319|dbj|BAD11562.1| hypothetical protein [Oryza sativa Japonica Group]
gi|45735811|dbj|BAD12847.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 178
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 44/53 (83%)
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
++LFK+G +T+I + DDP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQ
Sbjct: 124 LQLFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDAMEVDNDFFLVPVKISDHQ 176
>gi|195453878|ref|XP_002073983.1| GK14396 [Drosophila willistoni]
gi|194170068|gb|EDW84969.1| GK14396 [Drosophila willistoni]
Length = 660
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/334 (24%), Positives = 146/334 (43%), Gaps = 66/334 (19%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT + R V
Sbjct: 253 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHTDVPLGIRARV 310
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ + + L IL DE V+A+AA LG+KK+G+IFT I D T+ +
Sbjct: 311 AAIYEPPQESSRDSLNILPDEGSAD-VEAVAAALGLKKIGWIFTDLIT---ADATVGTVK 366
Query: 231 VLQAVEFH----AEC-----------NMEEWVT------AVVKLEVSEEGSAAIHFEAFQ 269
++ ++ H EC N ++ T V + V+ + + +H E +
Sbjct: 367 QIRGIDTHFLTAQECITAGELQNRHPNPCKYATNGVFGSKFVTICVTGDQTKQVHMEGYA 426
Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV---------KEV 312
+S C+ L ++ + D P+L +++ DV KD + +
Sbjct: 427 VSAQCMALVRDNCL---LPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLGRPL 483
Query: 313 DNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
++ LV V P + FP+ENR ++ AL ++L+ +
Sbjct: 484 PVEYLLVDVPASTPLQPQYTFTKYDQRQPFPVENRYIDGHLQDFNALSNYLSAWGDEEFL 543
Query: 362 KRISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
+ ISDFHLL+FL + L L + L + V+ +
Sbjct: 544 EAISDFHLLIFLYKMDMLPLRQHMAPLMEAVRTK 577
>gi|156349269|ref|XP_001621988.1| hypothetical protein NEMVEDRAFT_v1g176363 [Nematostella vectensis]
gi|156208367|gb|EDO29888.1| predicted protein [Nematostella vectensis]
Length = 614
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 144/315 (45%), Gaps = 66/315 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ+ H +++ F+ Q +++ +R GF+YG E K R V
Sbjct: 209 ITLKRQDYRHVDNILFEN--PSIVQPFLDYWRRTGNQRLGFLYGRYEEHKDVPLGIRAVV 266
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT----- 225
IYEPPQ+GT + L +L + + V+ +AA LG++ VG+IF+ + + K T
Sbjct: 267 AAIYEPPQEGTPDSLELLPNPNLDT-VNKLAADLGLRIVGWIFSDLVPDDVKKGTVKHLR 325
Query: 226 ------LSNREVLQAVEFH----AECNM-------EEWVTAVVKLEVSEEGSAAIHFEAF 268
L+ E + A +F + C ++VT VV E ++E I F+ F
Sbjct: 326 DINTHFLTAEECIMAGDFQNQYVSPCRYASSGKFGSKFVTVVVS-EGNDENQ--IGFDGF 382
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDV----KE 311
Q+S+ C+ L ++ + + DDP L ++ KD G +V +
Sbjct: 383 QVSNQCMALVRD---DCLVPTLDDPGLGYIRESSSEQYVPDVFYKDTDSYGNEVTLLARP 439
Query: 312 VDNDFFLVVVK-------ILDHQGPLS--STFPIENRTTQVTMR---ALKSHLNRSPSLP 359
+ ++ LV V + P S + FPIENR ++ A++ HL++ P
Sbjct: 440 MPIEYLLVQVSAAFPVQPVYTFSPPPSNGAVFPIENRENIGELQDFDAVRRHLSQYPPGN 499
Query: 360 LVKRISDFHLLLFLA 374
+ +S+FHLL+FLA
Sbjct: 500 FLSAMSNFHLLVFLA 514
>gi|57899187|dbj|BAD87239.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 196
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/53 (66%), Positives = 44/53 (83%)
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQ 327
++LFK+G +T+I + DDP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQ
Sbjct: 100 LQLFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDTMEVDNDFFLVPVKISDHQ 152
>gi|195112190|ref|XP_002000657.1| GI10357 [Drosophila mojavensis]
gi|193917251|gb|EDW16118.1| GI10357 [Drosophila mojavensis]
Length = 653
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 138/316 (43%), Gaps = 72/316 (22%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
+ RQ H ++V F+ ++++N +R G++YGT + R V
Sbjct: 252 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGWLYGTYEQHTDVPLGIRARVAA 309
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
IYEPPQ+ + + + I DE ++ VDA+A LG+KK+G+IFT I ++ T+ +
Sbjct: 310 IYEPPQESSRDSINICPDENADE-VDAVAKALGLKKIGWIFTDLITEDASAGTVKQ---I 365
Query: 233 QAVEFH----AEC-----------NMEEWVT------AVVKLEVSEEGSAAIHFEAFQMS 271
+ +E H EC N ++ T V + V+ + + +H E + +S
Sbjct: 366 RGIETHFLTAQECITAGELQNRHPNPCKYATNGTFGSKFVTICVTGDQTKQVHMEGYAVS 425
Query: 272 DMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDN 314
C+ L ++ I D P+L +++ DV KD + +
Sbjct: 426 AQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLPV 482
Query: 315 DFFLVVVKI------------LDHQGPLSSTFPIENRTTQVTMR---ALKSHLNRSPSLP 359
++ LV V D + P FPIENR ++ AL ++L+
Sbjct: 483 EYLLVDVPASTPLQPQFTFTEYDKRQP----FPIENRYIDGHLQDFNALSNYLSAWGEED 538
Query: 360 LVKRISDFHLLLFLAR 375
++ ISDFHLL++L +
Sbjct: 539 FLEAISDFHLLVYLYK 554
>gi|218200668|gb|EEC83095.1| hypothetical protein OsI_28230 [Oryza sativa Indica Group]
Length = 163
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/92 (44%), Positives = 56/92 (60%), Gaps = 2/92 (2%)
Query: 277 LFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPI 336
LFK+G +T+I + DDP+LSKM+K+VV GGKD EVDNDFFLV VKI DHQ L ++
Sbjct: 64 LFKDGVLQTKIRDKDDPRLSKMRKEVVAGGKDAMEVDNDFFLVPVKISDHQ--LQTSSQN 121
Query: 337 ENRTTQVTMRALKSHLNRSPSLPLVKRISDFH 368
+ + R+ H + + + R D H
Sbjct: 122 KKKDKAAVFRSCPFHFYNAKNTAEIFRAHDEH 153
>gi|361131707|gb|EHL03359.1| putative Nuclear protein localization protein 4 [Glarea lozoyensis
74030]
Length = 477
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 156/379 (41%), Gaps = 73/379 (19%)
Query: 98 NPAGSFGRKMTMDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG-- 155
NP + D L K R+ R+ +P V F ADA + + L F K G
Sbjct: 57 NPWEVVKQHPVDDRLDKKDGRIPRKRDPKMYHVEF----ADA--SLIEKLLDFWRKSGAQ 110
Query: 156 --GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLGM 206
G++YG E V V IYEPPQ +E+ + L + E EK VD +A G+
Sbjct: 111 RLGYLYGRYTEYTEVPLGIKAVVEAIYEPPQ--VDEIDGVTLNEWENEKDVDEVARLCGL 168
Query: 207 KKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWVT 248
++VG I+T + D Y LS++E++ A A+ + ++ +
Sbjct: 169 QRVGVIWTDLLDSGVGDGSVVCKRHIDSYFLSSQEIMFAARQQAQHPKPTKWSDTGKFGS 228
Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKMK 299
A V +S + S I A+Q S+ V + + E + E DD +S+ +
Sbjct: 229 AFVTCVISGDESGQISISAYQASNSAVEMVRAELVEPSVDPSVMLVRNEEEDDGSISRTR 288
Query: 300 -------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST---------FPIENRTT 341
+ + G V+E F V ++ L H P + FPIENR
Sbjct: 289 YIPEVFYRKINEYGAQVQENAKPSFPVEYLLITLTHGFPSNPMPQFIADPPGFPIENRAV 348
Query: 342 QVTMRALKS-----HLNRSPSL----PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
+ LK L+R+ L +K ISDFHLL FL + L D AL V
Sbjct: 349 IGVDQDLKDLSKRLQLDRNGELTNQDAAIKSISDFHLLCFLHGYGVLGKDEEALLCKVAT 408
Query: 393 QTAVPEGYKLLINSMAETV 411
Q + +G++LL S T+
Sbjct: 409 QHDLADGFQLLSTSGWATL 427
>gi|326434890|gb|EGD80460.1| hypothetical protein PTSG_11104 [Salpingoeca sp. ATCC 50818]
Length = 550
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 139/330 (42%), Gaps = 64/330 (19%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNF 172
+ RQE H + V F+ + ++++ +R GFMYG+ E +V V
Sbjct: 202 LNRQEYRHVDYVEFESH--EILDKFLDAWRRSGAQRAGFMYGSYEEYDQVPLGVKAVVKA 259
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT-------------QTIMQ 219
IYEPPQ + L I + + + VD +A LG+K+VG+IFT + I
Sbjct: 260 IYEPPQSNSALGLEIALESPQIQQVDQMAKQLGLKRVGWIFTDLEPDANGLAKYKRYIQP 319
Query: 220 NKKD-YTLSNREVLQAVEF---HAECNMEEWVTAVVK------LEVSEEGSAAIHFEAFQ 269
+D Y +S E+L A F H + T V + VS + I A+Q
Sbjct: 320 GDEDSYVISAEEILNAAMFQNQHPNPVPPRFSTTGVHGSKFTTVVVSGDSEHQIVPRAYQ 379
Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLSKMK--------KDVVVGGKD-----VKEVDNDF 316
S + L ++ D P L ++ +V+ +D +K +
Sbjct: 380 ASTQAMNLVRDDILRP----SDSPALVAVRPADDTTFVPEVLFTQRDKYNNEIKHKADPL 435
Query: 317 FLVVVKILDHQG---------PLSSTFPIENRT---TQVTMRALKSHLNRSPSLPLVKRI 364
F V +D S +FPIENR ++ ALK HL + S ++++
Sbjct: 436 FPVAYLAVDLPAGGRMDATSTTFSVSFPIENREALGAPASLSALKQHLQDT-SKSFLQKV 494
Query: 365 SDFHLLLFLAR---FLDLNSDVPALAQCVQ 391
SDFHLL++L F+ L +P+L + V+
Sbjct: 495 SDFHLLVYLMTDDMFVSLKESLPSLLRAVR 524
>gi|321464097|gb|EFX75107.1| hypothetical protein DAPPUDRAFT_306878 [Daphnia pulex]
Length = 620
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 140/313 (44%), Gaps = 61/313 (19%)
Query: 115 KQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF-- 172
K + + Q H ++V F+ + +Q+++ +R GF+YG D ++V
Sbjct: 225 KAITLNSQAYRHVDNVVFEN--PNLVEQFLHYWRVTGRQRIGFLYGRF--DIHLDVPLGI 280
Query: 173 ------IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI--------- 217
IYEPPQ+ T + + +L + + +VD +AA LG++ +G+IFT +
Sbjct: 281 KGIVAAIYEPPQESTRDSVRLLLPDNRQSVVDRVAAKLGLQCIGWIFTDLVAEDVSKGTV 340
Query: 218 --MQNKKDYTLSNREVLQAVEFHA----ECNMEE---WVTAVVKLEVSEEGSAAIHFEAF 268
+N + + LS +E + A F + C + + + V + V+ + +H E +
Sbjct: 341 RHFRNAETHFLSAQECIMAGNFQSLHPNPCRLSSDGYFGSKFVTVCVTGDSQNQVHMEGY 400
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV--KEVDND--- 315
Q+S+ C L ++ E + D P+L+ ++ DV KD EV D
Sbjct: 401 QVSNQCQALVRD---ECLVPTKDAPELAYARQSTAEKYIPDVYYKEKDSYGNEVTRDARP 457
Query: 316 ----FFLVVVKILDHQGPL-------SSTFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
+ LV V + P+ FPIENR ++ AL S+L ++ +
Sbjct: 458 LPVEYLLVDVPVSTPLEPIFTFSVGSRKPFPIENRLIDGHLQDLSALDSYLKQN-RQNFL 516
Query: 362 KRISDFHLLLFLA 374
+ SDFH L+++A
Sbjct: 517 EAASDFHFLIYVA 529
>gi|225677845|gb|EEH16129.1| nuclear protein localization 4 [Paracoccidioides brasiliensis Pb03]
Length = 667
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 135/320 (42%), Gaps = 59/320 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 286 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 344
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK +D +A G++KVG IFT + D Y LS+ E++ A + A
Sbjct: 345 EKDIDEVAKLCGLEKVGVIFTDLLDSGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 404
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 405 KASKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 464
Query: 291 --DDPKLSKMK---------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
++P+ K + V G +V+E F V ++ L H P
Sbjct: 465 QENEPENDSSKIRYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPNAVFK 524
Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
+STFPIENR + LKS ++ S + P +++ +SDFHLL FL L+ D +L
Sbjct: 525 NSTFPIENRAAIGESQELKSLFDQLVSHNDPHQVLRAVSDFHLLCFLCGLGILSKDEESL 584
Query: 387 AQCVQAQTAVPEGYKLLINS 406
C A+T P L+N+
Sbjct: 585 L-CTVARTHDPADGMQLVNT 603
>gi|226287417|gb|EEH42930.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
[Paracoccidioides brasiliensis Pb18]
Length = 662
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 138/320 (43%), Gaps = 59/320 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 281 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 339
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK +D +A G++KVG IFT + D Y LS+ E++ A + A
Sbjct: 340 EKDIDEVAKLCGLEKVGVIFTDLLDSGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 399
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 400 KASKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 459
Query: 291 --DDPK--LSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
++P+ SK++ + V G +V+E F V ++ L H P
Sbjct: 460 QENEPEDDSSKIRYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPNAVFK 519
Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
+STFPIENR + LKS ++ S + P +++ +SDFHLL FL L+ D +L
Sbjct: 520 NSTFPIENRAAIGESQELKSLFDQLVSHNDPHQVLRAVSDFHLLCFLCGLGILSKDEESL 579
Query: 387 AQCVQAQTAVPEGYKLLINS 406
C A+T P L+N+
Sbjct: 580 L-CTVARTHDPADGMQLVNT 598
>gi|198453799|ref|XP_001359342.2| GA18345, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132518|gb|EAL28487.2| GA18345, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 659
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 144/333 (43%), Gaps = 68/333 (20%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ ++++N +R G++YGT V R V
Sbjct: 255 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATVAA 312
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
IYEPPQ+ T + + + DE + + VDA+A LG+KK+G+IFT I D T + +
Sbjct: 313 IYEPPQESTRDSINMFPDEGDAE-VDAVANALGLKKIGWIFTDLITD---DATAGTVKQI 368
Query: 233 QAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAFQM 270
+ +E H +E +TA V + V+ + + +H E + +
Sbjct: 369 RGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGYAV 427
Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVD 313
S C+ L ++ + I + P+L +++ DV KD + +
Sbjct: 428 SAQCMALVRD---DCLIPTKEAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLP 484
Query: 314 NDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVK 362
++ LV V P + FPIENR ++ AL +L+ ++
Sbjct: 485 VEYLLVDVPASTPVKPQYTFTNYDKRQAFPIENRYIDGHLQDFSALSGYLSAWGEEEFLE 544
Query: 363 RISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
ISDFHLL++L + L L + +L + V+ +
Sbjct: 545 AISDFHLLVYLYKMDMLPLRQHMASLLEAVRTK 577
>gi|156094424|ref|XP_001613249.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802123|gb|EDL43522.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 531
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 128/285 (44%), Gaps = 46/285 (16%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F QY + +R G+MYG ED R IYEPPQ + + +L D+
Sbjct: 242 FVQYWYTNGSMCEQRVGWMYGYYREDPHYTLGIRAVCECIYEPPQINEVDKVKLLPDDFL 301
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF----------HAECNM 243
VD IA LG++++G+IFT +K++ L++ EV+Q + ++
Sbjct: 302 PS-VDVIANRLGLERIGWIFTHL---PRKEH-LTSEEVVQIANLQLANLKKNMHYTNYSV 356
Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD-----DPKLSKM 298
++T + + + AF +SDM + L + ++ + +P +++
Sbjct: 357 SNFITCTISPDPMLSNEPVTN--AFMVSDMGMALIRSSLVDSTQTDPSHIQLRNPVRNEL 414
Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSH 351
++ GGK+ + D D+F+V + + P + FP ENR T +K +
Sbjct: 415 LPLILEGGKETNKFDTDWFIVRI---NESAPKVVRSLFKNFHFPRENRLHSPTAYDVKEY 471
Query: 352 LNRSPSLPL----VKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
SP L + R SDFHL+LF+ R LD+ + ALA C A
Sbjct: 472 F-ASPKLERGTNGMHRCSDFHLILFVCRVLDMET---ALALCDAA 512
>gi|390178809|ref|XP_003736733.1| GA18345, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859598|gb|EIM52806.1| GA18345, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 145/335 (43%), Gaps = 68/335 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT V R V
Sbjct: 224 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATV 281
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + + DE + + VDA+A LG+KK+G+IFT I D T +
Sbjct: 282 AAIYEPPQESTRDSINMFPDEGDAE-VDAVANALGLKKIGWIFTDLITD---DATAGTVK 337
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 338 QIRGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 396
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ + I + P+L +++ DV KD +
Sbjct: 397 AVSAQCMALVRD---DCLIPTKEAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 453
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P + FPIENR ++ AL +L+
Sbjct: 454 LPVEYLLVDVPASTPVKPQYTFTNYDKRQAFPIENRYIDGHLQDFSALSGYLSAWGEEEF 513
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
++ ISDFHLL++L + L L + +L + V+ +
Sbjct: 514 LEAISDFHLLVYLYKMDMLPLRQHMASLLEAVRTK 548
>gi|390178811|ref|XP_003736734.1| GA18345, isoform C [Drosophila pseudoobscura pseudoobscura]
gi|388859599|gb|EIM52807.1| GA18345, isoform C [Drosophila pseudoobscura pseudoobscura]
Length = 560
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 145/335 (43%), Gaps = 68/335 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ ++++N +R G++YGT V R V
Sbjct: 154 ITLNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATV 211
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + + DE + + VDA+A LG+KK+G+IFT I D T +
Sbjct: 212 AAIYEPPQESTRDSINMFPDEGDAE-VDAVANALGLKKIGWIFTDLITD---DATAGTVK 267
Query: 231 VLQAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAF 268
++ +E H +E +TA V + V+ + + +H E +
Sbjct: 268 QIRGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDKTKQVHMEGY 326
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
+S C+ L ++ + I + P+L +++ DV KD +
Sbjct: 327 AVSAQCMALVRD---DCLIPTKEAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARP 383
Query: 312 VDNDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPL 360
+ ++ LV V P + FPIENR ++ AL +L+
Sbjct: 384 LPVEYLLVDVPASTPVKPQYTFTNYDKRQAFPIENRYIDGHLQDFSALSGYLSAWGEEEF 443
Query: 361 VKRISDFHLLLFLAR--FLDLNSDVPALAQCVQAQ 393
++ ISDFHLL++L + L L + +L + V+ +
Sbjct: 444 LEAISDFHLLVYLYKMDMLPLRQHMASLLEAVRTK 478
>gi|295663611|ref|XP_002792358.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226279028|gb|EEH34594.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 906
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 134/321 (41%), Gaps = 61/321 (19%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 525 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 583
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK +D +A ++KVG IFT + D Y LS+ E+ A + A
Sbjct: 584 EKDIDEVARLCALEKVGVIFTDLLDSGAGDGTVICKRHIDSYYLSSLEIAFAAQLQARYP 643
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 644 KASKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 703
Query: 291 -----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------ 330
DD SK++ + V G +V+E F V ++ L H P
Sbjct: 704 QENELDD-DSSKIRYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPNAVF 762
Query: 331 -SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPA 385
+STFPIENR + LKS N+ S + P +++ +SDFHLL FL L+ D +
Sbjct: 763 KNSTFPIENRAAIGESQELKSLFNKLVSHNDPHQVIRAVSDFHLLCFLCGLGILSKDEES 822
Query: 386 LAQCVQAQTAVPEGYKLLINS 406
L C A+T P L+N+
Sbjct: 823 LL-CTVARTHDPADGMQLVNT 842
>gi|345485482|ref|XP_001606190.2| PREDICTED: nuclear protein localization protein 4 homolog [Nasonia
vitripennis]
Length = 652
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 137/321 (42%), Gaps = 71/321 (22%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H ++V F+ A ++++N +R G++YG + R V
Sbjct: 237 ITLNRQPYRHVDNVMFEN--AHLVERFLNYWRITGHQRIGYLYGRYEQHSDVPLGIRTVV 294
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ+ T + + +L DE+E LVD +A L +KKVG+IFT I + K T+ +
Sbjct: 295 AAIYEPPQESTRDSIVLLPDEKE-ALVDELAQSLNLKKVGWIFTDLIADDMKKGTVKH-- 351
Query: 231 VLQAVEFH----AECNMEEWV-----------------TAVVKLEVSEEGSAAIHFEAFQ 269
++ +E H EC M + + V + V+ + +H E +Q
Sbjct: 352 -VRNIESHFLSAQECIMAGHIQNKLPNPCRFSPNGHFGSKAVTVCVTGDDKNQVHMEGYQ 410
Query: 270 MSDMCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKDV---------KEV 312
++ C+ L ++G + D P+L + DV KD + +
Sbjct: 411 VTYQCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVYYKEKDTYGNEVSRLARPL 467
Query: 313 DNDFFLVVVKILDHQGPLS-----------STFPIENRTTQVTMRALKSHLNRSPSLP-- 359
++ LV V PL+ + FP+E+R ++ S N
Sbjct: 468 PVEYLLVDVPA---STPLTPQFTFYANENITPFPVEHRLIDGQVQEFGSLCNYMQQFTPD 524
Query: 360 -LVKRISDFHLLLFLARFLDL 379
+ +SDFHLL+F A F+D+
Sbjct: 525 QFLDAVSDFHLLIFFA-FMDV 544
>gi|270011854|gb|EFA08302.1| hypothetical protein TcasGA2_TC005937 [Tribolium castaneum]
Length = 483
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 104/206 (50%), Gaps = 30/206 (14%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V+F+ D ++++N + +R GF+YG V R V
Sbjct: 237 LNRQVYRHVDNVAFEN--KDLVERFLNYWRSTGHQRIGFLYGNYEIHNDVPLGIRANVVA 294
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEPPQ+ + + +++LRD++EE +V+ +A LG+++VG+IFT I +++
Sbjct: 295 IYEPPQESSRDSIHLLRDDKEE-IVEELAQNLGLRRVGWIFTDLIPEDVQKGTVKHLRHI 353
Query: 222 KDYTLSNREVLQAVEFH----AECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+ LS E + A F C + + +V + V+ + + +H E +Q+S+ C
Sbjct: 354 NSHFLSAEECIMAGHFQNMYPNPCRFASGGYFGSKIVTVCVTGDKTNQVHMEGYQVSNQC 413
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK 300
+ L ++ + D P+L +++
Sbjct: 414 MALVRDNCL---LPTKDAPELGYVRE 436
>gi|357606887|gb|EHJ65269.1| putative nuclear protein localization [Danaus plexippus]
Length = 608
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 134/308 (43%), Gaps = 62/308 (20%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+TRQ H ++V + A +++++ A +R GF+YG V R V
Sbjct: 228 LTRQPYRHVDNVLLEH--AAPVERFLSYWRATGHQRVGFLYGRYELHPDVPLGIRARVAA 285
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
+YEPPQ+ + + + + D+ L+D +AA LG+++VG+IFT + ++
Sbjct: 286 VYEPPQECSRDAVRLASDDHA-ALLDRLAARLGLERVGWIFTDLLPLDLVSGTVQCLRGV 344
Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+ LS +E + A F E C + + V + V+ + IH E +Q+S C
Sbjct: 345 DTHFLSAQECITAGHFQNEHPNACRHASSGYFGSKFVTVCVTGDADNHIHLEGYQVSGQC 404
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMK--------KDVVVGGKD---------VKEVDNDFF 317
L ++G + D P+L ++ DV KD K + +
Sbjct: 405 AALVRDGIL---LPTRDAPELGYIRDCSPEQYVPDVYYKEKDAYGNEVGVSAKRLPVAYL 461
Query: 318 LVVVKILDHQGPLS--------STFPIENRTTQ---VTMRALKSHLNRSPSLPLVKRISD 366
LV V + P + ++FP +R Q ++ L +H+ R+ S + SD
Sbjct: 462 LVDVPV--GVAPAAGEPTFDPRASFPPAHRPLQQHVQSLSGLHAHVERAES--FLAAASD 517
Query: 367 FHLLLFLA 374
H+LLFLA
Sbjct: 518 LHVLLFLA 525
>gi|261196446|ref|XP_002624626.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
SLH14081]
gi|239595871|gb|EEQ78452.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
SLH14081]
Length = 660
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 131/320 (40%), Gaps = 59/320 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 279 INCLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQLGEIDGV-TLHEWSN 337
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK VD +A G++K+G IFT + D Y LS+ E++ A + A
Sbjct: 338 EKDVDEVAKLCGLEKIGVIFTDLMDARAGDGTVICKRHIDSYYLSSLEIIFAAQLQARYP 397
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 398 KATKWSETGQFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 457
Query: 291 ----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
D LSK + + V G +V+E F V ++ L H P
Sbjct: 458 QENEPDDDLSKARYIPEVFYRRVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 517
Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
STFPIENR + L++ N+ S P + +SDFHLL FL LN D +L
Sbjct: 518 DSTFPIENREVIGESQDLRTLANKLVSHGDPNKAICAVSDFHLLCFLHGLGILNKDEESL 577
Query: 387 AQCVQAQTAVPEGYKLLINS 406
C A+T P L+N+
Sbjct: 578 L-CTVARTHDPADGMQLVNT 596
>gi|239609444|gb|EEQ86431.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
ER-3]
Length = 660
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 131/320 (40%), Gaps = 59/320 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 279 INCLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQLGEIDGV-TLHEWSN 337
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK VD +A G++K+G IFT + D Y LS+ E++ A + A
Sbjct: 338 EKDVDEVAKLCGLEKIGVIFTDLMDARAGDGTVICKRHIDSYYLSSLEIIFAAQLQARYP 397
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 398 KATKWSETGQFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 457
Query: 291 ----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
D LSK + + V G +V+E F V ++ L H P
Sbjct: 458 QENEPDDDLSKARYIPEVFYRRVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 517
Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
STFPIENR + L++ N+ S P + +SDFHLL FL LN D +L
Sbjct: 518 DSTFPIENREVIGESQDLRTLANKLVSHGDPNKAICAVSDFHLLCFLHGLGILNKDEESL 577
Query: 387 AQCVQAQTAVPEGYKLLINS 406
C A+T P L+N+
Sbjct: 578 L-CTVARTHDPADGMQLVNT 596
>gi|327357005|gb|EGE85862.1| nuclear protein localization protein 4 [Ajellomyces dermatitidis
ATCC 18188]
Length = 660
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 131/320 (40%), Gaps = 59/320 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 279 INCLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQLGEIDGV-TLHEWSN 337
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK VD +A G++K+G IFT + D Y LS+ E++ A + A
Sbjct: 338 EKDVDEVAKLCGLEKIGVIFTDLMDARAGDGTVICKRHIDSYYLSSLEIIFAAQLQARYP 397
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 398 KATKWSETGQFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 457
Query: 291 ----DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL------- 330
D LSK + + V G +V+E F V ++ L H P
Sbjct: 458 QENEPDDDLSKARYIPEVFYRRVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 517
Query: 331 SSTFPIENRTTQVTMRALKSHLNR--SPSLP--LVKRISDFHLLLFLARFLDLNSDVPAL 386
STFPIENR + L++ N+ S P + +SDFHLL FL LN D +L
Sbjct: 518 DSTFPIENREVIGESQDLRTLANKLVSHGDPNKAICAVSDFHLLCFLHGLGILNKDEESL 577
Query: 387 AQCVQAQTAVPEGYKLLINS 406
C A+T P L+N+
Sbjct: 578 L-CTVARTHDPADGMQLVNT 596
>gi|82594712|ref|XP_725542.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480584|gb|EAA17107.1| Drosophila melanogaster GH03617p [Plasmodium yoelii yoelii]
Length = 520
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 121/271 (44%), Gaps = 50/271 (18%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G+MYG ED R IYEPPQ + + +L D+ VD IA LG+
Sbjct: 244 QRVGWMYGYYKEDSHYNLGIRAVCECIYEPPQFCEGDKVRLLPDDFMNN-VDLIAEKLGL 302
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREVLQAVEF----------HAECNMEEWVTAVVKLEVS 256
+K+G IFT +K+Y L++ E+L + + N+ ++T + + S
Sbjct: 303 EKIGIIFTHLP---RKEY-LTSYELLNIAKLQLANLKNNTHYTNYNISNFITCTISPDPS 358
Query: 257 EEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD 308
+ AF +SD+ + L K EI DDP K++ ++ GGK+
Sbjct: 359 LNNEPITN--AFMVSDLGMSLVKYNLI-NEIQ--DDPSHIKLRTPNKYELLPQILEGGKE 413
Query: 309 VKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSP---SL 358
+ D D+ ++ V + P + FP ENR T+ +K + + S +
Sbjct: 414 TNKFDTDWLIIRV---NDSAPKVVRSIFKNYHFPRENRNKPQTVYDVKEYFSSSKLNRGV 470
Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
R SDFHL++F A+ LD+ + AL C
Sbjct: 471 NGNSRCSDFHLIIFAAKVLDIET---ALVLC 498
>gi|72090152|ref|XP_789485.1| PREDICTED: nuclear protein localization protein 4 homolog
[Strongylocentrotus purpuratus]
Length = 627
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 80/306 (26%), Positives = 132/306 (43%), Gaps = 58/306 (18%)
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IY 174
RQ H ++V F+ F + ++ +R GF+YG K V + IY
Sbjct: 231 RQHYRHVDNVMFENPMM--FDRLLDFWRKCGNQRIGFLYGKYEHHKDVPLGIKATVLAIY 288
Query: 175 EPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNKKD 223
EPPQ T L +L D E VD +A+ LG+ KVG+I+T + ++N
Sbjct: 289 EPPQNSTSNSLELLEDPFAEA-VDYVASKLGLCKVGWIYTDLMADDLTKGTVKHVRNMDA 347
Query: 224 YTLSNREVLQAVEFH----AECNME---EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVR 276
+ +S E + A EF + C + + + V V+ + + IH EA+Q+S+ C+
Sbjct: 348 HFMSAEECIMAGEFQNQHPSPCKLATEGHFGSKFVTCIVTGDSTNQIHTEAYQVSNQCMA 407
Query: 277 LFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDNDFFLV 319
L ++ I D P+L +++ DV D + + +F LV
Sbjct: 408 LVRDDCLVPTI---DAPELGYIRESTNEQYVPDVFYSETDSYGNSSKQLARPLPVEFLLV 464
Query: 320 VVK--------ILDHQGPLSSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISDFH 368
+ + P FPIENR ++ AL +L + S L++ + DFH
Sbjct: 465 DLPAGFPKDATFAFNSQPGLKPFPIENRAGVGEVQGIEALTQYLGQFESGQLMEAMLDFH 524
Query: 369 LLLFLA 374
LL++LA
Sbjct: 525 LLIYLA 530
>gi|193613344|ref|XP_001951828.1| PREDICTED: nuclear protein localization protein 4 homolog
[Acyrthosiphon pisum]
Length = 630
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 140/310 (45%), Gaps = 58/310 (18%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQ H +++ F+ ++ ++++N +R GF+YG R V
Sbjct: 235 ITLNRQTYRHVDNIMFEN--SELVERFLNYWRITGHQRIGFLYGRYETHPDIPLGIRAIV 292
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ + + + +L DE+++ +V+ +A LG+ VG+IFT + ++
Sbjct: 293 AAIYEPPQESSRDHVKLLPDEKQD-IVNELARCLGLSCVGWIFTDLLTEDMQTGTVKYVR 351
Query: 220 NKKDYTLSNREVLQAVEFH----AECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N K + LS +E + A F C + + V + V+ + +H E + +S+
Sbjct: 352 NIKSHFLSAQECIMAGYFQNLHPNPCRYASNGYFGSKFVTVCVTGDEKCQVHMEGYSVSN 411
Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVDND 315
C+ L K+ + + D P+L +++ DV KD + + +
Sbjct: 412 QCMALVKD---DCLVPTKDVPELGFVRESSDKQYVPDVYYKEKDSYGNEVSKLARPLPVE 468
Query: 316 FFLVVVKILDHQGPLS--------STFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
+ L+ V PL+ + FP+ENR ++ +L +L + L + +
Sbjct: 469 YLLLDVPASTPLTPLNTFTSIKDITKFPVENRLIDGHIQDFDSLCKYLRQFTPLQFYESV 528
Query: 365 SDFHLLLFLA 374
SDFHLLL++A
Sbjct: 529 SDFHLLLYIA 538
>gi|350406695|ref|XP_003487852.1| PREDICTED: nuclear protein localization protein 4 homolog [Bombus
impatiens]
Length = 647
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 138/310 (44%), Gaps = 58/310 (18%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ ++++N + +R GF+YG V R V
Sbjct: 231 ITLNRQTYRHVDNVMFEN--PSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 288
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ T+ + +L DE+E +VD +A L +K+VG+IFT I ++
Sbjct: 289 AAIYEPPQESTKNSIRLLPDEKE-VVVDELAHLLNLKRVGWIFTDLIVDDIKKGTVKHVR 347
Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N + + LS +E + A F + C + + V + V+ + IH E +Q+S+
Sbjct: 348 NIESHFLSAQECIMAGYFQNKYPNPCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 407
Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
C+ L ++G + D P+L + DV KD + + +
Sbjct: 408 QCMALVRDGCL---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 464
Query: 316 FFLVVVKILDHQGPL--------SSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRI 364
+ LV + P + F IENR ++ +L S++ + ++ +
Sbjct: 465 YLLVDIPASTPLNPQFTFYVSNNITPFLIENRLIDGQIQEFSSLCSYMQQFSKEQFLEAV 524
Query: 365 SDFHLLLFLA 374
SDFHLL+F+A
Sbjct: 525 SDFHLLIFIA 534
>gi|340721568|ref|XP_003399190.1| PREDICTED: nuclear protein localization protein 4 homolog [Bombus
terrestris]
Length = 647
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 140/313 (44%), Gaps = 64/313 (20%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ ++++N + +R GF+YG V R V
Sbjct: 231 ITLNRQTYRHVDNVMFEN--PSLVERFLNYWRSTGHQRIGFLYGRYEIHTDVPLGIRAVV 288
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ T+ + +L DE+E +VD +A L +K+VG+IFT I ++
Sbjct: 289 AAIYEPPQESTKNSIRLLPDEKE-VVVDELAHLLNLKRVGWIFTDLIVDDIKKGTVKHVR 347
Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N + + LS +E + A F + C + + V + V+ + IH E +Q+S+
Sbjct: 348 NIESHFLSAQECIMAGYFQNKYPNPCRFSPNNYFGSKFVTVCVTGDEKNQIHMEGYQVSN 407
Query: 273 MCVRLFKEGWFETEIAEGDDPKLS--------KMKKDVVVGGKD---------VKEVDND 315
C+ L ++ F + D P+L + DV KD + + +
Sbjct: 408 QCMALVRDSCF---VPTKDAPELGYVIESTDKQYVPDVFYKEKDSYGNEVSRLARPLPVE 464
Query: 316 FFLVVVKILDHQGPLS-----------STFPIENRTTQVTMR---ALKSHLNRSPSLPLV 361
+ LV + PL+ + F IENR ++ +L S++ + +
Sbjct: 465 YLLVDIPA---STPLNPQFTFYVSNNITPFLIENRLIDGQIQEFSSLCSYMQQFSKEQFL 521
Query: 362 KRISDFHLLLFLA 374
+ +SDFHLL+F+A
Sbjct: 522 EAVSDFHLLIFIA 534
>gi|67624425|ref|XP_668495.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54659695|gb|EAL38261.1| hypothetical protein Chro.70227 [Cryptosporidium hominis]
Length = 490
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 112/503 (22%), Positives = 189/503 (37%), Gaps = 126/503 (25%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLS--TNQNLLLAKSPSDLL 59
++R+ S G RVT+ ++ LK+ I L++P E Q LS ++ N LL +L
Sbjct: 6 IIRVFSSSGRSRVTIPST-CNLSMLKSKIAQSLKVPEEQQLLSLDSHGNTLLKGDSLSML 64
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNV--------------------------- 92
Q +S+GSIV+L D +
Sbjct: 65 QH-------------GLSNGSIVYLITDATPQIATQSIQRPKHMTTSDLPQLEKKPSNSN 111
Query: 93 -------------------RGPSFNPAGSF--GRKMTMDDLIAKQ--------------- 116
GP F SF R DDL KQ
Sbjct: 112 YSSSTSADSKDMNNSVEKSEGPHFKSFDSFLSERNFVTDDLPLKQSYKSFFISKGVMNKI 171
Query: 117 ---MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------ 166
+ + Q H + + + A F Y L +R G+MYG ED
Sbjct: 172 PPSVTLRHQAYRHVDHLEMMNLSEAMQFVDYWRSKLGMMKQRVGWMYGYYREDSTYPMGI 231
Query: 167 RVEVNFIYEPPQ-QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
R + IYEPPQ + E IL ++ + VD +A LG++ +G +FT N++D
Sbjct: 232 RAVMEAIYEPPQDEKAEPGKLILENDSFKSKVDRVAQSLGLECLGLVFT----HNERDEV 287
Query: 226 LSNREVLQAVEFHAEC---------NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVR 276
L++ E++ + E + ++ + S +G I AF +SD+ +
Sbjct: 288 LTSNEIVTLGKLQLEALKTLHYTGYPVSSFICCTIAPCKSIQGGDPIP-NAFAVSDLGLA 346
Query: 277 LFKEGWFETEIAEGD------DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL 330
L ++ + + + D + + ++ ++ G ++ D + +V + + P+
Sbjct: 347 LLRDKIIDEKGLKDDTHIVIREEEKGEILPQILEKGVSTRKFDAHWLIVRI---NESAPI 403
Query: 331 -------SSTFPIENRTTQVTMRALKSHL-NRSPSLPLV--KRISDFHLLLFLARFLDLN 380
SS FP ENR + + +R S+PL ++DFHLLL+LA+ D
Sbjct: 404 QPKPFFSSSQFPRENRIVPQKPSDVSEFIKSRLTSVPLTSCNLLNDFHLLLYLAKLFD-- 461
Query: 381 SDVPALAQCVQAQTAVPEGYKLL 403
+ A++ C P LL
Sbjct: 462 -EATAISICESIINKTPVDQHLL 483
>gi|258566089|ref|XP_002583789.1| protein NPL4 [Uncinocarpus reesii 1704]
gi|237907490|gb|EEP81891.1| protein NPL4 [Uncinocarpus reesii 1704]
Length = 669
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 128/311 (41%), Gaps = 58/311 (18%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
+R GF+YGT E V V IYEPPQ +EV I LR+ E E+ VD +A
Sbjct: 298 GAQRLGFLYGTYEEYSEVPLGIKAVVQAIYEPPQ--VDEVDGITLREWENEQDVDQVAKL 355
Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWV---- 247
G++KVG IFT + D Y LS+ EV+ A + A +W
Sbjct: 356 CGLEKVGVIFTDLLDSGHGDGTVICRRHIDSYYLSSLEVIFAAQLQARYPKPSKWSETGR 415
Query: 248 --TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG-------------WFETEIAEGDD 292
+ V +S + + AI A+Q S+ V + + E+E+ D
Sbjct: 416 FGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLVQLESELENADT 475
Query: 293 PKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSS-------TFPIENR 339
+ + + V G +V+E F V + L H P S TFPIENR
Sbjct: 476 GTVRYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTGSSPLFTDNTFPIENR 535
Query: 340 ----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
+Q R L+R+ ++ +S+FHLL F+ L D L C A+T
Sbjct: 536 EVIGESQDIRRVANKLLSRTDPDSAIRAVSNFHLLCFMHGLGILTKDEETLL-CTVARTH 594
Query: 396 VPEGYKLLINS 406
P LIN+
Sbjct: 595 DPADGVQLINT 605
>gi|66362816|ref|XP_628374.1| nuclear pore associated protein (NLP4) with N-terminal ubiquitin
domain [Cryptosporidium parvum Iowa II]
gi|46229793|gb|EAK90611.1| nuclear pore associated protein (NLP4) with N-terminal ubiquitin
domain [Cryptosporidium parvum Iowa II]
Length = 491
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 112/497 (22%), Positives = 190/497 (38%), Gaps = 126/497 (25%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLS--TNQNLLLAKSPSDLL 59
++R+ S G RVT+ ++ LK+ I L++P E Q LS ++ N LL +L
Sbjct: 7 IIRVFSSSGRSRVTIPST-CNLSMLKSKIAQSLKVPEEQQLLSLDSHGNTLLKGDSLSVL 65
Query: 60 QFTDMANPDRPLSSLNISHGSIVFLSYDGERNV--------------------------- 92
Q + +GSIV+L D +
Sbjct: 66 QH-------------GLFNGSIVYLITDATPQIATQSIQRPKHMTTSDLPPLEKKPSNSN 112
Query: 93 -------------------RGPSFNPAGSF--GRKMTMDDLIAKQ--------------- 116
GP F SF R DDL KQ
Sbjct: 113 YPSSSSADSKDMNNSVEKSEGPHFKSFDSFLSERNFVTDDLPLKQSYKSFFISKGVMNKI 172
Query: 117 ---MRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------ 166
+ + Q H + + + A F Y L +R G+MYG ED
Sbjct: 173 PPSVTLRHQAYRHVDHLEMMNLSEAMQFVDYWRSKLGMMKQRVGWMYGYYREDSTYPMGI 232
Query: 167 RVEVNFIYEPPQ-QGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
R + IYEPPQ + E IL ++ + VD +A LG++ +G +FT N++D
Sbjct: 233 RAVMEAIYEPPQDEKAEPGKLILENDSFKSTVDRVAQSLGLECLGLVFT----HNERDEV 288
Query: 226 LSNREVLQAVEFHAEC---------NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVR 276
L++ E++ + E + ++ + S +G I AF +SD+ +
Sbjct: 289 LTSNEIVTLGKLQLEALKTPHYTGYPVSSFICCTIAPCKSIQGGDPIP-NAFAVSDLGLA 347
Query: 277 LFKEGWFETEIAEGDDPKLSKMKKD-------VVVGGKDVKEVDNDFFLVVVKILDHQGP 329
L ++ + E DD + ++D ++ G ++ D + +V + + P
Sbjct: 348 LLRDKIID-EKGLKDDTHIVIREEDKGEILPQILEKGVSTRKFDAHWLIVRI---NESAP 403
Query: 330 L-------SSTFPIENRTTQVTMRALKSHL-NRSPSLPLV--KRISDFHLLLFLARFLDL 379
+ SS FP ENR + + +R S+PL ++DFHLLL+LA+ D
Sbjct: 404 IQPKPFFSSSQFPRENRIVPQKPSDVSEFIKSRLTSVPLTSCNLLNDFHLLLYLAKLFDE 463
Query: 380 NSDVPALAQCVQAQTAV 396
+ + ++ + + +TAV
Sbjct: 464 ATAI-SICESIINKTAV 479
>gi|294925935|ref|XP_002779039.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239887885|gb|EER10834.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 363
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 137/330 (41%), Gaps = 83/330 (25%)
Query: 126 HCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQ 179
HCE ++ + +F QY +R G++YG L D R+ V IYEPP+Q
Sbjct: 69 HCEWMNTEE--IKSFVQYWQFDKEMLQQRCGWLYGYYLSDPNYDDGCRIVVEGIYEPPKQ 126
Query: 180 GT----------EEVLYILRDEEEEK----------------LVDAIAAGLGMKKVGFIF 213
E+L +++ + K VD + + LG+++VG IF
Sbjct: 127 EVYNAATGGIRMNEMLVAMQEASKGKPLDPGVLGGMVDPALQKVDVVMSKLGLERVGCIF 186
Query: 214 TQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDM 273
T +DY +S+ E+L A KL + SD
Sbjct: 187 TAL----PRDYEMSSGELLAAARLQ-------------KL-------------VWMASDQ 216
Query: 274 CVRLFKEGWFET-EIAEGD------DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDH 326
+ ++G F+ + AE +P +M V+ G +V E D D+ LV V +
Sbjct: 217 AEGMLQDGLFDVKKTAETPTRVQLREPFSQEMMPPVLASGSEVTEFDPDWLLVKV---ND 273
Query: 327 QGPLS-------STFPIENRTTQVTMRALKSHLNRSPS-LPLVKRISDFHLLLFLARFLD 378
PL S FP ENR+ Q + +K ++ + P+ P R +DFHLL+++ LD
Sbjct: 274 GVPLKKRSMFRFSHFPRENRSRQQSPDDIKQYMRQIPAGTPSWARYADFHLLVYITLLLD 333
Query: 379 LNSDVPALAQCVQAQTAVPEGYKLLINSMA 408
++ A+A + + + + L+ +MA
Sbjct: 334 EDT-AGAIAGSISREEEIDKAMDELLTNMA 362
>gi|70952299|ref|XP_745327.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56525615|emb|CAH80963.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 511
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 121/270 (44%), Gaps = 48/270 (17%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G+MYG ED R IYEPPQ + + +L D+ VD IA LG+
Sbjct: 235 QRIGWMYGYYKEDTHYNLGIRAVCECIYEPPQFCEGDQVRLLPDDFMSN-VDLIAEKLGL 293
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREV---------LQAVEFHAECNMEEWVTAVVKLEVSE 257
+K+G IFT +K+Y S+ V L+ + ++ ++T + +
Sbjct: 294 EKIGIIFTHLP---RKEYLASHELVNIAKLQLSNLKKNTHYTNYSISNFITCTISPDPLL 350
Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV 309
+ AF +SD+ + L ++ + EI DDP K++ ++ GGK+
Sbjct: 351 NNEPITN--AFMVSDLGMSLIRDNLID-EIQ--DDPSHIKLRSPNKHELLPQILEGGKET 405
Query: 310 KEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSP---SLP 359
+ D D+F++ V + P + FP ENR T+ +K + + S +
Sbjct: 406 NKFDTDWFIIRV---NDSAPKVVRSIFKNYHFPRENRNKPQTVYDVKEYFSSSKLDRGVN 462
Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQC 389
R SDFHL++F A+ LD+ + AL C
Sbjct: 463 GKSRCSDFHLIIFAAKVLDIET---ALVLC 489
>gi|221057203|ref|XP_002259739.1| Nuclear pore associated protein (NLP4) [Plasmodium knowlesi strain
H]
gi|193809811|emb|CAQ40515.1| Nuclear pore associated protein (NLP4),putative [Plasmodium
knowlesi strain H]
Length = 523
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 126/284 (44%), Gaps = 44/284 (15%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F QY + +R G+MYG ED R IYEPPQ + + +L D+
Sbjct: 234 FVQYWYTNGSMCEQRVGWMYGYYREDPHYTMGIRAVCECIYEPPQINEVDKVKLLPDDFL 293
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF----------HAECNM 243
+ VD IA LG++++G+IFT +K++ L++ EV+Q + + ++
Sbjct: 294 DS-VDVIANRLGLERIGWIFTHL---PRKEH-LTSEEVVQIAKLQLANLKKNVHYTNYSV 348
Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET-----EIAEGDDPKLSKM 298
++T + + + AF +SD+ + L + + + + +P +++
Sbjct: 349 SNFITCTISPDPMLSNEPVTN--AFMVSDLGMSLIQSNLVDPVQTDPSVIQLRNPMPNEL 406
Query: 299 KKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSH 351
++ GGK+ + D D+F+V + + P + FP ENR +K +
Sbjct: 407 LPLILEGGKETNKFDTDWFIVRI---NESAPKVVRSLFKNFHFPRENRLRSPNAYDVKEY 463
Query: 352 LNRSP---SLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
+ + R SDFHL+LF R LD+ + ALA C A
Sbjct: 464 FASTKLERGTNGMHRCSDFHLILFACRVLDMET---ALALCDAA 504
>gi|399217022|emb|CCF73709.1| unnamed protein product [Babesia microti strain RI]
Length = 450
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 110/458 (24%), Positives = 195/458 (42%), Gaps = 98/458 (21%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
+++RI G+ RVT+ + +++L + I + +P+E Q LS +PSD
Sbjct: 3 IVVRIFCPTGIIRVTLPTDSM-LSDLMSEIAERTGVPIEKQCLSI--------APSDT-- 51
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGE--------------RNVRGPSFNPAGSFGRK 106
+ N + L + I HG+I+ L+ D ++V+ SF+ S R
Sbjct: 52 -SIDPNVNVKLDKIGIVHGTILNLNSDYPVEFQDTSKEIPEFMQDVKFKSFDSFLS-ERG 109
Query: 107 MTMDDL------------------IAKQMRVTRQENPHCESVSFDRDCADAFQQYVN--- 145
+ DL I + + + RQ H + + + + +VN
Sbjct: 110 YNVSDLPLKNEYLPVKIEVGKMNKIPQSIALKRQPYRHVDHLEIMN--TEEIRNFVNFWM 167
Query: 146 ETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDA 199
+R G+MYG +ED R IYEP Q G E + I D +VD
Sbjct: 168 HDCEMLTQRCGWMYGYYIEDVHYPMGIRAVCEAIYEPIQTGDIESVRIEPDNFL-SVVDT 226
Query: 200 IAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF---------HAECNMEEWVTAV 250
IA LG++++GFIF+ N + L+ +E+ + + N VT
Sbjct: 227 IAERLGLERIGFIFSHLPRDN---FFLTAQEIYDIAKIQNSRIDTGHYTGYNASHHVTCT 283
Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKD-------VV 303
+ +V+ G +++ AF +SDM + L ++G + + + D+ K+ + + V+
Sbjct: 284 ISPDVNNNGECSLN--AFMVSDMGMALVRDGLIDPK--QDDNLKIHLVNPNEYVIFPGVL 339
Query: 304 VGGKDVKEVDNDFFLVVVKILDHQGPLS-------STFPIENRTTQVTMRA--LKSH-LN 353
G + D + LV V + P++ S F ENR ++A +K++ N
Sbjct: 340 EEGTETTSFDTSWVLVRV---NESAPITPNSFFKNSLFQRENRGKAPNIQAYFMKANGAN 396
Query: 354 RSPSLPLVKRI--SDFHLLLFLARFLDLNSDVPALAQC 389
+P + SDFHLLL +A+ +D+++ ALA C
Sbjct: 397 DNPKDGFDSHVLYSDFHLLLHIAKNVDIDT---ALAIC 431
>gi|124506075|ref|XP_001351635.1| Nuclear pore associated protein (NLP4), putative [Plasmodium
falciparum 3D7]
gi|23504562|emb|CAD51442.1| Nuclear pore associated protein (NLP4), putative [Plasmodium
falciparum 3D7]
Length = 531
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 121/266 (45%), Gaps = 40/266 (15%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G+MYG ED R IYEPPQ + +++L D+ VD IA LG+
Sbjct: 255 QRIGWMYGYYREDNHYNLGIRAVCECIYEPPQFCEDNKIHLLEDDFLPT-VDLIAERLGL 313
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREV----LQAVEFHAECNMEEW-VTAVVKLEVSEEG-- 259
+++G+IFT Q +Y S+ V +Q + + ++ + +S +
Sbjct: 314 ERIGWIFTHLPRQ---EYLTSDEVVNIAKMQLTNIKKNMHYTNYPISNFITCTISPDPLL 370
Query: 260 SAAIHFEAFQMSDMCVRLFKEGWFETEIAEGD-----DPKLSKMKKDVVVGGKDVKEVDN 314
S AF +SD+ + L ++ E + +P +++ ++ GK+ + D
Sbjct: 371 SNEPVTNAFMVSDLGMALMRDNLIEENQIDPSHIQLRNPNKNELLPQILESGKETNKFDT 430
Query: 315 DFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSPSLPL----VKR 363
D+F+V V + P + FP ENR + +K + RS L + +
Sbjct: 431 DWFIVRV---NESAPKVVRSIFKNFHFPRENRQHSQSAFHVKEYF-RSNKLDRGTKGIHQ 486
Query: 364 ISDFHLLLFLARFLDLNSDVPALAQC 389
SDFHL+LF+A+ LD+ + ALA C
Sbjct: 487 CSDFHLILFVAKVLDIET---ALALC 509
>gi|240279728|gb|EER43233.1| endoplasmic reticulum and nuclear membrane protein [Ajellomyces
capsulatus H143]
gi|325092857|gb|EGC46167.1| endoplasmic reticulum and nuclear membrane protein [Ajellomyces
capsulatus H88]
Length = 658
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 130/325 (40%), Gaps = 59/325 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 278 INSLLDFWRKSGTQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 336
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK VD +A G++K+G IFT + D Y LS+ E++ A + A
Sbjct: 337 EKDVDEVAKFCGLEKIGVIFTDLLDAGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 396
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 397 KATKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 456
Query: 291 -----DDPKLSKMKKDVVVG-----GKDVKEVDNDFFLV--VVKILDHQGPL-------S 331
DD ++ +V G +V+E F V ++ L H P +
Sbjct: 457 QENQPDDASKARYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFNN 516
Query: 332 STFPIENR-----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPAL 386
STFPIENR + + M A K + P+ + +SDFHLL FL LN D L
Sbjct: 517 STFPIENREVIGESQDLRMLADKLVSHGDPNKTICG-VSDFHLLCFLNSLGILNKDEEFL 575
Query: 387 AQCVQAQTAVPEGYKLLINSMAETV 411
V +G +L+ S T+
Sbjct: 576 LCTVARSHDTADGMQLINTSGWATL 600
>gi|225562903|gb|EEH11182.1| endoplasmic reticulum and nuclear membrane protein [Ajellomyces
capsulatus G186AR]
Length = 659
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 130/326 (39%), Gaps = 60/326 (18%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 278 INSLLDFWRKSGTQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWSN 336
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC- 241
EK VD +A G++K+G IFT + D Y LS+ E++ A + A
Sbjct: 337 EKDVDEVAKFCGLEKIGVIFTDLLDAGAGDGTVICKRHIDSYYLSSLEIVFAAQLQARYP 396
Query: 242 NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG----- 290
+W + V +S + AI A+Q S+ V + K E G
Sbjct: 397 KATKWSETGRFGSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQ 456
Query: 291 ------DDPKLSKMKKDVVVG-----GKDVKEVDNDFFLV--VVKILDHQGPL------- 330
DD ++ +V G +V+E F V ++ L H P
Sbjct: 457 QENQLDDDASKARYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFN 516
Query: 331 SSTFPIENR-----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPA 385
+STFPIENR + + M A K + P+ + +SDFHLL FL LN D
Sbjct: 517 NSTFPIENREVIGESQDLRMLADKLVSHGDPNKTICG-VSDFHLLCFLNSLGILNKDEEF 575
Query: 386 LAQCVQAQTAVPEGYKLLINSMAETV 411
L V +G +L+ S T+
Sbjct: 576 LLCTVARSHDTADGMQLINTSGWATL 601
>gi|296810816|ref|XP_002845746.1| NPL4 [Arthroderma otae CBS 113480]
gi|238843134|gb|EEQ32796.1| NPL4 [Arthroderma otae CBS 113480]
Length = 652
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 134/318 (42%), Gaps = 61/318 (19%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L + E
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 335
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A GM+K+G IFT I D Y LS+ EV+ A + A
Sbjct: 336 NEKDVDQVARLCGMEKIGVIFTDLIDSGAGDGTVICKRHIDSYYLSSLEVVFASQLQARN 395
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------- 287
C E + + V +S + + AI A+Q S+ V + + E
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISISAYQASNSAVEMVRADIIEPSADPSVMLV 455
Query: 288 ---AEGDDPKLSKMKKDVVVGGKD-----VKEVDNDFFLV--VVKILDHQGP-------L 330
+E D +++ +V + V+E N F V + L H P
Sbjct: 456 QQESELDSANITRYIPEVFYRRINEYKVSVQENANPSFPVEYLFVTLTHGFPTMPTTVFT 515
Query: 331 SSTFPIENRTT---QVTMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLDLNSDVPA 385
S+FPIENR +R + S L S P++ +K +S+FHLL FL L+ D A
Sbjct: 516 DSSFPIENREIIGESQDLRRVASKLTGSSDPNV-AIKAVSNFHLLAFLHGMGILSEDEEA 574
Query: 386 LAQCVQAQTAVPEGYKLL 403
L V + +G +L+
Sbjct: 575 LLCNVATKHDPADGVQLI 592
>gi|242007387|ref|XP_002424522.1| nuclear protein localization, putative [Pediculus humanus corporis]
gi|212507948|gb|EEB11784.1| nuclear protein localization, putative [Pediculus humanus corporis]
Length = 451
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 100/203 (49%), Gaps = 30/203 (14%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEV 170
+ + RQ H ++V F+ A ++++N + ++R GF+YG V R V
Sbjct: 220 ITLNRQVYRHTDNVMFEN--AHLVERFLNYWRSSGLQRVGFLYGKYEVHSDVPLGIRATV 277
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQ 219
IYEPPQ+ +++ + IL D+ E+ +V+ +A LG+ +VG+IFT I ++
Sbjct: 278 AAIYEPPQESSKDSINILPDDREQ-IVEELAQHLGLVRVGWIFTDLIADDVQKGTVKHVR 336
Query: 220 NKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSD 272
N + + LS +E + A + C + + V + V+ + S +H E +Q+S
Sbjct: 337 NIESHFLSAQECITAGYLQNQRPNPCRFSSTGFFGSKFVTVCVTGDSSKQVHMEGYQVSG 396
Query: 273 MCVRLFKEGWFETEIAEGDDPKL 295
C+ L ++G + D P+L
Sbjct: 397 QCMALVRDGCL---VPTKDAPEL 416
>gi|332020704|gb|EGI61109.1| Nuclear protein localization protein 4-like protein [Acromyrmex
echinatior]
Length = 515
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 107/229 (46%), Gaps = 35/229 (15%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ A ++++N + +R G++YG V R V
Sbjct: 248 LNRQTYRHVDNVMFEN--ASLVERFLNYWRSTGHQRIGYLYGRYEVHSDVPLGIRAVVAA 305
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEPPQ+ T++ + +L D E E LV+ + L +++VG+IFT I ++N
Sbjct: 306 IYEPPQESTKDTIRLLPD-EREALVEELGRTLNLRRVGWIFTDLIADDVKKGTVKHVRNI 364
Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+ + LS +E + A F + C + + V + V+ + +H E +Q+S+ C
Sbjct: 365 ESHFLSAQECIMAGYFQNQYPNPCRFSPNGYFGSKFVTVCVTGDDKNQVHMEGYQVSNQC 424
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKI 323
+ L ++G + D P+L V+ D + V + F+ V I
Sbjct: 425 MALVRDGCL---VPTKDAPELG-----YVIESTDKQYVPDVFYKVSSNI 465
>gi|358366858|dbj|GAA83478.1| endoplasmic reticulum and nuclear membrane proteinc Npl4
[Aspergillus kawachii IFO 4308]
Length = 667
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 135/330 (40%), Gaps = 70/330 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E K V V IYEPPQ +EV + L +
Sbjct: 277 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 334
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A G+++VG IFT + + D Y LS+ E+ A A+
Sbjct: 335 NEKEVDEVAHLCGLERVGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARLQAQH 394
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG+ AI A+Q S V + + E
Sbjct: 395 PKATKWSRTGRFGSNFVTCV--LSGDEEGAIAI--SAYQASVAAVEMVRADIVEPSAEPS 450
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+E DD P++ K+ + + ++ K +L+V L H P
Sbjct: 451 VMLVQSEDDDTENKSRYIPEVFYRKINEYGISAQQNAKPSFPVEYLLVT--LTHGFPTDA 508
Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
S+FPIENR +Q + K + ++ +SDFH+L FL N
Sbjct: 509 SPLFTDSSFPIENREVIGESQELIHVAKKLVTHGDPDKAIRAVSDFHMLCFLHSLSTFNK 568
Query: 382 DVPALAQCVQAQTAVPEGYKLLINSMAETV 411
D AL V Q A +G +L+ S T+
Sbjct: 569 DEEALLCRVATQRAPADGLQLINTSGWATL 598
>gi|196000646|ref|XP_002110191.1| hypothetical protein TRIADDRAFT_63700 [Trichoplax adhaerens]
gi|190588315|gb|EDV28357.1| hypothetical protein TRIADDRAFT_63700 [Trichoplax adhaerens]
Length = 606
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 80/338 (23%), Positives = 143/338 (42%), Gaps = 71/338 (21%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF---- 172
+ + RQ H + V F + + ++N + +R G++YG K V +
Sbjct: 223 ITLNRQSYRHIDYVMFQN--SKIMENFLNYWRSCGCQRLGYLYGYYDPHKDVPLGIKTVV 280
Query: 173 --IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT----- 225
IYEPPQ+ T+ + +L D + + VDA+A LG++KVG+I T + ++ T
Sbjct: 281 VAIYEPPQKNTKNSIQLLEDAQADT-VDAVAKILGLRKVGWIVTDLLPKDVHKGTVQHTR 339
Query: 226 ------LSNREVLQAVEF-HAECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSD 272
+S E + A F +A N ++ + V + VS + + +I +Q+S+
Sbjct: 340 GVDMHLMSAEECIMAAAFQNAHPNPCKYARDKYFGSKFVTVIVSGDCNDSIDIHGYQVSN 399
Query: 273 MCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------------- 308
C+ L ++ + DDP L+ ++ DV KD
Sbjct: 400 QCMSLVRDNCL---VPTRDDPTLAYVRNSTNIQYVPDVFYKMKDEYNNEKLALGRPLPVE 456
Query: 309 --VKEVDNDFFLVVVKILDHQGPLSSTFPIENRTT-------QVTMRALKSHLNRSPSLP 359
+ +V N F + +LD L FPIENR V + + H++
Sbjct: 457 YLLIDVPNGFPKNMKFLLDK---LDDQFPIENRDNIGQPQNFNVLAKYAEQHMDGK---- 509
Query: 360 LVKRISDFHLLLFLARFLDLN-SDVPALAQCVQAQTAV 396
++ SDFH L +L + + D+ LA V+ ++A+
Sbjct: 510 FLEMASDFHFLTYLVKTEIIKLEDLQPLATAVKQESAL 547
>gi|21360364|gb|AAM47486.1| unknown [Squalus mitsukurii]
Length = 167
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 76/152 (50%), Gaps = 22/152 (14%)
Query: 167 RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI--------- 217
R EV IYEPPQ GT L L D + E +VD IAA LG++KVG+IFT +
Sbjct: 14 RAEVAAIYEPPQVGTANSLEFLEDPKAE-VVDEIAAKLGLRKVGWIFTDLLSEDTRKGTV 72
Query: 218 --MQNKKDYTLSNREVLQAVEFHAE----CNM---EEWVTAVVKLEVSEEGSAAIHFEAF 268
++NK + LS E + A +F + C + + + V + + +HFE +
Sbjct: 73 KFIRNKDAHFLSAEECITAGDFQNKHPNVCRLSPVNHFGSKFVTVLATGGPDNQVHFEGY 132
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
Q+S+ C+ L + F + D P+L +K+
Sbjct: 133 QVSNQCMALVGDDCF---LPCRDAPELGYVKE 161
>gi|440291930|gb|ELP85172.1| nuclear protein localization, putative [Entamoeba invadens IP1]
Length = 375
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 91/184 (49%), Gaps = 13/184 (7%)
Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLV 197
+ FQ+Y+ E F V R G ++G ++ V V IYEPPQ + ++ +D+ V
Sbjct: 194 NRFQKYILER-NFYVNRIGILFGEFAKNNNVRVEAIYEPPQVQEDNGDFVFKDQAIIANV 252
Query: 198 DAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVSE 257
D + A LG+ KVG I++ K+ L++RE + A + +W + L VS
Sbjct: 253 DKMMAELGLSKVGVIYSHN---GKRKEILTSREAIAAAQRQI-----DWNPNCLTLVVSP 304
Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFF 317
+ +A+ E +Q+S+ V L K+ + A+ + L K+ V + K++ EV+ F
Sbjct: 305 DKNAS-KVECYQVSEQLVDLVKKALVQPTQADNNSILL---KESVYLSQKEIMEVEPVVF 360
Query: 318 LVVV 321
+ V
Sbjct: 361 VCNV 364
>gi|68071893|ref|XP_677860.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56498134|emb|CAH99046.1| conserved hypothetical protein [Plasmodium berghei]
Length = 516
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 120/272 (44%), Gaps = 52/272 (19%)
Query: 153 KRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G+MYG ED R IYEPPQ + + +L D VD IA LG+
Sbjct: 240 QRVGWMYGYYKEDSHYNLGIRAVCECIYEPPQFCEGDKVRLLPDYFMNN-VDLIAEKLGL 298
Query: 207 KKVGFIFTQTIMQNKKDYTLSNREV---------LQAVEFHAECNMEEWVTAVVKLEVSE 257
+K+G IFT +K+Y S V L+ + ++ ++T + + S
Sbjct: 299 EKIGIIFTHLP---RKEYLTSEELVNIAKLQLSNLKKNTHYTNYSISNFITCTISPDPSL 355
Query: 258 EGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKDV 309
+ AF +SD+ + L ++ + EI DDP K++ ++ GGK+
Sbjct: 356 NNEPITN--AFMVSDLGMSLIRDNLID-EIQ--DDPSHIKLRTPNKYELLPQILEGGKET 410
Query: 310 KEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALK-----SHLNRSPS 357
+ D D+F++ V + P + F ENR T+ +K S LNR +
Sbjct: 411 NKFDTDWFIIRV---NDSAPKIVRSIFKNYHFSRENRNKPQTVYDVKEYFSSSKLNRGVN 467
Query: 358 LPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
R SDFHL++F A+ LD+ + AL C
Sbjct: 468 GN--SRCSDFHLIIFAAKVLDIET---ALVLC 494
>gi|303323093|ref|XP_003071538.1| NPL4 family protein [Coccidioides posadasii C735 delta SOWgp]
gi|240111240|gb|EER29393.1| NPL4 family protein [Coccidioides posadasii C735 delta SOWgp]
gi|320033362|gb|EFW15310.1| endoplasmic reticulum and nuclear membrane protein c Npl4
[Coccidioides posadasii str. Silveira]
Length = 657
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/322 (28%), Positives = 130/322 (40%), Gaps = 62/322 (19%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV I LR+ E
Sbjct: 276 INSLLDFWRKSGSQRLGFLYGTYEEYSEVPLGVKAVVQAIYEPPQ--VDEVDGITLREWE 333
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
E+ VD +A G++KVG IFT + D Y LS+ EV A A
Sbjct: 334 NERDVDEVAKLCGLEKVGVIFTDLLDSGLGDGTVISRRHIDSYYLSSLEVAFAARLQARY 393
Query: 242 -NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF----------- 283
+W + V +S + + AI A+Q S+ V + +
Sbjct: 394 PKPSKWSETGRFGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLV 453
Query: 284 --ETEIAEGDDPKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL----- 330
E E+ D + + V G +V+E F V + L H P
Sbjct: 454 QRENELENADTGNARYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTEPSAL 513
Query: 331 --SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
+TFPIENR +Q K L+RS ++ +S+FHLL F+ L+ +
Sbjct: 514 FADTTFPIENREVIGESQDIRNVAKKLLSRSDPDGAIRAVSNFHLLCFMHGLGILSKEEE 573
Query: 385 ALAQCVQAQTAVPEGYKLLINS 406
AL C A+T P LIN+
Sbjct: 574 ALL-CTVARTHDPADGVQLINT 594
>gi|159487106|ref|XP_001701576.1| predicted protein [Chlamydomonas reinhardtii]
gi|158271517|gb|EDO97334.1| predicted protein [Chlamydomonas reinhardtii]
Length = 102
Score = 67.0 bits (162), Expect = 2e-08, Method: Composition-based stats.
Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 9/71 (12%)
Query: 113 IAKQMRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF 172
+AK +++ RQ+ PH ++SFDR+ A+ FQ Y+ F++KRGG +YGTVLE+
Sbjct: 1 MAKVVKIERQDKPHVSAISFDRNAANVFQSYLQGAFNFSIKRGGILYGTVLEE------- 53
Query: 173 IYEPPQQGTEE 183
E P+ G E
Sbjct: 54 --EGPEPGKTE 62
>gi|47217659|emb|CAG03056.1| unnamed protein product [Tetraodon nigroviridis]
Length = 584
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 147/376 (39%), Gaps = 125/376 (33%)
Query: 117 MRVTRQENPHCESVSFDRDC-ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVE 169
+ + RQ+ H +++ F+ D AD F + +T +R G++YG E K R E
Sbjct: 119 ITLNRQKYRHVDNIMFENDTIADRFLDFWRKT---GNQRMGYLYGRYTEHKDIPLGIRAE 175
Query: 170 VNFIYEPPQQG--TEEVLYILR-----------------------DEEEEKLVDAIAAGL 204
V IYEPPQ + +V +I+ D+ + + VD IAA L
Sbjct: 176 VAAIYEPPQVNLFSHKVSFIMSTVCMVTLSQYLFQNATQNSLELIDDPKAEAVDEIAAKL 235
Query: 205 GMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEF---HAE-CNMEE---W 246
G+ KVG+IFT + +NK + LS E + A F H+ C + +
Sbjct: 236 GLCKVGWIFTDLLSEDTRIGTVRYTRNKDSHYLSAEECITAGYFQNLHSNPCRLSRDGYF 295
Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK------ 300
+ V + + +HFE +Q+S+ C+ L ++ E + D P+L K+
Sbjct: 296 GSKFVTVVATGGPDNQVHFEGYQVSNQCMALVRD---ECLLPCKDAPELGFAKESSPEQY 352
Query: 301 --DVVVGGKDVKEVDNDFF-------LVVVKIL-----DHQGPLSST--FPIENR----T 340
DV KD D F +++ I D Q SS FPIENR
Sbjct: 353 VPDVFYKDKDKFGNDVTFLARPLPVEYLIIDITTTFPKDPQYTFSSAQRFPIENRDILGE 412
Query: 341 TQVT-------------------------------------------MRALKSHLNRSPS 357
TQ++ +L ++L++ S
Sbjct: 413 TQLSQLFSSRYRSSPPSAVMCRCNNKYSEELLLESSRPVTLEPLLNNFHSLSTYLSQCTS 472
Query: 358 LPLVKRISDFHLLLFL 373
++ +SDFHLLLFL
Sbjct: 473 TSFLEIVSDFHLLLFL 488
>gi|119189423|ref|XP_001245318.1| hypothetical protein CIMG_04759 [Coccidioides immitis RS]
gi|121755658|sp|Q1DY54.1|NPL4_COCIM RecName: Full=Nuclear protein localization protein 4
Length = 657
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/322 (28%), Positives = 130/322 (40%), Gaps = 62/322 (19%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV I LR+ E
Sbjct: 276 INSLLDFWRKSGSQRLGFLYGTYEEYSEVPLGVKAVVQAIYEPPQ--VDEVDGITLREWE 333
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
E+ VD +A G++KVG IFT + D Y LS+ EV A A
Sbjct: 334 NERDVDEVAKLCGLEKVGVIFTDLLDSGLGDGTVICRRHIDSYYLSSLEVAFAARLQARY 393
Query: 242 -NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF----------- 283
+W + V +S + + AI A+Q S+ V + +
Sbjct: 394 PKPSKWSETGRFGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLV 453
Query: 284 --ETEIAEGDDPKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL----- 330
E E+ D + + V G +V+E F V + L H P
Sbjct: 454 QRENELENADTGNARYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTEPSAL 513
Query: 331 --SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
+TFPIENR +Q K L+RS ++ +S+FHLL F+ L+ +
Sbjct: 514 FADTTFPIENREVIGESQDIRNVAKKLLSRSDPDEAIRAVSNFHLLCFMHGLGILSKEEE 573
Query: 385 ALAQCVQAQTAVPEGYKLLINS 406
AL C A+T P LIN+
Sbjct: 574 ALL-CTVARTHDPADGVQLINT 594
>gi|392868223|gb|EAS33974.2| nuclear protein localization protein 4 [Coccidioides immitis RS]
Length = 608
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/322 (28%), Positives = 130/322 (40%), Gaps = 62/322 (19%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV I LR+ E
Sbjct: 276 INSLLDFWRKSGSQRLGFLYGTYEEYSEVPLGVKAVVQAIYEPPQ--VDEVDGITLREWE 333
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
E+ VD +A G++KVG IFT + D Y LS+ EV A A
Sbjct: 334 NERDVDEVAKLCGLEKVGVIFTDLLDSGLGDGTVICRRHIDSYYLSSLEVAFAARLQARY 393
Query: 242 -NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF----------- 283
+W + V +S + + AI A+Q S+ V + +
Sbjct: 394 PKPSKWSETGRFGSNFVTCILSGDENGAISISAYQASNSAVEMVRADIVEPSADPSVMLV 453
Query: 284 --ETEIAEGDDPKLSKMK----KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL----- 330
E E+ D + + V G +V+E F V + L H P
Sbjct: 454 QRENELENADTGNARYIPEVFYRKVNEYGANVQENAKPAFPVEYLFVTLTHGFPTEPSAL 513
Query: 331 --SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
+TFPIENR +Q K L+RS ++ +S+FHLL F+ L+ +
Sbjct: 514 FADTTFPIENREVIGESQDIRNVAKKLLSRSDPDEAIRAVSNFHLLCFMHGLGILSKEEE 573
Query: 385 ALAQCVQAQTAVPEGYKLLINS 406
AL C A+T P LIN+
Sbjct: 574 ALL-CTVARTHDPADGVQLINT 594
>gi|307175902|gb|EFN65715.1| Nuclear protein localization protein 4-like protein [Camponotus
floridanus]
Length = 513
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 96/202 (47%), Gaps = 30/202 (14%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ A ++++N + +R GF+YG V R V
Sbjct: 287 LNRQTYRHVDNVMFEN--ASLVERFLNYWRSTGHQRIGFLYGRYEVHSDVPLGIRAVVAA 344
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEPPQ+ T++ + +L D E E LV+ + L ++++G+IFT I ++N
Sbjct: 345 IYEPPQESTKDTIRLLPD-EREALVEELGRTLNLRRIGWIFTDLIADDVKKGTVKHVRNI 403
Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+ + LS +E + A F C + + V + V+ + +H E +Q+S+ C
Sbjct: 404 ESHFLSAQECIMAGYFQNRYPNLCRFSPNGLFGSKFVTVCVTGDDKNQVHMEGYQVSNQC 463
Query: 275 VRLFKEGWFETEIAEGDDPKLS 296
+ L ++G + D P+L
Sbjct: 464 MALVRDGCL---VPTKDAPELG 482
>gi|407406442|gb|EKF30802.1| hypothetical protein MOQ_005377 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/278 (23%), Positives = 123/278 (44%), Gaps = 31/278 (11%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVD 198
FQ +V + L ++++R G +YG +EV+ IYEP Q G + L D+ VD
Sbjct: 104 FQSFVRQ-LDWSLQRYGILYGKYDASLNLIEVHTIYEPEQHGDAYTFHYLPDKRLPD-VD 161
Query: 199 AIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM----EEWVTAV 250
IAA LG+++VG + T ++ ++ LS+RE+L + F EC + T
Sbjct: 162 RIAALLGLRRVGAVCTHP-PRDIEEMVLSSREILLCAREQSRFGDECVLLTMSPNLTTGR 220
Query: 251 VKLEVSEEGSAAIHF--------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
++ + + A+HF ++ SD+ + E+A+ + + +
Sbjct: 221 IECQAWQTSPQAVHFYRLGALREKSDDYSDLESAKYVHSSIPLEVAQQETDEKGHPR--- 277
Query: 303 VVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF------PIENRTTQVTMRALKSHLNRSP 356
VV ++D +F V + + P+ P + +R R
Sbjct: 278 VVTRAPSHDIDTRWFTSYVAVQQFESPVVRNLFMRVSRPGMEPPAMINLRNYMEDPKRR- 336
Query: 357 SLPLVKRISDFHLLLFLAR-FLDLNSDVPALAQCVQAQ 393
++PL+++++DFH+L+FLA L D+P + + Q
Sbjct: 337 NVPLIEKLADFHVLIFLAESVFSLADDMPVIIGAITKQ 374
>gi|350638102|gb|EHA26458.1| hypothetical protein ASPNIDRAFT_46653 [Aspergillus niger ATCC 1015]
Length = 666
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 93/322 (28%), Positives = 131/322 (40%), Gaps = 70/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E K V V IYEPPQ +EV + L +
Sbjct: 277 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 334
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A G+++VG IFT + + D Y LS+ E+ A A+
Sbjct: 335 NEKEVDEVAHLCGLERVGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARMQAQH 394
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 395 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 450
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+E DD P++ K+ + V ++ K +L+V L H P
Sbjct: 451 VMLVQSEDDDTENKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTVA 508
Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
S+FPIENR +Q + K + ++ +SDFH+L FL N
Sbjct: 509 SPLFTDSSFPIENREVIGESQELVHVAKKLVTHGDPDKAIQAVSDFHMLCFLHSLSTFNK 568
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V Q A +G +L+
Sbjct: 569 DEEALLCRVATQRAPADGLQLI 590
>gi|317025379|ref|XP_001388958.2| nuclear protein localization protein 4 [Aspergillus niger CBS
513.88]
Length = 606
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 131/322 (40%), Gaps = 70/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E K V V IYEPPQ +EV + L +
Sbjct: 277 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 334
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A G++++G IFT + + D Y LS+ E+ A A+
Sbjct: 335 NEKEVDEVAHLCGLERIGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARMQAQH 394
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 395 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 450
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+E DD P++ K+ + V ++ K +L+V L H P
Sbjct: 451 VMLVQSEDDDTENKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTDA 508
Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
S+FPIENR +Q + K + ++ +SDFH+L FL N
Sbjct: 509 SPLFTDSSFPIENREVIGESQELVHVAKKLVTHGDPDKAIQAVSDFHMLCFLHSLSTFNK 568
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V Q A +G +L+
Sbjct: 569 DEEALLCRVATQRAPADGLQLI 590
>gi|194741672|ref|XP_001953313.1| GF17266 [Drosophila ananassae]
gi|190626372|gb|EDV41896.1| GF17266 [Drosophila ananassae]
Length = 657
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 135/313 (43%), Gaps = 66/313 (21%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNF 172
+ RQ H ++V F+ ++++N +R G++YGT + R +V
Sbjct: 254 LNRQTYRHVDNVMFEN--TKIVERFLNYWRTTGHQRMGYLYGTYEQHADVPLGIRAKVAA 311
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL 232
IYEPPQ+ T + + I DE +++ AA LG+KK+G+IFT I + D ++ + +
Sbjct: 312 IYEPPQESTRDSINIQPDEGADEVEAVAAA-LGLKKIGWIFTDLITE---DASVGTVKQI 367
Query: 233 QAVEFHAECNMEEWVTA----------------------VVKLEVSEEGSAAIHFEAFQM 270
+ +E H +E +TA V + V+ + + +H E + +
Sbjct: 368 RGIETHF-LTAQECITAGELQNRHPNPCKYASNGVFGSKFVTICVTGDNTKQVHMEGYAV 426
Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKEVD 313
S C+ L ++ I D P+L +++ DV KD + +
Sbjct: 427 SAQCMALVRDNCL---IPTKDAPELGYVRESTDKQYVPDVFYKEKDQYGNEVQRLARPLP 483
Query: 314 NDFFLVVVKILDHQGPLSS--------TFPIENRTTQVTMR---ALKSHLNRSPSLPLVK 362
++ LV V P + FPIENR ++ AL +L+ ++
Sbjct: 484 VEYLLVDVPASTPLQPQYTFTEYDKRQPFPIENRYIDGHLQDFNALSCYLSAWGEEEFLE 543
Query: 363 RISDFHLLLFLAR 375
+SDFHLL++L +
Sbjct: 544 AVSDFHLLVYLYK 556
>gi|307193797|gb|EFN76470.1| Nuclear protein localization protein 4-like protein [Harpegnathos
saltator]
Length = 432
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 106/226 (46%), Gaps = 35/226 (15%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ A ++++N ++R G++YG V R +
Sbjct: 206 LNRQTYRHVDNVMFEN--ASLVERFLNYWRGTGLQRIGYLYGRYEVHSDVPLGVRAVIAA 263
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEPPQ+ T++ + +L D E E LV+ + L + +VG+IFT I ++N
Sbjct: 264 IYEPPQESTKDTIRLLPD-EREALVEELGRMLNLWRVGWIFTDLIADDVKKGTVKHVRNI 322
Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
+ + LS +E + A F + C + + V + V+ + +H E +Q+S+ C
Sbjct: 323 ESHFLSAQECIMAGYFQNQYPNPCRFSPNGYFGSKFVTVCVTGDDKNQVHMEGYQVSNQC 382
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVV 320
+ L ++G + D P+L V+ D + V + F+ ++
Sbjct: 383 MALVRDGCL---VPTKDAPELG-----YVIESTDKQYVPDVFYKLI 420
>gi|190359911|sp|A2Q8R9.1|NPL4_ASPNC RecName: Full=Nuclear protein localization protein 4
gi|134055061|emb|CAK43702.1| unnamed protein product [Aspergillus niger]
Length = 654
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 131/322 (40%), Gaps = 70/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E K V V IYEPPQ +EV + L +
Sbjct: 265 INSLLDFWRKSGAQRLGFLYGTYEEYKEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 322
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A G++++G IFT + + D Y LS+ E+ A A+
Sbjct: 323 NEKEVDEVAHLCGLERIGVIFTDLLDAGRGDGSVICKRHIDSYYLSSLEIAFAARMQAQH 382
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 383 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 438
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+E DD P++ K+ + V ++ K +L+V L H P
Sbjct: 439 VMLVQSEDDDTENKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTDA 496
Query: 330 ----LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
S+FPIENR +Q + K + ++ +SDFH+L FL N
Sbjct: 497 SPLFTDSSFPIENREVIGESQELVHVAKKLVTHGDPDKAIQAVSDFHMLCFLHSLSTFNK 556
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V Q A +G +L+
Sbjct: 557 DEEALLCRVATQRAPADGLQLI 578
>gi|407847213|gb|EKG03039.1| hypothetical protein TCSYLVIO_005923 [Trypanosoma cruzi]
Length = 393
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 122/278 (43%), Gaps = 31/278 (11%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVD 198
FQ +V + L ++++R G +YG +EV+ IYEP Q G + L D+ VD
Sbjct: 104 FQSFVRQ-LDWSLQRYGILYGKYDASLNLIEVHAIYEPEQHGDAYTFHYLPDKRLPD-VD 161
Query: 199 AIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM----EEWVTAV 250
IAA LG+++VG + T ++ ++ LS+RE+L + F EC + T
Sbjct: 162 RIAALLGLRRVGAVCTHP-PRDVEEMVLSSREILLCAREQSRFGDECVLLTMSPSLTTGR 220
Query: 251 VKLEVSEEGSAAIHF--------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
++ + + A+HF ++ SD+ + E+A+ + + +
Sbjct: 221 IECQAWQTSPQAVHFYRLGVLREKSDNYSDLESAKYVHSSIPLEVAQQETDEKGHPR--- 277
Query: 303 VVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF------PIENRTTQVTMRALKSHLNRSP 356
VV ++D +F V + + P+ P + +R R
Sbjct: 278 VVTRAPSHDIDTRWFTSYVAVQQFESPVVRNLFMRVSRPGMEPPAMINLRNYMEDPKRR- 336
Query: 357 SLPLVKRISDFHLLLFLAR-FLDLNSDVPALAQCVQAQ 393
+PL+++++DFH+L+FLA + D+P + + Q
Sbjct: 337 KVPLIEKLADFHVLIFLAESVFSVADDMPVIIGAITKQ 374
>gi|154280186|ref|XP_001540906.1| hypothetical protein HCAG_04746 [Ajellomyces capsulatus NAm1]
gi|190359909|sp|A6R538.1|NPL4_AJECN RecName: Full=Nuclear protein localization protein 4
gi|150412849|gb|EDN08236.1| hypothetical protein HCAG_04746 [Ajellomyces capsulatus NAm1]
Length = 642
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 126/315 (40%), Gaps = 55/315 (17%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L +
Sbjct: 278 INSLLDFWRKSGTQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQVGEVDGV-TLHEWGN 336
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAEC-NMEEWV----- 247
EK VD +A G++K+G IFT + D E++ A + A +W
Sbjct: 337 EKDVDEVAKFCGLEKIGVIFTDLLDAGAGD------EIVFAAQLQARYPKATKWSETGRF 390
Query: 248 -TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLSK 297
+ V +S + AI A+Q S+ V + K E G D SK
Sbjct: 391 GSNFVTCVLSGDEDGAISISAYQASNSAVEMVKADIIEPSADPGVMLVQQENHSDDDASK 450
Query: 298 MK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL-------SSTFPIENR-- 339
+ + V G +V+E F V ++ L H P +STFPIENR
Sbjct: 451 ARYIPEVFYRKVNEYGANVQENAKPSFPVEYLLVTLTHGFPTDPDVMFNNSTFPIENREV 510
Query: 340 ---TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
+ + M A K + P+ + +SDFHLL FL LN D L V
Sbjct: 511 IGESQDLRMLADKLVSHGDPNKTICG-VSDFHLLCFLNSLGILNKDEEFLLCTVARSHDT 569
Query: 397 PEGYKLLINSMAETV 411
+G +L+ S T+
Sbjct: 570 ADGMQLINTSGWATL 584
>gi|71408582|ref|XP_806685.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870504|gb|EAN84834.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 393
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 122/278 (43%), Gaps = 31/278 (11%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLED-KRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVD 198
FQ +V + L ++++R G +YG +EV+ IYEP Q G + L D+ VD
Sbjct: 104 FQSFVRQ-LDWSLQRYGILYGKYDASLNLIEVHAIYEPEQHGDAYTFHYLPDKRLPN-VD 161
Query: 199 AIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM----EEWVTAV 250
IAA LG+++VG + T ++ ++ LS+RE+L + F EC + T
Sbjct: 162 RIAALLGLRRVGAVCTHP-PRDIEEMVLSSREILLCAREQSRFGDECVLLTMSPSLTTGR 220
Query: 251 VKLEVSEEGSAAIHF--------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
++ + + A+HF ++ SD+ + E+A+ + + +
Sbjct: 221 IECQAWQTSPQAVHFYRLGVLREKSDDYSDLESAKYVHSSIPLEVAQQETDEKGHPR--- 277
Query: 303 VVGGKDVKEVDNDFFLVVVKILDHQGPLSSTF------PIENRTTQVTMRALKSHLNRSP 356
VV ++D +F V + + P+ P + +R R
Sbjct: 278 VVTMAPSHDIDTRWFTSYVAVQQFESPVVRNLFMRVSRPGMEPPAMINLRNYMEDPKRR- 336
Query: 357 SLPLVKRISDFHLLLFLAR-FLDLNSDVPALAQCVQAQ 393
+PL+++++DFH+L+FLA + D+P + + Q
Sbjct: 337 KVPLIEKLADFHVLIFLAESVFSVADDMPVIIGAITKQ 374
>gi|391343438|ref|XP_003746017.1| PREDICTED: nuclear protein localization protein 4 homolog
[Metaseiulus occidentalis]
Length = 600
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 142/337 (42%), Gaps = 78/337 (23%)
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFA----VKRGGFMYGTVLEDKRV------EV 170
+Q+ H ++V F+ Q VN L F ++R G + G V V
Sbjct: 216 QQKYRHVDAVLFEN------SQIVNRFLDFWRQSLLQRVGILLGRYETYNEVPLGICARV 269
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
IYEPPQ + +L D EE VD + A LG+++VG+IF + + + T+
Sbjct: 270 AAIYEPPQNSARDGFELLDDPNEET-VDELCAKLGLRRVGWIFCDLLPLDVQKGTV---H 325
Query: 231 VLQAVEFHAECNMEEWVTAV----------------------VKLEVSEEGSAAIHFEAF 268
L+ V+F + +E +TA V + V+ + + IH EA+
Sbjct: 326 YLRDVDFSYLLSAQECITAAALQNKYPNPCKLSSDGYSGSKFVTVCVTGDKNNQIHMEAY 385
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD---------VKE 311
Q+S+ C+ L K+ I D P+L+ +K+ DV KD +
Sbjct: 386 QVSNQCMALVKDDCL---IPTQDAPELAWVKESTPKKFVPDVFYKTKDRYGLEVQKVARP 442
Query: 312 VDNDFFLV---------VVKILDHQGPLSSTFPIENR--TTQVT-MRALKSHL-NRSPSL 358
+ +F L+ V +GP F +ENR QV + A+ +L N + S
Sbjct: 443 LPVEFLLIDLPVFSPNDSVNTFRSEGP---HFTVENRAFANQVQDLPAVNQYLRNFTASK 499
Query: 359 PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
L+ DFH LL+LA ++ ++ L + V+ + A
Sbjct: 500 DLLAACRDFHFLLYLATLDVVHLEMDKLLEAVKTKNA 536
>gi|357528824|sp|Q5BGN5.2|NPL4_EMENI RecName: Full=Nuclear protein localization protein 4
gi|259489478|tpe|CBF89782.1| TPA: Nuclear protein localization protein 4
[Source:UniProtKB/Swiss-Prot;Acc:Q5BGN5] [Aspergillus
nidulans FGSC A4]
Length = 652
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 130/330 (39%), Gaps = 82/330 (24%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G G++YGT E V V IYEPPQ +EV + L + E
Sbjct: 276 INSLLDFWRKSGTQRLGYLYGTYEEYDEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 333
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFH-AE 240
EK VD IA G++KVG IFT + + D Y LS+ E+ A +
Sbjct: 334 NEKDVDEIARLCGLEKVGVIFTDLLDAGRGDGSVLCKRHIDSYYLSSLEIAFASRLQMQQ 393
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
W VT V L EEG AI ++Q S V + + E
Sbjct: 394 PKATRWSRTGYFGSNFVTCV--LSGDEEG--AITISSYQASVAAVEMIRADIIEPSAEPS 449
Query: 288 -----AEGDDPKLSKMKKDVV------------VGGKDVKEVDNDFFLVVVKILDHQGP- 329
+E DD S+ +V V K V+ +L+V L H P
Sbjct: 450 VMLVQSEDDDTNKSRYIPEVFYRKINEYGVSAQVNAKPAFPVE---YLLVT--LTHGFPT 504
Query: 330 ------LSSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARF 376
+ ST+PIENR R L SH + + ++ +SDFHLL FL F
Sbjct: 505 ESKPLFIDSTYPIENREVIGEGQEFHSLARKLVSHGDPQKA---IRAVSDFHLLCFLRTF 561
Query: 377 LDLNSDVPALAQCVQAQTAVPEGYKLLINS 406
+ + AL C A T P LIN+
Sbjct: 562 STFSKEEEALL-CRVATTQNPTDGLQLINT 590
>gi|67516027|ref|XP_657899.1| hypothetical protein AN0295.2 [Aspergillus nidulans FGSC A4]
gi|40746545|gb|EAA65701.1| hypothetical protein AN0295.2 [Aspergillus nidulans FGSC A4]
Length = 649
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 130/330 (39%), Gaps = 82/330 (24%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G G++YGT E V V IYEPPQ +EV + L + E
Sbjct: 273 INSLLDFWRKSGTQRLGYLYGTYEEYDEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFH-AE 240
EK VD IA G++KVG IFT + + D Y LS+ E+ A +
Sbjct: 331 NEKDVDEIARLCGLEKVGVIFTDLLDAGRGDGSVLCKRHIDSYYLSSLEIAFASRLQMQQ 390
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
W VT V L EEG AI ++Q S V + + E
Sbjct: 391 PKATRWSRTGYFGSNFVTCV--LSGDEEG--AITISSYQASVAAVEMIRADIIEPSAEPS 446
Query: 288 -----AEGDDPKLSKMKKDVV------------VGGKDVKEVDNDFFLVVVKILDHQGP- 329
+E DD S+ +V V K V+ +L+V L H P
Sbjct: 447 VMLVQSEDDDTNKSRYIPEVFYRKINEYGVSAQVNAKPAFPVE---YLLVT--LTHGFPT 501
Query: 330 ------LSSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARF 376
+ ST+PIENR R L SH + + ++ +SDFHLL FL F
Sbjct: 502 ESKPLFIDSTYPIENREVIGEGQEFHSLARKLVSHGDPQKA---IRAVSDFHLLCFLRTF 558
Query: 377 LDLNSDVPALAQCVQAQTAVPEGYKLLINS 406
+ + AL C A T P LIN+
Sbjct: 559 STFSKEEEALL-CRVATTQNPTDGLQLINT 587
>gi|149245254|ref|XP_001527151.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|190359912|sp|A5DX93.1|NPL4_LODEL RecName: Full=Nuclear protein localization protein 4
gi|146449545|gb|EDK43801.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 600
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 80/161 (49%), Gaps = 25/161 (15%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
++N + V+R G++YG + +V V IYEPPQ + L +L D E+E +
Sbjct: 276 FINVWRSTGVQRFGYLYGRYEKFDKVPMGIKAVVEAIYEPPQHDELDGLTLL-DWEDEPI 334
Query: 197 VDAIAAGLGMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHAE----- 240
VDAIAA LG++KVG +FT T++ ++K Y L+N E++ A +F +
Sbjct: 335 VDAIAAKLGLQKVGIVFTDLTDSGNRDGTVLCKRHKDSYFLTNLEIIMAAKFQIKNPNIT 394
Query: 241 --CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
N+ E+ + V +S I ++Q+S L K
Sbjct: 395 KYANLGEFSSKFVTCVISGGLQGEIEPRSYQVSSSAEGLVK 435
>gi|327296223|ref|XP_003232806.1| nuclear protein localization protein 4 [Trichophyton rubrum CBS
118892]
gi|326465117|gb|EGD90570.1| nuclear protein localization protein 4 [Trichophyton rubrum CBS
118892]
Length = 652
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 128/321 (39%), Gaps = 67/321 (20%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L +
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 335
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A GM+K+G IFT I D Y LS+ EV+ A + A
Sbjct: 336 NEKDVDHVAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 395
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
C E + + V +S + + AI A+Q S+ V + + E DP
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
+ ++++ + ++ + F V + ++ P
Sbjct: 452 VMLVQQEDELNNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511
Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
++FPIENR +Q R + S ++K +S+FHLL FL LN D
Sbjct: 512 TVFTDTSFPIENREFIGESQDLRRVASKLTDSSDPNTIIKAVSNFHLLTFLHGMGILNED 571
Query: 383 VPALAQCVQAQTAVPEGYKLL 403
AL V + +G +LL
Sbjct: 572 EEALLCTVATKHDPADGIQLL 592
>gi|326477582|gb|EGE01592.1| nuclear protein localization protein 4 [Trichophyton equinum CBS
127.97]
Length = 652
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 127/321 (39%), Gaps = 67/321 (20%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L +
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 335
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A GM+K+G IFT I D Y LS+ EV+ A + A
Sbjct: 336 NEKDVDYVAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 395
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
C E + + V +S + + AI A+Q S+ V + + E DP
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
+ ++++ + ++ + F V + ++ P
Sbjct: 452 IMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511
Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
+FPIENR +Q R + S ++K +S+FHLL FL LN D
Sbjct: 512 TVFTDPSFPIENREIIGESQDLRRVASKLTDSSDPNAVIKAVSNFHLLTFLHGMGILNED 571
Query: 383 VPALAQCVQAQTAVPEGYKLL 403
AL V + +G +LL
Sbjct: 572 EEALLCAVATKHDPADGIQLL 592
>gi|302503091|ref|XP_003013506.1| hypothetical protein ARB_00324 [Arthroderma benhamiae CBS 112371]
gi|291177070|gb|EFE32866.1| hypothetical protein ARB_00324 [Arthroderma benhamiae CBS 112371]
Length = 652
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 131/322 (40%), Gaps = 69/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L +
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 335
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD IA GM+K+G IFT I D Y LS+ EV+ A + A
Sbjct: 336 NEKDVDHIAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 395
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
C E + + V +S + + AI A+Q S+ V + + E DP
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
+ ++++ + ++ + F V + ++ P
Sbjct: 452 VMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511
Query: 330 ---LSSTFPIENRTT---QVTMRALKSHLNRSPSLP--LVKRISDFHLLLFLARFLDLNS 381
S+FPIENR +R + S L S S P ++K +S+FHLL FL L+
Sbjct: 512 TVFTDSSFPIENREVIGESQDLRRVASKLTDS-SDPNAVIKAVSNFHLLTFLHGMGILSE 570
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V + +G +LL
Sbjct: 571 DEEALLCTVATKHDPADGIQLL 592
>gi|452001809|gb|EMD94268.1| hypothetical protein COCHEDRAFT_1028190 [Cochliobolus
heterostrophus C5]
Length = 1071
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 86/324 (26%), Positives = 137/324 (42%), Gaps = 66/324 (20%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRD 190
Q V++ + F +R GF+YG E V V IYEPPQ EV I L D
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQ--INEVDGISLGD 316
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
+ E+++D IAA G+++VG IFT + +K D Y LS+ EV A + A
Sbjct: 317 WDNEQVIDDIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEVCFAARYQA 376
Query: 240 EC-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI----- 287
+ +W + V +S + I ++Q+S+ V + + E
Sbjct: 377 KYPRPSKWSDTGRHGSNFVTCVISGDDQGQIGISSYQVSNDAVEMVRADIIEPSAEPKVM 436
Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------ 329
+E D+ L++ + + + G +V+E+ F V ++ L H P
Sbjct: 437 LVQSEEDNEALNRTRYIPEVFYRKINEHGANVQEIAKPDFPVEYLLLTLTHGFPTQPNPL 496
Query: 330 -LSSTFPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLD 378
FPIENR M + +HL+ ++ L L +K IS+FH+L F+
Sbjct: 497 FTGGKFPIENREAMGEMPEI-THLSKALNAKANGLALNTASGLKAISNFHMLCFIHNLGI 555
Query: 379 LNSDVPALAQCVQAQTAVPEGYKL 402
L+ D +L V + EG L
Sbjct: 556 LSKDEESLLFKVASTHDTSEGSAL 579
>gi|451849988|gb|EMD63291.1| hypothetical protein COCSADRAFT_161800 [Cochliobolus sativus
ND90Pr]
Length = 655
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 86/324 (26%), Positives = 137/324 (42%), Gaps = 66/324 (20%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRD 190
Q V++ + F +R GF+YG E V V IYEPPQ EV I L D
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQ--INEVDGISLGD 316
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
+ E+++D IAA G+++VG IFT + +K D Y LS+ EV A + A
Sbjct: 317 WDNEQVIDDIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEVCFAARYQA 376
Query: 240 EC-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI----- 287
+ +W + V +S + I ++Q+S+ V + + E
Sbjct: 377 KYPRPSKWSDTGRHGSNFVTCVISGDDQGQIGISSYQVSNDAVEMVRADIIEPSAEPKVM 436
Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------ 329
+E D+ L++ + + + G +V+E+ F V ++ L H P
Sbjct: 437 LVQSEEDNEALNRTRYIPEVFYRKINEHGANVQEIAKPDFPVEYLLLTLTHGFPTQPNPL 496
Query: 330 -LSSTFPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLD 378
FPIENR M + +HL+ ++ L L +K IS+FH+L F+
Sbjct: 497 FTGGKFPIENREAMGEMPEI-THLSKALNAKANGLALNTASGLKAISNFHMLCFIHNLGI 555
Query: 379 LNSDVPALAQCVQAQTAVPEGYKL 402
L+ D +L V + EG L
Sbjct: 556 LSKDEESLLFKVASTHDTSEGSAL 579
>gi|326474161|gb|EGD98170.1| nuclear protein localization protein 4 [Trichophyton tonsurans CBS
112818]
Length = 641
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 127/321 (39%), Gaps = 67/321 (20%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L +
Sbjct: 267 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 324
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A GM+K+G IFT I D Y LS+ EV+ A + A
Sbjct: 325 NEKDVDYVAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 384
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
C E + + V +S + + AI A+Q S+ V + + E DP
Sbjct: 385 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 440
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
+ ++++ + ++ + F V + ++ P
Sbjct: 441 VMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 500
Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
+FPIENR +Q R + S ++K +S+FHLL FL LN D
Sbjct: 501 TVFTDPSFPIENREIIGESQDLRRVASKLTDSSDPNAVIKAVSNFHLLTFLHGMGILNED 560
Query: 383 VPALAQCVQAQTAVPEGYKLL 403
AL V + +G +LL
Sbjct: 561 EEALLCAVATKHDPADGIQLL 581
>gi|218511719|sp|Q6BRJ9.2|NPL4_DEBHA RecName: Full=Nuclear protein localization protein 4
Length = 566
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 121/287 (42%), Gaps = 47/287 (16%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
++++ V+R G +YG + +V V IYEPPQ G + L +L E
Sbjct: 246 LNKFIDSWRTTGVQRFGILYGRYEQFDKVPLGIKAVVEAIYEPPQSGELDGLTLL-PWEN 304
Query: 194 EKLVDAIAAGLGMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHA--- 239
E+ VD IAA LG+ KVG FT T++ ++K Y LS EV+ A ++
Sbjct: 305 ERQVDEIAASLGLYKVGMTFTDLTDSGAKNGTVLCKRHKNSYFLSCLEVIMAAKYQVSNP 364
Query: 240 ----ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE----GWFETEIAEGD 291
N + + V +S + I ++Q+S+ L K G + + +
Sbjct: 365 NITKHSNSGRFSSKFVTCVISGGMNGEIEPRSYQVSNSAEALIKADIITGSTQPSMLYIN 424
Query: 292 DPKLSKMKKDVVVG-----GKDVKEVDN-----DFFLVVVK---ILDHQGPLSSTFPIEN 338
D + + DV G +VK +F LV + LD FPIEN
Sbjct: 425 DSEGKRYVPDVFYSKINEYGLEVKTNAKPAFPVEFLLVSLSDSFPLDPSPMFRENFPIEN 484
Query: 339 RTTQVTMRALKS---HLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
R ++ LKS +LN P ++ DFH L ++A+ L+ D
Sbjct: 485 RDFMGDLQDLKSAYNYLNADPG--DGSQLFDFHFLTYIAKTGILSHD 529
>gi|440640577|gb|ELR10496.1| nuclear protein localization protein 4, variant [Geomyces
destructans 20631-21]
Length = 612
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 124/314 (39%), Gaps = 67/314 (21%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G+MYG E +V V IYEPPQ +E+ I L + EK VD +A G
Sbjct: 287 QRLGYMYGRYEEYTKVPLGIKAVVEAIYEPPQH--DEIDGITLNEWTNEKDVDGVARLCG 344
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
++KVG I+T + D Y LS+ E+ A+ + +
Sbjct: 345 LQKVGVIWTDLLDSGGGDGSVVCKRHIDSYFLSSLEIAFISRLQAQHPKPTKWSDTGRFG 404
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLSKM 298
+ V V+ + I A+Q S+ V + + E G DD +S+
Sbjct: 405 SNFVTCVVTGDEQGQISISAYQASNSAVEMIRADIVEPSADPGVMIVWAEEEDDASVSRT 464
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP---------LSSTFPIENRT 340
+ + + G V+E F V ++ L H P S FP+ENR
Sbjct: 465 RYIPEVFFRKINEYGAAVQENAKPSFPVEYLLVTLTHGFPQEPKPAFTAPPSAFPVENRE 524
Query: 341 TQVTMRALKS-----------HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
+ KS L RS + ++ +SDFHLL FL LN D AL
Sbjct: 525 AIGESQEYKSVSQAIGLRPDGQLKRSDTG--IEAVSDFHLLCFLRNMAILNKDEEALLCR 582
Query: 390 VQAQTAVPEGYKLL 403
V Q + +GY+LL
Sbjct: 583 VATQHDLADGYQLL 596
>gi|294656853|ref|XP_459171.2| DEHA2D15818p [Debaryomyces hansenii CBS767]
gi|199431790|emb|CAG87342.2| DEHA2D15818p [Debaryomyces hansenii CBS767]
Length = 576
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 121/287 (42%), Gaps = 47/287 (16%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
++++ V+R G +YG + +V V IYEPPQ G + L +L E
Sbjct: 256 LNKFIDSWRTTGVQRFGILYGRYEQFDKVPLGIKAVVEAIYEPPQSGELDGLTLL-PWEN 314
Query: 194 EKLVDAIAAGLGMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHA--- 239
E+ VD IAA LG+ KVG FT T++ ++K Y LS EV+ A ++
Sbjct: 315 ERQVDEIAASLGLYKVGMTFTDLTDSGAKNGTVLCKRHKNSYFLSCLEVIMAAKYQVSNP 374
Query: 240 ----ECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE----GWFETEIAEGD 291
N + + V +S + I ++Q+S+ L K G + + +
Sbjct: 375 NITKHSNSGRFSSKFVTCVISGGMNGEIEPRSYQVSNSAEALIKADIITGSTQPSMLYIN 434
Query: 292 DPKLSKMKKDVVVG-----GKDVKEVDN-----DFFLVVVK---ILDHQGPLSSTFPIEN 338
D + + DV G +VK +F LV + LD FPIEN
Sbjct: 435 DSEGKRYVPDVFYSKINEYGLEVKTNAKPAFPVEFLLVSLSDSFPLDPSPMFRENFPIEN 494
Query: 339 RTTQVTMRALKS---HLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
R ++ LKS +LN P ++ DFH L ++A+ L+ D
Sbjct: 495 RDFMGDLQDLKSAYNYLNADPG--DGSQLFDFHFLTYIAKTGILSHD 539
>gi|315051616|ref|XP_003175182.1| NPL4 [Arthroderma gypseum CBS 118893]
gi|311340497|gb|EFQ99699.1| NPL4 [Arthroderma gypseum CBS 118893]
Length = 652
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 131/322 (40%), Gaps = 69/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L + E
Sbjct: 278 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWE 335
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A GM+K+G IFT I D Y LS+ EV+ A +F A
Sbjct: 336 NEKDVDHVAKLCGMQKIGVIFTDLIDAGAGDGTVICKRHIDSYYLSSLEVVFASQFQARN 395
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
C E + + V +S + + AI A+Q S+ V + + E DP
Sbjct: 396 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 451
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
+ ++++ + ++ + F V + ++ P
Sbjct: 452 VMLVQQEDELNDANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 511
Query: 330 ---LSSTFPIENRTT---QVTMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLDLNS 381
S+FPIENR +R + + L S P + +K +S+FHLL FL L+
Sbjct: 512 TVFTDSSFPIENREIIGESQDLRRVATKLTGSNNPDV-AIKAVSNFHLLTFLHGMGILSQ 570
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D L V + +G +L+
Sbjct: 571 DEETLLCTVATKHDPADGIQLI 592
>gi|212539570|ref|XP_002149940.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Talaromyces marneffei ATCC 18224]
gi|210067239|gb|EEA21331.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Talaromyces marneffei ATCC 18224]
Length = 606
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 139/329 (42%), Gaps = 67/329 (20%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G G++YGT E V V IYEPPQ +EV I L +
Sbjct: 276 INGLLDFWRKSGTQRLGYLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGITLNEWT 333
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
E +D +A G+++VG IFT + K D Y LS+ E++ A A+
Sbjct: 334 NEAEIDEVAQLCGLQRVGVIFTDLLDTGKGDGSVICKRHADSYYLSSLEIVFASRQQAQH 393
Query: 241 CNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------- 287
+W + V +S + + AI ++Q S+ V + + E
Sbjct: 394 PKATKWSETGRFGSNFVTCVLSGDENGAIAVSSYQASNSAVEMVRADIIEPSADPSVMLV 453
Query: 288 -AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------L 330
+E DD SK++ + + G +V+E F V ++ L H P +
Sbjct: 454 QSEDDDDLGSKLRYIPEVFYRRINEYGANVQENAKPSFPVEYLLVTLTHGFPTEESPTFV 513
Query: 331 SSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDV 383
STFPIENR + + L +H + + + ++ +SDFHLL FL N++
Sbjct: 514 DSTFPIENREVIGESQELRTLAKKLVAHGDPNKA---IRAVSDFHLLAFLRGLGTFNAEE 570
Query: 384 PALAQCVQAQTAVP-EGYKLLINSMAETV 411
L C A T P EG +L+ S T+
Sbjct: 571 EKLL-CRVATTHDPAEGMQLVNTSGWATL 598
>gi|440640578|gb|ELR10497.1| nuclear protein localization protein 4 [Geomyces destructans
20631-21]
Length = 640
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 124/314 (39%), Gaps = 67/314 (21%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G+MYG E +V V IYEPPQ +E+ I L + EK VD +A G
Sbjct: 287 QRLGYMYGRYEEYTKVPLGIKAVVEAIYEPPQH--DEIDGITLNEWTNEKDVDGVARLCG 344
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
++KVG I+T + D Y LS+ E+ A+ + +
Sbjct: 345 LQKVGVIWTDLLDSGGGDGSVVCKRHIDSYFLSSLEIAFISRLQAQHPKPTKWSDTGRFG 404
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLSKM 298
+ V V+ + I A+Q S+ V + + E G DD +S+
Sbjct: 405 SNFVTCVVTGDEQGQISISAYQASNSAVEMIRADIVEPSADPGVMIVWAEEEDDASVSRT 464
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP---------LSSTFPIENRT 340
+ + + G V+E F V ++ L H P S FP+ENR
Sbjct: 465 RYIPEVFFRKINEYGAAVQENAKPSFPVEYLLVTLTHGFPQEPKPAFTAPPSAFPVENRE 524
Query: 341 TQVTMRALKS-----------HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
+ KS L RS + ++ +SDFHLL FL LN D AL
Sbjct: 525 AIGESQEYKSVSQAIGLRPDGQLKRSDTG--IEAVSDFHLLCFLRNMAILNKDEEALLCR 582
Query: 390 VQAQTAVPEGYKLL 403
V Q + +GY+LL
Sbjct: 583 VATQHDLADGYQLL 596
>gi|198418032|ref|XP_002123974.1| PREDICTED: similar to Nuclear protein localization protein 4
homolog (Protein NPL4) [Ciona intestinalis]
Length = 565
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 142/323 (43%), Gaps = 59/323 (18%)
Query: 102 SFGRKMTMDDLIAKQMRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYG 160
S +K ++ DL + + + RQ+ H +++ F +R+ D F Y ++ +R G++YG
Sbjct: 176 SNSKKHSVVDLPS-AVTLNRQKFRHVDNIMFENREIVDRFLNYWRKS---GNQRMGYLYG 231
Query: 161 TVLEDKRV------EVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFT 214
V EV IYEPPQ+ T+ L D EE V+ IA L ++KVG+I +
Sbjct: 232 RYSPHSEVPLGIKAEVCAIYEPPQESTKNWLQFHPDPNEEA-VEYIAKKLNLRKVGWILS 290
Query: 215 QTIMQNKKDYTLSN-----------REVLQAVEFHAE----CNME---EWVTAVVKLEVS 256
++++ T+ N E + A F C + ++ + + VS
Sbjct: 291 DLTVEDRSKGTVKNTRHSDTHFLTAEECITAATFQNNYPNPCKLASDGKYGSKFATVIVS 350
Query: 257 EEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--------DVVVGGKD 308
+ I FE +Q+S+ + L + E + D P+L +++ DV KD
Sbjct: 351 GDNDCQISFEGYQVSNQGMSLVAD---ECLLPTLDAPELGYIRESSPELFVSDVYYKEKD 407
Query: 309 ---------VKEVDNDFFLVVVKI---LDHQGPLSST---FPIENRTTQVTMRALKS--- 350
+ + ++ L V L+ Q S+ FP+ R T +++L S
Sbjct: 408 KYGNEITKLARPLPLEYLLTDVPTAFSLNSQYSCSNAVAGFPVPYRDTIGEVQSLSSLSK 467
Query: 351 HLNRSPSLPLVKRISDFHLLLFL 373
+LN+ V +SDF+LL++L
Sbjct: 468 YLNQFKPDQFVDAVSDFNLLVYL 490
>gi|242804178|ref|XP_002484322.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Talaromyces stipitatus ATCC 10500]
gi|218717667|gb|EED17088.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Talaromyces stipitatus ATCC 10500]
Length = 607
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 136/328 (41%), Gaps = 65/328 (19%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G G++YGT E V V IYEPPQ +EV I L +
Sbjct: 277 INGLLDFWRKSGAQRLGYLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGITLNEWT 334
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
E +D +A G+++VG IFT + K D Y LS+ E++ A A+
Sbjct: 335 NEAEIDEVARLCGLQRVGVIFTDLLDAGKGDGTVVCKRHVDSYYLSSLEIVFASRQQAQH 394
Query: 241 CNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------- 287
+W + V +S + + AI ++Q S+ V + + E
Sbjct: 395 PKATKWSETGRFGSNFVTCVLSGDENGAITVSSYQASNSAVEMVRADIIEPSADPSVMLV 454
Query: 288 -AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------L 330
+E DD SK++ + + G +V+E F V ++ L H P +
Sbjct: 455 QSEDDDDPGSKLRYIPEVFYRRINEYGANVQENAKPSFPVEYLLVTLTHGFPTEESPMFI 514
Query: 331 SSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDV 383
STFPIENR + + L +H + + + + +SDFHLL FL N++
Sbjct: 515 DSTFPIENREVIGESQELRTLAKKLVAHGDPNKA---ISAVSDFHLLAFLRSLGTFNAEE 571
Query: 384 PALAQCVQAQTAVPEGYKLLINSMAETV 411
L V + EG +L+ S T+
Sbjct: 572 ERLLYRVASTHDPAEGMQLVNTSGWATL 599
>gi|407929695|gb|EKG22507.1| NPL4 zinc-binding putative, partial [Macrophomina phaseolina MS6]
Length = 598
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 133/326 (40%), Gaps = 74/326 (22%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YG E V V IYEPPQ G + + L D +
Sbjct: 264 INTLLDFWRKSGCQRLGFLYGRYEEYTEVPLGTKAVVEAIYEPPQVGEVDGIS-LNDWDN 322
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-C 241
EK +D +A G+++VG IFT + D Y LS+ E A + +
Sbjct: 323 EKEIDEVARMCGLERVGVIFTDLLDAGAGDGSVVCKRHADSYYLSSLETCFAARYQGKFP 382
Query: 242 NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---- 287
+W VT +V + EEG I A+Q S+ V + + E
Sbjct: 383 RPSKWSETGRFGSNFVTCIVSGD--EEGQIGI--SAYQASNAAVEMVRADIIEPSADPTM 438
Query: 288 ----AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPL---- 330
E ++ +L + + + V G +V+E F V + L H P
Sbjct: 439 MLVQNEEEESELGRRRYIPEVFYRKVNEYGANVQESAKPAFPVEYLFVTLTHGFPTNPQP 498
Query: 331 ---SSTFPIENR-----TTQVTMRALKSHLNRSPSLPLVK------RISDFHLLLFLARF 376
+++FPIENR + +VT AL LN S P + ++DFHLL F+
Sbjct: 499 KFPNTSFPIENREVLGISQEVT--ALAKQLNAKSSGPKLSDATGLAAVADFHLLTFIHGL 556
Query: 377 LDLNSDVPALAQCVQAQTAVPEGYKL 402
L+ D AL V A+ V +G +L
Sbjct: 557 GILHKDEEALLCRVAAEHNVSDGAQL 582
>gi|330921786|ref|XP_003299562.1| hypothetical protein PTT_10586 [Pyrenophora teres f. teres 0-1]
gi|311326697|gb|EFQ92338.1| hypothetical protein PTT_10586 [Pyrenophora teres f. teres 0-1]
Length = 655
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 133/323 (41%), Gaps = 64/323 (19%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
Q V++ + F +R GF+YG E V V IYEPPQ + + L +
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVNEADGIS-LGEW 317
Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
+ EK +D IAA G+++VG IFT + N D Y L++ EV A + A+
Sbjct: 318 DNEKEIDEIAAQCGLQRVGVIFTDLLDANNGDGSVICKRHMDSYYLASLEVCFAARYQAK 377
Query: 241 C-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
+W + V +S + I ++Q+S+ V + + E
Sbjct: 378 YPRPSKWSDTGRHGSNFVTCVISGDDKGQIAISSYQVSNDAVEMVRADIIEPSAEPSVML 437
Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST--- 333
+E DD L + + + + G +V+EV F V ++ L H P +
Sbjct: 438 VQSEEDDEALGRARYIPEVFYRKINEHGANVQEVAKPAFPVEYLLLTLTHGFPTQPSPLF 497
Query: 334 ----FPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLDL 379
FPIENR + + SHL+ ++ L L + IS+FH+L F+ L
Sbjct: 498 TGGKFPIENREAMGEVPEV-SHLSKALNAKANGLALNTASDLNAISNFHMLCFIHNLGVL 556
Query: 380 NSDVPALAQCVQAQTAVPEGYKL 402
+ D AL V + EG L
Sbjct: 557 SKDEEALLFKVASSHDTSEGSAL 579
>gi|302659770|ref|XP_003021572.1| hypothetical protein TRV_04314 [Trichophyton verrucosum HKI 0517]
gi|291185476|gb|EFE40954.1| hypothetical protein TRV_04314 [Trichophyton verrucosum HKI 0517]
Length = 582
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 127/321 (39%), Gaps = 67/321 (20%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L +
Sbjct: 208 INSLLDFWRKTGSQRIGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGVTLHEWN 265
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD IA GM+K+G IFT I D Y LS+ EV+ A + A
Sbjct: 266 NEKDVDHIAKLCGMEKIGVIFTDLIDAGAGDGTVVCKRHIDSYYLSSLEVVFASQLQARN 325
Query: 241 ---CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
C E + + V +S + + AI A+Q S+ V + + E DP
Sbjct: 326 PKPCKWSETGQFGSNFVTCVLSGDENGAISVCAYQASNSAVEMVRADIIEPSA----DPS 381
Query: 295 LSKMKKDVVVGGKDVKEVDNDFFL-----VVVKILDHQGP-------------------- 329
+ ++++ + ++ + F V + ++ P
Sbjct: 382 VMLVQQEDELDNANITRYIPEVFYRRINEYKVSVQENAKPSFPVEYLFVTLTHGFPTTPT 441
Query: 330 ---LSSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSD 382
+FPIENR +Q R + S ++K +S+FHLL FL L+ D
Sbjct: 442 TVFTDPSFPIENREVIGESQDLRRVASKLTDSSDPNAVIKAVSNFHLLTFLHGMGILSED 501
Query: 383 VPALAQCVQAQTAVPEGYKLL 403
AL V + +G +LL
Sbjct: 502 EEALLCTVATKHDPADGIQLL 522
>gi|195152455|ref|XP_002017152.1| GL21676 [Drosophila persimilis]
gi|194112209|gb|EDW34252.1| GL21676 [Drosophila persimilis]
Length = 443
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 101/226 (44%), Gaps = 47/226 (20%)
Query: 85 SYDGERNVRGPSFNPAGSFGRKMT--MDDLIAKQMRVTRQENPHCESVSFDRDCADAFQQ 142
+Y E N++ SF+ S+ RK T MD R H ++V F+ ++
Sbjct: 192 AYLKENNIKHLSFH---SYIRKQTSGMDHGKYFTYR-------HVDNVMFEN--TKIVER 239
Query: 143 YVNETLAFAVKRGGFMYGT------VLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
++N +R G++YGT V R V IYEPPQ+ T + + + DE + +
Sbjct: 240 FLNYWRTTGHQRMGYLYGTYEPHTDVPLGIRATVAAIYEPPQESTRDSINMFPDEGDAE- 298
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTA------- 249
VDA+A LG+KK+G+IFT I D T + ++ +E H +E +TA
Sbjct: 299 VDAVANALGLKKIGWIFTDLITD---DATAGTVKQIRGIETHF-LTAQECITAGELQNRH 354
Query: 250 ---------------VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE 280
V + V+ + + +H E + +S C+ L ++
Sbjct: 355 PNPCKYASNGVFGSKFVTICVTGDQTKQVHMEGYAVSAQCMALVRD 400
>gi|119497211|ref|XP_001265368.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Neosartorya fischeri NRRL 181]
gi|190359914|sp|A1D4X8.1|NPL4_NEOFI RecName: Full=Nuclear protein localization protein 4
gi|119413530|gb|EAW23471.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Neosartorya fischeri NRRL 181]
Length = 652
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 132/323 (40%), Gaps = 72/323 (22%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +E+ I L +
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEIDGITLHEWP 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A G++KVG IFT + + D Y LS+ E+ A A+
Sbjct: 331 NEKEVDEVARQCGLEKVGVIFTDLLDAGRGDGSVVCKRHIDSYYLSSLEIAFASRMQAKH 390
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI ++Q S V + + E
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AITVSSYQASISAVEMVRADIVEPSAEPS 446
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-- 330
+E DD P++ K+ + V ++ K +L+V L H P
Sbjct: 447 VMLVQSEDDDTDNKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTEA 504
Query: 331 -----SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
+STFPIENR +Q K ++ ++ +SDFHLL FL +
Sbjct: 505 SPMFSASTFPIENREVIGESQELRHVAKKLVSHGDPDKAIREVSDFHLLCFLHSLSMFSK 564
Query: 382 DVPALAQCVQAQTAVP-EGYKLL 403
+ AL C A T P EG KLL
Sbjct: 565 EEEALL-CRVATTHDPTEGLKLL 586
>gi|396471846|ref|XP_003838966.1| hypothetical protein LEMA_P026390.1 [Leptosphaeria maculans JN3]
gi|312215535|emb|CBX95487.1| hypothetical protein LEMA_P026390.1 [Leptosphaeria maculans JN3]
Length = 759
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 133/322 (41%), Gaps = 62/322 (19%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
Q V++ + F +R GF+YG E V V IYEPPQ ++ + L D
Sbjct: 357 QVVDDLINFWRNTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVDEQDGIS-LTDW 415
Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
EK +D IAA G+++VG IFT + + D Y LS+ E++ A + A+
Sbjct: 416 NNEKEIDDIAAQCGLQRVGVIFTDLLDAGQGDGSVVCKRHFDSYYLSSLEIVFAARYQAK 475
Query: 241 -------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
+ ++ + V +S + I ++Q+S+ V + + E
Sbjct: 476 YPRPSKWSDTGKFGSNFVTCVISGDDKGQIGISSYQVSNDAVEMVRADIIEPSAEPSVML 535
Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------- 329
+E D+ L++ + + + G +V+E+ F V ++ L H P
Sbjct: 536 VQSEEDNEALNRARYIPEVFYRRINEHGANVQEIAKPDFPVEYLLLTLTHGFPTQPNPLF 595
Query: 330 LSSTFPIENRTTQVTM---RALKSHLN-RSPSLPL-----VKRISDFHLLLFLARFLDLN 380
FPIENR M L LN ++ L L V IS+FHLL F+ L+
Sbjct: 596 TGGKFPIENREIMGEMADVSTLGKTLNAKANGLALNTSSGVNAISNFHLLCFIHNLGILS 655
Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
D +L V + EG L
Sbjct: 656 KDEESLLFKVASTHDTSEGSAL 677
>gi|255951286|ref|XP_002566410.1| Pc22g25230 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211593427|emb|CAP99811.1| Pc22g25230 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 665
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 117/292 (40%), Gaps = 69/292 (23%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L D
Sbjct: 269 INSLLNFWRKSGAQRLGFLYGTYEEYAEVPLGVKAVVEAIYEPPQTGEIDGV-TLHDWHN 327
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-- 240
EK VD +A G++KVG IFT I + D Y LS+ E+ A A+
Sbjct: 328 EKEVDEVARLCGLQKVGVIFTDLIDAGRGDGSVVCKRHIDSYYLSSLEIAFAARLQAQNP 387
Query: 241 -----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI-------- 287
++ + V +S + + AI ++Q S V + + E
Sbjct: 388 KSTKWSRTGQFGSDFVTCVLSGDEAGAISVSSYQASVSAVEMVRGDIVEPSADPTVMLVQ 447
Query: 288 AEGDDPKLSKMK-------KDVVVGGKDVKEVDN-----DFFLVVVKILDHQGPLSST-- 333
+E D+ S+ + + + G V+E DF LV L H P S+
Sbjct: 448 SEHDEDLGSRTRYIPEVFYRRINEYGASVQESAEPSFPVDFLLVT---LTHGFPTESSPL 504
Query: 334 -----FPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
FPIENR + + L SH + + ++ +SDFH+L FL
Sbjct: 505 FTDSRFPIENREVIGESQELRHVAQKLMSHGDPDKA---IQGVSDFHILCFL 553
>gi|238483481|ref|XP_002372979.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus flavus NRRL3357]
gi|220701029|gb|EED57367.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus flavus NRRL3357]
gi|391871001|gb|EIT80170.1| nuclear pore complex, rNpl4 component [Aspergillus oryzae 3.042]
Length = 603
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 128/322 (39%), Gaps = 70/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +E+ + L +
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEIDGVTLHEWH 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
EK VD +A G++KVG IFT + + D Y LS+ E++ A A+
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGQGDGSVICKRHIDSYFLSSLEIIFAARLQAQY 390
Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 446
Query: 288 -----AEGDDPK----------LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
+E DD + K+ + V ++ K +L+V L H P S
Sbjct: 447 VMLVQSEEDDSENKSRYIPEVFYRKINEYGVSAQQNAKPAFPVEYLLVT--LTHGFPTES 504
Query: 333 T-------FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
+ FPIENR +Q K ++ ++ +SDFHLL FL +
Sbjct: 505 SPLFVEGNFPIENREVIGESQELRHVAKKLVSHGDPDKAIRAVSDFHLLCFLHSLSTFSK 564
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V + +G +L+
Sbjct: 565 DEEALLGRVATKHDPADGVQLI 586
>gi|317139892|ref|XP_001817829.2| nuclear protein localization protein 4 [Aspergillus oryzae RIB40]
Length = 603
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 127/322 (39%), Gaps = 70/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +E+ + L +
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEIDGVTLHEWH 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
EK VD +A G++KVG IFT + + D Y LS+ E+ A A+
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGQGDGSVICKRHIDSYFLSSLEITFAARLQAQY 390
Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 446
Query: 288 -----AEGDDPK----------LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
+E DD + K+ + V ++ K +L+V L H P S
Sbjct: 447 VMLVQSEEDDSENKSRYIPEVFYRKINEYGVSAQQNAKPAFPVEYLLVT--LTHGFPTES 504
Query: 333 T-------FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
+ FPIENR +Q K ++ ++ +SDFHLL FL +
Sbjct: 505 SPLFVEGNFPIENREVIGESQELRHVAKKLVSHGDPDKAIRAVSDFHLLCFLHSLSTFSK 564
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V + +G +L+
Sbjct: 565 DEEALLGRVATKHDPADGVQLI 586
>gi|123448649|ref|XP_001313052.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121894921|gb|EAY00123.1| hypothetical protein TVAG_471060 [Trichomonas vaginalis G3]
Length = 403
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 96/423 (22%), Positives = 183/423 (43%), Gaps = 48/423 (11%)
Query: 2 LLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQN-LLLAKSPSDLLQ 60
++RIR + G E+V V+ H T+ L + ++ I E TL ++ +L K
Sbjct: 6 VVRIRWKKGTEKVPVN-PHDTIGTLIQYMVNKFEIKEEISTLIQDKTKTVLQKRIK---- 60
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSYDGERNVRGPSFN-----PAGSFGRKMTMDDLIAK 115
LSSL++ +G + LS + N S P R + D
Sbjct: 61 ----------LSSLDMENGELFELSLPSDANPNKDSKTSKPDTPIDDHQRNVYTLDYFKD 110
Query: 116 Q---MRVTRQENPHCESVSFDRDCADAFQQY--VNETLAFAVKRGGFMYGTVLEDKRVEV 170
++ + EN + V F + Q+ ++ L F + +YGT + +
Sbjct: 111 HNPTIQFQKYENSILKRVLFP---SAELQRIGRISNALHFNSTQIFLLYGTTPNPAAIRI 167
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNRE 230
+ + P Q+ T E L I +E K +A G+K +GF+ I+ K + ++
Sbjct: 168 HALSLPSQECTGEELLI--NESHIKDSHHVALACGLKFLGFL----IISPKTNVPIAPHL 221
Query: 231 VLQAVEFHAECNMEEWV--TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA 288
+++ + ++ ++E ++ ++ L+ S S + + EA+ +S + +++ G F
Sbjct: 222 LIKLMPLISD-DLESFIFIRSLPTLK-SMVDSVSCNLEAYSVSKQFIDMYRRGLFT---G 276
Query: 289 EGDDPKLSKMKKDVVVGGKDVKEVDNDFFL--VVVKILDHQGPLSSTFPIENRTTQVTMR 346
E D L K V+V K +KEVD +F+ + K+ + P S++FP + + T+
Sbjct: 277 ESTDTSLI-TKSTVLVSAKSLKEVDVSYFVTPLATKLRESWFP-SNSFPYQ--SFYPTIA 332
Query: 347 ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLINS 406
+ + +P R DF+LLLF D +VP +A+ A+ +P+ + I
Sbjct: 333 DFVAICYKFIDVPNYIRFLDFNLLLFCCTIFDREEEVPLIARSCIARDDIPDKVLVKIEE 392
Query: 407 MAE 409
M +
Sbjct: 393 MID 395
>gi|209879848|ref|XP_002141364.1| NPL4 family protein [Cryptosporidium muris RN66]
gi|209556970|gb|EEA07015.1| NPL4 family protein [Cryptosporidium muris RN66]
Length = 489
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 73/290 (25%), Positives = 126/290 (43%), Gaps = 43/290 (14%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYI-LRDEE 192
F +Y L +R G+MYG ED R + IYEPPQ + + L ++
Sbjct: 198 FVKYWTCNLQLQKQRIGWMYGYYKEDNTYPMGIRAVMEAIYEPPQTNNHKFGNLELEFDD 257
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQ---------AVEFHAECNM 243
+ VD IA LG++ +G IFT ++D L++RE++ + +
Sbjct: 258 FKSTVDTIAHSLGLECIGLIFT----HKERDELLTSREIISLGKLQLDYLKYNHYTGYPV 313
Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDP-KLSKMKK-- 300
++ + E SA + AF +SD+ + ++ + + + D KL +K
Sbjct: 314 SNFIACTIAPCKKSEDSAPVP-NAFAVSDLGLAFIRDHLLDEDNLDDDLHMKLRAEQKGE 372
Query: 301 ---DVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKS 350
V+ GG++ D +F+V V + P+ + FP ENR + T + +
Sbjct: 373 LLPQVLEGGQETSRFDAHWFVVRV---NESAPIRPRSLFNCNQFPRENRISTQTPADVAT 429
Query: 351 HL-NR--SPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
+ NR + S + ++DFHLLL LA+ D ++ AL+ C P
Sbjct: 430 FIKNRLVNISSTSYQLLNDFHLLLHLAKLFDEST---ALSICNSIANKTP 476
>gi|189203729|ref|XP_001938200.1| endoplasmic reticulum and nuclear membrane protein Npl4
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985299|gb|EDU50787.1| endoplasmic reticulum and nuclear membrane protein Npl4
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 655
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 131/323 (40%), Gaps = 64/323 (19%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
Q V++ + F +R GF+YG E V V IYEPPQ + + L +
Sbjct: 259 QVVDDLINFWRQTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVNEADGIS-LGEW 317
Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
+ EK +D IAA G+++VG IFT + N D Y L++ EV A + A+
Sbjct: 318 DNEKEIDEIAAQCGLQRVGVIFTDLLDTNNGDGSVICKRHMDSYYLASLEVCFAARYQAK 377
Query: 241 C-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
+W + V +S + I ++Q+S+ V + + E
Sbjct: 378 YPRPSKWSDTGRHGSNFVTCVISGDDKGQIGISSYQVSNDAVEMVRADIIEPSAEPSVML 437
Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP------- 329
+E DD L + + + + G +V+EV F V ++ L H P
Sbjct: 438 VQSEEDDEALGRARYIPEVFYRRINEHGANVQEVAKPAFPVEYLLLTLTHGFPTQPNPLF 497
Query: 330 LSSTFPIENRTTQVTMRALKSHLN-----RSPSLPL-----VKRISDFHLLLFLARFLDL 379
FPIENR + + SHL+ ++ L L + IS+FH+L F+ L
Sbjct: 498 TGGKFPIENREAMGEVPEV-SHLSKVLNAKANGLALNTTSDLNAISNFHMLCFIHNLGVL 556
Query: 380 NSDVPALAQCVQAQTAVPEGYKL 402
+ D L V + EG L
Sbjct: 557 SKDEETLLFKVASSHDTSEGSAL 579
>gi|121805628|sp|Q2URI8.1|NPL4_ASPOR RecName: Full=Nuclear protein localization protein 4
gi|83765684|dbj|BAE55827.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 652
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 127/322 (39%), Gaps = 70/322 (21%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +E+ + L +
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEIDGVTLHEWH 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
EK VD +A G++KVG IFT + + D Y LS+ E+ A A+
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGQGDGSVICKRHIDSYFLSSLEITFAARLQAQY 390
Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 391 PKATKWSRTGRFGSNFVTCV--LSGDEEG--AISVSAYQASVAAVEMVRADIVEPSAEPS 446
Query: 288 -----AEGDDPK----------LSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
+E DD + K+ + V ++ K +L+V L H P S
Sbjct: 447 VMLVQSEEDDSENKSRYIPEVFYRKINEYGVSAQQNAKPAFPVEYLLVT--LTHGFPTES 504
Query: 333 T-------FPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
+ FPIENR +Q K ++ ++ +SDFHLL FL +
Sbjct: 505 SPLFVEGNFPIENREVIGESQELRHVAKKLVSHGDPDKAIRAVSDFHLLCFLHSLSTFSK 564
Query: 382 DVPALAQCVQAQTAVPEGYKLL 403
D AL V + +G +L+
Sbjct: 565 DEEALLGRVATKHDPADGVQLI 586
>gi|70990368|ref|XP_750033.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus fumigatus Af293]
gi|74669989|sp|Q4WKD7.1|NPL4_ASPFU RecName: Full=Nuclear protein localization protein 4
gi|66847665|gb|EAL87995.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus fumigatus Af293]
gi|159130511|gb|EDP55624.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus fumigatus A1163]
Length = 652
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 132/323 (40%), Gaps = 72/323 (22%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +E+ I L +
Sbjct: 273 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEIDGITLHEWP 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE- 240
EK VD +A G++KVG IFT + + D Y LS+ E+ A A+
Sbjct: 331 NEKEVDEVARQCGLEKVGVIFTDLLDAGRGDGSVVCKRHIDSYYLSSLEIAFASRMQAKH 390
Query: 241 CNMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI ++Q S V + + E
Sbjct: 391 PKPTKWSRTGRFGSNFVTCV--LSGDEEG--AITVSSYQASISAVEMVRADIIEPSAEPS 446
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-- 330
+E +D P++ K+ + V ++ K +L+V L H P
Sbjct: 447 VMLVQSEDEDTDNKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEYLLVT--LTHGFPTEA 504
Query: 331 -----SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNS 381
+STFPIENR +Q K ++ ++ +SDFHLL FL +
Sbjct: 505 SPIFPASTFPIENREVIGESQELRHVAKKLVSHGDPDKAIREVSDFHLLCFLHSLSMFSK 564
Query: 382 DVPALAQCVQAQTAVP-EGYKLL 403
+ AL C A T P EG KLL
Sbjct: 565 EEEALL-CRVATTHDPTEGLKLL 586
>gi|403222165|dbj|BAM40297.1| uncharacterized protein TOT_020000556 [Theileria orientalis strain
Shintoku]
Length = 540
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 74/343 (21%), Positives = 139/343 (40%), Gaps = 88/343 (25%)
Query: 139 AFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEV-------- 184
+F Y L + +R G+MYG ED+ R IYEPPQ V
Sbjct: 206 SFANYWLADLEMSFQRIGWMYGYYTEDQHYPYGIRAVCEAIYEPPQMSQNVVKVASPRNK 265
Query: 185 LYILRDEE----------EEKL---VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREV 231
+Y D + E+ L VD IA LG++++G IFT+ + L+ +E+
Sbjct: 266 VYKHNDSDNVQVNYVSLMEDNLLPNVDKIANRLGLERIGLIFTRL----PNECFLTAKEL 321
Query: 232 LQAVEFH----AECNMEEW-VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE 286
++ +E + + V+ +V +S + + +AF SDM + LF++ +
Sbjct: 322 MRTIEMQYNSLKHIHYTGYPVSTMVTCTMSPDDNGQPVLDAFMASDMALALFRDELLDNN 381
Query: 287 IAEGDDPKLSKMKKD------VVVGGKDVKEVDNDFFLVVVKILDHQGPL---------- 330
+ K+ +++ ++ GK+V D + +V + + P+
Sbjct: 382 QDHDNIIKMREVEHSQEIMPPIIESGKEVTSFDTSWLMVRI---NESAPIKPNSFFGNNG 438
Query: 331 -SSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRIS------------------------ 365
+ FP+ENRT T K+ ++ S R++
Sbjct: 439 RNHKFPVENRTYSSTA---KTRMHESLVSDCSARVTNSNERFRKWDKKILTDYFNTHDPE 495
Query: 366 ----DFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEGYKLLI 404
DFHLL+++ + LD+++ + A+ C+ + V + K L+
Sbjct: 496 TLWNDFHLLIYVCQVLDVDTAL-AICDCIVRKEQVDDLVKDLL 537
>gi|340992720|gb|EGS23275.1| hypothetical protein CTHT_0009420 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 659
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 124/318 (38%), Gaps = 64/318 (20%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
V+R G +YG LE V V IYEPPQ +E+ I L E E+ V+ +A
Sbjct: 279 GVQRLGILYGRYLEYDAVPLGIKAVVEAIYEPPQ--VDEIDGITLNPWENEQEVNQVAKY 336
Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWV---- 247
G+++VG I+T + K D Y L+ +E++ A A+ +W
Sbjct: 337 CGLEQVGVIWTDLLDAGKGDGSVVCKRHADSYFLAAQEIVFAARLQAQHPKPSKWSDTGR 396
Query: 248 --TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLS 296
+ V VS I A+QMS+ V + + E E DD S
Sbjct: 397 FGSNFVTCVVSGNEQGEISISAYQMSNDAVEMVRADIIEPSADPTLMLVREEEEDDGSPS 456
Query: 297 KMK--------------KDVVVGGKDVKEVDNDFFLVVVKILDHQGPL--SSTFPIENRT 340
K + +V+ K V+ F + D PL + FPIENR
Sbjct: 457 KTRYIPEVFYRKINEYGANVLENAKPAFPVEYLFVTLTHGFPDSPSPLFTDNIFPIENRE 516
Query: 341 -------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ 393
+ALK H +P +SDFHLL F+ + L+ + AL V
Sbjct: 517 YVGEAQEVSAVAKALKVHEADAP-----MNVSDFHLLCFIHQMSVLSKEEEALLCRVATL 571
Query: 394 TAVPEGYKLLINSMAETV 411
+ E ++L + +T+
Sbjct: 572 HDLAESFQLRSTTGWQTL 589
>gi|339252344|ref|XP_003371395.1| eukaryotic initiation factor 4A-III [Trichinella spiralis]
gi|316968374|gb|EFV52655.1| eukaryotic initiation factor 4A-III [Trichinella spiralis]
Length = 970
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 130/282 (46%), Gaps = 48/282 (17%)
Query: 153 KRGGFMYGT------VLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF++G V R V IYEPPQ + + IL D +E +VD++A LG+
Sbjct: 562 QRMGFLFGRYEPMTDVPLGIRAVVTVIYEPPQITYTDGIDILEDPHDE-IVDSLAECLGL 620
Query: 207 KKVGFIFTQ--------TIMQNKKD---YTLSNREVLQAVEFHAE----CNM-------E 244
K++G+IFT T++ N ++ Y LS E L A ++ + C
Sbjct: 621 KRIGWIFTDFLRDPDDPTLLVNTRNANTYMLSAAECLIAAKYQNKYPNACKYSLDGTFGS 680
Query: 245 EWVTAVVK-LEVSEEGSAAIHFEAFQMSD-----MCVRLFKEGWFETEIAE-GDDPKLSK 297
++VT V+ +VS +GSA + E S CVR + ++ D +K
Sbjct: 681 KFVTLVLSAYQVSNQGSALVAAECLLPSRSSPNMACVRRSTPTHYVPDVYYMCHDEYNNK 740
Query: 298 MKKDVVVGGKDVKEVDNDFFLVVVKIL-DHQGPLSST---FPIENRT---TQVTMRALKS 350
++K + V+ F + V + + + L+S FPIENR+ + +R +
Sbjct: 741 IRKTA-----GLLPVEYLFVTIPVGLAKEMRYTLTSQKKPFPIENRSSIGSAQNIRVVGE 795
Query: 351 HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
++ + +K IS+FH L++L ++ ++ LA+ + A
Sbjct: 796 YVKQFEESEFLKMISNFHFLVYLLTNTEVTFELDDLAELLDA 837
>gi|345312165|ref|XP_001519304.2| PREDICTED: nuclear protein localization protein 4 homolog, partial
[Ornithorhynchus anatinus]
Length = 502
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 93/208 (44%), Gaps = 53/208 (25%)
Query: 177 PQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQ-----------NKKDYT 225
PQ GT+ L +L D + E +VD +AA LG++KVG+IFT + + NK Y
Sbjct: 296 PQIGTQNSLELLEDPKAE-VVDEMAAKLGLRKVGWIFTDLVSEDTRKGTVRYSRNKDTYF 354
Query: 226 LSNREVLQAVEFHAE----CNM-------EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
LS E + A F + C + ++VTAV + +HFE +Q+S+ C
Sbjct: 355 LSAEECITAGNFQNKHPNMCRLSPDGHFGSKFVTAVATGGPDNQ----VHFEGYQVSNQC 410
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMK-------------KDVVVGGKDVKEVDN----DFF 317
+ L ++ E + D P+L K KD+ G ++ ++ ++
Sbjct: 411 MALVRD---ECLLPCRDAPELGYAKESSSEQYVPDVFYKDIDKFGNEITQLARPLPVEYL 467
Query: 318 LVVVKILDHQGPL------SSTFPIENR 339
++ + + P+ S FPIENR
Sbjct: 468 IIDITTTFPKDPVYTFSISQSPFPIENR 495
>gi|347829803|emb|CCD45500.1| similar to nuclear protein localization protein 4 [Botryotinia
fuckeliana]
Length = 640
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 129/316 (40%), Gaps = 65/316 (20%)
Query: 149 AFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIA 201
A +R G++YG E V V IYEPPQ +E+ + L + E EK +D IA
Sbjct: 284 ATGAQRFGYLYGRYEEYTEVPLGVKAVVEAIYEPPQ--VDELDGVTLNEWESEKDIDEIA 341
Query: 202 AGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNM 243
GM++VG I+T I D Y LS+ E+ A A+ +
Sbjct: 342 RLCGMERVGVIWTDLIDSGVGDGTVLCKRHIDSYYLSSLEIAFAARLQAKHPKPTKWSDT 401
Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--------AEGDDPKL 295
++ + V V+ + + I A+Q+S+ V + + E +EGDD
Sbjct: 402 GKFGSNFVTCVVTADENGGIAISAYQVSNSAVEMVRADIVEPSADPAVMIVRSEGDDDSD 461
Query: 296 SKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP--------LSSTFPIEN 338
+ + + + G+ V+E F V ++ L H P S+F IEN
Sbjct: 462 TSSRYIPEVFFRKINEYGRSVQENAKPSFPVEYLLVTLTHGFPSDPKPAFMAKSSFTIEN 521
Query: 339 RTTQVTMRALK-----------SHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALA 387
R + LK L +SP L +SDFHLL F+ L+ + AL
Sbjct: 522 RLVIGQEQDLKDVGKQLGLDKSGQLTQSPDGVLA--VSDFHLLSFIYSMGILSKEEMALL 579
Query: 388 QCVQAQTAVPEGYKLL 403
V Q + +GY+L+
Sbjct: 580 CRVATQHDLSDGYQLI 595
>gi|425766761|gb|EKV05359.1| Endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Penicillium digitatum Pd1]
gi|425775414|gb|EKV13686.1| Endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Penicillium digitatum PHI26]
Length = 675
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 118/292 (40%), Gaps = 69/292 (23%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+N L F K G GF+YGT E V V IYEPPQ G + + L + +
Sbjct: 269 INSLLDFWRKSGAQRLGFLYGTYEEYTEVPLGVKAVVEAIYEPPQTGEVDGV-TLHEWQN 327
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-- 240
EK VD +A G++KVG IFT I + D Y LS+ E+ A A+
Sbjct: 328 EKEVDDVARLCGLQKVGVIFTDLIDGGRGDGSVVCKRHIDSYFLSSLEIAFAARLQAQNP 387
Query: 241 -----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI-------- 287
++ + V +S + + AI ++Q S V + + E
Sbjct: 388 KSTKWSRTGQFGSDFVTCVLSGDETGAIAISSYQASVSAVEMVRGDIVEPSADPTVMLVQ 447
Query: 288 AEGDDPKLSKMK-------KDVVVGGKDVKEVDN-----DFFLVVVKILDHQGPLSST-- 333
+E D+ S+ + + + G V+E DF LV L H P S+
Sbjct: 448 SEHDEDLGSRTRYIPEVFYRRINEYGASVQESAEPSFPVDFLLVT---LTHGFPTESSPL 504
Query: 334 -----FPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
FPIENR + + L SH + + ++ +SDFH+L FL
Sbjct: 505 FTNSRFPIENREVIGESQELRHVAQKLMSHGDPDKA---IQGVSDFHILCFL 553
>gi|169602695|ref|XP_001794769.1| hypothetical protein SNOG_04350 [Phaeosphaeria nodorum SN15]
gi|160706232|gb|EAT88110.2| hypothetical protein SNOG_04350 [Phaeosphaeria nodorum SN15]
Length = 1017
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 132/324 (40%), Gaps = 66/324 (20%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE 191
Q V++ + F +R GF+YG E V V IYEPPQ + + L D
Sbjct: 259 QVVDDLINFWRNTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQVNELDGIS-LGDW 317
Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE 240
+ EK +D IAA G+++VG IFT + +K D Y LS+ E+ A + A+
Sbjct: 318 DNEKEIDEIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEIAFAARYQAK 377
Query: 241 -------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
++ + V +S + I ++Q S+ V + + E
Sbjct: 378 YPRPTKWSETGKFGSNFVTCVISGDDQGQIGISSYQASNDAVEMVRADIIEPSAEPSVML 437
Query: 288 --AEGDDPKLSKMK-------KDVVVGGKDVKE-VDNDF---FLVVVKILDHQGP----- 329
+E D+ L++ + + + G +V+E DF +L V L H P
Sbjct: 438 VQSEDDNEALNRARYIPEVFYRRINEHGANVQENAKPDFPVEYLFVT--LTHGFPTQPNP 495
Query: 330 --LSSTFPIENRTTQVTM---RALKSHLN-RSPSLPL-----VKRISDFHLLLFLARFLD 378
FPIENR M AL LN ++ L L + IS+FH+L F+
Sbjct: 496 LFTGGKFPIENREIMGEMPDVSALGKSLNAKANGLALNTTSGLNAISNFHMLCFIHNLGI 555
Query: 379 LNSDVPALAQCVQAQTAVPEGYKL 402
L+ D +L V + EG L
Sbjct: 556 LSKDEESLLFKVASTHDTSEGSAL 579
>gi|448113826|ref|XP_004202429.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
gi|359383297|emb|CCE79213.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
Length = 566
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 59/121 (48%), Gaps = 20/121 (16%)
Query: 137 ADA--FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
ADA ++++ A V+R G MYGT ++V V IYEPPQ G + L +L
Sbjct: 241 ADATILNKFLDSWRASGVQRFGIMYGTYEPFEKVPLGTKAVVQAIYEPPQAGEVDGLTLL 300
Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
E E + VD++AA LG+ KVG +FT +D Y LS EVL A F
Sbjct: 301 EWENESQ-VDSVAAALGLYKVGMVFTDLTDSGARDGSVICKRHTDTYFLSCLEVLMASRF 359
Query: 238 H 238
Sbjct: 360 Q 360
>gi|71030974|ref|XP_765129.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352085|gb|EAN32846.1| hypothetical protein, conserved [Theileria parva]
Length = 531
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 17/188 (9%)
Query: 113 IAKQMRVTRQENPHCESVSF-DRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK----- 166
+ K + + Q+ H + + + D F + L +++R G+MYG ED
Sbjct: 167 LPKSITIRHQKYRHVDHLEMMNVDEVRGFANFWLSELQMSLQRIGWMYGYYTEDHHYPYG 226
Query: 167 -RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
R IYEPPQ T + +L DE VD IA LG+++VG IFT+ +
Sbjct: 227 IRAVCEAIYEPPQFSTINDVVMLEDEMLCS-VDKIAEKLGLERVGMIFTRL----PNECF 281
Query: 226 LSNREVLQAVEFHAECNMEEW-----VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE 280
L+ RE+++ +E V+ +V +S + S AF SD+ + L+++
Sbjct: 282 LTARELMRVIEMQYNSLRHTHYTGYPVSTMVTCTMSPDDSGQPVLNAFMASDLALALYRD 341
Query: 281 GWFETEIA 288
F + A
Sbjct: 342 ELFTSSQA 349
>gi|72386595|ref|XP_843722.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175416|gb|AAX69558.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70800254|gb|AAZ10163.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 408
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 121/284 (42%), Gaps = 47/284 (16%)
Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYIL 188
V D + FQ +V ++ +AV+R G +YG + +EV+ +YEP Q+G + L
Sbjct: 94 VRVDVTSLNLFQSFVRQS-GWAVQRCGILYGKYDSAESTIEVHAVYEPEQEGGLQKFICL 152
Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM- 243
RD VD +A LG+++VG + T K LS E+L + F EC +
Sbjct: 153 RD-SRVGTVDRLAERLGLRRVGMVCTHQARDPDK-MVLSGYELLLCAREQSRFGDECVLL 210
Query: 244 ---EEWVTAVVKLEVSEEGSAAIHF----------EAFQMSDMCVRLFKEGWFETEIA-- 288
T ++ + + A+HF +A + V +G+ EI
Sbjct: 211 TMSPSITTGRIECQAWQASPQAVHFYRLGTLTEKLDADPRNSTVVTGTADGFCGDEIPSH 270
Query: 289 --------EGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRT 340
E + + V+ KE+D +F V + P S ++N
Sbjct: 271 FVYSSISLEVAQEQTDDKGRPQVITKAPSKEIDTRWFTSYVAV----DPFDSHV-VKNLF 325
Query: 341 TQV--------TMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
++ T+ L+++L + +P++++++DFH+L+FLA
Sbjct: 326 VRISRPGMDPPTITNLRNYLKDPKRADVPILQKLADFHVLIFLA 369
>gi|121703177|ref|XP_001269853.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus clavatus NRRL 1]
gi|190359910|sp|A1CS06.1|NPL4_ASPCL RecName: Full=Nuclear protein localization protein 4
gi|119397996|gb|EAW08427.1| endoplasmic reticulum and nuclear membrane proteinc Npl4, putative
[Aspergillus clavatus NRRL 1]
Length = 674
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 120/295 (40%), Gaps = 76/295 (25%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV + L +
Sbjct: 273 INSLLDFWRKSGSQRLGFLYGTYEEYTEVPLGVKAVVQAIYEPPQ--VDEVDGVTLHEWP 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
EK VD +A G++KVG IFT + + D Y LS+ E+ A A+
Sbjct: 331 NEKEVDEVARLCGLEKVGVIFTDLLDAGRGDGSVVCKRHIDSYYLSSLEIAFASRLQAQY 390
Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI ++Q S V + + E
Sbjct: 391 PKTTKWSRTGRFGSNFVTCV--LSGDEEG--AITISSYQASVSAVEMVRADIVEPSAEPS 446
Query: 288 -----AEGDD--------PKL--SKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP--- 329
+E DD P++ K+ + V ++ K FL+V L H P
Sbjct: 447 VMLVQSEDDDSDNKSRYIPEVFYRKINEYGVSAQQNAKPSFPVEFLLVT--LTHGFPTES 504
Query: 330 ----LSSTFPIENRTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
STFPIENR + + L SH + + +P +SDFHLL +L
Sbjct: 505 NPLFTKSTFPIENREVIGESQDLRSVAKKLVSHRDSNEVIP---EVSDFHLLCYL 556
>gi|190359915|sp|P0C7N6.1|NPL4_PHANO RecName: Full=Nuclear protein localization protein 4
Length = 653
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 133/325 (40%), Gaps = 68/325 (20%)
Query: 142 QYVNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRD 190
Q V++ + F +R GF+YG E V V IYEPPQ E+ I L D
Sbjct: 259 QVVDDLINFWRNTGCQRLGFLYGRYEEYTEVPLGTKAVVETIYEPPQ--VNELDGISLGD 316
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
+ EK +D IAA G+++VG IFT + +K D Y LS+ E+ A + A
Sbjct: 317 WDNEKEIDEIAAQCGLQRVGVIFTDLLDADKGDGSVICKRHIDSYYLSSLEIAFAARYQA 376
Query: 240 E-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI----- 287
+ ++ + V +S + I ++Q S+ V + + E
Sbjct: 377 KYPRPTKWSETGKFGSNFVTCVISGDDQGQIGISSYQASNDAVEMVRADIIEPSAEPSVM 436
Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-VDNDF---FLVVVKILDHQGP---- 329
+E D+ L++ + + + G +V+E DF +L V L H P
Sbjct: 437 LVQSEDDNEALNRARYIPEVFYRRINEHGANVQENAKPDFPVEYLFVT--LTHGFPTQPN 494
Query: 330 ---LSSTFPIENRTTQVTM---RALKSHLN-RSPSLPL-----VKRISDFHLLLFLARFL 377
FPIENR M AL LN ++ L L + IS+FH+L F+
Sbjct: 495 PLFTGGKFPIENREIMGEMPDVSALGKSLNAKANGLALNTTSGLNAISNFHMLCFIHNLG 554
Query: 378 DLNSDVPALAQCVQAQTAVPEGYKL 402
L+ D +L V + EG L
Sbjct: 555 ILSKDEESLLFKVASTHDTSEGSAL 579
>gi|261326791|emb|CBH09764.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 500
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 122/284 (42%), Gaps = 47/284 (16%)
Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTV-LEDKRVEVNFIYEPPQQGTEEVLYIL 188
V D + FQ +V ++ +AV+R G +YG + +EV+ +YEP Q+G + L
Sbjct: 186 VRVDVTSLNLFQSFVRQS-GWAVQRCGILYGKYDSAESTIEVHAVYEPEQEGGLQKFVCL 244
Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVL----QAVEFHAECNM- 243
RD VD +A LG+++VG + T ++ LS E+L + F EC +
Sbjct: 245 RD-SRVGTVDRLAERLGLRRVGMVCTHQ-ARDPDKMVLSGYELLLCAREQSRFGDECVLL 302
Query: 244 ---EEWVTAVVKLEVSEEGSAAIHF----------EAFQMSDMCVRLFKEGWFETEIA-- 288
T ++ + + A+HF +A + V +G+ EI
Sbjct: 303 TMSPSITTGRIECQAWQASPQAVHFYRLGTLTEKLDADPRNSTVVTGTADGFCGDEIPSH 362
Query: 289 --------EGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRT 340
E + + V+ KE+D +F V + P S ++N
Sbjct: 363 FVYSSISLEVAQEQTDDKGRPQVITKAPSKEIDTRWFTSYVAV----DPFDSHV-VKNLF 417
Query: 341 TQV--------TMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
++ T+ L+++L + +P++++++DFH+L+FLA
Sbjct: 418 VRISRPGMDPPTITNLRNYLKDPKRADVPILQKLADFHVLIFLA 461
>gi|389584256|dbj|GAB66989.1| nuclear pore associated protein [Plasmodium cynomolgi strain B]
Length = 507
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/275 (25%), Positives = 108/275 (39%), Gaps = 76/275 (27%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F QY + R G+MYG ED R IYEPPQ + + +L D+
Sbjct: 268 FVQYWYTNGCMSEHRVGWMYGYYREDPHYNLGIRAVCECIYEPPQINEVDKVKLLPDDFL 327
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKL 253
+ VD IA LG++++ D LSN V A
Sbjct: 328 DS-VDVIANRLGLERI-------------DPMLSNEPVTNA------------------- 354
Query: 254 EVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET-----EIAEGDDPKLSKMKKDVVVGGKD 308
F +SD+ + L + ++ + + +P +++ ++ GGK+
Sbjct: 355 --------------FMVSDLGMALIRSNLVDSTQTDPSVIQLRNPVRNELLPLILEGGKE 400
Query: 309 VKEVDNDFFLVVVKILDHQGPL-------SSTFPIENRTTQVTMRALKSHLNRSPSLPL- 360
+ D D+F+V + + P + FP ENR T +K + SP L
Sbjct: 401 TNKFDADWFIVRI---NESAPKVVRSLFKNFHFPRENRLHSPTAYDVKEYF-ASPKLERG 456
Query: 361 ---VKRISDFHLLLFLARFLDLNSDVPALAQCVQA 392
+ R SDFHL+LF R LD+ + ALA C A
Sbjct: 457 TNGMHRCSDFHLILFACRVLDMET---ALALCDAA 488
>gi|115387239|ref|XP_001211125.1| protein NPL4 [Aspergillus terreus NIH2624]
gi|114195209|gb|EAU36909.1| protein NPL4 [Aspergillus terreus NIH2624]
Length = 566
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 113/297 (38%), Gaps = 66/297 (22%)
Query: 144 VNETLAFAVKRG----GFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
+N L F K G GF+YGT E V V IYEPPQ +EV I L +
Sbjct: 273 INSLLDFWRKSGAQRMGFLYGTYEEYTEVPLGIKAVVQAIYEPPQ--VDEVDGITLHEWP 330
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC 241
E+ VD +A G++KVG IFT + + D Y LS+ EV A A+
Sbjct: 331 NEREVDEVARLCGLEKVGVIFTDLLDAGQGDGSVVCKRHIDSYYLSSLEVAFASRLQAQY 390
Query: 242 -NMEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--- 287
+W VT V L EEG AI A+Q S V + + E
Sbjct: 391 PKATKWSRNGRFGSNFVTCV--LSGDEEG--AITVSAYQASVAAVEMVRADIVEPSAEPS 446
Query: 288 -----AEGDDP-------------KLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP 329
+E DD K+++ K V+ F + P
Sbjct: 447 VMLVQSEEDDTDNKSRYIPEVFYRKINEYGVSAQENAKPAFPVEYLFVTLTHGFPTESSP 506
Query: 330 L--SSTFPIENR----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
L S+FPIENR +Q K ++ ++ +SDFHLL FL N
Sbjct: 507 LFTDSSFPIENREVIGESQELRHVAKKLVSHGDPDKAIQAVSDFHLLCFLHSLSTFN 563
>gi|123477189|ref|XP_001321763.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121904596|gb|EAY09540.1| hypothetical protein TVAG_276130 [Trichomonas vaginalis G3]
Length = 430
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 111/240 (46%), Gaps = 21/240 (8%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK---RVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
FQQ T + R F++G + DK +V V+ EP Q+ + + IL +E L
Sbjct: 169 FQQKALMT-GYKTHRIMFLFGRI--DKYTGKVTVHCSCEPQQKNFVDHVEIL-EEFNMDL 224
Query: 197 VDAIAAGLGMKKVGFIFTQT-IMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEV 255
IA GM+ VG ++ + +DY E + + + + E+ T +V V
Sbjct: 225 DRNIAHEFGMECVGMAISRPEPTEKSEDYKHEMTEYMVRLAAKYQVEITEYFTTIV---V 281
Query: 256 SEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVVG-GKDVKEVDN 314
+E+ H E +Q SD ++ +++G F + P L K K +V +G K +KE+D+
Sbjct: 282 TEK-----HAEPYQASDAAMKAYQQGLF----LQSSSPTLLKFKSEVKLGTAKHLKELDS 332
Query: 315 DFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLA 374
++ + ++I + T P + ++ +K + + P+ R+ +F+LLL+
Sbjct: 333 NYAVATLRIRNANSHFPDTEPFPPLSEYPSVMDVKKYFDDMQYCPMWYRLFNFNLLLYFC 392
>gi|241949039|ref|XP_002417242.1| nuclear-envelope, endoplasmic-reticulum-associated-degradation
network protein NPL4, putative [Candida dubliniensis
CD36]
gi|223640580|emb|CAX44837.1| nuclear-envelope, endoplasmic-reticulum-associated-degradation
network protein NPL4, putative [Candida dubliniensis
CD36]
Length = 580
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 62/123 (50%), Gaps = 22/123 (17%)
Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
AD+F ++N V+R G++YG + + V I EPPQ + + +L
Sbjct: 247 ADSFILNDFINVWRISGVQRFGYLYGKYAKSDKTPLGIKAIVEMIIEPPQHDELDGITLL 306
Query: 189 ---RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQA 234
++EEEEKLVD +A G+ KVG IFT + + + KD Y L+N E++ A
Sbjct: 307 DWNKEEEEEKLVDNLANKFGLYKVGMIFTDLTDDGNKNGKVLCKRHKDSYFLTNLEIIMA 366
Query: 235 VEF 237
+F
Sbjct: 367 AKF 369
>gi|68474832|ref|XP_718457.1| hypothetical protein CaO19.2434 [Candida albicans SC5314]
gi|74590800|sp|Q5AA50.1|NPL4_CANAL RecName: Full=Nuclear protein localization protein 4
gi|46440223|gb|EAK99531.1| hypothetical protein CaO19.2434 [Candida albicans SC5314]
Length = 598
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 69/134 (51%), Gaps = 21/134 (15%)
Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
AD+F ++N V+R G++YG + ++ V I EPPQ + + +L
Sbjct: 267 ADSFILNDFINVWRVSGVQRFGYLYGRYAKSEKTPLGIKAIVETIIEPPQHDELDGITLL 326
Query: 189 -RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVE 236
D++EEK+VD +A G+ KVG IFT + + + KD Y L+N E++ A +
Sbjct: 327 DWDQQEEKMVDQVANKFGLYKVGIIFTDLTDAGTKNGKVLCKRHKDSYFLTNLEIIMAAK 386
Query: 237 FHAEC-NMEEWVTA 249
F + N+ ++ TA
Sbjct: 387 FQLKYPNISKYSTA 400
>gi|384490701|gb|EIE81923.1| hypothetical protein RO3G_06628 [Rhizopus delemar RA 99-880]
Length = 565
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 121/289 (41%), Gaps = 57/289 (19%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYG------TVLEDKRVEVNFIYEPPQQGTEEVLYILRD 190
A ++N A +R G++YG V + V IYEPPQQ + + +
Sbjct: 197 ASIIDSFLNYWRASGSQRFGYLYGRYEPYLEVPLGVKAVVEAIYEPPQQDHSDGIQLTLP 256
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEF-- 237
EEE V+ +A G+ +VG I++ + ++ Y LS+ E A E
Sbjct: 257 WEEENRVNEVAEACGLVRVGMIYSDLLDDGSGSGKVIAKRHVNSYFLSSVECAFAAEMQR 316
Query: 238 -HAECNME----EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDD 292
H + + ++ + V +S + I A+Q+SD V L + + E +
Sbjct: 317 RHPNASKQSISGKFSSKFVTCVISGDTEGNIDVNAYQVSDTLVALQEANFTEP----SRN 372
Query: 293 PKLSKMKKDVVVGGKDVKEVDNDF---FLVVVK-------------------ILDHQGPL 330
P L ++ KD + + V EV F + VVVK + PL
Sbjct: 373 PSLMRV-KDSIPHERYVPEVFYKFKNEYNVVVKQSAKPTFPVEYLLINVTHGFPHNPSPL 431
Query: 331 ---SSTFPIENRT--TQVTMRALKSHLNRSPSLPLVKR-ISDFHLLLFL 373
+S+FP+ENR + +M L +N + + +K ++DFHLL F+
Sbjct: 432 FNSTSSFPVENREGLSHQSMSELAKVINSADNEEALKNALADFHLLCFI 480
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 8/86 (9%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
MLLRIRS++G+ RV VD T L I L+I + T++ ++ +P+
Sbjct: 1 MLLRIRSKEGMHRVQVDNGE-TFWTLAQKISELLKIE-DPSTIAMGRD----PNPATAAS 54
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
+ +A D+ + S N+ HG IV++SY
Sbjct: 55 LSQLA--DKTIESANLKHGDIVYVSY 78
>gi|68474665|ref|XP_718540.1| hypothetical protein CaO19.9970 [Candida albicans SC5314]
gi|46440312|gb|EAK99619.1| hypothetical protein CaO19.9970 [Candida albicans SC5314]
Length = 631
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 69/134 (51%), Gaps = 21/134 (15%)
Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
AD+F ++N V+R G++YG + ++ V I EPPQ + + +L
Sbjct: 300 ADSFILNDFINVWRVSGVQRFGYLYGRYAKSEKTPLGIKAIVETIIEPPQHDELDGITLL 359
Query: 189 -RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVE 236
D++EEK+VD +A G+ KVG IFT + + + KD Y L+N E++ A +
Sbjct: 360 DWDQQEEKMVDQVANKFGLYKVGIIFTDLTDAGTKNGKVLCKRHKDSYFLTNLEIIMAAK 419
Query: 237 FHAEC-NMEEWVTA 249
F + N+ ++ TA
Sbjct: 420 FQLKYPNISKYSTA 433
>gi|238878941|gb|EEQ42579.1| protein NPL4 [Candida albicans WO-1]
Length = 593
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 69/134 (51%), Gaps = 21/134 (15%)
Query: 137 ADAF--QQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
AD+F ++N V+R G++YG + ++ V I EPPQ + + +L
Sbjct: 262 ADSFILNDFINVWRVSGVQRFGYLYGRYAKSEKTPLGIKAIVETIIEPPQHDELDGITLL 321
Query: 189 -RDEEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVE 236
D++EEK+VD +A G+ KVG IFT + + + KD Y L+N E++ A +
Sbjct: 322 DWDQQEEKMVDQVANKFGLYKVGIIFTDLTDAGTKNGKVLCKRHKDSYFLTNLEIIMAAK 381
Query: 237 FHAEC-NMEEWVTA 249
F + N+ ++ TA
Sbjct: 382 FQLKYPNISKYSTA 395
>gi|328770163|gb|EGF80205.1| hypothetical protein BATDEDRAFT_25045 [Batrachochytrium
dendrobatidis JAM81]
Length = 587
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 119/292 (40%), Gaps = 63/292 (21%)
Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVL------ 185
D F Q+ + A +R G +YG V V+ IYEP Q+ + + +
Sbjct: 214 DNFLQFWRSSGA---QRVGILYGKYEPYAEVPLGVKAVVSAIYEPLQEDSHDTIQLAVTS 270
Query: 186 -YILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TIMQNKKDYTLSNREVLQ 233
+ + + VDA+A LG+ +VG I+T ++ Y LS+ E++
Sbjct: 271 SHSILNNATSATVDAVATSLGLVRVGIIYTDLTDDGSGNGTVVCKRHADSYFLSSAEIVF 330
Query: 234 AVEFHAECNMEEWVTAV-------VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE 286
A + M +A+ V +S I ++Q+S+MC + ++ E
Sbjct: 331 AGVNQEQHPMVTPYSALHTFGSRFVTCVISGNSEGGIDIFSYQVSNMCAAMVRDNIIEAS 390
Query: 287 -------IAEGDDPKLS-------KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSS 332
+AE D + K + +++V ++ LV V H P +
Sbjct: 391 VEPSLMRVAESTDKQYVPEVFYKFKNEYNIMVKNAAKPTFPVEYLLVTVT---HGFPSTP 447
Query: 333 T--------FPIENRT---TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
+ FPIENR + M ALK HL + + L+ +SDFH+LLF+
Sbjct: 448 SPTFTNTKAFPIENRVEMGSAQDMHALKQHLTFTENH-LLDLLSDFHVLLFI 498
>gi|448101102|ref|XP_004199483.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
gi|359380905|emb|CCE81364.1| Piso0_001263 [Millerozyma farinosa CBS 7064]
Length = 566
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 57/121 (47%), Gaps = 20/121 (16%)
Query: 137 ADA--FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYIL 188
ADA ++++ A V+R G MYGT ++V V IYEPPQ G + L +L
Sbjct: 241 ADATILNKFLDSWRASGVQRFGIMYGTYEPFEKVPLGTKAVVQAIYEPPQAGEVDGLTLL 300
Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
E E + VD+ AA LG+ KVG +FT D Y LS EVL A F
Sbjct: 301 EWENESQ-VDSAAAALGLYKVGMVFTDLTDSGTGDGSVICKRHTDTYFLSCLEVLMASRF 359
Query: 238 H 238
Sbjct: 360 Q 360
>gi|367042586|ref|XP_003651673.1| hypothetical protein THITE_2112228 [Thielavia terrestris NRRL 8126]
gi|346998935|gb|AEO65337.1| hypothetical protein THITE_2112228 [Thielavia terrestris NRRL 8126]
Length = 603
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 128/316 (40%), Gaps = 60/316 (18%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
+R G +YG LE V V IYEPPQ +EV I L + E+ V+ +A
Sbjct: 285 GTQRLGILYGKYLEYDVVPLGIKAVVEAIYEPPQ--IDEVDGISLNPWDNEQEVNQVAKF 342
Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWV---- 247
G+++VG I+T + K D Y L+++E+ A A+ +W
Sbjct: 343 CGLEQVGVIWTDLLDAGKGDGSAVCKRHADSYFLASQEIAFAARMQAQHPKPSKWSDTGR 402
Query: 248 --TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG---------DDPKLS 296
+ V VS I A+QMS+ V + + G E G DD +S
Sbjct: 403 FGSNFVTCVVSGNEKGEISIAAYQMSNDAVEMVRAGIVEPSADPGLMLVREEEEDDGSVS 462
Query: 297 KMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLS-------STFPIENR- 339
+ + + + G +V+E F V + L H P + S FPIENR
Sbjct: 463 RTRYIPEVFYRRINEYGANVQENAKPAFPVEYLFVTLTHGFPEAPKPLFQDSGFPIENRE 522
Query: 340 ----TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
+V+ A +N+ +SDFHLL F+ + L D AL V Q
Sbjct: 523 HVGEAQEVSAVAKTLKVNQKDD---NLHLSDFHLLCFIHQMGVLAKDEEALLCRVATQHD 579
Query: 396 VPEGYKLLINSMAETV 411
+ E ++L + +T+
Sbjct: 580 LAESFQLRSTTGWQTL 595
>gi|342180158|emb|CCC89634.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 422
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 62/279 (22%), Positives = 115/279 (41%), Gaps = 49/279 (17%)
Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDK-RVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
+ F+ +V + ++++R G +YGT +EV+ IYEP Q+G + +L+D +
Sbjct: 102 NVFESFVRQ-CNWSLQRYGVLYGTYNATTFTIEVHAIYEPEQEGNTQTFTVLQD-DRIAT 159
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFHAECNMEEWVTAVVKLEVS 256
VD +A LG+++VG + T ++ + LS RE+L + +E V V V+
Sbjct: 160 VDLVAKQLGLRRVGVVCTHG-PRDTSEMALSARELLLCAREQSRFG-DECVILTVSPNVT 217
Query: 257 EEGSAAIHFEAFQMSDMCVRLFKEGWF-ETEIAEGDDPKLSKMKKDVVVGGKDVKEVDND 315
+ +A+Q S V L+ G E +A+ D + + G
Sbjct: 218 ---TGRTECQAWQASPQAVHLYSLGLLAENPVADVKDDNRGETSEAGTSGSTAAASGATG 274
Query: 316 FFL--------------------VVVKILDHQGPLSSTFPIENRT--------------- 340
F+ VV K H+G + + R
Sbjct: 275 RFVYSSIPLEVAQEQRDASGHPQVVTKAPSHKGRNLLVYRLHRRAVISPVVRNSFVRISR 334
Query: 341 ---TQVTMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
+ T L+++L + + +++R++DFH+L+FLA
Sbjct: 335 PGMSPPTRENLRTYLEDPKRKGMKMLQRLADFHVLVFLA 373
>gi|299116579|emb|CBN74767.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 478
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 63/143 (44%), Gaps = 22/143 (15%)
Query: 266 EAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVVV--GGKDVKEVDNDFFLVVVKI 323
EAF +S+ V++F EG E P ++ V G K+ + + V I
Sbjct: 319 EAFSVSNQTVQMFSEGVLGHHQPE---PTAGRVATTAAVKEGSKETTTPETLGLICNVAI 375
Query: 324 LDHQGPLSSTFPIENRTTQVTMRALKSHL---------------NRSPSLPLVKRISDFH 368
+ H+G L + FP NR+ ++ LK L + PS +KR+SDF
Sbjct: 376 VQHEGVLQTRFPSANRSKAASLSDLKECLFGKIGSKAAAKNQKKQKQPSF--LKRLSDFQ 433
Query: 369 LLLFLARFLDLNSDVPALAQCVQ 391
LLLFL+R + + D L VQ
Sbjct: 434 LLLFLSRQMGFSRDFEELCSVVQ 456
>gi|171680155|ref|XP_001905023.1| hypothetical protein [Podospora anserina S mat+]
gi|170939704|emb|CAP64930.1| unnamed protein product [Podospora anserina S mat+]
Length = 547
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 123/309 (39%), Gaps = 63/309 (20%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
+R G +YG +E + V V IYEPPQ +EV + L E E+ ++ IA
Sbjct: 228 GAQRLGILYGRYMEYEVVPLGIKAVVEAIYEPPQ--IDEVDGVSLNAWENEQEINQIAKF 285
Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWVTA-- 249
G+++VG I+T + K D Y L+++EV A A+ +W A
Sbjct: 286 CGLEQVGVIWTDLLDAGKGDGSAVCKRHADSYFLASQEVCFAARMQAQHPKPSKWSDAGR 345
Query: 250 ----VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLS 296
V VS I A+QMS+ V + + E E DD +S
Sbjct: 346 FGSNFVTCVVSGNEQGEISISAYQMSNDAVEMVRADIIEPSADPTQMLVREEEEDDGSVS 405
Query: 297 KMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR- 339
+ + + + G +V+E F V + L H P S FPIENR
Sbjct: 406 RTRYIPEVFYRRINEYGVNVQENAKPAFPVEYLFVTLTHGFPESPKPLFTDEGFPIENRE 465
Query: 340 ------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ 393
+AL+ + LP +SDFHLL ++ + L+ D AL V
Sbjct: 466 YVGEAQQPSAVAKALRVNQKGDNRLP----VSDFHLLCYIHQMSVLSKDEEALLCKVATH 521
Query: 394 TAVPEGYKL 402
+ E ++L
Sbjct: 522 QDLAEAFQL 530
>gi|167516094|ref|XP_001742388.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163779012|gb|EDQ92626.1| predicted protein [Monosiga brevicollis MX1]
Length = 932
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 125/302 (41%), Gaps = 77/302 (25%)
Query: 138 DAFQQYVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRDE 191
D +++ +R G++YG ++V + IYEPPQ GT + L L+D+
Sbjct: 238 DILNRFLEPYRKTGAQRVGYLYGRYAPYEQVPLGIQAVVAAIYEPPQVGTADGLE-LQDD 296
Query: 192 EEEKLVDAIAAGLGMKKVGFIFT---------------QTIMQNKKDYT--LSNREVLQA 234
E + LV+ +AA LG+++VG+IFT + + D T L+ E+L A
Sbjct: 297 EHQMLVEQMAAALGLQRVGWIFTDLEAEDLAAGTVKHKRYVTHEPDDDTTVLTAPEILTA 356
Query: 235 VEFHAEC-------------NMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG 281
F + ++VT V VS + + I +A+Q+S + L ++
Sbjct: 357 AMFQQSYPNPVPLRYNDTGFHGSKFVTVV----VSGDDTNMISPKAYQVSTQAMALTRDD 412
Query: 282 WFE-TEIAE-------GDD---PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILD----- 325
T+ A+ D+ P + ++KD G +K N F V +D
Sbjct: 413 ILRPTDRADMMAVKPRTDEVYVPDIFYVEKDKY--GNQIKTPANPVFPVEYLFVDMEAGA 470
Query: 326 --HQGPL----SSTFPIENR-------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLF 372
QG S+ FPIE+R Q R + R +L + DFHLL F
Sbjct: 471 GTGQGTFVNNPSAPFPIEHREAIGEGQNMQALGRYFAAGGARDGNL-----LRDFHLLFF 525
Query: 373 LA 374
LA
Sbjct: 526 LA 527
>gi|340052592|emb|CCC46874.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 416
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/293 (23%), Positives = 122/293 (41%), Gaps = 50/293 (17%)
Query: 121 RQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLE-DKRVEVNFIYEPPQQ 179
R E V D+ + FQ + T + V+R +YG K VEV+ IYEP Q
Sbjct: 92 RWEKNRTNLVLVDQTSINIFQTFCR-TCGWCVQRYALLYGRYDNATKTVEVHAIYEPKQH 150
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE--- 236
G + +D E + VD +A GL +++VG I T ++ L+ RE+L
Sbjct: 151 GDARTFHFEKD-ENIRTVDMVAEGLELRRVGAICTHP-KRDSNAVVLTAREMLLCAREQS 208
Query: 237 -FHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF-ETEIAEG--DD 292
F EC V+ ++ H A+Q+S + L++ G+ E+ A G D+
Sbjct: 209 IFGDEC--------VLLTVYHDDVEDRTHCSAWQVSTQAIELYRNGFLSESNGARGPDDE 260
Query: 293 P---------------------KLSKMKKDVVVGGKDV-----KEVDNDFFLVVVKILDH 326
P ++++ KD G + V E+D +F+ + I +
Sbjct: 261 PATTDEQTMWEREARLCSEIELEVAETVKDDKGGCRSVIKKMHHEIDVTWFVSYIGIKEF 320
Query: 327 QGPLSSTFPIENRTTQV---TMRALKSHLN--RSPSLPLVKRISDFHLLLFLA 374
+ + + T LK +L+ + + +++ +DFH+L+FLA
Sbjct: 321 DSNVVRNLFVRLSRPGMDPPTYENLKIYLDDPKRRDMRRLEKYADFHVLVFLA 373
>gi|154338814|ref|XP_001565629.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134062681|emb|CAM39124.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 427
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/341 (21%), Positives = 140/341 (41%), Gaps = 68/341 (19%)
Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED--KRVEVNFIYEPPQQGTEEVLYI 187
V D + FQ +V + + + ++R +YG + +D +EV+ +YEP Q G+
Sbjct: 95 VKVDAQSINIFQSFVAQ-MDWKLQRYAVLYG-LYDDATHTIEVHAVYEPEQHGSTYAFDP 152
Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE----FHAECNM 243
L D +K V+ IA LG+++VG T M++ + L+ RE+L + + EC +
Sbjct: 153 LPDAHMDK-VEKIAKALGLRRVGVACTHP-MRDPEHILLNYRELLLCTKEQSRYGDECAL 210
Query: 244 --------------------EEWVT--AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG 281
E +T A + E + + +A+Q S CV L++ G
Sbjct: 211 LTVAPAAMPSTESSGNTSAPGELLTDSAAAASGATRESTIVVSCQAWQTSPQCVHLYRLG 270
Query: 282 WFET-----------------------EIAEGD-DPKLSKMKKDVVVGGKDVKEVDNDFF 317
+ E+A+ + DP + V E+D +F
Sbjct: 271 VLQKPPGGEEALQDVEKARQVHCTMPLEVAQTETDPSGHRR----FVTKSPSTEIDTRWF 326
Query: 318 LVVVKILDHQGPL-SSTFPIENRT--TQVTMRALKSHLN--RSPSLPLVKRISDFHLLLF 372
+ + P+ F +R ++ L++++N + +P +RI+DFH+L++
Sbjct: 327 TSYIAVQQFVSPIVRGAFMRLSRPGMPPPALQNLRNYMNDPKRKGMPFAERIADFHVLVY 386
Query: 373 -LARFLDLNSDVPALAQCVQAQTAVPEG--YKLLINSMAET 410
L + + ++ AL + + E Y+ ++ M T
Sbjct: 387 LLTQIFTTDDELRALCSVARTKMMTEEAANYQAILQGMMPT 427
>gi|354543615|emb|CCE40336.1| hypothetical protein CPAR2_103740 [Candida parapsilosis]
Length = 577
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 18/115 (15%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
++N V+R G++YG + ++V V I EPPQ + + L D +
Sbjct: 254 LNDFINVWRVSGVQRFGYLYGRYEKFEKVPMGIKAVVEAIVEPPQSDELDGI-TLSDWPD 312
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
E+LVD IAA LG+ KVG +FT +KD Y L+N E+L A +F
Sbjct: 313 EQLVDEIAAKLGIYKVGIVFTDLTDSGQKDGTVLCKRHKDSYFLTNLEILMAAKF 367
>gi|367020574|ref|XP_003659572.1| hypothetical protein MYCTH_2296798 [Myceliophthora thermophila ATCC
42464]
gi|347006839|gb|AEO54327.1| hypothetical protein MYCTH_2296798 [Myceliophthora thermophila ATCC
42464]
Length = 597
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 134/322 (41%), Gaps = 72/322 (22%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
+R G +YG LE V V IYEPPQ +E+ + L E E+ V+A+A
Sbjct: 279 GTQRLGILYGRYLEYDVVPLGIKAVVEAIYEPPQ--VDEIDGVTLGAWENEQEVNAVAKL 336
Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA-ECNMEEW----- 246
G+++VG I+T + D Y L+++EV+ A A +W
Sbjct: 337 CGLEQVGVIWTDLLDAGNGDGTAICKRHADSYFLASQEVVFAARMQARHPKPSKWSDTGR 396
Query: 247 -----VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDD 292
VT VV +EEG +I A+QMS+ V + + E E +D
Sbjct: 397 FGSNFVTCVVS--GNEEGEISI--SAYQMSNDAVEMVRADIIEPSADPTLMLVREEEEED 452
Query: 293 PKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPI 336
+S+ + + + G +V+E F V + L H P S FPI
Sbjct: 453 GSVSRTRYIPEVFYRRINEYGANVQENAKPAFPVEYLFVTLTHGFPETPKPVFTDSGFPI 512
Query: 337 ENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
ENR +ALK + + +LP +SDFHLL F+ + L+ + AL
Sbjct: 513 ENREYLGEAQEVSAVSKALKVN-QKGDALP----VSDFHLLCFIHQMSVLSKEEEALLCR 567
Query: 390 VQAQTAVPEGYKLLINSMAETV 411
V Q + E ++L + +T+
Sbjct: 568 VATQHDLAESFQLRSTTGWQTL 589
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 50/94 (53%), Gaps = 11/94 (11%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQ--TLSTNQNLLLAKSPSDL 58
MLLR+RS DG+ R++ D + A+L + QL V+ Q TLS + + AK ++
Sbjct: 1 MLLRLRSPDGMFRLSADKDE-SFADLIGKLVPQLPSTVDPQSITLSNHPSGGDAKKIGEI 59
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSYDGERNV 92
+F +S + +SHG ++F+SY + V
Sbjct: 60 ARFK--------ISQVGLSHGDLIFVSYKHQDTV 85
>gi|378728298|gb|EHY54757.1| hypothetical protein HMPREF1120_02922 [Exophiala dermatitidis
NIH/UT8656]
Length = 597
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 118/292 (40%), Gaps = 63/292 (21%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRD 190
AD + ++ +R GF+YG E V V IYEPPQ + L +L D
Sbjct: 267 ADMVDKVIDFWRKSGCQRLGFLYGRYEEYTEVPLGIKAVVEAIYEPPQMCEVDGLTLL-D 325
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVEFHA 239
+ EK VD +A G+++VG IFT Q + + D Y LS+ E+ A F A
Sbjct: 326 WDNEKDVDEMARLCGLERVGVIFTDLMSPEDGEGQAVCKRHIDSYFLSSLEIAFASRFQA 385
Query: 240 EC-NMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDD 292
+ +W + V V+ + +I A+Q S+ V + + E +
Sbjct: 386 KYPKPSKWSDTGRFGSNFVTCVVTGDEDQSITINAYQASNTAVEMVRADIVEPSA----E 441
Query: 293 PK--LSKMKKDVVVG-----------------GKDVKEVDNDFFLVVVKILD-------H 326
P L + ++D VVG G +V+E F V ++
Sbjct: 442 PSVMLVQSEEDEVVGARTRYIPEVFYRRINEYGANVQENAKPAFPVEYLLVSLTAGMPIK 501
Query: 327 QGPL--SSTFPIENRTTQVT---MRALKSHLNRSPSLPLVKRISDFHLLLFL 373
PL ++ FP+ENR R + L S S V ISDFHLL FL
Sbjct: 502 PNPLFKNNKFPVENREAVAEGQDPREVAKLLRSSQS---VDAISDFHLLCFL 550
>gi|149603388|ref|XP_001513178.1| PREDICTED: nuclear protein localization protein 4 homolog, partial
[Ornithorhynchus anatinus]
Length = 333
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 11/104 (10%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEE 193
F + L +R G+MYG +ED R IYEPPQ+ IL+D+
Sbjct: 196 FANFWMRDLQMITQRHGWMYGYYIEDAHYGRGIRAVCEAIYEPPQRDINGYGEILQDDFL 255
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEF 237
+VD IA+ LG++++G IFT +D L+ EVLQA +
Sbjct: 256 -NVVDTIASKLGLERIGQIFTHL----PRDSYLTPTEVLQAAKL 294
>gi|344305066|gb|EGW35298.1| nuclear protein localization protein 4 [Spathaspora passalidarum
NRRL Y-27907]
Length = 563
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 64/128 (50%), Gaps = 19/128 (14%)
Query: 140 FQQYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEE 193
+++N + V+R G++YG +V V IYEPPQ G + + +L ++E
Sbjct: 239 LNEFLNGWRSTGVQRFGYLYGRYEAFDKVPLGIKAVVESIYEPPQVGEFDGVTLLNWDDE 298
Query: 194 EKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF-HAEC 241
++ +DAIA+ LG+ KVG +T +KD Y L+N E + A F H
Sbjct: 299 QQ-IDAIASKLGLTKVGIAYTDLTDSGRKDGTVLCKRHKDSYFLTNLECIMAARFQHKYP 357
Query: 242 NMEEWVTA 249
N+ ++ ++
Sbjct: 358 NVTKYTSS 365
>gi|296421545|ref|XP_002840325.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636540|emb|CAZ84516.1| unnamed protein product [Tuber melanosporum]
Length = 675
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 126/312 (40%), Gaps = 53/312 (16%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G++YG + V V IYEPPQ G + + + EK V +A+ G+
Sbjct: 284 QRIGYLYGRLAPYPEVPLGIKAIVEAIYEPPQNGESDGVTLQLPWTGEKDVAEVASLCGL 343
Query: 207 KKVGFIF-----------TQTIMQNKKDYTLSNREVLQAVEFHAE-CNMEEWV------T 248
+K+G IF T ++ Y LS+ E++ A AE + +W +
Sbjct: 344 RKIGVIFSDLLDDGTGQGTVVCKRHIDSYYLSSLEIVFAARLQAEHPHPSKWSETGQFGS 403
Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE------IAEGDDPKLSKMK--- 299
+ V VS I A+Q+S+ V + + E + +D K +
Sbjct: 404 SFVTCVVSGNEDNGIDVAAYQVSNTAVEMVRADIIEPSADPSVMMVHEEDGKTRYVPEVF 463
Query: 300 -KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSS---------TFPIENR---TTQVT 344
+ + G +VKE F V ++ L H P + FPI NR
Sbjct: 464 FRHINEYGLNVKENAKPSFPVEYLLVTLTHGFPNDTKPIFTAPLGKFPIGNREIMGISQD 523
Query: 345 MRALKSHL---NRSPSLPL--VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
RA+ L ++S + P + +SDFHLL F+ + L+ + L V V +G
Sbjct: 524 HRAVAKQLGITSQSANRPTDGIDVVSDFHLLTFIKGWGVLDKNEEELLAKVAVGHDVNDG 583
Query: 400 YKLLINSMAETV 411
YKLL +T+
Sbjct: 584 YKLLQTPGWQTL 595
>gi|312375034|gb|EFR22482.1| hypothetical protein AND_15202 [Anopheles darlingi]
Length = 333
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/206 (23%), Positives = 95/206 (46%), Gaps = 30/206 (14%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ+ H ++V F+ ++++N +R G++YGT V R V
Sbjct: 92 LNRQQYRHVDNVMFEN--TGIVERFLNYWRTTGHQRIGYLYGTYEIHPDVPLGIRARVAA 149
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNK 221
IYEP Q+ + + +L D ++ VD +A LG+++VG+IFT + ++N
Sbjct: 150 IYEPQQESNRDTIRLL-DGADDGDVDEMAKLLGLQRVGWIFTDLLSENLANGTVKHVRNI 208
Query: 222 KDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
K + L+++E + A + C + + V + V+ + +H E + +S C
Sbjct: 209 KTHFLTSQECILAGHLQNKHPNRCKYSSNGYFGSKFVTVCVTGDEKKQVHMEGYAVSAQC 268
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKK 300
+ L ++ I D P+L +++
Sbjct: 269 MALVRDNCL---IPTKDAPELGYIRE 291
>gi|401423393|ref|XP_003876183.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322492424|emb|CBZ27698.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 631
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 126/301 (41%), Gaps = 65/301 (21%)
Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED--KRVEVNFIYEPPQQGTEEVLYI 187
V D + FQ +V + + + ++R +YG + +D +EV+ +YEP Q G+
Sbjct: 299 VKVDAQSINIFQSFVAQ-MDWRLQRYAILYG-LYDDATHTIEVHAVYEPEQHGSTYTFEP 356
Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE----FHAECNM 243
L D +K V+ IA LG+++VG T M++ +D L+ RE+L + + EC +
Sbjct: 357 LPDAHLDK-VEKIAKALGLRRVGVACTHP-MRDPEDILLNFRELLLCAKEQSRYGDECAL 414
Query: 244 ---------------------EEWVTAVVKLE-VSEEGSAAIHFEAFQMSDMCVRLFKEG 281
E ++V L + E + + +A+Q S CV L++ G
Sbjct: 415 LTVAPAAMPSTKSGDTTSAPGEFPASSVAALSGATPEPTIVVSCQAWQASPQCVHLYRLG 474
Query: 282 WFET-----------------------EIAEGD-DPKLSKMKKDVVVGGKDVKEVDNDFF 317
+ E+A+ D DP S ++ V E+D +F
Sbjct: 475 VLQESPGGEAALQGVEQARQVHCTMPLEVAQTDTDP--SGHQRFVTTAPS--TEIDTRWF 530
Query: 318 LVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLN--RSP---SLPLVKRISDFHLLLF 372
+ + + P+ + + AL++ N R P S+ + I+DFH+L++
Sbjct: 531 TSYIAVQQFESPIVRGAFMRLSRPGMPPPALQNLRNYIRDPKRKSMSFAETIADFHVLVY 590
Query: 373 L 373
L
Sbjct: 591 L 591
>gi|324508917|gb|ADY43759.1| Nuclear protein localization protein 4 [Ascaris suum]
Length = 507
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/316 (23%), Positives = 135/316 (42%), Gaps = 70/316 (22%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGT------VLEDK 166
+ + RQ H +++S + ++ VN LAF K G G++ G V
Sbjct: 117 LTLNRQRFRHVDNISIEN------KEVVNRFLAFWRKSGNQRVGYLIGRYEPFPDVPLGI 170
Query: 167 RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ----------- 215
+ V IYEPPQ + E + ++ EKLVD + + L +++VG+IFT
Sbjct: 171 KATVAAIYEPPQSSSSESVK-FEEDPNEKLVDELCSYLNLRRVGWIFTDLWSADSAKGTV 229
Query: 216 TIMQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAF 268
++K + LS +E + A F + C+ + + L S + S I+F +
Sbjct: 230 HCTRHKDSFFLSAKECITAGYFQNKYPNVTTFCSDRYFGSKFTTLVASGDQSEQINFNGY 289
Query: 269 QMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DV---------VVGGKDV 309
Q+S+ C + + + P+L+ +++ DV V KD
Sbjct: 290 QVSNQCAAMVEANI----LCPTSHPELAYLRQTPLYPAHYITDVHFMDKNEYGVETMKDG 345
Query: 310 KEVDNDFFLVVVKILDHQGP------LSST---FPIENRTTQVTMR---ALKSHLNRSPS 357
+ + ++ LV V + P LSS FP ENR ++ A+ +++++ +
Sbjct: 346 RPMPVEYLLVDVPAGMPKEPHATFHILSSKGYPFPNENRDIIGELQDTNAIGAYISQFSA 405
Query: 358 LPLVKRISDFHLLLFL 373
++ ++FH LLFL
Sbjct: 406 NQFLELATNFHFLLFL 421
>gi|302925668|ref|XP_003054140.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256735081|gb|EEU48427.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 645
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 56/304 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG E V V IYEPPQ +E+ I + E +K +DA+A G
Sbjct: 270 QRLGYLYGKYAEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEIDAVAKHCG 327
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWV------ 247
++ VG I+T + D Y LS+ E+ + A+ +W
Sbjct: 328 LEPVGVIWTDLLDAGNGDGSVICRRHADSYFLSSLEITFSSRLQAQYPKSSKWSDTGSFG 387
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
+ V ++ I ++QMS+ V + + E E DD +S+
Sbjct: 388 SNFVTCIITGNEDGEISISSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGSVSRT 447
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR--- 339
+ + V G +V+E F V + L H P FPIENR
Sbjct: 448 RYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPAEPKPMFTQQGFPIENREYI 507
Query: 340 TTQVTMRALKSHLNRSPS-LPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
AL + SP P + +S+FHLL F+ + L+ D AL V +Q + +
Sbjct: 508 GESQEHSALSKFVKSSPGKQPTGQEVSNFHLLCFIHQMSVLSKDEEALLCRVASQHDLAD 567
Query: 399 GYKL 402
++L
Sbjct: 568 SFQL 571
Score = 38.5 bits (88), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
MLLR+R DG+ R+TV+ T +L + QL V+ T LS N +K +D+
Sbjct: 1 MLLRLRGPDGMARLTVEPT-TTFGDLGKQLLDQLPKTVDPTTISLSNAPNGGDSKRLADI 59
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
+ F + + + HG ++FL+Y
Sbjct: 60 INFK--------VGQIGLKHGDLIFLTY 79
>gi|385303973|gb|EIF48012.1| nuclear protein localization protein [Dekkera bruxellensis
AWRI1499]
Length = 508
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 123/285 (43%), Gaps = 50/285 (17%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRV------EVNFIYEPPQQGTEEVLYILRD 190
++ ++N ++R G M+G+ +V +V IYE PQ E+ + + +
Sbjct: 196 SEIVNNFINAWRLSGMQRIGLMFGSYARYDKVPLGIKAKVQTIYELPQVDREDGVMLQKW 255
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHA 239
++E+K++ +A+ L + VG IFT + K D + LS+ E++ AV++
Sbjct: 256 DDEDKVL-KLASELNLVPVGIIFTDLMDAGKGDGSVICKRHKSSFFLSSLELIFAVKWQL 314
Query: 240 EC-NMEEW------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE------ 286
+ N+ W + V +S S I EA+Q S+ L K
Sbjct: 315 KFPNICHWSSSGKFSSKFVTCVISGNPSGEIDMEAYQASESAEGLVKADLISPSTHPSQV 374
Query: 287 -IAEGDDPKLSK--MKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------- 333
+ E +B + M V G +VK+ F V ++ L H P S+
Sbjct: 375 FVNEQNBKRYVPDIMYNMVNEYGLEVKKNGKPTFPVEYLLVSLTHGFPEKSSPFFKDTEK 434
Query: 334 FPIENRTT---QVTMRALKSHLNRSPSLP--LVKRISDFHLLLFL 373
FPIENR TM +K H N P+ P + R+S+FHLL ++
Sbjct: 435 FPIENRNYIDEVPTMSDIKEHFN-IPNDPAVFISRLSNFHLLXYI 478
>gi|84994858|ref|XP_952151.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65302312|emb|CAI74419.1| hypothetical protein, conserved [Theileria annulata]
Length = 549
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 76/154 (49%), Gaps = 17/154 (11%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRDEEEEKL 196
++NE L +++R G+MYG ED+ R IYEPPQ + ++ DE
Sbjct: 207 WLNE-LQMSLQRIGWMYGYYTEDQHYPYGIRAVCEAIYEPPQYSNINDVIMIEDEMLMN- 264
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVEFH----AECNMEEW-VTAVV 251
VD IA LG++++G IFT+ + L+ RE+++ +E + + ++ +V
Sbjct: 265 VDKIAERLGLERIGMIFTRL----PNECFLTARELIRVIEMQYNSLKHIHYTGYPISTMV 320
Query: 252 KLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
+S + + A+ SD+ + LF++ F +
Sbjct: 321 TCTMSPDDNGQPVLNAYMASDLALALFRDELFTS 354
>gi|170042588|ref|XP_001849002.1| nuclear protein localization [Culex quinquefasciatus]
gi|167866115|gb|EDS29498.1| nuclear protein localization [Culex quinquefasciatus]
Length = 464
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 52/230 (22%), Positives = 101/230 (43%), Gaps = 35/230 (15%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGT------VLEDKRVEVNF 172
+ RQ H ++V F+ ++++N +R G ++GT V R V
Sbjct: 244 LNRQTYRHVDNVMFEN--TGIVERFLNYWRTTGHQRIGMLFGTYEVHPDVPLGIRARVAA 301
Query: 173 IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNK----------- 221
IYEPPQ+ + + +L DE + ++ +A LG+++VG+IFT + +N
Sbjct: 302 IYEPPQESNRDSIRLLEDEHAVE-IEELARQLGLQRVGWIFTDLLAENAAAGTVKHVRGI 360
Query: 222 KDYTLSNREVLQAV----EFHAECNMEE---WVTAVVKLEVSEEGSAAIHFEAFQMSDMC 274
K + L+ E + A ++ C + + V + V+ + +H E + +S C
Sbjct: 361 KTHFLTAHECILAGHLQNKYPNRCKHASKGLFGSKFVTVCVTGDDKKQVHMEGYAVSAQC 420
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKIL 324
+ L ++ I D P+L +++ D + V + ++ VV L
Sbjct: 421 MALVRDNCL---IPTKDAPELGYIRES-----SDKQYVPDVYYKTVVPNL 462
>gi|156057431|ref|XP_001594639.1| hypothetical protein SS1G_04446 [Sclerotinia sclerotiorum 1980]
gi|190359918|sp|A7EGK5.1|NPL4_SCLS1 RecName: Full=Nuclear protein localization protein 4
gi|154702232|gb|EDO01971.1| hypothetical protein SS1G_04446 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 611
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 128/312 (41%), Gaps = 61/312 (19%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
+R G++YG E V V IYEPPQ +E+ I L E EK VD +A
Sbjct: 286 GAQRFGYLYGRYEEYTEVPLGVKAVVEAIYEPPQ--VDELDGITLNKWESEKDVDEMARL 343
Query: 204 LGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEE 245
GM++VG I+T + D Y LS+ E+ A A+ + +
Sbjct: 344 CGMERVGVIWTDLLDSGAGDGTVICKRHIDSYYLSSLEIAFAARLQAKHPKPTKWSDTGK 403
Query: 246 WVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--------AEGDDPKLSK 297
+ + V V+ + + I A+Q+S+ V + + E +EGDD +
Sbjct: 404 FGSNFVTCVVTADETGGIAISAYQVSNTAVEMVRADIVEPSADPAVMIVRSEGDDDSDTS 463
Query: 298 MK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP--------LSSTFPIENRT 340
+ + + G+ V+E F V ++ L H P ++F IENR
Sbjct: 464 ARYIPEVFYRKINEYGRSVQENAKPSFPVEYLLVTLTHGFPSDPKPAFMAKNSFTIENRL 523
Query: 341 TQVTMRALKS-----HLNRSPSLPL----VKRISDFHLLLFLARFLDLNSDVPALAQCVQ 391
+ +K L+++ L V +SDFHLL ++ L+ + AL V
Sbjct: 524 VIGQEQDIKDVGKQLGLDKNGQLTQSSDGVLAVSDFHLLCYIYSMGILSKEEMALLCRVA 583
Query: 392 AQTAVPEGYKLL 403
Q + +GY+L+
Sbjct: 584 TQHDLADGYQLI 595
>gi|219116761|ref|XP_002179175.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409066|gb|EEC48998.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 501
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 98/221 (44%), Gaps = 18/221 (8%)
Query: 168 VEVNFIYEPPQQGTEEVLY---ILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDY 224
+V+ I+EPPQ Y LR L IAA LG+K VG+IFT + ++K
Sbjct: 241 AKVHAIWEPPQNSQPGTAYDGKALRKIPSNVL--RIAAHLGLKPVGWIFTYSDDRHKDQD 298
Query: 225 TLS--NREVLQAVEFHAECNMEEWVTA-VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEG 281
L +++V + A NM+ A + + EAFQ+SD+ V++ E
Sbjct: 299 ALPVWSKDVQHGAQLQA-FNMQRTDRAEGARFATLAMDAKVGTTEAFQLSDVAVQIVAED 357
Query: 282 WFETEIAEGD---DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPI-- 336
+ + D + ++V G++ +D LV +L H+G S +
Sbjct: 358 MLLIQNGKKDVDGGGRYVTTSHAILVDGQETTTLDCVLCLVNTALLSHEGSFSGGSAVAA 417
Query: 337 ----ENRTTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
+ T T +A+ ++++ S L++ +S+F +LL L
Sbjct: 418 VKKKDGGLTSKTRKAILAYIDGSDDAALLEILSNFAVLLAL 458
>gi|150951498|ref|XP_001387827.2| nuclear protein localization factor and ER translocation component
[Scheffersomyces stipitis CBS 6054]
gi|284018106|sp|A3GFS1.2|NPL4_PICST RecName: Full=Nuclear protein localization protein 4
gi|149388644|gb|EAZ63804.2| nuclear protein localization factor and ER translocation component
[Scheffersomyces stipitis CBS 6054]
Length = 577
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 18/102 (17%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
V+R G MYG +V V IYEPPQ G + + +L E E + VDAIA+ L
Sbjct: 265 GVQRFGVMYGRYEPFDKVPLGIKAVVEAIYEPPQSGELDGITMLPWENEAE-VDAIASEL 323
Query: 205 GMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAV 235
G+ KVG +FT T++ ++K Y LSN E+L A
Sbjct: 324 GIYKVGVVFTDLTDSGQKNGTVLCKRHKDSYFLSNLEILMAA 365
>gi|430814351|emb|CCJ28392.1| unnamed protein product [Pneumocystis jirovecii]
Length = 530
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 121/299 (40%), Gaps = 57/299 (19%)
Query: 153 KRGGFMYG------TVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G++YG V + V IYEPPQ+ + ++ EE V+ +A G+
Sbjct: 223 QRFGYLYGRYEPYDAVPLGIKAVVEAIYEPPQENEPNGISLILPWNEESTVNEVAHLFGL 282
Query: 207 KKVGFIFTQ-TIMQNKKDYTLSNREVLQAVEFHAEC-NMEEWV------TAVVKLEVSEE 258
KKVG IFT T + K + R V+ A + N+ +W + V VS
Sbjct: 283 KKVGMIFTDLTDDDSGKGTVICKRHVVFAATMQLKHKNVTKWSKTGEFGSKFVTCVVSAN 342
Query: 259 GSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK--DVVVGGKDVKEVDN-- 314
I A+Q+S+ + L E + DP ++K DV + ++++
Sbjct: 343 KEGGIDIFAYQVSNTAMALVDAKIIEPSV----DPNTMLIRKEHDVYIPEVFFQKINQYK 398
Query: 315 -------------DFFLVVVKILDHQGPL---------SSTFPIENRTTQVTMRALKSHL 352
++ LV L H P+ S TF IENR + ++ + +
Sbjct: 399 TQVQENAKSSFPIEYLLVT---LTHGFPIKSESIFSSASGTFLIENRASIGQVQNINALA 455
Query: 353 NR------SPSLPLVKRISDFHLLLFLARFLDL--NSDVPALAQCVQAQTAVPEGYKLL 403
R P +K + DFH L +L LD+ D L +C++ + + + Y LL
Sbjct: 456 ERIKPILEQPYGDAIKALCDFHFLTYLWS-LDILDKEDGTLLIKCIK-EWQINDLYNLL 512
>gi|297798574|ref|XP_002867171.1| hypothetical protein ARALYDRAFT_913062 [Arabidopsis lyrata subsp.
lyrata]
gi|297313007|gb|EFH43430.1| hypothetical protein ARALYDRAFT_913062 [Arabidopsis lyrata subsp.
lyrata]
Length = 50
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 34/45 (75%), Gaps = 8/45 (17%)
Query: 345 MRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQC 389
MR LK+H LPL SDFHLLLF+A+FLD++SDVPALA+C
Sbjct: 1 MRTLKTH----SELPL----SDFHLLLFVAQFLDVSSDVPALAEC 37
>gi|448516776|ref|XP_003867639.1| Npl4 ubiquitin-binding protein [Candida orthopsilosis Co 90-125]
gi|380351978|emb|CCG22202.1| Npl4 ubiquitin-binding protein [Candida orthopsilosis]
Length = 589
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 56/113 (49%), Gaps = 18/113 (15%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
++N V+R G++YG + +V V I EPPQ + + +L + +E+L
Sbjct: 269 FINVWRVSGVQRFGYLYGRYEKFDKVPMGIKAVVEAIVEPPQSDELDGITLL-EWPDEQL 327
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFH 238
VD IAA LG+ KVG +FT +KD Y L+N E+L A +F
Sbjct: 328 VDEIAAKLGIYKVGVVFTDLTDSGQKDGTVLCKRHKDSYFLTNLEILMAAKFQ 380
>gi|320582084|gb|EFW96302.1| Nuclear protein localization protein [Ogataea parapolymorpha DL-1]
Length = 578
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 140/326 (42%), Gaps = 62/326 (19%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDKRV------EVNFIYEPPQQGTEEVLYILRD 190
++ +++N +R G M G+ ++V +V IYE PQ E+ + IL++
Sbjct: 240 SEIINEFINAWRLSGTQRIGLMLGSYDRYEKVPLGIKAKVEAIYELPQVDQEDGI-ILQN 298
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHA 239
EEE+ + ++ L ++ VG IFT + ++K + LS+ E + A+++
Sbjct: 299 WEEEQQILSLIRKLDLQPVGIIFTDLLDAGSGNGSVICKRHKDSFFLSSLEAIFAIKWQL 358
Query: 240 EC-NMEEW------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWF-------ET 285
+ N+ +W + V +S S I EA+Q+S+ L K E
Sbjct: 359 KFPNICKWSDSGIFSSKFVTCVISGNTSGEIDIEAYQISESGEGLVKADLISPSTHPNEV 418
Query: 286 EIAEGDD----PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGP---------- 329
I E +D P++ K + G VK+ F V ++ L H P
Sbjct: 419 YINEQNDERYVPEIFYQKINEY--GLQVKQHATPSFPVEYLLVSLTHGFPERSSPFFKAG 476
Query: 330 LSSTFPIENRT---TQVTMRALKSHL---NRSPSLPLVKRISDFHLLLFLARFLDLNSD- 382
S+ FPIENR +M LK++L N L +S+FHLL++ D+ S
Sbjct: 477 ASNKFPIENRAYIGESASMPDLKNYLSSINGDDMSVLATLLSNFHLLVYFTLNQDILSGH 536
Query: 383 -----VPALAQCVQAQTAVPEGYKLL 403
V + + + A+P+ YKL+
Sbjct: 537 EYELLVEITRKIGKGEEALPDCYKLV 562
>gi|221054019|ref|XP_002261757.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193808217|emb|CAQ38920.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 529
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 58/119 (48%), Gaps = 5/119 (4%)
Query: 297 KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRSP 356
K K DV++ VK +D F + V IL H+ ++ FP N + T+ ++++RS
Sbjct: 406 KCKSDVLIEKIQVKHIDVLFCVNNVPILSHKSVYNTFFPYPNNSNYYTILQRFNYMSRSL 465
Query: 357 SLP--LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT---AVPEGYKLLINSMAET 410
P +V DF+ LLFL D+P + + + +P+ Y ++ S++++
Sbjct: 466 HNPKDMVSIFRDFNFLLFLTNIFSFEHDIPPICKAINDSNNSLGIPDQYVSILRSLSQS 524
>gi|255730104|ref|XP_002549977.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240133046|gb|EER32603.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 581
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 53/112 (47%), Gaps = 18/112 (16%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
++N V+R G++YG + ++ V I EPPQ + + +L D EEEKL
Sbjct: 265 FINIWRLSGVQRFGYLYGHYEKSEKSPLGIKAVVETIVEPPQHDELDGITLL-DWEEEKL 323
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEF 237
VD +AA G+ KVG FT KD Y L+N E++ A F
Sbjct: 324 VDELAAQFGIYKVGITFTDLTDSGNKDGTVLCKRHKDSYFLTNLEIIMASRF 375
>gi|406603385|emb|CCH45063.1| Nuclear protein localization protein 4 [Wickerhamomyces ciferrii]
Length = 566
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 125/310 (40%), Gaps = 67/310 (21%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
+R GF+YGT + +V V IYEP Q + L + E+E+ +D +A
Sbjct: 251 GTQRIGFLYGTYQKYDKVPLGIKAVVEAIYEPTQHDENDGLTLGLPWEDEERIDNLAKSA 310
Query: 205 GMKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
G+ K+G IF+ I ++K Y LS+ EV+ A + ++ ++
Sbjct: 311 GLYKIGVIFSDLIDAGNGDGSVICKRHKDSYFLSSLEVIFAAKLQSKYPNYTQFSQSGKF 370
Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG-------------DD- 292
+ V VS I ++Q+S+ EG + +I G DD
Sbjct: 371 SSKFVTCVVSGNLQGEIDIASYQVSEGA-----EGLVKADILTGSTHPSFAYINETNDDR 425
Query: 293 -------PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPL-------SSTFPIEN 338
K+++ K V K VD +L+V L H P S+ FPIEN
Sbjct: 426 YVPDIFYTKMNEYKLQVKSNAKPAFPVD---YLLVT--LSHGFPQEPNPVFKSNKFPIEN 480
Query: 339 R----TTQVTMRALKS-HLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQ 393
R +Q + K ++ + ++ ISDFHLL +L L+ L + +
Sbjct: 481 RQAAGISQDFLEIKKQLKIDSFDTNDVINSISDFHLLAYLFSLDILSPSEQKLVVKIAVE 540
Query: 394 TAVPEGYKLL 403
+ + Y+LL
Sbjct: 541 HRLEDAYRLL 550
>gi|406867683|gb|EKD20721.1| NPL4 family protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 709
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 129/318 (40%), Gaps = 67/318 (21%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
+R G++YG E V V IYEPPQ + + +L D +EE LV+ IA
Sbjct: 271 GAQRIGYLYGRYTEYTEVPLGIKAVVEAIYEPPQINEVDGVTLL-DWQEESLVEGIARQC 329
Query: 205 GMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEW 246
G++KVG I+T + D Y LS+ E+ A A+ + +
Sbjct: 330 GLQKVGVIWTDLLDAGAGDGSVVCKRHIDSYYLSSLEIAFAARLQAQHPKPTKWSDTGRF 389
Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSK 297
+ V VS + I ++Q+S+ V + + E E DD +S+
Sbjct: 390 GSNFVTCVVSGDEKGQIAISSYQVSNSAVEMVRANIVEPSADPTVMLVRNEEEDDGSISR 449
Query: 298 MK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP--------LSSTFPIENRT 340
+ + + G V+E F V ++ L H P + F I NR
Sbjct: 450 TRYIPEVFYRRINEYGASVQENAKPSFPVEYLLVTLTHGFPDAPKPMFIAAPGFTIANRL 509
Query: 341 T------QVTMRALKSHLNRSPSLP----LVKRISDFHLLLFLAR-----FLDLNSDVPA 385
T + T+ A + ++R+ + ++ +SDFHLL +L + F + + A
Sbjct: 510 TLGEEQDKKTL-AKRIGVDRAGEVKNADGALRAVSDFHLLCYLKKEYVNLFEKASLEDEA 568
Query: 386 LAQCVQAQTAVPEGYKLL 403
L V + + EGY+LL
Sbjct: 569 LLYRVATKHDLAEGYQLL 586
>gi|342876146|gb|EGU77804.1| hypothetical protein FOXB_11668 [Fusarium oxysporum Fo5176]
Length = 646
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 125/308 (40%), Gaps = 64/308 (20%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG E V V IYEPPQ +E+ I + E +K VD +A G
Sbjct: 271 QRLGYLYGKYTEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEVDRVAKQCG 328
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEW------- 246
++ VG I+T + D Y LS+ E+ + A+ +W
Sbjct: 329 LEPVGVIWTDLLDAGNGDGTVVCKRHADSYFLSSLEICFSSRLQAQYPKPSKWSDTGSFG 388
Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPK 294
VT ++ +E+G +I ++QMS+ V + + E E DD
Sbjct: 389 SNFVTCIIT--GNEQGEISI--SSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGS 444
Query: 295 LSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIEN 338
+S+ + + V G +V+E F V + L H P FPIEN
Sbjct: 445 VSRTRYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPMFTQQGFPIEN 504
Query: 339 R---TTQVTMRALKSHLNRSPSL-PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT 394
R AL L SP P + +S+FHLL F+ + L+ D AL V +Q
Sbjct: 505 REYVGESQEHSALSKFLKLSPGQKPTGQEVSNFHLLCFIQQMGVLSKDEEALLCRVASQH 564
Query: 395 AVPEGYKL 402
+ + ++L
Sbjct: 565 DLADTFQL 572
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 7/86 (8%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
MLLR+R DG+ R+ V+ T +L ++ QL V+ +TL+ L A + D +
Sbjct: 1 MLLRLRGPDGMARLNVE-PTTTFGDLGKLLLDQLPKTVDPKTLA----LSNAPNGGDSKR 55
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY 86
D+ N + + + HG ++FL+Y
Sbjct: 56 LGDIVN--FKVGQIGLKHGDLIFLTY 79
>gi|340517387|gb|EGR47631.1| nuclear pore complex, npl4 component [Trichoderma reesei QM6a]
Length = 663
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 123/317 (38%), Gaps = 61/317 (19%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG E +V V IYEPPQ +E+ I + E +K VD IA G
Sbjct: 273 QRLGYLYGRYAEYTKVPLGVKAIVEAIYEPPQ--VDEIDGITMNAWENQKEVDHIAKLCG 330
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
++ VG I+T + + D Y LS+ E+ + A+ + ++
Sbjct: 331 LEPVGVIWTDLLDSGRGDGSVVCKRHLESYYLSSLEICFSSRLQAQYPKPSKWSDSGQFG 390
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
+ V ++ + I A+QMS+ V + + E E DD +S+
Sbjct: 391 SNFVTCVITGDEDGNIGISAYQMSNEAVEMVRADIVEPSADPNSMLVREEEEDDGSVSRT 450
Query: 299 K--------------KDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST---FPIENRTT 341
+ V K V+ F + D P +T FPIENR
Sbjct: 451 RYIPEVFYSKINEYGAKVQQNAKPAFPVEYLFVTLTHGFPDEPKPTFATQTGFPIENREY 510
Query: 342 -------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT 394
RAL + PS + +S+FH+L F+ + L+ D AL V Q
Sbjct: 511 IGESQEHSALARALNLRPGQKPS-DNGQEVSNFHILCFIHQLGVLSKDEEALLCRVATQH 569
Query: 395 AVPEGYKLLINSMAETV 411
+ + ++L +T+
Sbjct: 570 DLADSFQLRSTGGWQTL 586
>gi|336272553|ref|XP_003351033.1| hypothetical protein SMAC_04337 [Sordaria macrospora k-hell]
Length = 661
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 129/322 (40%), Gaps = 72/322 (22%)
Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
+++N +R G +YG LE V V IYEPPQ +E+ + L E E
Sbjct: 274 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNVWENE 331
Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CN 242
K V+ +A G++ VG I+T + K D Y L+ +E+ A F A+
Sbjct: 332 KDVNEVARLCGLEPVGVIWTDLLDSGKGDGSAICKRHTDSYFLAAQEICFAARFQAQHPK 391
Query: 243 MEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG-- 290
+W VT V+ +EEG +I A+QMS+ V + + E G
Sbjct: 392 ATKWSDTGRFGSNFVTCVIS--GNEEGEISI--SAYQMSNDAVEMVRADIIEPSTDPGQM 447
Query: 291 -------DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST- 333
DD +S+ + + + G +V+E F V + L H P S
Sbjct: 448 LVREEEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPKP 507
Query: 334 ------FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
FPI NR + LK + +S +S+FHLL F+ + L+
Sbjct: 508 VFTNDGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIHQMSVLS 562
Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
D AL V Q + + ++L
Sbjct: 563 KDEEALLCRVATQHDLADAFQL 584
>gi|46108668|ref|XP_381392.1| hypothetical protein FG01216.1 [Gibberella zeae PH-1]
Length = 648
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 121/304 (39%), Gaps = 56/304 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG +E V V IYEPPQ +E+ I + E +K +D +A G
Sbjct: 272 QRLGYLYGKYVEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEIDKVAKQCG 329
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWVTA---- 249
++ VG I+T + D Y LS+ E+ + A+ +W +
Sbjct: 330 LEPVGVIWTDLLDSGNGDGSVVCKRHADSYFLSSLEICFSSRLQAQYPKASKWSDSGNFG 389
Query: 250 --VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
V ++ I ++QMS+ V + + E E DD +S+
Sbjct: 390 SNFVTCIITGNEDGEISISSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGSVSRT 449
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR--- 339
+ + V G +V+E F V + L H P FPIENR
Sbjct: 450 RYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPMFTQQGFPIENREYV 509
Query: 340 TTQVTMRALKSHLNRSPSL-PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
AL + SP+ P + +S+FHLL F+ + L+ D AL V +Q +
Sbjct: 510 GESQEHSALSKFVKISPNQKPTGQEVSNFHLLCFIQQMGVLSKDEEALLCRVASQHDFAD 569
Query: 399 GYKL 402
++L
Sbjct: 570 TFQL 573
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
MLLR+R DG+ R+TV+ A T +L ++ QL V+ +T LS N +K D+
Sbjct: 1 MLLRLRGPDGMARLTVE-ATTTFGDLGKLLLDQLPKTVDPKTIALSNAPNGGDSKRLGDI 59
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
+ F + + + HG ++FL+Y
Sbjct: 60 INFK--------VGQIGLKHGDLIFLTY 79
>gi|380090800|emb|CCC04970.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 601
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 129/322 (40%), Gaps = 72/322 (22%)
Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
+++N +R G +YG LE V V IYEPPQ +E+ + L E E
Sbjct: 274 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNVWENE 331
Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CN 242
K V+ +A G++ VG I+T + K D Y L+ +E+ A F A+
Sbjct: 332 KDVNEVARLCGLEPVGVIWTDLLDSGKGDGSAICKRHTDSYFLAAQEICFAARFQAQHPK 391
Query: 243 MEEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG-- 290
+W VT V+ +EEG +I A+QMS+ V + + E G
Sbjct: 392 ATKWSDTGRFGSNFVTCVIS--GNEEGEISI--SAYQMSNDAVEMVRADIIEPSTDPGQM 447
Query: 291 -------DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST- 333
DD +S+ + + + G +V+E F V + L H P S
Sbjct: 448 LVREEEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPKP 507
Query: 334 ------FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
FPI NR + LK + +S +S+FHLL F+ + L+
Sbjct: 508 VFTNDGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIHQMSVLS 562
Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
D AL V Q + + ++L
Sbjct: 563 KDEEALLCRVATQHDLADAFQL 584
>gi|157870668|ref|XP_001683884.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126951|emb|CAJ05219.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 627
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 124/301 (41%), Gaps = 65/301 (21%)
Query: 130 VSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLED--KRVEVNFIYEPPQQGTEEVLYI 187
V D + FQ +V + + + ++R +YG + +D +EV+ +YEP Q G+
Sbjct: 295 VKVDAQSINIFQSFVAQ-MDWRLQRYAILYG-LYDDATHTIEVHAVYEPEQHGSTYTFEP 352
Query: 188 LRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTLSNREVLQAVE----FHAECNM 243
L D +K V+ IA LG+++VG T M++ + L+ RE+L + + EC +
Sbjct: 353 LPDAHLDK-VERIAKALGLRRVGVACTHP-MRDPEHILLNFRELLLCAKEQSRYGDECAL 410
Query: 244 EEWVTAVVKLEVS----------EEGSAA------------IHFEAFQMSDMCVRLFKEG 281
A + S GSAA + +A+Q S CV L++ G
Sbjct: 411 LTVAPAAMPSTKSGGATSAPGELPTGSAAASSGATLEPTIVVSCQAWQTSPQCVHLYRLG 470
Query: 282 WFET-----------------------EIAEGD-DPKLSKMKKDVVVGGKDVKEVDNDFF 317
+ E+A+ D DP V E+D +F
Sbjct: 471 VLQESPSGEAALQDVEQARQVHCTMPLEVAQTDTDPS----GHQRFVTKAPSTEIDTRWF 526
Query: 318 LVVVKILDHQGPL-SSTFPIENRT--TQVTMRALKSHLN--RSPSLPLVKRISDFHLLLF 372
+ + + P+ F +R ++ L+++++ + S+ + I+DFH+L++
Sbjct: 527 TSYIAVQQFESPIVRGAFMRLSRPGMPPPVLQNLRNYISDPKRKSMSFAETIADFHVLVY 586
Query: 373 L 373
L
Sbjct: 587 L 587
>gi|408398882|gb|EKJ78008.1| hypothetical protein FPSE_01796 [Fusarium pseudograminearum CS3096]
Length = 648
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 121/304 (39%), Gaps = 56/304 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG +E V V IYEPPQ +E+ I + E +K +D +A G
Sbjct: 272 QRLGYLYGKYVEYTEVPLGVKAVVEAIYEPPQ--VDELDGITMNAWESQKEIDKVAKQCG 329
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEWVTA---- 249
++ VG I+T + D Y LS+ E+ + A+ +W +
Sbjct: 330 LEPVGVIWTDLLDSGNGDGSVVCKRHADSYFLSSLEICFSSRLQAQYPKASKWSDSGNFG 389
Query: 250 --VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
V ++ I ++QMS+ V + + E E DD +S+
Sbjct: 390 SNFVTCIITGNEDGEISISSYQMSNESVEMVRADIVEPSADPNVMLVRDEEEDDGSVSRT 449
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENR--- 339
+ + V G +V+E F V + L H P FPIENR
Sbjct: 450 RYIPEVFYRRVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPMFTQQGFPIENREYV 509
Query: 340 TTQVTMRALKSHLNRSPSL-PLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPE 398
AL + SP+ P + +S+FHLL F+ + L+ D AL V +Q +
Sbjct: 510 GESQEHSALSKFVKISPNQKPTGQEVSNFHLLCFIQQMGVLSKDEEALLCRVASQHDFAD 569
Query: 399 GYKL 402
++L
Sbjct: 570 TFQL 573
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
MLLR+R DG+ R+TV+ A T +L ++ QL V+ +T LS N +K D+
Sbjct: 1 MLLRLRGPDGMARLTVE-ATTTFGDLGKLLLDQLPKTVDPKTIALSNAPNGGDSKRLGDI 59
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
+ F + + + HG ++FL+Y
Sbjct: 60 INFK--------VGQIGLKHGDLIFLTY 79
>gi|322705619|gb|EFY97203.1| NPL4 family protein [Metarhizium anisopliae ARSEF 23]
Length = 644
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 121/303 (39%), Gaps = 58/303 (19%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R GF+YG E V V IYEPPQ +E+ I L +K V+ IA G
Sbjct: 274 QRIGFLYGRYAEYTEVPLGVKAVVEAIYEPPQ--VDELDGITLNAWGNQKEVEHIAQLCG 331
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
++ VG I+T + + D Y LS+ E+ + A+ + +
Sbjct: 332 LEPVGVIWTDLLDAGRGDGSVVCKRHADSYFLSSLEICFSSRLQAQHPKPSKWSDTGRYG 391
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
+ V ++ I ++QMS+ V + + E E DD +S+
Sbjct: 392 SNFVTCIITGNEQGEIGISSYQMSNEAVEMVRADIVEPSADPTVMLVRDEEEDDGSVSRT 451
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIENR--- 339
+ + V G +V+E F V + L H P + FPIENR
Sbjct: 452 RYIPEVFYRKVNEYGANVQENAKPSFPVEYLFVTLTHGFPADPKPTFTQTGFPIENREYI 511
Query: 340 TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVPEG 399
AL LN S + + +S+FHLL FL + L+ D AL V +Q + E
Sbjct: 512 GESQEHSALSKALNLSDT---GQEVSNFHLLCFLHQMGILSKDEEALLCRVASQHDLAES 568
Query: 400 YKL 402
++L
Sbjct: 569 FQL 571
>gi|312067660|ref|XP_003136847.1| NPL4 family protein [Loa loa]
Length = 503
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 130/322 (40%), Gaps = 73/322 (22%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
+ RQ H +++S + ++ VN LAF K G G+M G + + +
Sbjct: 147 LNRQRFRHVDNISIEN------EEVVNRFLAFWRKTGHQRMGYMIGRYEPFQEIPLGIKA 200
Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
IYEPPQ + + + L + +EK+VD + + L +K+VG+IFT
Sbjct: 201 TVAAIYEPPQSCSADSV-CLEADPQEKMVDELCSYLNLKRVGWIFTDLWSADSSKGTVCC 259
Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
++K + L+ +E + A + C + + + S I F +Q+
Sbjct: 260 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 319
Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
S+ C L + + P+L+ +++ DV K+ V+
Sbjct: 320 SNQCASLVEANL----LCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 375
Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
++ LV V + P ++ FP ENR + +A+ +++ +
Sbjct: 376 MPVEYLLVDVPAGMPKEPHATFNISKKCNFPTENRVLIGELQDSKAVGRYISEFSQNQFI 435
Query: 362 KRISDFHLLLFLARFLDLNSDV 383
+ ++FH LLFL LN+D+
Sbjct: 436 ELATNFHFLLFL-----LNNDI 452
>gi|393910799|gb|EFO27216.2| NPL4 family protein [Loa loa]
Length = 548
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 130/322 (40%), Gaps = 73/322 (22%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
+ RQ H +++S + ++ VN LAF K G G+M G + + +
Sbjct: 161 LNRQRFRHVDNISIEN------EEVVNRFLAFWRKTGHQRMGYMIGRYEPFQEIPLGIKA 214
Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
IYEPPQ + + + L + +EK+VD + + L +K+VG+IFT
Sbjct: 215 TVAAIYEPPQSCSADSV-CLEADPQEKMVDELCSYLNLKRVGWIFTDLWSADSSKGTVCC 273
Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
++K + L+ +E + A + C + + + S I F +Q+
Sbjct: 274 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 333
Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
S+ C L + + P+L+ +++ DV K+ V+
Sbjct: 334 SNQCASLVEANL----LCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 389
Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
++ LV V + P ++ FP ENR + +A+ +++ +
Sbjct: 390 MPVEYLLVDVPAGMPKEPHATFNISKKCNFPTENRVLIGELQDSKAVGRYISEFSQNQFI 449
Query: 362 KRISDFHLLLFLARFLDLNSDV 383
+ ++FH LLFL LN+D+
Sbjct: 450 ELATNFHFLLFL-----LNNDI 466
>gi|322697947|gb|EFY89721.1| NPL4 family protein [Metarhizium acridum CQMa 102]
Length = 650
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 121/306 (39%), Gaps = 58/306 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R GF+YG E V V IYEPPQ +E+ I L +K V+ IA G
Sbjct: 274 QRIGFLYGRYAEYTEVPLGVKAVVEAIYEPPQ--VDELDGITLNAWGNQKEVEHIAQLCG 331
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
++ VG I+T + + D Y LS+ E+ + A+ + +
Sbjct: 332 LEPVGVIWTDLLDAGRGDGSVICKRHADSYFLSSLEICFSSRLQAQHPKPSKWSDTGRYG 391
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
+ V ++ I ++QMS+ V + + E E DD +S+
Sbjct: 392 SNFVTCIITGNEQGEIGISSYQMSNEAVEMVRADIVEPSADPTVMLVRDEEEDDGSVSRT 451
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIENR--- 339
+ + V G +V+E F V + L H P + FPIENR
Sbjct: 452 RYIPEVFYRKVNEYGANVQENAKPSFPVEYLFVTLTHGFPAEPKPTFTQTGFPIENREYI 511
Query: 340 TTQVTMRALKSHLNRSP---SLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
AL LN S S + +S+FHLL FL + L+ D AL V +Q +
Sbjct: 512 GESQEHSALSKALNLSSAQKSSDTGQEVSNFHLLCFLHKMGILSKDEEALLCRVASQHDL 571
Query: 397 PEGYKL 402
E ++L
Sbjct: 572 AESFQL 577
>gi|389631719|ref|XP_003713512.1| hypothetical protein MGG_04413 [Magnaporthe oryzae 70-15]
gi|374095430|sp|A4RN19.2|NPL4_MAGO7 RecName: Full=Nuclear protein localization protein 4
gi|351645845|gb|EHA53705.1| hypothetical protein MGG_04413 [Magnaporthe oryzae 70-15]
Length = 669
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 133/332 (40%), Gaps = 69/332 (20%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEK 195
++N +R G MYG E + V V IYEPPQ +EV + L + EK
Sbjct: 274 FINTWRRTGGQRYGIMYGKYSEYEEVPLGIKAVVQAIYEPPQ--VDEVDGVSLNSWDNEK 331
Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
V+ +A G++ VG I+T + D Y LS+ EV A A+
Sbjct: 332 DVNQVARLCGLEPVGAIWTDLLDAGAGDGSVVCKRHADSYFLSSLEVCFAARLQAQHPKP 391
Query: 244 EEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
+W VT ++ +E+G AI ++Q+S+ V + + E
Sbjct: 392 SKWSDTGRFGSNFVTCIIS--GNEQGEIAI--SSYQVSNEAVEMVRADIMEPSADPTVML 447
Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPL 330
E DD S+ + + + G +V+E V+ F + D P+
Sbjct: 448 VREEEEDDGSTSRTRYIPDVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDVAKPM 507
Query: 331 SS---TFPIENRTT-------QVTMRALKSHLNRSP-SLPLVKRISDFHLLLFLARFLDL 379
S FPIENR +ALK H S S ++S+FHLL FL + L
Sbjct: 508 FSDEGAFPIENREYMGESQEHSAAAKALKVHEKASSGSSKDGMKVSNFHLLCFLHQMSVL 567
Query: 380 NSDVPALAQCVQAQTAVPEGYKLLINSMAETV 411
+ D +L V Q + + ++L + +T+
Sbjct: 568 SKDEESLLCRVATQHDLADAFQLRSTTGWQTL 599
>gi|402588870|gb|EJW82803.1| NPL4 family protein, partial [Wuchereria bancrofti]
Length = 492
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 131/322 (40%), Gaps = 73/322 (22%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
+ RQ H +++S + ++ VN LAF K G G+M G + + +
Sbjct: 101 LNRQRFRHVDNISIEN------EEVVNRFLAFWRKTGHQRIGYMIGRYESFQEIPLGIKA 154
Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
IYEPPQ + + + L + +EK+VD + + L +K+VG+IFT
Sbjct: 155 TVAAIYEPPQSCSADSV-CLEADPQEKVVDELCSYLNLKRVGWIFTDLWSADSSKGTVYC 213
Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
++K + L+ +E + A + C + + + S I F +Q+
Sbjct: 214 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 273
Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
S+ C L + + P+L+ +++ DV K+ V+
Sbjct: 274 SNQCASLVE----ANLLCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 329
Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
++ LV V + P ++ FP ENR + +A+ ++++ +
Sbjct: 330 MPVEYLLVDVPAGMPKEPHATFNISKKCYFPTENRILIGELQDSKAVGRYISKFSQNQFI 389
Query: 362 KRISDFHLLLFLARFLDLNSDV 383
+ ++FH LLFL LN+D+
Sbjct: 390 ELATNFHFLLFL-----LNNDI 406
>gi|310790815|gb|EFQ26348.1| NPL4 family protein [Glomerella graminicola M1.001]
Length = 649
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 122/307 (39%), Gaps = 60/307 (19%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G MYG E + V V IYEPPQ +EV I L E + VD +A G
Sbjct: 280 QRLGIMYGRYTEYEVVPLGIKAVVEAIYEPPQ--VDEVDGISLNAWENQNDVDEVARLCG 337
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
+++VG IFT + D Y LS++E+ A+ + ++
Sbjct: 338 LQQVGVIFTDLLDAGAGDGSVVCKRHADSYFLSSQEICFISRLQAQHPKPTKWSDTGKFG 397
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
+ V +S S I A+Q S+ V + + E E DD S+
Sbjct: 398 SNFVTCIISGNESGEISISAYQASNDAVEMVRADIVEPSADPNAMLVREEEEDDGSTSRT 457
Query: 299 K-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPLSST--FPIENRTT- 341
+ + + G +V+E V+ F + DH P + FPIENR
Sbjct: 458 RYIPEVFYRRINEYGANVQENAKPTFPVEYLFVTLTHGFPDHSKPTFTQLGFPIENREYM 517
Query: 342 ------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
R LK + P+ ++ S+FHLL F+ + L+ D +L V +Q
Sbjct: 518 GESQEHSAASRLLKFSGGQKPAEGGLE-TSNFHLLCFIHQMGILSKDEESLLCRVASQHD 576
Query: 396 VPEGYKL 402
+ + Y+L
Sbjct: 577 LADTYQL 583
Score = 41.2 bits (95), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPS--DL 58
MLLR+R DG+ R+T+D + EL +L +P +T+ N + L+ +P+ D
Sbjct: 1 MLLRLRGPDGMSRLTID-KDASFGEL-----GRLLLPTLPKTVDPN-TITLSNAPTGGDA 53
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
TD+A+ + + + HG ++F+SY
Sbjct: 54 KNLTDIAS--FKVGQIGLKHGDLIFISY 79
>gi|389631717|ref|XP_003713511.1| hypothetical protein, variant [Magnaporthe oryzae 70-15]
gi|351645844|gb|EHA53704.1| hypothetical protein, variant [Magnaporthe oryzae 70-15]
Length = 607
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 133/332 (40%), Gaps = 69/332 (20%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEK 195
++N +R G MYG E + V V IYEPPQ +EV + L + EK
Sbjct: 274 FINTWRRTGGQRYGIMYGKYSEYEEVPLGIKAVVQAIYEPPQ--VDEVDGVSLNSWDNEK 331
Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
V+ +A G++ VG I+T + D Y LS+ EV A A+
Sbjct: 332 DVNQVARLCGLEPVGAIWTDLLDAGAGDGSVVCKRHADSYFLSSLEVCFAARLQAQHPKP 391
Query: 244 EEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
+W VT ++ +E+G AI ++Q+S+ V + + E
Sbjct: 392 SKWSDTGRFGSNFVTCIIS--GNEQGEIAI--SSYQVSNEAVEMVRADIMEPSADPTVML 447
Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPL 330
E DD S+ + + + G +V+E V+ F + D P+
Sbjct: 448 VREEEEDDGSTSRTRYIPDVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDVAKPM 507
Query: 331 SS---TFPIENRTT-------QVTMRALKSHLNRSP-SLPLVKRISDFHLLLFLARFLDL 379
S FPIENR +ALK H S S ++S+FHLL FL + L
Sbjct: 508 FSDEGAFPIENREYMGESQEHSAAAKALKVHEKASSGSSKDGMKVSNFHLLCFLHQMSVL 567
Query: 380 NSDVPALAQCVQAQTAVPEGYKLLINSMAETV 411
+ D +L V Q + + ++L + +T+
Sbjct: 568 SKDEESLLCRVATQHDLADAFQLRSTTGWQTL 599
>gi|400601268|gb|EJP68911.1| NPL4 family protein [Beauveria bassiana ARSEF 2860]
Length = 662
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 118/306 (38%), Gaps = 58/306 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G++YG E V V IYEPPQ + + L E +K +D IA G+
Sbjct: 272 QRLGYLYGRYAEYTEVPLGTKAVVEAIYEPPQVDEMDGV-TLNAWENQKDIDHIAKLCGL 330
Query: 207 KKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAEC-NMEEWV------T 248
+ VG I+T + ++ Y LS+ E A A+ +W +
Sbjct: 331 EPVGVIWTDLLDAGQGNGSVVCKRHADSYFLSSLETCFAARLQAQYPKATKWSDTGRFGS 390
Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKMK 299
+ V VS I A+Q S+ V + + E E DD S+ +
Sbjct: 391 SFVTCIVSGNEDGEIAISAYQASNDAVEMVRADIIEPSADPTQMLVRSEEEDDGSTSRTR 450
Query: 300 -------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIENRTT-- 341
+ + G +V+E F V + L H P + FPIENR
Sbjct: 451 YIPEVFFRKINEYGANVQENAKPAFPVEYLFVTLTHGFPADPKPTFTQTDFPIENREYIG 510
Query: 342 -----QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
+AL + PS + +S+FHLL FL + ++ D AL V +Q +
Sbjct: 511 ESQEHTAVAKALGMKPGQKPS-DNCQEVSNFHLLCFLHQMGTMSKDEEALLCKVASQHDL 569
Query: 397 PEGYKL 402
+ ++L
Sbjct: 570 ADSFQL 575
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPS--DL 58
MLLR+R DG+ R+TV+ T E+ + S L V+ +T+S ++ SP+ D
Sbjct: 1 MLLRVRGPDGMARLTVEPT-TTFGEMGQQLLSHLPPTVDPKTIS------MSNSPTGGDA 53
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
+ D+ N + + +SHG ++F++Y
Sbjct: 54 KRLGDIVN--FKIGQIGLSHGDLIFITY 79
>gi|358395872|gb|EHK45259.1| hypothetical protein TRIATDRAFT_265090 [Trichoderma atroviride IMI
206040]
Length = 667
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 123/311 (39%), Gaps = 68/311 (21%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG E +V V IYEPPQ +E+ I ++ E +K +D IA G
Sbjct: 272 QRLGYLYGRYAEYTKVPLGVKAVVEAIYEPPQ--VDEIDGISMKAWENQKEIDHIAKLCG 329
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEW------- 246
++ VG I+T + D Y LS+ E+ + A+ +W
Sbjct: 330 LEPVGAIWTDLLDSGHGDGSVVCKRHLDSYFLSSLEICFSSRLQAQYPKPSKWSDSGQFG 389
Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPK 294
VT V+ + EEG+ AI A+QMS+ V + + E E DD
Sbjct: 390 SNFVTCVITGD--EEGNIAI--SAYQMSNDAVEMVRADIVEPSADPNSMLVREEEEDDGS 445
Query: 295 LSKMK--------------KDVVVGGKDVKEVDNDFFLVVVKILDHQGPL--SSTFPIEN 338
LS+ + V K V+ F + D P + FPIEN
Sbjct: 446 LSRTRYIPEVFYSKVNEYGAKVQQNAKPAFPVEYLFVTLTHGFPDESKPTFTQTGFPIEN 505
Query: 339 RTTQVTMRALKSHLNRSPSLPLVKRISD-------FHLLLFLARFLDLNSDVPALAQCVQ 391
R + S L R+ +L ++ SD FH+L F+ L+ D AL V
Sbjct: 506 RE-YIGESQEHSALARALNLRSGQKASDNGQEVSNFHILCFIHTLGVLSKDEEALLCRVA 564
Query: 392 AQTAVPEGYKL 402
Q + + ++L
Sbjct: 565 TQHDLADSFQL 575
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 11/91 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQT--LSTNQNLLLAKSPSDL 58
ML+R+R DG+ R+T+D T +L + +QL ++ +T LS + AK D+
Sbjct: 1 MLIRLRGPDGMIRMTLDPT-TTFGDLGQQLLAQLPSTIDPKTIVLSNSPTGRDAKRLGDI 59
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSYDGE 89
+QF + + + HG ++FLSY E
Sbjct: 60 VQFQ--------IGQIGLKHGDLIFLSYQHE 82
>gi|170579617|ref|XP_001894909.1| NPL4 family protein [Brugia malayi]
gi|158598324|gb|EDP36238.1| NPL4 family protein [Brugia malayi]
Length = 532
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 130/322 (40%), Gaps = 73/322 (22%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRG----GFMYGTVLEDKRVEVNF-- 172
+ RQ H +++S + + + VN LAF K G G+M G + + +
Sbjct: 135 LNRQRFRHVDNISIENE------EVVNRFLAFWRKTGHQRIGYMIGRYESFQEIPLGIKA 188
Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQ-----------TI 217
IYEPPQ + + + L + +EK+VD + + L +K+VG+IFT
Sbjct: 189 TVAAIYEPPQSCSADSV-CLEADPQEKVVDELCSYLNLKRVGWIFTDLWSADSSKGTVYC 247
Query: 218 MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
++K + L+ +E + A + C + + + S I F +Q+
Sbjct: 248 TRHKDSFFLTAQECITAGWLQNKYPNITTFCTDGYFGSKFTTVVASGNEWNQIDFSGYQV 307
Query: 271 SDMCVRLFKEGWFETEIAEGDDPKLSKMKK----------DVVVGGKDVKEVDN------ 314
S+ C L + + P+L+ +++ DV K+ V+
Sbjct: 308 SNQCASLVE----ANLLCPTSHPELAYLREVPLTPSQYITDVYYMEKNEYGVETRKNGRP 363
Query: 315 ---DFFLVVVKILDHQGPLSS-------TFPIENRTTQVTM---RALKSHLNRSPSLPLV 361
++ LV V + P ++ FP ENR + +A+ +++ +
Sbjct: 364 MPVEYLLVDVPAGMPKEPHATFNISKKCYFPTENRILIGELQDSKAVGRYISEFSQNQFI 423
Query: 362 KRISDFHLLLFLARFLDLNSDV 383
+ ++FH LLFL LN+D+
Sbjct: 424 ELATNFHFLLFL-----LNNDI 440
>gi|344234048|gb|EGV65918.1| polyubiquitin-tagged protein recognition complex, Npl4 component
[Candida tenuis ATCC 10573]
Length = 568
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 71/270 (26%), Positives = 112/270 (41%), Gaps = 52/270 (19%)
Query: 151 AVKRGGFMYGT------VLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
+R G++YG+ V + +V FIYEPPQ + + ++ E EE +DA+AA L
Sbjct: 258 GTQRFGYLYGSYEPFDQVPLGIKAKVEFIYEPPQACELDGITLIPWENEEA-IDALAAEL 316
Query: 205 GMKKVGFIFTQ-----------TIMQNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
+ KVG IFT ++K Y L+N EV A + N ++
Sbjct: 317 NLYKVGVIFTDLTDSGLKNGSVLCKRHKDSYFLTNIEVAMAARNQLKYAYSTKYSNDRKF 376
Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE-------IAEGDD------- 292
+ V VS + I ++Q+S L + T I E +D
Sbjct: 377 SSRFVTCVVSGGLNGEIEPRSYQVSVNAEALLEADIVTTSTQPSMMYINESNDTRYVPDV 436
Query: 293 --PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGP----LSSTFPIENRT---TQV 343
K+++ +V K V D+ LV + + P S++F IE+R
Sbjct: 437 FYSKINEYGLEVKTNAKPAFPV--DYLLVTLSDAMPKEPKPFFTSNSFVIEHRDFLGELQ 494
Query: 344 TMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
++AL +HLN V + DFHLL +L
Sbjct: 495 NLKALFNHLNNDIGDGSV--LLDFHLLAYL 522
>gi|268533954|ref|XP_002632107.1| Hypothetical protein CBG06960 [Caenorhabditis briggsae]
Length = 527
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R G++ G V + IYEPPQ E+ + L D
Sbjct: 134 QELVNQFLDYWRVSGHQRVGYLIGQYQPHPEVPLGIKATVAAIYEPPQHCREDGIEFLED 193
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF-H 238
+ E K+VD + LG+++VG+IFT ++K + LS E + A +
Sbjct: 194 KNE-KIVDKLLEMLGLQRVGWIFTDCWTAARAEGTVHYTRHKDSFFLSAEECITAGMLQN 252
Query: 239 AECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
A N+ ++ + V + S + S ++F +Q+S+ C +
Sbjct: 253 AHPNVTDYSMDRRYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297
>gi|341896803|gb|EGT52738.1| hypothetical protein CAEBREN_20908 [Caenorhabditis brenneri]
Length = 524
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R G++ G V + IYEPPQ E+ + L D
Sbjct: 132 QELVNQFLDYWRLSGHQRVGYLIGQYQPHPEVPLGIKATVAAIYEPPQHCREDGVEFLED 191
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT-----------LSNREVLQAVEF-H 238
+ E K+VD + LG+++VG+IFT N+ + T LS E + A +
Sbjct: 192 KNE-KVVDKLLEMLGLQRVGWIFTDCWTANRAEGTVHYTRHKDSFFLSAEECITAGMLQN 250
Query: 239 AECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
+ N+ ++ + V + S + S ++F +Q+S+ C +
Sbjct: 251 SHPNVTDYSMDRRYGSKFVTVVASGDDSMHVNFHGYQVSNQCAAM 295
>gi|17534585|ref|NP_495093.1| Protein NPL-4.2, isoform b [Caenorhabditis elegans]
gi|351050428|emb|CCD64971.1| Protein NPL-4.2, isoform b [Caenorhabditis elegans]
Length = 529
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R GF+ G V + IYEPPQ E+ + L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
+ + K +D + LG+++VG+IFT N K + LS E + A
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252
Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
E +M+ + + V + S + S ++F +Q+S+ C +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297
>gi|17534583|ref|NP_495094.1| Protein NPL-4.2, isoform a [Caenorhabditis elegans]
gi|351050427|emb|CCD64970.1| Protein NPL-4.2, isoform a [Caenorhabditis elegans]
Length = 527
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R GF+ G V + IYEPPQ E+ + L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
+ + K +D + LG+++VG+IFT N K + LS E + A
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252
Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
E +M+ + + V + S + S ++F +Q+S+ C +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297
>gi|429856201|gb|ELA31125.1| endoplasmic reticulum and nuclear membrane proteinc [Colletotrichum
gloeosporioides Nara gc5]
Length = 654
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 124/307 (40%), Gaps = 60/307 (19%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G+MYG E + V V IYEPPQ +E+ I L E +K +D +A G
Sbjct: 283 QRLGYMYGRYTEYEVVPLGIKAVVEAIYEPPQ--VDELDGISLNGWENQKDIDDVARLCG 340
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNMEEWVTA---- 249
++K+G I+T + D Y LS++E+ A+ +W +
Sbjct: 341 LQKIGVIWTDLLDAGAGDGSVVCKRHADSYFLSSQEICFISRLQAQHPKPTKWSDSGRFG 400
Query: 250 --VVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
V +S S I ++Q S+ V + + E E DD +S+
Sbjct: 401 SNFVTCIISGNESGEISISSYQASNDAVEMVRADIIEPSADPNVMLVREEEEDDGSVSRT 460
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENRTT- 341
+ + + G +V+E F V + L H P S FPIENR
Sbjct: 461 RYIPEVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDESKPTFTQLGFPIENREYM 520
Query: 342 ------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
R LK + P+ ++ +S+FHLL F+ + L+ D +L V +Q
Sbjct: 521 GESQEHSAVARILKFSGGQKPAEGGLE-MSNFHLLCFIHQMDILSKDEESLLCRVASQHD 579
Query: 396 VPEGYKL 402
+ + Y+L
Sbjct: 580 LADTYQL 586
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 14/112 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
MLLR+R DG+ R+T+D T EL + L V+ T++ L A + D +
Sbjct: 1 MLLRLRGPDGMSRLTIDKDE-TFGELGRQLLPNLPKTVDPSTIT----LSNAPTGGDAKK 55
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY-------DGERNVRGPSFNPAGSFGR 105
D+ N + + + HG ++F++Y +G N P+ +P S R
Sbjct: 56 LVDITN--FKIGQIGLKHGDLIFINYQHQDSLSNGATNGAQPTSHPLASSNR 105
>gi|17534581|ref|NP_495096.1| Protein NPL-4.1, isoform b [Caenorhabditis elegans]
gi|351050426|emb|CCD64969.1| Protein NPL-4.1, isoform b [Caenorhabditis elegans]
Length = 529
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R GF+ G V + IYEPPQ E+ + L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
+ + K +D + LG+++VG+IFT N K + LS E + A
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252
Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
E +M+ + + V + S + S ++F +Q+S+ C +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297
>gi|17534579|ref|NP_495097.1| Protein NPL-4.1, isoform a [Caenorhabditis elegans]
gi|351050425|emb|CCD64968.1| Protein NPL-4.1, isoform a [Caenorhabditis elegans]
Length = 527
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R GF+ G V + IYEPPQ E+ + L D
Sbjct: 134 QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 193
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-----------KKDYTLSNREVLQAVEFH- 238
+ + K +D + LG+++VG+IFT N K + LS E + A
Sbjct: 194 KNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHYTRHKDSFFLSAEECITAAMLQN 252
Query: 239 -----AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
E +M+ + + V + S + S ++F +Q+S+ C +
Sbjct: 253 QHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 297
>gi|85118482|ref|XP_965453.1| hypothetical protein NCU02680 [Neurospora crassa OR74A]
gi|74618839|sp|Q7SH49.1|NPL4_NEUCR RecName: Full=Nuclear protein localization protein 4
gi|28927262|gb|EAA36217.1| hypothetical protein NCU02680 [Neurospora crassa OR74A]
gi|38567196|emb|CAE76489.1| probable nuclear protein localization factor and ER translocation
component NPL4 [Neurospora crassa]
Length = 651
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 72/313 (23%), Positives = 120/313 (38%), Gaps = 54/313 (17%)
Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
+++N +R G +YG LE V V IYEPPQ +E+ + L E E
Sbjct: 264 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNAWENE 321
Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE--- 240
K V+ +A G++ VG I+T + K D Y L+ +E+ A A+
Sbjct: 322 KDVNEVARLCGLEPVGVIWTDLLDAGKGDGSAICKRHSDSYFLAAQEICFAARLQAQHPK 381
Query: 241 ----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG------ 290
+ + + V +S I A+QMS+ V + + E G
Sbjct: 382 PTKWSDTGRFGSNFVTCVISGNEQGEISISAYQMSNDAVEMVRADIIEPSTDPGQMLVRE 441
Query: 291 ---DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSSTFPIEN 338
DD +S+ + + + G +V+E F V + L H P S N
Sbjct: 442 EEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPRPVFTN 501
Query: 339 RTTQVTMRALKSHLNRSPSLPLVKRI---------SDFHLLLFLARFLDLNSDVPALAQC 389
+ R + S+ + ++ S+FHLL F+ + L+ D AL
Sbjct: 502 DGFPIANREFVGEAQEASSVAKILKVNQKSDQFDVSNFHLLCFIRQMSVLSKDEEALLCR 561
Query: 390 VQAQTAVPEGYKL 402
V Q + + ++L
Sbjct: 562 VATQHDLADAFQL 574
>gi|350296839|gb|EGZ77816.1| nuclear protein localization protein 4 [Neurospora tetrasperma FGSC
2509]
Length = 651
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 123/318 (38%), Gaps = 64/318 (20%)
Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
+++N +R G +YG LE V V IYEPPQ +E+ + L E E
Sbjct: 264 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNAWENE 321
Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE--- 240
K V+ +A G++ VG I+T + K D Y L+ +E+ A A+
Sbjct: 322 KDVNEVARLCGLEPVGVIWTDLLDAGKGDGSAVCKRHTDSYFLAAQEICFAARLQAQHPK 381
Query: 241 ----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG------ 290
+ + + V +S I A+QMS+ V + + E G
Sbjct: 382 PTKWSDTGRFGSNFVTCVISGNEQGEISISAYQMSNDAVEMVRADIIEPSTDPGQMLVRE 441
Query: 291 ---DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST----- 333
DD +S+ + + + G +V+E F V + L H P S
Sbjct: 442 EEEDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPRPVFTN 501
Query: 334 --FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
FPI NR + LK + +S +S+FHLL F+ + L+ D
Sbjct: 502 DGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIRQMSVLSKDEE 556
Query: 385 ALAQCVQAQTAVPEGYKL 402
AL V Q + + ++L
Sbjct: 557 ALLCRVATQHDLADAFQL 574
>gi|358388811|gb|EHK26404.1| hypothetical protein TRIVIDRAFT_167022 [Trichoderma virens Gv29-8]
Length = 593
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 125/311 (40%), Gaps = 68/311 (21%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G++YG E V V IYEPPQ +EV I + +K +D IA G
Sbjct: 273 QRLGYLYGRYTEYTEVPLGVKAVVEAIYEPPQ--VDEVDGISMNAWGNQKDIDHIAKLCG 330
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAEC-NMEEW------- 246
++ VG I+T + D Y LS+ E+ + A+ N +W
Sbjct: 331 LEPVGVIWTDLLDSGLGDGSVVCKRHLESYYLSSLEICFSSRLQAQYPNPSKWSDSGQFG 390
Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPK 294
VT V+ + E+G+ A+ A+QMS+ V + + E E DD
Sbjct: 391 SKFVTCVITGD--EDGNIAV--SAYQMSNESVEMVRADIVEPSADPNSMLVREEEEDDGS 446
Query: 295 LSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPIEN 338
+S+ + + V G V+E F V + L H P + FPIEN
Sbjct: 447 VSRTRYIPEVFYRKVNEYGAKVQENAKPAFPVEYLFVTLTHGFPADPKPTFTQTGFPIEN 506
Query: 339 RTT-------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQ 391
R +AL + PS + +S+FH+L F+ + L+ D AL V
Sbjct: 507 REYIGESQEHSALAKALNLRKGQKPS-DNGQEVSNFHILCFIHQLGVLSKDEEALLCRVA 565
Query: 392 AQTAVPEGYKL 402
Q + + ++L
Sbjct: 566 TQHDLADSFQL 576
>gi|7504790|pir||T15260 hypothetical protein F59E12.4 - Caenorhabditis elegans
Length = 601
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 81/187 (43%), Gaps = 35/187 (18%)
Query: 119 VTRQENPHCESVSFDRDCADAFQQYVNETLAF----AVKRGGFMYGTVLEDKRVEVNF-- 172
+ RQ+ H +++ + Q+ VN+ L + +R GF+ G V +
Sbjct: 192 LNRQKFRHVDNIQIEN------QELVNQFLDYWRLSGHQRVGFLIGQYQPHLEVPLGIKA 245
Query: 173 ----IYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQN-------- 220
IYEPPQ E+ + L D+ + K +D + LG+++VG+IFT N
Sbjct: 246 TVAAIYEPPQHCREDGIEFLEDKNQ-KTIDNLLEMLGLQRVGWIFTDCWTANSAEGTVHY 304
Query: 221 ---KKDYTLSNREVLQAVEFH------AECNME-EWVTAVVKLEVSEEGSAAIHFEAFQM 270
K + LS E + A E +M+ + + V + S + S ++F +Q+
Sbjct: 305 TRHKDSFFLSAEECITAAMLQNQHPNITEYSMDRHYGSKFVTVVASGDESMHVNFHGYQV 364
Query: 271 SDMCVRL 277
S+ C +
Sbjct: 365 SNQCAAM 371
>gi|336464738|gb|EGO52978.1| hypothetical protein NEUTE1DRAFT_105928 [Neurospora tetrasperma
FGSC 2508]
Length = 653
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 123/318 (38%), Gaps = 64/318 (20%)
Query: 142 QYVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEE 194
+++N +R G +YG LE V V IYEPPQ +E+ + L E E
Sbjct: 266 KFLNPWRMTGCQRLGILYGKYLEYDVVPLGVKAVVEAIYEPPQ--VDEIDGVTLNAWENE 323
Query: 195 KLVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE--- 240
K V+ +A G++ VG I+T + K D Y L+ +E+ A A+
Sbjct: 324 KDVNEVARLCGLEPVGVIWTDLLDAGKGDGSAICKRHTDSYFLAAQEICFAARLQAQHPK 383
Query: 241 ----CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEG------ 290
+ + + V +S I A+QMS+ V + + E G
Sbjct: 384 PTKWSDTGRFGSNFVTCVISGNEQGEISISAYQMSNDAVEMVRADIIEPSTDPGQMLVRE 443
Query: 291 ---DDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST----- 333
DD +S+ + + + G +V+E F V + L H P S
Sbjct: 444 EEDDDGSVSRTRYIPEVFYRKINEYGANVQENAKPAFPVEYLFVTLTHGFPESPRPVFTN 503
Query: 334 --FPIENRT-------TQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVP 384
FPI NR + LK + +S +S+FHLL F+ + L+ D
Sbjct: 504 DGFPIANREFVGEAQEASAVAKILKVN-QKSDQF----DVSNFHLLCFIRQMSVLSKDEE 558
Query: 385 ALAQCVQAQTAVPEGYKL 402
AL V Q + + ++L
Sbjct: 559 ALLCRVATQHDLADAFQL 576
>gi|403213522|emb|CCK68024.1| hypothetical protein KNAG_0A03410 [Kazachstania naganishii CBS
8797]
Length = 609
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 128/321 (39%), Gaps = 67/321 (20%)
Query: 142 QYVNETLAFAVKRGGFMYGTV-------LEDKRVEVNFIYEPPQQGTEEVLYILRDE--E 192
Q++ + ++R +MYGT L K V V IYEPPQ ++ L + D+ +
Sbjct: 259 QFIESWRSTGMQRFAYMYGTYSRYDSTPLGIKAV-VEAIYEPPQHDEQDGLTMDMDQVTK 317
Query: 193 EEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF---H 238
E VDA+A +G+ +VGFIFT ++K Y L++ E + A H
Sbjct: 318 EMDAVDALAFQMGLFRVGFIFTDLTDAGQGDGSVFCKRHKDSYFLTSFETIMAARHQMAH 377
Query: 239 AE-CNMEE-------WVTAVVKLEVSEE---GSAAIHFEAFQMSDMCVRLFKEGWFETEI 287
A C E +VT VV + E S + +A + D + I
Sbjct: 378 ANACKFSERGTFSSKFVTCVVSGNLEGEIDIASYQVSIDAEALVDAQMISGSTHPSMAYI 437
Query: 288 AEGDD----PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------- 333
E D P+L MKK+ G VKE F V ++ L H P S
Sbjct: 438 NETTDERYVPELFYMKKNEY--GLTVKENAKPAFPVDYLIVSLTHGFPKDSAAAAAAGAT 495
Query: 334 ---------FPIENRTT---QVTMRALKSHLNRSPSLP----LVKRISDFHLLLFLARFL 377
FP NR + LK +L + + L K++S+FH+LL++
Sbjct: 496 AGKFATTMGFPWANRQIMGHSQDYQELKKYLEAATTAGDYNLLRKKLSNFHVLLYIHSLQ 555
Query: 378 DLNSDVPALAQCVQAQTAVPE 398
L+ + +L A A PE
Sbjct: 556 ILSHEEWSLLIQAAASGASPE 576
>gi|156082071|ref|XP_001608528.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148801099|gb|EDL42504.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 505
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 65/133 (48%), Gaps = 6/133 (4%)
Query: 283 FETEIAEGDDPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQ 342
F TE+ ++ L K K D+++ VK++D F + V IL H+ +S FP +
Sbjct: 369 FNTEVDLMNELYL-KCKSDILIEKIQVKQIDVLFCVNNVPILSHKSLYNSFFPYPKNSNY 427
Query: 343 VTMRALKSHLNRS--PSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQT---AVP 397
+ ++++RS +V DF+ LLFL + D+P + + V + +P
Sbjct: 428 YAILQRFNYMSRSLRNQKDMVSIFRDFNFLLFLTNIFSFDRDIPPICKAVNDASNAVGIP 487
Query: 398 EGYKLLINSMAET 410
+ Y ++ S++++
Sbjct: 488 DQYVSILRSLSQS 500
>gi|308487254|ref|XP_003105823.1| hypothetical protein CRE_17909 [Caenorhabditis remanei]
gi|308255279|gb|EFO99231.1| hypothetical protein CRE_17909 [Caenorhabditis remanei]
Length = 523
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 77/165 (46%), Gaps = 29/165 (17%)
Query: 141 QQYVNETLAF----AVKRGGFM---YGTVLE---DKRVEVNFIYEPPQQGTEEVLYILRD 190
Q+ VN+ L + +R G++ Y T LE + V IYEPPQ E+ + L D
Sbjct: 132 QELVNQFLDYWRLSGHQRVGYLIGQYQTHLEVPLGIKATVAAIYEPPQHCREDGIEFLED 191
Query: 191 EEEEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF-H 238
+ K+VD + LG+++VG+IFT ++K + LS E + A +
Sbjct: 192 KNA-KIVDKLLEMLGLQRVGWIFTDCWTAARAEGTVHYTRHKDSFFLSAEECITAGMLQN 250
Query: 239 AECNMEEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
A N+ ++ + V + S + S ++F +Q+S+ C +
Sbjct: 251 AHPNITDYSMDRHYGSKFVTVVASGDESMHVNFHGYQVSNQCAAM 295
>gi|156836568|ref|XP_001642339.1| hypothetical protein Kpol_242p4 [Vanderwaltozyma polyspora DSM
70294]
gi|190359919|sp|A7TTC4.1|NPL4_VANPO RecName: Full=Nuclear protein localization protein 4
gi|156112844|gb|EDO14481.1| hypothetical protein Kpol_242p4 [Vanderwaltozyma polyspora DSM
70294]
Length = 568
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 10/87 (11%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYG-------TVLEDKRVEVNFIYEPPQQGTEEVLYILR 189
+D Q++ A ++R G++YG T L K V V+ IYEPPQ ++ L +
Sbjct: 231 SDLINQFIESWRATGMQRFGYLYGSYEKYDSTPLGVKAV-VHAIYEPPQHDEQDGLTMDL 289
Query: 190 DEEEEKL--VDAIAAGLGMKKVGFIFT 214
++ EE++ VD IA +G+ +VG IF+
Sbjct: 290 EQVEEEMQKVDQIAMSMGLLRVGLIFS 316
>gi|380478569|emb|CCF43518.1| nuclear protein localization protein 4 [Colletotrichum
higginsianum]
Length = 649
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 123/307 (40%), Gaps = 60/307 (19%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R G MYG E + V V IYEPPQ +E+ I L E +K VD IA G
Sbjct: 280 QRLGVMYGRYTEYEVVPLGIKAVVEAIYEPPQ--VDEMDGISLNAWENQKDVDEIARLCG 337
Query: 206 MKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNMEEWV 247
+++VG IFT + D Y LS++E+ A+ + ++
Sbjct: 338 LQQVGVIFTDLLDAGAGDGSVVCKRHADSYFLSSQEICFISRLQAQHPKPTKWSDTGKFG 397
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
+ V +S S I ++Q S+ V + + E E DD S+
Sbjct: 398 SNFVTCIISGNESGEISISSYQASNDAVEMVRADIVEPSADPNAMLVREEEEDDGSTSRT 457
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENRTT- 341
+ + + G +V+E F V + L H P + FPIENR
Sbjct: 458 RYIPEVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDEAKPAFTQLGFPIENREYM 517
Query: 342 ------QVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTA 395
R LK + P+ ++ +S+FHLL F+ + L+ D +L V +Q
Sbjct: 518 GESQEHSAAARILKFSGGQKPAEGGLE-MSNFHLLCFIRQMGILSKDEESLLCRVASQHD 576
Query: 396 VPEGYKL 402
+ + Y+L
Sbjct: 577 LADTYQL 583
>gi|440465534|gb|ELQ34853.1| hypothetical protein OOU_Y34scaffold00744g17 [Magnaporthe oryzae
Y34]
gi|440481024|gb|ELQ61651.1| hypothetical protein OOW_P131scaffold01166g17 [Magnaporthe oryzae
P131]
Length = 683
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 125/320 (39%), Gaps = 82/320 (25%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEK 195
++N +R G MYG E + V V IYEPPQ +EV + L + EK
Sbjct: 274 FINTWRRTGGQRYGIMYGKYSEYEEVPLGIKAVVQAIYEPPQ--VDEVDGVSLNSWDNEK 331
Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
V+ +A G++ VG I+T + D Y LS+ EV A A+
Sbjct: 332 DVNQVARLCGLEPVGAIWTDLLDAGAGDGSVVCKRHADSYFLSSLEVCFAARLQAQHPKP 391
Query: 244 EEW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI------ 287
+W VT ++ +E+G AI ++Q+S+ V + + E
Sbjct: 392 SKWSDTGRFGSNFVTCII--SGNEQGEIAI--SSYQVSNEAVEMVRADIMEPSADPTVML 447
Query: 288 ---AEGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPL 330
E DD S+ + + + G +V+E V+ F + D P+
Sbjct: 448 VREEEEDDGSTSRTRYIPDVFYRRINEYGANVQENAKPSFPVEYLFVTLTHGFPDVAKPM 507
Query: 331 SS---TFPIENRTT-------QVTMRALKSHLNRSP-SLPLVKRISDFHLLLFL------ 373
S FPIENR +ALK H S S ++S+FHLL FL
Sbjct: 508 FSDEGAFPIENREYMGESQEHSAAAKALKVHEKASSGSSKDGMKVSNFHLLCFLHQMSVL 567
Query: 374 --ARFLDLNSDVPALAQCVQ 391
A FL L+ L +C Q
Sbjct: 568 SKASFLRLD-----LLRCFQ 582
>gi|154324156|ref|XP_001561392.1| hypothetical protein BC1G_00477 [Botryotinia fuckeliana B05.10]
Length = 608
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 35/177 (19%)
Query: 149 AFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIA 201
A +R G++YG E V V IYEPPQ +E+ + L + E EK +D IA
Sbjct: 270 ATGAQRFGYLYGRYEEYTEVPLGVKAVVEAIYEPPQ--VDELDGVTLNEWESEKDIDEIA 327
Query: 202 AGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-------CNM 243
GM++VG I+T I D Y LS+ E+ A A+ +
Sbjct: 328 RLCGMERVGVIWTDLIDSGVGDGTVLCKRHIDSYYLSSLEIAFAARLQAKHPKPTKWSDT 387
Query: 244 EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI--------AEGDD 292
++ + V V+ + + I A+Q+S+ V + + E +EGDD
Sbjct: 388 GKFGSNFVTCVVTADENGGIAISAYQVSNSAVEMVRADIVEPSADPAVMIVRSEGDD 444
>gi|402078068|gb|EJT73417.1| hypothetical protein GGTG_10255 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 677
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 123/325 (37%), Gaps = 60/325 (18%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
++N +R G +YG E V V IYEPPQ E L + EK
Sbjct: 275 FINAWRVSGGQRFGILYGKYTEYDVVPLGIKAVVEAIYEPPQV-CETDGVSLNPWDNEKD 333
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNME 244
V +A G++ VG ++T + D Y LS+ EV A A+
Sbjct: 334 VHEVARFCGLEPVGAVWTDLVDAGAGDGSVICKRHADSYYLSSLEVCFAARMQAQHPKSS 393
Query: 245 EWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AE 289
+W + + +S I A+Q+S+ V + + E E
Sbjct: 394 KWSDTGRFGSNFITCVISGNEQGEISISAYQVSNDAVEMVRADIVEPSADPNVMLVREEE 453
Query: 290 GDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSST 333
DD S+ + + + G +V+E F V + L H P S
Sbjct: 454 EDDGSQSRTRYIPEVFYRRINEYGANVQENAKPSFPVEYLYVTLTHGFPDAAKPIFTDSG 513
Query: 334 FPIENRTTQVTMRALKSHLNRSPSLPLVKR-------ISDFHLLLFLARFLDLNSDVPAL 386
FPIENR M + H + S +L + ++ +SDFHLL FL + L+ D L
Sbjct: 514 FPIENREY---MGESQEHSSASKALKVHQKADTGGLQVSDFHLLCFLHQMGVLSKDEENL 570
Query: 387 AQCVQAQTAVPEGYKLLINSMAETV 411
V Q + + ++L + +T+
Sbjct: 571 LCRVATQRDLADAFQLRSTTGWQTL 595
>gi|223590104|sp|A5DBC9.2|NPL4_PICGU RecName: Full=Nuclear protein localization protein 4
gi|190344745|gb|EDK36486.2| hypothetical protein PGUG_00584 [Meyerozyma guilliermondii ATCC
6260]
Length = 564
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 109/272 (40%), Gaps = 57/272 (20%)
Query: 151 AVKRGGFMYGTV-LEDK-----RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
V+R GFM+G + DK + V IYEPPQ G + + +L E +E LV +A L
Sbjct: 285 GVQRFGFMFGRYEVFDKVPLGIKAVVEAIYEPPQAGETDGITLLPWENQE-LVLKVAEKL 343
Query: 205 GMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
+ VG FT T++ ++K Y LS EVL A F E N +
Sbjct: 344 NLYPVGIAFTDLTDSGARNGTVLCKRHKDTYFLSCLEVLMAARFQIEHPNITKHSNSGRF 403
Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMS---------DMCVRLFKEGWFETEIAEGDD----- 292
+ V VS I +FQ+S D+ + ++G
Sbjct: 404 SSKFVTCVVSGGLEGEIEPRSFQVSTNAEALVRADIITGSTQPSMLYINSSQGKRYVPDV 463
Query: 293 --PKLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST------FPIENRTTQVT 344
K+++ +V K VD FL+V L PL+ T F IENR
Sbjct: 464 FYSKINEYGLEVKTNAKPAFPVD---FLLVT--LSDAFPLNPTPRFTNGFTIENRDFMGN 518
Query: 345 ---MRALKSHLNRSPSLPLVKRISDFHLLLFL 373
+RA ++N P +S+FH +++L
Sbjct: 519 LQDLRAAYRYINSDPGNGSC--LSNFHFIVYL 548
>gi|402078067|gb|EJT73416.1| hypothetical protein, variant [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 603
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 123/325 (37%), Gaps = 60/325 (18%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKL 196
++N +R G +YG E V V IYEPPQ E L + EK
Sbjct: 275 FINAWRVSGGQRFGILYGKYTEYDVVPLGIKAVVEAIYEPPQV-CETDGVSLNPWDNEKD 333
Query: 197 VDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNME 244
V +A G++ VG ++T + D Y LS+ EV A A+
Sbjct: 334 VHEVARFCGLEPVGAVWTDLVDAGAGDGSVICKRHADSYYLSSLEVCFAARMQAQHPKSS 393
Query: 245 EWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AE 289
+W + + +S I A+Q+S+ V + + E E
Sbjct: 394 KWSDTGRFGSNFITCVISGNEQGEISISAYQVSNDAVEMVRADIVEPSADPNVMLVREEE 453
Query: 290 GDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSST 333
DD S+ + + + G +V+E F V + L H P S
Sbjct: 454 EDDGSQSRTRYIPEVFYRRINEYGANVQENAKPSFPVEYLYVTLTHGFPDAAKPIFTDSG 513
Query: 334 FPIENRTTQVTMRALKSHLNRSPSLPLVKR-------ISDFHLLLFLARFLDLNSDVPAL 386
FPIENR M + H + S +L + ++ +SDFHLL FL + L+ D L
Sbjct: 514 FPIENREY---MGESQEHSSASKALKVHQKADTGGLQVSDFHLLCFLHQMGVLSKDEENL 570
Query: 387 AQCVQAQTAVPEGYKLLINSMAETV 411
V Q + + ++L + +T+
Sbjct: 571 LCRVATQRDLADAFQLRSTTGWQTL 595
>gi|320588312|gb|EFX00781.1| endoplasmic reticulum and nuclear membrane proteinc [Grosmannia
clavigera kw1407]
Length = 649
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 123/322 (38%), Gaps = 68/322 (21%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDE-EEEK 195
++N +R G++YG + E V V IYEPPQ T+E+ + D +E+
Sbjct: 281 FINAWRKSGSQRFGYLYGRLDEYDEVPLGVKAVVEAIYEPPQ--TDELDGLTLDAWSQEQ 338
Query: 196 LVDAIAAGLGMKKVGFIFTQTIMQNKKD-----------YTLSNREVLQAVEFHAE-CNM 243
VD +A G++K+G ++T D Y LS+ EV+ A A+
Sbjct: 339 AVDGVARLCGLEKLGVVWTDLTDAGAGDGTVVCKRHADSYYLSSLEVVFAARLQAQHPKA 398
Query: 244 EEWV------TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFE---------TEIA 288
W + V +S + I A+QMS+ V + + E
Sbjct: 399 SRWSDTGRFGSDFVTCLISGNEAGEIGIAAYQMSNEAVEMVRADIVEPSAEPETMLVRAE 458
Query: 289 EGDDPKLSKMK-------KDVVVGGKDVKE-------VDNDFFLVVVKILDHQGPLSS-- 332
E DD S+ + + + G +V+E V+ F + D PL +
Sbjct: 459 EEDDGASSRTRYIPEVFYRHINEYGANVQENAKPAFPVEYLFVTLTHGFPDQARPLFTDN 518
Query: 333 --TFPIENR----------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARFLDLN 380
FPIENR T++A L +S+FHLL L + L+
Sbjct: 519 LAGFPIENREHIGESQEHTAVARTLKATSGGQTTDGDL----SVSNFHLLCALHQMEVLS 574
Query: 381 SDVPALAQCVQAQTAVPEGYKL 402
D +L V Q + + ++L
Sbjct: 575 KDEESLLCRVATQHDLADKFQL 596
>gi|224002495|ref|XP_002290919.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972695|gb|EED91026.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 569
Score = 45.1 bits (105), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 42/215 (19%)
Query: 189 RDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---YTLSNREVLQAVEFHAECNMEE 245
R E + +A LGM+ +G+IF+ +D + R+ + + E
Sbjct: 324 RKESATERAVRVANYLGMQPIGWIFSYADENRHEDGDALPVHGRDAIVGSKLQIE----- 378
Query: 246 WVTAVVKLEVSEEGSAAIHF---------EAFQMSDMCVRLFKEGWFETEIAEGDDPKLS 296
+KL E+G I EAFQ+SD+CV++ E + D KL+
Sbjct: 379 ----TMKLLGREDGGKFITLTLDGRLGATEAFQLSDVCVQMVAEDVLCVPVV---DEKLT 431
Query: 297 -----KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQG--------PLSSTFPIENRTTQV 343
K+K V+V G++ K +D+ LV +L H G P+ + + T V
Sbjct: 432 STRFIKLKDPVLVSGEETKRLDSVLLLVNTAMLSHVGLYSGGEDAPVGGSVKKGSGTLLV 491
Query: 344 TMR-----ALKSHLNRSPSLPLVKRISDFHLLLFL 373
R AL++ L++++ DF LL L
Sbjct: 492 KTRKRILSALENSNGSGSDGKLLQQLCDFDTLLAL 526
>gi|346321034|gb|EGX90634.1| protein NPL4 [Cordyceps militaris CM01]
Length = 660
Score = 45.1 bits (105), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 118/306 (38%), Gaps = 58/306 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAGLG 205
+R GF+YG E V V IYEPPQ +E+ + L E +K VD IA G
Sbjct: 272 QRLGFLYGRYAEYTEVPLGIKAVVEAIYEPPQ--VDEMDGVSLSAWENQKDVDYIAGLCG 329
Query: 206 MKKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAE-------CNMEEWV 247
++ VG I+T + ++ Y LS+ E + A+ + ++
Sbjct: 330 LEPVGVIWTDLLDAGQSNGSVVCKRHADSYFLSSLETCFSARMQAQYPKATKWSDTGKFG 389
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEI---------AEGDDPKLSKM 298
++ V VS I A+Q S+ V + + E E DD S+
Sbjct: 390 SSFVTCIVSGNEDGEIAISAYQASNDAVEMVRADIVEPSADPTQMLVRSEEEDDGSTSRT 449
Query: 299 K-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST-------FPIENRT-- 340
+ + V G +V+E F V + L H P FPIENR
Sbjct: 450 RYIPEVFFRRVNEYGANVQENAKPAFPVEYLFVTLTHGFPADPKPTFTQPGFPIENRQYI 509
Query: 341 -TQVTMRALKSHLNRSPSLPL---VKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAV 396
A+ L P + +S+FHLL FL + ++ D AL V +Q +
Sbjct: 510 GESQEHTAVAKALGMRPGQRTSENCQEVSNFHLLCFLHQMGTMSKDEEALLCKVASQHDL 569
Query: 397 PEGYKL 402
+ ++L
Sbjct: 570 ADSFQL 575
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 49/88 (55%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPS--DL 58
MLLRIR DG+ R+TV+ +T E+ + + L V+ +T+S ++ SPS D
Sbjct: 1 MLLRIRGPDGMVRLTVEPT-MTFGEMGQQLLTHLPPTVDPKTIS------MSNSPSGGDA 53
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
+ D+ N + + + + HG ++F++Y
Sbjct: 54 KRLGDIVNFE--IGRIGLKHGDLIFITY 79
>gi|146422541|ref|XP_001487207.1| hypothetical protein PGUG_00584 [Meyerozyma guilliermondii ATCC
6260]
Length = 564
Score = 44.7 bits (104), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 110/272 (40%), Gaps = 57/272 (20%)
Query: 151 AVKRGGFMYGTV-LEDK-----RVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGL 204
V+R GFM+G + DK + V IYEPPQ G + + +L E +E LV +A L
Sbjct: 285 GVQRFGFMFGRYEVFDKVPLGIKAVVEAIYEPPQAGETDGITLLPWENQE-LVLKVAEKL 343
Query: 205 GMKKVGFIFTQ---------TIM--QNKKDYTLSNREVLQAVEFHAE-------CNMEEW 246
+ VG FT T++ ++K Y LS EVL A F E N +
Sbjct: 344 NLYPVGIAFTDLTDSGARNGTVLCKRHKDTYFLSCLEVLMAARFQIEHPNITKHSNSGRF 403
Query: 247 VTAVVKLEVSEEGSAAIHFEAFQMS---------DMCVRLFKEGWFETEIAEGDDP---- 293
+ V VS I +FQ+S D+ + ++G
Sbjct: 404 SSKFVTCVVSGGLEGEIEPRSFQVSTNAEALVRADIITGSTQPSMLYINSSQGKRYVPDV 463
Query: 294 ---KLSKMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSST------FPIENRTTQVT 344
K+++ +V K VD ++V +LD PL+ T F IENR
Sbjct: 464 FYLKINEYGLEVKTNAKPAFPVD----FLLVTLLD-AFPLNPTPRFTNGFTIENRDFMGN 518
Query: 345 ---MRALKSHLNRSPSLPLVKRISDFHLLLFL 373
+RA ++N P +S+FH +++L
Sbjct: 519 LQDLRAAYRYINSDPGNGSC--LSNFHFIVYL 548
>gi|452979305|gb|EME79067.1| hypothetical protein MYCFIDRAFT_79805 [Pseudocercospora fijiensis
CIRAD86]
Length = 638
Score = 44.7 bits (104), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 74/179 (41%), Gaps = 31/179 (17%)
Query: 126 HCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYG------TVLEDKRVEVNFIYEPPQQ 179
H E SFD ++++ V+R GF+YG V + V IYEPPQ
Sbjct: 251 HVEFQSFD-----IVNRFIDFWRKSGVQRIGFLYGRYEAYDQVPLGTKAVVEAIYEPPQ- 304
Query: 180 GTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD------------YTLS 227
G E + + E+E +D +A GM+++G +FT I + Y LS
Sbjct: 305 GDEVDGVTMTEWEKEAQIDKVAELTGMQRLGVVFTDLISPENPEEGAAICKRHVDSYFLS 364
Query: 228 NREVLQAVEFHAE-------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
+ EV A + A+ ++ + V +S + I A+Q S+ V + +
Sbjct: 365 SLEVCFASRYQAKYPRPSKWSETGQFGSNFVTCVISGDSEGQIGIAAYQASNAAVEMVR 423
>gi|389582714|dbj|GAB65451.1| hypothetical protein PCYB_061830, partial [Plasmodium cynomolgi
strain B]
Length = 521
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 297 KMKKDVVVGGKDVKEVDNDFFLVVVKILDHQGPLSSTFPIENRTTQVTMRALKSHLNRS- 355
K K V++ +VK++D F + + I+ H+ ++ FP N + T+ ++++RS
Sbjct: 398 KCKSAVLIEKIEVKQIDVLFCVNNMPIVSHKSFYNTFFPYPNNSNYYTILQRFNYMSRSL 457
Query: 356 -PSLPLVKRISDFHLLLFLARFLDLNSDVPALAQCV---QAQTAVPEGYKLLINSMAET 410
+V DF+ LLFL D+P + + V + +P+ Y ++ S++++
Sbjct: 458 HNQKDMVNIFRDFNFLLFLTNIFSFERDIPPICKAVNDSKNSVGIPDQYVSILRSLSQS 516
>gi|449299195|gb|EMC95209.1| hypothetical protein BAUCODRAFT_35205 [Baudoinia compniacensis UAMH
10762]
Length = 596
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 120/292 (41%), Gaps = 75/292 (25%)
Query: 151 AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEEEEKLVDAIAAG 203
+R G++YG E + V V IYEPPQ +E+ + L + + E +D +A+
Sbjct: 273 GAQRVGYLYGHYEEYEVVPLGTKAVVEAIYEPPQ--ADEIDGVTLGEWDNEDSIDELASK 330
Query: 204 LGMKKVGFIFTQTIMQNKK------------DYTLSNREVLQAVEFHAEC-NMEEW---- 246
G+++VG IFT + + + LS+ E+L A + A+ +W
Sbjct: 331 CGLQRVGVIFTDLVRPDDPAQGNAICKRHIDSFFLSSLEILFASRYQAKYPRPSKWSETG 390
Query: 247 ------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIA--------EGDD 292
VT VV + EEG AI A+Q S+ V + + E E +D
Sbjct: 391 KFGSNFVTCVVSGD--EEGQIAI--SAYQASNSAVEMVRADIIEPSAEPSVMLVQDEDED 446
Query: 293 PKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------LSSTFPI 336
+L +++ + + G+ V+E F V ++ L H P + + F I
Sbjct: 447 NELGRVRYIPEVFYRRINEYGRSVQENAKPSFPVEYLLVTLTHGFPNEAAPVFVQNKFAI 506
Query: 337 ENR------------TTQVTMRALKSHLNRSPSLPLVKRISDFHLLLFLARF 376
ENR TQ+ + K L+ V +SDFHLL F+ +
Sbjct: 507 ENREAIGEAQEPRDLQTQLKAGSNKVALDTPQG---VLAVSDFHLLCFVHQL 555
>gi|256077991|ref|XP_002575282.1| nuclear protein localization [Schistosoma mansoni]
gi|256077993|ref|XP_002575283.1| nuclear protein localization [Schistosoma mansoni]
gi|350644612|emb|CCD60660.1| nuclear protein localization, putative [Schistosoma mansoni]
gi|350644613|emb|CCD60661.1| nuclear protein localization, putative [Schistosoma mansoni]
Length = 692
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 68/157 (43%), Gaps = 33/157 (21%)
Query: 170 VNFIYEPPQQGTEEVLYILR--DEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD---- 223
V IYEP Q+ T + + ++ D K V +A +G++ VG+IFT + QN
Sbjct: 277 VAVIYEPAQESTPKSVKLIGPLDSVLPKHVMEVAQRIGLQPVGWIFTDLVAQNSNGNGVV 336
Query: 224 ---------YTLSNREVLQAVEF---HAE--------CNMEEWVTAVVKLEVSEEGSAAI 263
+ LS E + A H C ++ T VV + + S I
Sbjct: 337 KHFRGTVDTFFLSAEECITAAHLQNLHPNICRLSPDGCFGSKFTTVVV----TGDASNHI 392
Query: 264 HFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKK 300
HFEA+Q+S+ + L K+ + D P+L +K+
Sbjct: 393 HFEAYQVSNQAMALVKDNII---VPTYDAPELGYVKE 426
>gi|365985301|ref|XP_003669483.1| hypothetical protein NDAI_0C05810 [Naumovozyma dairenensis CBS 421]
gi|343768251|emb|CCD24240.1| hypothetical protein NDAI_0C05810 [Naumovozyma dairenensis CBS 421]
Length = 587
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 74/293 (25%), Positives = 119/293 (40%), Gaps = 60/293 (20%)
Query: 151 AVKRGGFMYG-------TVLEDKRVEVNFIYEPPQQGTEE--VLYILRDEEEEKLVDAIA 201
++R FMYG T L K + + IYEPPQ ++ L I + +E +D +A
Sbjct: 263 GMQRFAFMYGSYEKYDSTPLGIKAI-IEAIYEPPQHDEQDGLTLDIEQTRQEMDEIDKLA 321
Query: 202 AGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEFHA-ECNMEEW--- 246
+G+ +VG IFT ++K + LS+ EV+ A ++ N+ ++
Sbjct: 322 KTMGLSRVGVIFTDLTDAGNGDGSVFCKRHKDSFFLSSLEVIMAAKYQKRHPNVSKYSEQ 381
Query: 247 ---VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK----EGWFETEIAEGDD------- 292
+ V VS I ++Q+S L G +A ++
Sbjct: 382 GIFSSKFVTCVVSGNTEGEIDISSYQVSTAAEALVDATMISGSTHPSMAYINETTNDRYV 441
Query: 293 PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST----------FPIENRT 340
P++ MKK+ G VKE F V ++ L H P+ T FP NR
Sbjct: 442 PEIFYMKKNEY--GLTVKENAKPAFPVDYLLVSLTHGFPIDDTVSGKFRTIQGFPWANRQ 499
Query: 341 T---QVTMRALKSHLNRSPSLP----LVKRISDFHLLLFLARFLDLNSDVPAL 386
+ + LK ++ +S L RIS+FH LL++ L+ D AL
Sbjct: 500 SMGQSQDYQELKKYIYQSAIAGDFNVLHDRISNFHFLLYVHTLQILSEDEWAL 552
>gi|429240358|ref|NP_595883.2| Cdc48-Npl4-Ufd1 complex component Npl4 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|384872595|sp|Q9P780.2|NPL4_SCHPO RecName: Full=Nuclear protein localization protein 4
gi|347834321|emb|CAB88240.2| Cdc48-Npl4-Ufd1 complex component Npl4 (predicted)
[Schizosaccharomyces pombe]
Length = 572
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 43/187 (22%), Positives = 82/187 (43%), Gaps = 37/187 (19%)
Query: 143 YVNETLAFAVKRGGFMYGTVLEDKRVEVNF------IYEPPQQGTEEVLYILRDEEEEKL 196
++N+ +R G+ YG + V + IYEPPQ + + L + +E L
Sbjct: 253 FLNKWRQSGFQRIGYTYGHFEQYNNVPLGIKGVIEAIYEPPQVSEADGV-TLEEWADEAL 311
Query: 197 VDAIAAGLGMKKVGFIFT----------QTIMQNKKD-YTLSNREVLQAVEFHAEC-NME 244
V+ +A G++++G IFT + + + D Y LS+ EV + F + N
Sbjct: 312 VEQVATACGLRRIGIIFTDLTDDGSNSGKVLCKRHSDSYFLSSLEVYNSANFQTKFKNPC 371
Query: 245 EW----------VTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPK 294
+W VT+V+ ++ E I ++Q+S++ L++ + + DP
Sbjct: 372 KWSRSGYFGSKFVTSVISGNLNGE----IEVMSYQVSNIGTALYQADLIQPSV----DPD 423
Query: 295 LSKMKKD 301
+KK+
Sbjct: 424 RMLVKKE 430
>gi|300120945|emb|CBK21187.2| unnamed protein product [Blastocystis hominis]
Length = 444
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 37/169 (21%), Positives = 77/169 (45%), Gaps = 17/169 (10%)
Query: 169 EVNFIYE---PPQQGTEEVLYILRDEEEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKDYT 225
E+ F+Y P Q+ + +L + K DA+A L ++ +G I+T+ K+ T
Sbjct: 185 EIAFVYSLYIPKQRFINGKIELLASDPYRKAADAVARRLDLRIIGAIYTRP--SQKERIT 242
Query: 226 LSNREVLQAVEFHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFET 285
+ +++ +++ E ++ + V + V S I A+ +SD C+ ++++G E+
Sbjct: 243 AGDVDLIAKLQW-GERTRGDFCSRFVSVVVERNESREIEPFAYMLSDQCMCMYRDGMIES 301
Query: 286 EIAEGDDPKLSK-----------MKKDVVVGGKDVKEVDNDFFLVVVKI 323
D + ++ D +G ++V E + FFLV + I
Sbjct: 302 PRKGSDLVHVRNRTPAGELLSVVIRNDAKLGSQEVTEFEAIFFLVELTI 350
>gi|254584574|ref|XP_002497855.1| ZYRO0F15048p [Zygosaccharomyces rouxii]
gi|238940748|emb|CAR28922.1| ZYRO0F15048p [Zygosaccharomyces rouxii]
Length = 572
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 50/101 (49%), Gaps = 11/101 (10%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQ 60
M+LRIRS++G++RV+ D + ++ I+ S L V T++ A P L
Sbjct: 1 MILRIRSKEGMQRVSCDSGDLFGTVVEKIL-SHLSPSVNPATITVGSQESTANEPIAGLV 59
Query: 61 FTDMANPDRPLSSLNISHGSIVFLSY---DGERNVRGPSFN 98
+ R ++ L ++HG IVF+ Y +GE+ S N
Sbjct: 60 Y-------RSVADLGLNHGDIVFVKYQEKEGEQPKEAASVN 93
>gi|367009150|ref|XP_003679076.1| hypothetical protein TDEL_0A05330 [Torulaspora delbrueckii]
gi|359746733|emb|CCE89865.1| hypothetical protein TDEL_0A05330 [Torulaspora delbrueckii]
Length = 583
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYG-------TVLEDKRVEVNFIYEPPQQGTEEVLYILR 189
+D +++ +R G+MYG T L K V V IYEPPQ ++ L +
Sbjct: 243 SDLINEFIEAWRDTGTQRFGYMYGSYQTYDATPLGTKAV-VEAIYEPPQHDEQDGLTMDM 301
Query: 190 DEEEEKL--VDAIAAGLGMKKVGFIFT 214
++ E+++ VD +A +G++++G IFT
Sbjct: 302 EQVEKEMDHVDNLAQQMGLQRLGLIFT 328
>gi|213403230|ref|XP_002172387.1| NPL4 [Schizosaccharomyces japonicus yFS275]
gi|212000434|gb|EEB06094.1| NPL4 [Schizosaccharomyces japonicus yFS275]
Length = 583
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 63/151 (41%), Gaps = 24/151 (15%)
Query: 153 KRGGFMYG------TVLEDKRVEVNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R G+MYG VL + V I +P Q + + + + E VD +AA G+
Sbjct: 273 QRIGYMYGRYQKYENVLLGIKAVVELIIDPAQVSESDGVTLQNPWDREAEVDKLAAACGL 332
Query: 207 KKVGFIFTQTI-----------MQNKKDYTLSNREVLQAVEFHAEC-NMEEWV------T 248
++VG IFT + ++ Y LS+ E + F ++ N +W +
Sbjct: 333 RRVGIIFTDLVDDGSGKGTVLCKRHAGSYFLSSLETYNSAYFQSQTPNPSKWSKSGHFGS 392
Query: 249 AVVKLEVSEEGSAAIHFEAFQMSDMCVRLFK 279
V +S I A+Q+S+ V L +
Sbjct: 393 KFVTCVLSGNEKGEIEVMAYQVSNTAVALLQ 423
>gi|366989755|ref|XP_003674645.1| hypothetical protein NCAS_0B01870 [Naumovozyma castellii CBS 4309]
gi|342300509|emb|CCC68271.1| hypothetical protein NCAS_0B01870 [Naumovozyma castellii CBS 4309]
Length = 579
Score = 42.0 bits (97), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 127/298 (42%), Gaps = 67/298 (22%)
Query: 137 ADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEVNFIYEPPQQGTEEVLYILRD 190
++ Q++ ++R +MYG+ +E + V IYEPPQ ++ L + +
Sbjct: 239 SELINQFIESWRYTGMQRFAYMYGSYVEYDSTPLGIKALVEVIYEPPQHDEQDGLTMDME 298
Query: 191 E--EEEKLVDAIAAGLGMKKVGFIFTQT-----------IMQNKKDYTLSNREVLQAVEF 237
+ +E + D +AA +G+ ++G IFT ++K + LS+ EV+ + +
Sbjct: 299 QVTKEMEQCDKMAAQMGLSRIGLIFTDLTDAGNGDGSVFCKRHKDSFFLSSLEVIMSAKH 358
Query: 238 ---------HAECNM--EEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKE----GW 282
++E + ++VT VV + E I ++Q+S L G
Sbjct: 359 QLRHPNTSKYSEQGIFSSKFVTCVVSGNLEGE----IDISSYQVSTDAEALVDASMIGGS 414
Query: 283 FETEIAEGDD-------PKLSKMKKDVVVGGKDVKEVDNDFFLV--VVKILDHQGPLSST 333
+A ++ P++ M+K+ G VKE F V ++ L H P ++
Sbjct: 415 THPSMAYINETTKDRYVPEIFYMRKNEY--GLTVKENAKPAFPVDYLLVSLTHGFPRDAS 472
Query: 334 -----------FPIENRTT---QVTMRALKSHLNRSPSL----PLVKRISDFHLLLFL 373
FP NR + + LK +L R+ + L K+IS+FHLLL++
Sbjct: 473 ETTGKFLTVGGFPWANRQSMGQSQDYQELKKYLYRAATSGDFNELHKKISNFHLLLYI 530
>gi|397634455|gb|EJK71430.1| hypothetical protein THAOC_07131 [Thalassiosira oceanica]
Length = 540
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 58/119 (48%), Gaps = 13/119 (10%)
Query: 266 EAFQMSDMCVRLFKEGWFETEIAEGD--DPKLSKMKKDVVVGGKDVKEVDNDFFLVVVKI 323
EAFQ+SD+CV++ EG + + + ++ VV+ G++ ++D LV +
Sbjct: 381 EAFQLSDVCVQMVAEGALKAPTVDESLQGTRSMELTDPVVISGEETTKLDTVLLLVNTAM 440
Query: 324 LDHQG--------PLSSTFPIENRTTQV-TMRALKSHLNRSPSLPLVKRISDFHLLLFL 373
L H G P+ + + V T + L S L+RS S ++K + DF +LL L
Sbjct: 441 LSHVGLYSGGEDAPVGGSVKKSSGALLVKTRKRLLSALDRSNS--VLKELCDFDVLLAL 497
>gi|398397621|ref|XP_003852268.1| hypothetical protein MYCGRDRAFT_109593 [Zymoseptoria tritici
IPO323]
gi|339472149|gb|EGP87244.1| hypothetical protein MYCGRDRAFT_109593 [Zymoseptoria tritici
IPO323]
Length = 613
Score = 41.2 bits (95), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 122/293 (41%), Gaps = 66/293 (22%)
Query: 144 VNETLAF----AVKRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYI-LRDEE 192
VN+ L F +R GF+YG E +V V IYEPPQ +EV + L +
Sbjct: 262 VNKFLNFWRGGGAQRIGFLYGHYEEYDQVPLGTKAVVEAIYEPPQ--VDEVDGVTLSEWP 319
Query: 193 EEKLVDAIAAGLGMKKVGFIFT---------QTIMQNKK--DYTLSNREVLQAVEFHAE- 240
E ++ +A G++++G IFT QT + + Y LS+ E++ A + A+
Sbjct: 320 NEAQIEKLAGFCGLQRLGVIFTDLLRPEEGEQTAVCKRHIDSYFLSSLEIMFAARYQAQY 379
Query: 241 ------CNMEEWVTAVVKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETE-------- 286
++ + V +S + I A+Q S+ V + E
Sbjct: 380 PRPSKWSETGQFGSNFVTCVISGDAEGNIGISAYQASNAAVEMVTADIIEASADPSVMLV 439
Query: 287 IAEGDDPKLSKMK-------KDVVVGGKDVKEVDNDFFLV--VVKILDHQGP-------L 330
I E ++ + + + + + G++V+E F V ++ L H P +
Sbjct: 440 IDEDEENAIGRKRYIPEVFYRRLNEYGRNVQENAKPSFPVEYLLVTLTHGFPNEVNPKFV 499
Query: 331 SSTFPIENRTTQVTM---RALKSHL----NRSPSLPLVK---RISDFHLLLFL 373
F IENR + R L+ HL NR +L V+ +SDFHLL ++
Sbjct: 500 KCDFAIENRDALGELQEQRDLQKHLRAGANRV-ALDTVEGTLAVSDFHLLGYV 551
>gi|452841970|gb|EME43906.1| hypothetical protein DOTSEDRAFT_71645 [Dothistroma septosporum
NZE10]
Length = 596
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 85/210 (40%), Gaps = 34/210 (16%)
Query: 98 NPAGSFGRKMTMDDLIAK----QMRVTRQENPHCESVSFDR-DCADAFQQYVNETLAFAV 152
NP G D + +K + + QE + V F+ D D F + + A
Sbjct: 215 NPECPGGHAPFPDGICSKCQPSAISLVPQEYRMVDHVEFETFDMVDKFIGFWRQGGA--- 271
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF+YG E +V V IYEPPQ + + L D E ++ +A G+
Sbjct: 272 QRMGFLYGHYEEYDQVPLGTKAVVEAIYEPPQVDEPDGV-TLGDWNNENEIEQLAELSGL 330
Query: 207 KKVGFIFTQTIMQNKKD------------YTLSNREVLQAVEFHAEC-NMEEWV------ 247
++VG I+T + Q + + Y LS+ EV A + A+ +W
Sbjct: 331 QRVGVIYTDLVQQTEPEKGLAVCKRHADSYYLSSLEVCFASRYQAKYPRPSKWSETGKFG 390
Query: 248 TAVVKLEVSEEGSAAIHFEAFQMSDMCVRL 277
+ V +S + I A+Q S+ V +
Sbjct: 391 SNFVTCVISGDSDGNIAISAYQASNAAVEM 420
>gi|312378145|gb|EFR24796.1| hypothetical protein AND_10383 [Anopheles darlingi]
Length = 194
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 334 FPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISDFHLLLFLARFLDL 379
FP+ENR ++ AL S+L +S S+ + +SDFHLLLFL R D+
Sbjct: 49 FPVENRLIDGHIQDFSALASYLAKSRSMDFLDVVSDFHLLLFLYRMEDM 97
>gi|302653597|ref|XP_003018622.1| hypothetical protein TRV_07382 [Trichophyton verrucosum HKI 0517]
gi|291182280|gb|EFE37977.1| hypothetical protein TRV_07382 [Trichophyton verrucosum HKI 0517]
Length = 773
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 46/91 (50%), Gaps = 16/91 (17%)
Query: 14 VTVD-GAHVTVAELKTIIQSQLRIPVESQTLSTNQNLLLAKSPSDLLQFTDMANPDRPLS 72
VT+D G +T+A+LK +IQS+++IP +SQ L N L A+ +PL
Sbjct: 315 VTLDVGEDMTLADLKAVIQSEIQIPPQSQHLFHNNRPL--------------ADESKPLG 360
Query: 73 SLNISHGSIVFLSYDGERNVRGPS-FNPAGS 102
L IS G ++ + GP NP+G+
Sbjct: 361 QLGISEGDMLGMHIRVPTPASGPGQGNPSGA 391
>gi|297725717|ref|NP_001175222.1| Os07g0514801 [Oryza sativa Japonica Group]
gi|255677805|dbj|BAH93950.1| Os07g0514801 [Oryza sativa Japonica Group]
Length = 182
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%), Gaps = 1/38 (2%)
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVV-GGKDVKE 311
V LFK+G + EI + +DP LSKM+K+VVV GGKD E
Sbjct: 109 VALFKDGVLQAEIGDKEDPLLSKMRKEVVVAGGKDTME 146
>gi|453085771|gb|EMF13814.1| endoplasmic reticulum and nuclear membrane protein Npl4
[Mycosphaerella populorum SO2202]
Length = 644
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 23/121 (19%)
Query: 142 QYVNETLAFAVKRG----GFMYGTVLE------DKRVEVNFIYEPPQQGTEEVLYILRDE 191
Q V+ L F K G GF+YG E + V IYEPPQ + + + +
Sbjct: 259 QIVDRFLDFWRKSGAQRIGFLYGRYEEYDLVPLGTKAVVEAIYEPPQVNEPDGV-TMTEW 317
Query: 192 EEEKLVDAIAAGLGMKKVGFIFTQTIMQNKKD------------YTLSNREVLQAVEFHA 239
E E VD +A G+++VG IFT + + + Y LS+ EV A + A
Sbjct: 318 ENEAAVDKLAEFSGLQRVGIIFTDLLAPDSPEQGTAVCKRHVDSYFLSSLEVCLAARYQA 377
Query: 240 E 240
+
Sbjct: 378 K 378
>gi|363755074|ref|XP_003647752.1| hypothetical protein Ecym_7082 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891788|gb|AET40935.1| hypothetical protein Ecym_7082 [Eremothecium cymbalariae
DBVPG#7215]
Length = 573
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 134/338 (39%), Gaps = 63/338 (18%)
Query: 117 MRVTRQENPHCESVSFDRDCADAFQQYVNETLAFAVKRGGFMYGTVLEDK------RVEV 170
+ + RQE + V F ++ ++++ ++R G++YGT + + V
Sbjct: 210 ISLQRQEFRMVDHVEFQH--SELVNEFIDSWRTTGMQRFGYLYGTYSKYTNTPLGIKAVV 267
Query: 171 NFIYEPPQQGTEEVLYILRDEEEEKL----VDAIAAGLGMKKVGFIFTQT---------- 216
I+EPPQ ++ L + D E+ L VD +A+ +G+ +VG IFT
Sbjct: 268 EAIWEPPQHDEQDGLTM--DIEQVALEVGHVDNLASEMGLMRVGMIFTDLTDSGNGDGTV 325
Query: 217 -IMQNKKDYTLSNREVLQAVEFHAE----CNMEE---WVTAVVKLEVSEEGSAAIHFEAF 268
++K + LS+ EV+ A + C + + + VS I A+
Sbjct: 326 FCKRHKDSFFLSSLEVIMAAKHQLNAPNACRFSQQSRFSSKFFTCVVSGNLEGEIDISAY 385
Query: 269 QMSDMCVRLFK----EGWFETEIAEGDDPKLSKMKKDVVVGGKD-----VKEVDNDFFLV 319
Q+S L G +A ++ ++ D+ K+ VKE F V
Sbjct: 386 QVSTDAEALVDSDIISGSTHPSMAYINETTHNRYVPDIFYMRKNEYNITVKENAKPAFPV 445
Query: 320 --VVKILDHQGPLSST-----------FPIENRTT---QVTMRALKSHL----NRSPSLP 359
++ L H P+ S FP NR LK ++ N S
Sbjct: 446 DYLIVSLTHGFPVPSASNAPKFHRHTGFPWTNRQALGQSQDYLELKRYILPTANSSDLFK 505
Query: 360 LVKRISDFHLLLFLARFLDLNSDVPALAQCVQAQTAVP 397
L +++S+FHLLL++ LNS +L + A T P
Sbjct: 506 LQQQLSNFHLLLYIHSLHILNSKEWSL--LLAATTCTP 541
>gi|170042583|ref|XP_001849000.1| nuclear protein localization 4 [Culex quinquefasciatus]
gi|167866113|gb|EDS29496.1| nuclear protein localization 4 [Culex quinquefasciatus]
Length = 192
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 330 LSSTFPIENRTTQVTMR---ALKSHLNRSPSLPLVKRISDFHLLLFLAR 375
+S FP+ENR ++ AL +L RS S+ + +SDFHLLLFL R
Sbjct: 45 VSQYFPVENRLIDGHIQDFSALSDYLARSRSMDFLDAMSDFHLLLFLYR 93
>gi|222637133|gb|EEE67265.1| hypothetical protein OsJ_24441 [Oryza sativa Japonica Group]
Length = 178
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%), Gaps = 1/38 (2%)
Query: 275 VRLFKEGWFETEIAEGDDPKLSKMKKDVVV-GGKDVKE 311
V LFK+G + EI + +DP LSKM+K+VVV GGKD E
Sbjct: 109 VALFKDGVLQAEIGDKEDPLLSKMRKEVVVAGGKDTME 146
>gi|302414408|ref|XP_003005036.1| NPL4 [Verticillium albo-atrum VaMs.102]
gi|261356105|gb|EEY18533.1| NPL4 [Verticillium albo-atrum VaMs.102]
Length = 250
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 11/88 (12%)
Query: 1 MLLRIRSRDGLERVTVDGAHVTVAELKTIIQSQLRIPVESQ--TLSTNQNLLLAKSPSDL 58
MLLR+R DG+ R+TVD + +L + L V+ TLS N N +K D+
Sbjct: 1 MLLRVRGPDGMSRLTVD-QNDNFGDLGRQLLPNLPTTVDPTTITLSNNPNGNDSKRLGDI 59
Query: 59 LQFTDMANPDRPLSSLNISHGSIVFLSY 86
F L + + HG ++F++Y
Sbjct: 60 ANFK--------LGQIGLKHGDLIFITY 79
>gi|401826979|ref|XP_003887582.1| nuclear pore protein [Encephalitozoon hellem ATCC 50504]
gi|392998588|gb|AFM98601.1| nuclear pore protein [Encephalitozoon hellem ATCC 50504]
Length = 505
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 38/173 (21%), Positives = 74/173 (42%), Gaps = 32/173 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF+ G +E + + V+ I+EP Q+ + I ++ + G G+
Sbjct: 201 QRFGFLIGKYMEHEMIPLGTKAVVSGIWEPDQEDYPDGFVITGP------IEDLFTGTGL 254
Query: 207 KKVGFIFTQTIMQNK--------KDYTLSNREV--------LQAVEFHAECNMEEWVTAV 250
VG I+T M+N +DY LS+ E+ + N E+ +
Sbjct: 255 GIVGMIYTDISMENGSVTSDKIIRDYFLSSLEIEFIAKMQLMHPYVLKDGGNEMEFGSRF 314
Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDVV 303
V + + + I + +Q+S+ C+ L K W I ++PK+ ++D++
Sbjct: 315 VTIIATVDKDGNIGLQEYQVSNQCMALVKGSW----ILPTENPKMLLARRDIL 363
>gi|52081992|ref|YP_080783.1| transcriptional regulator Slr [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|319647856|ref|ZP_08002074.1| slr protein [Bacillus sp. BT1B_CT2]
gi|404490872|ref|YP_006714978.1| HTH-type transcriptional regulator SlrR [Bacillus licheniformis DSM
13 = ATCC 14580]
gi|423683996|ref|ZP_17658835.1| transcriptional regulator Slr [Bacillus licheniformis WX-02]
gi|52005203|gb|AAU25145.1| transcriptional regulator Slr [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|52349881|gb|AAU42515.1| HTH-type transcriptional regulator SlrR [Bacillus licheniformis DSM
13 = ATCC 14580]
gi|317390197|gb|EFV71006.1| slr protein [Bacillus sp. BT1B_CT2]
gi|383440770|gb|EID48545.1| transcriptional regulator Slr [Bacillus licheniformis WX-02]
Length = 152
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 14/94 (14%)
Query: 181 TEEVLYILRDEEEE---KLVDAIAAGLGMKKVGFIFTQTIMQNKKDYTL-SNREVLQAVE 236
TE +LY ++E+E LV A+ AG+ +++ F FT + Q KK+ L NRE+ +A
Sbjct: 64 TEVILYRRTEQEDEWRTHLVQAVQAGMNQEEL-FQFTHRLKQKKKERALYRNRELTRA-- 120
Query: 237 FHAECNMEEWVTAVVKLEVSEEGSAAIHFEAFQM 270
N++EW + LE E G EAF M
Sbjct: 121 -----NIDEW--RALMLEAKEMGLTVKEVEAFLM 147
>gi|449329529|gb|AGE95800.1| protein involved in er translocation [Encephalitozoon cuniculi]
Length = 505
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 79/172 (45%), Gaps = 32/172 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF+ G ++ + + V+ I+EP Q+ + I +DA +G G+
Sbjct: 201 QRFGFLVGRYMDHEMIPLGTKAVVSGIWEPEQEDYPDGFVITGS------MDAPFSGTGL 254
Query: 207 KKVGFIFTQTIMQN--------KKDYTLSNREV-----LQAVEFHA--ECNME-EWVTAV 250
+ VG I+T +M+ K Y LS+ E+ +Q + H + E E+ + +
Sbjct: 255 EIVGMIYTDILMERGSVTSDKVAKGYFLSSLEIEFIAKMQLMHPHTVRDGGREMEFGSRL 314
Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
+ V+ E +I + +Q+S+ C+ L K G+ I +DP+ +D+
Sbjct: 315 ATIVVTAEKDGSIGLQEYQVSNQCMALVKGGY----ILPTEDPRRFLATRDI 362
>gi|19074499|ref|NP_586005.1| PROTEIN INVOLVED IN ER TRANSLOCATION [Encephalitozoon cuniculi
GB-M1]
gi|19069141|emb|CAD25609.1| PROTEIN INVOLVED IN ER TRANSLOCATION [Encephalitozoon cuniculi
GB-M1]
Length = 505
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 79/172 (45%), Gaps = 32/172 (18%)
Query: 153 KRGGFMYGTVLEDKRVE------VNFIYEPPQQGTEEVLYILRDEEEEKLVDAIAAGLGM 206
+R GF+ G ++ + + V+ I+EP Q+ + I +DA +G G+
Sbjct: 201 QRFGFLVGRYMDHEMIPLGTKAVVSGIWEPEQEDYPDGFVITGS------MDAPFSGTGL 254
Query: 207 KKVGFIFTQTIMQN--------KKDYTLSNREV-----LQAVEFHA--ECNME-EWVTAV 250
+ VG I+T +M+ K Y LS+ E+ +Q + H + E E+ + +
Sbjct: 255 EIVGMIYTDILMERGSVTSDKVAKGYFLSSLEIEFIAKMQLMHPHTVRDGGREMEFGSRL 314
Query: 251 VKLEVSEEGSAAIHFEAFQMSDMCVRLFKEGWFETEIAEGDDPKLSKMKKDV 302
+ V+ E +I + +Q+S+ C+ L K G+ I +DP+ +D+
Sbjct: 315 ATIVVTAEKDGSIGLQEYQVSNQCMALVKGGY----ILPTEDPRRFLATRDI 362
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.380
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,918,132,237
Number of Sequences: 23463169
Number of extensions: 236421254
Number of successful extensions: 562097
Number of sequences better than 100.0: 404
Number of HSP's better than 100.0 without gapping: 85
Number of HSP's successfully gapped in prelim test: 319
Number of HSP's that attempted gapping in prelim test: 561067
Number of HSP's gapped (non-prelim): 709
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)