BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016587
(386 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255545672|ref|XP_002513896.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223546982|gb|EEF48479.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 386
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/386 (90%), Positives = 374/386 (96%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ IMNK+R+LDAYPKINEDFYSRT SGGVITL SSI+MLLLF SELRLY++AVTETKL
Sbjct: 1 MEGIMNKLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFISELRLYIHAVTETKLA 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPALPCSILS+DAMDISGEQHLDVKHDI KKRLDS GNVIE+RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVIEARQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI+ PLQRHGGRLEHNETYCGSCYGAE+SDEDCCN+CE+VREAYRKKGWALSNP
Sbjct: 121 DGIGAPKIENPLQRHGGRLEHNETYCGSCYGAEASDEDCCNSCEDVREAYRKKGWALSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQRIK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF QS VHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNISHKIN+LAFG++FPGVVNPLDGV WTQETPSGMYQYFIKVVPTVYTDVSG+TIQ
Sbjct: 241 KDSFNISHKINRLAFGDYFPGVVNPLDGVHWTQETPSGMYQYFIKVVPTVYTDVSGYTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFRS+E GRLQ+LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHFRSAEAGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQ+AIKKK+EIGKFS
Sbjct: 361 VSGILDSFIYHGQKAIKKKMEIGKFS 386
>gi|224082148|ref|XP_002306582.1| predicted protein [Populus trichocarpa]
gi|222856031|gb|EEE93578.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 737 bits (1903), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/386 (88%), Positives = 374/386 (96%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ +M+K+R+LDAYPKINEDFYSRT SGGVITL SS+VM LLFFSELRLYL+AVTETKL+
Sbjct: 1 MEGLMSKLRNLDAYPKINEDFYSRTLSGGVITLASSVVMFLLFFSELRLYLHAVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPALPCSILS+DAMDISGEQHLDVKHDI KKRLD GNVIE+RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDFHGNVIEARQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI+KPLQRHGGRLEHNETYCGSCYGAE+SDEDCCN+CE+VREAYRKKGWA++NP
Sbjct: 121 DGIGAPKIEKPLQRHGGRLEHNETYCGSCYGAEASDEDCCNSCEDVREAYRKKGWAVTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DL+DQCKREGFLQ+IK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ
Sbjct: 181 DLMDQCKREGFLQKIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNI+HKIN+L FGE+FPGVVNPLDGV+WTQETPSGMYQYFIKVVPTVYTDVSGHTIQ
Sbjct: 241 KDSFNITHKINRLTFGEYFPGVVNPLDGVQWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR ++ GRLQ+LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHFRGTDIGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D FIYHGQ+AIKKK+EIGKFS
Sbjct: 361 VSGILDTFIYHGQKAIKKKMEIGKFS 386
>gi|356552872|ref|XP_003544786.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 386
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/386 (88%), Positives = 374/386 (96%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+IM+K+R+LDAYPKINEDFYSRT SGGVITL SSI+MLLLFFSELRLYL+AVTETKL+
Sbjct: 1 MDSIMSKLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFFSELRLYLHAVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSR ETLRINFDVTFPALPCSILS+DAMDISGEQHLDVKHDI KKRLDS GNVIE+RQ
Sbjct: 61 VDTSRAETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVIETRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIGAPKI+KPLQRHGGRLEHNETYCGSCYGAE SD+DCCN+CE+VREAYRKKGWALSNP
Sbjct: 121 EGIGAPKIEKPLQRHGGRLEHNETYCGSCYGAEESDDDCCNSCEDVREAYRKKGWALSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQRIK+EEGEGCN+YGFLEVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRIKDEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN+SH IN+LAFGE+FPGVVNPLD V WTQETPSGMYQYFIKVVPTVYTDVSGHTIQ
Sbjct: 241 KDSFNLSHHINRLAFGEYFPGVVNPLDNVHWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR+ + GRLQ+LPGVFFFYDLSPIKVTFTEE+VSFLHFLTNVCAIVGG+FT
Sbjct: 301 SNQFSVTEHFRTGDVGRLQSLPGVFFFYDLSPIKVTFTEENVSFLHFLTNVCAIVGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQRAIKKK+E+GKF+
Sbjct: 361 VSGILDSFIYHGQRAIKKKMELGKFN 386
>gi|225459342|ref|XP_002285801.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|302141938|emb|CBI19141.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/386 (87%), Positives = 371/386 (96%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD I+NK+R+LDAYPKINEDFYSRT SGGVITL SSI MLLLF SELRLYL+AVTETKL+
Sbjct: 1 MDNIINKLRNLDAYPKINEDFYSRTLSGGVITLASSIFMLLLFISELRLYLHAVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPALPCSILS+DAMDISGEQHLDV+HDI KKR+D+ G+VIE+RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALPCSILSLDAMDISGEQHLDVRHDIIKKRIDAHGSVIEARQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG+PKI+KPLQ+HGGRLEHNETYCGSCYGAE+SD+DCCNNCEEVREAYRKKGWA+SNP
Sbjct: 121 DGIGSPKIEKPLQKHGGRLEHNETYCGSCYGAEASDDDCCNNCEEVREAYRKKGWAMSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQRIK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF QS +HVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNIHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNISHKIN+LAFG++FPGVVNPLDGV+W Q TPSGMYQYFIKVVPTVYT VSGHTI
Sbjct: 241 KDSFNISHKINRLAFGDYFPGVVNPLDGVQWIQATPSGMYQYFIKVVPTVYTHVSGHTIS 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+NQFSVTEHFR++E GRLQ+LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT
Sbjct: 301 TNQFSVTEHFRNAELGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYH Q+AIKKKIEIGKFS
Sbjct: 361 VSGILDSFIYHSQKAIKKKIEIGKFS 386
>gi|356548103|ref|XP_003542443.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 386
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/386 (87%), Positives = 373/386 (96%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M++I++K+R+LDAYPKINEDFYSRT SGGVITL SSI+MLLLF+SELRLYL+AVTETKL+
Sbjct: 1 MESIISKLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFYSELRLYLHAVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSR ETLRINFDVTFPALPCSILS+DAMDISGEQ LDVKHDI KKRLDS+GNVIE+RQ
Sbjct: 61 VDTSRAETLRINFDVTFPALPCSILSLDAMDISGEQRLDVKHDIIKKRLDSRGNVIETRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIGAPKI+KPLQRHGGRLEHNETYCGSCYG+E SD+DCCN+CE+VREAYRKKGWALSNP
Sbjct: 121 EGIGAPKIEKPLQRHGGRLEHNETYCGSCYGSEVSDDDCCNSCEDVREAYRKKGWALSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQRIK+EEGEGCN+YGFLEVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRIKDEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN+SH IN+L FGE+FPGVVNPLD V WTQETPSGMYQYFIKVVPTVYTDVSGHTIQ
Sbjct: 241 KDSFNLSHHINRLTFGEYFPGVVNPLDNVHWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR+ + GRLQ+LPGVFFFYDLSPIKVTFTEE+VSFLHFLTNVCAIVGG+FT
Sbjct: 301 SNQFSVTEHFRTGDMGRLQSLPGVFFFYDLSPIKVTFTEENVSFLHFLTNVCAIVGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQRAIKKK+E+GKF+
Sbjct: 361 VSGILDSFIYHGQRAIKKKMELGKFN 386
>gi|357489473|ref|XP_003615024.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355516359|gb|AES97982.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 386
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/386 (86%), Positives = 374/386 (96%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+IMNK+R+LDAYPKINEDFYSRT SGG+IT+VSSI+MLLLFFSELRLYL+A TETKL+
Sbjct: 1 MDSIMNKLRNLDAYPKINEDFYSRTLSGGLITIVSSILMLLLFFSELRLYLHAATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISGEQHLDV+HDI KKR+DS GNVIE+RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALACSIVSLDAMDISGEQHLDVRHDIIKKRIDSHGNVIETRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG+P I+KPLQRHGGRLEHNETYCGSCYGAE+SDE+CCN+CEEVREAYRKKGWALS+P
Sbjct: 121 DGIGSPNIEKPLQRHGGRLEHNETYCGSCYGAEASDEECCNSCEEVREAYRKKGWALSSP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D IDQCKREGFL+RIKEEEGEGCN+YGFLEVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ
Sbjct: 181 DSIDQCKREGFLERIKEEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
++SFN+SH IN++AFG++FPGVVNPLD V WTQETPSGMYQYFIKVVPT+YTDVSG+TIQ
Sbjct: 241 KESFNLSHHINRIAFGDYFPGVVNPLDRVHWTQETPSGMYQYFIKVVPTMYTDVSGNTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR+++ GRLQ+LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG+FT
Sbjct: 301 SNQFSVTEHFRTADVGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQ+AIKKK+E+GKFS
Sbjct: 361 VSGILDSFIYHGQKAIKKKMELGKFS 386
>gi|224066933|ref|XP_002302286.1| predicted protein [Populus trichocarpa]
gi|222844012|gb|EEE81559.1| predicted protein [Populus trichocarpa]
Length = 377
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/386 (88%), Positives = 366/386 (94%), Gaps = 9/386 (2%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +M+K+R+ DAYPKINEDFYSRT SGGVITL SSIVM LLFFSELRLYL+AVTETKL+
Sbjct: 1 MDGLMSKLRNFDAYPKINEDFYSRTLSGGVITLASSIVMFLLFFSELRLYLHAVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPALPCSILS+DAMDISGEQHLDVKHDI KKRLDS GNVIESRQ
Sbjct: 61 VDTSRGETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVIESRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI+KPLQRHGGRLEHNETYC DEDCCN+CEEVREAY+KKGWA++NP
Sbjct: 121 DGIGAPKIEKPLQRHGGRLEHNETYC---------DEDCCNSCEEVREAYQKKGWAVTNP 171
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DL+DQCKREGFLQRIK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ
Sbjct: 172 DLMDQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 231
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN SHKIN+LAFGE+FPGVVNPLDGV+WTQETPSGMYQYFIKVVPTVYTDVSGHTIQ
Sbjct: 232 KDSFNTSHKINRLAFGEYFPGVVNPLDGVQWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 291
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR ++ GRLQ+LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT
Sbjct: 292 SNQFSVTEHFRGADIGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 351
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQ+AIKKK+EIGKFS
Sbjct: 352 VSGILDSFIYHGQKAIKKKMEIGKFS 377
>gi|297846654|ref|XP_002891208.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
lyrata]
gi|297337050|gb|EFH67467.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
lyrata]
Length = 386
Score = 714 bits (1842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/386 (85%), Positives = 365/386 (94%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I+NK+R+LDAYPKINEDFYSRT SGGVITL+SS+VM LLFFSELRLYL+ VTETKL+
Sbjct: 1 MAGILNKLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFD+TFPAL CSILSVDAMDISGE HLDVKHDI K+RLDS GN IE+RQ
Sbjct: 61 VDTSRGETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTIEARQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGA KI+KPLQ+HGGRLEHNETYCGSCYGAE+ + DCCN+CE+VREAYRKKGW ++NP
Sbjct: 121 DGIGATKIEKPLQKHGGRLEHNETYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQR+K+EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNISHKIN+L +G++FPGVVNPLD V W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQ
Sbjct: 241 KDSFNISHKINRLTYGDYFPGVVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEH +SSE G+LQ+LPGVFFFYDLSPIKVTFTEEH+SFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEIGKFS 386
>gi|238478737|ref|NP_001154394.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|12324714|gb|AAG52317.1|AC021666_6 unknown protein; 24499-21911 [Arabidopsis thaliana]
gi|27808598|gb|AAO24579.1| At1g36050 [Arabidopsis thaliana]
gi|110736190|dbj|BAF00066.1| hypothetical protein [Arabidopsis thaliana]
gi|332193720|gb|AEE31841.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 386
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/386 (84%), Positives = 363/386 (94%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I+NK+R+LDAYPKINEDFYSRT SGGVITL+SS+VM LLFFSELRLYL+ VTETKL+
Sbjct: 1 MAGILNKLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFD+TFPAL CSILSVDAMDISGE HLDVKHDI K+RLDS GN IE+RQ
Sbjct: 61 VDTSRGETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTIEARQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGA KI+ PLQ+HGGRL HNETYCGSCYGAE+ + DCCN+CE+VREAYRKKGW ++NP
Sbjct: 121 DGIGATKIENPLQKHGGRLGHNETYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQR+K+EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNISHKIN+L +G++FPGVVNPLD V W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQ
Sbjct: 241 KDSFNISHKINRLTYGDYFPGVVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEH +SSE G+LQ+LPGVFFFYDLSPIKVTFTEEH+SFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEIGKFS 386
>gi|240254210|ref|NP_564467.5| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|332193719|gb|AEE31840.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 489
Score = 701 bits (1809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/382 (84%), Positives = 359/382 (93%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I+NK+R+LDAYPKINEDFYSRT SGGVITL+SS+VM LLFFSELRLYL+ VTETKL+
Sbjct: 1 MAGILNKLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFD+TFPAL CSILSVDAMDISGE HLDVKHDI K+RLDS GN IE+RQ
Sbjct: 61 VDTSRGETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTIEARQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGA KI+ PLQ+HGGRL HNETYCGSCYGAE+ + DCCN+CE+VREAYRKKGW ++NP
Sbjct: 121 DGIGATKIENPLQKHGGRLGHNETYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQR+K+EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNISHKIN+L +G++FPGVVNPLD V W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQ
Sbjct: 241 KDSFNISHKINRLTYGDYFPGVVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEH +SSE G+LQ+LPGVFFFYDLSPIKVTFTEEH+SFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEI 382
VSGIIDAFIYHGQ+AIKKK+EI
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEI 382
>gi|449465886|ref|XP_004150658.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
gi|449518819|ref|XP_004166433.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 386
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/386 (87%), Positives = 366/386 (94%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD I++K+R+LDAYPKINEDFYSRT SGGVITL SSI+MLLLF SELRLYL+AVTETKL+
Sbjct: 1 MDNIISKLRNLDAYPKINEDFYSRTLSGGVITLSSSILMLLLFISELRLYLHAVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPALPCS+LS+DAMDISGEQHLDVKHDI KKRLDS GN IE+R
Sbjct: 61 VDTSRGETLRINFDVTFPALPCSLLSLDAMDISGEQHLDVKHDIIKKRLDSHGNAIEARP 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI+KPLQRHGGRLEHNETYCGSC+GAES+D+DCCN+CEEVREAYRKKGWALSNP
Sbjct: 121 DGIGAPKIEKPLQRHGGRLEHNETYCGSCFGAESADDDCCNSCEEVREAYRKKGWALSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQRIK+E+GEGCNIYGFLEVNKVAGNFHFAPGKSF QS VHVHD+LAFQ
Sbjct: 181 DLIDQCKREGFLQRIKDEDGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFNISHKIN+LAFGE+FPGVVNPLD V+W QETPS YQYFIKVVPTVY VSG+TIQ
Sbjct: 241 KDSFNISHKINRLAFGEYFPGVVNPLDSVQWKQETPSATYQYFIKVVPTVYNSVSGYTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEH R++E GRLQ+LP VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHVRTAEVGRLQSLPAVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQ+ IKKK+EIGKFS
Sbjct: 361 VSGILDSFIYHGQKVIKKKMEIGKFS 386
>gi|38347102|emb|CAE02574.2| OSJNBa0006M15.17 [Oryza sativa Japonica Group]
gi|116309990|emb|CAH67017.1| H0523F07.5 [Oryza sativa Indica Group]
gi|218194960|gb|EEC77387.1| hypothetical protein OsI_16129 [Oryza sativa Indica Group]
Length = 386
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/386 (80%), Positives = 355/386 (91%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ +++K+RSLDAYPK+NEDFYSRT SGG+ITL SS+VMLLLF SELRLYL+AVTET L
Sbjct: 1 MEGLLSKLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISG++HLDVKHDIFK+R+D GNVI ++Q
Sbjct: 61 VDTSRGETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K+++PLQRHGGRLEHNETYCGSCYGAE SDE CCN+CE+VREAYRKKGW +SNP
Sbjct: 121 DAVGGMKVEQPLQRHGGRLEHNETYCGSCYGAEESDEQCCNSCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQ IK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF ++ VHVHD+L FQ
Sbjct: 181 DLIDQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN+SHKINKL+FG+ FPGVVNPLDG +W Q + GMYQYFIKVVPTVYTD++ H I
Sbjct: 241 KDSFNVSHKINKLSFGQRFPGVVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIIL 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFRSSE GR+Q +PGVFFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHFRSSESGRIQAVPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIID+F+YHGQRAIKKK+EIGKF+
Sbjct: 361 VSGIIDSFVYHGQRAIKKKMEIGKFN 386
>gi|242076030|ref|XP_002447951.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
gi|241939134|gb|EES12279.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
Length = 386
Score = 676 bits (1745), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/386 (80%), Positives = 351/386 (90%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +++K+RSLDAYPK+NEDFYSRT SGGVITL SS++MLLLF SELRLYL+AVTET L
Sbjct: 1 MDGLLSKLRSLDAYPKVNEDFYSRTLSGGVITLASSVIMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISG++HLDVKHD+FK+R+D+ GNVI +RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K++ PLQ HGGRLEHNETYCGSCYGA+ SD CCN+CE+VREAYRKKGW +SNP
Sbjct: 121 DAVGGMKMEAPLQHHGGRLEHNETYCGSCYGAQESDGQCCNSCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DL+DQCKREGFLQ IK+EEGEGCNIYGF+EVNKVAGNFHFAPGKSF QS VHVHD+L FQ
Sbjct: 181 DLLDQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN+SHKIN+L+FGE+FPGVVNPLDG W Q + GMYQYFIKVVPTVYTD++ H I
Sbjct: 241 KDSFNVSHKINRLSFGEYFPGVVNPLDGASWVQHSSYGMYQYFIKVVPTVYTDINEHIIL 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFRS E GR+Q LPGVFFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 361 VSGIIDSFVYHSQRAIKKKMEIGKFN 386
>gi|226494692|ref|NP_001148795.1| LOC100282412 [Zea mays]
gi|194696974|gb|ACF82571.1| unknown [Zea mays]
gi|195622210|gb|ACG32935.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|414586929|tpg|DAA37500.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 386
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/386 (80%), Positives = 351/386 (90%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +++K+RSLDAYPK+NEDFYSRT SGG+ITLVSS VMLLLF SELRLYL+AVTET L
Sbjct: 1 MDGLLSKLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISG++HLDVKHD+FK+R+D+ GNVI +RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K++ PLQ HGGRLEHNETYCGSCYGA+ SD+ CCN CE+VREAYRKKGW +SNP
Sbjct: 121 DVVGGMKMEAPLQHHGGRLEHNETYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DL+DQCKREGFLQ IK+EEGEGCNIYGF+EVNKVAGNFHFAPGKSF QS VHVHD+L FQ
Sbjct: 181 DLLDQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN+SHKIN+L+FGE+FPGVVNPLDG W Q + GMYQYFIKVVPTVYTD++ H I
Sbjct: 241 KDSFNVSHKINRLSFGEYFPGVVNPLDGANWVQHSSYGMYQYFIKVVPTVYTDINEHIIL 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFRS E GR+Q LPGVFFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 361 VSGIIDSFVYHSQRAIKKKMEIGKFN 386
>gi|225448309|ref|XP_002264644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|296085664|emb|CBI29463.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/386 (78%), Positives = 354/386 (91%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD + ++R+LDAYPKINEDFYSRTFSGG+ITL+SSIVML LFFSELRLYL+ VTETKL+
Sbjct: 1 MDRVFQRLRNLDAYPKINEDFYSRTFSGGLITLISSIVMLFLFFSELRLYLHTVTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRG TLRINFDVTFPA+PCS+L++DAMDISGEQH D+KHDI KKR+D+ GNV+ RQ
Sbjct: 61 VDTSRGGTLRINFDVTFPAVPCSVLTLDAMDISGEQHHDIKHDIVKKRIDAHGNVVAVRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG P+I+KPLQRHGGRLEHNE YCGSCYGAE +D+DCCN+C+EVREAYRKKGW ++NP
Sbjct: 121 DGIGGPQIEKPLQRHGGRLEHNEKYCGSCYGAEVTDDDCCNSCDEVREAYRKKGWGMTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGF+Q++KEEEGEGCN+YGFLEVNKVAGNFHF+PGK F+QS +HV+D+LA
Sbjct: 181 DLIDQCKREGFVQKVKEEEGEGCNVYGFLEVNKVAGNFHFSPGKGFYQSNIHVNDLLAIS 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+D +NISH+INKLAFG+HFPGVVNPLDG +W Q+ P GMYQYFIKVVPT+YTD+ GHTIQ
Sbjct: 241 KDGYNISHRINKLAFGDHFPGVVNPLDGAQWFQDAPDGMYQYFIKVVPTIYTDIRGHTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFRS+E GR +LPGV+FFYDLSPIKVT EEH SFLHF+TN+CAIVGG+FT
Sbjct: 301 SNQFSVTEHFRSAEPGRPHSLPGVYFFYDLSPIKVTSKEEHSSFLHFMTNICAIVGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIID+F+YHG RAIKKK+E+GKFS
Sbjct: 361 VSGIIDSFVYHGHRAIKKKMELGKFS 386
>gi|224032113|gb|ACN35132.1| unknown [Zea mays]
gi|414586931|tpg|DAA37502.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 391
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/391 (79%), Positives = 351/391 (89%), Gaps = 5/391 (1%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +++K+RSLDAYPK+NEDFYSRT SGG+ITLVSS VMLLLF SELRLYL+AVTET L
Sbjct: 1 MDGLLSKLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISG++HLDVKHD+FK+R+D+ GNVI +RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K++ PLQ HGGRLEHNETYCGSCYGA+ SD+ CCN CE+VREAYRKKGW +SNP
Sbjct: 121 DVVGGMKMEAPLQHHGGRLEHNETYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQ-----CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
DL+DQ CKREGFLQ IK+EEGEGCNIYGF+EVNKVAGNFHFAPGKSF QS VHVHD
Sbjct: 181 DLLDQVEPSDCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHD 240
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG W Q + GMYQYFIKVVPTVYTD++
Sbjct: 241 LLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDGANWVQHSSYGMYQYFIKVVPTVYTDIN 300
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
H I SNQFSVTEHFRS E GR+Q LPGVFFFYDLSPIKVTFTE+HVSFLHFLTNVCAIV
Sbjct: 301 EHIILSNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIV 360
Query: 356 GGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
GGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 361 GGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 391
>gi|357163897|ref|XP_003579883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 386
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/386 (78%), Positives = 342/386 (88%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +M+K+R+LDAYPK+NEDFYSRT SGGVITL SS VMLLLF SELRLYL+AVTET L
Sbjct: 1 MDGLMSKLRNLDAYPKVNEDFYSRTLSGGVITLASSFVMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LRINFD+TFPAL CSI+S+D MDISG++HLDVKHD+FK+R+D+ GNVI ++Q
Sbjct: 61 VDTSRGEKLRINFDITFPALQCSIISIDVMDISGQEHLDVKHDVFKQRIDANGNVIATKQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K++KPLQ HGGRLEHNETYCGSCYGAE E CCN+CE+VREAYRKKGW +SNP
Sbjct: 121 DAVGGMKVEKPLQMHGGRLEHNETYCGSCYGAEEPGEQCCNSCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D IDQCKREGFLQ IK+EEGEGCNIYGF+E+NKVAGNFHFAPGKSF QS VHVHD+L FQ
Sbjct: 181 DSIDQCKREGFLQTIKDEEGEGCNIYGFVEINKVAGNFHFAPGKSFQQSNVHVHDLLPFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DSFN+SHKINKL+FGE FPGVVNPLDG W Q +P GMYQYF+KVVPTVY+ ++ I
Sbjct: 241 KDSFNVSHKINKLSFGEPFPGVVNPLDGAHWFQHSPYGMYQYFVKVVPTVYSHINEQIIL 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEH RSSE R+Q LPGVFFFYDLSPIKVTFTE HVSFLHFLTNVCAIVGGVFT
Sbjct: 301 SNQFSVTEHARSSESVRMQALPGVFFFYDLSPIKVTFTERHVSFLHFLTNVCAIVGGVFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGIID+F+YHGQRAI KK EIGKF+
Sbjct: 361 VSGIIDSFVYHGQRAITKKREIGKFN 386
>gi|18395087|ref|NP_564162.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|9454530|gb|AAF87853.1|AC073942_7 Contains similarity to a PR00989 protein from Homo sapiens
gi|7959731. EST gb|AI995648 comes from this gene
[Arabidopsis thaliana]
gi|13878151|gb|AAK44153.1|AF370338_1 unknown protein [Arabidopsis thaliana]
gi|21281042|gb|AAM44956.1| unknown protein [Arabidopsis thaliana]
gi|21553754|gb|AAM62847.1| unknown [Arabidopsis thaliana]
gi|332192089|gb|AEE30210.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 386
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 297/386 (76%), Positives = 352/386 (91%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +MN++R+LDAYPKINEDFY RT SGGVITL SSIVML+LFFSEL+LY++ VTET+L
Sbjct: 1 MVGVMNRLRNLDAYPKINEDFYRRTLSGGVITLASSIVMLILFFSELQLYIHPVTETQLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LRINFDVTFPAL CSI+S+D+MDISGE+HLDV+HDI K+RLDS GNVIE++Q
Sbjct: 61 VDTSRGEKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG KI+KPLQ+HGGRLEHNETYCGSC+GAE+SD+ CCN+CEEVREAYRKKGWALS+P
Sbjct: 121 DGIGHTKIEKPLQKHGGRLEHNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ IDQCKREGF+Q++K+EEGEGCN++GFLEVNKVAGNFHF PG+SFHQSG HD+L FQ
Sbjct: 181 ESIDQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ ++NISHK+N+LAFG+ FPGVVNPLDGV+W Q SG+YQYFIKVVP++YTDV +TIQ
Sbjct: 241 QGNYNISHKVNRLAFGDFFPGVVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHF++ E GR+Q+ PGVFF+YDLSPIKV F E+HV FLHFLTNVCAIVGG+FT
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKVIFEEQHVEFLHFLTNVCAIVGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQRAIKKK+EIGKF+
Sbjct: 361 VSGIVDSFIYHGQRAIKKKMEIGKFN 386
>gi|297850670|ref|XP_002893216.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
lyrata]
gi|297339058|gb|EFH69475.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
lyrata]
Length = 386
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 296/386 (76%), Positives = 351/386 (90%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +MN++R+LDAYPKINEDFY RT SGGVITLVSS VML+LFFSEL+LY++ VTET+L
Sbjct: 1 MVGVMNRLRNLDAYPKINEDFYRRTLSGGVITLVSSFVMLILFFSELQLYIHPVTETQLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LRINFDVTFPAL CSI+S+D+MDISGE+HLDV+HDI K+RLDS GNVIE++Q
Sbjct: 61 VDTSRGEKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG KI+KPLQ+HGGRLEHNETYCGSC+GAE+SD+ CCN+CEEVREAYRKKGWALS+P
Sbjct: 121 DGIGHTKIEKPLQKHGGRLEHNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ IDQCKREGF+Q++K+EEGEGCN++GFLEVNKVAGNFHF PG+SFHQSG HD+L FQ
Sbjct: 181 ESIDQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ ++NISH +N+LAFG+ FPGVVNPLDGV+W Q SG+YQYFIKVVP++YTDV +TIQ
Sbjct: 241 QGNYNISHTVNRLAFGDFFPGVVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHF++ E GR+Q+ PGVFF+YDLSPIKV F E+HV FLHFLTNVCAIVGG+FT
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKVIFEEQHVEFLHFLTNVCAIVGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+D+FIYHGQRAIKKK+EIGKF+
Sbjct: 361 VSGIVDSFIYHGQRAIKKKMEIGKFN 386
>gi|6598578|gb|AAF18633.1|AC006228_4 F5J5.4 [Arabidopsis thaliana]
Length = 440
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 311/440 (70%), Positives = 349/440 (79%), Gaps = 54/440 (12%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVT---ET 57
M I+NK+R+LDAYPKINEDFYSRT SGGVITL+SS+VM LLFFSELR L++ + E
Sbjct: 1 MAGILNKLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRTSLSSYSHRDEA 60
Query: 58 KLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE 117
R T + NFD+TFPAL CSILSVDAMDISGE HLDVKHDI K+RLDS GN IE
Sbjct: 61 YSRYFKGRDVTHQRNFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTIE 120
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES----------------------- 154
+RQDGIGA KI+ PLQ+HGGRL HNETYCGSCYGAE+
Sbjct: 121 ARQDGIGATKIENPLQKHGGRLGHNETYCGSCYGAEAVIVLSLYLTLWSMVSQLSSEVCF 180
Query: 155 ----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 210
+ DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFLE
Sbjct: 181 FPVQEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFLE 240
Query: 211 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 270
VNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 241 VNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKVE 300
Query: 271 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDL 330
W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYDL
Sbjct: 301 WSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYDL 360
Query: 331 SPIKVTFTEEHVSFLHFLTNVCAIVGG------------------------VFTVSGIID 366
SPIKVTFTEEH+SFLHFLTNVCAIVGG VFTVSGIID
Sbjct: 361 SPIKVTFTEEHISFLHFLTNVCAIVGGISLISIYHNNTCWLTHIKIRNETCVFTVSGIID 420
Query: 367 AFIYHGQRAIKKKIEIGKFS 386
AFIYHGQ+AIKKK+EIGKFS
Sbjct: 421 AFIYHGQKAIKKKMEIGKFS 440
>gi|224059030|ref|XP_002299683.1| predicted protein [Populus trichocarpa]
gi|222846941|gb|EEE84488.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 282/386 (73%), Positives = 341/386 (88%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ I K+R+LDAYPKINEDFYSRT SGG+ITL+SSI+ML LFFSE LYL+AVTETKLL
Sbjct: 1 MEGIYQKLRNLDAYPKINEDFYSRTLSGGLITLISSIIMLFLFFSEFSLYLHAVTETKLL 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDT+RG+TLRINFD+TFPA+ CS+LSVDA+DISGEQH D++HDI KKR+++ G+VIE RQ
Sbjct: 61 VDTTRGQTLRINFDITFPAIRCSLLSVDAIDISGEQHHDIRHDITKKRINAHGDVIEVRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKIDKPLQ+HGGRLEHNE YCGSC+GAE SD+ CCN+C+EVREAYRKKGWAL+N
Sbjct: 121 DGIGAPKIDKPLQKHGGRLEHNEEYCGSCFGAEMSDDHCCNSCDEVREAYRKKGWALTNM 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC REGF+Q IK+EEGEGCNI G LEVN+VAGNFHF PGKSFHQS + D+L Q
Sbjct: 181 DLIDQCIREGFVQMIKDEEGEGCNINGSLEVNRVAGNFHFVPGKSFHQSNFQLLDLLDMQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
++S+NISH+IN+LAFG++FPGVVNPLDG++ T +G+ Q+FIKVVPT+YTD+ G T+
Sbjct: 241 KESYNISHRINRLAFGDYFPGVVNPLDGIQLMHGTQNGVQQFFIKVVPTIYTDIRGRTVH 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQ+SVTEHF SE RL +LPGV+F YD SPIKVTF EEH SFLHF+T++CAI+GG+FT
Sbjct: 301 SNQYSVTEHFTKSELMRLDSLPGVYFIYDFSPIKVTFKEEHTSFLHFMTSICAIIGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
++GI+D+FIYHG+RAIKKK+EIGKFS
Sbjct: 361 IAGIVDSFIYHGRRAIKKKMEIGKFS 386
>gi|449449715|ref|XP_004142610.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 385
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 290/387 (74%), Positives = 337/387 (87%), Gaps = 3/387 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+++MNKIR LDAYPKI+EDFY+RT SGG IT+ SSI+M LLFFSELRLY++ TETKL+
Sbjct: 1 MESLMNKIRKLDAYPKISEDFYNRTLSGGFITIASSIIMFLLFFSELRLYVHTATETKLI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LRINFDVTFPALPCS+LS+ AMDISGEQHLDVKHDI KKR+D QGNVI+SR
Sbjct: 61 VDTSRGEHLRINFDVTFPALPCSVLSLHAMDISGEQHLDVKHDIVKKRIDYQGNVIDSRP 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG+ +I++PLQ+HGGRL+ NETYCGSCYGA S EDCCN+C++VREAY +KGWALS+P
Sbjct: 121 DGIGSTEIERPLQKHGGRLKQNETYCGSCYGA--SGEDCCNSCQDVREAYHRKGWALSHP 178
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA-F 239
DLIDQCKREGF QR+K EEGEGCNIYGFLEVNKVAGNFHFAPG+ F S +H+ LA F
Sbjct: 179 DLIDQCKREGFFQRVKNEEGEGCNIYGFLEVNKVAGNFHFAPGRGFQLSYFQIHNPLASF 238
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
Q D+FNISH+IN+L FG+ FPGVVNPLDGV+W Q T SGM+QYFIKVVPTVY V+G I
Sbjct: 239 QWDAFNISHRINRLTFGDDFPGVVNPLDGVQWNQGTLSGMFQYFIKVVPTVYKAVNGKAI 298
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+SNQFSVT+H R + Q L GVFFFYDLSPIKVTFTEEH+SF HFLTNVCAIVGGVF
Sbjct: 299 KSNQFSVTQHLRGIDGESFQALHGVFFFYDLSPIKVTFTEEHISFFHFLTNVCAIVGGVF 358
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKFS 386
T+SGI+D+ IYHGQ+AIKKK+ +GKF+
Sbjct: 359 TISGILDSIIYHGQKAIKKKMALGKFT 385
>gi|449510462|ref|XP_004163672.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 3-like [Cucumis
sativus]
Length = 385
Score = 615 bits (1587), Expect = e-174, Method: Compositional matrix adjust.
Identities = 289/387 (74%), Positives = 336/387 (86%), Gaps = 3/387 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+++MNKIR LDAYPKI+EDFY+RT SGG IT+ SSI+M LLFFSELRLY++ TETKL+
Sbjct: 1 MESLMNKIRKLDAYPKISEDFYNRTLSGGFITIASSIIMFLLFFSELRLYVHTATETKLI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LRINFDVTFPALPCS+LS+ AMDISGEQHLDVKHDI KKR+D QGNVI+SR
Sbjct: 61 VDTSRGEHLRINFDVTFPALPCSVLSLHAMDISGEQHLDVKHDIVKKRIDYQGNVIDSRP 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG+ +I++PLQ+HGGRL+ NETYCGSCYGA S EDCCN+C++VREAY +KGWALS+P
Sbjct: 121 DGIGSTEIERPLQKHGGRLKQNETYCGSCYGA--SGEDCCNSCQDVREAYHRKGWALSHP 178
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA-F 239
DLIDQCKREGF QR+K EEGEGCNIYGFLEVNKVAGNFHFAPG+ F S +H+ LA F
Sbjct: 179 DLIDQCKREGFFQRVKNEEGEGCNIYGFLEVNKVAGNFHFAPGRGFQLSYFQIHNPLASF 238
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
Q D+FNISH+IN+L FG+ FPGVVNPLDGV+W Q T SGM+QYFIKVVPTVY V+G I
Sbjct: 239 QWDAFNISHRINRLTFGDDFPGVVNPLDGVQWNQGTLSGMFQYFIKVVPTVYKAVNGKAI 298
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+SNQFSVT+H R + Q L G FFFYDLSPIKVTFTEEH+SF HFLTNVCAIVGGVF
Sbjct: 299 KSNQFSVTQHLRGIDGESFQALHGXFFFYDLSPIKVTFTEEHISFFHFLTNVCAIVGGVF 358
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKFS 386
T+SGI+D+ IYHGQ+AIKKK+ +GKF+
Sbjct: 359 TISGILDSIIYHGQKAIKKKMALGKFT 385
>gi|356512071|ref|XP_003524744.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 431
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 280/386 (72%), Positives = 338/386 (87%), Gaps = 2/386 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD + NK+R+LDAYPK+NEDFY+RT +GGV+T+VS+ VML LFFSEL LYL VTE+KLL
Sbjct: 48 MDKVFNKLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFFSELSLYLYTVTESKLL 107
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRG+TL INFDVTFPA+ CSILS+DAMDISGEQHLD++H+I KKR+D+ GNVIE R+
Sbjct: 108 VDTSRGDTLHINFDVTFPAVRCSILSLDAMDISGEQHLDIRHNIVKKRIDANGNVIEERK 167
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI++PLQ+HGGRL H+E YCGSC+GAE SDE CCN+CEEVREAYRKKGWA++N
Sbjct: 168 DGIGAPKIERPLQKHGGRLGHDEKYCGSCFGAEESDEHCCNSCEEVREAYRKKGWAMTNM 227
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC+REG++QR+K+EEGEGCN+ G LEVNKVAGNFHFA GKSF QS + + D+LA Q
Sbjct: 228 DLIDQCQREGYVQRVKDEEGEGCNLQGSLEVNKVAGNFHFATGKSFLQSAIFLADLLALQ 287
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ +NISH+INKL+FG HFPG+VNPLDGV+W Q GMYQYFIKVVPT+YTD+ G I
Sbjct: 288 DNHYNISHRINKLSFGHHFPGLVNPLDGVKWVQGPAHGMYQYFIKVVPTIYTDIRGRVIH 347
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQ+SVTEHF+SSE G +PGVFFFYD+SPIKV F EEH+ FLHFLTN+CAI+GGVFT
Sbjct: 348 SNQYSVTEHFKSSELG--VAVPGVFFFYDISPIKVNFKEEHIPFLHFLTNICAIIGGVFT 405
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
V+GIID+ IY+GQR IK+K+E+GKF+
Sbjct: 406 VAGIIDSSIYYGQRTIKRKMELGKFT 431
>gi|363806898|ref|NP_001242045.1| uncharacterized protein LOC100781612 [Glycine max]
gi|255644390|gb|ACU22700.1| unknown [Glycine max]
Length = 384
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 279/386 (72%), Positives = 334/386 (86%), Gaps = 2/386 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD + NK+R+LDAYPK+NEDFY+RT +GGV+T+VS+ VML LFFSEL L L VTE+KLL
Sbjct: 1 MDKVFNKLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFFSELSLCLYTVTESKLL 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRG+TL INFDVTFPA+ CSILS+DAMDISGEQHLD++H+I KKR+D+ GNVIE R+
Sbjct: 61 VDTSRGDTLHINFDVTFPAVRCSILSLDAMDISGEQHLDIRHNIVKKRIDANGNVIEERK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI+KPLQ+HGGRL H+E YCGSC+GAE SDE CCN+CEEVREAYRKKGWA++N
Sbjct: 121 DGIGAPKIEKPLQKHGGRLGHDEKYCGSCFGAEESDEHCCNSCEEVREAYRKKGWAMTNM 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC+REG++QR+K+EEGEGCN+ G LEVNKVAGNFHFA GKSF QS + + D+LA Q
Sbjct: 181 DLIDQCQREGYVQRVKDEEGEGCNLQGSLEVNKVAGNFHFATGKSFLQSAIFLADVLALQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ +NISH+INKL+FG HFPG+VNPLDGVRW Q GMYQYFIKVVPT+YTD+ G I
Sbjct: 241 DNHYNISHRINKLSFGHHFPGLVNPLDGVRWVQGPTHGMYQYFIKVVPTIYTDIRGRVIH 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQ+SVTEHF+SSE G +PGVFFFYD+SPIKV F EEH FLHFLTN+CAI+GGV
Sbjct: 301 SNQYSVTEHFKSSELG--VAVPGVFFFYDISPIKVNFKEEHTPFLHFLTNICAIIGGVLA 358
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
V+GIID+ IY+GQR IK+K+E+GKF+
Sbjct: 359 VAGIIDSSIYYGQRTIKRKMELGKFT 384
>gi|224073341|ref|XP_002304080.1| predicted protein [Populus trichocarpa]
gi|222841512|gb|EEE79059.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 605 bits (1560), Expect = e-170, Method: Compositional matrix adjust.
Identities = 274/385 (71%), Positives = 336/385 (87%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD I K+R+LDAYPKINEDFYSRT SGG+ITL+SS+++L LFFSEL LYL+ VTETKLL
Sbjct: 1 MDRIYQKVRNLDAYPKINEDFYSRTLSGGLITLISSVLILFLFFSELSLYLHKVTETKLL 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRG++LRINFDVTFPA+ CS+LSVDA+DISGEQHLD++HDI KKR+++ G+VIE RQ
Sbjct: 61 VDTSRGQSLRINFDVTFPAIRCSLLSVDAIDISGEQHLDIRHDISKKRINAHGDVIEVRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIGAPKID+PLQ HGGRL HNE YCGSC+G E S +DCCN CEEVREAYR+KGWA++N
Sbjct: 121 EGIGAPKIDRPLQSHGGRLGHNEEYCGSCFGGEMSHDDCCNTCEEVREAYRRKGWAMTNM 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGF+Q IK+EEGEGCNI G LEVN+VAG+FHFAP KSFH S + D+L Q
Sbjct: 181 DLIDQCKREGFIQMIKDEEGEGCNINGSLEVNRVAGSFHFAPWKSFHLSNFLIQDLLDLQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+DS+NISH+IN+LAFG++FPGVVNPL G++ +TP+G+ Q+FIKVVPT+YTD+ G T+
Sbjct: 241 KDSYNISHRINRLAFGDYFPGVVNPLAGIQLMHDTPNGVQQFFIKVVPTIYTDIRGRTVH 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQ+S TEHF+ SE L +LPGV+FFYD SPIKV F EEH+SFLHF+T++CAI+GG+FT
Sbjct: 301 SNQYSATEHFKKSELTPLDSLPGVYFFYDFSPIKVIFKEEHISFLHFMTSICAIIGGIFT 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
++GIID+FIY+GQRAI KK+ IGKF
Sbjct: 361 IAGIIDSFIYYGQRAITKKVGIGKF 385
>gi|217071774|gb|ACJ84247.1| unknown [Medicago truncatula]
Length = 384
Score = 602 bits (1553), Expect = e-170, Method: Compositional matrix adjust.
Identities = 272/385 (70%), Positives = 335/385 (87%), Gaps = 2/385 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD + NK+R+LDAYPK+NEDFY+RT +GGV+T+VS+ VML LF SELRLYL VTE+KLL
Sbjct: 1 MDKVFNKLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFISELRLYLYTVTESKLL 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETL INFDVTFPA+ CSILS+D MDISGE+H D+ H+I K+R+D+ G VIE+R+
Sbjct: 61 VDTSRGETLNINFDVTFPAVRCSILSLDTMDISGERHHDILHNIMKQRIDANGKVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIGAPKI++PLQ+HGGRLEH+E YCGSC+GAE SD+ CCNNCEEVREAYRKKGWAL+N
Sbjct: 121 EGIGAPKIERPLQKHGGRLEHDEKYCGSCFGAEESDDHCCNNCEEVREAYRKKGWALTNI 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC+REGF+Q++K+EEGEGCNI+G LEVNKVAGNFHFA G+SF QS + + D+LA Q
Sbjct: 181 DLIDQCQREGFVQKVKDEEGEGCNIHGSLEVNKVAGNFHFATGQSFLQSAIFLTDLLALQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ +NISH+INKL+FG H+PG+VNPLDG++W Q GM QYFIKVVPTVYTD+ G I
Sbjct: 241 DNHYNISHQINKLSFGHHYPGLVNPLDGIKWVQGNDHGMCQYFIKVVPTVYTDIRGRVIH 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQ+SVTEHF+SSE G +PGVFFFYD+SPIKV F EEH+ FLHFLTN+CAI+GG+FT
Sbjct: 301 SNQYSVTEHFKSSELG--AAVPGVFFFYDISPIKVNFKEEHIPFLHFLTNICAIIGGIFT 358
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
++GI+D+ IY+GQ+ IKKK+EIGK+
Sbjct: 359 IAGIVDSSIYYGQKTIKKKMEIGKY 383
>gi|302790744|ref|XP_002977139.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
gi|302820940|ref|XP_002992135.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
gi|300140061|gb|EFJ06790.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
gi|300155115|gb|EFJ21748.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
Length = 386
Score = 596 bits (1536), Expect = e-168, Method: Compositional matrix adjust.
Identities = 277/383 (72%), Positives = 330/383 (86%), Gaps = 1/383 (0%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
++ K++ LDAYPKINEDF+SRT SGGVIT+VSSI M +LF +EL+L+L T ++LLVDT
Sbjct: 3 MLKKLQQLDAYPKINEDFHSRTLSGGVITVVSSIFMAILFITELKLFLLPGTTSELLVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR-QDG 122
SRGETL+INFD+TFPAL CS++S+DAMD+SGEQHLDVKH+IFKKRLD G V++ Q+
Sbjct: 63 SRGETLQINFDITFPALACSVISLDAMDVSGEQHLDVKHNIFKKRLDPSGKVVQPPVQED 122
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
IG PKIDKPLQ+HGGRLEHNETYCGSC+GAE SD++CCN+CEEVREAYRK+GWA+ N DL
Sbjct: 123 IGGPKIDKPLQKHGGRLEHNETYCGSCFGAEQSDDECCNSCEEVREAYRKRGWAIHNADL 182
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
IDQCKREG+L +IKEEEGEGCNIYG LEVNKVAGNFHFAPGKSF Q VHVHD+ + ++
Sbjct: 183 IDQCKREGWLTKIKEEEGEGCNIYGSLEVNKVAGNFHFAPGKSFSQQHVHVHDVQSLHKE 242
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
FN+SH IN+L+FG FPGVVNPLD + Q+ PS MYQYFIKVVPT YTD++GH I +N
Sbjct: 243 KFNVSHYINELSFGARFPGVVNPLDKEKRIQKFPSAMYQYFIKVVPTAYTDMTGHKIVTN 302
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT+HF++ E ++LPGVFFFY+LSPIKV FTE SFLHFLTNVCAI+GGVFTVS
Sbjct: 303 QFSVTDHFKAVEGLNGRSLPGVFFFYELSPIKVLFTERKTSFLHFLTNVCAIIGGVFTVS 362
Query: 363 GIIDAFIYHGQRAIKKKIEIGKF 385
GIID+FIYHG RAIKKK+EIGK+
Sbjct: 363 GIIDSFIYHGHRAIKKKMEIGKY 385
>gi|357112459|ref|XP_003558026.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 387
Score = 596 bits (1536), Expect = e-168, Method: Compositional matrix adjust.
Identities = 273/385 (70%), Positives = 332/385 (86%), Gaps = 4/385 (1%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ NK+RSLDAYPK+NEDFYSRT SGG+IT+ SS+ +LLLFFSE+RLYL + TE+KL VDT
Sbjct: 3 LWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFFSEIRLYLYSATESKLTVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
SRGE L INFDVTFPALPCS++++D MD+SGEQH D++HDIFKKR+D GNVIESR+DG+
Sbjct: 63 SRGERLHINFDVTFPALPCSLVAIDTMDVSGEQHYDIRHDIFKKRIDHLGNVIESRKDGV 122
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G+PKI++PLQ HGGRL+HNE YCGSCYG+E SD+ CCN+CEEVR+AYRKKGWAL+N + I
Sbjct: 123 GSPKIERPLQNHGGRLDHNEAYCGSCYGSEESDDQCCNSCEEVRDAYRKKGWALTNVESI 182
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
DQCKREGF+QR+K+E+GEGCNI+GF++VNKVAGNFHFAPGK QS + D+L FQ ++
Sbjct: 183 DQCKREGFVQRLKDEQGEGCNIHGFVDVNKVAGNFHFAPGKHLDQSFNFLQDMLNFQPEN 242
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQ 300
+NISHKINKL+FG+ FPGVVNPLDGV W QE +GMYQYF+KVVPT+YTD+ G I
Sbjct: 243 YNISHKINKLSFGKEFPGVVNPLDGVEWKQEQATGLTGMYQYFVKVVPTIYTDIRGRKIH 302
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FT
Sbjct: 303 SNQFSVTEHFREA-IGFPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFT 361
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
V+GIID+F+YHG RAIKKK+EIGK
Sbjct: 362 VAGIIDSFVYHGHRAIKKKMEIGKL 386
>gi|168024878|ref|XP_001764962.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683771|gb|EDQ70178.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 595 bits (1534), Expect = e-167, Method: Compositional matrix adjust.
Identities = 273/385 (70%), Positives = 331/385 (85%), Gaps = 2/385 (0%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A+ NK++ LDAYPKI+EDFYSRT SGGVITLVS++ M +LF +E+ LYL+A T+ +L+VD
Sbjct: 2 AVFNKLKQLDAYPKISEDFYSRTLSGGVITLVSTVFMFVLFVTEISLYLSAQTQNQLVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES-RQD 121
TSRGETL+IN D+TFPAL CS++S+DAMDISGEQHL+V+H+IFKKRLD G V+ + + D
Sbjct: 62 TSRGETLQINLDITFPALACSMVSLDAMDISGEQHLNVRHNIFKKRLDVHGKVVNAPKPD 121
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I APK+ KPLQ+HGGRLEHNETYCGSC+GAESSD++CCNNCEEVREAYRKKGWAL+N D
Sbjct: 122 AINAPKVQKPLQKHGGRLEHNETYCGSCFGAESSDDECCNNCEEVREAYRKKGWALTNAD 181
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
LIDQC REGF++R+KEE GEGCNIYG LEVNKVAGNFHFAPGKSF QS +H+ D++ F
Sbjct: 182 LIDQCHREGFIERVKEEAGEGCNIYGKLEVNKVAGNFHFAPGKSFQQSAMHLLDLMGFIT 241
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
DSFN+SH IN+L+FG HFPG VNPLD V Q+ +GMYQYFIKVVPTVYTD+ G I +
Sbjct: 242 DSFNVSHTINELSFGAHFPGAVNPLDKVTNIQKDLNGMYQYFIKVVPTVYTDIKGRKIST 301
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVTEH+ + + G + +PGVFFFYDLSPIKV F+EE SFLHFLTNVCAIVGGV+++
Sbjct: 302 NQFSVTEHYTAGDHGP-RFVPGVFFFYDLSPIKVKFSEERPSFLHFLTNVCAIVGGVYSI 360
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+GIID+F+YHG RAIKKK+E+GK S
Sbjct: 361 AGIIDSFVYHGHRAIKKKMELGKLS 385
>gi|168014180|ref|XP_001759631.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689170|gb|EDQ75543.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 382
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 270/378 (71%), Positives = 322/378 (85%), Gaps = 1/378 (0%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
++ K++SLDAYPKINEDFYSRT SGG+IT++S+ M+LLFFSEL+LYL A L+VDT
Sbjct: 1 MIQKLKSLDAYPKINEDFYSRTLSGGIITIISATFMVLLFFSELKLYLAAQVANDLVVDT 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDG 122
RG T++IN DVTFPAL CS++S+DAMDISGE HLDVKH+IFKKRLD G VIE +RQ+
Sbjct: 61 ERGGTIQINLDVTFPALACSVVSLDAMDISGEAHLDVKHNIFKKRLDVNGKVIEPARQES 120
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I PK+DKPLQ+HGGRLEHNETYCGSC+GAE+ ++ CCNNCEEVREAYRKKGWAL+NPDL
Sbjct: 121 INQPKLDKPLQKHGGRLEHNETYCGSCFGAETEEDHCCNNCEEVREAYRKKGWALNNPDL 180
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
IDQCKREGFLQ+IK+E+GEGCN+YG LE NKVAGNFHFAPGKSF Q+ +HVHD++AF +D
Sbjct: 181 IDQCKREGFLQKIKDEDGEGCNVYGTLEANKVAGNFHFAPGKSFQQANMHVHDLMAFGKD 240
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
SFN+SHKIN+++FG +PG VNPLD + Q T GMYQYFIKVVPTVYTD G I +N
Sbjct: 241 SFNVSHKINEISFGVRYPGAVNPLDKLERIQTTTHGMYQYFIKVVPTVYTDTRGRKISTN 300
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QF+VT+HF+ G LPGVFFFYDLSPIKV FTE+ +SF HFLTNVCAIVGGVF+VS
Sbjct: 301 QFAVTDHFKGVGPGEDHALPGVFFFYDLSPIKVKFTEKRMSFFHFLTNVCAIVGGVFSVS 360
Query: 363 GIIDAFIYHGQRAIKKKI 380
GIIDAF+YHGQ+ IKK++
Sbjct: 361 GIIDAFVYHGQKQIKKRL 378
>gi|108707873|gb|ABF95668.1| Serologically defined breast cancer antigen NY-BR-84, putative,
expressed [Oryza sativa Japonica Group]
Length = 387
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 271/385 (70%), Positives = 332/385 (86%), Gaps = 4/385 (1%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ NK+RSLDAYPK+NEDFYSRT SGG+IT+ SS+ +LLLF SE+RLYL + T++KL VDT
Sbjct: 3 LWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFLSEIRLYLYSATDSKLTVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
SRGE L INFDVTFPALPCS+++VD MD+SGEQH D++HDI KKR+D+ GNVIESR+DG+
Sbjct: 63 SRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDNLGNVIESRKDGV 122
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
GAPKI++PLQ+HGGRL+HNE YCGSCYG+E SD+ CCN+CE+VR+AYRKKGWAL+N + I
Sbjct: 123 GAPKIERPLQKHGGRLDHNEVYCGSCYGSEESDDQCCNSCEDVRDAYRKKGWALTNIEEI 182
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
DQCKREGF+QR+K+E+GEGC+I+GF+ VNKVAGNFHFAPGKS QS + D+L FQ+++
Sbjct: 183 DQCKREGFVQRLKDEQGEGCSIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLNFQQEN 242
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQ 300
+NISHKINKL+FG FPGVVNPLDGV W QE +GMYQYF+KVVPT+YTD+ G I
Sbjct: 243 YNISHKINKLSFGVEFPGVVNPLDGVEWIQEHTNGLTGMYQYFVKVVPTIYTDIRGRKIN 302
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FT
Sbjct: 303 SNQFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFT 361
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
V+GIID+F+YHG RAIKKK+EIGK
Sbjct: 362 VAGIIDSFVYHGHRAIKKKMEIGKL 386
>gi|449438787|ref|XP_004137169.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 386
Score = 586 bits (1510), Expect = e-165, Method: Compositional matrix adjust.
Identities = 275/385 (71%), Positives = 337/385 (87%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDAI NK+R+LDAYPKINEDFY RTFSGG+ITL SS ML LFFSELR+YL+A TET+L+
Sbjct: 1 MDAIFNKLRNLDAYPKINEDFYRRTFSGGLITLASSFFMLFLFFSELRMYLHAKTETQLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRG L INFD++FPA+PCSILS+DA+DISGEQHLD++H+I KKR+D G VIE+R
Sbjct: 61 VDTSRGGELHINFDLSFPAIPCSILSLDAIDISGEQHLDIRHNIIKKRIDHLGTVIEARP 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIGAPKI+KPLQ+HGGRLEHNETYCGSC+GAE+SD+DCCN+CEEVREAYRKKGWA++N
Sbjct: 121 DGIGAPKIEKPLQKHGGRLEHNETYCGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQ 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC+RE F+Q++K+EEGEGCNI G LEVNKVAG+FHF PGKSF+QS + +LA Q
Sbjct: 181 DLIDQCQREDFIQKVKDEEGEGCNIEGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+N+SH+IN+LAFG H+ G+VNPLDGV W + M+QYF+KVVPT+Y ++ G T+
Sbjct: 241 TSDYNVSHRINRLAFGNHYDGLVNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVH 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQ+SVTEHF+S E G Q++PGVFF+YDLSP+KVT+TEEHV FLHF+T++CAI+GGVF+
Sbjct: 301 SNQYSVTEHFKSVEFGSSQSIPGVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFS 360
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
V+GIIDAFIYHGQR +KKK+EIGKF
Sbjct: 361 VAGIIDAFIYHGQRKMKKKVEIGKF 385
>gi|168004517|ref|XP_001754958.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694062|gb|EDQ80412.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 266/385 (69%), Positives = 325/385 (84%), Gaps = 2/385 (0%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
AI NK++ LDA+PKI+EDFYSRT SGGVITLVSSI M LLF +E R+YL+A T+ +L+VD
Sbjct: 2 AIFNKLKQLDAHPKISEDFYSRTLSGGVITLVSSIFMFLLFVTEFRIYLSAQTQNQLVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES-RQD 121
TSRGETL+IN D+TFPAL CS++S+DAMDISGE HLDV+H+I+KKRLD G +++ + D
Sbjct: 62 TSRGETLQINLDITFPALACSVVSLDAMDISGELHLDVRHNIYKKRLDVHGKAVDAPKPD 121
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I APK+ KPLQ+HGGRLE +ETYCGSC+GAESSD+ CCN+CEEVREAYRKKGWAL+N D
Sbjct: 122 AINAPKVQKPLQKHGGRLEDHETYCGSCFGAESSDDQCCNSCEEVREAYRKKGWALTNTD 181
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
LIDQC REGF++RIKEE GEGCNIYG LEVNKVAGNF APGKSF QS +H+ D++ F
Sbjct: 182 LIDQCHREGFIERIKEEAGEGCNIYGKLEVNKVAGNFQIAPGKSFQQSAMHLLDLMGFVT 241
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
DSFN+SH IN+L+FG +FPG VNPLD V Q+ +GM+QYFIKVVPTVYTD+ G I +
Sbjct: 242 DSFNVSHTINELSFGAYFPGAVNPLDKVTSIQKDQNGMFQYFIKVVPTVYTDIKGRKIST 301
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSV EH+ + + G + +PGVFFFYDL+PIKV FTEE SFLHFLTNVCAI+GG++T+
Sbjct: 302 NQFSVMEHYTAGDHGP-RVIPGVFFFYDLTPIKVKFTEERPSFLHFLTNVCAIIGGIYTI 360
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+GI+D+FIYHG RAIKKK+E+GK S
Sbjct: 361 AGIVDSFIYHGHRAIKKKMELGKLS 385
>gi|226498912|ref|NP_001150650.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|194699894|gb|ACF84031.1| unknown [Zea mays]
gi|195640862|gb|ACG39899.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
Length = 387
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/385 (70%), Positives = 332/385 (86%), Gaps = 4/385 (1%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +K+R+LDAYPK+NEDFYSRT SGG+IT++SS+ +LLLFFSE+RLYL + TE+KL VDT
Sbjct: 3 LWSKLRNLDAYPKVNEDFYSRTLSGGLITILSSLAILLLFFSEIRLYLYSATESKLTVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
SRGE L INFDVTFPALPCS+++VD MD+SGEQH D++HDI KKR+D GNVIESR+DG+
Sbjct: 63 SRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDHLGNVIESRKDGV 122
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
GAPKI++PLQ+HGGRL+HNE YCGSCYGAE SD+ CCN+CEEVR+AYRKKGWA++N +LI
Sbjct: 123 GAPKIERPLQKHGGRLDHNEVYCGSCYGAEESDDQCCNSCEEVRDAYRKKGWAVNNVELI 182
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
DQCKREG++QR+K+E+GEGC I+GF+ VNKVAGNFHFAPGKS QS + D+L Q ++
Sbjct: 183 DQCKREGYVQRLKDEQGEGCTIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLNLQPET 242
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQ 300
+NISHKINKL+FGE FPGVVNPLDGV W Q+ +GMYQYF+KVVPT+YTD+ G I
Sbjct: 243 YNISHKINKLSFGEEFPGVVNPLDGVEWIQDNSNGLTGMYQYFVKVVPTIYTDIRGRKIH 302
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FT
Sbjct: 303 SNQFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFT 361
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
V+GIID+F+YHG RAIKKK+E+GK
Sbjct: 362 VAGIIDSFVYHGHRAIKKKMELGKL 386
>gi|242088319|ref|XP_002439992.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
gi|241945277|gb|EES18422.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
Length = 384
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 263/387 (67%), Positives = 325/387 (83%), Gaps = 6/387 (1%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA + +++ LDAYPK+NEDFY RT SGG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLQRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD MDISGEQH D++HDI K+RLDS GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTMDISGEQHHDIRHDIEKRRLDSHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG KI++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGAKIERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F++R+K ++ EGCN++GFL+V+KVAGNFHFAPGK F++S + V + L+
Sbjct: 181 DLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHFAPGKGFYESNIDVPE-LSVL 239
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNI+HKINKL+FG FPGVVNPLDG +W Q G YQYFIKVVPT+YTD+ GH I
Sbjct: 240 EGGFNITHKINKLSFGTEFPGVVNPLDGAQWIQPASDGTYQYFIKVVPTIYTDIRGHNIH 299
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
SNQFSVTEHFR G + + PGVFFFYD SPIKV FTEE+ S LH+LTN+CAIVGGV
Sbjct: 300 SNQFSVTEHFRD---GNILPKPQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVGGV 356
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGKF 385
FTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 357 FTVSGIIDSFIYHGQKALKKKMELGKY 383
>gi|357133202|ref|XP_003568216.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 384
Score = 576 bits (1485), Expect = e-162, Method: Compositional matrix adjust.
Identities = 266/386 (68%), Positives = 323/386 (83%), Gaps = 4/386 (1%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD + K++ LDAYPK+NEDFY RT SGGV+TLVS++VMLLLF SE YLN+ TETKL+
Sbjct: 1 MDGFLQKLKGLDAYPKVNEDFYKRTLSGGVVTLVSAVVMLLLFISETSSYLNSATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI KKRL+S GNVIESR+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLNSHGNVIESRK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG KI++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+C+EVREAY+KKGWAL+NP
Sbjct: 121 EGIGGAKIERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCDEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F++R+K + GEGC+++GFL+V+KVAGNFHFAPG+ F++S V V ++ + +
Sbjct: 181 DLIDQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGRGFYESNVDVPELSSLE 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNI+HKINKL+FG FPGVVNPLDG +WTQ G YQYFIKVVPT YTD G I
Sbjct: 241 -GGFNITHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTYQYFIKVVPTNYTDTRGRKID 299
Query: 301 SNQFSVTEHFRSSE-QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
SNQFSVTEHFR R Q PGVFFFYD SPIKV FTEE+ SFLH+LTN+CAIVGG+F
Sbjct: 300 SNQFSVTEHFRDGNVHPRPQ--PGVFFFYDFSPIKVIFTEENKSFLHYLTNLCAIVGGIF 357
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKF 385
TVSGIID+FIYHGQ+A+KKK+EIGK+
Sbjct: 358 TVSGIIDSFIYHGQKALKKKMEIGKY 383
>gi|212721670|ref|NP_001132255.1| uncharacterized protein LOC100193691 [Zea mays]
gi|194693892|gb|ACF81030.1| unknown [Zea mays]
gi|223949235|gb|ACN28701.1| unknown [Zea mays]
gi|413949703|gb|AFW82352.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 384
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 261/385 (67%), Positives = 323/385 (83%), Gaps = 2/385 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA +++++ LDAYPK+NEDFY RT SGG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI K+RL+S GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG K+++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGAKVERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F+ R+K ++ EGCN+ GFL+V+KVAGNFHFAPGK F++S + V + L+
Sbjct: 181 DLIDQCAREDFIDRVKTQQDEGCNVLGFLDVSKVAGNFHFAPGKGFYESNIDVPE-LSLL 239
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNISHKINKL+FG FPGVVNPLDG +WTQ G YQYFIKVVPT+YTD+ G I
Sbjct: 240 EGGFNISHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRGRGIH 299
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR R ++ PGVFFFYD SPIKV FTEE+ S LH+LTN+CAIVGGVFT
Sbjct: 300 SNQFSVTEHFRDGNV-RPKSQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVGGVFT 358
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
VSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 359 VSGIIDSFIYHGQKALKKKMELGKY 383
>gi|326510689|dbj|BAJ87561.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514988|dbj|BAJ99855.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326533080|dbj|BAJ93512.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 383
Score = 573 bits (1478), Expect = e-161, Method: Compositional matrix adjust.
Identities = 264/386 (68%), Positives = 322/386 (83%), Gaps = 5/386 (1%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +++K++ LDAYPK+NEDFY RT SGGV+TL+S+ VMLLLF SE + Y + TETKL+
Sbjct: 1 MDGLLSKLKGLDAYPKVNEDFYKRTLSGGVVTLLSAFVMLLLFVSETKSYFYSATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI KKRLDS GNVIESR+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLDSHGNVIESRK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG KI+KPLQ+HGGRL E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGTKIEKPLQKHGGRLGKGEEYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F++R+K + GEGC+++GFL+V+KVAGNFHFAPGK +++S V + ++ A
Sbjct: 181 DLIDQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGKGYYESNVDMPELSA-- 238
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNI+HKINKL+FG FPG VNPLDG +WTQ G YQYFIKVVPT+Y D+ G I
Sbjct: 239 EGGFNITHKINKLSFGTEFPGAVNPLDGAQWTQPASDGTYQYFIKVVPTIYNDIRGRKID 298
Query: 301 SNQFSVTEHFRSSE-QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
SNQFSVTEHFR Q R Q PGVFFFYD SPIKV FTEE+ SFLH+LTN+CAIVGG+F
Sbjct: 299 SNQFSVTEHFRDGNVQPRPQ--PGVFFFYDFSPIKVIFTEENRSFLHYLTNLCAIVGGIF 356
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKF 385
TV+GIID+FIYHGQ+A+KKK+EIGK+
Sbjct: 357 TVAGIIDSFIYHGQKALKKKMEIGKY 382
>gi|226494401|ref|NP_001141198.1| uncharacterized protein LOC100273285 [Zea mays]
gi|194703210|gb|ACF85689.1| unknown [Zea mays]
gi|238011828|gb|ACR36949.1| unknown [Zea mays]
gi|413945823|gb|AFW78472.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 384
Score = 573 bits (1477), Expect = e-161, Method: Compositional matrix adjust.
Identities = 261/385 (67%), Positives = 320/385 (83%), Gaps = 2/385 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA + +++ LDAYPK+NEDFY RT SGG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLQRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TF ++PC++LSVD MDISGEQH D++HDI K RLD+ GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDITFLSIPCTLLSVDTMDISGEQHQDIRHDIEKIRLDAHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
IG KI++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 VSIGGAKIERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F++R+K ++ EGCN++GFL+V+KVAGNFHFAPGK F++S + V + L+
Sbjct: 181 DLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHFAPGKGFYESNIDVPE-LSLL 239
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNI+HKINKL+FG FPGVVNPLDG +WTQ G YQYFIKVVPT+YTD+ GH I
Sbjct: 240 EGGFNITHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRGHNIH 299
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR R + PGVFFFYD SPIKV FTEE S LH+LTN+CAIVGGVFT
Sbjct: 300 SNQFSVTEHFRDGNV-RPKPQPGVFFFYDFSPIKVIFTEESRSLLHYLTNLCAIVGGVFT 358
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
VSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 359 VSGIIDSFIYHGQKALKKKMELGKY 383
>gi|222628979|gb|EEE61111.1| hypothetical protein OsJ_15023 [Oryza sativa Japonica Group]
Length = 369
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 274/395 (69%), Positives = 317/395 (80%), Gaps = 35/395 (8%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ +++K+RSLDAYPK+NEDFYSRT SGG+ITL SS+VMLLLF SELR L
Sbjct: 1 MEGLLSKLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRHTLT-------- 52
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
+ G L++ FDVTFPAL CSI+S+DAMDISG++HLDVKHDIFK+R+D GNVI ++Q
Sbjct: 53 --YTFGMILKMQFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ 110
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES---------SDEDCCNNCEEVREAYR 171
D +G N Y G G + SDE CCN+CE+VREAYR
Sbjct: 111 DAVGG----------------NGPYSGMAAGLNTMRPIVALVMSDEQCCNSCEDVREAYR 154
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
KKGW +SNPDLIDQCKREGFLQ IK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF ++ V
Sbjct: 155 KKGWGVSNPDLIDQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANV 214
Query: 232 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
HVHD+L FQ+DSFN+SHKINKL+FG+ FPGVVNPLDG +W Q + GMYQYFIKVVPTVY
Sbjct: 215 HVHDLLPFQKDSFNVSHKINKLSFGQRFPGVVNPLDGAQWMQHSSYGMYQYFIKVVPTVY 274
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
TD++ H I SNQFSVTEHFRSSE GR+Q +PGVFFFYDLSPIKVTFTE+HVSFLHFLTNV
Sbjct: 275 TDINEHIILSNQFSVTEHFRSSESGRIQAVPGVFFFYDLSPIKVTFTEQHVSFLHFLTNV 334
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAIVGGVFTVSGIID+F+YHGQRAIKKK+EIGKF+
Sbjct: 335 CAIVGGVFTVSGIIDSFVYHGQRAIKKKMEIGKFN 369
>gi|242035905|ref|XP_002465347.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
gi|241919201|gb|EER92345.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
Length = 387
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 267/385 (69%), Positives = 329/385 (85%), Gaps = 4/385 (1%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +K+R+LDAYPK+NEDFYSRT SGG+IT++SS+ +LLLFFSE+RLYL + TE+KL VDT
Sbjct: 3 LWSKLRNLDAYPKVNEDFYSRTLSGGLITILSSLAILLLFFSEIRLYLYSATESKLTVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
SRGE L INFDVTFPALPCS+++VD MD+SGEQH D++HDI KKR+D GNVIESR+D +
Sbjct: 63 SRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDITKKRIDHLGNVIESRKDRV 122
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
GAPKI++PLQ+HGGRL+HNE YCGSCYGAE +D+ CCN+CEEVR+ YRKKGWA++N +LI
Sbjct: 123 GAPKIERPLQKHGGRLDHNEVYCGSCYGAEETDDQCCNSCEEVRDVYRKKGWAINNVELI 182
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
DQCKREG++QR+K+E GEGC I+GF+ VNKVAGNFHFAPGKS QS + D+L Q ++
Sbjct: 183 DQCKREGYVQRLKDETGEGCTIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLNIQPET 242
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQ 300
+NISHKINKL+FGE FPGVVNPLDGV W Q+ +GMYQYF+KVVPT+YTD+ G I
Sbjct: 243 YNISHKINKLSFGEEFPGVVNPLDGVEWIQDNSNGLTGMYQYFVKVVPTIYTDIRGRKIY 302
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FT
Sbjct: 303 SNQFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFT 361
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
V+GIID+F+YHG RAIKKK+E+GK
Sbjct: 362 VAGIIDSFVYHGHRAIKKKMELGKL 386
>gi|326497521|dbj|BAK05850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 391
Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust.
Identities = 261/338 (77%), Positives = 298/338 (88%)
Query: 49 LYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR 108
LYL+AVTET L VDTSRGE LRINFD+TFPAL CSI+SVD MDISG++HLDVKHD+FK+R
Sbjct: 54 LYLHAVTETTLRVDTSRGEKLRINFDITFPALQCSIISVDVMDISGQEHLDVKHDVFKQR 113
Query: 109 LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVRE 168
+D+ GNVI ++QD +G K++KPLQ HGGRLEHNETYCGSCYGA+ S E CCN+CE+VRE
Sbjct: 114 IDAHGNVIATKQDAVGGMKVEKPLQHHGGRLEHNETYCGSCYGAQESPEQCCNSCEDVRE 173
Query: 169 AYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ 228
AYRKKGW +SNPD IDQCK EGFLQ IK+EEGEGCNIYGFLE+NKVAGNFHFAPGKSF Q
Sbjct: 174 AYRKKGWGVSNPDSIDQCKSEGFLQTIKDEEGEGCNIYGFLEINKVAGNFHFAPGKSFQQ 233
Query: 229 SGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
S VHVHD+L FQ+DSFN+SHKINKL+FGE FPGV+NPLDG +W Q + GM QYF+KVVP
Sbjct: 234 SNVHVHDLLPFQKDSFNLSHKINKLSFGEPFPGVINPLDGAQWIQHSSYGMAQYFVKVVP 293
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
TVY+ ++ I SNQFSVTEH RS + GR+Q LPGVFFFYDLSPIKVTFTE HVSFLHFL
Sbjct: 294 TVYSHINEQIILSNQFSVTEHSRSGDSGRVQALPGVFFFYDLSPIKVTFTERHVSFLHFL 353
Query: 349 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
TNVCAIVGGVFTVSGIID+F+YHGQRAI KK E+GKF+
Sbjct: 354 TNVCAIVGGVFTVSGIIDSFVYHGQRAITKKRELGKFT 391
>gi|115464597|ref|NP_001055898.1| Os05g0490200 [Oryza sativa Japonica Group]
gi|50080302|gb|AAT69636.1| unknown protein [Oryza sativa Japonica Group]
gi|113579449|dbj|BAF17812.1| Os05g0490200 [Oryza sativa Japonica Group]
gi|218197014|gb|EEC79441.1| hypothetical protein OsI_20422 [Oryza sativa Indica Group]
gi|222632053|gb|EEE64185.1| hypothetical protein OsJ_19017 [Oryza sativa Japonica Group]
Length = 384
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 263/385 (68%), Positives = 323/385 (83%), Gaps = 2/385 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ + K++ LDAYPK+NEDFY RT SGGV+T+V+S+VMLLLF SE R Y + TETKL+
Sbjct: 1 MEGFLQKLKGLDAYPKVNEDFYKRTLSGGVVTVVASVVMLLLFVSETRSYFYSATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFDVTFP++PC++LSVD MDISGEQH D++HDI K+RLD+ GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDVTFPSVPCTLLSVDTMDISGEQHHDIRHDIEKRRLDAHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG KI+ PLQ+HGGRL E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGAKIESPLQKHGGRLSKGEEYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F++R+K ++GEGCN++GFL+V+KVAGN HFAPGK F++S ++V ++ A +
Sbjct: 181 DLIDQCTREDFVERVKTQQGEGCNVHGFLDVSKVAGNLHFAPGKGFYESNINVPELSALE 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNI+HKINKL+FG FPGVVNPLDG +WTQ G YQYFIKVVPT+YTD+ G I
Sbjct: 241 H-GFNITHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDLRGRKIH 299
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR R + PGVFFFYD SPIKV FTEE+ S LH+LTN+CAIVGGVFT
Sbjct: 300 SNQFSVTEHFRDGNI-RPKPQPGVFFFYDFSPIKVIFTEENSSLLHYLTNLCAIVGGVFT 358
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
VSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 359 VSGIIDSFIYHGQKALKKKMELGKY 383
>gi|168019656|ref|XP_001762360.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686438|gb|EDQ72827.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 380
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 261/385 (67%), Positives = 319/385 (82%), Gaps = 7/385 (1%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ NK++ LDAYPKI+EDFYSRT SGG+ITLVSS+ M LLF +E R+YL+A T+ +L+VD
Sbjct: 2 SFFNKLKHLDAYPKISEDFYSRTLSGGLITLVSSVFMTLLFITEFRIYLSAQTQNQLVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES-RQD 121
TSRGETL+IN D+TF AL CS++S+DAMDISGEQHL+V+H+IFKKRLD G I++ + D
Sbjct: 62 TSRGETLQINLDITFSALACSVVSLDAMDISGEQHLNVRHNIFKKRLDVHGKAIDAPKPD 121
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I APK+ +PLQ+HGGRLEHNETYCGSC+GA SSD++CCN+CEEVREAYRKKGWAL N D
Sbjct: 122 AINAPKVQRPLQKHGGRLEHNETYCGSCFGAASSDDECCNSCEEVREAYRKKGWALINID 181
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+IDQC REGF++R+KEE GEGCNIYG LEVNKVAGNFH APGK F QS +H+ D+L +
Sbjct: 182 IIDQCHREGFIERVKEEAGEGCNIYGKLEVNKVAGNFHIAPGKLFQQSAMHLLDLLGIRS 241
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
DSFN+SH +N+L+FG HFPG VNPLD + Q+ +GMYQYFIKVVPTVYTD+ G I +
Sbjct: 242 DSFNVSHIVNELSFGAHFPGRVNPLDKITSIQKDQNGMYQYFIKVVPTVYTDIRGSEIAT 301
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVTEH+ + + G + +PGVFFFYDLSPIKV FTE+ SFLHFLT VCAIVG
Sbjct: 302 NQFSVTEHYTAGDHGP-RVVPGVFFFYDLSPIKVKFTEKRPSFLHFLTTVCAIVG----- 355
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+ IID+FIYHG RA+KKK+E+GKFS
Sbjct: 356 ASIIDSFIYHGHRAVKKKMELGKFS 380
>gi|79318328|ref|NP_001031077.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|332192090|gb|AEE30211.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 338
Score = 559 bits (1441), Expect = e-157, Method: Compositional matrix adjust.
Identities = 255/335 (76%), Positives = 304/335 (90%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +MN++R+LDAYPKINEDFY RT SGGVITL SSIVML+LFFSEL+LY++ VTET+L
Sbjct: 1 MVGVMNRLRNLDAYPKINEDFYRRTLSGGVITLASSIVMLILFFSELQLYIHPVTETQLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LRINFDVTFPAL CSI+S+D+MDISGE+HLDV+HDI K+RLDS GNVIE++Q
Sbjct: 61 VDTSRGEKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
DGIG KI+KPLQ+HGGRLEHNETYCGSC+GAE+SD+ CCN+CEEVREAYRKKGWALS+P
Sbjct: 121 DGIGHTKIEKPLQKHGGRLEHNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ IDQCKREGF+Q++K+EEGEGCN++GFLEVNKVAGNFHF PG+SFHQSG HD+L FQ
Sbjct: 181 ESIDQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ ++NISHK+N+LAFG+ FPGVVNPLDGV+W Q SG+YQYFIKVVP++YTDV +TIQ
Sbjct: 241 QGNYNISHKVNRLAFGDFFPGVVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKV 335
SNQFSVTEHF++ E GR+Q+ PGVFF+YDLSPIKV
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKV 335
>gi|326506194|dbj|BAJ86415.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 363
Score = 540 bits (1392), Expect = e-151, Method: Compositional matrix adjust.
Identities = 250/367 (68%), Positives = 303/367 (82%), Gaps = 5/367 (1%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +++K++ LDAYPK+NEDFY RT SGGV+TL+S+ VMLLLF SE + Y + TETKL+
Sbjct: 1 MDGLLSKLKGLDAYPKVNEDFYKRTLSGGVVTLLSAFVMLLLFVSETKSYFYSATETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI KKRLDS GNVIESR+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLDSHGNVIESRK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG KI+KPLQ+HGGRL E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGTKIEKPLQKHGGRLGKGEEYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F++R+K + GEGC+++GFL+V+KVAGNFHFAPGK +++S V + ++ A
Sbjct: 181 DLIDQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGKGYYESNVDMPELSA-- 238
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNI+HKINKL+FG FPG VNPLDG +WTQ G YQYFIKVVPT+Y D+ G I
Sbjct: 239 EGGFNITHKINKLSFGTEFPGAVNPLDGAQWTQPASDGTYQYFIKVVPTIYNDIRGRKID 298
Query: 301 SNQFSVTEHFRSSE-QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
SNQFSVTEHFR Q R Q PGVFFFYD SPIKV FTEE+ SFLH+LTN+CAIVGG+F
Sbjct: 299 SNQFSVTEHFRDGNVQPRPQ--PGVFFFYDFSPIKVIFTEENRSFLHYLTNLCAIVGGIF 356
Query: 360 TVSGIID 366
TV+GIID
Sbjct: 357 TVAGIID 363
>gi|218192721|gb|EEC75148.1| hypothetical protein OsI_11348 [Oryza sativa Indica Group]
gi|222624836|gb|EEE58968.1| hypothetical protein OsJ_10656 [Oryza sativa Japonica Group]
Length = 355
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 252/385 (65%), Positives = 307/385 (79%), Gaps = 36/385 (9%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ NK+RSLDAYPK+NEDFYSRT SGG+IT+ SS+ +LLLF SE+RLYL + T++KL VDT
Sbjct: 3 LWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFLSEIRLYLYSATDSKLTVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
SRGE L INFDVTFPALPCS+++VD MD+SGEQH D++HDI KKR+D+ GNVIESR+DG+
Sbjct: 63 SRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDNLGNVIESRKDGV 122
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
GAPKI++PLQ+HGGRL+HNE YCGSCYG+E SD+ CCN+CE+VR+AYRKKGWAL+N + I
Sbjct: 123 GAPKIERPLQKHGGRLDHNEVYCGSCYGSEESDDQCCNSCEDVRDAYRKKGWALTNIEEI 182
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
DQCKREGF+QR+K+E+GEGC+I+GF+ VNK
Sbjct: 183 DQCKREGFVQRLKDEQGEGCSIHGFVNVNK------------------------------ 212
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQ 300
ISHKINKL+FG FPGVVNPLDGV W QE +GMYQYF+KVVPT+YTD+ G I
Sbjct: 213 --ISHKINKLSFGVEFPGVVNPLDGVEWIQEHTNGLTGMYQYFVKVVPTIYTDIRGRKIN 270
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
SNQFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FT
Sbjct: 271 SNQFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFT 329
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKF 385
V+GIID+F+YHG RAIKKK+EIGK
Sbjct: 330 VAGIIDSFVYHGHRAIKKKMEIGKL 354
>gi|413949704|gb|AFW82353.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 398
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 238/356 (66%), Positives = 294/356 (82%), Gaps = 2/356 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA +++++ LDAYPK+NEDFY RT SGG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI K+RL+S GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG K+++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGAKVERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQC RE F+ R+K ++ EGCN+ GFL+V+KVAGNFHFAPGK F++S + V + L+
Sbjct: 181 DLIDQCAREDFIDRVKTQQDEGCNVLGFLDVSKVAGNFHFAPGKGFYESNIDVPE-LSLL 239
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FNISHKINKL+FG FPGVVNPLDG +WTQ G YQYFIKVVPT+YTD+ G I
Sbjct: 240 EGGFNISHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRGRGIH 299
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
SNQFSVTEHFR R ++ PGVFFFYD SPIKV FTEE+ S LH+LTN+CAIVG
Sbjct: 300 SNQFSVTEHFRDGNV-RPKSQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVG 354
>gi|414586930|tpg|DAA37501.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 268
Score = 473 bits (1216), Expect = e-131, Method: Compositional matrix adjust.
Identities = 214/268 (79%), Positives = 246/268 (91%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +++K+RSLDAYPK+NEDFYSRT SGG+ITLVSS VMLLLF SELRLYL+AVTET L
Sbjct: 1 MDGLLSKLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISG++HLDVKHD+FK+R+D+ GNVI +RQ
Sbjct: 61 VDTSRGETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K++ PLQ HGGRLEHNETYCGSCYGA+ SD+ CCN CE+VREAYRKKGW +SNP
Sbjct: 121 DVVGGMKMEAPLQHHGGRLEHNETYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DL+DQCKREGFLQ IK+EEGEGCNIYGF+EVNKVAGNFHFAPGKSF QS VHVHD+L FQ
Sbjct: 181 DLLDQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQ 240
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDG 268
+DSFN+SHKIN+L+FGE+FPGVVNPLDG
Sbjct: 241 KDSFNVSHKINRLSFGEYFPGVVNPLDG 268
>gi|384252531|gb|EIE26007.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 386
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 229/391 (58%), Positives = 302/391 (77%), Gaps = 14/391 (3%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ I++K+++LDAYPK+NEDF+ RT SGG+IT+ SSI+ML LF SEL L++ T +L
Sbjct: 1 MEGIVSKLKNLDAYPKVNEDFFQRTLSGGIITIGSSIIMLCLFLSELSLFMKITTTNELS 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESR 119
VDT+RG+ L INFD+TFPALPC +S+D MDISGE HLDV HD++K+RLDS G VI +S
Sbjct: 61 VDTTRGDQLSINFDMTFPALPCEWISLDLMDISGEMHLDVDHDVYKRRLDSNGVVIPDSI 120
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ P++D L NET CGSCYGA + DE+CCNNCEEVR AYR+KGW ++
Sbjct: 121 EKHQVGPELDDTLLHKA-----NETECGSCYGA-APDEECCNNCEEVRAAYRRKGWGFTD 174
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P I QC +EGF+++++ +EGEGC+++G L VNKVAGNFHFAPGKSF Q +HVHD++ F
Sbjct: 175 PQQISQCAKEGFVEKLRAQEGEGCHMWGSLAVNKVAGNFHFAPGKSFQQGPMHVHDLVPF 234
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGV---RWTQETPSGM---YQYFIKVVPTVYTD 293
Q +F++SH+I+KL+FG +PG+ NPLD V ++ P G+ YQYF+KVVPT+Y +
Sbjct: 235 QGVTFDLSHRIDKLSFGHEYPGMTNPLDRVNLPKFNTRNPQGLPGAYQYFLKVVPTIYVN 294
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
HTI SNQ+SVTEHF+ S+ + Q LPGVFF+YDLSPIKV + E +SFLHFLT+VCA
Sbjct: 295 SHNHTINSNQYSVTEHFKGSQDFQAQ-LPGVFFYYDLSPIKVKYHETRMSFLHFLTSVCA 353
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
IVGG+FTV+GI+DAFIYHG +AIKKK+++GK
Sbjct: 354 IVGGIFTVAGIVDAFIYHGHQAIKKKVDLGK 384
>gi|148222292|ref|NP_001091124.1| ERGIC and golgi 3 [Xenopus laevis]
gi|120538715|gb|AAI29573.1| LOC100036873 protein [Xenopus laevis]
Length = 384
Score = 446 bits (1147), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/382 (54%), Positives = 277/382 (72%), Gaps = 5/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++++R DAYPK EDF +T G V+T++S ++ML+LFFSEL+ YL +L VD S
Sbjct: 4 LHRLRQFDAYPKTLEDFRVKTCGGAVVTVISGLIMLILFFSELQYYLTKEIYPELFVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD + S D
Sbjct: 64 RGDKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADKHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K+++ + L+ N C SCYGAE+ D CCN+C++VREAYR+KGWA PD I+
Sbjct: 124 LGKLEEHVVLDPKTLDPNR--CESCYGAETEDFSCCNSCDDVREAYRRKGWAFKTPDSIE 181
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QCKREGF Q+++E++ EGC IYGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 182 QCKREGFSQKMQEQKNEGCQIYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 241
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H+I L+FG +PG+VNPLDG S M+QYF+K+VPTVY V G +++NQF
Sbjct: 242 NMTHEIKHLSFGRDYPGLVNPLDGTSIVAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQF 301
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+
Sbjct: 302 SVTRHEKMT-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVA 360
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
+IDA IYH RAI+KKIE+GK
Sbjct: 361 SLIDALIYHSTRAIQKKIELGK 382
>gi|159470839|ref|XP_001693564.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283067|gb|EDP08818.1| predicted protein [Chlamydomonas reinhardtii]
Length = 388
Score = 446 bits (1147), Expect = e-123, Method: Compositional matrix adjust.
Identities = 218/387 (56%), Positives = 280/387 (72%), Gaps = 10/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K+++LDAYPKINEDF+++T SGG+IT+VSS+VM+LLF SELRL+L + +L VD
Sbjct: 7 LGKLKALDAYPKINEDFFTKTMSGGIITIVSSVVMVLLFLSELRLFLTTSSAHELSVDVG 66
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RGE ++I+FDVTFP +PC+ LS+DAMDISGE HLD+ +++ + E + GIG
Sbjct: 67 RGEKIKIHFDVTFPKVPCAWLSLDAMDISGELHLDLVVELYTLWRRGAAGLTEGKGGGIG 126
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ R+ L + CGSCYGAE DCCN C+EVR AYR+KGWALSN D I+
Sbjct: 127 VLSVSVSRSRNATALANG---CGSCYGAEDKQGDCCNTCDEVRAAYRRKGWALSNVDHIE 183
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC + + + IKE+ GEGC+I +EVNKVAGNFHFAPG+S+ Q +HVHDI F
Sbjct: 184 QCAHDLYTEAIKEQAGEGCHIG--VEVNKVAGNFHFAPGRSYQQGSMHVHDIAPFGDAVI 241
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETP-----SGMYQYFIKVVPTVYTDVSGHTI 299
+ H I+KL+FGE +PG+ NPLDG + Q +GM+QYF+KVVPT YTD+S T+
Sbjct: 242 DFRHVIHKLSFGEPYPGMKNPLDGAKAGQAAAAAAAATGMFQYFLKVVPTSYTDLSNKTL 301
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQFSVTE+FR ++ G +TLPGVFFFYDLSPIKV E SFL FLT+VCAIVGGVF
Sbjct: 302 STNQFSVTENFREAQGGAGRTLPGVFFFYDLSPIKVKIVEHGSSFLSFLTSVCAIVGGVF 361
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKFS 386
TVSGI+DAF+Y G R IKKK+E+GKFS
Sbjct: 362 TVSGIVDAFVYTGTRMIKKKMELGKFS 388
>gi|302834369|ref|XP_002948747.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
nagariensis]
gi|300265938|gb|EFJ50127.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
nagariensis]
Length = 392
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 220/386 (56%), Positives = 285/386 (73%), Gaps = 6/386 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++K+++LDAYPKINEDF+++T SGG+IT+V+S+VM+LLF SELRLY+ + +L VD
Sbjct: 9 LSKLKALDAYPKINEDFFTKTMSGGIITIVASVVMVLLFLSELRLYMTTQSVHELSVDVG 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN-VIESRQDGI 123
RGE ++I+FD+TFP +PCS LS+DAMDISGE HLD+ HD++K+RL + G+ V E + +
Sbjct: 69 RGEKIQIHFDLTFPKVPCSWLSLDAMDISGELHLDLDHDVYKQRLSANGSPVKEVEKHNV 128
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
A K P+ +G CGSCYGAE DCCN C+EVR AYR+KGWAL+N D I
Sbjct: 129 EATKKVVPV--NGTENSTATPVCGSCYGAEDRQGDCCNTCDEVRAAYRRKGWALANVDHI 186
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
+QC + + + IKE+ GEGC+++G LEVNKVAGNFHFAPG+S+ Q +HVHDI F
Sbjct: 187 EQCAHDLYTESIKEQTGEGCHMWGMLEVNKVAGNFHFAPGRSYQQGSMHVHDIAPFGDAV 246
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVR--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
+ H +NKL+FG +PG+ NPLD + + +GMYQYF+KVVPT YT + T+ +
Sbjct: 247 IDFRHTVNKLSFGAPYPGMKNPLDNAKAGYKSAAATGMYQYFLKVVPTSYTGIDNKTLAT 306
Query: 302 NQFSVTEHFRSSEQGRL-QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
NQFSVTE+FR S QG +TLPGVFFFYDLSPIKV E SFL FLT+VCAIVGGVFT
Sbjct: 307 NQFSVTENFRESSQGGAGKTLPGVFFFYDLSPIKVRIVEHSSSFLSFLTSVCAIVGGVFT 366
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGKFS 386
VSGI+DAFIY R I+KK+E+GKFS
Sbjct: 367 VSGIVDAFIYTSTRLIRKKMELGKFS 392
>gi|440797665|gb|ELR18746.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 383
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/381 (55%), Positives = 271/381 (71%), Gaps = 8/381 (2%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
K++S DAYPK EDF RT SG ++++S +++ LFFSEL YL+ + +L VDTSR
Sbjct: 7 KKLKSFDAYPKTLEDFRVRTVSGAAVSIISGLIITWLFFSELSFYLSTDVQPELFVDTSR 66
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE LRIN DVTFP LPC LSVDAMD+SGE LDV+H+IFKKRL + G + + + A
Sbjct: 67 GEKLRINMDVTFPDLPCGYLSVDAMDVSGEHQLDVEHNIFKKRLAADGRPLGIEKGELEA 126
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
P G LE E CGSCYG+E CCN C EVRE+YRKKGWA ++P+ I+Q
Sbjct: 127 AATPSP----GQELEPIE--CGSCYGSEQEPGQCCNTCAEVRESYRKKGWAFAHPESIEQ 180
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
C REGF + +++++GEGC +YG + VNKVAGNFHFAPGKSF +HVHD+ F+ S+N
Sbjct: 181 CAREGFSENLEKQKGEGCQVYGHILVNKVAGNFHFAPGKSFQAHHMHVHDLQPFRMSSWN 240
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQ 303
ISH+IN+++FG+ FPGV+NPLDGV T + +G MYQYF+K+VPT+Y + G+ I +NQ
Sbjct: 241 ISHRINRISFGKEFPGVINPLDGVEKTTDPGAGSAMYQYFVKIVPTIYESLDGNVINTNQ 300
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
FSVTEH R G LPG+F YDLSPI V FTE SF HFLT VCAI+GGVFTV+G
Sbjct: 301 FSVTEHTRMLPPGDKSGLPGLFVMYDLSPIMVKFTERTKSFAHFLTGVCAIIGGVFTVAG 360
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
IID+ IY+ R + KK+E+GK
Sbjct: 361 IIDSLIYNSLRTLGKKMELGK 381
>gi|363741418|ref|XP_003642491.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Gallus gallus]
gi|363741445|ref|XP_003642499.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Gallus gallus]
Length = 383
Score = 442 bits (1138), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/384 (54%), Positives = 274/384 (71%), Gaps = 14/384 (3%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G ++T+VS ++M+LLFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDG 122
+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD GN + E + G
Sbjct: 66 DKLKINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
K+ P R C SCYGAES D CCN C++VREAYR++GWA NPD
Sbjct: 126 KEEEKVFDPNSLDADR-------CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDT 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+ N++H I L+FG +PG+VNPLDG T + S M+QYF+KVVPTVY V G +++N
Sbjct: 239 NINMTHYIKHLSFGRDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTN 298
Query: 303 QFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
QFSVT H + + G + Q LPGVF Y+LSP+ V TE+H F HFLT VCAIVGG+FT
Sbjct: 299 QFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFT 357
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+G ID+ IYH RAI+KKIE+GK
Sbjct: 358 VAGFIDSLIYHSARAIQKKIELGK 381
>gi|41055991|ref|NP_957309.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform 2 [Danio rerio]
gi|82210123|sp|Q803I2.1|ERGI3_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|28278376|gb|AAH44474.1| ERGIC and golgi 3 [Danio rerio]
gi|182890166|gb|AAI64701.1| Ergic3 protein [Danio rerio]
Length = 383
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/397 (53%), Positives = 278/397 (70%), Gaps = 25/397 (6%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA +NK++ DAYPK EDF +T G +T++S ++ML+LFFSEL+ YL +L
Sbjct: 1 MDA-LNKLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELF 59
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR- 119
VDTSRG+ LRIN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + +
Sbjct: 60 VDTSRGDKLRINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGQPVTTEA 119
Query: 120 --------QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYR 171
++G+ P P + C SCYGAE+ D CCN C++VREAYR
Sbjct: 120 EKHDLGKEEEGVFDPSTLDPDR------------CESCYGAETDDLKCCNTCDDVREAYR 167
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
++GWA PD I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS V
Sbjct: 168 RRGWAFKTPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHV 227
Query: 232 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
HVHD+ +F D+ N++H I L+FG+ +PG+VNPLD S MYQYF+K+VPT+Y
Sbjct: 228 HVHDLQSFGLDNINMTHFIKHLSFGKDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIY 287
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V FTE+ SF HFLT
Sbjct: 288 VKGDGEVVKTNQFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLT 346
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
VCAI+GGVFTV+G+ID+ IYH RAI+KKIE+GK S
Sbjct: 347 GVCAIIGGVFTVAGLIDSLIYHSARAIQKKIELGKAS 383
>gi|224077228|ref|XP_002191084.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Taeniopygia guttata]
Length = 383
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/384 (54%), Positives = 275/384 (71%), Gaps = 14/384 (3%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G ++T VS ++M+LLFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTAVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDG 122
+ L+IN DV FP +PC+ LS+DAMD++G+Q LDV+H++FK+RLD GN + E + G
Sbjct: 66 DKLKINLDVIFPHMPCAYLSIDAMDVAGDQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
K+ P R C SCYGAES D CCN C++VREAYR++GWA NPD
Sbjct: 126 KEEEKVFDPNSLDADR-------CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDS 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+ N++H I L+FG +PG+VNPLDG T + S M+QYF+KVVPTVY V G +++N
Sbjct: 239 NINMTHYIKHLSFGRDYPGIVNPLDGTAVTAQQASMMFQYFVKVVPTVYRKVDGEVVRTN 298
Query: 303 QFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
QFSVT+H + + G L Q LPGVF Y+LSP+ V TE+H SF HF+T VCAIVGG+FT
Sbjct: 299 QFSVTQHEKIA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFVTGVCAIVGGIFT 357
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+G ID+ IYH RAI+KKIE+GK
Sbjct: 358 VAGFIDSLIYHSARAIQKKIELGK 381
>gi|348521804|ref|XP_003448416.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Oreochromis niloticus]
Length = 384
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 205/384 (53%), Positives = 275/384 (71%), Gaps = 5/384 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+NK++ DAYPK EDF +T+ G +T++S ++ML+LF SEL+ YL +L VDTS
Sbjct: 4 LNKLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN D+ FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD + + +
Sbjct: 64 RGDKLKINIDIIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K D L+ + C SCYGAE+ D CCN C++VREAYR++GWA + D I+
Sbjct: 124 LGKADDGEVFDPSTLDPDR--CESCYGAETEDLKCCNTCDDVREAYRRRGWAFKSADTIE 181
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 182 QCKREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 241
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FG+ +PG+VNPLDG T S MYQYF+K+VPT+Y G +++NQF
Sbjct: 242 NMTHLIKHLSFGKDYPGLVNPLDGTDVTAPQASMMYQYFVKIVPTIYMKTDGEVVKTNQF 301
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G + Q LPGVF Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+
Sbjct: 302 SVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVA 360
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
G+ID+ IYH R I+KKIE+GK S
Sbjct: 361 GLIDSLIYHSARVIQKKIELGKTS 384
>gi|260815243|ref|XP_002602383.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
gi|229287692|gb|EEN58395.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
Length = 397
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/394 (53%), Positives = 278/394 (70%), Gaps = 19/394 (4%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+R DAYPK +DF +TF G +T++S M+LLF SEL+ YL +L VDTSRG
Sbjct: 9 KLRRFDAYPKTLDDFRVKTFGGAAVTIISGFFMILLFVSELQYYLTLEVTEELFVDTSRG 68
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGA 125
E +RIN D+ F +PC+ LS+DAMDI+GEQ +DV H++FK+R+D QGN++ E ++ +G
Sbjct: 69 EKMRINIDILFHKVPCAYLSIDAMDIAGEQQIDVDHNLFKRRMDLQGNILDEPEKEDLGD 128
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
P D+ +Q C SCYGAE+ D CCN CE+VREAYR+KGWA +NPD I+Q
Sbjct: 129 PS-DEFMQAIKKLENKTADVCESCYGAETEDLKCCNTCEDVREAYRRKGWAFNNPDTIEQ 187
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH--------VHDIL 237
CKREG+ +++K+++ EGC +YG+LEVNKVAGNFHFAPGKSF Q VH VHD+
Sbjct: 188 CKREGWSEKLKQQKNEGCQVYGYLEVNKVAGNFHFAPGKSFQQHHVHVSCFYHPIVHDLQ 247
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
F + FN+SH +N L+FG PG VNPLDG + S MYQYF+K+VPT+Y +SG
Sbjct: 248 PFGGEKFNLSHHVNHLSFGTDIPGRVNPLDGHMVAAKQGSMMYQYFVKIVPTIYKKISGQ 307
Query: 298 TIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+++NQFSVT+H + S EQG LPGVF Y+LSP+ V FTE+ SF+HFLT VC
Sbjct: 308 EVRTNQFSVTKHQKQVTASSGEQG----LPGVFVLYELSPMMVQFTEKQRSFMHFLTGVC 363
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
AIVGGVFTV+G+ID+ IYH RAI++KI++GK S
Sbjct: 364 AIVGGVFTVAGLIDSLIYHSARAIQQKIDLGKAS 397
>gi|47575764|ref|NP_001001226.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Xenopus (Silurana) tropicalis]
gi|82185697|sp|Q6NVS2.1|ERGI3_XENTR RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|45708932|gb|AAH67932.1| ERGIC and golgi 3 [Xenopus (Silurana) tropicalis]
Length = 384
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 205/382 (53%), Positives = 276/382 (72%), Gaps = 5/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++++R DAYPK EDF +T G ++T++S ++ML+LFFSEL+ YL +L VD S
Sbjct: 4 LHRLRQFDAYPKTLEDFRVKTCGGALVTVISGLIMLILFFSELQYYLTKEIYPELFVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD + S D
Sbjct: 64 RGDKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADRHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K ++ + L+ N C SCYGAE+ D CCN C++VREAYR++GWA PD I+
Sbjct: 124 LGKSEEHVVFDPKSLDPNR--CESCYGAETDDFSCCNTCDDVREAYRRRGWAFKTPDSIE 181
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 182 QCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 241
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H+I L+FG +PG+VNPLDG S M+QYF+K+VPTVY V G +++NQF
Sbjct: 242 NMTHEIRHLSFGRDYPGLVNPLDGSSVAAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQF 301
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+
Sbjct: 302 SVTRHEKMT-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVA 360
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ +Y+ RAI+KKIE+GK
Sbjct: 361 GLIDSLVYYSTRAIQKKIELGK 382
>gi|148225661|ref|NP_001087591.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Xenopus laevis]
gi|82181499|sp|Q66KH2.1|ERGI3_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|51513379|gb|AAH80394.1| MGC83277 protein [Xenopus laevis]
Length = 389
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/387 (54%), Positives = 279/387 (72%), Gaps = 10/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++++R DAYPK EDF +T G V+T++S ++ML+LFFSEL+ YL +L VD S
Sbjct: 4 LHRLRQFDAYPKTLEDFRVKTCGGAVVTVISGLIMLILFFSELQYYLTKEVYPELFVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD + S D
Sbjct: 64 RGDKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDLDKKPVTSEADRHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K ++ + L+ N C SCYGAE+ D CCN+C++VREAYR+KGWA PD I+
Sbjct: 124 LGKSEEQVVFDPKTLDPNR--CESCYGAETDDFSCCNSCDDVREAYRRKGWAFKTPDSIE 181
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 182 QCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 241
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H+I L+FG+ +PG+VNPLDG S M+QYF+K+VPTVY V G +
Sbjct: 242 GLDNINMTHEIKHLSFGKDYPGLVNPLDGTSIVAMQSSMMFQYFVKIVPTVYVKVDGEVL 301
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G + Q LPGVF Y+LSP+ V FTE+H SF HFLT VCAI+GG
Sbjct: 302 RTNQFSVTRHEKMT-NGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGG 360
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VFTV+G+ID+ IY+ RAI+KKIE+GK
Sbjct: 361 VFTVAGLIDSLIYYSTRAIQKKIELGK 387
>gi|259155256|ref|NP_001158869.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Salmo salar]
gi|223647782|gb|ACN10649.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Salmo salar]
Length = 388
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 211/393 (53%), Positives = 279/393 (70%), Gaps = 19/393 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++K++ DAYPK EDF +T G +T++S ++ML+LFFSEL+ YL +L VDTS
Sbjct: 4 LHKLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDG 122
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD GN + E+ +
Sbjct: 64 RGDKLKININVIFPNMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGNPVTTEAEKHD 123
Query: 123 IGAPK--IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+G + I P + R C SCYGAE+ D CCN C++VREAYR++GWA NP
Sbjct: 124 LGQEEGEIFDPSKLDPER-------CESCYGAETEDLKCCNTCDDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QCKREGF Q+++E++ EGC IYGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCKREGFSQKMQEQKNEGCQIYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FG +PG+VNPLDG S MYQYF+K+VPT+Y
Sbjct: 237 LQSFGLDNINMTHLIKHLSFGRDYPGIVNPLDGTDVAAPQASMMYQYFVKIVPTIYVKWD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V FTE+ SF HFLT VCA
Sbjct: 297 GEVVKTNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
IVGGVFTV+G+ID+ IYH +AI+KKIE+GK S
Sbjct: 356 IVGGVFTVAGLIDSLIYHSAKAIQKKIELGKAS 388
>gi|363741420|ref|XP_003642492.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Gallus gallus]
gi|363741447|ref|XP_003642500.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Gallus gallus]
Length = 388
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 211/389 (54%), Positives = 274/389 (70%), Gaps = 19/389 (4%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G ++T+VS ++M+LLFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDG 122
+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD GN + E + G
Sbjct: 66 DKLKINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
K+ P R C SCYGAES D CCN C++VREAYR++GWA NPD
Sbjct: 126 KEEEKVFDPNSLDADR-------CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDT 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDIL 237
I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
+F D+ N++H I L+FG +PG+VNPLDG T + S M+QYF+KVVPTVY V G
Sbjct: 239 SFGLDNINMTHYIKHLSFGRDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGE 298
Query: 298 TIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H F HFLT VCAIV
Sbjct: 299 VVRTNQFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIV 357
Query: 356 GGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
GG+FTV+G ID+ IYH RAI+KKIE+GK
Sbjct: 358 GGIFTVAGFIDSLIYHSARAIQKKIELGK 386
>gi|327271489|ref|XP_003220520.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Anolis carolinensis]
Length = 383
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 208/384 (54%), Positives = 272/384 (70%), Gaps = 14/384 (3%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G ++T++S ++M LLFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDG 122
+ LRIN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + E + G
Sbjct: 66 DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELG 125
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I P R C SCYGAES D CCN C++VREAYR++GWA NPD
Sbjct: 126 KEEETIFDPNSLDPDR-------CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDT 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+ N++H I L+FG +PG+VNPLDG + + S M+QYF+KVVPT+Y V G +++N
Sbjct: 239 NINMTHIIKHLSFGRDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTN 298
Query: 303 QFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
QFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GGVFT
Sbjct: 299 QFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFT 357
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+G+ID+ IYH R I+KKIE+GK
Sbjct: 358 VAGLIDSLIYHSARVIQKKIELGK 381
>gi|74315943|ref|NP_001028277.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform 1 [Danio rerio]
gi|72679324|gb|AAI00126.1| ERGIC and golgi 3 [Danio rerio]
Length = 388
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 211/402 (52%), Positives = 278/402 (69%), Gaps = 30/402 (7%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA +NK++ DAYPK EDF +T G +T++S ++ML+LFFSEL+ YL +L
Sbjct: 1 MDA-LNKLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELF 59
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR- 119
VDTSRG+ LRIN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + +
Sbjct: 60 VDTSRGDKLRINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGQPVTTEA 119
Query: 120 --------QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYR 171
++G+ P P + C SCYGAE+ D CCN C++VREAYR
Sbjct: 120 EKHDLGKEEEGVFDPSTLDPDR------------CESCYGAETDDLKCCNTCDDVREAYR 167
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
++GWA PD I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS V
Sbjct: 168 RRGWAFKTPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHV 227
Query: 232 HV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
HV HD+ +F D+ N++H I L+FG+ +PG+VNPLD S MYQYF+K+
Sbjct: 228 HVHAVEIHDLQSFGLDNINMTHFIKHLSFGKDYPGIVNPLDDTNVAAPQASMMYQYFVKI 287
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSF 344
VPT+Y G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V FTE+ SF
Sbjct: 288 VPTIYVKGDGEVVKTNQFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKFTEKQRSF 346
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
HFLT VCAI+GGVFTV+G+ID+ IYH RAI+KKIE+GK S
Sbjct: 347 THFLTGVCAIIGGVFTVAGLIDSLIYHSARAIQKKIELGKAS 388
>gi|348521802|ref|XP_003448415.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Oreochromis niloticus]
Length = 389
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 205/389 (52%), Positives = 275/389 (70%), Gaps = 10/389 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+NK++ DAYPK EDF +T+ G +T++S ++ML+LF SEL+ YL +L VDTS
Sbjct: 4 LNKLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN D+ FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD + + +
Sbjct: 64 RGDKLKINIDIIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K D L+ + C SCYGAE+ D CCN C++VREAYR++GWA + D I+
Sbjct: 124 LGKADDGEVFDPSTLDPDR--CESCYGAETEDLKCCNTCDDVREAYRRRGWAFKSADTIE 181
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 182 QCKREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 241
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FG+ +PG+VNPLDG T S MYQYF+K+VPT+Y G +
Sbjct: 242 GLDNINMTHLIKHLSFGKDYPGLVNPLDGTDVTAPQASMMYQYFVKIVPTIYMKTDGEVV 301
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G + Q LPGVF Y+LSP+ V FTE+H SF HFLT VCAI+GG
Sbjct: 302 KTNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGG 360
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
VFTV+G+ID+ IYH R I+KKIE+GK S
Sbjct: 361 VFTVAGLIDSLIYHSARVIQKKIELGKTS 389
>gi|297602842|ref|NP_001052965.2| Os04g0455900 [Oryza sativa Japonica Group]
gi|255675519|dbj|BAF14879.2| Os04g0455900 [Oryza sativa Japonica Group]
Length = 253
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 196/246 (79%), Positives = 226/246 (91%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M+ +++K+RSLDAYPK+NEDFYSRT SGG+ITL SS+VMLLLF SELRLYL+AVTET L
Sbjct: 1 MEGLLSKLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRLYLHAVTETTLR 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGETLRINFDVTFPAL CSI+S+DAMDISG++HLDVKHDIFK+R+D GNVI ++Q
Sbjct: 61 VDTSRGETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D +G K+++PLQRHGGRLEHNETYCGSCYGAE SDE CCN+CE+VREAYRKKGW +SNP
Sbjct: 121 DAVGGMKVEQPLQRHGGRLEHNETYCGSCYGAEESDEQCCNSCEDVREAYRKKGWGVSNP 180
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DLIDQCKREGFLQ IK+EEGEGCNIYGFLEVNKVAGNFHFAPGKSF ++ VHVHD+L FQ
Sbjct: 181 DLIDQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQ 240
Query: 241 RDSFNI 246
+DSFN+
Sbjct: 241 KDSFNV 246
>gi|196008679|ref|XP_002114205.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
gi|190583224|gb|EDV23295.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
Length = 369
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 206/390 (52%), Positives = 275/390 (70%), Gaps = 31/390 (7%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +R DA+PK EDF RTF G IT+VS+++MLLLF SE+ YL+ ++L VD
Sbjct: 5 SVLTSLRRYDAFPKTLEDFRIRTFGGATITIVSAVIMLLLFVSEMNYYLSVEVTSELFVD 64
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
TSRGE ++I +VTFP + C+ILSVD MD++G Q LD+K ++ K+R+D G
Sbjct: 65 TSRGEKIKIYMNVTFPKMACAILSVDTMDVAGMQQLDIKQNLMKRRIDENG--------- 115
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
KP G ++ N+T CGSCYGAE+++ CCN+CE+VREAYRKKGWAL++P+
Sbjct: 116 -------KPT---GDAVQKNKTKCGSCYGAENAEMKCCNSCEDVREAYRKKGWALTSPEG 165
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKV-AGNFHFAPGKSFHQSGVHVHDILAFQR 241
I+QC+ EG+ Q +KE+E EGCN++G+LEVNKV AGNFHFAPGKSF Q VHVHD+ +F
Sbjct: 166 IEQCQEEGWAQMLKEQEKEGCNVFGYLEVNKVVAGNFHFAPGKSFQQHRVHVHDLQSFGS 225
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
FN SH I+KL+FGE FPG++NPLDG R + + S MYQYFIKVVPTVY + G ++S
Sbjct: 226 RKFNTSHTIHKLSFGEEFPGIINPLDGHRMSSDQDSAMYQYFIKVVPTVYKKLKGEEVKS 285
Query: 302 NQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
NQ+SVT+H + EQG LPGVF Y+LSP+ + + E SF HFLT VCAI+G
Sbjct: 286 NQYSVTKHLKYIKLSMGEQG----LPGVFISYELSPMIIRYAERRKSFAHFLTGVCAIIG 341
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
GVFTV+ +IDA +YH + + KIE+GK S
Sbjct: 342 GVFTVASLIDAMVYHSAKML--KIELGKAS 369
>gi|387015776|gb|AFJ50007.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3-like
[Crotalus adamanteus]
Length = 372
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 207/380 (54%), Positives = 274/380 (72%), Gaps = 17/380 (4%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G +T++S ++M LFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGAFVTVISGLIMFFLFFSELQYYLTKEIHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ LRIN D+ FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD +D +G
Sbjct: 66 DKLRINIDIAFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLD---------KDELGK- 115
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
++ L + L+ C SCYGAES D CCNNC++VREAYR++GWA NPD I+QC
Sbjct: 116 --EEELFFNPNSLDPER--CESCYGAESEDIKCCNNCDDVREAYRRRGWAFKNPDTIEQC 171
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
KREGF ++++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ ++ D+ NI
Sbjct: 172 KREGFSEKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSYGLDNINI 231
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
+H I L+FG+ +PG+VNPLDG T S M+QYF+KVVPTVY V G +++NQFSV
Sbjct: 232 THFIRHLSFGKDYPGLVNPLDGTIVTAHQASMMFQYFVKVVPTVYMKVDGEMVRTNQFSV 291
Query: 307 TEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
T H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+G+
Sbjct: 292 TRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGL 350
Query: 365 IDAFIYHGQRAIKKKIEIGK 384
ID+ IYH RAI+KKIE+GK
Sbjct: 351 IDSLIYHSARAIQKKIELGK 370
>gi|405966014|gb|EKC31342.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Crassostrea gigas]
Length = 397
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 200/387 (51%), Positives = 273/387 (70%), Gaps = 9/387 (2%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++R DAYPK EDF +TF G ++T++SS++M++LF SEL YL + +L VDT+RG
Sbjct: 9 RLRQFDAYPKTLEDFRVKTFGGALVTVISSLLMVILFISELNYYLTKDVQPELFVDTTRG 68
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI---ESRQDGI 123
+ LRIN D+ FP +PC+ LS+DAMD+SGEQ LDV H +FK+RL++ G I E ++G
Sbjct: 69 QKLRINIDIDFPKVPCAYLSIDAMDVSGEQQLDVDHHLFKQRLNADGEKIKDTEPEKEGT 128
Query: 124 GAPKIDKPLQRHGGRLEH-----NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
I + + +E + C SCYGAE+ D CCN CE+VREAYRKKGWA +
Sbjct: 129 MYEPIFELGDKSKDAVEAVTKKLDPDRCESCYGAETGDLKCCNTCEDVREAYRKKGWAFN 188
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+P+ I+QC REG+ ++K ++ EGC +YG+LEVNKV GNFHFAPGKSF Q VHVHD+ A
Sbjct: 189 SPEGIEQCNREGWTAKMKAQQKEGCQVYGYLEVNKVQGNFHFAPGKSFQQHHVHVHDLQA 248
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
F FN+SH I L+FG+ +PG++NPLD E M+QY++KVVPT Y DV G T
Sbjct: 249 FGGQKFNLSHAIRHLSFGQDYPGIINPLDQTSQISEDEQTMFQYYVKVVPTTYVDVKGKT 308
Query: 299 IQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+ +NQ+SV +H ++ G + LPGVFF Y+LSP+ V +TE+ SF+HFLT VCAI+GG
Sbjct: 309 LYTNQYSVNKHSKTVGNGMGDSGLPGVFFIYELSPMMVKYTEKQRSFMHFLTGVCAIIGG 368
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RA++KKIE+GK
Sbjct: 369 IFTVAGLIDSMIYHSSRALQKKIELGK 395
>gi|327271491|ref|XP_003220521.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Anolis carolinensis]
Length = 388
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 208/389 (53%), Positives = 272/389 (69%), Gaps = 19/389 (4%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G ++T++S ++M LLFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDG 122
+ LRIN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + E + G
Sbjct: 66 DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELG 125
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I P R C SCYGAES D CCN C++VREAYR++GWA NPD
Sbjct: 126 KEEETIFDPNSLDPDR-------CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDT 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDIL 237
I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
+F D+ N++H I L+FG +PG+VNPLDG + + S M+QYF+KVVPT+Y V G
Sbjct: 239 SFGLDNINMTHIIKHLSFGRDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGE 298
Query: 298 TIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+
Sbjct: 299 VVRTNQFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAII 357
Query: 356 GGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
GGVFTV+G+ID+ IYH R I+KKIE+GK
Sbjct: 358 GGVFTVAGLIDSLIYHSARVIQKKIELGK 386
>gi|410926566|ref|XP_003976749.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Takifugu rubripes]
Length = 384
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 203/385 (52%), Positives = 274/385 (71%), Gaps = 9/385 (2%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K++ DAYPK EDF +T+ G +T++S ++ML+LF SEL+ YL +L VDTSR
Sbjct: 5 SKLKQFDAYPKTLEDFRVKTWGGATVTIISGVLMLILFVSELQYYLTKEVHPELYVDTSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDS--QGNVIESRQDGI 123
G+ L+IN ++ FP +PC LS+DAMD++GEQ LDV+H++FK+RLD Q E+ + +
Sbjct: 65 GDKLKININIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHEL 124
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G D P+ + C SCYGAE+ D CCN+C++VREAYR++GWA N D I
Sbjct: 125 GGED-DVPVFDPSTL---DPERCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTI 180
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
+QCKREGF Q+++E++ EGC +YG LEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 EQCKREGFTQKMQEQKNEGCQVYGVLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 240
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
N++H I L+FG+ +PG++NPLD T S MYQYF+K+VPT+Y G +++NQ
Sbjct: 241 INMTHLIRHLSFGQDYPGLINPLDDTNITAPQASMMYQYFVKIVPTIYVKTDGEVLKTNQ 300
Query: 304 FSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
FSVT H + + G + Q LPGVF Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV
Sbjct: 301 FSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTV 359
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+G+ID+ IYH R I+KKIE+GK S
Sbjct: 360 AGLIDSLIYHSARVIQKKIELGKAS 384
>gi|126291179|ref|XP_001371602.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Monodelphis domestica]
Length = 383
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 277/382 (72%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL A +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN D+ FP +PC+ LS+DAMD++GEQ LDV+H+++K+RLD G + + +
Sbjct: 64 RGDKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE--- 120
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
++ K ++ + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 121 RHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I +L+FGE +PG+VNPLD T S M+QYF+KVVPTVY VSG ++SNQF
Sbjct: 241 NMTHYIRRLSFGEDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKIE+GK
Sbjct: 360 GLIDSLIYHSARAIQKKIELGK 381
>gi|327271493|ref|XP_003220522.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 3 [Anolis carolinensis]
Length = 394
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 208/395 (52%), Positives = 272/395 (68%), Gaps = 25/395 (6%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+++ DA+PK EDF +T G ++T++S ++M LLFFSEL+ YL +L VD SRG
Sbjct: 6 RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDG 122
+ LRIN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + E + G
Sbjct: 66 DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELG 125
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I P R C SCYGAES D CCN C++VREAYR++GWA NPD
Sbjct: 126 KEEETIFDPNSLDPDR-------CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDT 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDIL 237
I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238
Query: 238 AFQRDS------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
+F D+ N++H I L+FG +PG+VNPLDG + + S M+QYF+KVVPT+Y
Sbjct: 239 SFGLDNVSILGKINMTHIIKHLSFGRDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIY 298
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
V G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT
Sbjct: 299 MKVDGEVVRTNQFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLT 357
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VCAI+GGVFTV+G+ID+ IYH R I+KKIE+GK
Sbjct: 358 GVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGK 392
>gi|410926568|ref|XP_003976750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Takifugu rubripes]
Length = 389
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 203/390 (52%), Positives = 274/390 (70%), Gaps = 14/390 (3%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K++ DAYPK EDF +T+ G +T++S ++ML+LF SEL+ YL +L VDTSR
Sbjct: 5 SKLKQFDAYPKTLEDFRVKTWGGATVTIISGVLMLILFVSELQYYLTKEVHPELYVDTSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDS--QGNVIESRQDGI 123
G+ L+IN ++ FP +PC LS+DAMD++GEQ LDV+H++FK+RLD Q E+ + +
Sbjct: 65 GDKLKININIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHEL 124
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G D P+ + C SCYGAE+ D CCN+C++VREAYR++GWA N D I
Sbjct: 125 GGED-DVPVFDPSTL---DPERCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTI 180
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILA 238
+QCKREGF Q+++E++ EGC +YG LEVNKVAGNFHFAPGKSF QS VHV HD+ +
Sbjct: 181 EQCKREGFTQKMQEQKNEGCQVYGVLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 240
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
F D+ N++H I L+FG+ +PG++NPLD T S MYQYF+K+VPT+Y G
Sbjct: 241 FGLDNINMTHLIRHLSFGQDYPGLINPLDDTNITAPQASMMYQYFVKIVPTIYVKTDGEV 300
Query: 299 IQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+++NQFSVT H + + G + Q LPGVF Y+LSP+ V FTE+H SF HFLT VCAI+G
Sbjct: 301 LKTNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIG 359
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
GVFTV+G+ID+ IYH R I+KKIE+GK S
Sbjct: 360 GVFTVAGLIDSLIYHSARVIQKKIELGKAS 389
>gi|354477966|ref|XP_003501188.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Cricetulus griseus]
gi|344246673|gb|EGW02777.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Cricetulus griseus]
Length = 383
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 212/382 (55%), Positives = 276/382 (72%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + L+ N C SCYGAES D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVAV-FDPNSLDPNR--CESCYGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|13384938|ref|NP_079792.1| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Mus
musculus]
gi|37999778|sp|Q9CQE7.1|ERGI3_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3; AltName: Full=Serologically defined breast
cancer antigen NY-BR-84 homolog
gi|12844094|dbj|BAB26233.1| unnamed protein product [Mus musculus]
gi|12851518|dbj|BAB29073.1| unnamed protein product [Mus musculus]
gi|26341008|dbj|BAC34166.1| unnamed protein product [Mus musculus]
gi|27882157|gb|AAH43720.1| ERGIC and golgi 3 [Mus musculus]
gi|148674217|gb|EDL06164.1| ERGIC and golgi 3, isoform CRA_d [Mus musculus]
Length = 383
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 212/382 (55%), Positives = 276/382 (72%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + L+ N C SCYGAES D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTV-FDPNSLDPNR--CESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|157820783|ref|NP_001100003.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Rattus norvegicus]
gi|149030853|gb|EDL85880.1| ERGIC and golgi 3 (predicted) [Rattus norvegicus]
Length = 383
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 212/382 (55%), Positives = 276/382 (72%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + L+ N C SCYGAES D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTV-FDPDSLDPNR--CESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|348564091|ref|XP_003467839.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cavia porcellus]
Length = 383
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 211/382 (55%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAESEDLKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKIE+GK
Sbjct: 360 GLIDSLIYHSARAIQKKIELGK 381
>gi|413945824|gb|AFW78473.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
partial [Zea mays]
Length = 284
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 195/285 (68%), Positives = 235/285 (82%), Gaps = 2/285 (0%)
Query: 101 KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC 160
+HDI K RLD+ GNVIE+R+ IG KI++PLQ+HGGRL+ E YCG+CYGAE SDE CC
Sbjct: 1 RHDIEKIRLDAHGNVIEARKVSIGGAKIERPLQKHGGRLDKGEQYCGTCYGAEESDEQCC 60
Query: 161 NNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF 220
N+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL+V+KVAGNFHF
Sbjct: 61 NSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHF 120
Query: 221 APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY 280
APGK F++S + V + L+ FNI+HKINKL+FG FPGVVNPLDG +WTQ G Y
Sbjct: 121 APGKGFYESNIDVPE-LSLLEGGFNITHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTY 179
Query: 281 QYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 340
QYFIKVVPT+YTD+ GH I SNQFSVTEHFR R + PGVFFFYD SPIKV FTEE
Sbjct: 180 QYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNV-RPKPQPGVFFFYDFSPIKVIFTEE 238
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 239 SRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 283
>gi|284004911|ref|NP_001164802.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Oryctolagus cuniculus]
gi|217038333|gb|ACJ76626.1| serologically defined breast cancer antigen 84 isoform b
(predicted) [Oryctolagus cuniculus]
Length = 383
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 210/382 (54%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFNPDSL---DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|126291176|ref|XP_001371575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Monodelphis domestica]
Length = 388
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 208/387 (53%), Positives = 277/387 (71%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL A +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN D+ FP +PC+ LS+DAMD++GEQ LDV+H+++K+RLD G + + +
Sbjct: 64 RGDKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE--- 120
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
++ K ++ + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 121 RHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I +L+FGE +PG+VNPLD T S M+QYF+KVVPTVY VSG +
Sbjct: 241 GLDNINMTHYIRRLSFGEDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+SNQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RSNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKIE+GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIELGK 386
>gi|359322740|ref|XP_864582.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Canis lupus familiaris]
Length = 383
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 209/382 (54%), Positives = 274/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + N C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVKVFDPDSL---NPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT+VCAIVGG+FTV+
Sbjct: 301 SVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTSVCAIVGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|296199725|ref|XP_002747286.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Callithrix jacchus]
gi|403281165|ref|XP_003932068.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Saimiri boliviensis boliviensis]
Length = 383
Score = 420 bits (1079), Expect = e-115, Method: Compositional matrix adjust.
Identities = 209/382 (54%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|395830112|ref|XP_003788179.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Otolemur garnettii]
Length = 383
Score = 419 bits (1078), Expect = e-115, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFNPDSL---DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|109092202|ref|XP_001098982.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Macaca mulatta]
Length = 383
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLKTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|410262554|gb|JAA19243.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|194044515|ref|XP_001929457.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Sus scrofa]
gi|350594868|ref|XP_003483992.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Sus scrofa]
Length = 383
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 274/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEIKVFDPDSLDPDR-------CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 237 LDNINMTHYIQHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVAS-GLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|7706278|ref|NP_057050.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Homo sapiens]
gi|332858219|ref|XP_003316930.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Pan troglodytes]
gi|397523795|ref|XP_003831904.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Pan paniscus]
gi|37999823|sp|Q9Y282.1|ERGI3_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3; AltName: Full=Serologically defined breast
cancer antigen NY-BR-84
gi|4689108|gb|AAD27763.1|AF077030_1 hypothetical 43.2 kDa protein [Homo sapiens]
gi|4929577|gb|AAD34049.1|AF151812_1 CGI-54 protein [Homo sapiens]
gi|7671663|emb|CAB89412.1| ERGIC and golgi 3 [Homo sapiens]
gi|14602515|gb|AAH09765.1| ERGIC and golgi 3 [Homo sapiens]
gi|15559308|gb|AAH14014.1| ERGIC and golgi 3 [Homo sapiens]
gi|119596605|gb|EAW76199.1| ERGIC and golgi 3, isoform CRA_a [Homo sapiens]
gi|124249802|gb|ABM92879.1| endoplasmic reticulum-localized protein ERp43 [Homo sapiens]
gi|312152490|gb|ADQ32757.1| ERGIC and golgi 3 [synthetic construct]
gi|380785591|gb|AFE64671.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|383419067|gb|AFH32747.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|384947602|gb|AFI37406.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|410342895|gb|JAA40394.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 273/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|344279905|ref|XP_003411726.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Loxodonta africana]
Length = 386
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 7 LGKLKQFDAYPKTLEDFRIKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 66
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 67 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 126
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 127 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 179
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 180 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 239
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 240 LDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 299
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 300 TNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 358
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 359 FTVAGLIDSLIYHSARAIQKKIDLGK 384
>gi|410953936|ref|XP_003983624.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Felis catus]
Length = 383
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|301762088|ref|XP_002916455.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Ailuropoda melanoleuca]
Length = 383
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|426241390|ref|XP_004014574.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Ovis aries]
Length = 383
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 274/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|417399979|gb|JAA46966.1| Putative copii vesicle protein [Desmodus rotundus]
Length = 383
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 209/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPRMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKAEMKVFDPNSLDPER-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY + G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTALQASMMFQYFVKVVPTVYMKLDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|431894341|gb|ELK04141.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pteropus alecto]
Length = 383
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY + G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMVVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|197100234|ref|NP_001126130.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pongo abelii]
gi|75041559|sp|Q5R8G3.1|ERGI3_PONAB RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|55730450|emb|CAH91947.1| hypothetical protein [Pongo abelii]
Length = 383
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 207/382 (54%), Positives = 272/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VRE YR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVRETYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|164448602|ref|NP_001029525.2| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
taurus]
gi|75057944|sp|Q5EAE0.1|ERGI3_BOVIN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|59857621|gb|AAX08645.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|59857623|gb|AAX08646.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|59857741|gb|AAX08705.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|110665562|gb|ABG81427.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 383
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 297 TNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|354477968|ref|XP_003501189.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Cricetulus griseus]
Length = 388
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 212/387 (54%), Positives = 276/387 (71%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + L+ N C SCYGAES D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVAV-FDPNSLDPNR--CESCYGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|410218732|gb|JAA06585.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 207/382 (54%), Positives = 272/382 (71%), Gaps = 6/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++F +RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFNQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 240
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 241 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 300
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 301 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 359
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 360 GLIDSLIYHSARAIQKKIDLGK 381
>gi|95767625|gb|ABF57320.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 380
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 210/386 (54%), Positives = 273/386 (70%), Gaps = 14/386 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 1 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 61 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 120
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NP
Sbjct: 121 LGKVEVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNP 173
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 174 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 233
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 234 LDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 293
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 294 TNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 352
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 353 FTVAGLIDSLIYHSARAIQKKIDLGK 378
>gi|359322742|ref|XP_851879.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Canis lupus familiaris]
Length = 388
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 209/387 (54%), Positives = 274/387 (70%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + N C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVKVFDPDSL---NPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT+VCAIVGG
Sbjct: 301 RTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTSVCAIVGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|296199723|ref|XP_002747285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Callithrix jacchus]
gi|403281167|ref|XP_003932069.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Saimiri boliviensis boliviensis]
gi|166831592|gb|ABY90117.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Callithrix jacchus]
Length = 388
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 209/387 (54%), Positives = 273/387 (70%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|395830114|ref|XP_003788180.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Otolemur garnettii]
gi|197215642|gb|ACH53034.1| ERGIC and golgi 3 (predicted) [Otolemur garnettii]
Length = 388
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/387 (53%), Positives = 273/387 (70%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFNPDSL---DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|334310895|ref|XP_003339551.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Monodelphis domestica]
Length = 396
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/395 (52%), Positives = 277/395 (70%), Gaps = 19/395 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL A +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN D+ FP +PC+ LS+DAMD++GEQ LDV+H+++K+RLD G + + +
Sbjct: 64 RGDKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE--- 120
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
++ K ++ + C SCYGAES D CCN CE+VREAYR++GWA NPD I+
Sbjct: 121 RHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDS--------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
D+ N++H I +L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 241 GLDNVVLCWYLQINMTHYIRRLSFGEDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVY 300
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
VSG ++SNQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT
Sbjct: 301 MKVSGEVLRSNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLT 359
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VCAI+GG+FTV+G+ID+ IYH RAI+KKIE+GK
Sbjct: 360 GVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGK 394
>gi|190402265|gb|ACE77675.1| ERGIC and golgi 3 (predicted) [Sorex araneus]
Length = 388
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 210/387 (54%), Positives = 275/387 (71%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGVPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
KI+ + L+ N C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKIEVKV-FDPDSLDPNR--CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCQREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|95767501|gb|ABF57305.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 376
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 209/382 (54%), Positives = 270/382 (70%), Gaps = 14/382 (3%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD SRG+
Sbjct: 1 KQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDK 60
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIG 124
L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S + G
Sbjct: 61 LKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKV 120
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 121 EVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 173
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 174 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 233
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 234 NMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 293
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 294 SVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 352
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 353 GLIDSLIYHSARAIQKKIDLGK 374
>gi|344279907|ref|XP_003411727.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Loxodonta africana]
Length = 391
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 273/391 (69%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 7 LGKLKQFDAYPKTLEDFRIKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 66
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 67 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 126
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 127 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 179
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 180 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 239
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 240 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVD 299
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 300 GEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 358
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 359 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 389
>gi|194044517|ref|XP_001929458.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Sus scrofa]
gi|350594870|ref|XP_003483993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Sus scrofa]
Length = 388
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 274/391 (70%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEIKVFDPDSLDPDR-------CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 237 LQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-SGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|109092200|ref|XP_001098885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Macaca mulatta]
Length = 388
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/387 (53%), Positives = 273/387 (70%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 KTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|229368723|gb|ACQ63006.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Dasypus novemcinctus]
Length = 388
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 273/391 (69%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 237 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|38327615|ref|NP_938408.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform a [Homo sapiens]
gi|281182526|ref|NP_001162565.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Papio anubis]
gi|397523797|ref|XP_003831905.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Pan paniscus]
gi|410055053|ref|XP_003953764.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Pan troglodytes]
gi|57208593|emb|CAI42842.1| ERGIC and golgi 3 [Homo sapiens]
gi|164623746|gb|ABY64672.1| ERGIC and golgi 3, isoform 1 (predicted) [Papio anubis]
gi|380785589|gb|AFE64670.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform a [Macaca mulatta]
Length = 388
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/387 (53%), Positives = 273/387 (70%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|75077200|sp|Q4R8X1.1|ERGI3_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|67967936|dbj|BAE00450.1| unnamed protein product [Macaca fascicularis]
Length = 382
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 273/382 (71%), Gaps = 7/382 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGTPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + G + C SCYGAE+ D CCN CE+VREAYR++G A NPD I+
Sbjct: 124 LGKVEVTV---FGPDSLDPDRCESCYGAEAEDIKCCNTCEDVREAYRRRG-AFKNPDTIE 179
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 180 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 239
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQF
Sbjct: 240 NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQF 299
Query: 305 SVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
SVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+
Sbjct: 300 SVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVA 358
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
G+ID+ IYH RAI+KKI++GK
Sbjct: 359 GLIDSLIYHSARAIQKKIDLGK 380
>gi|22760064|dbj|BAC11054.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/387 (53%), Positives = 272/387 (70%), Gaps = 11/387 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 241 GLDDINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 300
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 301 RTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 359
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 MFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|426241392|ref|XP_004014575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Ovis aries]
Length = 388
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 274/391 (70%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 237 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|281346059|gb|EFB21643.1| hypothetical protein PANDA_004535 [Ailuropoda melanoleuca]
Length = 387
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 273/391 (69%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 237 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|301762086|ref|XP_002916454.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Ailuropoda melanoleuca]
Length = 388
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 273/391 (69%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 237 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|410953938|ref|XP_003983625.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Felis catus]
Length = 388
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 273/391 (69%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 237 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|184185558|gb|ACC68956.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Rhinolophus ferrumequinum]
Length = 388
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/391 (53%), Positives = 273/391 (69%), Gaps = 19/391 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY +
Sbjct: 237 LQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKLD 296
Query: 296 GHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCA
Sbjct: 297 GEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
I+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 386
>gi|432101449|gb|ELK29631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Myotis davidii]
Length = 391
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/390 (53%), Positives = 273/390 (70%), Gaps = 14/390 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + H C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEMKVFDPDSLDPHR---CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS- 243
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNV 240
Query: 244 -------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY + G
Sbjct: 241 CTRCCLQINMTHYIRHLSFGEDYPGIVNPLDRTNVTALQASMMFQYFVKVVPTVYMKLDG 300
Query: 297 HTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI
Sbjct: 301 QVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 359
Query: 355 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 IGGMFTVAGLIDSLIYHSARAIQKKIDLGK 389
>gi|330790779|ref|XP_003283473.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
gi|325086583|gb|EGC39970.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
Length = 383
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 204/385 (52%), Positives = 271/385 (70%), Gaps = 10/385 (2%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
++++++ DAYPK +DF +TF+G ++++V I +L LFFS++ LY + +L VDT
Sbjct: 3 MVSQLKKFDAYPKTVDDFRVKTFTGAIVSIVGGIFILWLFFSQVTLYFSTDIHHELFVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+RGE L+IN D+TF LPC+ LS+DAMD+SGE DV H+IFKKRL S G I Q I
Sbjct: 63 TRGEKLKINMDITFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKKRLSSTGQPI-IEQPPI 121
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPD 181
+I+K + ++ E++ CGSCYGAE CCN CEEVR AY KKGW L +P
Sbjct: 122 REEEINKKIVKN----ENDVQGCGSCYGAEDPARGIPCCNTCEEVRNAYSKKGWGL-DPS 176
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ QC REGF + I E+ GEGC +YGF+ VNKVAGNFHFAPGKSF Q +HVHD+ F+
Sbjct: 177 TVSQCLREGFTKNIVEQNGEGCQVYGFILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKD 236
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
FN+SH INKLA G FPG+ NPLD V T+ GM+QYFIK+VPT+Y ++G+ I +
Sbjct: 237 GQFNMSHTINKLAVGNEFPGIKNPLDEVTKTEVAGVGMFQYFIKIVPTIYEGLNGNRIAT 296
Query: 302 NQFSVTEHFR-SSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
NQ+SVTEH+R +++G T LPG+FF YDLSPI + +E+ SF FLTNVCAI+GGVF
Sbjct: 297 NQYSVTEHYRLLAKKGEEPTGLPGLFFMYDLSPIMMKVSEKGKSFASFLTNVCAIIGGVF 356
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGK 384
TV GI D+FIY+ + +KKKI++GK
Sbjct: 357 TVFGIFDSFIYYSTKNLKKKIDLGK 381
>gi|291231388|ref|XP_002735646.1| PREDICTED: serologically defined breast cancer antigen 84-like,
partial [Saccoglossus kowalevskii]
Length = 358
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/359 (53%), Positives = 253/359 (70%), Gaps = 9/359 (2%)
Query: 32 TLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMD 91
T++S I+M +LF SEL YL +L VDT+RGE +RIN D+TFP LPC LS+DAMD
Sbjct: 1 TIISGILMFILFISELNYYLTKEVTPELYVDTTRGEKMRINLDITFPTLPCGYLSIDAMD 60
Query: 92 ISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 151
++GEQ LDV H+I K R+D G + + + K ++ +L+ + C SCYG
Sbjct: 61 VAGEQQLDVDHNIMKSRIDKNGKPVATPEKEDIGDKSEEAKDFDVNKLDPDR--CESCYG 118
Query: 152 AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEV 211
AES D CCN CE+VREAYR+KGWA +N D I QC REG+ ++K + GEGC +YG LEV
Sbjct: 119 AESKDLKCCNTCEDVREAYRRKGWAFNNADGIAQCSREGWSDKLKSQSGEGCQVYGHLEV 178
Query: 212 NKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW 271
NKVAGNFHFAPGKSF Q VHVHD+ AF + FN+SH+IN L+FG +PG+ NPLD +
Sbjct: 179 NKVAGNFHFAPGKSFQQHHVHVHDLQAFSGEKFNLSHRINHLSFGHKYPGMENPLDNSKV 238
Query: 272 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR------SSEQGRLQTLPGVF 325
T + S MYQYF+K+VPT YT ++G T +SNQ+SVT+H + +S G LPGVF
Sbjct: 239 TSQKASIMYQYFVKIVPTTYTKLNGATTRSNQYSVTKHEKVVSTSLASAAGE-HGLPGVF 297
Query: 326 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
Y+ +P+ V +TE+H SF+HF+T VCAI+GGVFTV+G+ID+ IYH +AIKKKI++GK
Sbjct: 298 ILYEFAPLMVKYTEKHRSFMHFMTGVCAIIGGVFTVAGLIDSMIYHSSKAIKKKIDLGK 356
>gi|34849462|gb|AAH57130.1| Ergic3 protein [Mus musculus]
Length = 394
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 212/393 (53%), Positives = 276/393 (70%), Gaps = 17/393 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + L+ N C SCYGAES D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTV-FDPNSLDPNR--CESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 240
Query: 240 QRDS------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 241 GLDNPSDCLQINMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMK 300
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT V
Sbjct: 301 VDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 359
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
CAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 CAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 392
>gi|255578837|ref|XP_002530273.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223530205|gb|EEF32113.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 265
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 183/246 (74%), Positives = 217/246 (88%)
Query: 90 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 149
MDI GEQH D+KH+I KKR+++ G+VIE R++GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 1 MDIMGEQHFDIKHNITKKRINAHGDVIEVRKEGIGAPKIEKPLQRHGGRLEHNETYCGSC 60
Query: 150 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 209
YGAE SD+DCCN+C+EVREAYRKKGWAL+ DLIDQCKREGF+Q++K+EEGEGCNIYG L
Sbjct: 61 YGAEMSDDDCCNSCDEVREAYRKKGWALTGVDLIDQCKREGFIQKVKDEEGEGCNIYGSL 120
Query: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 269
EVNKVAGNFHF+PGK HQS + D+L FQ DS+NISH IN+LAFG++FPGVVNPLDGV
Sbjct: 121 EVNKVAGNFHFSPGKGLHQSSFFIQDLLVFQGDSYNISHTINRLAFGDYFPGVVNPLDGV 180
Query: 270 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 329
W ETP+GM+QYF+KVVPT+YTD+ G T++SNQ+SVTEHF+ SE RL + PGVFFFYD
Sbjct: 181 PWVHETPNGMHQYFLKVVPTIYTDIRGRTVRSNQYSVTEHFKKSEFARLDSPPGVFFFYD 240
Query: 330 LSPIKV 335
SPIKV
Sbjct: 241 FSPIKV 246
>gi|57208594|emb|CAI42843.1| ERGIC and golgi 3 [Homo sapiens]
Length = 396
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/397 (52%), Positives = 273/397 (68%), Gaps = 21/397 (5%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 2 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 61
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 62 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 121
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 122 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 178
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH------------ 232
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VH
Sbjct: 179 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHGCVCRLKMIARS 238
Query: 233 ---VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT 289
VHD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPT
Sbjct: 239 LACVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPT 298
Query: 290 VYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
VY V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HF
Sbjct: 299 VYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHF 357
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
LT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 358 LTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 394
>gi|426391505|ref|XP_004062113.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Gorilla gorilla gorilla]
gi|7959731|gb|AAF71038.1|AF116721_14 PRO0989 [Homo sapiens]
Length = 346
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 194/348 (55%), Positives = 251/348 (72%), Gaps = 6/348 (1%)
Query: 39 MLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
MLLLF SEL+ YL +L VD SRG+ L+IN DV FP +PC+ LS+DAMD++GEQ L
Sbjct: 1 MLLLFLSELQYYLTTEVHPELYVDKSRGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQL 60
Query: 99 DVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED 158
DV+H++FK+RLD G + S + K++ + + C SCYGAE+ D
Sbjct: 61 DVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESCYGAEAEDIK 117
Query: 159 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 218
CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNF
Sbjct: 118 CCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNF 177
Query: 219 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG 278
HFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD T S
Sbjct: 178 HFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASM 237
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 336
M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V
Sbjct: 238 MFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVK 296
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 297 LTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 344
>gi|156389237|ref|XP_001634898.1| predicted protein [Nematostella vectensis]
gi|156221986|gb|EDO42835.1| predicted protein [Nematostella vectensis]
Length = 386
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 195/388 (50%), Positives = 257/388 (66%), Gaps = 16/388 (4%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
I N +R DAYPK EDF +TF G +T +S +M +LF SEL YL +L VDT
Sbjct: 6 IFNTLRRFDAYPKTLEDFRIKTFGGAAVTFISGFLMFILFVSELNYYLTTEVNPELFVDT 65
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+R + LRIN ++ FP LPC LS+DAMD+SGEQ +DV +I K+R+D G +I+
Sbjct: 66 TRAQKLRINVEIVFPKLPCVYLSIDAMDVSGEQQIDVSSNILKRRVDLDGKIIDE----- 120
Query: 124 GAPKIDKPLQRHGGR--LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
A K D + H + L+ + C SCYGAE+ D+ CCN C++VREAYR+KGWALSN D
Sbjct: 121 NAEKGDLGDKSHEAKELLDLDPNRCESCYGAETPDKKCCNTCDDVREAYRRKGWALSNVD 180
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ QC REG+ +++E++ EGC + G+LEVNKVAGNFHFAPGKSF Q VHVHD+ F
Sbjct: 181 DVKQCMREGWKDKLQEQKNEGCEVTGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQPFGS 240
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
FN++H I L+FG +PG PLD MYQYF+K+VPT Y +SG + +
Sbjct: 241 TQFNLTHNIKHLSFGHDYPGKTYPLDNTFVPAMEAGSMYQYFVKIVPTTYRKLSGEILHT 300
Query: 302 NQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+QFSVT+H R S E G LPGVF Y+ SP+ V +TE SF+HFLT VCAIVG
Sbjct: 301 HQFSVTKHKRVIRQMSGEHG----LPGVFVLYEFSPMMVQYTESRRSFMHFLTGVCAIVG 356
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
G+FTV+G++D+ IYH RA++KKI++GK
Sbjct: 357 GIFTVAGLVDSMIYHSSRALQKKIDLGK 384
>gi|61555014|gb|AAX46646.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
Length = 346
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/352 (55%), Positives = 251/352 (71%), Gaps = 14/352 (3%)
Query: 39 MLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
MLLLF SEL+ YL +L VD SRG+ L+IN +V FP +PC+ LS+DAMD++GEQ L
Sbjct: 1 MLLLFLSELQYYLTTEVHPELYVDKSRGDKLKININVLFPHMPCAYLSIDAMDVAGEQQL 60
Query: 99 DVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES 154
DV+H++FKKRLD G + S + G K+ P R C SCYGAE
Sbjct: 61 DVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR-------CESCYGAEM 113
Query: 155 SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKV 214
D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFLEVNKV
Sbjct: 114 EDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKV 173
Query: 215 AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 274
AGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD T
Sbjct: 174 AGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTAP 233
Query: 275 TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSP 332
S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPGVF Y+LSP
Sbjct: 234 QASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSP 292
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 293 MMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 344
>gi|449528843|ref|XP_004171412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Cucumis sativus]
Length = 355
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 180/259 (69%), Positives = 226/259 (87%)
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+I+KPLQ+HGGRLEHNETYCGSC+GAE+SD+DCCN+CEEVREAYRKKGWA++N DLIDQC
Sbjct: 96 EIEKPLQKHGGRLEHNETYCGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQC 155
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
+RE F+Q++K+EEGEGCNI G LEVNKVAG+FHF PGKSF+QS + +LA Q +N+
Sbjct: 156 QREDFIQKVKDEEGEGCNIEGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNV 215
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH+IN+LAFG H+ G+VNPLDGV W + M+QYF+KVVPT+Y ++ G T+ SNQ+SV
Sbjct: 216 SHRINRLAFGNHYDGLVNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSV 275
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
TEHF+S E G Q++PGVFF+YDLSP+KVT+TEEHV FLHF+T++CAI+GGVF+V+GIID
Sbjct: 276 TEHFKSVEFGSSQSIPGVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIID 335
Query: 367 AFIYHGQRAIKKKIEIGKF 385
AFIYHGQR +KKK+EIGKF
Sbjct: 336 AFIYHGQRKMKKKVEIGKF 354
>gi|335304738|ref|XP_003360010.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Sus scrofa]
gi|350594872|ref|XP_003134465.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Sus scrofa]
Length = 398
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 210/401 (52%), Positives = 274/401 (68%), Gaps = 29/401 (7%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEIKVFDPDSLDPDR-------CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDS----------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+K
Sbjct: 237 LQSFGLDNVSTGHRCCLQINMTHYIQHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVK 296
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVS 343
VVPTVY V G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H S
Sbjct: 297 VVPTVYMKVDGEVLRTNQFSVTRHEKVAS-GLMGDQGLPGVFVLYELSPMMVKLTEKHRS 355
Query: 344 FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
F HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 396
>gi|351702542|gb|EHB05461.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Heterocephalus glaber]
Length = 378
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 209/386 (54%), Positives = 267/386 (69%), Gaps = 19/386 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHE 123
Query: 125 APKID----KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
K++ P R C SCYGAES D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVTVFDPESLDPDR-------CESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS HVH Q
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQS--HVHGWCCLQ 234
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 235 ---INMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 291
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+
Sbjct: 292 TNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 350
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 351 FTVAGLIDSLIYHSARAIQKKIDLGK 376
>gi|410953940|ref|XP_003983626.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Felis catus]
Length = 399
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 210/402 (52%), Positives = 273/402 (67%), Gaps = 30/402 (7%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HD 235
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHD 236
Query: 236 ILAFQRDS-----------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFI 284
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+
Sbjct: 237 LQSFGLDNRSRLRCWYCLQINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFV 296
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHV 342
KVVPTVY V G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H
Sbjct: 297 KVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHR 355
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 SFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 397
>gi|355563183|gb|EHH19745.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
mulatta]
gi|355784539|gb|EHH65390.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
fascicularis]
Length = 401
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/400 (51%), Positives = 273/400 (68%), Gaps = 24/400 (6%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVH----------- 232
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS G +
Sbjct: 181 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHGTYLTGCVCRLKMI 240
Query: 233 ------VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
VHD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KV
Sbjct: 241 ARSLACVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKV 300
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSF 344
VPTVY V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF
Sbjct: 301 VPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 359
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 360 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 399
>gi|66801671|ref|XP_629760.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium discoideum AX4]
gi|74851212|sp|Q54DW2.1|ERGI3_DICDI RecName: Full=Probable endoplasmic reticulum-Golgi intermediate
compartment protein 3
gi|60463164|gb|EAL61357.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium discoideum AX4]
Length = 383
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/387 (50%), Positives = 269/387 (69%), Gaps = 13/387 (3%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
++++++ DAYPK +DF +T++G +++++ + +L LFFS++ LY + +L VDT
Sbjct: 2 LISQLKKFDAYPKTVDDFRVKTYTGAIVSIIGGVFILWLFFSQVTLYFSTDIHHELFVDT 61
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+RGE L+IN D+TF LPC+ LS+DAMD+SGE DV H+IFKKRL G I I
Sbjct: 62 TRGEKLKINMDITFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKKRLSPTGQPI------I 115
Query: 124 GAPKI-DKPLQRHGGRLEHNETY-CGSCYGAESSDED--CCNNCEEVREAYRKKGWALSN 179
AP I ++ + + ++N+ CGSCYGAE + CCN CEEVR AY KKGW L +
Sbjct: 116 EAPPIREEEINKKESVKDNNDVVGCGSCYGAEDPSKGIGCCNTCEEVRVAYSKKGWGL-D 174
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P I QC REGF + + E+ GEGC +YGF+ VNKVAGNFHFAPGKSF Q +HVHD+ F
Sbjct: 175 PSGIPQCIREGFTKNLVEQNGEGCQVYGFILVNKVAGNFHFAPGKSFQQHHMHVHDLQPF 234
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
+ SFN+SH IN+L+FG FPG+ NPLD V T+ GM+QYF+KVVPT+Y ++G+ I
Sbjct: 235 KDGSFNVSHTINRLSFGNDFPGIKNPLDDVTKTEMVGVGMFQYFVKVVPTIYEGLNGNRI 294
Query: 300 QSNQFSVTEHFR--SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+NQ+SVTEH+R + + LPG+FF YDLSPI + +E SF FLTNVCAI+GG
Sbjct: 295 ATNQYSVTEHYRLLAKKGEEPSGLPGLFFMYDLSPIMMKVSERGKSFASFLTNVCAIIGG 354
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VFTV GI D+FIY+ + ++KKI++GK
Sbjct: 355 VFTVFGIFDSFIYYSTKNLQKKIDLGK 381
>gi|390359988|ref|XP_792057.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Strongylocentrotus purpuratus]
Length = 400
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/397 (49%), Positives = 272/397 (68%), Gaps = 20/397 (5%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ N++R DAYPK EDF +TF G +T++SSI+M+ LF SEL YL +L VD
Sbjct: 6 VWNRLREFDAYPKTLEDFRVKTFGGAAVTIISSIIMITLFISELNFYLTKEVIPELYVDA 65
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDG 122
+RGE L+IN ++ FP +PC+ LS+DAMDISGEQ LDV H+I+K+R+D G I E ++
Sbjct: 66 TRGEKLKINMEIVFPKMPCAYLSIDAMDISGEQQLDVDHNIYKRRIDKTGTPISEPEKEE 125
Query: 123 IGAPKIDKPLQRHG-------GRLE-HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
+G + + + ++E + C SCYGAE+ CCN+CE V+EAYR+KG
Sbjct: 126 LGKKEDQEKKEEEDSEQEDEKKKMEVLDPNRCESCYGAETPGLKCCNDCEGVQEAYRRKG 185
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA S+P I+QCKREGF ++++ ++ EGC +YG+LEVNKVAGNFHFAPGKSF Q VHVH
Sbjct: 186 WAFSDPTSIEQCKREGFSEKMQSQKEEGCELYGYLEVNKVAGNFHFAPGKSFQQHHVHVH 245
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 294
D+ A FN++H + L+FG +PG+ NPLD ++ S M+QYF+K+VPT YT +
Sbjct: 246 DLQAIAGAKFNMTHHVKTLSFGMEYPGMENPLDNMKTIDVKGSSMFQYFVKIVPTTYTKL 305
Query: 295 SGHTIQSNQFSVTEH-------FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
++NQ+SVT+H F + E G LPGVF Y+LSP+ V FTE+H SF+HF
Sbjct: 306 DKSITRTNQYSVTKHEKQVTTSFSTGEHG----LPGVFVLYELSPLMVKFTEKHRSFMHF 361
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
LT VCAI+GGVFTV+G+ID+ IYH +AI+KKI++GK
Sbjct: 362 LTGVCAIIGGVFTVAGLIDSLIYHSAKAIQKKIDLGK 398
>gi|449265747|gb|EMC76893.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3,
partial [Columba livia]
Length = 330
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 191/333 (57%), Positives = 241/333 (72%), Gaps = 14/333 (4%)
Query: 58 KLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI- 116
+L VD SRG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD GN +
Sbjct: 4 ELYVDKSRGDKLKINLDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVT 63
Query: 117 ---ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 173
E + G K+ P R C SCYGAES D CCN C++VREAYR++
Sbjct: 64 PEAERHELGKEEEKVFDPNSLDADR-------CESCYGAESEDIRCCNTCDDVREAYRRR 116
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
GWA NPD I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV
Sbjct: 117 GWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHV 176
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
HD+ +F D+ N++H I L+FG +PG+VNPLDG T + S M+QYF+KVVPTVY
Sbjct: 177 HDLQSFGLDNINMTHYIKHLSFGRDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMK 236
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT V
Sbjct: 237 VDGEVVRTNQFSVTRHEKIA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 295
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
CAIVGG+FTV+G ID+ IYH RAI+KKIE+GK
Sbjct: 296 CAIVGGIFTVAGFIDSLIYHSARAIQKKIELGK 328
>gi|428183328|gb|EKX52186.1| hypothetical protein GUITHDRAFT_65491 [Guillardia theta CCMP2712]
Length = 425
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 205/388 (52%), Positives = 260/388 (67%), Gaps = 32/388 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++R D YPK +DF RT +G V++++ ++M +L E+ LYL T+ +L VDTSRG
Sbjct: 39 RLREFDIYPKTIQDFQVRTLAGAVVSILGFLIMFVLILGEINLYLTIQTDHELSVDTSRG 98
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQ---- 120
E L+INF++TF A+PC+I+S+D MDISGEQH+DV H+++K+RLD GNVI SR
Sbjct: 99 EKLQINFNITFHAMPCTIISLDTMDISGEQHIDVHHEVYKQRLDVDGNVILLLSRACLNV 158
Query: 121 -DGIGA-------PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK 172
+G G D PL GG CGSCYGAE S ++CCN C+ VREAYR+
Sbjct: 159 TNGSGDFTTLRAHAGFDAPLT--GGE-------CGSCYGAEESPDECCNTCDSVREAYRR 209
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL-------EVNKVAGNFHFAPGKS 225
+GWA N D I QCK EGFL +++EE EGC + G L +VNKVAGNFHF+PGKS
Sbjct: 210 RGWAFVNSDGIVQCKTEGFLLKMQEERHEGCRVVGTLQARLTREQVNKVAGNFHFSPGKS 269
Query: 226 F-HQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFI 284
F Q GVH D+L ++ +N+SH IN L+FG +PG VNPLDGV E S MYQYF+
Sbjct: 270 FSQQVGVHFQDLLVLRKTDYNVSHAINHLSFGRKYPGRVNPLDGVVRICEFRSAMYQYFV 329
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
KVVPT Y +G + +NQFS TE+ R E G + LPGVFFFYDLSPIK T E + SF
Sbjct: 330 KVVPTQYQYRNGTILSTNQFSTTENTRQLE-GFTRGLPGVFFFYDLSPIKATLAERNNSF 388
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHG 372
LHFLT +CAI+GGVFTV GIID+ IY G
Sbjct: 389 LHFLTGLCAIIGGVFTVMGIIDSTIYTG 416
>gi|326434226|gb|EGD79796.1| intermediate compartment protein 3 [Salpingoeca sp. ATCC 50818]
Length = 396
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/398 (50%), Positives = 267/398 (67%), Gaps = 24/398 (6%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +K+R+LDAYPK EDF +TFSG I++V+ +++++LF SEL YL+ E +L VDT
Sbjct: 6 VWSKLRNLDAYPKTLEDFRVKTFSGAAISIVAILLIVVLFTSELVYYLSTEVEPELFVDT 65
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI------- 116
SR E +RIN DVTF + C+ L +D MD+SGE LDV+HDIFK+RL G I
Sbjct: 66 SRDEKMRINVDVTFHKMACAFLHLDIMDVSGENELDVEHDIFKQRLTETGTPIYEEPEEV 125
Query: 117 ----ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK 172
+ +GA K+ K L+ N C SCYGAES CCN CE VREAYR+
Sbjct: 126 DDLGDESDSAVGALKMMKE------GLDPNR--CESCYGAESEQNKCCNTCEAVREAYRR 177
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
KGWAL++ I+QC+REG+ +++K + EGC IYG LEVNKVAGNFH APGKSF Q +H
Sbjct: 178 KGWALTDIQGIEQCEREGWTEKLKAQAKEGCRIYGHLEVNKVAGNFHIAPGKSFQQHSIH 237
Query: 233 VHDILAFQRDS---FNISHKINKLAFGEHFPGVVNPLDGVRWTQET-PSGMYQYFIKVVP 288
HD+ +F R++ FN+SH IN L+FG +PGVVNPLDG T + + MYQY++K+VP
Sbjct: 238 FHDLNSFGREALGKFNMSHTINHLSFGIEYPGVVNPLDGHSETADKLGATMYQYYVKIVP 297
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHF 347
T Y G + +NQ+SVT H R + QT LPG+F +++SPI V +E SF HF
Sbjct: 298 TRYRKARGQELNTNQYSVTMHQRHIDHKAGQTGLPGMFVMFEISPILVQLSERTHSFFHF 357
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
LT V AI+GG+F+V+G+ID+F+YHG R++KKK E+GK
Sbjct: 358 LTGVLAIIGGIFSVAGMIDSFVYHGLRSLKKKQELGKL 395
>gi|335774962|gb|AEH58414.1| endoplasmic reticulum-golgi intermediat compartment protein 3-like
protein [Equus caballus]
Length = 354
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/360 (55%), Positives = 258/360 (71%), Gaps = 14/360 (3%)
Query: 31 ITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAM 90
+T+VS ++MLLLF SEL+ YL +L VD SRG+ L+IN DV FP +PC+ LS+DAM
Sbjct: 1 VTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKINIDVFFPHMPCAYLSIDAM 60
Query: 91 DISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETYC 146
D++GEQ LDV+H++FK+RLD G + S + G K+ P R C
Sbjct: 61 DVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR-------C 113
Query: 147 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 206
SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +Y
Sbjct: 114 ESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVY 173
Query: 207 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 266
GFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPL
Sbjct: 174 GFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNPL 233
Query: 267 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGV 324
D T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPGV
Sbjct: 234 DRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGV 292
Query: 325 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
F Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 293 FVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 352
>gi|281211641|gb|EFA85803.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Polysphondylium pallidum PN500]
Length = 388
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/400 (48%), Positives = 270/400 (67%), Gaps = 29/400 (7%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ K++S DAYPK +DF +T++G ++++VSSI ++ LF S++ +Y+ T +L VDT
Sbjct: 1 MFQKLKSFDAYPKTVDDFRVKTYAGAIVSIVSSIFIIWLFLSQISIYMTTETHHELFVDT 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI---ESRQ 120
+R E L+IN DV F LPC+ LS+DAMD+SGE DV H+IFK+RL G I R+
Sbjct: 61 NRAEKLKINIDVVFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKRRLSPTGEFIPDAPKRE 120
Query: 121 DGIG-APKIDKPLQRHGGRLEHNETYCGSCYGAESSDE--DCCNNCEEVREAYRKKGWAL 177
D + PK++ E++ CGSC GAE+ + +CCN CEEVR AY+K GW
Sbjct: 121 DNVNIKPKVN----------ENDRPECGSCMGAENPSKGINCCNTCEEVRVAYQKMGWGF 170
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+P QC REGF + + E+ GEGC +YGFL VNKVAGNFHFAPGKSF Q +HVHD+
Sbjct: 171 -DPSDTPQCVREGFTKNVVEQNGEGCQVYGFLLVNKVAGNFHFAPGKSFQQHHMHVHDLQ 229
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETP---------SGMYQYFIKVVP 288
+F + FN+SH I++L+FG FPG+ NPLDGV T+ SGM+QY++K+VP
Sbjct: 230 SF-KGQFNLSHTISRLSFGNDFPGIKNPLDGVSKTEANQYQYHNLVVGSGMFQYYVKIVP 288
Query: 289 TVYTDVSGHTIQSNQFSVTEHFR--SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
T+Y ++G+ I +NQ+SVTEH+R + + + LPG+FF YDLSPI + E SF
Sbjct: 289 TIYEGLNGNLINTNQYSVTEHYRLLAKKGEEMTGLPGLFFMYDLSPIMMKVVERSKSFAS 348
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
F+T+VCAIVGGVFTV+GI D+FIY +++K+KI++GK S
Sbjct: 349 FITSVCAIVGGVFTVAGIFDSFIYQTTKSLKRKIDLGKAS 388
>gi|440902508|gb|ELR53293.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
grunniens mutus]
Length = 395
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 205/398 (51%), Positives = 267/398 (67%), Gaps = 26/398 (6%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH-------- 232
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VH
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHGCREEVRV 236
Query: 233 ----VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
+ + N++H I L+FGE +PG+VNPLD T S M+QYF+KVVP
Sbjct: 237 TGARCSEAQGWCCLQINMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVP 296
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
TVY V G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF H
Sbjct: 297 TVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTH 355
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
FLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 356 FLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 393
>gi|320167013|gb|EFW43912.1| Ergic3 protein [Capsaspora owczarzaki ATCC 30864]
Length = 392
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/394 (47%), Positives = 264/394 (67%), Gaps = 20/394 (5%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +++ LDAY K ED +T+ G ++++V +++M LF SEL +L T +LLVD
Sbjct: 5 SVLGRLKQLDAYAKTTEDVRIKTYGGAIVSIVCALIMAALFVSELNYFLTTETHHELLVD 64
Query: 63 TSRG--ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--S 118
T+R + LRIN +VTFP LPC+ +S+D MD++GE LDV H + K RL + G V+ +
Sbjct: 65 TTRAGEQKLRININVTFPRLPCAYMSIDVMDVAGEHQLDVLHTLVKTRLSASGEVVREPT 124
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+ +G +P R + + + CG CYGA++ CCN+CEEV+ AYR+KGW +
Sbjct: 125 PVEALG----QQPPSDAAERRDLDNSKCGDCYGAQTEKRPCCNSCEEVQAAYREKGWGMM 180
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+PD I+QC++EGF +R++ EGC + GF+ VNKVAGNFHFAPGKS VHVHD+
Sbjct: 181 DPDSIEQCRQEGFSERMRSIANEGCKVQGFMYVNKVAGNFHFAPGKSSQHQHVHVHDLQQ 240
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT--QETP-SGMYQYFIKVVPTVYTDVS 295
F+ +F+++H I+ L+FG +PG VNPLD V + TP S M+QYFIKVVPT Y ++
Sbjct: 241 FKTTTFDMTHTIHLLSFGTEYPGQVNPLDAVSKVPPENTPGSAMFQYFIKVVPTEYVKLN 300
Query: 296 GHTIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
G T Q++QFS T H + + E G LPGVFF Y+ SP+ V TE SF+HFLT
Sbjct: 301 GETEQTSQFSATSHVKMINHAAGENG----LPGVFFMYEPSPMLVKITERRKSFMHFLTG 356
Query: 351 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VCAIVGGVFTV+G++DA IYH R+IKKK+E+GK
Sbjct: 357 VCAIVGGVFTVAGLVDATIYHSYRSIKKKMELGK 390
>gi|340373749|ref|XP_003385402.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Amphimedon queenslandica]
Length = 386
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 210/395 (53%), Positives = 274/395 (69%), Gaps = 18/395 (4%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +++ ++++LDAY K EDF +TFSG ITLVSSI++LLLF SEL +L+ + +L
Sbjct: 1 MASMLGRLKNLDAYSKTLEDFKIKTFSGATITLVSSIIILLLFLSELLYFLSTDVKQELY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESR 119
VDTSRGE L+IN D+ F PC LS+D MD+SGE LDV+H ++K+RL G VI ES
Sbjct: 61 VDTSRGEKLQINVDIIFHRAPCLYLSIDVMDVSGEHQLDVEHTMYKQRLTLDGEVINESP 120
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ A + G+ CGSCYGAE+ + CCN CE+VREAYRKKGWA S+
Sbjct: 121 TKSVLARD-----ETQDGKAGAANKTCGSCYGAETPELSCCNTCEQVREAYRKKGWAFSD 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P I+QC++EG+ +IKE+ EGC +YG ++V+KVAGNFHFAPGKSF Q VHVHD+ F
Sbjct: 176 PSSIEQCEKEGWTTQIKEQMNEGCRVYGLIDVSKVAGNFHFAPGKSFQQHSVHVHDLQPF 235
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVR-WTQETPSG--MYQYFIKVVPTVYTDVSG 296
FN+SH + KL+FG+ +PG++NPLDG + + ET G MYQYFIKVVPT+Y ++
Sbjct: 236 GVKHFNMSHTVLKLSFGQEYPGIINPLDGHKAFDVETTHGGIMYQYFIKVVPTLYRRLNN 295
Query: 297 HTIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
T+ +NQF+VT+H R S E G LPGVFF YD+SPI V TE S HFLT+V
Sbjct: 296 ETMGTNQFAVTKHQRPVRSASGEHG----LPGVFFIYDISPILVYLTEYRHSLTHFLTSV 351
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAIVGGVFTV+G+ID +YH R +KKK+E+GK S
Sbjct: 352 CAIVGGVFTVAGMIDKLLYHSGRVLKKKMELGKLS 386
>gi|395510083|ref|XP_003759313.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3, partial [Sarcophilus harrisii]
Length = 335
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/338 (55%), Positives = 243/338 (71%), Gaps = 19/338 (5%)
Query: 58 KLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI- 116
+L VD SRG+ L+IN D+ FP +PC+ LS+DAMD++GEQ LDV+H+++K+RLD G+ +
Sbjct: 4 ELYVDKSRGDKLKINIDIFFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGHPVT 63
Query: 117 ---ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 173
E + G K+ P R C SCYGAES D CCN CE+VREAYR++
Sbjct: 64 TEAERHELGKEEEKVFDPSSLDPER-------CESCYGAESEDSKCCNTCEDVREAYRRR 116
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
GWA NPD I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV
Sbjct: 117 GWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHV 176
Query: 234 -----HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
HD+ +F D+ N++H I +L+FGE +PG+VNPLD T S M+QYF+KVVP
Sbjct: 177 HAVEIHDLQSFGLDNINMTHYIRRLSFGEDYPGIVNPLDDTNITAPQASMMFQYFVKVVP 236
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
TVY V+G ++SNQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF H
Sbjct: 237 TVYMKVNGEVLRSNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTH 295
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
FLT VCAI+GG+FTV+G+ID+ IYH RAI+KKIE+GK
Sbjct: 296 FLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGK 333
>gi|383864675|ref|XP_003707803.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Megachile rotundata]
Length = 385
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 186/395 (47%), Positives = 254/395 (64%), Gaps = 23/395 (5%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG V+T++S+I+M +LF +EL YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAVVTIISTIIMAILFLTELNYYLTPTLSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
TSRG LRIN D+ P + C +LS+DAMD +GEQHL ++H+I+K+RLD QG IE Q
Sbjct: 61 TSRGSKLRINLDIVVPTISCDLLSIDAMDTTGEQHLQIEHNIYKRRLDLQGKPIEDPQ-- 118
Query: 123 IGAPKID----KPLQRHGGRLEHNETY--CGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
K D K L + + + T CG CYGA S CCN CE+VR+AY K WA
Sbjct: 119 ----KTDITDTKALSKTTAKSVESTTVETCGDCYGAASEKIKCCNTCEDVRKAYSDKNWA 174
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+P I QC+ + ++++K +GC IYG++EVN+V G+FH APG SF + VHVHD+
Sbjct: 175 PPDPGSIKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGNSFSVNHVHVHDV 234
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+ FN++HKI L+FG + PG NP+D + M+ ++IK+VPT Y G
Sbjct: 235 QPYMSTQFNMTHKIRHLSFGLNIPGKTNPIDDTTMVAMEGAMMFYHYIKIVPTTYVRADG 294
Query: 297 HTIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
T+ +NQFSVT H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TN+
Sbjct: 295 STLLTNQFSVTRHARQVSLLSGESG----MPGIFFSYELSPLMVKYTEKAKSFGHFATNM 350
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAI+GGVFTV+G+ID+F+YH RAI+KKIE+GK+S
Sbjct: 351 CAIIGGVFTVAGLIDSFLYHSVRAIQKKIELGKYS 385
>gi|332024433|gb|EGI64631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Acromyrmex echinatior]
Length = 386
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/390 (45%), Positives = 250/390 (64%), Gaps = 12/390 (3%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG ++T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQMLRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFLSEINYYLTPTMSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ-D 121
TSRG LRIN D+ P++ C +LS+DAMD +GEQHL ++H+IFK+RLD GN IE Q
Sbjct: 61 TSRGSKLRINLDIIVPSISCDLLSLDAMDTTGEQHLHIEHNIFKRRLDLNGNPIEDPQRT 120
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I K + CG CYGA + CCN CE+V EAYR+K WA +P
Sbjct: 121 NITDAKAMSKTTEKAVEIGSTTELCGDCYGATTDTMKCCNTCEDVWEAYRRKKWAPPDPA 180
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ QC+ + + ++K +GC IYG++EVN+V G+FH APG SF + VHVHD+ +
Sbjct: 181 DVKQCQNDKSMDKLKHAFTQGCQIYGYMEVNRVGGSFHIAPGASFSVNHVHVHDVQPYTS 240
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
FN++HKI L+FG + PG NP+DG+ + M+ ++IK+VPT Y G T+ +
Sbjct: 241 SHFNMTHKIRHLSFGLNIPGKTNPMDGMTVVDMDAAMMFYHYIKIVPTTYVRADGSTLLT 300
Query: 302 NQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
NQFSVT H + + E G +PG+FF Y+LSP+ V +TE+ SF HF TN CAI+G
Sbjct: 301 NQFSVTRHSKKVSLLTGESG----MPGIFFNYELSPLMVKYTEKANSFGHFATNTCAIIG 356
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
GVFTV+G+ID+ +YH RAI++KIE+GK++
Sbjct: 357 GVFTVAGLIDSLLYHSVRAIQRKIELGKYN 386
>gi|350404831|ref|XP_003487234.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Bombus impatiens]
Length = 385
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 182/389 (46%), Positives = 245/389 (62%), Gaps = 11/389 (2%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG V+T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEVNYYLTPTLSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
TSRG LRIN D+ P + C +LS+DAMD +GEQHL ++H+IFK+RLD G IE Q
Sbjct: 61 TSRGSKLRINLDIIVPTISCDVLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRT 120
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ E CG CYGA CCN CE+VREAYR K WAL +
Sbjct: 121 DITDTKARSKTTTKTVESTTEKACGDCYGAAGDIIKCCNTCEDVREAYRLKNWALPALGM 180
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I QCK + ++++K +GC IYG++EVN+V G+FH APG SF + VHVHD+ +
Sbjct: 181 IKQCKNDKSVEKMKTAFIQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTST 240
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
FN++HKI L+FG + PG NP+D + M+ ++IK+VPT Y G T+ +N
Sbjct: 241 QFNMTHKIRHLSFGLNIPGKTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTN 300
Query: 303 QFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
QFSVT H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TN CAI+GG
Sbjct: 301 QFSVTRHARQVSLFSGESG----MPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGG 356
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
VFTV+G+ID+ +YH RAI+KKIE+GK++
Sbjct: 357 VFTVAGLIDSLLYHSVRAIQKKIELGKYN 385
>gi|328786822|ref|XP_393819.4| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Apis mellifera]
Length = 383
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 187/393 (47%), Positives = 251/393 (63%), Gaps = 21/393 (5%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG V+T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEMNYYLTPTLSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
TSRG LRIN D+ P + C +LS+DAMD +GEQHL ++H+IFK+RLD G IE R
Sbjct: 61 TSRGSKLRINLDIIVPTISCDLLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRT 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHN-ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA-LS 178
D + K + LE E CG CYGA S CCN CE+VREAYR K WA L
Sbjct: 121 DITDTKALSKTTAK---TLESTTEKICGDCYGAASEIIKCCNTCEDVREAYRLKNWAVLG 177
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
N I QC+ + ++++K +GC IYG++EVN+V G+FH APG SF + VHVHD+
Sbjct: 178 N---IKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQP 234
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
+ FN++HKI L+FG + PG NP+D + M+ ++IK+VPT Y G T
Sbjct: 235 YTSTQFNMTHKIRHLSFGLNIPGKTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGST 294
Query: 299 IQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+ +NQFSVT H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TN CA
Sbjct: 295 LLTNQFSVTRHARQVSLFSGESG----MPGIFFNYELSPLMVKYTEKAKSFGHFATNACA 350
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
I+GGVFTV+G+ID+ +YH RAI+KKIE+GK++
Sbjct: 351 IIGGVFTVAGLIDSLLYHSLRAIQKKIELGKYN 383
>gi|380016121|ref|XP_003692037.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Apis florea]
Length = 385
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 185/392 (47%), Positives = 249/392 (63%), Gaps = 17/392 (4%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG V+T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEVNYYLTPTLSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
TSRG LRIN D+ P + C +LS+DAMD +GEQHL ++H+IFK+RLD G IE R
Sbjct: 61 TSRGSKLRINLDIIVPTISCDLLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRT 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHN-ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
D + K + LE E CG CYGA S CCN CE+VREAYR K WA
Sbjct: 121 DITDTKALSKTTAK---TLESTTEKICGDCYGAASEIIKCCNTCEDVREAYRLKNWAPPV 177
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
I QC+ + ++++K +GC IYG++EVN+V G+FH APG SF + VHVHD+ +
Sbjct: 178 LGNIKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPY 237
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
FN++HKI L+FG + PG NP+D + M+ ++IK+VPT Y G T+
Sbjct: 238 TSTQFNMTHKIRHLSFGLNIPGKTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTL 297
Query: 300 QSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+NQFSVT H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TN CAI
Sbjct: 298 LTNQFSVTRHARQVSLFSGESG----MPGIFFNYELSPLMVKYTEKAKSFGHFATNACAI 353
Query: 355 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
+GGVFTV+G+ID+ +YH RAI+KKIE+GK++
Sbjct: 354 IGGVFTVAGLIDSLLYHSLRAIQKKIELGKYN 385
>gi|332248939|ref|XP_003273622.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Nomascus leucogenys]
Length = 380
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 197/387 (50%), Positives = 259/387 (66%), Gaps = 19/387 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC G LQR + E C++ +VAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCPARG-LQRTQPENERECSL-------QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 232
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +
Sbjct: 233 GLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 292
Query: 300 QSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG
Sbjct: 293 RTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGG 351
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 352 MFTVAGLIDSLIYHSARAIQKKIDLGK 378
>gi|307179776|gb|EFN67966.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Camponotus floridanus]
Length = 385
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/391 (46%), Positives = 252/391 (64%), Gaps = 17/391 (4%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG ++T++S+I+M +L SE+ YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAIVTVISTIIMGILLMSEINYYLTPSMSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
TSRG LRIN D+ P + C +LS+DAMD +GEQHL ++H+IFK+RLD G IE R
Sbjct: 61 TSRGSKLRINLDIIVPVISCDLLSIDAMDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRT 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNET-YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ + ++K ++ LE T CG CYGA + CCN CEEVREAY+ K WA +
Sbjct: 121 NITDSKAVNKTAEK---ALEIGSTESCGDCYGAATETLRCCNTCEEVREAYKLKKWAPPD 177
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P I QCK + +++IK +GC IYG++EVN+V G+FH APG SF + VHVHD+ +
Sbjct: 178 PANIKQCKDDKSMEKIKHAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPY 237
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
FN++HKI L+FG + PG NP+D + M+ ++IK+VPT Y G T+
Sbjct: 238 TSTHFNMTHKIRHLSFGLNIPGKTNPMDDTTVIATEGAMMFYHYIKIVPTTYVRTDGSTL 297
Query: 300 QSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+NQFSVT H + + E G +PG+FF Y+LSP+ V +TE+ SF HF TN CAI
Sbjct: 298 FTNQFSVTRHAKQVSLFTGESG----MPGIFFSYELSPLMVKYTEKAKSFGHFATNTCAI 353
Query: 355 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
+GGVFTV+G+ID+ +YH RAI+KKIE+GK+
Sbjct: 354 IGGVFTVAGLIDSLLYHSVRAIQKKIELGKY 384
>gi|167535515|ref|XP_001749431.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772059|gb|EDQ85716.1| predicted protein [Monosiga brevicollis MX1]
Length = 394
Score = 368 bits (945), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 190/393 (48%), Positives = 261/393 (66%), Gaps = 11/393 (2%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
AI + ++ DAYPK +DF +TFSG +++++ I+M++LF SEL +L+ +L VD
Sbjct: 2 AIFDNLKRFDAYPKTLDDFRVKTFSGAAVSIIAIIIMVILFSSELVYFLSTDVHEELFVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ-- 120
T+R E LRIN D+TFP +PC LS+D MDISGE ++ HD+F++RLD+ GN I + Q
Sbjct: 62 TARNEKLRINLDITFPKMPCVYLSLDVMDISGENEQNIDHDVFRQRLDASGNKIYNGQEE 121
Query: 121 -DGIGAPKIDKPLQRH-GGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
D +G D + G + + C SCYGAE ++ CCN C +V+EAYRKKGWA
Sbjct: 122 IDELGESHADNVADKALDGLKDLDPNRCESCYGAEDTEGQCCNTCAQVQEAYRKKGWAFR 181
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ I QC+REG+ ++ +E EGC +YG LEVNKVAGNFH APG+SF Q +H+HD+ +
Sbjct: 182 SGQGIAQCEREGYDAMMEAQEREGCQLYGHLEVNKVAGNFHIAPGRSFEQHNMHIHDMQS 241
Query: 239 FQRD---SFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDV 294
F R+ FN++H IN L+FG +P VN LDG V E + MYQYF+KVVPT Y +
Sbjct: 242 FGREKLAKFNLTHVINHLSFGIDYPDRVNSLDGHVEVPNEYGAIMYQYFLKVVPTRYRFL 301
Query: 295 SGHTIQSNQFSVTEHFRS--SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
S I +NQ+SVT H R +QG LPG+FF YD+SP+K+ T+ SF HFLT +C
Sbjct: 302 SQTEIDTNQYSVTMHQREIRPDQG-TSGLPGLFFMYDISPMKIQLTQSSRSFFHFLTGLC 360
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
AI+GGV+TV+G+ID F+YHG R +K K +GK
Sbjct: 361 AIIGGVYTVAGMIDGFLYHGIRTLKAKQNMGKL 393
>gi|215704311|dbj|BAG93745.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 261
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 168/259 (64%), Positives = 206/259 (79%), Gaps = 2/259 (0%)
Query: 90 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 149
MDISGEQH D++HDI K+RLD+ GNVIE+R++GIG KI+ PLQ+HGGRL E YCG+C
Sbjct: 1 MDISGEQHHDIRHDIEKRRLDAHGNVIEARKEGIGGAKIESPLQKHGGRLSKGEEYCGTC 60
Query: 150 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 209
YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++GEGCN++GFL
Sbjct: 61 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVHGFL 120
Query: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 269
+V+KVAGN HFAPGK F++S ++V ++ A + FNI+HKINKL+FG FPGVVNPLDG
Sbjct: 121 DVSKVAGNLHFAPGKGFYESNINVPELSALEH-GFNITHKINKLSFGTEFPGVVNPLDGA 179
Query: 270 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 329
+WTQ G YQYFIKVVPT+YTD+ G I SNQFSVTEHFR R + PGVFFFYD
Sbjct: 180 QWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNI-RPKPQPGVFFFYD 238
Query: 330 LSPIKVTFTEEHVSFLHFL 348
SPIKV E + + F+
Sbjct: 239 FSPIKVVTMERNSYVVMFI 257
>gi|328868763|gb|EGG17141.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium fasciculatum]
Length = 335
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 184/342 (53%), Positives = 235/342 (68%), Gaps = 17/342 (4%)
Query: 51 LNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
+ T +L VDT+RGE LRIN DV F LPC+ LS+DAMD+SG+ DV H+IFKKRL
Sbjct: 1 MTTETHHELFVDTTRGEKLRINMDVVFHHLPCAFLSLDAMDVSGDHQFDVAHNIFKKRLS 60
Query: 111 SQGNVIE----SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE--DCCNNCE 164
G I R+D I +R E+++ CGSCYGAE CC+ CE
Sbjct: 61 PTGMPIADASPQREDTIN--------KRVPAGNENDKVDCGSCYGAEDPSRGISCCSTCE 112
Query: 165 EVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGK 224
EVR AY+KKGW++ I QC REGF + I E+ GEGC +YGF+ VNKVAGNFHFAPGK
Sbjct: 113 EVRTAYQKKGWSIQEYSGIAQCVREGFTKNIVEQNGEGCQVYGFINVNKVAGNFHFAPGK 172
Query: 225 SFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFI 284
SF Q +HVHD+ AF + SFN+SH IN+L+FG FPG+ NPLDGV T+ SGM+QY+I
Sbjct: 173 SFQQHHMHVHDLQAF-KGSFNLSHSINRLSFGNDFPGIKNPLDGVTKTEMVGSGMFQYYI 231
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
KVVPT+Y ++G+ I +NQFSVTEH+R + + LPG+FF YDLSPI + +E+
Sbjct: 232 KVVPTLYEGLNGNRISTNQFSVTEHYRLLAKKDEEPSGLPGLFFMYDLSPIMMKVSEQGK 291
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF FLT+VCAIVGGVFTV+GI+D+ IY + +KKKI++GK
Sbjct: 292 SFASFLTSVCAIVGGVFTVAGILDSMIYKTTKNLKKKIDLGK 333
>gi|307193219|gb|EFN76110.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Harpegnathos saltator]
Length = 386
Score = 364 bits (934), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 174/391 (44%), Positives = 247/391 (63%), Gaps = 14/391 (3%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG ++T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFMSEINYYLTPTMSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
TSRG LRIN DV P + C +LSVDAMD +G Q+L ++H+IF++RLD G IE R
Sbjct: 61 TSRGSKLRINLDVIVPTISCDLLSVDAMDTTGVQYLQIEHNIFQRRLDLNGKPIEDPQRT 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ + KP ++ CG CYGA + +CCN C++V+ AYR K WA+ +
Sbjct: 121 NITKTKAVVKPTDEET-QISSTTKVCGDCYGAATETLECCNTCDDVQMAYRLKKWAMPDL 179
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
I QC+ + + K +GC IYG++EVN+V G+FH APG S+ + VHVHD+ +
Sbjct: 180 AKIKQCQNDKSADKYKHAFTQGCQIYGYMEVNRVGGSFHIAPGDSYSVNHVHVHDVQPYN 239
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ FN++HKI L+FG + PG NP+D + M+ Y+IK+VPT Y G T+
Sbjct: 240 SNHFNMTHKIRHLSFGLNIPGKTNPMDDTTTVATEGAMMFYYYIKIVPTTYVRADGSTLL 299
Query: 301 SNQFSVTEHFRS-----SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+NQFSVT H + S+ G +PG+FF Y+LSP+ V +TE+ SF HF TN CAI+
Sbjct: 300 TNQFSVTRHSKRMPLYMSDSG----MPGIFFSYELSPLMVKYTEKAKSFGHFATNTCAII 355
Query: 356 GGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
GGVFTV+G+ID+ +YH RAI+KKIE+GK++
Sbjct: 356 GGVFTVAGLIDSLLYHSVRAIQKKIELGKYN 386
>gi|270007946|gb|EFA04394.1| hypothetical protein TcasGA2_TC014693 [Tribolium castaneum]
Length = 385
Score = 363 bits (932), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 187/394 (47%), Positives = 247/394 (62%), Gaps = 19/394 (4%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I K+R DAYPK ED +T+ G V+T++S +M LLF+ EL YL +L
Sbjct: 1 MFNIFEKLRRFDAYPKTLEDVRIKTYGGAVVTIISLTIMTLLFWVELVDYLTPNVSEELF 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSR +++IN D+ P + C L++DAMD SGEQHL + H+I+K+RLD QG IE +
Sbjct: 61 VDTSRSPSIQINLDIIVPTISCDFLALDAMDSSGEQHLQIDHNIYKRRLDLQGQPIEEPK 120
Query: 121 DGIGAPKIDKPLQRHGGR--LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL- 177
K D ++R N+T CGSCYGA + CCN CE+VREAYR++ WA
Sbjct: 121 ------KEDITIKRKNSTEVATVNKTECGSCYGASFDPKRCCNTCEDVREAYRERRWAFP 174
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
NP+ I QCK E F +++K +GC IYG L VN+V+G+FH APGKSF + VHVHD+
Sbjct: 175 ENPENITQCKEERFSEKLKTAFAQGCQIYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQ 234
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVV-NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
F FN +HKI L+FG NPL E + M+QY IK+VPT Y + G
Sbjct: 235 PFSSTEFNTTHKIRHLSFGASIDSDTHNPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDG 294
Query: 297 HTIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
I +NQFSVT+H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TNV
Sbjct: 295 QFISANQFSVTKHRRVISLMSGESG----MPGIFFQYELSPLMVKYTEQSRSFGHFATNV 350
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
CAI+GGV+TV+G+ID +YH + I+KKIE+GKF
Sbjct: 351 CAIIGGVYTVAGLIDTMLYHSVKLIQKKIELGKF 384
>gi|441638772|ref|XP_004090166.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Nomascus leucogenys]
Length = 393
Score = 363 bits (932), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 197/400 (49%), Positives = 259/400 (64%), Gaps = 32/400 (8%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+
Sbjct: 124 LGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIE 180
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAF 239
QC G LQR + E C++ +VAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 181 QCPARG-LQRTQPENERECSL-------QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 232
Query: 240 QRDS-------------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KV
Sbjct: 233 GLDNVQLWMSSGWCCLQINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKV 292
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSF 344
VPTVY V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF
Sbjct: 293 VPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 351
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 352 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 391
>gi|326931697|ref|XP_003211962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Meleagris gallopavo]
Length = 411
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 179/326 (54%), Positives = 226/326 (69%), Gaps = 19/326 (5%)
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGA 125
+ F FP L S LS+DAMD++GEQ LDV+H++FK+RLD GN + E + G
Sbjct: 92 KCAFTDRFPHLLVSDLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEE 151
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
K+ P R C SCYGAES D CCN C++VREAYR++GWA NPD I+Q
Sbjct: 152 EKVFDPNSLDADR-------CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQ 204
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQ 240
CKREGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F
Sbjct: 205 CKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFG 264
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FG +PG+VNPLDG T + S M+QYF+KVVPTVY V G ++
Sbjct: 265 LDNINMTHYIKHLSFGRDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVR 324
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H F HFLT VCAIVGG+
Sbjct: 325 TNQFSVTRHEKIA-NGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGI 383
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
FTV+G ID+ IYH RAI+KKIE+GK
Sbjct: 384 FTVAGFIDSLIYHSARAIQKKIELGK 409
>gi|189237821|ref|XP_974331.2| PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
Length = 395
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 185/392 (47%), Positives = 246/392 (62%), Gaps = 21/392 (5%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K+R DAYPK ED +T+ G V+T++S +M LLF+ EL YL +L VDTS
Sbjct: 13 LGKLRRFDAYPKTLEDVRIKTYGGAVVTIISLTIMTLLFWVELVDYLTPNVSEELFVDTS 72
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
R +++IN D+ P + C L++DAMD SGEQHL + H+I+K+RLD QG IE +
Sbjct: 73 RSPSIQINLDIIVPTISCDFLALDAMDSSGEQHLQIDHNIYKRRLDLQGQPIEEPK---- 128
Query: 125 APKIDKPLQRHGGR----LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL-SN 179
K D ++R N+T CGSCYGA + CCN CE+VREAYR++ WA N
Sbjct: 129 --KEDITIKRKNSTEVSVATVNKTECGSCYGASFDPKRCCNTCEDVREAYRERRWAFPEN 186
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P+ I QCK E F +++K +GC IYG L VN+V+G+FH APGKSF + VHVHD+ F
Sbjct: 187 PENITQCKEERFSEKLKTAFAQGCQIYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPF 246
Query: 240 QRDSFNISHKINKLAFGEHFPGVV-NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
FN +HKI L+FG NPL E + M+QY IK+VPT Y + G
Sbjct: 247 SSTEFNTTHKIRHLSFGASIDSDTHNPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQF 306
Query: 299 IQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
I +NQFSVT+H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TNVCA
Sbjct: 307 ISANQFSVTKHRRVISLMSGESG----MPGIFFQYELSPLMVKYTEQSRSFGHFATNVCA 362
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
I+GGV+TV+G+ID +YH + I+KKIE+GKF
Sbjct: 363 IIGGVYTVAGLIDTMLYHSVKLIQKKIELGKF 394
>gi|198425065|ref|XP_002127888.1| PREDICTED: similar to ERGIC and golgi 3 [Ciona intestinalis]
Length = 385
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 177/384 (46%), Positives = 248/384 (64%), Gaps = 16/384 (4%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K++ DAYPK EDF +T SG +TL+S +MLLLF SEL+ YL ++L VD SR
Sbjct: 11 SKVKDFDAYPKTLEDFRIKTISGATVTLISGTIMLLLFLSELKYYLTTEVNSELFVDMSR 70
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
G L IN +VTFP +PC LS+D +D+SG++ +DV+H + K+ L+S G+ + A
Sbjct: 71 GNKLSINMNVTFPLVPCEFLSLDMIDVSGQRDIDVQHTLVKQPLNSDGSWVAE-----AA 125
Query: 126 PKID----KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
K+D KP+ YCGSC+GAE+ D CCN C +++EAYR+KGWA
Sbjct: 126 EKVDLVGTKPVLN--ATEPPPADYCGSCFGAETKDMTCCNTCSDIKEAYRRKGWAFPRDG 183
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
I C E KE G GC ++G LEVN+VAGNFH +PGKS+ +HVHD+ +
Sbjct: 184 SITPCIGE---DDDKEPVGSGCYLHGHLEVNRVAGNFHISPGKSYEVGHMHVHDMARMGK 240
Query: 242 -DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
N+SH N L+FG +PG V+PLD + S +QY++K+VPT Y +SG T
Sbjct: 241 YKESNVSHVFNHLSFGSTYPGQVHPLDNLEVIASESSVAFQYYVKIVPTTYEKLSGDTFH 300
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+NQFSVT H + ++ R ++LPG+F Y+LSP+ V + E SF+HFLT+VCAI+GG+FT
Sbjct: 301 TNQFSVTRHQKRNKDSR-ESLPGMFVSYELSPMMVRYVERRRSFVHFLTSVCAIIGGIFT 359
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+G+ D+FIYHG +A++KKIE+GK
Sbjct: 360 VAGLFDSFIYHGSKALQKKIELGK 383
>gi|340721521|ref|XP_003399168.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Bombus terrestris]
Length = 385
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 181/389 (46%), Positives = 243/389 (62%), Gaps = 11/389 (2%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG V+T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQILRQLDVHPKVREEADILVRTFSGAVVTIISTIIMSILFLSEVNYYLTPTLSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
TSR LRIN D+ P + C +LS+DAMD +GEQHL ++H+IFK+RLD G IE Q
Sbjct: 61 TSRDSKLRINLDIIVPTISCDVLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRT 120
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ E CG CYGA CCN CE+VREAYR K WA +
Sbjct: 121 DITDTKARSKTTEKTVESTTEKACGDCYGAAGDIIKCCNTCEDVREAYRLKNWAPPALGM 180
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I QCK + +++IK +GC IYG++EVN+V G+FH APG SF + VHVHD+ +
Sbjct: 181 IKQCKNDKSVEKIKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTST 240
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
FN++HKI L+FG + PG NP+D + M+ ++IK+VPT Y G T+ +N
Sbjct: 241 QFNMTHKIRHLSFGLNIPGKTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTN 300
Query: 303 QFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
QFSVT H R S E G +PG+FF Y+LSP+ V +TE+ SF HF TN CAI+GG
Sbjct: 301 QFSVTRHARQVSLFSGESG----MPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGG 356
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
VFTV+G+ID+ +YH RAI+KKIE+GK++
Sbjct: 357 VFTVAGLIDSLLYHSVRAIQKKIELGKYN 385
>gi|242007856|ref|XP_002424735.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
gi|212508228|gb|EEB11997.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
Length = 376
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 190/393 (48%), Positives = 252/393 (64%), Gaps = 26/393 (6%)
Query: 1 MDA--IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETK 58
MD+ I+N ++ D YPK +D+ RT GG +T+VS I+M LLF SEL YL +
Sbjct: 1 MDSAKIINTLKDFDGYPKTLDDYRIRTLGGGAVTVVSYIIMTLLFISELNTYLTPDISEE 60
Query: 59 LLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
L VDT+R L+IN ++T P + C LS+DAMD SGEQHL ++H+I+K LD G I+
Sbjct: 61 LFVDTTREPKLQINLNITVPEISCKYLSLDAMDSSGEQHLQIEHNIYKVSLDKNGIPIKE 120
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWA 176
+ KP+ E E CGSCYGAES + CCN C +V++AY K+GW
Sbjct: 121 PE----KETFVKPVN------ETKEKKCGSCYGAESETLNITCCNTCADVKDAYMKRGWG 170
Query: 177 LSNPDLIDQCKREGFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
L+N +LI+QCK + + EGC IYG +EVN+V G+FH APG+SF + VHVH
Sbjct: 171 LNNLELIEQCK------NLSQNNIFNEGCFIYGTMEVNRVGGSFHIAPGQSFSINHVHVH 224
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV--YT 292
D+ F +FN SHKI+ L+FG + PG NPLDG+ + M+QY+IK+VPT+ Y
Sbjct: 225 DVQPFSSKAFNTSHKIDHLSFGYNIPGKTNPLDGIVALTHEGATMFQYYIKIVPTIYYYY 284
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
D SG TI +NQFSVT H +S + + PG+FF Y+L+PI V +TE SF HF TNVC
Sbjct: 285 DKSG-TILTNQFSVTRHQKSGSE-TIGVPPGIFFNYELAPIMVKYTERKRSFGHFATNVC 342
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
AI+GGVFTV+ +IDAF+Y +A KKKIEIGKF
Sbjct: 343 AIIGGVFTVASLIDAFLYRSVQAFKKKIEIGKF 375
>gi|412994036|emb|CCO14547.1| predicted protein [Bathycoccus prasinos]
Length = 436
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 193/417 (46%), Positives = 248/417 (59%), Gaps = 41/417 (9%)
Query: 8 IRSLDAYPK-INEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+R DA+PK ++ DFYSR+F GG+IT+V+ IV + L +E +LYL + L VD RG
Sbjct: 17 LRKFDAFPKFVDVDFYSRSFGGGIITVVTYIVAVSLLLAETKLYLKTHVKHDLYVDNGRG 76
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDV-KHDIFKKRLDSQG-------NVIES 118
ET+RIN DV FP L C L +D MD+SGE HLDV H++ K R D G N
Sbjct: 77 ETMRINVDVFFPNLSCGSLGLDVMDVSGETHLDVVDHEMRKIRYDRYGVKLADALNDEHG 136
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNET------------------YCGSCYGAESS----- 155
+++ + D L N+T YCGSCYGA+ S
Sbjct: 137 KEEVVNEKAFDSNETETASSLRKNKTKKTAKELIPRYMEDGKTKYCGSCYGADVSGANRG 196
Query: 156 -DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKV 214
++ CC CEEVREAY + GWA + ++QCKREGF + + EGC GFL+VNKV
Sbjct: 197 REQRCCQTCEEVREAYIEVGWAFTGASSMEQCKREGFSEVLGNVHEEGCEFKGFLDVNKV 256
Query: 215 AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 274
GNFH APGKSF Q HVHD+ F FN SH++ L+FGE +PG V+PLDG + T +
Sbjct: 257 QGNFHIAPGKSFQQGEQHVHDLSPFPDGKFNFSHEVRHLSFGEGYPGKVDPLDGTKRTLK 316
Query: 275 TP--SGMYQYFIKVVPTVYTDVS--GHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFF 326
P +G+YQYF ++VPT YT ++ I +NQ+SV +HF+ +S QG LPGVFF
Sbjct: 317 LPAETGVYQYFFRIVPTTYTYLNPFKKDISTNQYSVVDHFKPVDAASIQGGSSDLPGVFF 376
Query: 327 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 383
FYDLSPIKV E S FL VCA VGGVF VSGI+D +Y G AIKKKI++G
Sbjct: 377 FYDLSPIKVDIAEYRTSVWKFLAEVCASVGGVFAVSGIVDKVVYKGSLAIKKKIQLG 433
>gi|355686517|gb|AER98082.1| ERGIC and golgi 3 [Mustela putorius furo]
Length = 304
Score = 354 bits (908), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 171/309 (55%), Positives = 219/309 (70%), Gaps = 14/309 (4%)
Query: 58 KLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE 117
+L VD SRG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G +
Sbjct: 4 ELYVDKSRGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVS 63
Query: 118 SRQD----GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 173
S + G K+ P R C SCYGAE+ D CCN CE+VREAYR++
Sbjct: 64 SEAERHELGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKCCNTCEDVREAYRRR 116
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
GWA NPD I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV
Sbjct: 117 GWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHV 176
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 177 HDLQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMK 236
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
V G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT V
Sbjct: 237 VDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 295
Query: 352 CAIVGGVFT 360
CAI+GG+FT
Sbjct: 296 CAIIGGMFT 304
>gi|157118753|ref|XP_001653244.1| ptx1 protein [Aedes aegypti]
gi|108875623|gb|EAT39848.1| AAEL008391-PA [Aedes aegypti]
Length = 384
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 179/392 (45%), Positives = 255/392 (65%), Gaps = 17/392 (4%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +R DAYPKI+++F RT G +T +S ++++L +SEL YL V +L VD
Sbjct: 2 TLLDSLRRFDAYPKIDKEFSIRTVGGATLTFISGTIIVVLIYSELIAYLTPVVTDELFVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQD 121
++RG+ L+IN D P + C +S+DA D +GEQHL ++H I+K+R+D QGN I E++++
Sbjct: 62 STRGQKLKINLDFYIPRISCDYVSLDAQDATGEQHLHIEHTIYKRRMDLQGNPIEEAKKE 121
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I APK L++ E N C SCYGAE + CC C++V +AYR+K W NP+
Sbjct: 122 DISAPK--PRLEKK----EENVKKCRSCYGAEKNSTHCCETCQDVIDAYREKQW---NPN 172
Query: 182 LID--QCKREGFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
L D QC+ E L + E EGC IYG ++VN+V G+FH APGKSF S +HVHD+
Sbjct: 173 LDDFEQCQNEVLLGKKSLESKAFSEGCQIYGSMQVNRVGGSFHIAPGKSFSISHIHVHDV 232
Query: 237 LAFQRDSFNISHKINKLAFGEHFP-GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
F FN SH+IN L+FGE F G PLD T + M+QY+IK+VPT + ++
Sbjct: 233 QPFSSSRFNTSHRINTLSFGEEFGYGQTRPLDFTEKTAHEGAIMFQYYIKIVPTEFVPLN 292
Query: 296 GHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
G T+ +NQFSVT+H +S S +PG+F Y+LSP+ V FTE+ SF HF TN+CAI
Sbjct: 293 GPTLHTNQFSVTKHQKSVSVMSGESGMPGIFVNYELSPLMVRFTEKRNSFSHFATNLCAI 352
Query: 355 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
+GG+FTV+GIID+ ++ A+K+KIE+GKFS
Sbjct: 353 IGGIFTVAGIIDSLLFTSIHALKRKIELGKFS 384
>gi|444729170|gb|ELW69597.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Tupaia chinensis]
Length = 393
Score = 350 bits (898), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 190/419 (45%), Positives = 254/419 (60%), Gaps = 70/419 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T++S ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + + +
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSTEAERHE 123
Query: 125 APKID-------------------------KPLQRHG----GRLE--------HNETYCG 147
KI+ KP G++E + C
Sbjct: 124 LGKIEVKVFDPNSLDPDRCESCYGAESEDIKPCLEAADLELGKIEVKVFDPNSLDPDRCE 183
Query: 148 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 207
SCYGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YG
Sbjct: 184 SCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYG 243
Query: 208 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 267
FLEVNK+ N++H I L+FGE +PG+VNPLD
Sbjct: 244 FLEVNKI------------------------------NMTHYIQHLSFGEDYPGIVNPLD 273
Query: 268 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVF 325
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 274 HTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVF 332
Query: 326 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 333 VLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 391
>gi|170031960|ref|XP_001843851.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Culex quinquefasciatus]
gi|167871431|gb|EDS34814.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Culex quinquefasciatus]
Length = 391
Score = 350 bits (898), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 178/394 (45%), Positives = 253/394 (64%), Gaps = 16/394 (4%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+++ R LDAYPKI+++F +T G +T +S +++ L +SE +L E +L VD
Sbjct: 3 LIDSFRRLDAYPKIDKEFSIKTIGGAALTTISGTIIVFLIYSEFVAFLTPTIEDQLFVDA 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDG 122
+RG+ LRIN D P + C +S+DA D +GEQHL + H+IFK+RLD +GN IE+ +++
Sbjct: 63 TRGQKLRINLDFVVPRVSCDYVSLDAQDATGEQHLHIDHNIFKRRLDLKGNPIEAPKKED 122
Query: 123 IGAPKIDKPLQRHGGRLEHNETY---CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
I APK K + ++ T CGSCYGA+ + CCN C++V +AYR+K W N
Sbjct: 123 IQAPKPRKDATE--APVVNSSTTANPCGSCYGAQKNSSHCCNTCQDVIDAYREKQW---N 177
Query: 180 PDL--IDQCKREGFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
P L +QCK E + ++ E EGC IYG++EVN+V G+FH APGKSF S +HVH
Sbjct: 178 PTLEEFEQCKTEVAIGKLSLEAKAFNEGCQIYGYMEVNRVGGSFHIAPGKSFSISHIHVH 237
Query: 235 DILAFQRDSFNISHKINKLAFGEHFP-GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
D+ F FN++H IN L+FGE F G +PLDG E + M+QY+IK+VPT +
Sbjct: 238 DVQPFSSSRFNMTHHINTLSFGEEFGFGQTSPLDGTDVIAEEGAMMFQYYIKIVPTEFVP 297
Query: 294 VSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+SG + +NQFSVT H +S S +PG+F Y+LSP+ V FTE+ SF HF TN+C
Sbjct: 298 LSGPKLHTNQFSVTTHRKSVSLMSGDSGMPGIFVNYELSPLMVKFTEKRSSFSHFATNLC 357
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
AI+GG+FTVSGI+D ++ A+K+KIE+GK S
Sbjct: 358 AIIGGIFTVSGIVDTLLFTSIHALKRKIELGKAS 391
>gi|158300475|ref|XP_320382.3| AGAP012144-PA [Anopheles gambiae str. PEST]
gi|157013177|gb|EAA00591.3| AGAP012144-PA [Anopheles gambiae str. PEST]
Length = 386
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 178/394 (45%), Positives = 249/394 (63%), Gaps = 23/394 (5%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ +R LDAYPKI+ +F RT SG +TL+SSIV++ L E+ YL+ +L VDT+
Sbjct: 4 LDSLRRLDAYPKIDNEFSIRTVSGAALTLISSIVIVTLVIGEINAYLSPNVSEELFVDTT 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG L+IN D T P + C +S+DA D +GEQHL ++H+I+K+RLD QGN IE
Sbjct: 64 RGHKLKINLDFTIPRISCDYVSLDAQDSTGEQHLHIEHNIYKRRLDLQGNQIEE------ 117
Query: 125 APKIDKPLQRHGGRLEHNET--------YCGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
PK + +Q R+ E CGSCYGA + CCN C+EV +AYR++ W
Sbjct: 118 -PK-KEDIQASTKRISSTEAPATTTVKPACGSCYGAAKNASQCCNTCQEVIDAYRERKW- 174
Query: 177 LSNPDLID--QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
NP++ D QCK + EGC+IYG +EVN+V G FH APGKSF + +HVH
Sbjct: 175 --NPNVEDFEQCKNGNGGSVEGKAFSEGCHIYGTMEVNRVEGRFHIAPGKSFSINHIHVH 232
Query: 235 DILAFQRDSFNISHKINKLAFGEHFP-GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
D+ + FN +H+IN L+FGE F G PLDG+ + M+QY+IK+VPT++
Sbjct: 233 DVQPYSSSRFNTTHRINTLSFGEQFGFGTTRPLDGLMVEATEGAMMFQYYIKIVPTMFVP 292
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
++G T+ +NQFSVT+H +S +T +PG+F Y+LSP+ V FTE+ S HF TNVC
Sbjct: 293 LNGPTLYTNQFSVTKHQKSVTAMSGETGMPGIFVNYELSPLMVKFTEKRNSLGHFATNVC 352
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
AI+GG+FTV+GIID+ ++ IK+KIE+GK S
Sbjct: 353 AIIGGIFTVAGIIDSLLFTSIHVIKRKIELGKAS 386
>gi|321463520|gb|EFX74535.1| hypothetical protein DAPPUDRAFT_226626 [Daphnia pulex]
Length = 381
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 170/378 (44%), Positives = 245/378 (64%), Gaps = 10/378 (2%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+++DAYPK EDF RT +G ++T+ SSI+M LF E R +L+ +L VDT+R
Sbjct: 10 KTIDAYPKTLEDFTIRTATGAMVTVFSSIIMAFLFVIEFRDFLSINVSEQLYVDTTRIPN 69
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
++INFDVTFP + CS LSVDA+D SGEQ V+H+IFK+RL+ G +++ + +I
Sbjct: 70 MKINFDVTFPTISCSYLSVDAVDSSGEQQFGVEHNIFKQRLNLLGEPLQAAE----LEEI 125
Query: 129 DKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+K + E + + C SCYGA+ E CC C EVREAYR+K WA P+ +QC+
Sbjct: 126 NKTHNKTETSTEESASKPCNSCYGAK---EGCCETCAEVREAYRQKNWAF-RPEEFEQCR 181
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
E L R EGC +YG+LEVN+V+G+FH APGKS+ + VHVHD+ + + FN++
Sbjct: 182 NEKNLTRDYSAFKEGCKLYGYLEVNRVSGSFHIAPGKSYAINHVHVHDVQPYSSEDFNVT 241
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H IN L+FG G NPLDG T + + M+QY+IKVVPT Y + G +NQ+SVT
Sbjct: 242 HHINSLSFGTSLIGKENPLDGFLTTADKGAMMFQYYIKVVPTWYVKLDGEEFHTNQYSVT 301
Query: 308 EHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
H + S G +PGVFF Y++SP+++++ E S HF T+VC I+GGVFTV+GIID
Sbjct: 302 RHQKVVSSYGGESGVPGVFFTYEMSPLQISYKESKRSIGHFATDVCTIIGGVFTVAGIID 361
Query: 367 AFIYHGQRAIKKKIEIGK 384
+ +Y + +++K+++GK
Sbjct: 362 SLLYRSSKLLQQKLQLGK 379
>gi|357612408|gb|EHJ67977.1| hypothetical protein KGM_08440 [Danaus plexippus]
Length = 385
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 172/384 (44%), Positives = 234/384 (60%), Gaps = 8/384 (2%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
I+ K + LDAY K EDF +T +G +IT+ + VM+LL EL Y++ +L VDT
Sbjct: 5 IIGKFKQLDAYAKTLEDFRVKTATGAIITVTGAFVMILLIVLELHTYMSPNISEELFVDT 64
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDG 122
SRG LRINFD+ P + C L +DAMD SGEQHL + H++ K+RLD G I E ++
Sbjct: 65 SRGHKLRINFDIVVPRISCDYLVLDAMDSSGEQHLQMDHNVHKRRLDLDGVPIKEPIKED 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I K E CGSCYGA +D CCN CE+V+EAYR + WAL +
Sbjct: 125 ISLSSTVKQ-----NSSEIAIVTCGSCYGAAFNDSQCCNTCEDVKEAYRLRRWALPDLAT 179
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
++QCK + L+R EGC IYG++EVN+V G+FH APGKSF + VHVHD+ F
Sbjct: 180 VEQCKDDDSLERTNLALKEGCQIYGYMEVNRVGGSFHIAPGKSFTINHVHVHDVQPFSSS 239
Query: 243 SFNISHKINKLAFGEHFPGV-VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
FN +H I L+FG PLDG+ + + M+QY++K+VPT+Y + G + +
Sbjct: 240 VFNTTHIIRHLSFGSDIESANTAPLDGITGLAKEGAVMFQYYLKIVPTMYVKLDGTILHT 299
Query: 302 NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
NQFSVT H +S +++ +PG FF Y+LSP+ V +T + S HF TNVCAIVGGVFT
Sbjct: 300 NQFSVTRHQKSVSNINVESGMPGAFFSYELSPLMVKYTAKGRSIGHFATNVCAIVGGVFT 359
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+GI D +YH A + K+ +GK
Sbjct: 360 VAGIFDTLLYHSLNAFQNKVVLGK 383
>gi|328770814|gb|EGF80855.1| hypothetical protein BATDEDRAFT_19389 [Batrachochytrium
dendrobatidis JAM81]
Length = 409
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/388 (44%), Positives = 247/388 (63%), Gaps = 9/388 (2%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I++ ++ DAY K +DF RT SG ++T+VS++V+L L FSE + L VD
Sbjct: 23 GILSDLKKYDAYAKPLDDFRIRTISGALVTVVSTLVILFLTFSEFTDWYQKEMLPSLEVD 82
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
R E + IN +VTF +PC +LSVD MD+SGE ++ H + K R+D GN++E +Q
Sbjct: 83 KGRKEKMNINLNVTFYHMPCYLLSVDVMDVSGEHQNNLPHSMHKVRIDQLGNLLE-KQKK 141
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+G +++ + + YCGSCYG + + CCN CE+V+EAY + GW+ ++PD
Sbjct: 142 LGNTN-SSGVKKEIRDMALDPKYCGSCYGGVAPESKCCNTCEQVQEAYERSGWSFTDPDS 200
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-- 240
I+QC REG+ +R++ + E CNIYG +EVNKV GN HFAPG SF Q+ +HVHD+ +
Sbjct: 201 IEQCVREGWSKRMETQINEACNIYGHIEVNKVQGNIHFAPGHSFQQNALHVHDLHDYNAP 260
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
SFN H I++L+FGE VNPLD V T T YQY+IKVV T + ++G +
Sbjct: 261 NGSFNFKHTIHELSFGES-SSFVNPLDTVTKTPPTKYFSYQYYIKVVGTDISYLNGSQLT 319
Query: 301 SNQFSVTEHFRSSEQ--GRLQT-LPGVFFF-YDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+NQFSVTEH + G L +PG FF +++SP+ V F E F HFLT++CAI+G
Sbjct: 320 TNQFSVTEHEQDVTPLFGALPIGMPGKLFFNFEISPMLVKFKEFRKPFTHFLTDLCAIIG 379
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
GVFTV+G+IDA ++ QR+I+ K+EIGK
Sbjct: 380 GVFTVAGMIDALLFATQRSIQAKVEIGK 407
>gi|296481082|tpg|DAA23197.1| TPA: endoplasmic reticulum-Golgi intermediate compartment protein 3
[Bos taurus]
Length = 306
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 166/309 (53%), Positives = 215/309 (69%), Gaps = 11/309 (3%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 237 LDNINMTHYIRHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296
Query: 301 SNQFSVTEH 309
+NQFSVT H
Sbjct: 297 TNQFSVTRH 305
>gi|195378906|ref|XP_002048222.1| GJ11466 [Drosophila virilis]
gi|194155380|gb|EDW70564.1| GJ11466 [Drosophila virilis]
Length = 372
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 175/384 (45%), Positives = 239/384 (62%), Gaps = 23/384 (5%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E Y+ + +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLVFLEFLNYMKPMLSEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD +G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLEGQ-----------PL 115
Query: 128 IDKPLQRHGGRLEHNE-TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ + CGSCYGAE + CCN CE+V +AYR + W + D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNSTCGSCYGAEHNATHCCNTCEDVLDAYRVRKWNM-QVDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSN 302
+SH IN L+FGE +PLDG+R QE+ S M+ Y++K+VPT+Y S G I +N
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVEVQESKSEMFNYYLKIVPTLYERHSDGQPIYTN 289
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT H R R + +PG+FF Y+LSP+ V + E HVSF HF TN C+IVGGVFTV+
Sbjct: 290 QFSVTRH-RKDLTDRERGMPGIFFSYELSPLMVKYAERHVSFGHFATNCCSIVGGVFTVA 348
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GI+ + + A+++K+E+GK S
Sbjct: 349 GILAVLLNNSWEALQRKLEVGKLS 372
>gi|150036309|emb|CAO03349.1| ERGIC and golgi 3 [Homo sapiens]
Length = 325
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 169/329 (51%), Positives = 220/329 (66%), Gaps = 18/329 (5%)
Query: 13 AYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRIN 72
AYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD SRG+ L+IN
Sbjct: 1 AYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKIN 60
Query: 73 FDVTFPALPCSI------------LSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S
Sbjct: 61 IDVLFPHMPCAWSQYLSLIFLLPDLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEA 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ K++ + + C SCYGAE+ D CCN CE+VREAYR++GWA NP
Sbjct: 121 ERHELGKVEVTVFDPDSL---DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNP 177
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 178 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 237
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G ++
Sbjct: 238 LDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 297
Query: 301 SNQFSVTEHFRSSEQGRL--QTLPGVFFF 327
+NQFSVT H + + G L Q LPGVF
Sbjct: 298 TNQFSVTRHEKVA-NGLLGDQGLPGVFVL 325
>gi|307105810|gb|EFN54058.1| hypothetical protein CHLNCDRAFT_25376, partial [Chlorella
variabilis]
Length = 312
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 160/315 (50%), Positives = 215/315 (68%), Gaps = 15/315 (4%)
Query: 82 CSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID--KPLQRHGGRL 139
CS LS+DAMDISGE L+V HD++K+RL G + D G P+ KP+ +
Sbjct: 1 CSWLSIDAMDISGEVQLEVDHDVYKRRLSPDGTPL----DEGGCPRAGWLKPVPGNDSEA 56
Query: 140 EHNET--YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 197
+ + YCGSCYG+ES CCN C EVR+AYR KGWAL + + ++QC EG+ + I E
Sbjct: 57 DPTKAPGYCGSCYGSESRAGQCCNTCAEVRDAYRTKGWALLDVEKVEQCHHEGYKEEIDE 116
Query: 198 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 257
++GEGC+++G L++NKVAGNFH APG+S+ Q +H+HD+ F +F+ SH I+KLAFG
Sbjct: 117 QKGEGCHVWGELQINKVAGNFHIAPGRSYQQGNMHIHDLSPFAGQAFDFSHTIHKLAFGR 176
Query: 258 HFPG----VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-- 311
+PG ++ T+ G+YQYF+KVVPT Y+D+ +TI +NQFSVTEHFR
Sbjct: 177 EYPGTRGQALSTFCLSVGTRRERMGLYQYFLKVVPTSYSDLRNNTIYTNQFSVTEHFRET 236
Query: 312 SSEQGRLQTLPGVFFFYDLSPIKVTFT-EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+S LPGVF FYDLSPIK + +SFL FLT++CAI+GGVFTVSGIIDA +Y
Sbjct: 237 ASPTAGGGQLPGVFLFYDLSPIKASLEGRARLSFLSFLTSLCAIIGGVFTVSGIIDATVY 296
Query: 371 HGQRAIKKKIEIGKF 385
HGQ+AIKKK+++GK
Sbjct: 297 HGQQAIKKKLDLGKL 311
>gi|225708964|gb|ACO10328.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Caligus rogercresseyi]
Length = 385
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 168/384 (43%), Positives = 236/384 (61%), Gaps = 15/384 (3%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYPK EDF +T SGG ITL+S ++M+ LF SE+R YL + +L VDTS+G
Sbjct: 8 LRRLDAYPKTLEDFRIQTLSGGAITLLSGVLMVFLFASEIREYLTPRVQEELFVDTSKGG 67
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGA 125
L+IN DV F ++ C L +DAMD+SGE H+D+ H+I+K+RL +G+ +E R+ +G
Sbjct: 68 KLKINLDVVFNSVSCDFLVLDAMDVSGESHVDIVHNIYKRRLSLEGSPMEEPRRETEVGQ 127
Query: 126 PKID-KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K P ++ E + CGSCYGAE+ CCN+C EV+EAYR+KGW + +
Sbjct: 128 KKTTHAPSPKN----ETSTPPCGSCYGAETPGSPCCNSCGEVKEAYRRKGWTIVAAKF-E 182
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+ + + I+ EGC IYG L VN+V G+FH PGKSF + +H+HD+ F F
Sbjct: 183 QCEMD--TEGIERVYKEGCQIYGSLLVNRVGGSFHIVPGKSFTLNHLHIHDLQPFSSGEF 240
Query: 245 NISHKINKLAFGEHF---PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N SH+I L+FG PG N LD V MYQY++K+VPT Y+ G T
Sbjct: 241 NTSHRIRHLSFGSKTALDPG-GNALDAVSALSPKGGLMYQYYLKIVPTTYSRSDGGTFTG 299
Query: 302 NQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
NQ+SVT + S +PGVFF Y+L+P+ V ++E+ SF HF T +CAI+GGVFT
Sbjct: 300 NQYSVTRLEKDVSSSLDSGGMPGVFFNYELAPLMVKYSEKEKSFGHFATGLCAIIGGVFT 359
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
++ D FIY + +++K +GK
Sbjct: 360 LASAFDKFIYSSSKILEEKFGLGK 383
>gi|194751543|ref|XP_001958085.1| GF10736 [Drosophila ananassae]
gi|190625367|gb|EDV40891.1| GF10736 [Drosophila ananassae]
Length = 372
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 174/384 (45%), Positives = 235/384 (61%), Gaps = 23/384 (5%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLNYMQPTMNEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HDIFK RLD +G P
Sbjct: 67 KLRINLDVTLHNLGCNYVSLDAMDSSGDTHLRVDHDIFKHRLDLKGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CEEV +AYR + W + D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNSTHCCNTCEEVLDAYRLRKWNV-QVDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT-DVSGHTIQSN 302
+SH IN L+FGE +PLDG+ +E S M+ Y++K+VPT+Y D G I +N
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGMHVEVEEKKSEMFNYYLKIVPTLYMRDSDGKPIYTN 289
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT H R R + +PG+FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+
Sbjct: 290 QFSVTRH-RKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVA 348
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GI+ + + AI++K+E+GK S
Sbjct: 349 GILAVLLNNSLEAIQRKLEVGKLS 372
>gi|145340712|ref|XP_001415464.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575687|gb|ABO93756.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 379
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 167/385 (43%), Positives = 239/385 (62%), Gaps = 18/385 (4%)
Query: 12 DAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS-RGETLR 70
D +PKI++DF RT +GG I + +M++LF + + T L VD G T +
Sbjct: 1 DLFPKISDDFARRTATGGAIATIGLALMVILFLQQTAELMRTTTAYDLRVDDGVAGATKK 60
Query: 71 I--NFDVTFPALPCSILSVDAMDISGEQHLDV-KHDIFKKRLDSQGNVIE--SRQDGIGA 125
I N D+T A+ C+ +S+DAMD++GE LDV + ++ R+D++G I S + + A
Sbjct: 61 IVINVDLTLRAMHCAQVSLDAMDVTGETRLDVSRSEVRTTRVDARGRAIAMTSERTAVNA 120
Query: 126 PKIDKPLQRH--GGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
+R GGR + CG CYGA + CC++C+ VREAYR KGWAL + +
Sbjct: 121 KTEAGEREREATGGR-----SACGDCYGAAEAGT-CCDDCDSVREAYRVKGWALPDLRRV 174
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-D 242
QC +E + ++ E EGC+ G EVNKVAGNFH APGKS++ G HVHD+ F +
Sbjct: 175 TQCTKEYDVVAMRNEHKEGCHFSGHFEVNKVAGNFHIAPGKSYNNLGQHVHDLSPFAGVE 234
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVS--GHTI 299
SFN SH I+KL+FGE FPGVVNPLDGV R + +G+YQY + VVP Y + +
Sbjct: 235 SFNFSHIIHKLSFGEEFPGVVNPLDGVTRTMDDANAGVYQYRLSVVPARYKYLGFRARVV 294
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+SN +SVT+HFR + + LPG+FFFYDLSP++V + E + F +L+NV AI+GGV
Sbjct: 295 ESNDYSVTDHFRGFDVTKNPGLPGLFFFYDLSPLRVEYEERRIGFFQYLSNVAAIIGGVS 354
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGK 384
V I+D +Y GQRA+++K+++GK
Sbjct: 355 AVVNIVDGLVYRGQRALREKVDLGK 379
>gi|195327731|ref|XP_002030571.1| GM24497 [Drosophila sechellia]
gi|195590409|ref|XP_002084938.1| GD12569 [Drosophila simulans]
gi|194119514|gb|EDW41557.1| GM24497 [Drosophila sechellia]
gi|194196947|gb|EDX10523.1| GD12569 [Drosophila simulans]
Length = 373
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 172/385 (44%), Positives = 237/385 (61%), Gaps = 24/385 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E+ Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CE+V +AYR + W ++ D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNATHCCNTCEDVLDAYRLRKWTVA-VDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRSDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQS 301
+SH IN L+FGE +PLDG+R ET S M+ Y++K+VPT+Y + G I +
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYT 289
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVT +R R + +PG+FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV
Sbjct: 290 NQFSVTR-YRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTV 348
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+GI+ + + AI++K+E+GK S
Sbjct: 349 AGILAVLLNNSWEAIQRKLEVGKLS 373
>gi|194374867|dbj|BAG62548.1| unnamed protein product [Homo sapiens]
Length = 321
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 162/309 (52%), Positives = 211/309 (68%), Gaps = 21/309 (6%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S G
Sbjct: 64 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSS-----G 118
Query: 125 APKIDKPLQRHG-GRLE--------HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
A +RH G++E + C SCYGAE+ D CCN CE+VREAYR++GW
Sbjct: 119 A-------ERHELGKVEVTVFDPDSLDPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGW 171
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A NPD I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD
Sbjct: 172 AFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHD 231
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V
Sbjct: 232 LQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 291
Query: 296 GHTIQSNQF 304
G Q +
Sbjct: 292 GEVSQGAPY 300
>gi|195495133|ref|XP_002095138.1| GE19855 [Drosophila yakuba]
gi|194181239|gb|EDW94850.1| GE19855 [Drosophila yakuba]
Length = 373
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 171/385 (44%), Positives = 237/385 (61%), Gaps = 24/385 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E+ Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVINYMQPTLNEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CE+V +AYR + W ++ D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNATHCCNTCEDVLDAYRLRKWNVA-VDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRSDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQS 301
+SH IN L+FGE +PLDG+R ET S M+ Y++K+VPT+Y + G I +
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYT 289
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVT +R R + +PG+FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV
Sbjct: 290 NQFSVTR-YRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTV 348
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+GI+ + + A+++K+E+GK S
Sbjct: 349 AGILAVLLNNSWEALQRKLEVGKLS 373
>gi|125978263|ref|XP_001353164.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
gi|54641917|gb|EAL30666.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
Length = 372
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 173/384 (45%), Positives = 235/384 (61%), Gaps = 23/384 (5%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLSYMQPALNEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HDIFK RLD +G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDIFKHRLDLKGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CE+V +AYR W + D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNATHCCNTCEDVLDAYRLHKWNV-QVDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSN 302
+SH IN L+FGE +PLDG+R ET S M+ Y++K+VPT+Y S G I +N
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMRQSDGQPIYTN 289
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT +R R + +PG+FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+
Sbjct: 290 QFSVTR-YRKDLTDRERGMPGIFFSYELSPLMVKYAEKHNSFGHFATNCCSIIGGVFTVA 348
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GI+ + + AI++K+++GK S
Sbjct: 349 GILAVLLNNSWEAIQRKLDVGKLS 372
>gi|21357439|ref|NP_648758.1| CG7011 [Drosophila melanogaster]
gi|7294304|gb|AAF49653.1| CG7011 [Drosophila melanogaster]
gi|16768234|gb|AAL28336.1| GH25868p [Drosophila melanogaster]
gi|220946650|gb|ACL85868.1| CG7011-PA [synthetic construct]
Length = 373
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 171/385 (44%), Positives = 235/385 (61%), Gaps = 24/385 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E+ Y+ +L VDT+R
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFVDTTRDH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CE+V +AYR + W ++ D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNATHCCNTCEDVLDAYRLRKWTVA-VDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRSDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQS 301
+SH IN L+FGE +PLDG+R ET S M+ Y++K+VPT+Y + G I +
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYT 289
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVT +R R + +PG+FF Y+LSP+ V + E H SF HF TN C+I+GGVFTV
Sbjct: 290 NQFSVTR-YRKDLSDRERGMPGIFFSYELSPLMVKYAERHSSFGHFATNCCSIIGGVFTV 348
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+GI+ + + AI++K+E+GK S
Sbjct: 349 AGILAVLLNNSWEAIQRKLEVGKLS 373
>gi|194872681|ref|XP_001973062.1| GG13555 [Drosophila erecta]
gi|190654845|gb|EDV52088.1| GG13555 [Drosophila erecta]
Length = 373
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 171/385 (44%), Positives = 236/385 (61%), Gaps = 24/385 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E+ Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CEEV +AYR + W ++ D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNATHCCNTCEEVLDAYRLRKWNVA-VDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRSDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQS 301
+SH IN L+FGE +PLDG+R ET S M+ Y++K+VPT+Y + G I +
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVEVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYT 289
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVT +R R + +PG+FF Y+LSP+ V + E+ SF HF TN C+I+GGVFTV
Sbjct: 290 NQFSVTR-YRKDLSDRERGMPGIFFSYELSPLMVKYAEKRSSFGHFATNCCSIIGGVFTV 348
Query: 362 SGIIDAFIYHGQRAIKKKIEIGKFS 386
+GI+ + + A+++K+E+GK S
Sbjct: 349 AGILAVLLNNSWEALQRKLEVGKLS 373
>gi|195441336|ref|XP_002068468.1| GK20487 [Drosophila willistoni]
gi|194164553|gb|EDW79454.1| GK20487 [Drosophila willistoni]
Length = 372
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 171/384 (44%), Positives = 236/384 (61%), Gaps = 23/384 (5%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E Y+ +L VDT+R
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLNYMRPTLNEELFVDTTRNH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD +G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLKGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNE-TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ + CGSCYGAE + CCN CE+V +AY K W++ D ++QC
Sbjct: 116 KETPIKEIVAVSPANKNSTCGSCYGAEHNATHCCNTCEDVLDAYHLKKWSVQ-VDKLEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSN 302
+SH IN L+FGE +PLDG+R +E+ S M+ Y+IK+VPT+Y S G I +N
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVNVEESKSEMFNYYIKIVPTLYERNSDGQPIYTN 289
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT +R R + +PG+FF Y+LSP+ V + E H SF HF TN C+I+GGVFTV+
Sbjct: 290 QFSVTR-YRKDLTDRERGMPGIFFSYELSPLMVKYAERHNSFGHFATNCCSIIGGVFTVA 348
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GI+ + + AI++K+E+GK S
Sbjct: 349 GILAVLLNNSWEAIQRKLEVGKLS 372
>gi|156552683|ref|XP_001599365.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Nasonia vitripennis]
Length = 328
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 215/335 (64%), Gaps = 16/335 (4%)
Query: 58 KLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE 117
+L VDTSRG L+IN D+ ++ C +LS+DAMD +GE HL+++H+IFK+RLD G IE
Sbjct: 4 ELFVDTSRGSKLKINLDIVISSIACDMLSIDAMDTTGETHLEIQHNIFKRRLDLDGKPIE 63
Query: 118 S-RQDGIGAPK--IDKPLQRHGGRLEHNETYCGSCYGAESSDE--DCCNNCEEVREAYRK 172
++ GI PK +KP E+ CG CYGA S + CCN CEEV+EAYRK
Sbjct: 64 DPKKTGIADPKKTTEKPA-------ENATAKCGDCYGAASEELGIKCCNTCEEVKEAYRK 116
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
+ WA+ + QCK + + +E GC IYGF+EVN+V G+FH APG S +H
Sbjct: 117 RKWAVHDTSRFAQCKNDKSREMTFKE---GCQIYGFMEVNRVGGSFHIAPGDSITIDHLH 173
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
VHD+ + FN++H+I L+FG + PG NP+D + M+ ++IK+VPT +
Sbjct: 174 VHDVQPYSSSQFNLTHRIRHLSFGTNIPGKTNPIDNTTVIASEGATMFHHYIKIVPTTFM 233
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
+ G + +NQFS+T+H RS +Q ++ +PG+FF Y+LSP+ V +T+ S H +TN
Sbjct: 234 RLDGSILHTNQFSLTKHSRSIKQYSGESGMPGLFFSYELSPLMVKYTQTVKSLGHLMTNT 293
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAI+GG FTV+ IIDAF+YH RAI+KK+E+GK S
Sbjct: 294 CAIIGGTFTVASIIDAFLYHSVRAIQKKMELGKLS 328
>gi|331241265|ref|XP_003333281.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312271|gb|EFP88862.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 421
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 167/404 (41%), Positives = 235/404 (58%), Gaps = 46/404 (11%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+ LD + K ED +T GG++T+ S+ ++ L E R Y + +LVD SRGE
Sbjct: 12 KGLDGFSKTMEDVKVKTGFGGMLTMASAALIFTLILVEFRDYRQIHVQPSILVDKSRGEK 71
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
L ++ ++TFP +PC +LSVD MDISGE DV HD+ K RL G P
Sbjct: 72 LLVHMNITFPRVPCYLLSVDVMDISGEHQNDVAHDLAKTRLGLDG-----------VPLS 120
Query: 129 DKPLQRHGGRLE-----HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
Q+ G LE + YCGSCYG E CCN+CEEVRE+Y ++GW+ +NPD I
Sbjct: 121 TNTTQKLQGELETIIASRAKDYCGSCYGGEPGPSGCCNSCEEVRESYVRRGWSFNNPDGI 180
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
+QC +E + +RIKE+ EGCNI G L+VNKV GNFH +PG+SF VHVHD++ + +DS
Sbjct: 181 EQCVQEHWSERIKEQSKEGCNINGVLKVNKVIGNFHLSPGRSFQTHQVHVHDLVPYLQDS 240
Query: 244 --FNISHKINKLAFGE--------------HFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
+ H I+ AF + G+VNPLDGV+ E + M+QYF+KVV
Sbjct: 241 NLHDFGHVIHNFAFMDANQPTETAHTLRLKKTLGIVNPLDGVKAHTEASNYMFQYFLKVV 300
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFR---------SSEQGRLQT-----LPGVFFFYDLSPI 333
T + + G +++Q+SVT++ R + E G L + +PGVFF Y++SP+
Sbjct: 301 GTQFQLLDGQVAKTHQYSVTQYERDLDNSDKSDADELGHLTSHGHSGVPGVFFNYEISPM 360
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIK 377
+V E SF HF T+ CAIVGGV TV+G++D+F+Y Q +K
Sbjct: 361 QVVHQEYRQSFAHFATSTCAIVGGVLTVAGLLDSFVYGAQNRMK 404
>gi|358054679|dbj|GAA99605.1| hypothetical protein E5Q_06306 [Mixia osmundae IAM 14324]
Length = 424
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/399 (40%), Positives = 232/399 (58%), Gaps = 35/399 (8%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+ LDA+ K ED +T GG++TLVS ++ L E Y ++VD SRGE
Sbjct: 12 KGLDAFGKTLEDVKIKTGFGGILTLVSFTLIAALTLMEFVDYRRVHLHPSIVVDKSRGEK 71
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
L ++ ++TFP +PC +LSVD MDISGE D+ HDI K RLD G ++++ +D +
Sbjct: 72 LVVHLNITFPRVPCYLLSVDIMDISGEHQNDIHHDILKNRLDKSGALVQATRD----STL 127
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKR 188
L+R G ++ YCGSCYG D CCN C+EVRE+Y ++GW+ NPD IDQC R
Sbjct: 128 KGELERAVG-VKREPGYCGSCYGGAPGDSGCCNTCDEVRESYVRRGWSFVNPDGIDQCVR 186
Query: 189 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNI 246
EGF ++IKE+ EGCN+ G ++VNKV GNFH +PGKSF + HVHD++ + +
Sbjct: 187 EGFSEKIKEQSEEGCNVAGQVKVNKVIGNFHLSPGKSFQSNMHHVHDLVPYLAAGQQHDF 246
Query: 247 SHKINKLAFG--------------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
H IN+ +F + + +PL GVR E + M+QYF+KVV T +
Sbjct: 247 GHIINRFSFAAEGDDGFNRETARLKQSLNIEDPLTGVRAHTEQSNYMFQYFVKVVSTKFK 306
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFT 338
+ G T+ S+Q+SVT++ R +G +PG+FF Y++SP+ V
Sbjct: 307 TLDGRTLSSHQYSVTQYERDLSKGNKPGKDEDGHQTSHGYAGVPGLFFNYEISPMLVVHR 366
Query: 339 EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIK 377
EE SF HF+T+ CAIVGG+ TV+G+ID +Y Q ++
Sbjct: 367 EERQSFAHFITSTCAIVGGILTVAGLIDTLVYSSQTRLQ 405
>gi|195126511|ref|XP_002007714.1| GI12235 [Drosophila mojavensis]
gi|193919323|gb|EDW18190.1| GI12235 [Drosophila mojavensis]
Length = 372
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 176/384 (45%), Positives = 238/384 (61%), Gaps = 23/384 (5%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++SS ++ LL F E Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISSSIISLLIFLECLNYMRPTLTEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD GN P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLDGN-----------PL 115
Query: 128 IDKPLQRHGGRLEHNE-TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ + CGSCYGAE + CCN CE+V +AYR + W + D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNSTCGSCYGAEHNSTHCCNTCEDVLDAYRIRKWNM-QVDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSN 302
+SH IN L+FGE +PLDG+R +E+ S M+ Y++K+VPT+Y S G I +N
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVDVEESKSEMFNYYLKIVPTLYERHSDGKPIYTN 289
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT H R R + +PG+FF Y+LSP+ V + E HVSF HF TN C+I+GGVFTV+
Sbjct: 290 QFSVTRH-RKDLTDRERGMPGIFFSYELSPLMVKYAERHVSFGHFATNCCSIIGGVFTVA 348
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GI+ + + AI++K+E+GK S
Sbjct: 349 GILAVVLNNSLEAIQRKLEVGKLS 372
>gi|291000812|ref|XP_002682973.1| predicted protein [Naegleria gruberi]
gi|284096601|gb|EFC50229.1| predicted protein [Naegleria gruberi]
Length = 416
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 170/408 (41%), Positives = 246/408 (60%), Gaps = 40/408 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++S D YPK +DF +T GG+I+++S +V+L+L E LYL +L VDT +
Sbjct: 3 LKSFDFYPKTQDDFRVKTLGGGLISIISLLVILILVLGEFYLYLQVERFDQLYVDTQQER 62
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVK-HDIFKKRLDSQGN-VIESRQDGIGA 125
+ I ++TFPA+ C L++D MD+SGE H+ + H ++K RL G +IE + + +
Sbjct: 63 KIPIYINITFPAVSCDALNLDVMDVSGEHHVHLDYHTVYKMRLTLDGKPIIEQQAEQVSD 122
Query: 126 PKIDKP----LQRHGGRLEHN--------------------ETYCGSCYGAESSDEDCCN 161
DKP L+ G ++H+ YCGSCYG+ CCN
Sbjct: 123 ---DKPTLDILKPPPGAVKHDLVNNAELDKIRAERAKKVKDPKYCGSCYGSNRDANQCCN 179
Query: 162 NCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA 221
C++VRE+YR+ GWA S + I+QC E +++K + EGCN++G+ VNKVAGNFHFA
Sbjct: 180 TCDDVRESYRRVGWAFSPNEDIEQCYEEILERKMKYSKQEGCNLHGYFLVNKVAGNFHFA 239
Query: 222 PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG----VRWTQET-- 275
PGKSF ++ H+HD ++ D FN SH IN L FGE PG++NPLDG + + ET
Sbjct: 240 PGKSFVRAQQHMHDYTNYEVDHFNTSHIINYLGFGEKIPGLINPLDGTSKIIGYNAETGQ 299
Query: 276 ----PSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDL 330
S ++QYF+KVVPT+Y S ++I +NQ+SVT+H R + +PGVFF YDL
Sbjct: 300 RVEGESALFQYFVKVVPTIYEKYGSSNSIITNQYSVTQHSRPKNRLHPNVVPGVFFIYDL 359
Query: 331 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
SPI V TE SF+ FLT++CAI+GGVFTVS ++D IY ++ + +
Sbjct: 360 SPIMVHITENKKSFVQFLTSLCAIIGGVFTVSALLDRVIYGVEKKMNR 407
>gi|195021391|ref|XP_001985385.1| GH17030 [Drosophila grimshawi]
gi|193898867|gb|EDV97733.1| GH17030 [Drosophila grimshawi]
Length = 372
Score = 306 bits (785), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 175/384 (45%), Positives = 236/384 (61%), Gaps = 23/384 (5%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++SS ++ LL E Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISSSIISLLVLLEFLNYMKPTMTEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HD+FK RLD QG P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLQGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNE-TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ + CGSCYGAE + CCN CE+V +AYR + W + D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNSTCGSCYGAEHNSTHCCNTCEDVLDAYRIRKWNM-QVDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSN 302
+SH IN L+FGE +PLDG+R +E+ S M+ Y++K+VPT+Y S G I +N
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGIRVDVEESKSEMFNYYLKIVPTLYERHSDGEPIYTN 289
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVT H R R + +PG+FF Y+LSP+ V + E H SF HF TN C+IVGGVFTV+
Sbjct: 290 QFSVTRH-RKDLTDRERGMPGIFFSYELSPLMVKYAERHNSFGHFATNCCSIVGGVFTVA 348
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GI+ + + AI++K+E+GK S
Sbjct: 349 GILAVLLNNSWEAIQRKLEVGKLS 372
>gi|449684240|ref|XP_002157414.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Hydra magnipapillata]
Length = 311
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 202/310 (65%), Gaps = 19/310 (6%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
I +++ DAYPK EDF +T+ G +IT +SSI+M LF SE YL +L VDT
Sbjct: 3 ISTRLKQFDAYPKTLEDFRVKTYGGALITGISSIIMFALFLSEFNYYLTTEVHPELFVDT 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES--RQD 121
+R + LRIN DV FP + C+ LS+DAMD+SGEQ D++H+IFKKR D +GN I++ +++
Sbjct: 63 TRHQKLRINIDVYFPNIGCAYLSIDAMDVSGEQQTDLEHNIFKKRYDEKGNPIDTVEKKE 122
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+G K ++ ++ L+ ++ C SCYGAE++D CCN CE+VR AYRKKGW +PD
Sbjct: 123 ELGD-KSEEAVKVLNSTLD-DKPKCESCYGAETTDHPCCNTCEDVRVAYRKKGWGFHDPD 180
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-------- 233
I+QCKRE + +++ EGC IYG++EV+KVAGNFH APGKSF Q +HV
Sbjct: 181 SIEQCKREHWKDTFQQQSNEGCQIYGYIEVSKVAGNFHIAPGKSFQQQHIHVQTIRFGKD 240
Query: 234 -------HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
HD+ F FN+SH I L+FGE PGV NPLDG + E S MYQYF+K+
Sbjct: 241 GTISLNMHDLQPFGAKQFNVSHNIWSLSFGEPIPGVENPLDGTNVSAEAGSLMYQYFVKI 300
Query: 287 VPTVYTDVSG 296
VPTVY +SG
Sbjct: 301 VPTVYKKLSG 310
>gi|226486462|emb|CAX74360.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
Length = 379
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 155/387 (40%), Positives = 229/387 (59%), Gaps = 20/387 (5%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +N +R+ DA+ K +DF +T SG +++++SS ++ +LF SE ++ + +++
Sbjct: 1 MVVTINYLRNFDAFAKPLKDFRIKTMSGAMVSIISSFIIGILFTSEFISFMRTQNKQEII 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN-----V 115
VD +RGE + I D+T +PC+ L +D MD +G Q L+V H+++K + GN V
Sbjct: 61 VDINRGEKMSIYLDITINFIPCAFLRLDTMDTTGAQQLNVMHEVYKTSVSISGNPLSNSV 120
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
+ D P YCGSCYGA+S CCN CEEV+ AY + W
Sbjct: 121 RHTVNDDSALTTTRDP------------NYCGSCYGADSPTRKCCNTCEEVQMAYHEMQW 168
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
N +QC+ E + + EGC I+G L VN+V G FH APG S+ ++ HVH
Sbjct: 169 VFGNASEFEQCRNENWDGMKRNIGNEGCRIHGSLTVNRVGGGFHIAPGHSYTENHAHVHS 228
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
I + FN+SH I +L FG+ +PG +N LDG + T + PS M+ Y++K+VPT+YT VS
Sbjct: 229 IRSLGHVQFNVSHSITELRFGDAYPGQINSLDGTKMTVDKPSQMFNYYLKLVPTMYTSVS 288
Query: 296 GH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ T+ +NQ+S T H R S G Q LPGVFF Y+++P+ V TEE SF+HFLTN C
Sbjct: 289 NNESTLITNQYSATWHSRGSPLSGDGQGLPGVFFNYEIAPLLVKITEERKSFVHFLTNTC 348
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKK 379
AI+GGVFTV+ ++DAFIY ++ +
Sbjct: 349 AIIGGVFTVASLLDAFIYQSSCVLRNR 375
>gi|56753075|gb|AAW24747.1| SJCHGC09363 protein [Schistosoma japonicum]
gi|226486460|emb|CAX74359.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
gi|226486464|emb|CAX74361.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
Length = 379
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 155/387 (40%), Positives = 229/387 (59%), Gaps = 20/387 (5%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +N +R+ DA+ K +DF +T SG +++++SS ++ +LF SE ++ + +++
Sbjct: 1 MVVTINYLRNFDAFAKPLKDFRIKTMSGAMVSIISSFIIGILFTSEFISFMRTQNKQEII 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN-----V 115
VD +RGE + I D+T +PC+ L +D MD +G Q L+V H+++K + GN V
Sbjct: 61 VDINRGEKMSIYLDITINFIPCAFLRLDTMDTTGAQQLNVMHEVYKTSVSISGNPLSNSV 120
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
+ D P YCGSCYGA+S CCN CEEV+ AY + W
Sbjct: 121 RHTVNDDSALTTTRDP------------NYCGSCYGADSPTRKCCNTCEEVQMAYHEMQW 168
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
N +QC+ E + + EGC I+G L VN+V G FH APG S+ ++ HVH
Sbjct: 169 VFGNASEFEQCRNENWDGMKRNIGNEGCRIHGSLTVNRVGGGFHIAPGHSYTENHAHVHS 228
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
I + FN+SH I +L FG+ +PG +N LDG + T + PS M+ Y++K+VPT+YT VS
Sbjct: 229 IRSLGHVQFNVSHSITELRFGDAYPGQINSLDGTKMTVDKPSQMFNYYLKLVPTMYTSVS 288
Query: 296 GH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ T+ +NQ+S T H R S G Q LPGVFF Y+++P+ V TEE SF+HFLTN C
Sbjct: 289 NNESTLITNQYSATWHSRGSPLSGDGQGLPGVFFNYEIAPLLVKITEERKSFVHFLTNTC 348
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKK 379
AI+GGVFTV+ ++DAFIY ++ +
Sbjct: 349 AIIGGVFTVASLLDAFIYQSSCVLRNR 375
>gi|324511490|gb|ADY44781.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Ascaris suum]
Length = 382
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 165/389 (42%), Positives = 246/389 (63%), Gaps = 13/389 (3%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ ++R LDAY K +DF +TF+GG +TL+S++V+++LF SE +L+ +L VD
Sbjct: 2 SLLARLRDLDAYTKPLDDFRVKTFTGGAVTLLSTLVIVVLFVSETISFLSTDVVEQLFVD 61
Query: 63 -TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
TS + L +NFDVTF LPC++++VD MD+SG+ DV+ D++K+RLD QGN I
Sbjct: 62 STSADQRLDVNFDVTFTKLPCAMVTVDVMDVSGDNQDDVQDDVYKQRLDQQGNNIT---- 117
Query: 122 GIGAPKIDKPLQRHGGRLE-HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G A ++ + + E CGSCYGA + CCN CE+V+EAY +GW + +
Sbjct: 118 GQAAVRLGVNVNTSTPASQLTTEPKCGSCYGAS---DRCCNTCEDVKEAYSARGWQMLDI 174
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ ++QCK + +++ I + +GEGC +YG ++V KVAGNFH APG H HD+ +
Sbjct: 175 ESVEQCKSDAWVRTINDFKGEGCRVYGKVQVAKVAGNFHIAPGDPLRSLRSHFHDLHSIA 234
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRW-TQETPSG-MYQYFIKVVPTVYTDV-SGH 297
F+ +H IN L+FG FPG PLDG + T + SG M+QY++KVVPT+Y + S +
Sbjct: 235 PAKFDTAHIINHLSFGTPFPGKNYPLDGKSFGTNKDSSGIMFQYYMKVVPTMYEFLDSSN 294
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
I S+QFSVT H + G LPG F Y+ SP+ V + E FL ++CAI+GG
Sbjct: 295 NIFSHQFSVTTHQKDIGMGA-SGLPGFFVQYEFSPLMVKYEERRQPLSTFLVSLCAIIGG 353
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
VFTV+ +ID+ IYH RAI+ K+E+ K++
Sbjct: 354 VFTVASLIDSLIYHSSRAIQHKVEMNKYN 382
>gi|312075860|ref|XP_003140604.1| hypothetical protein LOAG_05019 [Loa loa]
Length = 365
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 156/388 (40%), Positives = 233/388 (60%), Gaps = 28/388 (7%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +++ DAY K +DF RTF+GG +TLVSS V++ +F SE +L+ +L VD
Sbjct: 2 SLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYVD 61
Query: 63 TSRGET-LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
++ E + +NFD+TFP LPCS++++D MD+SG+ D++ D++K ++ N+ S
Sbjct: 62 STPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIRDDVYKIKV----NINTSTAS 117
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ A ++ CGSCYGA+ E CCN CEEV+EAY +KGW L N +
Sbjct: 118 SVPASQV----------------LCGSCYGAK---EGCCNTCEEVKEAYMRKGWELINIE 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
++QCK + +++++ E + EGC +YG ++V KVAGNFH APG H HD+ +
Sbjct: 159 TVEQCKSDLWVKKMSEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLRAHRSHFHDLHSLSP 218
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG--MYQYFIKVVPTVYTDV-SGHT 298
F+ SH +N +FG FPG V PLDG + S MYQY +K+VPT Y + S
Sbjct: 219 SKFDTSHTVNHFSFGNSFPGKVYPLDGKFFGSARNSDGIMYQYHLKLVPTSYVFLDSTRN 278
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
I S+ FSVT + + QG LPG F Y+ SP+ V + E S FL ++CAI+GG+
Sbjct: 279 IFSHLFSVTTYQKDISQGA-SGLPGFFVQYEFSPLMVKYEERQQSLSTFLVSICAIIGGI 337
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
FTV+ +IDAFIY R I +KI + K++
Sbjct: 338 FTVASLIDAFIYRSGRIISQKIALNKYT 365
>gi|58264656|ref|XP_569484.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134109945|ref|XP_776358.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259032|gb|EAL21711.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57225716|gb|AAW42177.1| ER to Golgi transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 422
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 157/414 (37%), Positives = 238/414 (57%), Gaps = 40/414 (9%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ + + DA+ K ED +T +G ++T +S ++L E Y E ++V
Sbjct: 4 NGMFGSFQGFDAFGKTMEDVKIKTRTGALLTFISLSIILTSVMLEFIDYRRIHMEPSIIV 63
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D SRGE L I+FD+ FP +PC +LS+D MDISGE + +H + K R++ GNVI Q
Sbjct: 64 DRSRGEKLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRMNKDGNVISKVQG 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
G ++ ++R L + YCGSCYGA + CCN+CEEVR+AY +KGW+ S+P+
Sbjct: 124 G----QLKGDVER--ANLNQDPNYCGSCYGALPPESGCCNSCEEVRQAYGRKGWSFSDPE 177
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
I+QC EG++ ++KE+ EGC I G + VNKV GN HF+PG+SF + + + +++ + R
Sbjct: 178 GIEQCVEEGWMDKMKEQNEEGCRIDGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR 237
Query: 242 DS--FNISHKINKLAFGEHFP------------------GVVNPLDGVRWTQETPSGMYQ 281
D + H ++K FG G+ +PL G++ E + M+Q
Sbjct: 238 DKNHHDFGHIVHKFRFGADMTKAEELTVLPKEQRWRDKLGLRDPLQGIKAHTEVSNYMFQ 297
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFF 327
YF+KVV T + +SG I S+Q+SVT++ R G + +PGVFF
Sbjct: 298 YFLKVVSTNFISLSGEEISSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFN 357
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 381
Y++SP+KV TEE SF HFLT+ CAIVGGV TV+ ++D+ I++ + +KKK E
Sbjct: 358 YEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLVDSLIFNSSKRLKKKSE 411
>gi|393907059|gb|EFO23462.2| hypothetical protein LOAG_05019 [Loa loa]
Length = 378
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 156/388 (40%), Positives = 233/388 (60%), Gaps = 15/388 (3%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +++ DAY K +DF RTF+GG +TLVSS V++ +F SE +L+ +L VD
Sbjct: 2 SLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYVD 61
Query: 63 TSRGET-LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
++ E + +NFD+TFP LPCS++++D MD+SG+ D++ D++K L ++
Sbjct: 62 STPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIRDDVYKISL-------LDGKE 114
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
G G + + ++ CGSCYGA+ E CCN CEEV+EAY +KGW L N +
Sbjct: 115 GNGVRQEVNINTSTASSVPASQVLCGSCYGAK---EGCCNTCEEVKEAYMRKGWELINIE 171
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
++QCK + +++++ E + EGC +YG ++V KVAGNFH APG H HD+ +
Sbjct: 172 TVEQCKSDLWVKKMSEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLRAHRSHFHDLHSLSP 231
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG--MYQYFIKVVPTVYTDV-SGHT 298
F+ SH +N +FG FPG V PLDG + S MYQY +K+VPT Y + S
Sbjct: 232 SKFDTSHTVNHFSFGNSFPGKVYPLDGKFFGSARNSDGIMYQYHLKLVPTSYVFLDSTRN 291
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
I S+ FSVT + + QG LPG F Y+ SP+ V + E S FL ++CAI+GG+
Sbjct: 292 IFSHLFSVTTYQKDISQGA-SGLPGFFVQYEFSPLMVKYEERQQSLSTFLVSICAIIGGI 350
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
FTV+ +IDAFIY R I +KI + K++
Sbjct: 351 FTVASLIDAFIYRSGRIISQKIALNKYT 378
>gi|348680250|gb|EGZ20066.1| CopII vesicle protein [Phytophthora sojae]
Length = 409
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/383 (39%), Positives = 232/383 (60%), Gaps = 10/383 (2%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +D YPK++ +F +T G +++V+ IVM +LF SEL Y + T ++VD+S GE
Sbjct: 31 LKKVDVYPKMHREFKVQTEFGATVSIVAGIVMAILFLSELSAYWSLNTHEHMVVDSSLGE 90
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L++N DV+F A+ C ++AMD++GE +++ + K RLD+ GN I G
Sbjct: 91 KLQVNLDVSFLAVNCRDAHINAMDVAGELQVNMHQTVVKTRLDADGNTI-----GRPISM 145
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAE-SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
I + E YCGSC+GA+ + ++CCN CE+V+EA+ ++L + + +QC
Sbjct: 146 ITDEGAEEQAKTALPEGYCGSCHGAQHPAGKECCNTCEDVKEAFIYSDFSLEDAEQKEQC 205
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
RE ++GEGC G + VN+VAGNFH A G++FH+ G VH Q ++N
Sbjct: 206 VREIMEAEKLAQDGEGCRFTGKMFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEHTYNS 265
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I+ L+FGE PGV PLDGV E G++QY+IK+VPT+Y+D+ +TI S QFSV
Sbjct: 266 SHIIHSLSFGEPMPGVAGPLDGVSKIAEQSGGVFQYYIKIVPTIYSDIDENTIHSYQFSV 325
Query: 307 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
T+ + +G++ +LPG FF +DLSP V + + F HFLT VCAIVGGV +++G +
Sbjct: 326 TQQGNYLNPRGQMTSLPGTFFVFDLSPFMVKVENDRMPFTHFLTKVCAIVGGVISIAGFV 385
Query: 366 DAFIY---HGQRAIKKKIEIGKF 385
D+F+Y H +R + KF
Sbjct: 386 DSFMYNSLHVRRRVSTNSGATKF 408
>gi|71021625|ref|XP_761043.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
gi|46100607|gb|EAK85840.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
Length = 435
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/417 (37%), Positives = 241/417 (57%), Gaps = 51/417 (12%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ I ++R +DA+ K +D RT +G +ITL+S++++ +L E Y + L V
Sbjct: 4 NGIFGQLRGIDAFSKTMDDVRIRTNAGALITLISALLIAVLTIGEFIDYRTVHVKPALEV 63
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D SRGE L +N ++TFP +PC +LS+D MDISGE D++HD+ + R++ G +IE +
Sbjct: 64 DRSRGEKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDVERTRINHDGKIIEQGK- 122
Query: 122 GIGAPKIDKPLQRHGGRLEHNE--TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
K L+ R+ + + YCG CYG + CCN C+EVREAY +KGW+ ++
Sbjct: 123 --------KSLKGDAARIANTKGKDYCGDCYGGQPPASKCCNTCDEVREAYVRKGWSFAD 174
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
PD +DQC EG+ ++IKE+ EGC I G L VNKV G+FH +PGK+F ++ +H+HD++ +
Sbjct: 175 PDHVDQCVAEGWSEKIKEQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPY 234
Query: 240 ----QRDSFNISHKINKLAFGEHFP----------------GVVNPLDGVRWTQETPSGM 279
+ + H I++ +FG GV +PL+GVR + M
Sbjct: 235 LSGTGSEHHDFGHIIHEFSFGSEQEYHGLTSAKERAVKAKLGVKDPLEGVRAQTQQSQFM 294
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-------------SEQGR-------LQ 319
+QYF+KVV T + +SG T+++ Q+SVT + R S +G
Sbjct: 295 FQYFVKVVSTEFRPLSGETLKTQQYSVTTYERDLSPGANAAALAGLSNEGSGAHISHGFA 354
Query: 320 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+PGVFF Y++SP+K +E S HFLT+ CAIVGG+ TV+GI+D+ +Y+ +R +
Sbjct: 355 GVPGVFFNYEISPLKTIHSEYRQSLSHFLTSTCAIVGGILTVAGILDSLVYNSRRRL 411
>gi|321253192|ref|XP_003192660.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
gi|317459129|gb|ADV20873.1| ER to Golgi transport-related protein, putative [Cryptococcus
gattii WM276]
Length = 435
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 156/415 (37%), Positives = 238/415 (57%), Gaps = 40/415 (9%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ + + DA+ K ED +T +G ++T +S ++L E Y E ++V
Sbjct: 4 NGMFGAFQGFDAFGKTMEDVKVKTRTGALLTFISLSIILTSVMLEFIDYRRIHLEPSIIV 63
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D SRGE L I+FD+ FP +PC +LS+D MDISGE + +H + K R+D G +I Q
Sbjct: 64 DRSRGEKLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRIDKNGKIISKVQG 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
G ++ L+R L + YCGSCYGA + CCN+CEEVR+AY +KGW+ S+P+
Sbjct: 124 G----QLKGDLER--ANLNQDPNYCGSCYGAPPPESGCCNSCEEVRQAYGRKGWSFSDPE 177
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
I+QC EG++ ++KE+ EGC I G + VNKV GN HF+PG+SF + + + +++ + R
Sbjct: 178 GIEQCVEEGWMDKMKEQNEEGCRIGGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR 237
Query: 242 DS--FNISHKINKLAFGEHFP------------------GVVNPLDGVRWTQETPSGMYQ 281
D + H ++K FG G+ +PL G++ E + M+Q
Sbjct: 238 DKNHHDFGHIVHKFRFGGDMTKAEELTVLPKEQRWRDKLGLKDPLQGIKVHTEVSNYMFQ 297
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFF 327
YF+KVV T + ++G I S+Q+SVT++ R G + +PGVFF
Sbjct: 298 YFLKVVSTNFISLNGEEIPSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFN 357
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
Y++SP+KV TEE SF HFLT+ CAIVGGV TV+ ++D+FI++ + +KK E+
Sbjct: 358 YEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLLDSFIFNSSKRLKKTSEV 412
>gi|313231322|emb|CBY08437.1| unnamed protein product [Oikopleura dioica]
Length = 386
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 162/390 (41%), Positives = 235/390 (60%), Gaps = 19/390 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+++IR DAY K EDF RT +G VIT+ S++ +LLFFSEL YL ++L VD +
Sbjct: 4 LSQIRRFDAYTKPVEDFRERTVTGAVITICCSLLCMLLFFSELNYYLTTEVVSELRVDNT 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL-DSQGNVIESRQDGI 123
RG L +N D+T LPC+ S+DAMD++G++ D +H +FK R+ D Q + + + I
Sbjct: 64 RGGKLVMNLDLTVAGLPCNYFSIDAMDLTGDR-ADAEHQLFKVRMKDGQEVALSEKVEEI 122
Query: 124 GAPKIDKPLQRHGGRLEHNET------YCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
A K+ H + E ET C SCYGAE+ ++ CCN+CEEV++AYR KGWA
Sbjct: 123 NAEKL------HDEKQEEEETGLAVKDECQSCYGAETEEQPCCNSCEEVQQAYRNKGWAF 176
Query: 178 S-NPDLIDQCKREGF--LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
+ QC E F + +++ EGE C ++G LEVN+V+G+ +PGK+ G VH
Sbjct: 177 DHSAQQFSQCVNEHFDLNEELQKTEGESCRVHGHLEVNRVSGSLQISPGKTLVLDGSVVH 236
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 294
DI + SF+ SH I+ L+FGE FPG NPLD E+ + + Y KV+PT + +
Sbjct: 237 DIRGMKHMSFDTSHTIHHLSFGEVFPGQENPLDNTEHEAESMNMAWHYNFKVIPTEFRKL 296
Query: 295 SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
G +NQFSVT H ++ Q + LPG+ F ++++PI V E S +HF T+VCAI
Sbjct: 297 DGSRTATNQFSVTRHEKALSQMSSR-LPGINFHFEIAPIAVIKMETRRSAVHFATSVCAI 355
Query: 355 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+GGV+T+S I+D+FI H + K E+GK
Sbjct: 356 IGGVWTISSILDSFI-HKTNKLLIKTELGK 384
>gi|301106576|ref|XP_002902371.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262098991|gb|EEY57043.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 393
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/377 (39%), Positives = 224/377 (59%), Gaps = 18/377 (4%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +D YPK++ +F +T G +++V+ I M +LF SEL Y T ++VD++ GE
Sbjct: 30 LKKVDVYPKMHREFKVQTEFGATVSIVAGIFMAILFLSELSTYWTVNTHEHMVVDSTLGE 89
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L++N DV+F A+ C ++AMD++GE +++ + K RLD+ G I + D + K
Sbjct: 90 KLQVNLDVSFLAVNCRDAHINAMDVAGELQVNMHQTVVKTRLDANGRSISTTADELA--K 147
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAE-SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
D P YCGSCYG + ++CCN CEEV+EA+ +L + +QC
Sbjct: 148 TDLP-----------AGYCGSCYGTRHPAGKECCNTCEEVKEAFIHSDLSLEEAEQKEQC 196
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
RE ++GEGC G + VN+VAGNFH A G++FH+ G VH Q +FN
Sbjct: 197 VRESIDTEKLAQDGEGCRFTGKMFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEHTFNS 256
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I+ L+FGE PG +PLDGV E G++QY+IK+VPT+Y+D+ I S QFSV
Sbjct: 257 SHIIHSLSFGEPIPGATSPLDGVSKIAEQSGGVFQYYIKIVPTIYSDIDESAIHSYQFSV 316
Query: 307 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
T+ + +G++ +LPG FF +DLSP V + V F HFLT +CAIVGGV +++G +
Sbjct: 317 TQQSNYLNPRGQMTSLPGTFFVFDLSPFMVKVENDRVPFTHFLTKICAIVGGVISIAGFV 376
Query: 366 DAFIY---HGQRAIKKK 379
D+F+Y H +R + K
Sbjct: 377 DSFMYNSLHVRRRVSSK 393
>gi|443732120|gb|ELU16969.1| hypothetical protein CAPTEDRAFT_192533 [Capitella teleta]
Length = 304
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/306 (48%), Positives = 198/306 (64%), Gaps = 6/306 (1%)
Query: 39 MLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
M +LF SE YL +L VDT+RG+ L+IN D+TFP + CS L++DAMD+SGEQ +
Sbjct: 1 MFVLFVSEFNYYLTTEVHPELFVDTARGQKLKINVDMTFPTVGCSFLTLDAMDVSGEQQI 60
Query: 99 DVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED 158
DV HDIFK+RLD G +++ G L + SCYGAES
Sbjct: 61 DVLHDIFKQRLDLDGIEVKAEPSKEGQSSESCALNHALSSFLFSRF---SCYGAESEAHK 117
Query: 159 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 218
CCN C EVREAYR+KGWA + I+QC REG++ +++E + EGC IYGFLEVNKVAGNF
Sbjct: 118 CCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNF 177
Query: 219 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPS 277
H APG+SF Q H+HD+ A Q FN+SH+I L+FG+ +PG VNPLD + T++
Sbjct: 178 HVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQVNPLDASEQVTEQADF 237
Query: 278 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKV 335
M+ Y++KVVPT Y +G + SNQ+SVT+H + G L Q LPGVF Y+LSP+ V
Sbjct: 238 VMFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMV 297
Query: 336 TFTEEH 341
+TE++
Sbjct: 298 KYTEKN 303
>gi|328858670|gb|EGG07782.1| hypothetical protein MELLADRAFT_105603 [Melampsora larici-populina
98AG31]
Length = 422
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 164/408 (40%), Positives = 237/408 (58%), Gaps = 38/408 (9%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+ LD + K ED RT GG +TL S+I+++ L E Y ++VD SRGE
Sbjct: 12 KGLDGFGKTMEDVKIRTGFGGFLTLASAILIVTLVLVEFVDYRTLHLNPSIVVDKSRGEK 71
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPK 127
L ++ ++TFP +PC +LSVD MDISGE DV HD+ K RL+ G ++ S G+
Sbjct: 72 LIVDMNITFPRVPCYLLSVDLMDISGEHQNDVNHDMTKTRLNPDGTLVSASVSKGLKGEL 131
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
R G YCGSCYG + CCN CEEVRE+Y ++GW+ SNPD I+QC
Sbjct: 132 DTIAATRAPG-------YCGSCYGGTPPESGCCNTCEEVRESYVRRGWSFSNPDGIEQCV 184
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFN 245
+E + +IKE+E EGCN+ G ++VNKV GNFH +PG+SF + +HVHD++ + + +S +
Sbjct: 185 QEHWSDKIKEQEKEGCNMNGQVKVNKVIGNFHMSPGRSFQTNAMHVHDLVPYLQTGNSHD 244
Query: 246 ISHKINKLAF-GEHFP-------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
H I+K AF EH G+VNPLDG++ E + M+QYF+KVV T +
Sbjct: 245 FGHIIHKFAFLAEHQSPDDDETRRIKTSLGIVNPLDGIKAHTEESNYMFQYFLKVVGTEF 304
Query: 292 TDVSGHTIQSNQFSVTEHFR---SSEQGRLQTL-----------PGVFFFYDLSPIKVTF 337
+ ++++Q+SVT++ R S +G L PG+FF Y++SP++V
Sbjct: 305 HLLDQRVVKTHQYSVTQYERDLTKSSRGGTDELGHQTSHGYAGVPGLFFNYEISPMQVIH 364
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
E SF HF T+ CAI+GGV TV+G+ID+ +Y + IK + G F
Sbjct: 365 KEYRQSFAHFATSTCAIIGGVLTVAGLIDSAVYGARNRIKLQSSDGGF 412
>gi|302688477|ref|XP_003033918.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
gi|300107613|gb|EFI99015.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
Length = 415
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 155/407 (38%), Positives = 229/407 (56%), Gaps = 39/407 (9%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ +DA+ K ED +T +G ++TL+++ ++L E Y + +T + VD S
Sbjct: 6 LSHLKGIDAFGKTAEDVKVKTRTGALLTLIAASIILAFTTLEFFDYRKVIIDTSVTVDQS 65
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RGE L + +VTFP +PC +LSVD DISG+ DV H++ K RLD G I
Sbjct: 66 RGERLTVRMNVTFPRVPCYLLSVDVTDISGDVQRDVSHNMLKTRLDKDGKAIRGAHTAEL 125
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+IDK ++ G YCGSCYG CCN CEEVR AY +GW+ +NPD I+
Sbjct: 126 RNEIDKQNEQRGA------DYCGSCYGGLPPASGCCNTCEEVRTAYVNRGWSFNNPDSIE 179
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QCK EG+ +++E+ EGCNI G L +NKVAGN H +PG+SF G +V++++ + RD
Sbjct: 180 QCKNEGWADKLREQANEGCNIAGRLRINKVAGNIHLSPGRSFQTGGRNVYELVPYLRDDG 239
Query: 245 N---ISHKINKLAF-----------------GEHFPGVVNPLDGVRWTQETPSGMYQYFI 284
N SH I+ L+F + NPLDG M+QYF+
Sbjct: 240 NRHDFSHTIHSLSFEGDDAYDNRKRETSKEMRQRMGLSSNPLDGTVRVTNKAQYMFQYFV 299
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT------------LPGVFFFYDLS 331
KVV T + ++G T+ S+ +SVT R ++ G+ QT LPG F +D+S
Sbjct: 300 KVVSTKFRPLNGRTVNSHSYSVTHFERDLTDGGQAQTGQNVQVQHGVTGLPGAFINFDVS 359
Query: 332 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
PI++ TE SF HF+T+ CAIVGGV TV+ ++D+ ++ +A+KK
Sbjct: 360 PIQLVHTEWRQSFAHFVTSTCAIVGGVLTVASLLDSVLFATSKALKK 406
>gi|384483831|gb|EIE76011.1| hypothetical protein RO3G_00715 [Rhizopus delemar RA 99-880]
Length = 408
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 154/384 (40%), Positives = 233/384 (60%), Gaps = 23/384 (5%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ + + R LDAY K +DF RT +GG +T++S + +L+L E YL + + ++LVD
Sbjct: 25 SFIKRFRKLDAYAKTLDDFRVRTATGGAVTIISGLCILILVLFETVQYLTPIMKPEILVD 84
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
E L I FD+TFP LPC +LS+D MD SGE + HD++K+RLD G VI + +
Sbjct: 85 GGNMEKLPIKFDITFPHLPCYMLSLDIMDESGEHISNYDHDVYKERLDPNGEVITAEKSN 144
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ K + H + + YCGSCYGA+ S+E CCN CEE++ AY + GW + +PD
Sbjct: 145 DLSNSQAKNAREHSMNVP--DDYCGSCYGAKGSNE-CCNTCEEIQNAYSELGWNV-DPDN 200
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+QC REG+ ++I+ + EGC ++G L VNK+ GNFHF+ GK+F QSG H+HD+ F +
Sbjct: 201 FEQCIREGWKEKIESQSREGCRMHGTLLVNKIRGNFHFSAGKAFKQSGSHIHDMSTFLHN 260
Query: 243 --SFNISHKINKLAFGEH-----------FPGVVNPLDGVRWTQETPSGMYQYFIKVVPT 289
+ N H I L FG H +++PL+ ++ + MYQYF+K+VPT
Sbjct: 261 DKNQNFMHTIQHLQFGNHDYNSEKQKRTKSRELIHPLENIKSGNSETAIMYQYFLKIVPT 320
Query: 290 VYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
+ ++G I++ Q+SV+ +H S G LPGVFF D SP+++ ++E S +
Sbjct: 321 EFNFLNGKRIRTFQYSVSKQDHIVSYLGG----LPGVFFMLDHSPMRIIYSETKTSLASY 376
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYH 371
LT++CAI+GG+FTV+ +ID I H
Sbjct: 377 LTSLCAIIGGIFTVASVIDGSIQH 400
>gi|413949705|gb|AFW82354.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
partial [Zea mays]
Length = 202
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 127/185 (68%), Positives = 162/185 (87%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA +++++ LDAYPK+NEDFY RT SGG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI K+RL+S GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+GIG K+++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+CEEVREAY+KKGWAL+NP
Sbjct: 121 EGIGGAKVERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180
Query: 181 DLIDQ 185
DLIDQ
Sbjct: 181 DLIDQ 185
>gi|388856238|emb|CCF50047.1| uncharacterized protein [Ustilago hordei]
Length = 435
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 156/417 (37%), Positives = 240/417 (57%), Gaps = 51/417 (12%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ + ++R +DA+ K +D RT +G +ITL+S++++L+L E Y + L V
Sbjct: 4 NGVFGQLRGIDAFSKTMDDVRIRTNAGALITLISALLILVLTIGEYVDYRTVHLKPALEV 63
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D SRGE L +N ++TFP +PC +LS+D MDISGE D++HDI + R+ QD
Sbjct: 64 DRSRGEKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRIS---------QD 114
Query: 122 GIGAPKIDKPLQRHGGRLEHNE--TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
G + + K L+ R+ + + YCG CYG + CCN C+EVREAY +KGW+ S+
Sbjct: 115 GKVSIQGTKSLKGDAARIANTKGKDYCGDCYGGQPPASGCCNTCDEVREAYVRKGWSFSD 174
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
PD ++QC EG+ ++IKE+ EGC I G L VNKV G+FH +PG++F ++ +H+HD++ +
Sbjct: 175 PDHVEQCVAEGWSEKIKEQNKEGCRISGKLHVNKVVGSFHLSPGRAFQRNSMHIHDLVPY 234
Query: 240 QRDS----FNISHKINKLAFGEHFP----------------GVVNPLDGVRWTQETPSGM 279
S + H I++ +FG GV +PL+GVR + M
Sbjct: 235 LSGSGAEHHDFGHIIHEFSFGSEQEYHGLTTAKERAVKDKLGVKDPLEGVRARTKESQYM 294
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-------------SEQGR-------LQ 319
+QYF+KVV T + ++G T+++ Q+SVT + R S +G
Sbjct: 295 FQYFLKVVSTEFRPLAGETLKTQQYSVTTYERDLSPGANAAALAGLSNEGSGARISHGFA 354
Query: 320 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+PGVFF Y++SP+K +E S HFLT+ CAIVGG+ TV+GI+D+ IY+ R +
Sbjct: 355 GVPGVFFNYEISPLKTIHSEYRQSLSHFLTSTCAIVGGILTVAGILDSLIYNSGRRL 411
>gi|392591676|gb|EIW81003.1| ER-derived vesicles protein ERV46 [Coniophora puteana RWD-64-598
SS2]
Length = 419
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 156/406 (38%), Positives = 231/406 (56%), Gaps = 41/406 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +DA+ K ED +T +G +TL+S+ ++L E Y T+T ++VD SRGE
Sbjct: 9 LKGIDAFGKTTEDVKVKTRTGAFLTLLSAAIILSFTLMEFVDYRRVYTDTSIVVDRSRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L + +VTFP +PC +LSVD MDISGE DV H++ K+RLD G I + G +
Sbjct: 69 KLSVRMNVTFPHVPCYLLSVDVMDISGETQRDVSHNVVKQRLDKTGKGIAGSRSGDLRNE 128
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGA-ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
IDK + G YCGSCYG S+D CCN+CEEVR+AY KGW+ NP+ I+QC
Sbjct: 129 IDKLAELRG------PDYCGSCYGGYTSTDNGCCNSCEEVRQAYVNKGWSFGNPEGIEQC 182
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---S 243
+EG+ ++K++ EGCNI G + VNKV GN + +PG+SF + +D + + ++
Sbjct: 183 TQEGWTDKVKDQADEGCNISGRIRVNKVVGNINISPGRSFQTGSRNFYDFVPYLKEDGGQ 242
Query: 244 FNISHKINKLAF---GEHFPGVV--------------NPLDGVRWTQETPSGMYQYFIKV 286
+ +H I++L F E+ P + NPLDG + + MYQYF+KV
Sbjct: 243 HDFTHYIDELTFLADDEYNPNKMKHGKELKQRMGLDSNPLDGFKASTTKKMFMYQYFLKV 302
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFFYDLSP 332
V T + ++G TI ++Q+S T R +G PG +F +++SP
Sbjct: 303 VSTQFRTLNGRTINTHQYSATHFERDLSRGMGGGENNQGVYVQHGAGGAPGAYFNFEISP 362
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
I+V E SF HFLT+ CAIVGGV TV+ ++D+F++ RA+KK
Sbjct: 363 IQVVHAETRQSFAHFLTSTCAIVGGVLTVAALLDSFLFATSRALKK 408
>gi|405123077|gb|AFR97842.1| COPII-coated vesicle component Erv46 [Cryptococcus neoformans var.
grubii H99]
Length = 422
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 152/403 (37%), Positives = 231/403 (57%), Gaps = 40/403 (9%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ + + DA+ K ED +T +G ++T +S ++L E Y E ++V
Sbjct: 4 NGMFGSFQGFDAFGKTMEDVKIKTRTGALLTFISLSIILTSVMLEFIDYRRIHLEPSIIV 63
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D SRGE L I+FD+ FP +PC +LS+D MDISGE + +H + K R++ GNVI Q
Sbjct: 64 DRSRGEKLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRMNKDGNVISKVQ- 122
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
++ ++R L + YCGSCYGA + CCN+CEEVR+AY +KGW+ S+P+
Sbjct: 123 ---GSQLKGDVER--ANLNQDPNYCGSCYGAPPPESGCCNSCEEVRQAYGRKGWSFSDPE 177
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
I+QC EG++ ++KE+ EGC I G + VNKV GN HF+PG+SF + + + +++ + R
Sbjct: 178 GIEQCVEEGWMDKMKEQNEEGCRIDGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR 237
Query: 242 DS--FNISHKINKLAFGEHFP------------------GVVNPLDGVRWTQETPSGMYQ 281
D + H ++K FG G+ +PL G++ E + M+Q
Sbjct: 238 DKNHHDFGHIVHKFRFGGDMTKAEELTVLPKEQRWRDKLGLRDPLQGMKAHTEVSNYMFQ 297
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFF 327
YF+KVV T + ++G I S+Q+SVT++ R G + +PGVFF
Sbjct: 298 YFLKVVSTNFISLNGEEIPSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFN 357
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
Y++SP+KV TEE SF HFLT+ CAIVGGV TV+ ++D+FI+
Sbjct: 358 YEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLVDSFIF 400
>gi|268581953|ref|XP_002645960.1| C. briggsae CBR-ERV-46 protein [Caenorhabditis briggsae]
Length = 380
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 160/388 (41%), Positives = 228/388 (58%), Gaps = 13/388 (3%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ ++ DAY K +DF +T SGG++TL+++IV+ LL E R +L+ L VD
Sbjct: 2 SLLWSLKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVLETRQFLSTAVLEHLFVD 61
Query: 63 -TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG-NVIESRQ 120
T+ E + I FD+TF LPC+ ++VD MD+S E ++ DI++ RLD+ G NV ES Q
Sbjct: 62 STTSDERVHIEFDITFNKLPCNFITVDVMDVSSEAQENINDDIYRLRLDADGRNVSESAQ 121
Query: 121 DGIGAPKIDKPLQRHGGRLEH--NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
KI+ + G E CGSCYGA +D CCN CE+V+ AY KGW +
Sbjct: 122 ------KIEINQNKTIGEPTELVQEVKCGSCYGA-VADGICCNTCEDVKNAYAVKGWQV- 173
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
N + ++QCK + +++ E + EGC +YG ++V KVAGNFH APG HVHD+
Sbjct: 174 NIEEVEQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHN 233
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
F+ SH +N ++FG+ FPG PLDG T+ MYQY++KVVPT Y + G
Sbjct: 234 LDPVKFDASHTVNHISFGKSFPGKNYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRV 293
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
QS+QFSVT H + R LPG F Y+ SP+ V + E S FL ++CAIVGGV
Sbjct: 294 DQSHQFSVTTH-KKDLGFRQAGLPGFFLQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGV 352
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
F ++ ++D IYH R +K +I GK +
Sbjct: 353 FAMAQLVDITIYHTSRYMKSRIAGGKLT 380
>gi|390603136|gb|EIN12528.1| endoplasmic reticulum-derived transport vesicle ERV46 [Punctularia
strigosozonata HHB-11173 SS5]
Length = 419
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 151/409 (36%), Positives = 229/409 (55%), Gaps = 39/409 (9%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ ++ LDA+ K ED +T +G +T +S+ ++L E Y +T ++VD
Sbjct: 5 GLFGSLKGLDAFGKTMEDVKVKTRTGAFLTFLSAAIILTFTMIEFVDYRRVNMDTSIVVD 64
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SRGE L + +VTFP +PC +LS+D MDISGEQ D+ H+I K RLDS G +I Q
Sbjct: 65 KSRGEKLTVRMNVTFPRVPCYLLSLDVMDISGEQQRDISHNILKTRLDSTGKLIPGSQ-- 122
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+++ R + + YCGSCYGAE S+ CCN+C+ VR+AY +GW+ NPD
Sbjct: 123 --RSELESEFDRQNKPMP--DGYCGSCYGAEPSEGACCNSCDAVRQAYVNRGWSFGNPDS 178
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I+QC +E + +++K++ EGCNI G + VNKV GN H +PG+SF G +++++ + R+
Sbjct: 179 IEQCVKENWSEKLKDQASEGCNIAGRVRVNKVIGNIHLSPGRSFQSQGRSMYELVPYLRE 238
Query: 243 SFN---ISHKINKLAF---GEHFPGV--------------VNPLDGVRWTQETPSGMYQY 282
N SH I++ AF E+ P PLDG M+QY
Sbjct: 239 DGNRHDFSHTIHEFAFEGDDEYLPDKYKVSKEMRAKMGLEAGPLDGAVGRTIKAQYMFQY 298
Query: 283 FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-------------LPGVFFFYD 329
F+KVV T + + G T+ S+Q+S T R ++G +PG FF ++
Sbjct: 299 FLKVVSTQFRTLDGQTVNSHQYSATHFERDLDKGSEDNTAEGVHISHTTYGVPGAFFNFE 358
Query: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
+SPI + +E SF HFLT+ CAIVGGV T++ I+D+ ++ +A+KK
Sbjct: 359 ISPILIVHSETRQSFAHFLTSTCAIVGGVLTIASIVDSVLFATTKALKK 407
>gi|341884797|gb|EGT40732.1| CBN-ERV-46 protein [Caenorhabditis brenneri]
Length = 379
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 159/387 (41%), Positives = 229/387 (59%), Gaps = 12/387 (3%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ ++ DAY K +DF +T SGG++TL+++IV+ LL E R +L+ L VD
Sbjct: 2 SLLWSLKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVMETRQFLSTDVLEHLFVD 61
Query: 63 -TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG-NVIESRQ 120
T+ E + I FD+TF LPC+ ++VD MD+S E ++ DI++ RLD+ G NV E+ Q
Sbjct: 62 STTSDERVHIEFDITFNKLPCNFITVDVMDVSSEAQDNINDDIYRLRLDADGKNVSETAQ 121
Query: 121 DGIGAPKIDKPLQRHGGRLEH-NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
KI+ + E E CGSCYGA ++D CCN CE+V+ AY KGW + N
Sbjct: 122 ------KIEINQNKTVDATELIQEVKCGSCYGA-AADGICCNTCEDVKNAYAIKGWQV-N 173
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QCK + +++ E + EGC +YG ++V KVAGNFH APG HVHD+
Sbjct: 174 IEEVEQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQSMRSHVHDLHNL 233
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
F+ SH +N ++FG+ FPG PLDG T+ MYQY++KVVPT Y + G
Sbjct: 234 DPVKFDASHTVNHISFGKSFPGKNYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVD 293
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
QS+QFSVT H + R LPG F Y+ SP+ V + E S FL ++CAIVGGVF
Sbjct: 294 QSHQFSVTTH-KKDLGFRQSGLPGFFLQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVF 352
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKFS 386
++ ++D IYH R +K +I GK +
Sbjct: 353 AMAQLVDITIYHSSRYMKNRIAGGKLT 379
>gi|409079094|gb|EKM79456.1| hypothetical protein AGABI1DRAFT_120853 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1000
Score = 290 bits (743), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 149/394 (37%), Positives = 225/394 (57%), Gaps = 42/394 (10%)
Query: 19 EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFP 78
ED +T +G +T +++ ++L E Y T+T ++VD SRGE L +N ++TFP
Sbjct: 602 EDVKVKTRTGAFLTFIAAAIILSFTTLEFLDYRRVYTDTSIVVDKSRGEKLTVNLNITFP 661
Query: 79 ALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK--PLQRHG 136
+PC +LS+D MDISGE D+ H+I K RL++ G ++ + ++DK +Q+ G
Sbjct: 662 RVPCFLLSLDVMDISGEVQRDISHNILKTRLENNGTIVPASYSAQLQNELDKMNEVQQSG 721
Query: 137 GRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK 196
YCGSCYG CCN C+EVR+AY +GW+ S+PD I+QCKREG+ +++K
Sbjct: 722 --------YCGSCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKMK 773
Query: 197 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLA 254
++ EGCN+ G L VNKV GN H +PG+SF + ++++++ + RD + SH+I+ A
Sbjct: 774 DQADEGCNVSGRLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHFA 833
Query: 255 F-------------GEHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
F G +NPLDG ++ M+QYF+KVV T + + G
Sbjct: 834 FEGDDEYVYWKASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFRTLDGK 893
Query: 298 TIQSNQFSVTEHFRSSE-------------QGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
+ ++Q+SVT R E Q Q LPG FF Y++SPI V + SF
Sbjct: 894 IVNTHQYSVTHFERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHADSRQSF 953
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
HFLT+ CAIVGGV TV+ ++D+ ++ RA+KK
Sbjct: 954 AHFLTSTCAIVGGVLTVASLVDSLLFATTRALKK 987
>gi|426196003|gb|EKV45932.1| hypothetical protein AGABI2DRAFT_207344 [Agaricus bisporus var.
bisporus H97]
Length = 1000
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 149/394 (37%), Positives = 225/394 (57%), Gaps = 42/394 (10%)
Query: 19 EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFP 78
ED +T +G +T +++ ++L E Y T+T ++VD SRGE L +N ++TFP
Sbjct: 602 EDVKVKTRTGAFLTFIAAAIILSFTTLEFLDYRRVYTDTSIVVDKSRGEKLTVNLNITFP 661
Query: 79 ALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK--PLQRHG 136
+PC +LS+D MDISGE D+ H+I K RL++ G ++ + ++DK +Q+ G
Sbjct: 662 RVPCFLLSLDVMDISGEVQRDISHNILKTRLENNGTIVPASYSAQLQNELDKMNEVQQSG 721
Query: 137 GRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK 196
YCGSCYG CCN C+EVR+AY +GW+ S+PD I+QCKREG+ +++K
Sbjct: 722 --------YCGSCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKMK 773
Query: 197 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLA 254
++ EGCN+ G L VNKV GN H +PG+SF + ++++++ + RD + SH+I+ A
Sbjct: 774 DQADEGCNVSGRLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHFA 833
Query: 255 F-------------GEHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
F G +NPLDG ++ M+QYF+KVV T + + G
Sbjct: 834 FEGDDEYVYWKASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFRTLDGK 893
Query: 298 TIQSNQFSVTEHFRSSE-------------QGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
+ ++Q+SVT R E Q Q LPG FF Y++SPI V + SF
Sbjct: 894 IVNTHQYSVTHFERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHADSRQSF 953
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
HFLT+ CAIVGGV TV+ ++D+ ++ RA+KK
Sbjct: 954 AHFLTSTCAIVGGVLTVASLVDSLLFATTRALKK 987
>gi|17568835|ref|NP_510575.1| Protein ERV-46 [Caenorhabditis elegans]
gi|3878494|emb|CAB01889.1| Protein ERV-46 [Caenorhabditis elegans]
Length = 380
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 157/390 (40%), Positives = 232/390 (59%), Gaps = 17/390 (4%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ ++ DAY K +DF +T SGG++TL+++I ++LL E + +L+ L VD
Sbjct: 2 SLLWSLKHFDAYRKPMDDFRVKTLSGGLVTLIATIAIVLLIVLETKQFLSTEVLEHLFVD 61
Query: 63 -TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG-NVIESRQ 120
T+ E + I FD+TF LPC+ ++VD MD+S E ++ DI++ RLD +G N+ ES Q
Sbjct: 62 STTSDERVHIEFDITFTKLPCNFITVDVMDVSSEAQENINDDIYRLRLDPEGRNISESAQ 121
Query: 121 DGIGAPKIDKPLQRHGGRLEHN----ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
KI+ + ++ +E E CGSCYGA ++D CCN C++V+ AY KGW
Sbjct: 122 ------KIE--INQNKTSVETTDVIQEVKCGSCYGA-AADGICCNTCDDVKSAYAVKGWQ 172
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ N + ++QCK + +++ E + EGC +YG ++V KVAGNFH APG HVHD+
Sbjct: 173 V-NIEEVEQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDL 231
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
F+ SH +N ++FG+ FPG PLDG T MYQY++KVVPT Y + G
Sbjct: 232 HNLDPVKFDASHTVNHVSFGKSFPGKNYPLDGKVNTDNRGGIMYQYYVKVVPTRYDYLDG 291
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
QS+QFSVT H + R LPG F Y+ SP+ V + E SF FL ++CAIVG
Sbjct: 292 RVDQSHQFSVTTH-KKDLGFRQSGLPGFFLQYEFSPLMVQYEEFRQSFASFLVSLCAIVG 350
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
GVF ++ ++D IYH R +K +I GK +
Sbjct: 351 GVFAMAQLVDITIYHSSRYMKSRIAGGKLT 380
>gi|443894052|dbj|GAC71402.1| hypothetical protein PANT_3d00017 [Pseudozyma antarctica T-34]
Length = 461
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 153/405 (37%), Positives = 230/405 (56%), Gaps = 51/405 (12%)
Query: 16 KINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDV 75
K +D RT +G +IT+VS++++++L E Y + L VD SRGE L +N D+
Sbjct: 45 KTMDDVRIRTNAGALITMVSALLIVVLTIGEFVDYRTVHLKPSLEVDRSRGEKLTVNMDI 104
Query: 76 TFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRH 135
TFP +PC +LS+D MDISGE D++HDI + R+ G I + K L+
Sbjct: 105 TFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRVTHDGKPITQGK---------KNLKGD 155
Query: 136 GGRLE--HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQ 193
R+ + YCG CYG + CCN C+EVREAY +KGW+ ++PD +DQC EG+
Sbjct: 156 AARIAATKGKDYCGDCYGGQPPASGCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGWSD 215
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHK 249
+IKE+ EGC I G L VNKV G+FH +PGK+F ++ VH+HD++ + + + H
Sbjct: 216 KIKEQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSVHIHDLVPYLSGTGAEHHDFGHI 275
Query: 250 INKLAFG----------------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
I+ +FG + GV +PL+GVR + M+QYF+KVV T +
Sbjct: 276 IHDFSFGSEQQYHGLTTAKEREVKQKLGVKDPLEGVRAQTQQSQFMFQYFLKVVSTEFRP 335
Query: 294 VSGHTIQSNQFSVTEHFRS-------------SEQGR-------LQTLPGVFFFYDLSPI 333
+SG T+++ Q+SVT + R S +G +PGVFF Y++SP+
Sbjct: 336 LSGDTLKTQQYSVTTYERDLSPGANAAAMAGMSNEGSGAHISHGFAGVPGVFFNYEISPL 395
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
K +E S HFLT+ CAIVGG+ TV+GI+D+ +Y+ +R +++
Sbjct: 396 KTIHSEHRQSLSHFLTSTCAIVGGILTVAGIVDSLVYNSRRRLRR 440
>gi|170089933|ref|XP_001876189.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
gi|164649449|gb|EDR13691.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
Length = 421
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 150/404 (37%), Positives = 225/404 (55%), Gaps = 39/404 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +DA+ K ED +T +G ++T++S+ ++L F E Y +T ++VD SRGE
Sbjct: 9 LKGVDAFGKTTEDVKVKTRTGALLTIISAAIILAFSFVEFIDYRAVNIDTSIVVDKSRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L +N +VTFP +PC +LS+D MDISGE D+ H++ K RLD+ G + +
Sbjct: 69 KLTVNLNVTFPRVPCYLLSLDIMDISGELQRDISHNVMKVRLDTHGKEVPNSHSAELRND 128
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+DK E YCGSC+G + CCN CE+VR AY +GW+ SNP+ I+QCK
Sbjct: 129 LDKMND------AKRENYCGSCFGGLEPEGGCCNTCEDVRLAYVNRGWSFSNPEAIEQCK 182
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN-- 245
EG+ ++KE+ EGCNI G + VNKV GN H +PG+SF + ++++++ + RD N
Sbjct: 183 NEGWADKLKEQADEGCNISGRIRVNKVIGNIHLSPGRSFQTNARNLYELVPYLRDDGNRH 242
Query: 246 -ISHKINKLAFG-----EHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVV 287
SH I+ LAF +++ NPLDG M+QYF+KVV
Sbjct: 243 DFSHTIHHLAFEGDDEYDYWKAAAGSAMRQRMGLTENPLDGAIARTAKAQYMFQYFLKVV 302
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-------------LQTLPGVFFFYDLSPIK 334
T + + G + ++Q+S T+ R +G + LPG FF +++SPI
Sbjct: 303 STQFRTLDGRKVNTHQYSTTQFERDLTEGAAGETAGGIHVQHGVSGLPGAFFNFEISPIL 362
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V E SF HFLT+ CAI+GGV TV+ IID+ ++ R +KK
Sbjct: 363 VVHAETRQSFAHFLTSTCAIIGGVLTVASIIDSILFATNRRLKK 406
>gi|389744843|gb|EIM86025.1| ER-derived vesicles protein ERV46 [Stereum hirsutum FP-91666 SS1]
Length = 419
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 150/404 (37%), Positives = 225/404 (55%), Gaps = 40/404 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +DA+ K ED +T +G +TL+++ ++L E Y +T + VD SRGE
Sbjct: 9 LKGVDAFGKTMEDVKVKTRTGAFLTLMAAAIILTFTTMEFFDYRRVTMDTSVEVDRSRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L + +VTFP +PC +LS+D MDISGE D+ H+I K RL+S G + + + +
Sbjct: 69 KLTVRMNVTFPRVPCYLLSLDVMDISGETQRDISHNIVKTRLNSDGTQVPNSANMQLRNE 128
Query: 128 IDK-PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+DK QR G YCGSCYG + CCN C++VREAY ++GW+ NPD I+QC
Sbjct: 129 LDKLNAQRQDG-------YCGSCYGGTPPEGGCCNTCDQVREAYVQRGWSFGNPDSIEQC 181
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN- 245
+E + +++ E+ EGCNI G + VNKV GN H +PGKSF S +++++ + +D N
Sbjct: 182 VQEHWSEKLHEQSSEGCNISGRVRVNKVIGNIHLSPGKSFQNSASSIYELVPYLKDDKNR 241
Query: 246 --ISHKINKLAFG-----------------EHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
SH ++ L FG + NPLDG PS M+QYF+K
Sbjct: 242 HDFSHIVHSLTFGADDEYDSRKTKIANEMKQRMGLDSNPLDGYHARTSQPSTMFQYFLKA 301
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFYDLSPIK 334
V T + + G + ++Q+ VT + R + + +T +PG FF Y++SPIK
Sbjct: 302 VSTQFRTIDGKVVNTHQYQVTHYNRDAGNPQDKTNQGVNVMHGITGVPGAFFNYEISPIK 361
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V E SF HFLT+ CAIVGGV TV+ I+D+ ++ + +KK
Sbjct: 362 VIHEETRQSFAHFLTSTCAIVGGVLTVTSILDSVLFAANQRLKK 405
>gi|164655211|ref|XP_001728736.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
gi|159102620|gb|EDP41522.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
Length = 427
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 145/410 (35%), Positives = 237/410 (57%), Gaps = 45/410 (10%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++R LDA+ ++++D RT G ++TL S++++L+L SE Y T +L VD
Sbjct: 5 AFFGQLRGLDAFGRMSDDVRIRTNVGALLTLTSALMILVLIVSEFLDYRRVQTSPRLEVD 64
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SRGE L + F+VTFP +PC +LS+D +D+ GE +DV HD+ ++RLD G +
Sbjct: 65 LSRGERLAVQFNVTFPRIPCYLLSLDVVDVVGETQMDVHHDVERRRLDETGKPV------ 118
Query: 123 IGAPKIDKPLQRHGGRL--EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ ++ + L+ R+ E YCG CYGA+ + CCN+C+ VREAY W+ ++P
Sbjct: 119 --SEEVIRELESEAKRVIAERGPDYCGDCYGADPPEGGCCNSCDAVREAYMLHNWSFTSP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC +E + + ++E+ EGCNI G + VNKV GN HF PG++FH++ +H HD++ +
Sbjct: 177 DDIEQCAQEHWSEHVREQNHEGCNIAGEVRVNKVVGNLHFIPGRTFHRNDIHTHDLVPYL 236
Query: 241 R----DSFNISHKINKLAFG-------------------EHFPGVVNPLDGVRWTQETPS 277
D + HKI++ +FG ++ G+ N L+G + +
Sbjct: 237 HGTGDDVHHFGHKIHRFSFGMEDEFAIERTSRGRRQGPLKNRMGIKNALEGRSAKTLSSN 296
Query: 278 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ------------GRLQTLPGVF 325
M+QYF+KVVP ++GH + + Q+S T + R+ E ++ +PGV+
Sbjct: 297 YMFQYFLKVVPVEVHKLNGHEMSTYQYSATSYERNLEDFDRGGQMSGHIVRMIEGIPGVY 356
Query: 326 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 375
F Y++SP++V TE H S H ++N+ A++GG+ TV+G+ID IY +R
Sbjct: 357 FNYEISPLRVIQTEWHHSIWHLVSNLFALIGGIVTVAGLIDGAIYRSRRT 406
>gi|393212588|gb|EJC98088.1| endoplasmic reticulum-derived transport vesicle ERV46 [Fomitiporia
mediterranea MF3/22]
Length = 421
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 151/404 (37%), Positives = 230/404 (56%), Gaps = 39/404 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +DA+ K ED +T +G +T++S+ ++L E Y ET ++VD SRGE
Sbjct: 9 LKGIDAFGKTMEDVKVKTKTGAFLTILSAAIILAFTTIEFLDYRRVNLETSIVVDRSRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L + +VTFP +PC +LS+D MDISGE D+ H+I K RLD+ G V+ + K
Sbjct: 69 RLTVRMNVTFPKVPCYLLSLDVMDISGEAQRDISHNIVKARLDANGAVVPNSHSAELRNK 128
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+D + + YCGSCYG + + CCN CEEVR+AY KGW+ SNPD I+QC
Sbjct: 129 LDVMND------QTQDNYCGSCYGGVAPEGGCCNTCEEVRQAYVNKGWSFSNPDSIEQCV 182
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN-- 245
RE + +++ E+ EGCNI G L VNKV GN H +PG+SF + +++H+++ + ++ N
Sbjct: 183 REHWSEKLHEQSTEGCNISGRLRVNKVIGNIHLSPGRSFQTNYMNIHELVPYLKEDKNRH 242
Query: 246 -ISHKINKLAF----------GEHFPGV-------VNPLDGVRWTQETPSGMYQYFIKVV 287
H +++L+F E G+ NPLDG + M+QYF+KVV
Sbjct: 243 DFGHIVHELSFEGDDEYNFRKKERSKGIKKKLGIEANPLDGAVGKAASLQYMFQYFVKVV 302
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QT------------LPGVFFFYDLSPIK 334
T + + G T++++Q+S T R G + QT +PGVF Y++SP+
Sbjct: 303 STKFELMDGQTVKTHQYSATHFERDLTTGAIGQTKEGVHIAHTNVGMPGVFINYEISPLL 362
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V +E SF HFLT+ CAI+GGV T++ I+D+ ++ R +KK
Sbjct: 363 VVHSETRQSFAHFLTSTCAIIGGVLTIATIVDSVVFATGRRLKK 406
>gi|345569114|gb|EGX51983.1| hypothetical protein AOL_s00043g717 [Arthrobotrys oligospora ATCC
24927]
Length = 397
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 160/400 (40%), Positives = 226/400 (56%), Gaps = 38/400 (9%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+++ LDA+ K ED RT SGG++T+ S +V+ L E Y ++L+VD +R
Sbjct: 5 SRLMRLDAFTKTVEDARIRTSSGGIVTIFSVLVIFCLVIGEWNDYRKVSVISELIVDKTR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SG+ V H I K RLD G +IES
Sbjct: 65 GEQMEIHLNITFPHIPCELLTLDVMDVSGDLQPSVSHGIGKHRLDKSGGIIES------- 117
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGAESSDED----CCNNCEEVREAYRKKGWALSNP 180
K L+ H +H + +YCG CYGA + D CC C++VREAY KGWA +
Sbjct: 118 ----KFLELHPEHPKHLDPSYCGECYGAVAPDTSKKAGCCQTCDDVREAYAAKGWAFGDG 173
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ QC+ EG+ + +KE+ GEGC I G L VNKV GNFH APGKSF + +HVHD+ +
Sbjct: 174 TGVHQCEEEGYKEMLKEQAGEGCRIDGHLWVNKVVGNFHIAPGKSFSNAQMHVHDLANYL 233
Query: 241 RDSF--NISHKINKLAFGEHFPGVV--------NPLDGVRWTQETPSGMYQYFIKVVPTV 290
+ + +H IN L+FG P + NPLD + Y YF+K+V T
Sbjct: 234 QGDVHHDFTHTINALSFGPPLPTDLLHENHHQQNPLDATSKKTSDRNYNYLYFLKIVSTS 293
Query: 291 YTDVS-GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTE 339
Y + G+TI ++Q+SVT H RS E G+ +PG+FF YD+SP+KV E
Sbjct: 294 YEHLDHGYTIHTHQYSVTSHERSLEGGKDDVHPGTVHARGGIPGIFFSYDISPMKVVNRE 353
Query: 340 EHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
SF FLT++CAI+GG TV+ +D +Y G R I K
Sbjct: 354 IRTKSFSGFLTSICAIIGGTLTVAAALDRGLYEGARRIGK 393
>gi|443734706|gb|ELU18587.1| hypothetical protein CAPTEDRAFT_139951 [Capitella teleta]
Length = 285
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 138/277 (49%), Positives = 180/277 (64%), Gaps = 15/277 (5%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ N +R DAYPK EDF +T+ G +T+VS I+M +LF SE YL +L VDT
Sbjct: 6 VFNSLRQFDAYPKTLEDFRVKTYGGAAVTIVSGILMFVLFVSEFNYYLTTEVHPELFVDT 65
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQD 121
+RG+ L+IN D+TFP + CS L++DAMD+SGEQ +DV HDIFK+RLD G + E ++
Sbjct: 66 ARGQKLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKE 125
Query: 122 GIGAPKID----KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
+G D PL+ + C SCYGAES CCN C EVREAYR+KGWA
Sbjct: 126 DLGDKSKDFAVKNPLK---------DDRCESCYGAESEAHKCCNTCNEVREAYRQKGWAF 176
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QC REG++ +++E + EGC IYGFLEVNKVAGNFH APG+SF Q H+HD+
Sbjct: 177 VDAQNIEQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQ 236
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 274
A Q FN+SH+I L+FG+ +PG VNPLD E
Sbjct: 237 ALQGMKFNMSHRIQHLSFGDDYPGQVNPLDASEQVTE 273
>gi|308483051|ref|XP_003103728.1| CRE-ERV-46 protein [Caenorhabditis remanei]
gi|308259746|gb|EFP03699.1| CRE-ERV-46 protein [Caenorhabditis remanei]
Length = 380
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 160/388 (41%), Positives = 227/388 (58%), Gaps = 13/388 (3%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ ++ DAY K +DF +T SGG++TL+++IV+ LL E + +L+ L VD
Sbjct: 2 SLLWSLKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVLETKQFLSTDVLEHLFVD 61
Query: 63 -TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG-NVIESRQ 120
T+ E + I FD+TF LPC+ ++VD MD+S E ++ DI++ RLD+ G N+ ES Q
Sbjct: 62 STTSDERVHIEFDITFNKLPCNFITVDVMDVSSEAQDNINDDIYRLRLDADGRNISESAQ 121
Query: 121 D-GIGAPK-IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
I K I P + E CGSCYGA ++D CCN CE+V+ AY KGW +
Sbjct: 122 KIEINQNKTIADPTELT------QEVKCGSCYGA-AADGICCNTCEDVKSAYAIKGWQV- 173
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
N + ++QCK + +++ E + EGC +YG ++V KVAGNFH APG HVHD+
Sbjct: 174 NIEEVEQCKNDKWVKEFTEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHN 233
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
F+ SH +N L FG+ FPG PLDG T+ MYQY++KVVPT Y + G
Sbjct: 234 LDPVKFDASHTVNHLTFGKSFPGKHYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRV 293
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
QS+QFSVT H + R LPG F Y+ SP+ V + E S FL ++CAIVGGV
Sbjct: 294 DQSHQFSVTTH-KKDLGFRQSGLPGFFVQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGV 352
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
F ++ +ID IY R +K +I GK +
Sbjct: 353 FAMAQLIDITIYQTHRYMKNRIAGGKLT 380
>gi|402218655|gb|EJT98731.1| ER to Golgi transport-related protein [Dacryopinax sp. DJM-731 SS1]
Length = 455
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 161/437 (36%), Positives = 240/437 (54%), Gaps = 68/437 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ ++++ LDA+ K ED +T +G ++TL+S+ +++ E Y T ++VD
Sbjct: 5 GVFSQLKGLDAFGKTMEDVKVKTRTGALLTLISACIIVFFTLMEFVDYRRIHLATSVVVD 64
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SRGE L +N ++TFP +PC +LS+D MDISGE+ DV H++ + RL QG I
Sbjct: 65 RSRGEKLLVNMNITFPRVPCYLLSLDVMDISGERQHDVTHNMQRVRLSPQGIPIPDVLPE 124
Query: 123 IG-APKIDKPLQ-RHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G + +I+K ++ R GG CGSCYG + CCN CE+VREAY ++GW+ S+P
Sbjct: 125 SGLSNEIEKVIEAREGGE-------CGSCYGGDPPASGCCNTCEDVREAYMRRGWSFSSP 177
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ I QC EG+ +++K + EGCNI G + VNKV GNFHF+PGKSF + +HVHD++ +
Sbjct: 178 EDIKQCVNEGWTEKVKSQSEEGCNISGRVRVNKVIGNFHFSPGKSFQTNAMHVHDLVPYL 237
Query: 241 RDS--FNISHKINKLAF---GEHFPGV--------------VNPLDGVRW---------T 272
+D+ + H+I+ F GE V NPLDG+R T
Sbjct: 238 KDANRHDFGHEIHYFGFESDGEQQAEVGRLSKSIKTKLGIDKNPLDGLRAHVRSLSRRET 297
Query: 273 QETP-----------------SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 315
+ P + M+QYF+KVV T Y + G + S+Q+SVT + R Q
Sbjct: 298 RRVPGMSSNRRSYRPEQTEKSNYMFQYFLKVVSTKYEMLRGTVVNSHQYSVTSYERDLSQ 357
Query: 316 GR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
G + +PG FF +++SP+ V E SF HFLT+ CAIVGGV TV
Sbjct: 358 GDKAQRDEHGTMTSHGVSGIPGAFFNFEISPMVVVHQETRQSFAHFLTSTCAIVGGVLTV 417
Query: 362 SGIIDAFIYHGQRAIKK 378
+ I D+ ++ +R +KK
Sbjct: 418 AAIFDSMLFSAERKLKK 434
>gi|443734710|gb|ELU18591.1| hypothetical protein CAPTEDRAFT_139954 [Capitella teleta]
Length = 285
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 138/277 (49%), Positives = 180/277 (64%), Gaps = 15/277 (5%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ N +R DAYPK EDF +T+ G +T+VS I+M +LF SE YL +L VDT
Sbjct: 6 VFNSLRQFDAYPKTFEDFRVKTYGGAAVTIVSGILMFVLFVSEFNYYLITEVHPELFVDT 65
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQD 121
+RG+ L+IN D+TFP + CS L++DAMD+SGEQ +DV HDIFK+RLD G + E ++
Sbjct: 66 ARGQKLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKE 125
Query: 122 GIGAPKID----KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
+G D PL+ + C SCYGAES CCN C EVREAYR+KGWA
Sbjct: 126 DLGDKSKDFAVKNPLK---------DDRCESCYGAESEAHKCCNTCNEVREAYRQKGWAF 176
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QC REG++ +++E + EGC IYGFLEVNKVAGNFH APG+SF Q H+HD+
Sbjct: 177 VDAQNIEQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQ 236
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 274
A Q FN+SH+I L+FG+ +PG VNPLD E
Sbjct: 237 ALQGMKFNMSHRIQHLSFGDDYPGQVNPLDASEQVTE 273
>gi|343425773|emb|CBQ69306.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 435
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 153/417 (36%), Positives = 235/417 (56%), Gaps = 51/417 (12%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ I ++R +DA+ K +D RT +G +ITLVS +++++L E Y + L V
Sbjct: 4 NGIFGQLRGIDAFSKTMDDVRIRTNAGALITLVSVLLIVVLTIGEFVDYRTVHLKPALEV 63
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D SRGE L +N ++TFP +PC +LS+D MDISGE D++HDI + R+ G V+E +
Sbjct: 64 DRSRGEKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRISHDGKVVEQGK- 122
Query: 122 GIGAPKIDKPLQRHGGRLEHNE--TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
K L+ R+ + + YCG CYG + CCN C+EVREAY ++GW+ ++
Sbjct: 123 --------KHLKGDAARIANTKGKDYCGDCYGGQPPASGCCNTCDEVREAYVRRGWSFAD 174
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
PD +DQC EG+ +IK++ EGC I G L VNKV G+FH +PGK+F ++ +H+HD++ +
Sbjct: 175 PDHVDQCVAEGWSDKIKQQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPY 234
Query: 240 ----QRDSFNISHKINKLAFGEHFP----------------GVVNPLDGVRWTQETPSGM 279
+ + H I++ +FG GV +PL GVR + M
Sbjct: 235 LSGTGAEHHDFGHIIHEFSFGSEQEYHGLTTAKERAVKAKLGVKDPLAGVRAQTQQSQFM 294
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------------LQ 319
+QYF+KVV T + ++G T+++ Q+SVT + R G
Sbjct: 295 FQYFVKVVATEFRPLAGETLKTQQYSVTTYERDLSPGASAAALAGMSNEGSGAHISHGFA 354
Query: 320 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+PGVFF Y++SP+K E S HFLT+ CAIVGG+ TV+GI+D+ +Y+ +R +
Sbjct: 355 GVPGVFFNYEISPLKTIHAEYRQSLAHFLTSTCAIVGGILTVAGILDSLVYNSRRRL 411
>gi|299743758|ref|XP_002910702.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
okayama7#130]
gi|298405804|gb|EFI27208.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
okayama7#130]
Length = 416
Score = 283 bits (725), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 148/403 (36%), Positives = 226/403 (56%), Gaps = 42/403 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ +DA+ K ED +T +G +TL+S+ ++L + E Y +T ++VD SRGE
Sbjct: 9 LKGIDAFGKTTEDVKVKTRTGAFLTLLSAAIILAITTMEFFDYRKVFIDTSIVVDRSRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L +N +VTFP +PC +LS+D MDISGE D+ H++ K RLD G + +
Sbjct: 69 KLTVNLNVTFPKVPCYLLSLDIMDISGEVQRDISHNVLKVRLDRSGKEVPGSHTADLSAD 128
Query: 128 IDKPLQRHGGRLEHN--ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
++K L H E YCGSCYG + CCN CE+VR AY +GW+ +NPD I+Q
Sbjct: 129 VEK--------LSHTKKEGYCGSCYGGLEPESGCCNTCEDVRMAYVNRGWSFTNPDAIEQ 180
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
C+ EG+ +++++ EGCNI G + VNKV GN H +PG+SF + ++++++ + RD N
Sbjct: 181 CRNEGWADKLRDQADEGCNISGRIRVNKVIGNIHMSPGRSFQSNSRNIYELVPYLRDDQN 240
Query: 246 ---ISHKINKLAF-------------GEHFPGVV----NPLDGVRWTQETPSGMYQYFIK 285
SH I+ F G+ + NPLDG+ M+QYF+K
Sbjct: 241 RHDFSHIIHHFGFEGDDEYDYWKAEAGQKMRRRMGLTENPLDGIEARTWKSQYMFQYFLK 300
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG--------RLQ----TLPGVFFFYDLSPI 333
VV T + + G T+ ++Q+S T R +G R+Q LPG FF Y++SPI
Sbjct: 301 VVSTRFRTLDGQTVNTHQYSTTSFERDLGEGMNQDDGGIRVQHGVSGLPGAFFNYEISPI 360
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+V E SF HFLT+ CA++GGV TV+ ++D+ ++ +AI
Sbjct: 361 QVVHAESRQSFAHFLTSTCAVIGGVLTVAALVDSALFVTAKAI 403
>gi|395324643|gb|EJF57079.1| endoplasmic reticulum-derived transport vesicle ERV46 [Dichomitus
squalens LYAD-421 SS1]
Length = 423
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 150/408 (36%), Positives = 230/408 (56%), Gaps = 41/408 (10%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ +DA+ K ED +T +G ++T++++ ++L E Y +T ++VD S
Sbjct: 6 LNALKGVDAFGKTMEDVKVKTRTGALLTIIAAAIILSFTTIEFFDYRRVFVDTSIVVDRS 65
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RGE L +N ++TFP +PC +LS+D MDISGE D+ H+I K RLD +G +
Sbjct: 66 RGEKLTVNMNITFPRVPCYLLSLDVMDISGETQSDITHNILKTRLDEKGKPVSHSLIAEL 125
Query: 125 APKIDK-PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
+DK QR G YCGSCYG + CCN CEEVR+AY +GW+ + PD I
Sbjct: 126 QNDLDKLNEQRQSG-------YCGSCYGGIEPEGGCCNTCEEVRQAYVNRGWSFNRPDSI 178
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
+QC +EG+ ++KE+ EGCNI G + VNKV GN H +PG+SF S ++++++ + R
Sbjct: 179 EQCVKEGWSDKLKEQAHEGCNIAGRVRVNKVVGNIHLSPGRSFRTSAHNLYELVPYLRTD 238
Query: 244 FN---ISHKINKLAF---GEHFP-------------GV-VNPLDGVRWTQETPSGMYQYF 283
N +H+I+ AF E+ P G+ NPLDG + M+QYF
Sbjct: 239 GNRHDFTHQIHHFAFEGDDEYDPRNAKLGKELKNRLGIDANPLDGTQGRTIKQQYMFQYF 298
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-------------LPGVFFFYDL 330
+KVV T + + G + ++Q+S T R ++G + +PG FF Y++
Sbjct: 299 LKVVSTQFQTIDGKKVGTHQYSATHFERDLDKGPSEDSPAGLHVAHGNGGIPGAFFNYEI 358
Query: 331 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
SP+ + E SF HFLT+ CAIVGGV TV+ +ID+ ++ ++A KK
Sbjct: 359 SPLLIRHVETRQSFAHFLTSTCAIVGGVLTVASLIDSLLFATRKAFKK 406
>gi|358391585|gb|EHK40989.1| ER-derived vesicle Erv46-like protein [Trichoderma atroviride IMI
206040]
Length = 422
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 163/426 (38%), Positives = 230/426 (53%), Gaps = 61/426 (14%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ++ RT SGG++T+VS +V+L L + E Y V +L+VD
Sbjct: 2 APKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVLFLSWGEWSSYRRIVVHPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
RGE + I+ ++TFP +PC +L++D MD+SGEQ V H I K RL G VIES
Sbjct: 62 KGRGERMDIHLNITFPNMPCELLTLDVMDVSGEQQHGVAHGITKLRLQPPSRGGGVIESN 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKG 174
L + + EH N YCG CYGA + CCN C+EVREAY +
Sbjct: 122 S-----------LAQLHEKAEHLNPDYCGGCYGATAPANAEKPGCCNTCDEVREAYAQAS 170
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC+RE + +R+ ++ EGC I G L+VNKV GNFH APG+SF +HVH
Sbjct: 171 WAFGRGEGVEQCEREHYSERLDQQREEGCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230
Query: 235 DI-----LAFQRDSFNISHKINKLAFGEHFPGVV---------------NPLDGVRWTQE 274
D+ L + + +H I+ L FG P V NPLDG+
Sbjct: 231 DLKNYWDLPNGMKAHDFTHVIHSLRFGPQLPPEVIARMGRRTAWTNHHLNPLDGIHQETS 290
Query: 275 TPSGMYQYFIKVVPTVY---------TDVSGHTIQSNQFSVTEHFRS------SEQGRLQ 319
P+ Y YF+K+VPT Y S +++++Q+SVT H RS +++G +
Sbjct: 291 DPNFNYMYFVKIVPTSYLPLGWEQKSASASDGSVETHQYSVTSHKRSLMGGDDAKEGHAE 350
Query: 320 TL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 372
L PGVFF YD+SP+KV EE +FL FL+ +CAIVGG TV+ ID ++ G
Sbjct: 351 RLHSKGGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAIDRGLFEG 410
Query: 373 QRAIKK 378
+KK
Sbjct: 411 ATRLKK 416
>gi|134054958|emb|CAK36967.1| unnamed protein product [Aspergillus niger]
Length = 406
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 165/412 (40%), Positives = 226/412 (54%), Gaps = 53/412 (12%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGVIT+ S +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWADYRRVVVMPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ +VTFP LPC +L++D MD+SGEQ V H I K RL S G VI+
Sbjct: 65 GEKMEIHLNVTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLTSAAEGGRVID----- 119
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWALS 178
+ A ++ K L + YCG CYGA + S CCN C+EVREAY ++ WA
Sbjct: 120 VKALELAKHL---------DPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFG 170
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+ EG+ +RI + EGC + G L VNKV GNFH APG+SF +HVHD+
Sbjct: 171 KGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLAN 230
Query: 239 F------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMY 280
F + ++H+I++L FG P + NPLDG + P Y
Sbjct: 231 FFDADLPDAEKHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDGTKQETNEPGYNY 290
Query: 281 QYFIKVVPTVYTDVSGHT-IQSNQFSVTEHFRS------SEQGRLQTL------PGVFFF 327
YF+KVV T Y + I+++Q+SVT H RS S++G + L PGVF
Sbjct: 291 MYFVKVVSTSYLPLGWDPLIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPGVFVN 350
Query: 328 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
YD+SP+KV E +F FLT VCAI+GG TV+ +D +Y G +KK
Sbjct: 351 YDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKK 402
>gi|409042254|gb|EKM51738.1| hypothetical protein PHACADRAFT_150385 [Phanerochaete carnosa
HHB-10118-sp]
Length = 422
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 154/412 (37%), Positives = 231/412 (56%), Gaps = 55/412 (13%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+ K ED +T +G +T++S+ ++L + E Y +T + VD SRGE
Sbjct: 9 LKGLDAFGKTMEDVKVKTRTGAFLTILSAAIILAITTMEFFDYRRVNVDTSIEVDKSRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN------VIESRQD 121
L ++F+VTFP +PC +LS+D MDISGE D+ H++ K RL+ QGN ++E R D
Sbjct: 69 KLIVSFNVTFPRVPCYLLSLDVMDISGETQTDIVHNVIKTRLNEQGNPVPANKIVELRND 128
Query: 122 GIGAPKIDK-PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
IDK QR G YCGSCYG CCN CE+VR+AY +GW+ + P
Sbjct: 129 ------IDKLNEQRQDG-------YCGSCYGGVEPAGGCCNTCEDVRQAYVNRGWSFTAP 175
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC +EG+ +++++ EGCN G L VNKV GN H +PG+SF +++DI+ +
Sbjct: 176 DSIEQCAQEGWADKLRDQANEGCNAAGKLRVNKVVGNIHLSPGRSFRSGSHNIYDIVPYL 235
Query: 241 RDSFN---ISHKINKLAFG----------------EHFPGVVN-PLDGVRWTQETPSGMY 280
++ N SH ++ AF + G+ + PLDG + M+
Sbjct: 236 KEDGNRHDFSHTVHAFAFAGDDEFNFQKADHGNSLKRRLGIADGPLDGTTQKTSKQAYMF 295
Query: 281 QYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---------RSSEQGR-----LQTLPGVFF 326
QYF+KVV T + + G +I+++Q S T HF +S+QG + +PG FF
Sbjct: 296 QYFLKVVSTQFITLDGKSIKTHQHSAT-HFERDLSKGIAENSQQGMHVMHGMTGIPGAFF 354
Query: 327 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
Y++SPI V E SF HFLT+ CA+VGGV TV+ +ID+ ++ + +KK
Sbjct: 355 NYEISPILVVHRETRQSFAHFLTSTCAVVGGVLTVASLIDSMLFATSKKLKK 406
>gi|322708973|gb|EFZ00550.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Metarhizium anisopliae ARSEF 23]
Length = 429
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 160/428 (37%), Positives = 228/428 (53%), Gaps = 64/428 (14%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGGV+T++S V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGVVTIISLFVVLFLSWGEWAEYRRVVVRPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE ++I+ ++TFP +PC +L++D MD+SGEQ V H + RL R + G
Sbjct: 65 GERMQIHLNMTFPRMPCELLTLDVMDVSGEQQHGVSHGVKNVRL---------RPESQGG 115
Query: 126 PKID-KPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSN 179
ID K ++ H +H + +YCG CYGA + CCN C+EVREAY +GWA
Sbjct: 116 GVIDIKSMKVHDDPADHLDPSYCGECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGR 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QC RE + +R+ E+ EGC + G LEVNKV GNFH APG+SF +HVHD+ +
Sbjct: 176 GENVEQCTREHYAERLDEQREEGCRVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNY 235
Query: 240 QR----DSFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSGM 279
+ +H I++L FG P V NPLDG R P+
Sbjct: 236 WETPNGKQHDFTHTIHQLRFGPQLPAAVSDRLGKGSMPWTNHHLNPLDGTRQEIGDPAFN 295
Query: 280 YQYFIKVVPTVY---------TDVSGHT-------IQSNQFSVTEHFRSSEQGRLQT--- 320
Y YF+K+VPT Y + +G T ++++Q+SVT H RS E G
Sbjct: 296 YMYFVKIVPTSYLPLGWEKRFKNAAGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGH 355
Query: 321 ---------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+PGVFF YD+SP+KV EE +F FL +CAIVGG TV+ +D ++
Sbjct: 356 AERQHSQGGIPGVFFSYDISPMKVINREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLF 415
Query: 371 HGQRAIKK 378
G +KK
Sbjct: 416 EGAARLKK 423
>gi|393233667|gb|EJD41236.1| endoplasmic reticulum-derived transport vesicle ERV46 [Auricularia
delicata TFB-10046 SS5]
Length = 419
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/410 (35%), Positives = 226/410 (55%), Gaps = 41/410 (10%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I + ++ +DA+ K +D +T +G ++TL+S ++ E Y +T ++VD
Sbjct: 4 GIFSTLKGVDAFGKTMDDVKVKTRTGALLTLISIAIIFTFTTIEFVDYRRINHDTSMVVD 63
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SRGE L +N +VTFP +PC +LS+D MDISGE+ DV H+I K R+D+ +RQ
Sbjct: 64 KSRGEKLTVNLNVTFPKIPCYLLSLDVMDISGERQADVTHNILKTRIDA------NRQR- 116
Query: 123 IGAPKIDKPLQRHGGRL--EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
I LQ ++ YCGSCYG + CC CE VR+AY +GWA S+P
Sbjct: 117 IADQTTTYDLQNEAEKVVAARGANYCGSCYGGLEPEGGCCQTCEAVRQAYINRGWAFSDP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QCK+EG+ ++I+ + EGCN+ G + VNKV G+ F+ G+SF + + +HD++ +
Sbjct: 177 DAIEQCKQEGWKEKIQAQMNEGCNVEGRVRVNKVVGSIQFSFGRSFQMNQMSLHDLVPYL 236
Query: 241 R-------------------DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ 281
R D FNI + + NPLDG E+ M+Q
Sbjct: 237 RDENVHDWRHRVQHFYFSSDDEFNIYKAGISSSMKQRLGIAANPLDGNYGHTESTEYMFQ 296
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG-------------RLQTLPGVFFFY 328
YF+KVV T + + G I ++Q+S T R +G +Q LPGVFF +
Sbjct: 297 YFLKVVSTQFRTIGGEVINTHQYSATHFDRDLAEGVRGKTEDGVVVTHGVQGLPGVFFNF 356
Query: 329 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
++SP+++ +E SF HF+T+ CAIVGGV T++ I+D+ ++ Q+A+KK
Sbjct: 357 EISPMRIIHSETRQSFAHFITSTCAIVGGVLTIASIVDSLLFTTQQALKK 406
>gi|403417426|emb|CCM04126.1| predicted protein [Fibroporia radiculosa]
Length = 419
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 142/401 (35%), Positives = 226/401 (56%), Gaps = 36/401 (8%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R +DA+ K +D +T +G +T++S+ ++L E Y +T ++VD SRGE
Sbjct: 10 LRGVDAFGKTTDDVKVKTRTGAFLTILSAAIILAFTMMEFLDYRQVKIDTSVVVDKSRGE 69
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L + +VTFP +PC +LS+D MDISGE D+ H+I K RL+ +G ++S
Sbjct: 70 KLNVRMNVTFPRVPCYLLSLDVMDISGESQADITHNILKTRLNEKGIPLQSLAKSAELRN 129
Query: 128 -IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+DK ++ G + YCGSCYG ++ CCN C++VR+AY +GW+ + PD I+QC
Sbjct: 130 DLDKINEQRG------DNYCGSCYGGQAPPGGCCNTCDQVRQAYIDRGWSFTRPDSIEQC 183
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN- 245
EG+ +++KE+ EGCNI G + VNKV GN +PG+SF + +++D++ + ++ N
Sbjct: 184 TNEGWSEKLKEQASEGCNIAGKVRVNKVIGNIQLSPGRSFRTAAQNMYDLVPYLKEDKNR 243
Query: 246 --ISHKINKLAFG-------------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 290
SH I++ AF + G+ +PLD M+QYF+KVV T
Sbjct: 244 HDFSHTIHQFAFESDQEKERHRARDFQKRVGIESPLDNTERKTSKQQYMFQYFLKVVSTH 303
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTF 337
+ + +++Q+S T R +G+ + +PGVF YD+SP+ +
Sbjct: 304 FAMLDNKVYKTHQYSATHFERDLTKGQQEDNKEGVHIAHTATGIPGVFINYDISPMLILH 363
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
+E SF HFLT+ CAIVGGV TV+ +ID+ ++ RA+KK
Sbjct: 364 SETRQSFAHFLTSTCAIVGGVLTVASLIDSVLFATTRALKK 404
>gi|388581981|gb|EIM22287.1| endoplasmic reticulum-derived transport vesicle ERV46 [Wallemia
sebi CBS 633.66]
Length = 407
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/398 (37%), Positives = 231/398 (58%), Gaps = 38/398 (9%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
DA+ K E+ RT G +T++ +I++ L F+E R Y + +++VD SR E L+
Sbjct: 9 FDAFAKTLEESRIRTNFGAYLTIICAILISFLTFNEFRDYRAVDFKPRIIVDQSRSEKLQ 68
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK 130
+NF+VTFP +PC +LS+D MD+SGEQ D++H I + RL +G I DG+ +
Sbjct: 69 LNFNVTFPRVPCYLLSLDLMDVSGEQVRDLRHAIVRTRLSEKGETI----DGMKTAGMSG 124
Query: 131 PLQRHGGRLEHNETYCGSCYGA-ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE 189
L E CGSCYG ++E CC C++VRE+Y K+GW+ NPD + QC E
Sbjct: 125 YLNEVAKPRE-----CGSCYGGVPPNEEKCCYTCDDVRESYVKQGWSFVNPDGVKQCLDE 179
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---I 246
+ +R+KE+ EGCN+ G ++VNKV GNFH +PG+SF + H+HD++ + +++ N
Sbjct: 180 HWAERVKEQSSEGCNVAGLVDVNKVVGNFHISPGRSFQSNAHHIHDLVPYLKNANNHHDF 239
Query: 247 SHKINKLAFG-----------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
H ++ +F + + +PL + E + M+QYF+KVV T + ++
Sbjct: 240 GHILHHFSFKSSNEPADTDNLKEMLNINDPLSNTKAHTEVSNYMFQYFLKVVSTDFDFLN 299
Query: 296 GHTIQSNQFSVTEHFRS-------SEQGRLQTL-------PGVFFFYDLSPIKVTFTEEH 341
G + S+Q+S T + R+ ++ G QT+ PGVFF YD+SP++V +TE
Sbjct: 300 GEKLNSHQYSATAYERNLDEKGIYAQDGHGQTILHGVEGFPGVFFNYDISPLRVIYTESR 359
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
SF FLT+ CAIVGGV TV+ IIDA ++ ++ + K
Sbjct: 360 RSFASFLTSTCAIVGGVLTVASIIDAGVFGARQKLTGK 397
>gi|51214107|emb|CAH17876.1| hypothetical protein (22C8.0001), conserved [Pneumocystis carinii]
Length = 388
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 159/394 (40%), Positives = 220/394 (55%), Gaps = 31/394 (7%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
R DA+ K E+ +T +GG IT++S IV+ +L + E R Y V +L +D SRGE
Sbjct: 8 RRFDAFSKTIENAQIKTINGGFITILSIIVIFVLIYFEWRDYRQIVILPELTIDRSRGEK 67
Query: 69 LRINFDVTFPALPCS---ILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
L+IN ++TFP +PCS +LS+D MD+SGE DV H++ K RLDS G I S +
Sbjct: 68 LQINLNLTFPKIPCSRLLVLSLDVMDVSGELETDVSHNVVKNRLDSNGIFINST--SLNT 125
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
+P + YCGSCYGA+ E CCN C++V +AY W + + +Q
Sbjct: 126 LNFQQPAKTRP------PDYCGSCYGAK---EGCCNTCQQVIDAYASNNWPVPDTKAFEQ 176
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-- 243
CK + E EGCN G +EVNKV GNFHFAPG S H+HDI + DS
Sbjct: 177 CKEK---YNNLNEFDEGCNFVGRIEVNKVVGNFHFAPGHSSQIMRNHIHDIYDYMTDSSP 233
Query: 244 FNISHKINKLAFGEHFPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
+ SH INKL+FG G + NPLD V+ + P+ Y YFIK V + +S ++ +
Sbjct: 234 HDFSHTINKLSFGPEVEGRSLQNPLDNVKKETDNPTLRYSYFIKCVAYRFEYLSKPSLDT 293
Query: 302 NQFSVTEHFRS----------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
N++SVT H RS + +PGVFF YD+SPIK+ E +F FLT+
Sbjct: 294 NKYSVTVHERSISGDSDPNYPTHISPKDGIPGVFFSYDISPIKIIERETRGNFSTFLTST 353
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
I+ GV T++GI+D +Y +R I+KK+ GKF
Sbjct: 354 VIIISGVLTIAGIVDRILYETERQIEKKLREGKF 387
>gi|336369994|gb|EGN98335.1| hypothetical protein SERLA73DRAFT_109778 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382751|gb|EGO23901.1| hypothetical protein SERLADRAFT_450196 [Serpula lacrymans var.
lacrymans S7.9]
Length = 988
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 147/395 (37%), Positives = 222/395 (56%), Gaps = 42/395 (10%)
Query: 19 EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFP 78
ED +T +G +T++S+ ++L E Y +T ++VD SRGE L + ++TFP
Sbjct: 591 EDVKVKTRTGAFLTILSAAIILAFTAMEFFDYRTVNVDTSIIVDRSRGEKLSVRMNMTFP 650
Query: 79 ALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGG 137
+PC +LS+D MDISGEQ DV H+I K R+ +G + ++G +IDK QR G
Sbjct: 651 RVPCYLLSLDIMDISGEQQRDVSHNIHKTRITPEGGPVPGARNGELRNEIDKLNDQRSNG 710
Query: 138 RLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 197
YCGSCYG + CCN+CE+VR+AY +GW+ +NPD I+QC EG+ +++K+
Sbjct: 711 -------YCGSCYGGVEPEGGCCNSCEDVRQAYVNRGWSFNNPDNIEQCVAEGWSEKLKD 763
Query: 198 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLA 254
+ EGCNI G L VNKV GN + +PG+SF S + ++++ + R+ N SH I++ +
Sbjct: 764 QAEEGCNISGRLRVNKVIGNINVSPGRSFQSSSRNFYELVPYLREDNNRHDFSHVIHEFS 823
Query: 255 F-----------------GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
F + NPLDG+ M+QYF+KVV T + + G
Sbjct: 824 FMTDDEYNLHKAKLGKDMKQRMGIAENPLDGLNAKTNKAQYMFQYFLKVVSTQFRTIDGK 883
Query: 298 TIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVS 343
TI ++Q+S T R +G + +PG FF +++SPI V +E S
Sbjct: 884 TINTHQYSATHFERDLSKGSQGGDNGEGVVTQHGVSGVPGAFFNFEISPILVVHSEGRQS 943
Query: 344 FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
F HFLT+ CAIVGGV TV+ ++D+F++ R +KK
Sbjct: 944 FAHFLTSTCAIVGGVLTVAALLDSFLFATGRRLKK 978
>gi|348667280|gb|EGZ07106.1| hypothetical protein PHYSODRAFT_319656 [Phytophthora sojae]
Length = 398
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 152/376 (40%), Positives = 210/376 (55%), Gaps = 8/376 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+++++ LDAYPK E+F RT GG+ +L++ + LL SEL YL T K+ VD
Sbjct: 9 LSRLKGLDAYPKTIEEFKVRTLQGGLFSLLAFACISLLLVSELSFYLATDTVDKMTVDGG 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGI 123
R + INFDV FP + CS++++++ D++G D++H+I K LD G + E D I
Sbjct: 69 RNTMVAINFDVEFPRMACSVVALESADMAGNVQHDIEHNIRKIPLDHTGQALAEGMHDVI 128
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G + + HG E ++ CGSCY A E CC+ CE V+ AY +K W + + I
Sbjct: 129 GG-ALTNNTELHG---ETDKPACGSCYSAGEPGE-CCDTCESVKAAYARKSWMMPSLHTI 183
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
QC+ + ++ E EGC I G L V+KVAG +FAP K F + D++
Sbjct: 184 AQCQEVEIEKVLRGEVNEGCRIQGSLVVSKVAGKLYFAPSKFFRSGYLSSKDLVDATFKV 243
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVR--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
F+ SH I L+FGE +P + NPLD + E G +QYF+KVVPT YT +S I +
Sbjct: 244 FDTSHTIRSLSFGEAYPDMKNPLDNRKKELPDEKTRGSFQYFLKVVPTEYTFLSASRIIT 303
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFS TEHFR + LP V F Y SPI + V FL FLT+VCAIVGGVFT
Sbjct: 304 NQFSATEHFRQLTPVSDKGLPMVTFSYTFSPIMFRIEQYRVGFLQFLTSVCAIVGGVFTR 363
Query: 362 SGIIDAFIYHGQRAIK 377
+ D +Y GQ K
Sbjct: 364 TATADESVYRGQVGAK 379
>gi|358334909|dbj|GAA53334.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Clonorchis sinensis]
Length = 323
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 133/304 (43%), Positives = 188/304 (61%), Gaps = 13/304 (4%)
Query: 83 SILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN 142
S+L++D MD +GEQ +DV I+K R+DS G+ I + + G P G + +
Sbjct: 21 SVLNLDTMDSTGEQKIDVSQQIYKTRIDSTGSPISATRRDDGNPS-------KGQVVTKD 73
Query: 143 ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 202
YCGSCYGAES CCN C+E++ AY+++ W + N + +QC+ E + + EG
Sbjct: 74 PDYCGSCYGAESETRKCCNTCKEIQLAYQERHWVVKNLSVFEQCREEQWDDTLANLGSEG 133
Query: 203 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 262
C I G L+VNKVAG+FH PG S+ VHVH++ F N+SHKI+KLAFG +PG
Sbjct: 134 CRIQGSLQVNKVAGSFHITPGNSYASDQVHVHNLQGFDGQKLNMSHKIDKLAFGNMYPGQ 193
Query: 263 VNPLDGVRWTQETPSGMYQYFIKVVPTVY-----TDVSGHTIQSNQFSVTEHFRSSEQGR 317
NPLDG P+ M Y++K+VPT+Y T S T+ +NQ+SVT H + S
Sbjct: 194 TNPLDGTTMNVVEPAQMVTYYMKLVPTMYVSYNTTTRSLSTVHTNQYSVTWHSKGSPLTS 253
Query: 318 LQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +PG+FF Y+LSP+ V + EH SFLHFLTN CAI+GGVFTV+ ++DAFIY +
Sbjct: 254 DSSGIPGLFFNYELSPLLVKISYEHKSFLHFLTNTCAIIGGVFTVASLLDAFIYQSTCVV 313
Query: 377 KKKI 380
+K++
Sbjct: 314 RKRL 317
>gi|340520521|gb|EGR50757.1| predicted protein [Trichoderma reesei QM6a]
Length = 430
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 161/431 (37%), Positives = 230/431 (53%), Gaps = 69/431 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+VS +V++ L + E Y V +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVVFLAWGEWTDYRRIVVHPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP +PC +L++D MD+SGEQ V H I K RL G IES
Sbjct: 65 GERMDIHLNMTFPNMPCELLTLDVMDVSGEQQHGVAHGITKIRLQPAALGGGEIES---- 120
Query: 123 IGAPKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWAL 177
K L + + EH + YCG CYGA + CCN C+EVREAY WA
Sbjct: 121 -------KSLSQLHEKAEHLDPNYCGGCYGAIAPSTAQKPGCCNTCDEVREAYALASWAF 173
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+RE + +R+ ++ EGC I G L+VNKV GNFH APG+SF +HVHD+
Sbjct: 174 GRGEGVEQCEREHYAERLDQQREEGCRIEGLLQVNKVIGNFHLAPGRSFSNGNMHVHDLK 233
Query: 238 AF----QRDSFNISHKINKLAFGEHFPGVV---------------NPLDGVRWTQETPSG 278
+ + S + +H I+ L FG P V NPLD R + P+
Sbjct: 234 NYWDLPEGKSHDFTHIIHSLRFGPQLPDTVIERLGGKNTWSNHHLNPLDNTRQDTKDPNF 293
Query: 279 MYQYFIKVVPTVY------------------TDVSGHTIQSNQFSVTEHFRS------SE 314
Y YF+K+VPT Y T S +I+++Q+SVT H RS ++
Sbjct: 294 NYMYFVKIVPTSYLPLGWEKRKPSTTNGGVTTFYSDGSIETHQYSVTSHKRSLMGGDDAK 353
Query: 315 QGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDA 367
+G + L PGVFF YD+SP+KV EE +FL FL+ +CAIVGG TV+ +D
Sbjct: 354 EGHPERLHARNGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAVDR 413
Query: 368 FIYHGQRAIKK 378
++ G +KK
Sbjct: 414 GLFEGATRLKK 424
>gi|119496763|ref|XP_001265155.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
fischeri NRRL 181]
gi|119413317|gb|EAW23258.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
fischeri NRRL 181]
Length = 438
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 169/439 (38%), Positives = 226/439 (51%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG+ITL S +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARIRTTSGGIITLASLVVILYLVWGEWLDYRRVVVLPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-D 121
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H + K RL S G V++ + D
Sbjct: 65 GERMEIHMNITFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDVQALD 124
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWAL 177
+I K L + YCG C GA+ S E CCN C+EVREAY K WA
Sbjct: 125 LHSKEEIAKHL---------DPNYCGDCGGADPLPGSMKEGCCNTCDEVREAYAAKNWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
I+QC+REG+ RI + EGC + G L VNKV GNFH APG+SF VH HD+
Sbjct: 176 GKGSNIEQCEREGYAARIDAQRREGCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAHDLQ 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
+ + ++H I++L FG P V NPLD P+
Sbjct: 236 NYLDLELPDNEKHTMTHHIHQLRFGPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDPAYN 295
Query: 280 YQYFIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRS 312
+ YF+KVV T Y + SG +I+++Q+SVT H RS
Sbjct: 296 FVYFVKVVSTSYLPLGWDPLFSSAAHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSHKRS 355
Query: 313 ------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
S++G + L PGVFF YD+SP+KV E SF FLT VCAI+GG
Sbjct: 356 LRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKSFSGFLTGVCAIIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ ID +Y G +KK
Sbjct: 416 TVAAAIDRGLYEGALRVKK 434
>gi|320592791|gb|EFX05200.1| copii-coated vesicle membrane protein [Grosmannia clavigera kw1407]
Length = 440
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 162/442 (36%), Positives = 226/442 (51%), Gaps = 75/442 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGGV+T+VS IV+L L + E Y + +L+VD
Sbjct: 2 AAKSRFTRLDAFTKTVEDARIRTTSGGVVTIVSLIVVLYLAWGEWLDYRRVIIRPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE + I+ ++TFP +PC +L++D MD+SGEQ V+H + RL+ Q
Sbjct: 62 KGRGERMEIHLNITFPRIPCELLTLDVMDVSGEQQHGVQHGVRMVRLEPQSR-------- 113
Query: 123 IGAPKID-KPLQRHGGRLEH-NETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWA 176
G +I+ K L H H + YCG CYGA CCN C+EVREAY WA
Sbjct: 114 -GGSEIEVKTLDLHADAASHLDPEYCGPCYGATPPQHAIKTGCCNTCDEVREAYASSSWA 172
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ ++QC+RE + +RI E+ EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 173 FGKGENVEQCQREHYAERIDEQRHEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVHDL 232
Query: 237 LAF----QRDSFNISHKINKLAFGEHFPGV-------------------VNPLDGVRWTQ 273
+ + + +H ++ L FG P +NPLDGV
Sbjct: 233 KNYWDMPTPNLHSFTHTVHSLRFGPQLPESLQKTLAGGGAKGQPWTNHHINPLDGVMQQT 292
Query: 274 ETPSGMYQYFIKVVPTVY------------------TDVSGH------TIQSNQFSVTEH 309
P+ Y YFIK+VPT Y DV + +++++Q+SVT H
Sbjct: 293 SDPNFNYMYFIKIVPTSYLALGWEKTFRGFVDDHDSADVGSYGLLADGSVETHQYSVTSH 352
Query: 310 FRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 356
RS + G RL +PGVFF YD+SP+KV EE +F FL +CAI+G
Sbjct: 353 KRSLQGGDDAAEGHQERLHARGGIPGVFFSYDISPMKVVNREERAKTFAGFLAGLCAIIG 412
Query: 357 GVFTVSGIIDAFIYHGQRAIKK 378
G TV+ +D ++ G +KK
Sbjct: 413 GTLTVAAAVDRTVFEGTIRLKK 434
>gi|255941116|ref|XP_002561327.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585950|emb|CAP93687.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 412
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 159/413 (38%), Positives = 225/413 (54%), Gaps = 49/413 (11%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGVIT+ S ++++ L + E Y V + +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARIRTNSGGVITIASLLIVMWLVWGEWADYRRIVVQPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-D 121
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H + K RL G VI+ + D
Sbjct: 65 GERMEIHLNMTFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSPHNEGGKVIDVQALD 124
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESS----DEDCCNNCEEVREAYRKKGWAL 177
+ + K L YCG C GA CC CEEVREAY +K WA
Sbjct: 125 LHSSSEAAKHLA---------PDYCGECGGATPPANVIKPGCCTTCEEVREAYAEKQWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QCKREG+ +++ E+ EGC I G L+VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GDGSNIEQCKREGYAEKLAEQRREGCRIEGVLKVNKVVGNFHIAPGRSFTTGNMHVHDLD 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
A+ + +SH +++L FG P + NPLD + + P+
Sbjct: 236 AYVVPNAGPAEQHTMSHLVHELRFGPQLPTELAGRWGWTDHHHTNPLDDTKQETDEPAYN 295
Query: 280 YQYFIKVVPTVYTDVSGHT-IQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 326
+ YF+KVV T Y + I+++Q+SVT H R G R+ +PGVFF
Sbjct: 296 FMYFVKVVSTSYLPLGWDPHIEAHQYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPGVFF 355
Query: 327 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
YD+SP+KV E +F +FLT VCAI+GG TV+ +D +Y G +KK
Sbjct: 356 NYDISPMKVINREARPKTFTNFLTGVCAIIGGTLTVAAALDRGLYEGAMRVKK 408
>gi|70990824|ref|XP_750261.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus fumigatus
Af293]
gi|66847893|gb|EAL88223.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
fumigatus Af293]
gi|159130735|gb|EDP55848.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
fumigatus A1163]
Length = 438
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 169/439 (38%), Positives = 226/439 (51%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG+ITL S +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARIRTTSGGIITLASLVVILYLVWGEWLDYRRVVVLPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-D 121
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H + K RL S G V++ + D
Sbjct: 65 GERMEIHMNITFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDVQALD 124
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWAL 177
+I K L + YCG C GA+ S E CCN C+EVREAY K WA
Sbjct: 125 LHSKEEIAKHL---------DPNYCGDCGGADPLPGSIKEGCCNTCDEVREAYAAKNWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
I+QC+REG+ RI + EGC + G L VNKV GNFH APG+SF VH HD+
Sbjct: 176 GKGTNIEQCEREGYAARIDAQRREGCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAHDLQ 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
+ + ++H I++L FG P V NPLD P+
Sbjct: 236 NYLDSELPDNEKHTMTHHIHQLRFGPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDPAYN 295
Query: 280 YQYFIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRS 312
+ YF+KVV T Y + SG +I+++Q+SVT H RS
Sbjct: 296 FVYFVKVVSTSYLPLGWDPLFSSAAHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSHKRS 355
Query: 313 ------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
S++G + L PGVFF YD+SP+KV E SF FLT VCAI+GG
Sbjct: 356 LRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKSFSGFLTGVCAIIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ ID +Y G +KK
Sbjct: 416 TVAAAIDRGLYEGALRVKK 434
>gi|358378080|gb|EHK15763.1| hypothetical protein TRIVIDRAFT_86970 [Trichoderma virens Gv29-8]
Length = 420
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 155/420 (36%), Positives = 223/420 (53%), Gaps = 51/420 (12%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ++ RT SGG++T+VS +V+ L + E Y V +L+VD
Sbjct: 2 APKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVFFLSWGEWTDYRRIVVHPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE + I+ ++TFP +PC +L++D MD+SGEQ V H I K RL G
Sbjct: 62 KGRGERMDIHLNMTFPNMPCELLTLDVMDVSGEQQHGVAHGISKIRLRPAAQ-------G 114
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALS 178
G + + Q H YCG CYGA + CCN C+EVREAY + WA
Sbjct: 115 GGEIESNTLTQLHEKAEHLAPDYCGGCYGATAPANAEKPGCCNTCDEVREAYAQMSWAFG 174
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+RE + +R+ ++ EGC I G L+VNKV GNFH APG+SF +HVHD+
Sbjct: 175 RGEGVEQCEREHYAERLDQQREEGCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKT 234
Query: 239 F----QRDSFNISHKINKLAFGEHFPGVV---------------NPLDGVRWTQETPSGM 279
+ + + +H I+ L FG P V NPLD + P+
Sbjct: 235 YWDFPEGKPHDFTHIIHSLRFGPQLPDTVIERMGGKNTWTNHHLNPLDATHQETKDPNFN 294
Query: 280 YQYFIKVVPTVYTDVSGH--------TIQSNQFSVTEHFRS------SEQGRLQTL---- 321
Y YF+K+VPT Y + +I+++Q+SVT H RS S++G + L
Sbjct: 295 YMYFVKIVPTSYLPLGWEKRTPGYDGSIETHQYSVTSHKRSLMGGDDSQEGHPERLHARN 354
Query: 322 --PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
PGVFF YD+SP+KV EE +FL FL+ +CAIVGG TV+ +D ++ G +KK
Sbjct: 355 GIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAVDRGLFEGASRLKK 414
>gi|407418919|gb|EKF38246.1| hypothetical protein MOQ_001547 [Trypanosoma cruzi marinkellei]
Length = 406
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 154/411 (37%), Positives = 227/411 (55%), Gaps = 34/411 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GGV +L+S ++ +L E+R + + V + ++
Sbjct: 1 MRWLGQLDVFPKFDTKFEQDARQRTAVGGVFSLLSLFIIAVLVIGEVRYFFSTVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG--NVIES 118
VD G T+ I ++TFP +PC +++ DA+D G V+ D K R+ + + E+
Sbjct: 61 VDPDLGGTMEITVNITFPHVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEA 120
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
R KI K L +G E+ C SCYGAE CC+ C++VR AY + W +
Sbjct: 121 RPLVDEKKKITKALDPNGAEKEN----CPSCYGAEPEPGACCHTCDDVRRAYSLRRWVFN 176
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ ++QC E + EGCN++ +V +V GN HF PG+ F+ G H+HD
Sbjct: 177 EDDISVEQCAGERLRKAAILISQEGCNLFVKYKVARVTGNIHFVPGRMFNLMGQHLHDFR 236
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDG-------VRWTQETPSGMYQYFIKVVPTV 290
N+SH ++ L FGE FPG VNP+DG V T+E +G + YF+KVVPT
Sbjct: 237 GKTVRQLNLSHIVHTLCFGERFPGQVNPMDGLVNSRGAVDATEEV-NGRFSYFVKVVPTQ 295
Query: 291 YTDVS----GHTIQSNQFSVTEHFRSSEQGRLQT---------LPGVFFFYDLSPIKVTF 337
Y S G ++SNQ+SVT HF +S L T +PGVF YDLSPIKV
Sbjct: 296 YQAASILGVGSVVESNQYSVTHHFTASPSAELSTTTPESTPVIVPGVFITYDLSPIKVFV 355
Query: 338 TEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
E+H S LH + +CA+ GGVFTV+G++D+ I+HG R +++K++ GK S
Sbjct: 356 MEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGKQS 406
>gi|193627365|ref|XP_001948436.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Acyrthosiphon pisum]
Length = 404
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 151/393 (38%), Positives = 215/393 (54%), Gaps = 23/393 (5%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
MN ++ DA+ K E+ +T GG+++LV + ++ L S L YL+ +L VDTS
Sbjct: 10 MNTLKQFDAFAKPLEEVQIKTVWGGIVSLVCFLTIVFLMVSNLVEYLDNTPTEELFVDTS 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDG 122
R + L+INFD+ P + C L +DA+D SGE HL V H+I+K+RL+ +G I + D
Sbjct: 70 RNKKLQINFDIVVPKISCDFLVLDAVDNSGETHLQVDHNIYKRRLNLEGQPISDPEKSDD 129
Query: 123 IGAPKIDKPLQRHGGRLEHNET--------YCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
+G+ K P L+ NET CGSCYGAESS CCN C++V+ AY+ K
Sbjct: 130 VGSKKTLNP----PSMLKSNETDDANNTEDICGSCYGAESSTIPCCNTCDDVKRAYKMKN 185
Query: 175 WALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
W P I+QCK + + ++ EGC +YG L VN+V+G+FH APG SF + +HV
Sbjct: 186 WDF-RPSSIEQCKNQSSQNEMYDKAFKEGCQLYGTLLVNRVSGSFHIAPGMSFSFNHMHV 244
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVV-----NPLDGVRWTQETPSGMYQYFIKVVP 288
HD+ F SFN +H I L+FG+ + NPLD + M+QY+IK+VP
Sbjct: 245 HDVHPFSSSSFNTTHTIRHLSFGQKLESINTSHGGNPLDSTESIAGEGATMFQYYIKIVP 304
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
T+Y +NQFSVT+H + PG+FF Y+ SPI + TE+ H
Sbjct: 305 TLYQRRDLSIFSTNQFSVTKHKVQAFDKGPSGAPGIFFSYEFSPIMIKLTEKPRLLGHLF 364
Query: 349 TNVCAIVGGVFTVSGIIDAFIYHGQRA--IKKK 379
T + GVF IID F+Y + I+KK
Sbjct: 365 TQFLCNISGVFICFWIIDIFMYKVSKVYNIRKK 397
>gi|340923948|gb|EGS18851.1| hypothetical protein CTHT_0054620 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 436
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 165/438 (37%), Positives = 229/438 (52%), Gaps = 71/438 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+VS IV+L L +SE R Y V +L+VD
Sbjct: 2 AGKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLIVVLFLSWSEWREYRRIVVHPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL---DSQGNVIESR 119
RGE + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL + G I+ +
Sbjct: 62 KGRGERMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVTKTRLRPWEEGGGDIDKK 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGW 175
+ + + ++ L+ N YCGSCYGA + CC C+EVREAY + W
Sbjct: 122 ELALHS------IEESATHLDPN--YCGSCYGANPPPNAVKPGCCQTCDEVREAYAQAAW 173
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + I+QC+RE + +R+ ++ EGC I G L VNKV GNFH APGKSF +HVHD
Sbjct: 174 AFGRGENIEQCQREHYAERLDQQRREGCRIEGGLRVNKVVGNFHIAPGKSFSNGNMHVHD 233
Query: 236 ILAFQRDSF--NISHKINKLAFGEHFPGV----------------VNPLDGVRWTQETPS 277
+ + +H I+ L FG P VNPLD + +
Sbjct: 234 LKNYWESPVRHTFTHIIHHLRFGPQLPESLHQKLGNKALPWSNHHVNPLDNTHQETDEVN 293
Query: 278 GMYQYFIKVVPTVY------------------------TDVSGHTIQSNQFSVTEHFRSS 313
Y YFIK+VPT Y T G +++++Q+SVT H RS
Sbjct: 294 FSYMYFIKIVPTSYLPLGWEKTWDQFREQHHAELGSFGTSADG-SVETHQYSVTSHRRSL 352
Query: 314 EQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 360
G RL + +PGVFF YD+SP+KV EE SFL FL +CAIVGG T
Sbjct: 353 SGGDDAAEGHSERLHSKGGIPGVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLT 412
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ ID ++ G +KK
Sbjct: 413 VAAAIDRALFEGTVRLKK 430
>gi|212540034|ref|XP_002150172.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
marneffei ATCC 18224]
gi|210067471|gb|EEA21563.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
marneffei ATCC 18224]
Length = 440
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 164/439 (37%), Positives = 224/439 (51%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++TLVS +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARVRTTSGGIVTLVSLVVILWLVWGEWADYRRVVVLPELIVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H + K RL S + G
Sbjct: 65 GERMEIHLNMTFPRLPCELLTLDVMDVSGEQQMGVVHGLNKVRLSSVAD---------GG 115
Query: 126 PKID-KPLQRHGGR---LEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWAL 177
ID L+ H + + YCG C GA + CCN CEEVREAY K WA
Sbjct: 116 RVIDVSKLELHSQNEVAIHLDPEYCGECGGASPPENAKKPGCCNTCEEVREAYALKSWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QC+REG+ RI + EGC I G + VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GKGENIEQCQREGYADRIDAQRREGCRIEGDIRVNKVIGNFHIAPGRSFSSGNMHVHDLD 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
+ + +SH I++L FG V NPLD + P+
Sbjct: 236 TYLDRELADYEKHTMSHIIHQLRFGPQLSDEVSQRWQWTDHHHTNPLDSTQQLTNEPAYN 295
Query: 280 YQYFIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRS 312
Y Y+IKVV T Y + + +I+++Q+SVT H RS
Sbjct: 296 YNYYIKVVSTSYLPLGWDSARSDQLHGDDQFTPLGLHGAAHGTAGSIETHQYSVTSHKRS 355
Query: 313 ---------SEQGRLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
Q R+ +PGVFF YD+SP+KV E +F FLT VCA++GG
Sbjct: 356 LHGGNDAAEGHQERIHAEGGIPGVFFNYDISPMKVVNREARAKTFTGFLTGVCAVIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D F+Y G R I+K
Sbjct: 416 TVAAAVDRFLYEGSRRIRK 434
>gi|317025332|ref|XP_001388859.2| COPII-coated vesicle membrane protein Erv46 [Aspergillus niger CBS
513.88]
gi|350638031|gb|EHA26387.1| hypothetical protein ASPNIDRAFT_196625 [Aspergillus niger ATCC
1015]
Length = 438
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 167/439 (38%), Positives = 227/439 (51%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGVIT+ S +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWADYRRVVVMPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +VTFP LPC +L++D MD+SGEQ V H I K RL S G
Sbjct: 65 GEKMEIHLNVTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLTSAAE---------GG 115
Query: 126 PKID-KPLQRHGG--RLEH-NETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWAL 177
ID K L+ H +H + YCG CYGA + S CCN C+EVREAY ++ WA
Sbjct: 116 RVIDVKALELHSKDESAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+ EG+ +RI + EGC + G L VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLA 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
F + ++H+I++L FG P + NPLDG + P
Sbjct: 236 NFFDADLPDAEKHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDGTKQETNEPGYN 295
Query: 280 YQYFIKVVPTVY-------------------TDVSGH--------TIQSNQFSVTEHFRS 312
Y YF+KVV T Y + H +I+++Q+SVT H RS
Sbjct: 296 YMYFVKVVSTSYLPLGWDPLFSSSIHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSHKRS 355
Query: 313 ------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
S++G + L PGVF YD+SP+KV E +F FLT VCAI+GG
Sbjct: 356 LMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G +KK
Sbjct: 416 TVAAALDRGLYEGVSRMKK 434
>gi|358372047|dbj|GAA88652.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus kawachii
IFO 4308]
Length = 438
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 165/439 (37%), Positives = 227/439 (51%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGVIT+ S +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWVDYRRVVVMPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP LPC +L++D MD+SGEQ V H I K RL S G
Sbjct: 65 GEKMEIHLNITFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLTSAAE---------GG 115
Query: 126 PKID-KPLQRHGG--RLEH-NETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWAL 177
ID K L+ H +H + YCG CYGA + S CCN C+EVREAY ++ WA
Sbjct: 116 RVIDVKALELHSKDESAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+ EG+ +RI + EGC + G L VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLA 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
F + + ++H+I++L FG P + NPLD + P
Sbjct: 236 TFFDAELPESERHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDNTKQETNEPGYN 295
Query: 280 YQYFIKVVPTVY-------------------TDVSGH--------TIQSNQFSVTEHFRS 312
Y YF+KVV T Y + H +I+++Q+SVT H RS
Sbjct: 296 YMYFVKVVSTSYLPLGWDPLFSSSIHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSHKRS 355
Query: 313 ------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
S++G + L PGVF YD+SP+KV E +F FLT VCAI+GG
Sbjct: 356 LMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G +KK
Sbjct: 416 TVAAALDRGLYEGVSRMKK 434
>gi|258565913|ref|XP_002583701.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907402|gb|EEP81803.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 435
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 160/438 (36%), Positives = 227/438 (51%), Gaps = 70/438 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGGV+T+V+ I ++LL + E + Y V ++L+VD
Sbjct: 2 AAKSRFTRLDAFAKTVEDARIRTRSGGVVTIVALIAVILLVWGEWKDYRRVVVLSELIVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
RGE + I+ ++TFP LPC +L++D MD+SGEQ + H I K RL G+V++++
Sbjct: 62 KGRGERMEIHLNITFPHLPCELLTLDVMDVSGEQQSGLIHGIKKVRLGPASEGGHVLDAQ 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGW 175
+ K D+ + + YCGSCY + + CCN C+EVREAY +GW
Sbjct: 122 T--LDLHKKDEVA------VHLDPEYCGSCYDGVPPPNAQKQGCCNTCDEVREAYASRGW 173
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + + QC+REG+ RI + EGC + G L VNKV GNFH APG+SF +H HD
Sbjct: 174 AFGRGEGVAQCEREGYGARIDAQRHEGCRLEGILRVNKVIGNFHIAPGRSFTNGYMHAHD 233
Query: 236 ILAFQRDSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQ 281
+ + ++H I++L FG P + NPLD T E P +
Sbjct: 234 LKIYHETPVKHTMAHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKYNFM 293
Query: 282 YFIKVVPTVYTDVSGH----------------------------TIQSNQFSVTEHFRSS 313
YF+KVV T Y + +I+++Q+SVT H RS
Sbjct: 294 YFVKVVSTSYLPLGWDASLSSEVHSRLASDAPLGKQGIQLGRHGSIETHQYSVTSHKRSV 353
Query: 314 EQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 360
E G R+ T +PGVFF YD+SP+KV E SF FLT VCA++GG T
Sbjct: 354 EGGDDSAEGHKERIHTAGGIPGVFFNYDISPMKVINREARTKSFSGFLTGVCAVIGGTLT 413
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ ID +Y G +KK
Sbjct: 414 VAAAIDRMLYEGAVRVKK 431
>gi|194224360|ref|XP_001916465.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Equus caballus]
Length = 342
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 163/388 (42%), Positives = 221/388 (56%), Gaps = 59/388 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S +
Sbjct: 64 RGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNC--EEVREAYRKKGWALS 178
G K+ P R C SCYGAE+ D C + + + KG
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAETEDIKPPYFCLQDHLHSSLAGKGLPWG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
R +EE + H V +HD+ +
Sbjct: 177 ---------------RDQEE--------------------------ALH--AVEIHDLQS 193
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G
Sbjct: 194 FGLDNINMTHYIRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEV 253
Query: 299 IQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+++NQFSVT H + + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+G
Sbjct: 254 LRTNQFSVTRHEKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIG 312
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
G+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 313 GMFTVAGLIDSLIYHSARAIQKKIDLGK 340
>gi|336465550|gb|EGO53790.1| hypothetical protein NEUTE1DRAFT_151014 [Neurospora tetrasperma
FGSC 2508]
gi|350295150|gb|EGZ76127.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 444
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 162/446 (36%), Positives = 231/446 (51%), Gaps = 79/446 (17%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+VS +V+L L + E R Y V +L+VD
Sbjct: 2 AGKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL Q
Sbjct: 62 KGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRLRPQSE-------- 113
Query: 123 IGAPKID-KPLQRHGG---RLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKG 174
G +ID K L H + +YCG CYGA + CC+ CEEVREAY +
Sbjct: 114 -GGGEIDAKILSLHAADESATHLDPSYCGPCYGAPAPYNAKKPGCCSTCEEVREAYAQAS 172
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC+RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVH
Sbjct: 173 WAFGDGATMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232
Query: 235 DILAFQR----DSFNISHKINKLAFGEHFPGV------------------VNPLDGVRWT 272
D+ + + SH I+ L FG P +NPLD +
Sbjct: 233 DLAQWWSTPVPGGHSFSHIIHSLRFGPQLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQE 292
Query: 273 QETPSGMYQYFIKVVPTVYTDV---------------------------SGHTIQSNQFS 305
+ P+ + YF+K+VPT Y + S +++++Q+S
Sbjct: 293 TDDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKATWEQDHSVGLGAYGYGSDGSMETHQYS 352
Query: 306 VTEHFRS------SEQG---RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVC 352
VT H RS S++G RL + +PGVFF YD+SP+KV EE SFL FL +C
Sbjct: 353 VTSHKRSLTGGDDSKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFLGFLAGLC 412
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKK 378
A+VGG TV+ +D ++ G +KK
Sbjct: 413 AVVGGTLTVAAAVDRGLFEGTVRLKK 438
>gi|407852879|gb|EKG06122.1| hypothetical protein TCSYLVIO_002790, partial [Trypanosoma cruzi]
Length = 472
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 151/410 (36%), Positives = 225/410 (54%), Gaps = 32/410 (7%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GG+ +L+S +++ +L E+R + + V + ++
Sbjct: 67 MRWLGQLDVFPKFDTKFEQDARQRTAVGGIFSLISLLIIAVLVIGEVRYFFSTVEQHEMY 126
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG--NVIES 118
VD G T+ I ++TFP +PC +++ DA+D G V+ D K R+ + + E+
Sbjct: 127 VDPDLGGTMEITVNITFPRVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEA 186
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
R KI K L G E+ C SCYGAE CC+ CE+VR AY + W +
Sbjct: 187 RPLVDEKKKITKALDPSGAEKEN----CPSCYGAEPEPGACCHTCEDVRRAYSLRRWVFN 242
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ ++QC E + EGCN++ +V +V GN HF PG+ F+ G H+HD
Sbjct: 243 EDDISVEQCAEERLRKAATLSSQEGCNLFVNYKVARVTGNIHFVPGRMFNLMGQHLHDFR 302
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ------ETPSGMYQYFIKVVPTVY 291
N+SH ++ L FGE FPG VNP+DG+ ++ E +G + YF+KVVPT Y
Sbjct: 303 GKTVRQLNLSHIVHTLGFGERFPGQVNPMDGLVNSRGAVDATEEVNGRFSYFVKVVPTQY 362
Query: 292 TDVS----GHTIQSNQFSVTEHFRSSEQGRLQ---------TLPGVFFFYDLSPIKVTFT 338
S G ++SNQ+SVT HF S L +PGVF YDLSPIKV
Sbjct: 363 QSASVLGVGSVVESNQYSVTRHFTPSPSAELSAAAAESSPVVVPGVFITYDLSPIKVFVI 422
Query: 339 EEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
E+H S LH + +CA+ GGVFTV+G++D+ I+HG R +++K++ GK S
Sbjct: 423 EKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGKQS 472
>gi|440299607|gb|ELP92159.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba invadens IP1]
Length = 361
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 135/370 (36%), Positives = 217/370 (58%), Gaps = 25/370 (6%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M+ I+ DAYPK+N D R + GG+++++ + M +F+SE++ Y L VD S
Sbjct: 1 MDTIKRFDAYPKLNYDVRVRYWLGGLLSILCLLTMGWMFYSEVQDYYTVQMRPTLRVDES 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ E L INFD+TFP + CS++++D +D +GE +D++ ++ KKRL+
Sbjct: 61 KSEKLPINFDITFPRISCSLMTIDVLDTTGEVSIDIESNVNKKRLNPHS----------- 109
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESS--DEDCCNNCEEVREAYRKKGWALSNPDL 182
+ + ++ Y C E S CC C+E++E+Y+K G + P+
Sbjct: 110 -------MTESSNKATAHKVYGIECPACEESVDKNKCCFTCDELKESYKKAGKEVP-PNA 161
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ QC+ + + +GEGC++YG + VN+V+GNFH APG S Q H H A
Sbjct: 162 V-QCQLKNIQKMALALDGEGCHMYGSVFVNRVSGNFHIAPGMSEQQGEGHRHS--AEWIG 218
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
S N++H N L+FG++FPG++ P+D ++ T + MYQYF++VVP Y + +++N
Sbjct: 219 SLNLTHTWNSLSFGDNFPGMIKPMDSIQKVDVTNNSMYQYFVQVVPMTYFGLDKKVVKTN 278
Query: 303 QFSVTEHFRSSEQGRL-QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
+SVTEH+RS + Q +PGVF Y++S ++V +TEE SF H LT +C IVGG+FT+
Sbjct: 279 GYSVTEHYRSGNLKTMEQGVPGVFVLYEISSMEVLYTEETGSFGHLLTGICGIVGGIFTI 338
Query: 362 SGIIDAFIYH 371
++DAFI+H
Sbjct: 339 FSLLDAFIFH 348
>gi|85115136|ref|XP_964815.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
gi|28926610|gb|EAA35579.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
Length = 444
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/442 (36%), Positives = 228/442 (51%), Gaps = 79/442 (17%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+ LDA+ K ED RT SGG++T+VS +V+L L + E R Y V +L+VD RG
Sbjct: 6 RFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVVDKGRG 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
E + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL Q G
Sbjct: 66 ERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRLRPQSE---------GGG 116
Query: 127 KID-KPLQRHGG---RLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALS 178
+ID K L H + +YCG CYGA + CC+ CEEVREAY + WA
Sbjct: 117 EIDAKVLSLHAADESATHLDPSYCGPCYGAPAPYNAKKPGCCSTCEEVREAYAQASWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 177 DGATMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQ 236
Query: 239 FQR----DSFNISHKINKLAFGEHFPGV------------------VNPLDGVRWTQETP 276
+ + SH I+ L FG P +NPLD + P
Sbjct: 237 WWSTPVPGGHSFSHIIHSLRFGPQLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQETNDP 296
Query: 277 SGMYQYFIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEH 309
+ + YF+K+VPT Y + S +++++Q+SVT H
Sbjct: 297 NYNFMYFVKIVPTSYLPLGWEKQAAQNKAAWEQDHSVGLGAYGYGSDGSMETHQYSVTSH 356
Query: 310 FRS------SEQG---RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 356
RS S++G RL + +PGVFF YD+SP+KV EE SFL FL +CA+VG
Sbjct: 357 KRSLTGGDDSKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFLGFLAGLCAVVG 416
Query: 357 GVFTVSGIIDAFIYHGQRAIKK 378
G TV+ +D ++ G +KK
Sbjct: 417 GTLTVAAAVDRGLFEGTVRLKK 438
>gi|346979363|gb|EGY22815.1| ER-derived vesicles protein ERV46 [Verticillium dahliae VdLs.17]
Length = 435
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/437 (36%), Positives = 225/437 (51%), Gaps = 70/437 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ++ RT SGG+IT+VS ++ L + E Y +L+VD
Sbjct: 2 AGKSRFTRLDAFTKTVDEARIRTSSGGIITIVSLFIVFWLAWGEWADYRRITLHPELIVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE + I+ ++TFP +PC +L++D MD+SGEQ + I K RL SQ +DG
Sbjct: 62 KGRGEKMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGIVSGISKVRLRSQ-------KDG 114
Query: 123 IGAPKID-KPLQRHGGRLEHNE---TYCGSCYGAESS----DEDCCNNCEEVREAYRKKG 174
G ID K L H YCG CYGA++ + CCN CEEVREAY +
Sbjct: 115 GGV--IDTKALSLHAADEAATHLAPDYCGDCYGAKAPANAVKQGCCNTCEEVREAYAQAS 172
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLDEQRAEGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232
Query: 235 DILAFQRD--SFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETP 276
D+ + + + +H+I+ L FG P + NPLDG P
Sbjct: 233 DLKNYWDGDITHDFTHQIHALRFGPQLPESITKNLGNKATPWTNHHLNPLDGTSQITTDP 292
Query: 277 SGMYQYFIKVVPTVYTDV----------------------SGHTIQSNQFSVTEHFRSSE 314
S + YF+K+VPT Y + S +I+++Q+SVT H RS
Sbjct: 293 SFNFMYFVKIVPTSYLPLGWDSKRSPQDHDGGLLGSFGQGSDGSIETHQYSVTSHKRSLS 352
Query: 315 QG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTV 361
G RL T +PGVFF YD+SP+KV EE SF FLT +CA++GG TV
Sbjct: 353 GGDDSAEGHAERLHTRGGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTV 412
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D ++ G +KK
Sbjct: 413 AAAVDRGMFEGSLRLKK 429
>gi|389632999|ref|XP_003714152.1| hypothetical protein MGG_01245 [Magnaporthe oryzae 70-15]
gi|351646485|gb|EHA54345.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae 70-15]
Length = 439
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 163/440 (37%), Positives = 226/440 (51%), Gaps = 72/440 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG+IT+VS IV+L L + E Y +L+VD
Sbjct: 2 APKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLAWGEWADYRRIDIHPELIVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
SRG+ + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL Q G VI+++
Sbjct: 62 KSRGDRMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVIKVRLRPQSEGGGVIDAK 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGW 175
+ A L+ N YCG CYGA + CCN C+EVREAY + W
Sbjct: 122 TLALHAE------DEAATHLDPN--YCGGCYGAPAPANAKKAGCCNTCDEVREAYAQASW 173
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + ++QC RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVHD
Sbjct: 174 AFGRGENVEQCTREHYAERLDEQRHEGCQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVHD 233
Query: 236 ILAFQ----RDSFNISHKINKLAFGEHFPGV------------------VNPLDGVRWTQ 273
+ + + SH I+ L FG P +NPLDGV T
Sbjct: 234 LKNYWDTPVEGGHSFSHTIHSLRFGPQLPPSALEKLGNKDKNMPWTNHHINPLDGVIQTT 293
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGH----------------------TIQSNQFSVTEHFR 311
P+ Y YF+K+VPT Y + +++++Q+SVT H R
Sbjct: 294 VDPNFNYMYFVKIVPTSYLPLGWEKRTHLATMHDHGVGTYGYSGDGSVETHQYSVTSHKR 353
Query: 312 SSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGV 358
S G R+ + +PGVFF YD+SP+KV E +F FLT +CAI+GG
Sbjct: 354 SLAGGDDGEDGHKERMHSRGGIPGVFFSYDISPMKVINREVRTKTFAGFLTGLCAILGGT 413
Query: 359 FTVSGIIDAFIYHGQRAIKK 378
TV+ ID + G IKK
Sbjct: 414 LTVAAAIDRMTFEGVTRIKK 433
>gi|242803029|ref|XP_002484091.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
stipitatus ATCC 10500]
gi|218717436|gb|EED16857.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
stipitatus ATCC 10500]
Length = 440
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 161/437 (36%), Positives = 226/437 (51%), Gaps = 71/437 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++TLVS +V+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARVRTTSGGIVTLVSLVVILWLVWGEWADYRRVVVLPELIVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD--SQGNVIESRQDGI 123
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H + K RL ++G + I
Sbjct: 65 GERMEIHLNMTFPRLPCELLTLDVMDVSGEQQMGVVHGLNKVRLSPVAEGGKV------I 118
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSN 179
K++ Q + N YCG C GA ++ CCN CEEVREAY K WA
Sbjct: 119 DVAKLELHAQNEVA-VHLNPEYCGQCGGAPPPPNTNKPGCCNTCEEVREAYALKSWAFGK 177
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ I+QC+REG+ ++I + EGC I G + VNKV GNFH APG+SF +HVHD+ +
Sbjct: 178 GENIEQCQREGYAEKINAQRREGCRIEGDIRVNKVIGNFHIAPGRSFSTGNMHVHDLDTY 237
Query: 240 Q------RDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQ 281
+ +SH I++L FG + NPLD + + P+ Y
Sbjct: 238 MDRELSDNEKHTMSHIIHQLRFGPQLSDELSRRWQWTDHHHTNPLDDTQQFTDEPAYNYN 297
Query: 282 YFIKVVPTVYTDVSGHTIQSN---------------------------QFSVTEHFRSSE 314
Y+IKVV T Y + + QS+ Q+SVT H RS
Sbjct: 298 YYIKVVSTSYLPLGWDSSQSDQLHGDDQSTPLGLHGAVHGAAGSLETHQYSVTSHKRSLH 357
Query: 315 QG---------RLQT---LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTV 361
G R+ +PGVFF YD+SP+KV E +F FLT VCA++GG TV
Sbjct: 358 GGNDAAEGHKERVHAEGGIPGVFFNYDISPMKVVNREVRPKTFTGFLTGVCAVIGGTLTV 417
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D F+Y G R ++K
Sbjct: 418 AAAVDRFLYEGSRRMRK 434
>gi|169770949|ref|XP_001819944.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus oryzae
RIB40]
gi|238486566|ref|XP_002374521.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
flavus NRRL3357]
gi|83767803|dbj|BAE57942.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699400|gb|EED55739.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
flavus NRRL3357]
gi|391874294|gb|EIT83200.1| COPII vesicle protein [Aspergillus oryzae 3.042]
Length = 436
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 162/436 (37%), Positives = 227/436 (52%), Gaps = 71/436 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG+IT+ S + +L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARIRTTSGGIITIASLLAILWLVWGEWVDYRRVVVLPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGEQ V H I K RL S G+VI+ +
Sbjct: 65 GEKMEIHLNMTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLSSPAEGGHVIDVKALE 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAES--SDEDCCNNCEEVREAYRKKGWALSNP 180
+ + Q L+ N YCG C G ++ CCN CEEVREAY ++ WA
Sbjct: 125 LHSE------QEAAKHLDPN--YCGDCGGVPQPGGEKRCCNTCEEVREAYAQQQWAFGKG 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF- 239
+ I+QC+REG+ QR+ + EGC + G L VNKV GNFH APG+SF VHVHD+ +
Sbjct: 177 ENIEQCEREGYAQRLDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNVHVHDLENYF 236
Query: 240 -----QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQY 282
+ ++H I++L FG P + NPLD + P+ + Y
Sbjct: 237 EGDLPDAEKHTMTHIIHQLRFGPQLPDELSDRWQWTDHHHTNPLDSTQQETSDPAYNFMY 296
Query: 283 FIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRS--- 312
F+KVV T Y + S +I+++Q+SVT H RS
Sbjct: 297 FVKVVSTSYLPLGWDPLFSSAVHSAYEDSPLGSHGIAYGSQSSIETHQYSVTSHKRSLRG 356
Query: 313 ---SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 362
S++G + L PGVFF YD+SP+KV E +F FLT VCAI+GG TV+
Sbjct: 357 GDASDEGHKERLHAANGIPGVFFNYDISPMKVINKEARPKTFTGFLTGVCAIIGGTLTVA 416
Query: 363 GIIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AALDRGLYEGALRVKK 432
>gi|296417040|ref|XP_002838173.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634087|emb|CAZ82364.1| unnamed protein product [Tuber melanosporum]
Length = 399
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 162/400 (40%), Positives = 217/400 (54%), Gaps = 36/400 (9%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+++ LDA+ K ED RT SGG++TLVS V+ +L E R Y +L+VD +R
Sbjct: 5 SRLTRLDAFTKTVEDARVRTTSGGIVTLVSLFVVFVLVVGEFREYRRIQVLPELVVDKTR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE L I+ ++TFP +PC +L++D MD+SGEQ + H I RL ES+
Sbjct: 65 GEQLPISLNITFPHIPCELLTLDVMDVSGEQQSSITHGIHLTRLTP---FPESK------ 115
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDL 182
P L H H + YCG CYGA ++D CC CE+VREAY GWA +
Sbjct: 116 PVSTTSLNVHEDTASHLDPAYCGKCYGAPGPEKDKGCCQTCEDVREAYASIGWAFGKGEG 175
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--Q 240
++QC+RE + +R+ E EGCNI G L VNKV GNFH APGKSF + +HVHD+ +
Sbjct: 176 VEQCEREHYAERLDEMREEGCNIAGHLSVNKVIGNFHIAPGKSFSSAQMHVHDLNQYFAS 235
Query: 241 RDSFNISHKINKLAFGEHFPGVV----NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+H I+ L+FG P V NPLD R + S + YFIKVV T Y +
Sbjct: 236 TKEHTFTHTIHHLSFGPDLPANVKVQRNPLDDSRQVTQERSFNFMYFIKVVSTSYLPLGT 295
Query: 297 H-------TIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTE 339
I+++Q+SVT H RS G + +PGVFF YD+SP+KV E
Sbjct: 296 SENSYIPGAIETHQYSVTSHKRSLMGGADKEHASTIHARGGIPGVFFSYDISPMKVINRE 355
Query: 340 EHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
SF FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 356 VRAKSFAGFLTGVCAVIGGTLTVAAAIDRGLYEGGMRVKK 395
>gi|71407913|ref|XP_806393.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870127|gb|EAN84542.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 406
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 153/411 (37%), Positives = 225/411 (54%), Gaps = 34/411 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GG+ +L+S +++ +L E+R + + V + ++
Sbjct: 1 MRWLGQLDVFPKFDTKFEQDARQRTAIGGIFSLLSLLIIAVLVIGEVRYFFSTVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG--NVIES 118
VD G T+ I ++TFP +PC +++ DA+D G V+ D K R+ + + E+
Sbjct: 61 VDPDIGGTMEITVNITFPRVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEA 120
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
R KI K L G E+ C SCYGAE CC+ CE+VR AY + W +
Sbjct: 121 RPLVDEKKKITKALDPSGAEKEN----CPSCYGAEPEPGACCHTCEDVRRAYSLRRWVFN 176
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ ++QC E + EGCN++ +V +V GN HF PG+ F+ G H+HD
Sbjct: 177 EDDVSVEQCAEERLRKAAILSSQEGCNLFVNYKVARVTGNIHFVPGRMFNLMGQHLHDFR 236
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDG-------VRWTQETPSGMYQYFIKVVPTV 290
N+SH ++ L FGE FPG VNP+DG V T+E +G + YF+KVVPT
Sbjct: 237 GKTVRQLNLSHIVHTLGFGERFPGQVNPMDGLVNLRGAVDATEEV-NGRFSYFVKVVPTQ 295
Query: 291 YTDVS----GHTIQSNQFSVTEHFRSSEQGRLQ---------TLPGVFFFYDLSPIKVTF 337
Y S G ++SNQ+SVT HF S L +PGVF YDLSPIKV
Sbjct: 296 YQSASILGVGSVVESNQYSVTHHFTPSPSAELSAAAAESSPVMVPGVFITYDLSPIKVFV 355
Query: 338 TEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
E+H S LH + +CA+ GGVFTV+G++D+ I+HG R +++K++ GK S
Sbjct: 356 FEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGKQS 406
>gi|303322923|ref|XP_003071453.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
delta SOWgp]
gi|240111155|gb|EER29308.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
delta SOWgp]
gi|320033474|gb|EFW15422.1| COPII-coated vesicle membrane protein Erv46 [Coccidioides posadasii
str. Silveira]
Length = 435
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 159/437 (36%), Positives = 224/437 (51%), Gaps = 70/437 (16%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ ++ LDA+ K ED RT SGGV+T+VS IV++LL + E R Y V +L+VD
Sbjct: 3 VKSRFTRLDAFAKTVEDARIRTRSGGVVTIVSLIVVILLVWGEWRDYRRVVVLPELIVDK 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ 120
RGE + I+ ++TFP LPC +L++D MD+SGEQ V H + K RL + G+ ++
Sbjct: 63 GRGERMEIHLNITFPHLPCQLLTLDVMDVSGEQQSGVIHGVNKVRLSAASEGGHALD--- 119
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWA 176
+ +DK Q L + YCGSCY + CCN C+EVREAY + WA
Sbjct: 120 --VETVDLDKKDQ---APLHLDPGYCGSCYDGIPPPNAKKPGCCNTCDEVREAYALRNWA 174
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ ++QC++EG+ +I + EGC + G L VNKV GNFH APG+SF +H HD+
Sbjct: 175 FGRGEGVEQCEQEGYGSKIDSQRNEGCRLEGILRVNKVVGNFHVAPGRSFTNGYMHAHDL 234
Query: 237 LAFQRDSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQY 282
+ +SH I++L FG P + NPLD T E P + Y
Sbjct: 235 KTYYETPVKHTMSHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKFNFMY 294
Query: 283 FIKVVPTVYTDVSGH----------------------------TIQSNQFSVTEHFRSSE 314
F+KVV T Y + +I+++Q+SVT H RS E
Sbjct: 295 FVKVVSTSYLPLGWDASLSSEVHSRLSSDAPLGKQGIQLGQYGSIETHQYSVTSHKRSIE 354
Query: 315 QG---------RLQT---LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTV 361
G R+ T +PGVFF YD+SP+KV E L FLT VCA++GG TV
Sbjct: 355 GGDDSAEGHKERVHTAGGIPGVFFNYDISPMKVINREARTKSLSGFLTGVCAVIGGTLTV 414
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D +Y G +KK
Sbjct: 415 AAAVDRALYEGSVRVKK 431
>gi|119189667|ref|XP_001245440.1| hypothetical protein CIMG_04881 [Coccidioides immitis RS]
gi|392868334|gb|EAS34105.2| COPII-coated vesicle membrane protein Erv46 [Coccidioides immitis
RS]
Length = 435
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 158/437 (36%), Positives = 224/437 (51%), Gaps = 70/437 (16%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ ++ LDA+ K ED RT SGGV+T+VS IV++LL + E + Y V +L+VD
Sbjct: 3 VKSRFTRLDAFAKTVEDARIRTRSGGVVTIVSLIVVILLVWGEWKDYRRVVVLPELIVDK 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ 120
RGE + I+ ++TFP LPC +L++D MD+SGEQ V H + K RL + G+ ++
Sbjct: 63 GRGERMEIHLNITFPHLPCQLLTLDVMDVSGEQQSGVIHGVNKVRLSAASEGGHALD--- 119
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWA 176
+ +DK R L + YCGSCY + CCN C+EVREAY + WA
Sbjct: 120 --VETLDLDK---RDQAPLHLDPAYCGSCYDGIPPPNAKKPGCCNTCDEVREAYALRNWA 174
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ ++QC++EG+ +I + EGC + G L VNKV GNFH APG+SF +H HD+
Sbjct: 175 FGRGEGVEQCEQEGYGSKIDSQRNEGCRLEGILRVNKVVGNFHVAPGRSFTNGYMHAHDL 234
Query: 237 LAFQRDSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQY 282
+ +SH I++L FG P + NPLD T E P + Y
Sbjct: 235 KTYYETPVKHTMSHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKFNFMY 294
Query: 283 FIKVVPTVYTDVSGH----------------------------TIQSNQFSVTEHFRSSE 314
F+KVV T Y + +I+++Q+SVT H RS E
Sbjct: 295 FVKVVSTSYLPLGWDASLSSEVHSRLSSDAPLGKQGIQLGQYGSIETHQYSVTSHKRSIE 354
Query: 315 QG---------RLQT---LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTV 361
G R+ T +PGVFF YD+SP+KV E L FLT VCA++GG TV
Sbjct: 355 GGDDSAEGHKERVHTAGGIPGVFFNYDISPMKVINREARTKSLSGFLTGVCAVIGGTLTV 414
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D +Y G +KK
Sbjct: 415 AAAVDRALYEGSVRVKK 431
>gi|402083890|gb|EJT78908.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 444
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 160/445 (35%), Positives = 225/445 (50%), Gaps = 77/445 (17%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG+IT+VS IV+L L E Y +L+VD
Sbjct: 2 APKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLALGEWSDYRRIAIHPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
SRG+ + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL Q G VI+ +
Sbjct: 62 KSRGDRMEIHLNITFPRMPCELLTLDVMDVSGEQQHGVQHGVVKVRLQPQSEGGGVIDVK 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGW 175
+ A D+ H + YCG CYGA ++ CC+ C+EVREAY + W
Sbjct: 122 ALSLHA---DEDSATH-----LDPKYCGPCYGAPAPSNAAKAGCCSTCDEVREAYAQASW 173
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + ++QC RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVHD
Sbjct: 174 AFGRGENVEQCLREHYAERLDEQRQEGCQIAGSLRVNKVIGNFHLAPGRSFSNGNMHVHD 233
Query: 236 ILAFQRDSFN----ISHKINKLAFGEHFPGVV-------------------NPLDGVRWT 272
+ + + SH ++ L+FG P V NPLDG
Sbjct: 234 LKNYWDTPVDGGHSFSHVVHSLSFGPQLPLEVQKRLDRGRSLPWADHSHQLNPLDGTSQE 293
Query: 273 QETPSGMYQYFIKVVPTVYTDVSGH--------------------------TIQSNQFSV 306
P+ + YF+K+VPT Y + ++++Q+SV
Sbjct: 294 TADPNFSFMYFLKIVPTSYLPLGWEGRRAKIATGNHDKDSWVGTYGYSPDGAVETHQYSV 353
Query: 307 TEHFRS---------SEQGRLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCA 353
T H RS Q RL + +PGVFF YD+SP+KV EE +F FLT +CA
Sbjct: 354 TSHKRSLAGGDDAAEGHQERLHSKGGIPGVFFSYDISPMKVINREERPKTFAGFLTGLCA 413
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKK 378
I+GG TV+ +D Y G +KK
Sbjct: 414 ILGGTLTVAAAVDRTFYEGATRLKK 438
>gi|429853391|gb|ELA28466.1| copii-coated vesicle membrane protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 437
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 226/435 (51%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+VS IV+L L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARVRTTSGGIVTIVSLIVVLWLAWGEWVDYRRIEIHPELIVDQGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL Q ++G G
Sbjct: 65 GERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRLRPQ-------KEGGGV 117
Query: 126 PKIDKPLQRHGG--RLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALS 178
+ K L H EH + YCG CYGA + CCN CEEVREAY + WA
Sbjct: 118 IDV-KALSLHSSDEAAEHLDPNYCGPCYGAPAPPNAQKAGCCNTCEEVREAYAQASWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC RE + ++++E+ EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 177 KGENVEQCTREHYAEKLEEQRREGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKN 236
Query: 239 FQRD----SFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSG 278
+ + +H I+ L FG P + NPLD P+
Sbjct: 237 YWETPDDAQHDFTHVIHTLRFGPQLPDTITKKMTKRAYAWTNHHGNPLDSTHQETNDPNY 296
Query: 279 MYQYFIKVVPTVYTDVS------------------GH----TIQSNQFSVTEHFRS---- 312
+ YF+K+VPT Y ++ GH +++++Q+SVT H RS
Sbjct: 297 NFMYFVKIVPTSYLALNWQKSASIQDEESSGLGLLGHLSDGSVETHQYSVTSHKRSLAGG 356
Query: 313 -----SEQGRLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
Q RL + +PGVFF YD+SP+KV EE +F FLT +CAI+GG TV+
Sbjct: 357 DDSAEGHQERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D ++ G +KK
Sbjct: 417 AVDRGVFEGGLRLKK 431
>gi|336265645|ref|XP_003347593.1| hypothetical protein SMAC_04901 [Sordaria macrospora k-hell]
gi|380096460|emb|CCC06508.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 428
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 157/434 (36%), Positives = 223/434 (51%), Gaps = 71/434 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+VS +V+L L + E R Y V +L+VD
Sbjct: 2 AGKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL Q
Sbjct: 62 KGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRLRPQSE-------- 113
Query: 123 IGAPKID-KPLQRHGG---RLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKG 174
G +ID K L H + +YCG CYGA + CC+ CEE+REAY +
Sbjct: 114 -GGGEIDAKVLALHAADESATHLDPSYCGPCYGAPAPYNAKKAGCCSTCEEIREAYAQAS 172
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC+RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVH
Sbjct: 173 WAFGDGSTMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232
Query: 235 DILAFQRDSF----------NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFI 284
D+ + K N L H +NPLD R + P+ + YF+
Sbjct: 233 DLAQWWNSPLPDDLVRKLGGGKDGKRNTLWTNHH----LNPLDNTRQETDDPNYNFMYFV 288
Query: 285 KVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRSSEQG- 316
K+VPT Y + S +++++Q+SVT H RS G
Sbjct: 289 KIVPTSYLPLGWEKQAAQNKASWDQDHSVGLGVFGQGSDGSMETHQYSVTSHKRSLAGGD 348
Query: 317 --------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGI 364
RL + +PGVFF YD+SP+KV EE SF+ FL +CA+VGG TV+
Sbjct: 349 DAKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFIGFLAGLCAVVGGTLTVAAA 408
Query: 365 IDAFIYHGQRAIKK 378
+D ++ G +KK
Sbjct: 409 VDRGLFEGTVRLKK 422
>gi|50548631|ref|XP_501785.1| YALI0C13112p [Yarrowia lipolytica]
gi|49647652|emb|CAG82095.1| YALI0C13112p [Yarrowia lipolytica CLIB122]
Length = 401
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 149/400 (37%), Positives = 228/400 (57%), Gaps = 26/400 (6%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+++K+ DA+ K D +T SGG++TL++ +++++L SE Y V +++ VD
Sbjct: 1 MLSKLFRYDAFAKPTADATIKTASGGIVTLLAILLIVVLTISEYWAYTTPVMRSQMTVDR 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
RG+ L I+ ++TFP LPCS++++D +D SGE V HD+ K LD +GN++ S +
Sbjct: 61 YRGDRLDIHLNITFPQLPCSLVTLDIIDSSGEVQQSVDHDMTKVTLDERGNILSSEALTL 120
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G K + + + YCGSCYGAES + CCN CE+VR AY KGWA ++ +
Sbjct: 121 GENPDSKAVAKR--TFLDDPNYCGSCYGAESEPDQCCNTCEQVRAAYATKGWAFTDGSGV 178
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--R 241
+QC+ GF +++K + +GCNI G V KVAGNFHFAPG S H+ H+HD+ F+
Sbjct: 179 EQCEVIGFKEQLKAQYNQGCNIAGKFTVQKVAGNFHFAPGVSSHRDEQHLHDLSHFKDPE 238
Query: 242 DSFNISHKINKLAFGEHF--------PGVV---NPLDGVRWTQETPSGMYQYFIKVVPTV 290
F SH I+ L+FGE GV +PL+ + + YF KVV T
Sbjct: 239 APFTFSHIIHDLSFGEQVDVSGLDWDKGVAMETSPLENTPHHTDNKWFRFNYFTKVVSTR 298
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEE 340
+ + G I++NQ++ T H R + GR + LPGVFF YD+SP+++ +E
Sbjct: 299 FEFLDGKKIETNQYAATAHERPLQGGRDEDHQNTRHMRGGLPGVFFSYDISPMRIVNKQE 358
Query: 341 HVS-FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+ S F F+ V A +GGV TV+ ++D IY + +K+K
Sbjct: 359 YRSHFGAFVMQVVATIGGVLTVAAVLDRGIYEVDQVLKRK 398
>gi|380489161|emb|CCF36889.1| hypothetical protein CH063_08353 [Colletotrichum higginsianum]
Length = 437
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 156/436 (35%), Positives = 226/436 (51%), Gaps = 72/436 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+VS IV+ L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARVRTTSGGIVTIVSLIVVFWLAWGEWVDYRKIEIHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL SQ G VI+ +
Sbjct: 65 GERMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGVIHGVNKVRLRSQKEGGGVIDMK--- 121
Query: 123 IGAPKIDKPLQRHGGRLEH-NETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWAL 177
+D L EH + YCG+CYGA++ CCN CEEVREAY + WA
Sbjct: 122 ----ALD--LHSREATAEHLDPNYCGACYGAQAPANAQKAGCCNTCEEVREAYAQASWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC RE + +R++E+ EGC + G L VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GKGENVEQCTREHYAERLEEQRQEGCRLEGNLRVNKVVGNFHLAPGRSFSNGNMHVHDLK 235
Query: 238 AF----QRDSFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPS 277
+ + +H I+ L FG P V NPLD P+
Sbjct: 236 NYWDTPDDAQHDFTHTIHSLRFGPQLPDQVTKKMGKRAYAWTNHHGNPLDNTHQETTDPN 295
Query: 278 GMYQYFIKVVPTVYTDVSGH----------------------TIQSNQFSVTEHFRSSEQ 315
+ YF+K+VPT Y ++ +++++Q+SVT H RS
Sbjct: 296 YNFMYFVKIVPTSYLALNWQKSSSYQDEENSGLGLLGQGNDGSVETHQYSVTSHKRSLAG 355
Query: 316 G---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 362
G RL + +PGVFF YD+SP+KV EE +F FLT +CAI+GG TV+
Sbjct: 356 GDDAAEGHKERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTLTVA 415
Query: 363 GIIDAFIYHGQRAIKK 378
+D ++ G +KK
Sbjct: 416 AAVDRGVFEGGLRLKK 431
>gi|213409826|ref|XP_002175683.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
yFS275]
gi|212003730|gb|EEB09390.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
yFS275]
Length = 394
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/394 (36%), Positives = 225/394 (57%), Gaps = 28/394 (7%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++R DA+ K ED +T GG+I+++S++++ ++ F E + Y V + +++VD SR
Sbjct: 6 QLRRFDAFTKTVEDAKIKTAGGGLISIISAVIVFVIVFLEWKNYQRIVVQPEIVVDPSRN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
E + INF++TFP +PC + VD MDISG+ DV+H + K RLD GN+I IG+
Sbjct: 66 ERMEINFNITFPHVPCHYMGVDVMDISGDFQQDVQHSVTKTRLDKYGNIIAVIDSDIGSA 125
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDL 182
+ + + G E CG CYGA + CCNNC+ VR+AY +K WA+ + D
Sbjct: 126 TDESAMDKDG------EVTCGDCYGAGDAAPPETPGCCNNCKAVRDAYARKQWAIGDYDA 179
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--Q 240
QC+ E + ++GEGCNI G L VN+VAGNFHFAPG+SF H+HD+ + +
Sbjct: 180 FQQCRDENYKAEHASQKGEGCNIAGHLFVNRVAGNFHFAPGRSFQTQQGHLHDLRGYEEE 239
Query: 241 RDSFNISHKINKLAFGEHF-PGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYT--DVS 295
+++ +++H I++L+FG P +PLDG + Y YFIK V + D +
Sbjct: 240 QEAHDMTHMIHQLSFGPPIKPSAEHTDPLDGHFKNTDDALHNYAYFIKCVAHKFVPLDPA 299
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTE-EHVSF 344
TI +N+FSVT+H RS GR +PGVFF D+SP+ V + +F
Sbjct: 300 DPTINTNEFSVTQHERSVTGGRENDNPSHLNRRGGIPGVFFNIDISPMLVIQRQIRGNTF 359
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
F++NV + +GG T++ ++D +Y + +KK
Sbjct: 360 GGFISNVLSFLGGFITLTTLVDRGLYAAELKMKK 393
>gi|340053482|emb|CCC47775.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 404
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 151/416 (36%), Positives = 224/416 (53%), Gaps = 48/416 (11%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +R LD +PK + +D RT GG+++ + +L E+R +L+ V + ++
Sbjct: 1 MKFLRCLDVFPKFDVRFEQDARQRTVVGGLLSFACMTAIAVLVVGEVRYFLSTVDQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL---------DS 111
VD G + I +VTFP +PC +++ DA+D GE DV K R+ ++
Sbjct: 61 VDPHIGGEMHITLNVTFPRVPCDLMTADAIDSFGEYAKDVIRSTRKMRVHADTLQPISEA 120
Query: 112 QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYR 171
+G V+E RQ A GG C SCYGAE + DCCN C++VR A++
Sbjct: 121 RGLVVEKRQSSTNADS--------GG-----AEGCPSCYGAEKNPGDCCNTCDDVRNAFK 167
Query: 172 KKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG 230
KGW+ + D+ I QC E EGCNIY ++V GN HF PG F G
Sbjct: 168 DKGWSFNEDDIGIAQCAEERLRHAESSSSREGCNIYAKFSASRVKGNIHFVPGSMFDYYG 227
Query: 231 VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ------ETPSGMYQYFI 284
H+H + N+SH I++L FGE FPG NPLDG+ ++ E+ +G + YF+
Sbjct: 228 QHMHVLKGEIIRKMNLSHIIHQLDFGERFPGQKNPLDGMVNSRGVVDKSESTNGRFSYFV 287
Query: 285 KVVPTVYTDVS----GHTIQSNQFSVTEHFRSS--EQGRLQT-------LPGVFFFYDLS 331
+VVPT Y VS G +++NQ+SVT +F S GR ++ +PG+F YD+S
Sbjct: 288 QVVPTQYQHVSIFGTGRLLETNQYSVTHYFTESWNATGRDKSANDAPSVVPGIFILYDIS 347
Query: 332 PIK--VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
PIK V T + S +H + +CA+ GGVF V+ +ID+F++HG R ++KKI GK+
Sbjct: 348 PIKTSVKATHPYPSVVHLVLQLCAVGGGVFNVASLIDSFLFHGTRQVQKKIRQGKY 403
>gi|310800359|gb|EFQ35252.1| hypothetical protein GLRG_10396 [Glomerella graminicola M1.001]
Length = 437
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 155/435 (35%), Positives = 225/435 (51%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+VS IV+ L + E Y ++L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGIVTIVSLIVVFWLAWGEWADYRRIEIHSELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL R++G G
Sbjct: 65 GERMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL-------RPRKEGGGV 117
Query: 126 PKIDKPLQRHG--GRLEH-NETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWALS 178
I K L H EH + YCG CYGA++ CCN C+EVREAY + WA
Sbjct: 118 IDI-KALDLHSRDDSAEHLDPNYCGPCYGAQAPPNAQKPGCCNTCDEVREAYAQASWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC RE + +R++E+ EGC I G L VN+V GNFH APG+SF +HVHD+
Sbjct: 177 KGEGVEQCTREHYAERLEEQRQEGCRIEGNLRVNRVVGNFHLAPGRSFSNGNMHVHDLKN 236
Query: 239 FQRD----SFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSG 278
+ + +H I+ L FG P V NPLD P+
Sbjct: 237 YWDTPADAQHDFTHTIHSLRFGPQLPDQVTKKMGKRAYAWTNHHGNPLDNTHQDTNDPNY 296
Query: 279 MYQYFIKVVPTVYTDVSGH----------------------TIQSNQFSVTEHFRS---- 312
+ YF+K+VPT Y ++ +++++Q+SVT H RS
Sbjct: 297 NFMYFVKIVPTSYLALNWQKSTAYQDDDSSSLGLLGQGNDGSVETHQYSVTSHKRSLAGG 356
Query: 313 -----SEQGRLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
Q RL + +PGVFF YD+SP+KV EE +F FLT +CAI+GG TV+
Sbjct: 357 DDAAEGHQERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D ++ G +KK
Sbjct: 417 AVDRGVFEGGMRLKK 431
>gi|302923326|ref|XP_003053651.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734592|gb|EEU47938.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 437
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 157/436 (36%), Positives = 225/436 (51%), Gaps = 72/436 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+VS IV++ L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGIVTIVSLIVVIFLAWGEWSEYRRVEIHPELIVDRGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL Q G
Sbjct: 65 GERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRLQPQSK---------GG 115
Query: 126 PKID-KPLQRHGGRLEH-NETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSN 179
ID K L H H + +YCG CYGA+ + CC C+EVREAY + WA
Sbjct: 116 ADIDSKSLSLHDDAAAHLDPSYCGGCYGAQPPANARKAGCCQTCDEVREAYAQASWAFGR 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QC+RE + +++ + EGC I G L VNKV GNFHFAPG+SF +HVHD+ +
Sbjct: 176 GEGVEQCEREHYAEKLDAQREEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNY 235
Query: 240 ----QRDSFNISHKINKLAFGEHFPGVV---------------NPLDGVRWTQETPSGMY 280
+ + + +H I+ L FG P V NPLDG R + P+ +
Sbjct: 236 WDAPKGKAHDFTHIIHSLRFGPQLPDEVARKVGKGTPWTNHHQNPLDGTRQDIKDPNFNF 295
Query: 281 QYFIKVVPTVYT----DVSG---------------------HTIQSNQFSVTEHFRSSEQ 315
YF+K+VPT Y D G +++++Q+SVT H RS
Sbjct: 296 MYFVKIVPTSYLPLGWDSKGLKIAGLLQDDTSLGAYGYAEDGSVETHQYSVTSHKRSLAG 355
Query: 316 G---------RLQT---LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVS 362
G R T +PGVFF YD+SP+KV EE +F FL +CAIVGG TV+
Sbjct: 356 GNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKGKTFSGFLAGLCAIVGGTLTVA 415
Query: 363 GIIDAFIYHGQRAIKK 378
+D ++ G +KK
Sbjct: 416 AAVDRGLFEGAARLKK 431
>gi|378732932|gb|EHY59391.1| hypothetical protein HMPREF1120_07381 [Exophiala dermatitidis
NIH/UT8656]
Length = 437
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 166/435 (38%), Positives = 225/435 (51%), Gaps = 78/435 (17%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
LDA+ K ED RT SGG++T+VS +V++ L E Y V + +L+VD RGE +
Sbjct: 10 LDAFTKTVEDARIRTTSGGIVTIVSILVVIYLILGEWADYRRIVVQPELVVDKGRGEKME 69
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID- 129
I+ ++TFP +PC +L++D MD+SGEQ V H + K RL S V E G+ ID
Sbjct: 70 IHLNITFPRIPCELLTLDVMDVSGEQQSGVVHGVNKVRLTS---VAE------GSRVIDT 120
Query: 130 KPLQRH-----GGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ LQ H L+ + YCGSCY A + CCN C+EVREAY WA
Sbjct: 121 QALQLHQQAEVSSHLDPD--YCGSCYSAPAPPNAKKPGCCNTCDEVREAYAANSWAFGRG 178
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF- 239
+ ++QC+REG+ R+ E+ EGC I G + VNKV GNFH APG+SF +HVHD+ F
Sbjct: 179 EGVEQCEREGYGARLDEQRHEGCRIEGVIRVNKVVGNFHIAPGRSFSNGNMHVHDLNNFF 238
Query: 240 ---QRDSFNISHKINKLAFGEH-------FPGV-----VNPLDGVRWTQETPSGMYQYFI 284
+H+I+ L FG + G NPLDG+R + P + YFI
Sbjct: 239 DTPIEGGHTFTHEIHSLRFGPQLSDQEAKWTGADHHLNANPLDGLRQETDEPGYNFMYFI 298
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRSSEQG 316
KVV T Y + S +I+++Q+SVT H RS G
Sbjct: 299 KVVSTSYLPLGWDEDKSIQQHSSLSDLIPLGMHGKGAGSQGSIETHQYSVTSHKRSLAGG 358
Query: 317 ---------RLQT---LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSG 363
RL +PGVFF YD+SP+KV E SF +FLT VCA++GG TV+
Sbjct: 359 NDAAEGHKERLHAHGGIPGVFFSYDISPMKVINREVRPKSFANFLTGVCAVIGGTLTVAA 418
Query: 364 IIDAFIYHGQRAIKK 378
ID +Y G +KK
Sbjct: 419 AIDRGLYEGATRLKK 433
>gi|121702771|ref|XP_001269650.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
clavatus NRRL 1]
gi|119397793|gb|EAW08224.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
clavatus NRRL 1]
Length = 438
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 162/439 (36%), Positives = 222/439 (50%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+ S IV+L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARVRTTSGGIVTIASLIVILYLVWGEWVDYRRVVVLPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-D 121
GE + I+ ++TFP LPC ++++D MD+SGEQ + V H + K RL S G+V++ R D
Sbjct: 65 GERMEIHMNITFPRLPCELVTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGHVLDIRSLD 124
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWAL 177
++ K L + YCG C GA+ + CCN C+EVREAY K WA
Sbjct: 125 LHSKDEVAKHL---------DPNYCGDCGGADPLPGAIKPGCCNTCDEVREAYAAKNWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
I+QC+REG+ RI + EGC + G L VNKV GNFH APG+SF +HVHD
Sbjct: 176 GKGANIEQCEREGYTARIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTNGNIHVHDTQ 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
A+ + H+I++L FG P + NPLD P+
Sbjct: 236 AYFDLDLPDDAKHTMEHEIHQLRFGPQLPDELSARWQWTDHHHTNPLDNTHQETNDPAYN 295
Query: 280 YQYFIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRS 312
+ YF+KVV T Y + + +I+++Q+SVT H RS
Sbjct: 296 FVYFVKVVSTSYLPLGWDPLFSSALHSTYEKAPLGAHGIGYGASGSIETHQYSVTSHKRS 355
Query: 313 SEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVF 359
G RL +PGVFF YD+SP+KV E L FLT VCAI+GG
Sbjct: 356 LRGGDAEDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKTLSSFLTGVCAIIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ ID +Y G +KK
Sbjct: 416 TVAAAIDRGLYEGALRVKK 434
>gi|396471326|ref|XP_003838845.1| similar to endoplasmic reticulum-golgi intermediate compartment
protein 3 [Leptosphaeria maculans JN3]
gi|312215414|emb|CBX95366.1| similar to endoplasmic reticulum-golgi intermediate compartment
protein 3 [Leptosphaeria maculans JN3]
Length = 439
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 160/440 (36%), Positives = 225/440 (51%), Gaps = 76/440 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++TLVS +V+ L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARVRTTSGGIVTLVSLVVIFWLTWGEWADYRRVTVRPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGE + + H I K RL + + G+
Sbjct: 65 GERMEISLNITFPRMPCELLTLDVMDVSGELQMGITHGINKVRLSPEVD---------GS 115
Query: 126 PKID-KPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSN 179
ID KPL H H + +YCG+CYGA + CCN C+EVR+AY W+
Sbjct: 116 KVIDAKPLDLHQDEASHLDPSYCGNCYGAPPPTNAIKHGCCNTCDEVRDAYASISWSFGR 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QC+RE + + + E+ EGC + G ++VNKV GNFH APGKSF +HVHD+ +
Sbjct: 176 GEGVEQCEREHYAEHLDEQRQEGCRLEGSIKVNKVVGNFHIAPGKSFSNGNLHVHDLENY 235
Query: 240 QRDSF--NISHKINKLAFGEHF----------------PGV-----VNPLDGVRWTQETP 276
RD + +HKI+ L FG PG VNPLD +
Sbjct: 236 FRDEYAHTFTHKIHHLRFGPQLSQAVVQDMAKKHMATGPGGWTNHHVNPLDHTEQRTDEK 295
Query: 277 SGMYQYFIKVVPTV-----------------YTDVSGHTIQS--------NQFSVTEHFR 311
+ Y YFIKVV T Y D+ G TI S +Q+SVT H R
Sbjct: 296 AFNYMYFIKVVSTAYLPLGWEKSADGSSSGGYDDLLGTTIHSVNKGSIETHQYSVTSHKR 355
Query: 312 SSEQG---------RLQT---LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGV 358
S + G R+ +PGVFF YD+SP+KV E +F FL +CA++GG
Sbjct: 356 SLQGGSDEKEGHKERIHARGGIPGVFFSYDISPMKVINREMREKTFSGFLVGLCAVIGGT 415
Query: 359 FTVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G IKK
Sbjct: 416 LTVAAAVDRALYEGVNKIKK 435
>gi|389602486|ref|XP_001567299.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505471|emb|CAM42729.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 541
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 138/403 (34%), Positives = 223/403 (55%), Gaps = 33/403 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +++ LD +PK + +D RT SGG+ ++V+ +V+L L E+R +L+ ++
Sbjct: 136 MRQLKRLDVFPKFDRKFEQDARHRTVSGGIFSVVAIVVILWLLVGEVRYFLSIEEHHEMF 195
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ-GNVIESR 119
VDT G +R+ +VTF +PC ++++DA+D+ G DV+ + K+R+D+ G VI +
Sbjct: 196 VDTEVGGDMRVTVNVTFNHVPCDLITLDAVDVFGVFANDVEDNTVKQRIDAATGQVISAA 255
Query: 120 QDGIGAPK-IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+ + K I K + G E+ C SCYGAE S DCC+ CE+VR+AY +KGW L+
Sbjct: 256 RAVVDEKKVITKAIDADGVEKEN----CPSCYGAERSPGDCCHTCEDVRQAYAQKGWRLN 311
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ ++QC + EGCN+Y ++ G+ F PG+ + G +HD++
Sbjct: 312 VDDISVEQCAEDRIKMATAAFGKEGCNLYATFAASRATGSLQFIPGRMYQMLGRRMHDLM 371
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW-------TQETPSGMYQYFIKVVPTV 290
++SH ++ L FGE FPG NPLDG ++ +G + YF+KV+PT
Sbjct: 372 GSAARKLDLSHTVHTLEFGERFPGQQNPLDGTAQGSALSGDAKDAMNGRFSYFVKVIPTT 431
Query: 291 YTDVS-----GHTIQSNQFSVTEHFRSSEQGRL--------QTLPGVFFFYDLSPIKVTF 337
Y S T++SNQ++ T HF S + + +PGVF YDLSP+++
Sbjct: 432 YQRYSLITGLQDTVESNQYTATHHFTPSAATKAASQTPTMQEIVPGVFMTYDLSPVRILA 491
Query: 338 TEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
E H S +HF+ +CA+ GGV TV G++D+ +H R ++K
Sbjct: 492 QERHPYPSVIHFVLQLCAVCGGVLTVVGLVDSMCFHSVRKVRK 534
>gi|171696240|ref|XP_001913044.1| hypothetical protein [Podospora anserina S mat+]
gi|170948362|emb|CAP60526.1| unnamed protein product [Podospora anserina S mat+]
Length = 437
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 158/438 (36%), Positives = 225/438 (51%), Gaps = 70/438 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+VS IV+ L + E + Y +L+VD
Sbjct: 2 AAKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLIVVFFLAWGEWQDYRRIEIHPELIVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL---DSQGNVIESR 119
RGE + I+ +V+FP +PC +L++D MD+SGEQ V+H + K RL G VIE++
Sbjct: 62 KGRGERMEIHLNVSFPRVPCELLTLDVMDVSGEQQHGVQHGVVKTRLRPLSEGGGVIEAK 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGW 175
+ A L+ N YCG CYGA + +CC C+EV+EAY + W
Sbjct: 122 ALALHA------RDEEAAHLDPN--YCGPCYGAAPPVHAQKPNCCQTCDEVKEAYAAQAW 173
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + I+QC+RE + +++ E+ EGC I G + VNKV GNFH APGKSF +HVHD
Sbjct: 174 AFGRGEGIEQCEREHYAEKLDEQRNEGCRIEGNVRVNKVIGNFHIAPGKSFSNGNMHVHD 233
Query: 236 ILAFQRDSF--NISHKINKLAFGEHFP-GV----------------VNPLDGVRWTQETP 276
+ + +H+I+ L FG P G+ VNPLD +
Sbjct: 234 LKNYWDTPVKHTFTHEIHHLRFGPQLPDGLAKKLGKNKALPWTNHHVNPLDNTHQETDDV 293
Query: 277 SGMYQYFIKVVPTVYTDVSGH-----------------------TIQSNQFSVTEHFRSS 313
+ + YFIK+VPT Y + +++++Q+SVT H RS
Sbjct: 294 NYNFMYFIKIVPTSYLPLGWEKTWQGFKDQHHKELGSFGQSADGSLETHQYSVTSHRRSL 353
Query: 314 EQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 360
G RL +PGVFF YD+SP+KV EE SFL FL +CAIVGG T
Sbjct: 354 SGGDDGSEGHKERLHAKGGIPGVFFSYDISPMKVINREERPKSFLGFLAGLCAIVGGTLT 413
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ +D ++ G +KK
Sbjct: 414 VAAAVDRALFEGGMKLKK 431
>gi|425772976|gb|EKV11354.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
digitatum PHI26]
gi|425782132|gb|EKV20058.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
digitatum Pd1]
Length = 438
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 157/439 (35%), Positives = 227/439 (51%), Gaps = 75/439 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGVIT+ S ++++ L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARIRTKSGGVITIASLLIVMWLVWGEWADYRRVVVLPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H + K RL + G VI+ +
Sbjct: 65 GERMEIHLNMTFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSPRNEGGKVIDVQALD 124
Query: 123 IGAP-KIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWAL 177
+ +P + K L + YCG C GA CC CEEVR+AY +K WA
Sbjct: 125 LHSPSEAAKHL---------DPEYCGECGGATPPPNVIKPGCCTTCEEVRQAYAEKQWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QC REG+ +R+ E+ EGC I G L+VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GDGSNIEQCTREGYAERLAEQRREGCRIEGVLKVNKVIGNFHIAPGRSFTTGNMHVHDLD 235
Query: 238 AF------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGM 279
+ + +SH +++L FG P + NPLD + + P+
Sbjct: 236 TYIDPNAGPAEQHTMSHLVHELRFGPQLPAELAGRWGWTDHHHTNPLDDTKQETDEPAYN 295
Query: 280 YQYFIKVVPTVYTDV---------------------------SGHTIQSNQFSVTEHFRS 312
+ YF+KVV T Y + + +I+++Q+SVT H R
Sbjct: 296 FLYFVKVVSTSYLPLGWDPQFSTAIHNAYDKAPLGYHGLAYGTQGSIEAHQYSVTSHKRP 355
Query: 313 SEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
G R+ +PGVFF YD+SP+KV E +F +FLT VCAI+GG
Sbjct: 356 LSGGNDAAEGHKERVHAGGGIPGVFFNYDISPMKVVNREARPKTFTNFLTGVCAIIGGTL 415
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G +KK
Sbjct: 416 TVAAALDRGVYEGAMRVKK 434
>gi|189203047|ref|XP_001937859.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984958|gb|EDU50446.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 437
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 156/442 (35%), Positives = 225/442 (50%), Gaps = 78/442 (17%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ ++ LDA+ K ED RT SGG++T+ S +V+ L + E Y +L+VD
Sbjct: 3 VKSRFNKLDAFTKTVEDARVRTTSGGIVTIASLLVIFWLSWGEWADYRRVTVRPELMVDK 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN---VIESRQ 120
RGE + I +V+FP +PC +L++D MD+SGE + V H I K RL + + VIE+
Sbjct: 63 GRGERMEIAMNVSFPRIPCELLTLDVMDVSGELQMGVTHGINKVRLSPEADGSKVIET-- 120
Query: 121 DGIGAPKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGW 175
K L H H YCG CYGA + +CCN C+EVR+AY W
Sbjct: 121 ---------KALDLHADEASHLAPDYCGQCYGAPPPTNAKKPNCCNTCDEVRDAYASISW 171
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
+ + ++QC+RE + + + ++ EGC + G ++VNKV GNFHFAPGKSF +HVHD
Sbjct: 172 SFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGSIKVNKVVGNFHFAPGKSFSNGNLHVHD 231
Query: 236 ILAFQRDSF--NISHKINKLAFGEHFPGV---------------------VNPLDGVRWT 272
+ + +D + +H+I++L FG V VNPLD
Sbjct: 232 LENYFKDDYAHTFTHRIHQLRFGPQLSDVVVRDMQKKHLDSGHNGWSNHHVNPLDNTVQH 291
Query: 273 QETPSGMYQYFIKVV---------------PTVYTDVSGHT--------IQSNQFSVTEH 309
+ + Y YFIKVV P+ Y+D+ G T I+++Q+SVT H
Sbjct: 292 TDEKAYNYMYFIKVVSTAYLPLGWEQEFPHPSKYSDILGTTIDESYKGSIETHQYSVTSH 351
Query: 310 FRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVG 356
RS + G R+ +PGVFF YD+SP+KV E SF FL +CA++G
Sbjct: 352 KRSLQGGTDEKDGHKERIHARGGIPGVFFSYDISPMKVVNREVREKSFSGFLVGLCAVIG 411
Query: 357 GVFTVSGIIDAFIYHGQRAIKK 378
G TV+ ID +Y G IKK
Sbjct: 412 GTLTVAAAIDRALYEGVNRIKK 433
>gi|146095510|ref|XP_001467598.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020411|ref|XP_003863369.1| hypothetical protein, conserved [Leishmania donovani]
gi|134071963|emb|CAM70660.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501601|emb|CBZ36681.1| hypothetical protein, conserved [Leishmania donovani]
Length = 467
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 136/403 (33%), Positives = 224/403 (55%), Gaps = 33/403 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +++ LD +PK + +D RT SGGV+++V+ +V++ L E+R +L+ ++
Sbjct: 62 MRQLKRLDVFPKFDRKFEQDARHRTVSGGVLSVVAIVVIIWLLVGEVRYFLSVEEHQEMF 121
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ-GNVIESR 119
VDT G +++ +VTF +PC ++++DA+DI G DV+ + K+R+D+ G VI +
Sbjct: 122 VDTKVGGDMQVTVNVTFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDAATGQVISAA 181
Query: 120 QDGIGAPKI-DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+ + K+ K + G E+ C SCYGAE + DCC+ CE+VR+AY ++GW L
Sbjct: 182 RAMVDEKKVMTKAIDADGAEKEN----CPSCYGAERNPGDCCHTCEDVRQAYARRGWKLD 237
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
++ ++QC + EGCN+Y ++ G+ F PG+ + G +HD++
Sbjct: 238 IDEISVEQCAEDRIKMAAAASGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDLM 297
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW-------TQETPSGMYQYFIKVVPTV 290
++SH ++ L FG+ FPG NPLDG ++ +G + YF+K+VPT
Sbjct: 298 GSTTRKLDLSHTVHTLEFGDPFPGQQNPLDGTAQGSALSGDAKDAMNGRFSYFVKLVPTT 357
Query: 291 YTDVSGHT-----IQSNQFSVTEHFRSSEQGRL--------QTLPGVFFFYDLSPIKVTF 337
Y S T ++SNQ+S T HF SE + + +PGVF YDLSP+++
Sbjct: 358 YQRYSLITGLQDAVESNQYSATHHFTPSEAAKAVSQTPKKQEIVPGVFMTYDLSPVRILV 417
Query: 338 TEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
E H S +HF+ +CA+ GGV TV G++D+ +H R I+K
Sbjct: 418 QERHPYPSLVHFVLQLCAVCGGVLTVVGLVDSMCFHSVRKIRK 460
>gi|440473660|gb|ELQ42442.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae Y34]
gi|440486294|gb|ELQ66175.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae P131]
Length = 444
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 163/445 (36%), Positives = 226/445 (50%), Gaps = 77/445 (17%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG+IT+VS IV+L L + E Y +L+VD
Sbjct: 2 APKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLAWGEWADYRRIDIHPELIVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
SRG+ + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL Q G VI+++
Sbjct: 62 KSRGDRMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVIKVRLRPQSEGGGVIDAK 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGW 175
+ A L+ N YCG CYGA + CCN C+EVREAY + W
Sbjct: 122 TLALHAE------DEAATHLDPN--YCGGCYGAPAPANAKKAGCCNTCDEVREAYAQASW 173
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + ++QC RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVHD
Sbjct: 174 AFGRGENVEQCTREHYAERLDEQRHEGCQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVHD 233
Query: 236 ILAFQ----RDSFNISHKINKLAFGEHFPGV------------------VNPLDGVRWTQ 273
+ + + SH I+ L FG P +NPLDGV T
Sbjct: 234 LKNYWDTPVEGGHSFSHTIHSLRFGPQLPPSALEKLGNKDKNMPWTNHHINPLDGVIQTT 293
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGH----------------------TIQSNQFSVTEHFR 311
P+ Y YF+K+VPT Y + +++++Q+SVT H R
Sbjct: 294 VDPNFNYMYFVKIVPTSYLPLGWEKRTHLATMHDHGVGTYGYSGDGSVETHQYSVTSHKR 353
Query: 312 SSEQG---------RLQT---LPGVFFFY-----DLSPIKVTFTEEHV-SFLHFLTNVCA 353
S G R+ + +PGVFF Y D+SP+KV E +F FLT +CA
Sbjct: 354 SLAGGDDGEDGHKERMHSRGGIPGVFFSYPFCPQDISPMKVINREVRTKTFAGFLTGLCA 413
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKK 378
I+GG TV+ ID + G IKK
Sbjct: 414 ILGGTLTVAAAIDRMTFEGVTRIKK 438
>gi|325189930|emb|CCA24410.1| hypothetical protein BRAFLDRAFT_63528 [Albugo laibachii Nc14]
Length = 699
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/370 (37%), Positives = 214/370 (57%), Gaps = 8/370 (2%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K+R++D PK ++F +T +GG+++L+S ++ L SEL YL+ K+LVD S
Sbjct: 320 LGKLRNVDFNPKTLDEFKVKTINGGILSLLSIGLIGYLLVSELIFYLSVDIVDKMLVDGS 379
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
R + INFDV FP +PCSI+++++ SGE H D++H + K+ +D G ++ + G+
Sbjct: 380 RNRMVTINFDVEFPRMPCSIVTLESTGSSGEIHHDIQHSVHKQAIDLNGKILSA---GMK 436
Query: 125 APKIDKPLQRHGGRLEHNETY---CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I K + +T CGSCYGA +S E CCN CE+V++AY + W + +
Sbjct: 437 LDSIGKAWTNQSDTVAEEKTVKVECGSCYGAGASGE-CCNTCEDVQQAYASRRWNIPSLH 495
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
I+QC++ + + EGC IYG + V KV G FAP K+ + +IL
Sbjct: 496 TIEQCQKSEIEKLLHSTVEEGCRIYGSIAVTKVHGKVLFAPAKALLSGYISTEEILDKTI 555
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
F+ SHKIN L FGE +P + +PL+G + G YQYF++VVPT Y ++G I
Sbjct: 556 KIFDTSHKINYLDFGERYPEMKSPLNGHNTILPKGTRGTYQYFLQVVPTAYYYLNGGIID 615
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+NQ+SVT+H++ Q LP + F Y SPI + +L FLT++CAI+GGVFT
Sbjct: 616 TNQYSVTQHYQELTPLGEQQLPMITFQYKFSPIMFQIEQRRRGYLQFLTSLCAILGGVFT 675
Query: 361 VSGIIDAFIY 370
+ G +D+ ++
Sbjct: 676 MVGAVDSILF 685
>gi|157873507|ref|XP_001685262.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128333|emb|CAJ08503.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 467
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 134/403 (33%), Positives = 224/403 (55%), Gaps = 33/403 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +++ LD +PK + +D RT SGGV+++V+ ++++ L E+R +L+ ++
Sbjct: 62 MRQLKRLDVFPKFDRKFEQDARHRTVSGGVLSVVAIVIIIWLLVGEVRYFLSVEEHQEMF 121
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ-GNVIESR 119
VDT G +++ ++TF +PC ++++DA+DI G DV+ + K+R+D+ G VI +
Sbjct: 122 VDTKVGGDMQVTVNITFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDAATGQVISAA 181
Query: 120 QDGIGAPKI-DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+ + K+ K + G E+ C SCYGAE + DCC+ CE+VR+AY ++GW L
Sbjct: 182 RAMVDEKKVMTKAIDADGAEKEN----CPSCYGAERNPGDCCHTCEDVRQAYARRGWKLD 237
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
++ ++QC + EGCN+Y ++ G+ F PG+ + G +HD++
Sbjct: 238 IDEISVEQCAEDRINMAAAASGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDLM 297
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW-------TQETPSGMYQYFIKVVPTV 290
++SH ++ L FG+ FPG NPLDG ++ +G + YF+K+VPT
Sbjct: 298 GSTTRKLDLSHTVHTLEFGDPFPGQQNPLDGTAQGSALSGDAKDAMNGRFSYFVKLVPTT 357
Query: 291 YTDVSGHT-----IQSNQFSVTEHFRSSEQGRL--------QTLPGVFFFYDLSPIKVTF 337
Y S T ++SNQ+S T HF SE + + +PGVF YDLSP+++
Sbjct: 358 YQRYSLITGLQDVVESNQYSATHHFTPSEAAKAASQAPKKQEIVPGVFMTYDLSPVRILV 417
Query: 338 TEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
E H S HF+ +CA+ GGV TV+G++D+ +H R I+K
Sbjct: 418 QERHPYPSLAHFVLQLCAVCGGVLTVAGLVDSLCFHSARKIRK 460
>gi|392566201|gb|EIW59377.1| endoplasmic reticulum-derived transport vesicle ERV46 [Trametes
versicolor FP-101664 SS1]
Length = 423
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 49/412 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ +DA+ K ED +T +G ++TL+++ ++ E Y +T ++VD S
Sbjct: 6 LSALKGVDAFGKTMEDVKVKTRTGALLTLIAAAIITSFTTIEFFDYRRVNVDTSIVVDRS 65
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG-----NVIESR 119
RGE L +N +VTFP +PC +LS+D MDISGE D+ H+I K R+D +G VI
Sbjct: 66 RGEKLTVNMNVTFPRVPCYLLSLDVMDISGETQSDITHNILKTRMDERGFPVPTTVITEL 125
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
Q+ + + G E G CCN CE+VR+AY +GW+ +
Sbjct: 126 QNDLDKINSQREGGYCGSCYGGVEPEGG-----------CCNTCEDVRQAYVNRGWSFNR 174
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
PD I+QC +EG+ +++KE+ EGCNI G + VNKV GN H +PG+SF S +++++ +
Sbjct: 175 PDSIEQCVQEGWSEKLKEQATEGCNIAGRVRVNKVVGNIHLSPGRSFRTSSHSLYELVPY 234
Query: 240 QRDSFN---ISHKINKLAF-----------------GEHFPGVVNPLDGVRWTQETPSGM 279
+ N +H I+ LAF + NPLDG M
Sbjct: 235 LKTDGNRHDFTHTIHHLAFEGDDEWDLAKAKLGKELKQRLGIAANPLDGTTGRTIKQQYM 294
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-------------LPGVFF 326
+QYF+KVV T + +SG TI ++Q+S T R ++G + +PG FF
Sbjct: 295 FQYFLKVVATQFRTLSGKTINTHQYSATHFERDLDKGSQENTPTGVHVAHGNGGIPGAFF 354
Query: 327 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
Y++SP+++ E SF HFLT+ CAIVGGV TV+ +ID+ ++ ++A+KK
Sbjct: 355 NYEISPLRIVHAETRQSFAHFLTSTCAIVGGVLTVASLIDSALFATRKALKK 406
>gi|345319994|ref|XP_001507420.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Ornithorhynchus anatinus]
Length = 203
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 121/204 (59%), Positives = 150/204 (73%), Gaps = 3/204 (1%)
Query: 159 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 218
CCN CE+VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNF
Sbjct: 1 CCNTCEDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNF 60
Query: 219 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG 278
HFAPGKSF QS VH + L N++H I L+FGE +PG+VNPLDG + S
Sbjct: 61 HFAPGKSFQQSHVHGKERLRIHPRPINMTHYIEHLSFGEDYPGIVNPLDGTDVSAPQASM 120
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 336
M+QYF+KVVPTVY G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V
Sbjct: 121 MFQYFVKVVPTVYVKADGEVVRTNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVK 179
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFT 360
TE+H SF HFLT VCAI+GGVFT
Sbjct: 180 LTEKHRSFTHFLTGVCAIIGGVFT 203
>gi|115388503|ref|XP_001211757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195841|gb|EAU37541.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 438
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 160/438 (36%), Positives = 230/438 (52%), Gaps = 73/438 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG+IT+ S +++L L + E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFAKTVEDARIRTTSGGIITIASLLIILWLVWGEWVDYRRVVVMPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGEQ + V H I K RL S G+V++ +
Sbjct: 65 GEKMEIHLNITFPRLPCELLTLDVMDVSGEQQVGVAHGINKVRLASPAEGGHVLDVQALE 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG---AESSDEDCCNNCEEVREAYRKKGWALSN 179
+ + ++ + +H L+ N YCG C G + CCN CEEVREAY + WA
Sbjct: 125 LHS---EQEVAKH---LDPN--YCGECGGIPQQPGEPKRCCNTCEEVREAYAEHQWAFGK 176
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ I+QC+REG+ RI + EGC + G L VNKV GNFH APG+SF +HVHD+ +
Sbjct: 177 GENIEQCEREGYAARIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFSSGNIHVHDLENY 236
Query: 240 ------QRDSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQ 281
+ ++H I++L FG P + NPLD + + Y
Sbjct: 237 FELDQPASEKHTMTHHIHQLRFGPQLPDELSDRWQWTDHHHTNPLDDTVQETDLAAFNYM 296
Query: 282 YFIKVVPTVYTDVS--------------------------GH--TIQSNQFSVTEHFR-- 311
YF+KVV T Y + GH +I+++Q+SVT H R
Sbjct: 297 YFVKVVSTAYLPLGWDPRVSSYIHSASSHNVPLGRHGIGYGHDGSIETHQYSVTSHKRPL 356
Query: 312 ----SSEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 360
++++G + L PGVFF YD+SP+KV E +F FLT VCAI+GG T
Sbjct: 357 MGGNAADEGHKERLHAAAGIPGVFFNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLT 416
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ ID +Y G +KK
Sbjct: 417 VAAAIDRGLYEGAIRVKK 434
>gi|346324387|gb|EGX93984.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Cordyceps militaris CM01]
Length = 423
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 156/422 (36%), Positives = 227/422 (53%), Gaps = 58/422 (13%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGGV+T+VS +V+L L + E Y V +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGVVTIVSLVVVLFLAWGEWASYRTVVIRPELVVDQGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL R +G G
Sbjct: 65 GERMDIHLNITFPRMPCELLTLDVMDVSGEQQHGVAHGVHKVRL---------RPEGEGG 115
Query: 126 PKID-KPLQRHGGRLEH-NETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSN 179
ID L H EH + +YCG C GA + + CCN CEE+REAY + WA +
Sbjct: 116 GVIDVSSLNLHNDAAEHLDPSYCGDCGGAPAPTTVTKAGCCNTCEEIREAYAQVSWAFGD 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+QC+RE + +R++E+ EGC I G L+VNKV GNFH APG+SF +HVHD+ +
Sbjct: 176 GKAFEQCEREHYAERLEEQRHEGCRIDGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNY 235
Query: 240 QRDS----FNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSGM 279
+ + +H I+ L FG P V NPLD + P+
Sbjct: 236 WETTDDKKHDFTHHIHHLRFGPQLPETVVQKLGKGATPWTNHHGNPLDSTKQLTNDPNFN 295
Query: 280 YQYFIKVVPTVYTDVSGH----------TIQSNQFSVTEHFRS------SEQGRLQTL-- 321
+ YF+K+VPT + + +++++Q+SVT H RS S +G + L
Sbjct: 296 FMYFVKIVPTSFLPLGWEKMARTMNVDASVETHQYSVTSHKRSLTGGDDSAEGHAERLHS 355
Query: 322 ----PGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
PGVFF YD+SP+KV EE SFL F+ +CA+VGG TV+ +D ++ G +
Sbjct: 356 RGGIPGVFFSYDISPMKVINREEKGKSFLGFVAGLCAVVGGTLTVAAAVDRGLFEGTTRL 415
Query: 377 KK 378
KK
Sbjct: 416 KK 417
>gi|194689880|gb|ACF79024.1| unknown [Zea mays]
gi|413949702|gb|AFW82351.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 176
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 112/174 (64%), Positives = 147/174 (84%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA +++++ LDAYPK+NEDFY RT SGG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRGE LR+NFD+TFP++PC++LSVD DISGEQH D++HDI K+RL+S GNVIE+R+
Sbjct: 61 VDTSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVIEARK 120
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
+GIG K+++PLQ+HGGRL+ E YCG+CYGAE SDE CCN+CEE + R+KG
Sbjct: 121 EGIGGAKVERPLQKHGGRLDKGEQYCGTCYGAEESDEQCCNSCEESGKHIRRKG 174
>gi|170586880|ref|XP_001898207.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
putative [Brugia malayi]
gi|158594602|gb|EDP33186.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
putative [Brugia malayi]
Length = 341
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 142/354 (40%), Positives = 209/354 (59%), Gaps = 20/354 (5%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +++ DAY K +DF RTF+GG +TLVSS V++ +F SE +L+ +L VD
Sbjct: 2 SLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYVD 61
Query: 63 TSRGET-LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL--DSQGNVIESR 119
++ E + +NFD+TFP LPCS++++D MD+SG+ D+K D++K L +GN I R
Sbjct: 62 STPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIKDDVYKISLLNGKEGNGI--R 119
Query: 120 QD-GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
Q I + + ++ CGSCYGA+ + CCN CEEV+EAY KKGW L
Sbjct: 120 QGVNINTTTV--------SSVPASQILCGSCYGAK---DGCCNTCEEVKEAYIKKGWELV 168
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
N + ++QCK + +++++ E + EGC +YG ++V KVAGNFH APG H HD+ +
Sbjct: 169 NIETVEQCKSDLWVKKMNEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLKAHRSHFHDLHS 228
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG-MYQYFIKVVPTVYTDV-SG 296
F+ SH +N L+FG FPG V PLDG + SG MYQY +K+VPT Y + S
Sbjct: 229 LSPSKFDTSHTVNHLSFGNSFPGKVYPLDGKFFGSAKDSGIMYQYHLKLVPTSYVFLDST 288
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
I S+ FSVT + + QG LPG F Y+ SP+ V + E + + N
Sbjct: 289 RNIFSHLFSVTTYQKDISQGA-SGLPGFFIQYEFSPLMVKYEERRQYVVTIILN 341
>gi|401888400|gb|EJT52358.1| ER to golgi family transport-related protein [Trichosporon asahii
var. asahii CBS 2479]
gi|406696432|gb|EKC99721.1| ER to transport-related protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 378
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/377 (37%), Positives = 203/377 (53%), Gaps = 49/377 (12%)
Query: 50 YLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL 109
Y E ++VD SRGE L I+ D+TFP +PC +LS+D MDISGE+ D+ HD+ K RL
Sbjct: 7 YRRVTLEPTIIVDRSRGEKLEIDLDITFPRVPCFLLSLDVMDISGERQNDITHDMAKHRL 66
Query: 110 DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREA 169
+ G +E + G + ++ Q + YCGSCYGA++ + CCN+C++VR+A
Sbjct: 67 SASGEELEVTRSGQLKGEAERAAQ------NRDPNYCGSCYGAQAPESGCCNSCDDVRKA 120
Query: 170 YRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS 229
Y + GW NP I+QC E + + + ++ EGC I G ++VNKV GN F G F +
Sbjct: 121 YSESGWQFPNPSTIEQCVEENWAENMAQQNTEGCRIVGQVKVNKVVGNLQFTHGNVFTRG 180
Query: 230 GVHVHDILAFQRDS---FNISHKINKLAFGEHFP--------------------GVVNPL 266
D+L + RD + H INK F P G+ +PL
Sbjct: 181 HT---DLLPYLRDGNVHHDFGHIINKFRFTGEMPGQLYHRSQIQKKEDETRKELGIHDPL 237
Query: 267 DGVRWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT---- 320
GVR E MYQYF+KVV T + ++G I +NQ+S TE+ R + G L T
Sbjct: 238 QGVRSHAENDGSNIMYQYFVKVVSTAFVYLNGQNINTNQYSATEYERDLKHGNLPTKDQH 297
Query: 321 ----------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+PGVF Y++SP+KV TE SF HF+T+ CAIVGGV TV+ +IDA I+
Sbjct: 298 GHVTTHYTNAIPGVFINYEISPMKVVHTETRQSFAHFVTSTCAIVGGVLTVASLIDAAIF 357
Query: 371 HG-QRAIKKKIEIGKFS 386
+ +R + +K G S
Sbjct: 358 NSRKRLMGEKESYGALS 374
>gi|387219467|gb|AFJ69442.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Nannochloropsis gaditana CCMP526]
Length = 432
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/389 (36%), Positives = 218/389 (56%), Gaps = 32/389 (8%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSEL-RLYLNAVTETKLLVDTSRG 66
+ +D + K +++ +T G + L S +++L+L SE +L + T+ L+VDTS G
Sbjct: 34 LERMDVFTKFHDEDKIQTSRGASMALFSWVLVLVLLCSEAYEAFLTSRTKEHLVVDTSLG 93
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ L I D+TF AL C+ + VDAMD++G+ + V+H++ K+RL SQG + IG P
Sbjct: 94 DKLNITLDMTFHALTCADVHVDAMDVAGDNQMQVEHNMLKQRLSSQG-------ERIGFP 146
Query: 127 KIDKPLQRHGGRLEH-----NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
++ P + + YCGSC+ A + CCN+C+++ +AY +G +
Sbjct: 147 FLEDPTDFDSKKADALLGAAPWDYCGSCFQARTHTGACCNSCQDLEQAYLTQGLPMGKIK 206
Query: 182 LIDQCKREGFLQRIKE---EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
GF ++GEGCN+ GF+ VNKVAGNFH A G S + G H+H +
Sbjct: 207 TTAPQCLPGFQAPAPSGPMQKGEGCNLKGFMSVNKVAGNFHIAFGDSVVKDGRHIHQFIP 266
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQET-PSGMYQYFIKVVPTVYTDVSG 296
+ FN+SH I ++FG+ +PG VNPLDG V++ T +G++QYFIKV+PT Y +G
Sbjct: 267 SEAPFFNVSHTIQHVSFGDEYPGRVNPLDGKVKYVSSTVGTGLFQYFIKVIPTHYKGRAG 326
Query: 297 HTIQSNQFSVTEHFRS--------------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
I++N+ SVTE F+ + + LPGVFF YDLSP V + V
Sbjct: 327 EAIRTNRISVTERFKPLHKEGEARLTGDSHAHNDQTSVLPGVFFIYDLSPFNVEVSTVSV 386
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYH 371
F HFL +CAI GGVF++S ++D Y+
Sbjct: 387 PFSHFLVKLCAIAGGVFSISRLLDNVFYY 415
>gi|400602673|gb|EJP70275.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Beauveria bassiana ARSEF 2860]
Length = 423
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 155/425 (36%), Positives = 227/425 (53%), Gaps = 58/425 (13%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ++ RT SGGV+T+VS +V+L L + E Y +L+VD
Sbjct: 2 AAKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLVWGEWADYRTIAIRPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL R +
Sbjct: 62 QGRGERMDIHLNITFPRMPCELLTLDVMDVSGEQQHGVAHGVHKVRL---------RPEA 112
Query: 123 IGAPKID-KPLQRHGGRLEH-NETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWA 176
G ID L H EH + +YCG C GA + CCN CEE+REAY + WA
Sbjct: 113 EGGGVIDVSSLDLHNDAAEHLDPSYCGDCGGAPAPSNVKKAGCCNTCEEIREAYAQVSWA 172
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ +QC+RE + +R++E+ EGC I G L+VNKV GNFH APG+SF +HVHD+
Sbjct: 173 FGDGKAFEQCEREHYAERLEEQRHEGCRIDGLLQVNKVVGNFHLAPGRSFSNGNMHVHDL 232
Query: 237 LAFQRDS----FNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETP 276
+ + + +H I+ L FG P V NPLD + + P
Sbjct: 233 KNYWETTDDKKHDFTHYIHHLRFGPQLPEAVVKKMGKGATPWTNHHANPLDNTKQLTDDP 292
Query: 277 SGMYQYFIKVVPTVYTDV----------SGHTIQSNQFSVTEHFRSSEQG---------R 317
+ + YF+K+VPT + + + +++++Q+SVT H RS G R
Sbjct: 293 NYNFMYFVKIVPTSFLPLGWEKMSRAMNTDGSVETHQYSVTSHKRSLTGGDDAAEGHAER 352
Query: 318 LQT---LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 373
L + +PGVFF YD+SP+KV EE SFL F+ +CA+VGG TV+ +D ++ G
Sbjct: 353 LHSRGGIPGVFFSYDISPMKVINREEQGKSFLGFIAGLCAVVGGTLTVAAAVDRGLFEGT 412
Query: 374 RAIKK 378
+KK
Sbjct: 413 TRLKK 417
>gi|402590490|gb|EJW84420.1| hypothetical protein WUBG_04668 [Wuchereria bancrofti]
Length = 341
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 206/353 (58%), Gaps = 18/353 (5%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ +++ DAY K +DF RTF+GG +TLVSS V++ +F SE +L+ +L VD
Sbjct: 2 SLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYVD 61
Query: 63 TSRGET-LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL--DSQGNVIESR 119
++ E + +NFD+TFP LPCS++++D MD+SG+ D+K D++K L +GN I
Sbjct: 62 STPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIKDDVYKISLLNGKEGNGIRQG 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ P ++ CGSCYGA+ + CCN CEEV+EAY KKGW L N
Sbjct: 122 VNINTTTVSSAPA---------SQILCGSCYGAK---DGCCNTCEEVKEAYIKKGWELVN 169
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QCK + +++++ E + EGC +YG ++V KVAGNFH APG H HD+ +
Sbjct: 170 IETVEQCKSDLWVKKMNEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLKAHRSHFHDLHSL 229
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG-MYQYFIKVVPTVYTDV-SGH 297
F+ SH +N L+FG FPG V PLDG + SG MYQY +K+VPT Y + S
Sbjct: 230 SPSKFDTSHTVNHLSFGNSFPGKVYPLDGKFFGSAKDSGIMYQYHLKLVPTSYVFLDSTR 289
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
I S+ FSVT + + QG LPG F Y+ SP+ V + E + + N
Sbjct: 290 NIFSHLFSVTTYQKDISQGA-SGLPGFFIQYEFSPLMVKYEERRQYVVTIILN 341
>gi|440636941|gb|ELR06860.1| hypothetical protein GMDG_08151 [Geomyces destructans 20631-21]
Length = 441
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 158/439 (35%), Positives = 227/439 (51%), Gaps = 74/439 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+ S ++++ L F E Y V +L+VD SR
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGIVTIASLLIVIYLAFGEWADYRRIVVHPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I ++TFP +PC +L++D MD+SGE VKH + K RL+S D G
Sbjct: 65 GEKMEIWMNITFPYVPCELLTLDVMDVSGEMQTGVKHGVSKVRLNSP--------DAGGG 116
Query: 126 PKIDKPLQRHGG--RLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALS 178
K L H + H + +YCG CYGA + CCN C+EVR+AY WA
Sbjct: 117 AIDVKALDLHSTEEKAAHLDPSYCGQCYGATPPPNAQKAGCCNTCDEVRDAYASASWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+RE + +R+ E+ EGC I G + VNKV GNFH APG+S+ +HVHD+
Sbjct: 177 RGENVEQCEREHYSERLDEQRKEGCRIEGGVRVNKVIGNFHIAPGRSYSNGNMHVHDLAN 236
Query: 239 FQ-----RDSFNISHKINKLAFGEHFP-GV---------------VNPLDGVRWTQETPS 277
+ + +H I+ + FG P G+ +NPLDG + P+
Sbjct: 237 YWDTPSLERGHSFAHTIHHVRFGPQLPEGLSKKFGGKNQPWTNHHLNPLDGTQQHTRDPA 296
Query: 278 GMYQYFIKVVPTVY------------TDVS---------GH----TIQSNQFSVTEHFRS 312
Y YF+KVV T Y T +S GH +++++Q+SVT H RS
Sbjct: 297 FNYMYFVKVVSTSYLPLGWNSKSAAKTQISEENIGLGAYGHAVDGSVETHQYSVTSHKRS 356
Query: 313 SEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVF 359
G RL + +PGVFF YD+SP+KV EE L F+T +CAIVGG
Sbjct: 357 LSGGDDGAEGHKERLHSRTGIPGVFFSYDISPMKVINREERTKTLSGFITGLCAIVGGTL 416
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G IKK
Sbjct: 417 TVAAAVDRGLYEGVSRIKK 435
>gi|325191973|emb|CCA26442.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 401
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 126/366 (34%), Positives = 217/366 (59%), Gaps = 17/366 (4%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ D YPK++ +F +T +G +++++++++ L+LF +ELR Y++ ++VD++ E
Sbjct: 33 LKRFDVYPKLHTEFKVQTETGAIVSIITAVIALILFLAELREYMSVRMHEHMVVDSTISE 92
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN D+++ AL C + AMD++GE +D+ I RLD++GN I +
Sbjct: 93 KLRINIDISYLALTCKESYLTAMDVTGELQMDLHRSIGMTRLDAKGNPINT--------- 143
Query: 128 IDKPLQRHGGRLEHNETYCGSCY-GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+D + L N YCGSCY + CCN C+EV+EA+ L + D +QC
Sbjct: 144 LDSAKEE---VLPAN--YCGSCYETVHPLGKTCCNTCDEVKEAFVANDLRLFDADQKEQC 198
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
RE ++ + + GEGC + G++ VN+VAGNFH G++FH+ G +H L Q FN
Sbjct: 199 VREMTEEQRQAQAGEGCRLKGYMMVNRVAGNFHVGLGRTFHRKGKLIHQFLPGQESVFNA 258
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
S ++ L+FG + V N LDG ++ + G+ +YF+K+VPT+Y+D+S ++ S Q+S
Sbjct: 259 SFLLHSLSFGTPYANVKNGLDGTQYITKKKGGVMKYFLKIVPTIYSDISS-SVHSYQYSH 317
Query: 307 TEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
T+ + + G++ LPG +F ++ SP V E + F HF+ + AI+GG+ +++G +
Sbjct: 318 TKQEKYMNAMGQISGLPGAYFMFEFSPFMVKIDSEQIPFTHFVIRIFAILGGMISIAGFV 377
Query: 366 DAFIYH 371
D+ I+H
Sbjct: 378 DSVIFH 383
>gi|342874382|gb|EGU76396.1| hypothetical protein FOXB_13074 [Fusarium oxysporum Fo5176]
Length = 439
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 156/438 (35%), Positives = 224/438 (51%), Gaps = 74/438 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+VS +V+L L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVLFLSWGEWAEYRRIEIHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ V H + K RL G
Sbjct: 65 GERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRLQPANQ---------GG 115
Query: 126 PKID-KPLQRHGGRLEH-NETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSN 179
ID K L H +H + +YCG CYGA+ + CC C+EVREAY + WA
Sbjct: 116 AVIDIKSLALHDESADHLDPSYCGGCYGAQPPANARKAGCCQTCDEVREAYAQSSWAFGR 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QC+RE + +++ + EGC I G L VNKV GNFHFAPG+SF +HVHD+ +
Sbjct: 176 GEGVEQCEREHYGEKLDAQREEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNY 235
Query: 240 ----QRDSFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSGM 279
+ S + +H I+ L FG P + NPLD R P+
Sbjct: 236 WDVPKGKSHDFTHYIHSLRFGPQLPDNIAKKVGTKSSLWTNHHQNPLDNTRQEIHDPNFN 295
Query: 280 YQYFIKVVPTVY-----------------TDVSG---------HTIQSNQFSVTEHFRSS 313
+ YF+K+VPT Y D +G +++++Q+SVT H RS
Sbjct: 296 FMYFVKIVPTSYLPLGWDSKGIKIAGLLQDDNAGLGAYGYSEDGSVETHQYSVTSHKRSL 355
Query: 314 EQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 360
G R T +PGVFF YD+SP+KV EE +F FL +CAIVGG T
Sbjct: 356 AGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLT 415
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ +D ++ G IKK
Sbjct: 416 VAAAVDRGLFEGAARIKK 433
>gi|408400673|gb|EKJ79750.1| hypothetical protein FPSE_00030 [Fusarium pseudograminearum CS3096]
Length = 439
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 158/440 (35%), Positives = 226/440 (51%), Gaps = 78/440 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGGV+T+VS +V+L L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLSWGEWADYRRIDIHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGI 123
GE + I+ ++TFP +PC +LS+D MD+SGEQ V H + K RL +SQG +
Sbjct: 65 GERMEIHLNITFPKMPCELLSLDVMDVSGEQQHGVMHGVNKVRLQPESQGGAV------- 117
Query: 124 GAPKID-KPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWAL 177
ID K L H H + +YCG CYGA + CC C+EVREAY + WA
Sbjct: 118 ----IDTKSLSLHDDAAHHLDPSYCGGCYGATPPANAQKAGCCQTCDEVREAYAQASWAF 173
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+RE + +++ + EGC I G L VNKV GNFHFAPG+SF +HVHD+
Sbjct: 174 GRGEGVEQCEREHYGEKLDAQRSEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLK 233
Query: 238 AF----QRDSFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPS 277
+ + S + +H ++ L FG P + NPLD R P+
Sbjct: 234 NYWDVPKGFSHDFTHIVHSLRFGPQLPDHIARKVGHKNTLWTNHHQNPLDDTRQETHDPN 293
Query: 278 GMYQYFIKVVPTVY-----------------TDVSG---------HTIQSNQFSVTEHFR 311
+ YF+K+VPT Y D +G +++++Q+SVT H R
Sbjct: 294 YNFMYFVKIVPTSYLPLGWDKKGIKIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTSHRR 353
Query: 312 SSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGV 358
S G R T +PGVFF YD+SP+KV EE +F FL +CAIVGG
Sbjct: 354 SLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGT 413
Query: 359 FTVSGIIDAFIYHGQRAIKK 378
TV+ +D ++ G +KK
Sbjct: 414 LTVAAAVDRGLFEGAARLKK 433
>gi|46105482|ref|XP_380545.1| hypothetical protein FG00369.1 [Gibberella zeae PH-1]
Length = 444
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 158/440 (35%), Positives = 226/440 (51%), Gaps = 78/440 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGGV+T+VS +V+L L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLSWGEWADYRRIDIHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGI 123
GE + I+ ++TFP +PC +LS+D MD+SGEQ V H + K RL +SQG +
Sbjct: 65 GERMEIHLNITFPKMPCELLSLDVMDVSGEQQHGVMHGVNKVRLQPESQGGAV------- 117
Query: 124 GAPKID-KPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWAL 177
ID K L H H + +YCG CYGA + CC C+EVREAY + WA
Sbjct: 118 ----IDTKSLSLHDDAAHHLDPSYCGGCYGATPPANAQKAGCCQTCDEVREAYAQASWAF 173
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+RE + +++ + EGC I G L VNKV GNFHFAPG+SF +HVHD+
Sbjct: 174 GRGEGVEQCEREHYGEKLDAQRSEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLK 233
Query: 238 AF----QRDSFNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPS 277
+ + S + +H ++ L FG P + NPLD R P+
Sbjct: 234 NYWDVPKGFSHDFTHIVHSLRFGPQLPDHIARKVGHKNTLWTNHHQNPLDDTRQETHDPN 293
Query: 278 GMYQYFIKVVPTVY-----------------TDVSG---------HTIQSNQFSVTEHFR 311
+ YF+K+VPT Y D +G +++++Q+SVT H R
Sbjct: 294 YNFMYFVKIVPTSYLPLGWDKKGIKIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTSHRR 353
Query: 312 SSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGV 358
S G R T +PGVFF YD+SP+KV EE +F FL +CAIVGG
Sbjct: 354 SLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGT 413
Query: 359 FTVSGIIDAFIYHGQRAIKK 378
TV+ +D ++ G +KK
Sbjct: 414 LTVAAAVDRGLFEGAARLKK 433
>gi|261188384|ref|XP_002620607.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis SLH14081]
gi|239593207|gb|EEQ75788.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis SLH14081]
gi|239609349|gb|EEQ86336.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis ER-3]
gi|327354450|gb|EGE83307.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis ATCC 18188]
Length = 435
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 158/433 (36%), Positives = 218/433 (50%), Gaps = 76/433 (17%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
LDA+ K ED RT SGGV+T+ + I++ L + E Y V +L+VD RGE +
Sbjct: 10 LDAFTKTVEDARIRTRSGGVVTITALIIIFFLIWGEWSEYRRVVVLPELVVDKGRGERME 69
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPK 127
I+ +VTFP LPC +L++D MDISGE +V H + K RL + G V++ I A
Sbjct: 70 IHLNVTFPNLPCELLTLDVMDISGEYQTEVVHGVNKLRLSPAEEGGQVLD-----ITA-- 122
Query: 128 IDKPLQRHG---GRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNP 180
LQ H + + YCGSCYGA + CCN C+EVREAY K W+
Sbjct: 123 ----LQLHSKTDNAKDLDPNYCGSCYGAPAPPNAQKPGCCNTCDEVREAYAAKRWSFGRG 178
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ ++QC++EG+ + + EGC + G + VNKV GNFH APG+SF +H HD+ +
Sbjct: 179 ENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVIGNFHIAPGRSFTNGNMHAHDLNNYY 238
Query: 241 RDSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKV 286
N+ HKI+ L FG P V NPLD P + YF+KV
Sbjct: 239 NTPIPHNVGHKIHYLRFGPQLPDEVSRRWKWTDHHHTNPLDNTEQHTTNPRLNFAYFVKV 298
Query: 287 VPTVYTDV----------------------------SGHTIQSNQFSVTEHFRSSEQG-- 316
V T Y + SG +I+++Q+SVT H RS + G
Sbjct: 299 VATSYLPLGWDDDWSSTVHSKVSNNVPLGKQGVSLGSGGSIETHQYSVTSHKRSVDGGND 358
Query: 317 -------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGII 365
RL + +PGVF YD+SP+KV E +F FLT VCA++GG TV+ I
Sbjct: 359 AEEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAI 418
Query: 366 DAFIYHGQRAIKK 378
D +Y G +KK
Sbjct: 419 DRALYEGSVRVKK 431
>gi|453082617|gb|EMF10664.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 432
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 156/434 (35%), Positives = 221/434 (50%), Gaps = 71/434 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT +GG++T+ S I++L L + E Y V +L+VD R
Sbjct: 5 SRFTKLDAFTKTVEDARIRTSTGGIVTITSLILILYLVWGEWTDYRRTVVHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +++FP +PC +L++D MD+SGE V H + K RLD+ G I G A
Sbjct: 65 GEKMEIHMNISFPRVPCELLTLDVMDVSGEVQSGVMHGVNKVRLDANGKEI-----GKEA 119
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGAESSD----EDCCNNCEEVREAYRKKGWALSNP 180
++ Q + H + YCG CYGA + + CCNNC EVREAY W+
Sbjct: 120 LTVNSEEQ-----VPHLDPDYCGDCYGAPAPETATKAGCCNNCAEVREAYAGVSWSFGRG 174
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ ++QC RE + + + E+ EGC I G + VNKV GNFHFAPGKSF +HVHD+ +
Sbjct: 175 EGVEQCTREHYAEHLDEQRKEGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYF 234
Query: 241 RDS---FNISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSGMYQ 281
+ + +HKI+ L FG P V NPLD + + +
Sbjct: 235 QSGEVQHSFTHKIHHLRFGPELPDDVVKAVGKKGMAWSNHHLNPLDDTEQVTDEVAYNFM 294
Query: 282 YFIKVVPTVYT----DVSGH--------------------TIQSNQFSVTEHFRSSEQG- 316
YF+KVV T Y D SG +I+++Q+SVT H RS G
Sbjct: 295 YFVKVVSTAYLPLGWDGSGSLLDIPHELIALGGYGKGEQGSIETHQYSVTSHKRSLTGGD 354
Query: 317 --------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGI 364
RL +PGVFF YD+SP+KV E SF FL VCA++GG TV+
Sbjct: 355 AKAEGHEERLHAKGGIPGVFFSYDISPMKVINREARAKSFSGFLVGVCAVIGGTLTVAAA 414
Query: 365 IDAFIYHGQRAIKK 378
+D +Y G ++K
Sbjct: 415 VDRLLYEGGSKLRK 428
>gi|330919615|ref|XP_003298687.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
gi|311327999|gb|EFQ93219.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
Length = 437
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 154/437 (35%), Positives = 222/437 (50%), Gaps = 72/437 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+ S +V+ L + E Y +L+VD SR
Sbjct: 5 SRFNKLDAFTKTVEDARVRTTSGGIVTIASLLVIFWLSWGEWADYRRVTVRPELVVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I +++FP +PC +L++D MD+SGE + V H I K RL + DG A
Sbjct: 65 GERMEIAMNISFPRMPCELLTLDVMDVSGELQMGVTHGINKVRLSPEA-------DGSKA 117
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNP 180
+I K + H H YCG CYGA + CCN C+EVR+AY W+
Sbjct: 118 IEI-KAVDLHTDEASHLAPDYCGQCYGAPAPSNAKKPTCCNTCDEVRDAYASVSWSFGRG 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ ++QC+RE + + + ++ EGC + G ++VNKV GNFHFAPGKSF +HVHD+ +
Sbjct: 177 EGVEQCEREHYAEHLDQQRQEGCRLEGNIKVNKVVGNFHFAPGKSFSNGNLHVHDLENYF 236
Query: 241 RDSF--NISHKINKLAFGEHFPGVV---------------------NPLDGVRWTQETPS 277
+D + +H I++L FG VV NPLD + +
Sbjct: 237 KDEYTHTFTHHIHQLRFGPQLSDVVVQNMQKKHQESGIGGWSNHHINPLDETMQHTDEKA 296
Query: 278 GMYQYFIKVVPTVY---------------TDVSGHT--------IQSNQFSVTEHFRSSE 314
Y YFIKVV TVY +D+ G T I+++Q+SVT H RS +
Sbjct: 297 YNYMYFIKVVTTVYLPLGWEKVFPHPSKFSDILGATIDESYKGSIETHQYSVTSHKRSLQ 356
Query: 315 QGRLQT------------LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTV 361
G + +PGVFF YD+SP++V E +F FL +CA++GG TV
Sbjct: 357 GGNDEKDGHKERIHARGGIPGVFFSYDISPMEVINREVREKTFSGFLVGLCAVIGGTLTV 416
Query: 362 SGIIDAFIYHGQRAIKK 378
+ ID +Y G IKK
Sbjct: 417 AAAIDRALYEGVNRIKK 433
>gi|367019108|ref|XP_003658839.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
42464]
gi|347006106|gb|AEO53594.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
42464]
Length = 436
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 155/441 (35%), Positives = 215/441 (48%), Gaps = 83/441 (18%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+VS IV+ L E Y V +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARIRTTSGGIVTIVSLIVVFFLALGEWSDYRRIVVHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRL----------DSQGNV 115
GE + I+ ++TFP +PC +L++D MD+SGEQ V+H I K RL DS+ V
Sbjct: 65 GERMEIHLNITFPRIPCELLTLDVMDVSGEQQHGVQHGITKTRLRPLSEGGGDIDSKEIV 124
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYR 171
+ SR + + + YCG CYGA + CCN C+EVR+AY
Sbjct: 125 LHSRDEAA---------------VHLDPNYCGECYGAPPPNNAKKPGCCNTCDEVRDAYA 169
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
+ WA + I QC+RE + +++ + EGC I G L VNKV GNFH APG+SF +
Sbjct: 170 QASWAFGRGEGIVQCEREHYSEKLDAQRNEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNM 229
Query: 232 HVHDILAF--QRDSFNISHKINKLAFGEHFPGV----------------VNPLDGVRWTQ 273
HVHD+ + +H I+ L FG P VNPLD
Sbjct: 230 HVHDLKNYWDSPTKHTFTHTIHHLRFGPQLPESLTQKLGTKNLPWTNHHVNPLDDTHQQT 289
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGH-----------------------TIQSNQFSVTEHF 310
+ + Y YF+K+VPT Y + +++++Q+SVT H
Sbjct: 290 DDVNYNYMYFLKIVPTSYLPLGWEKTWAGFRERHSAELGSFGTSPDGSVETHQYSVTSHK 349
Query: 311 RS------------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGG 357
RS Q +PGVFF YD+SP+KV EE SFL FL +CAIVGG
Sbjct: 350 RSLAGGNDAAEGHQERQHARGGIPGVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGG 409
Query: 358 VFTVSGIIDAFIYHGQRAIKK 378
TV+ ID ++ G +KK
Sbjct: 410 TLTVAAAIDRALFEGTVRLKK 430
>gi|67479077|ref|XP_654920.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472012|gb|EAL49533.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449701866|gb|EMD42605.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 354
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/368 (39%), Positives = 210/368 (57%), Gaps = 20/368 (5%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M I+ DAYPKIN + + + GG++++V I M+ +F SEL Y + L VD S
Sbjct: 1 MQNIKRFDAYPKINSNNRVKHWIGGLLSIVCIITMIWMFSSELNDYFTIRKKPVLRVDES 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ + L INFD+TFP CS SVD +D +GE +D+ +I K+RL N++ +D I
Sbjct: 61 KNKKLPINFDITFPHSACSFSSVDVLDTTGEVIIDISKNIKKERL----NLV--NEDEIS 114
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K K + +G T C C ES + CC CEE+ E+Y+K + P
Sbjct: 115 KKKFAKTV--YG-------TECPPC-NNESDKDKCCFTCEELTESYQKLNKEV--PKGSP 162
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+ + GEGC I G + VN+ +GNFH APG S + H+H + +
Sbjct: 163 QCEIRNIHKMTTFYNGEGCRISGTVFVNRASGNFHIAPGSSQQLTQEHIHSV-DWISGGI 221
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H N L+FG+ FPG++NP+DG+ T + MYQYF++VVP YT + I +N +
Sbjct: 222 NLTHTWNFLSFGDSFPGMINPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNKVIHTNGY 281
Query: 305 SVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
SVTEH+R S + Q +PGVF YD+S I+V + EE SF H LT++C I+GGVF +
Sbjct: 282 SVTEHYRPGSLKSPEQGIPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFS 341
Query: 364 IIDAFIYH 371
++D FI+H
Sbjct: 342 LLDYFIFH 349
>gi|300123299|emb|CBK24572.2| unnamed protein product [Blastocystis hominis]
Length = 376
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 136/361 (37%), Positives = 200/361 (55%), Gaps = 20/361 (5%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ LD YPKI +D+ +T SGG ++L S ++++LF SEL YL + +D +R
Sbjct: 28 KLEKLDIYPKIGDDYVIKTESGGFVSLFSGFIIIILFVSELTNYLKVNRTDVITIDNTRN 87
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
E L+INF+++ +PCS S+D MDISG+Q + V I + LD + +
Sbjct: 88 EKLQINFNISLYGIPCSEASLDIMDISGQQQMGVTSRIVQLDLDENHKPVNMALSSVLYE 147
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW-ALSNPDLIDQ 185
K P CGSC+GA S+ CCN C++V AY ++GW Q
Sbjct: 148 KNIDPA-------------CGSCFGASLSNV-CCNTCDDVLSAYERRGWDTWFVSKYSPQ 193
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
C++ + +GC ++G LEVNKVAGNFH A G + ++ H+H FN
Sbjct: 194 CRKNNDEVKKPRVNSQGCMMWGVLEVNKVAGNFHIAVGHAANRDSHHIHSFNPLMISKFN 253
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
++H I KL+FGEH PG+ NPLDG E+ + Y++KV+PTVY++ + T+ SN+ S
Sbjct: 254 VTHHIEKLSFGEHIPGIQNPLDGHDMVAESLTSQ-NYYLKVMPTVYSNRTS-TVVSNELS 311
Query: 306 VTEHFRSSEQ---GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
V E R E G++ +LPG+FF YD++P TE ++F HFL VCA++GGV V
Sbjct: 312 VNEVSRRVEMTPFGQITSLPGIFFIYDITPFMHVVTESRIAFAHFLVRVCAVIGGVAAVG 371
Query: 363 G 363
Sbjct: 372 A 372
>gi|452842116|gb|EME44052.1| hypothetical protein DOTSEDRAFT_71753 [Dothistroma septosporum
NZE10]
Length = 436
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 153/437 (35%), Positives = 219/437 (50%), Gaps = 73/437 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+ S +++L L + E Y +L+VD R
Sbjct: 5 SRFTKLDAFTKTVEDARIRTTSGGIVTVTSLLLILYLVWGEWADYRRITVHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ +V+FP +PC +L++D MD+SGE V H + K RL + G IE +
Sbjct: 65 GEKMEIHMNVSFPRVPCELLTLDVMDVSGEVQTGVMHGVNKVRLRPEAEGGGEIEKKALD 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALS 178
+G + + L + YCG CYGA ++ CCN C EVREAY W+
Sbjct: 125 LGVEEAAQHL---------DPDYCGECYGAPAPSNAAKPGCCNTCAEVREAYAGVSWSFG 175
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+RE + + + + EGC I G + VNKV GNFHFAPGKSF +HVHD+
Sbjct: 176 RGENVEQCEREHYSEHLDAQRKEGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLEN 235
Query: 239 FQRDSFNI----SHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSG 278
F I +HKI+ L FG P V NPLDG E S
Sbjct: 236 FFNSPEGIQHTFTHKIHSLRFGPQLPDDVVNKVGKRGIAWSEHHLNPLDGTSQVTEEKSY 295
Query: 279 MYQYFIKVVPTVYTDVSGH------------------------TIQSNQFSVTEHFRSSE 314
+ YF+KVV T Y ++ +I+++Q+SVT H RS +
Sbjct: 296 NFMYFVKVVSTAYLPLAWKPSGSLLDLPHELVELGGYGKGEGGSIETHQYSVTSHKRSLQ 355
Query: 315 QG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTV 361
G RL +PGVFF YD+SP+KV E +F FLT V A++GG TV
Sbjct: 356 GGDANEEGHKERLHARGGIPGVFFSYDISPMKVVNREARTKTFTGFLTGVAAVIGGTLTV 415
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D +Y G + ++K
Sbjct: 416 AAAVDRLMYEGGQRVRK 432
>gi|407044387|gb|EKE42566.1| hypothetical protein ENU1_017250 [Entamoeba nuttalli P19]
Length = 354
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/368 (39%), Positives = 210/368 (57%), Gaps = 20/368 (5%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M I+ DAYPKIN + + + GG++++V I M+ +F SEL Y + L VD S
Sbjct: 1 MQNIKRFDAYPKINSNNRVKHWIGGLLSIVCIITMIWMFSSELNDYFTIRKKPVLRVDES 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ + L INFD+TFP CS SVD +D +GE +D+ +I K+RL N++ +D I
Sbjct: 61 KNKKLPINFDITFPHSACSFTSVDVLDTTGEVIIDISKNIKKERL----NLV--NEDEIS 114
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K K + +G T C C E + CC CEE+ E+Y+K + P
Sbjct: 115 KKKFAKTV--YG-------TECPPC-NNEIDKDKCCFTCEELTESYQKLNKEV--PKGSP 162
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+ + + GEGC I G + VN+ +GNFH APG S + H+H + +
Sbjct: 163 QCEIKNIHKMTTFYNGEGCRISGTVFVNRASGNFHIAPGSSQQLTQEHIHSV-DWISGGI 221
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N++H N L+FG+ FPG++NPLDG+ T + MYQYF++VVP YT + I +N +
Sbjct: 222 NLTHTWNFLSFGDSFPGMINPLDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNKVINTNGY 281
Query: 305 SVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
SVTEH+R S + Q +PGVF YD+S I+V + EE SF H LT++C I+GGVF +
Sbjct: 282 SVTEHYRPGSLKSPEQGIPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFS 341
Query: 364 IIDAFIYH 371
++D FI+H
Sbjct: 342 LLDYFIFH 349
>gi|67524561|ref|XP_660342.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
gi|40743850|gb|EAA63036.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
gi|259486349|tpe|CBF84116.1| TPA: COPII-coated vesicle membrane protein Erv46, putative
(AFU_orthologue; AFUA_1G05120) [Aspergillus nidulans
FGSC A4]
Length = 437
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 155/441 (35%), Positives = 219/441 (49%), Gaps = 74/441 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ++ RT SGG+IT+ S ++++ L + E Y +L+VD
Sbjct: 2 AAKSRFTRLDAFAKTVDEARIRTTSGGIITIASLLIIIWLTWGEWVDYRRVAVLPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SRGE + I+ ++TFP LPC + ++D MD+SGEQ + V H + K RL
Sbjct: 62 KSRGEKMEIHLNITFPRLPCELTTLDVMDVSGEQQVGVAHGVNKVRLAPAAE-------- 113
Query: 123 IGAPKID-KPLQRHGGRLEH-NETYCGSCYGAESS----DEDCCNNCEEVREAYRKKGWA 176
G +D + LQ H +H + YCG C GA CC+ C+EVREAY +K W
Sbjct: 114 -GGRVLDVQALQLHAEEAKHLDPDYCGECGGAPPPPNAIKPGCCSTCDEVREAYAQKQWG 172
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
I+QC+RE + +RI + EGC + G + VNKV GNFH APG+SF + VH+HDI
Sbjct: 173 FGKGTNIEQCEREHYSERIDAQRREGCRLEGVIRVNKVVGNFHIAPGRSFSSNNVHIHDI 232
Query: 237 LAFQR------DSFNISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSG 278
++ + +SH I+ L FG P + NPLD P+
Sbjct: 233 ANYEERGLSPAEQHTMSHIIHSLRFGPQLPDELSDRWQWTDHHHTNPLDSTSQEAPEPAY 292
Query: 279 MYQYFIKVVPTVYTDV----------------------------SGHTIQSNQFSVTEHF 310
+ YFIKVV T Y + S +I+++Q+SVT H
Sbjct: 293 SFMYFIKVVSTSYLPLGWDPLYSASLHAAADTNTPLGAQGLSAGSQGSIETHQYSVTSHK 352
Query: 311 RSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGG 357
RS G R+ +PGVFF YD+SP+KV E +F FLT VCAIVGG
Sbjct: 353 RSLRGGDASDEAHKERIHAAGGIPGVFFNYDISPMKVINREARPKTFTGFLTGVCAIVGG 412
Query: 358 VFTVSGIIDAFIYHGQRAIKK 378
TV+ ID +Y G ++K
Sbjct: 413 TLTVAAAIDRTLYEGVSRVRK 433
>gi|72393511|ref|XP_847556.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175086|gb|AAX69235.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803586|gb|AAZ13490.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261330829|emb|CBH13814.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 405
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 144/408 (35%), Positives = 220/408 (53%), Gaps = 33/408 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GGV+++ S +++ L E+R +L+ V + ++
Sbjct: 1 MKGLSRLDVFPKFDTRFEQDARQRTALGGVLSMASILIITFLVVGEIRYFLSTVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G + + ++TFP +PC +++ DA+D GE +V D K R+DS +
Sbjct: 61 VDPHIGGIMHMKVNITFPRVPCDLMTADAIDAFGEYVENVVTDTAKVRVDSS----TLKP 116
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G +D Q G NE C +CYGAE + +CC+ C++VR A+ ++ W
Sbjct: 117 LGKARQLVDLKKQPTNGNETGNEN-CPTCYGAEKNPGECCHTCDDVRRAFAERQWEFHED 175
Query: 181 DL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
D+ I QC E EGCN++ V +V GN HF PG+ F+ G H+H
Sbjct: 176 DVSIAQCAHERLKVAADSASAEGCNLHASFSVPRVTGNIHFVPGRMFNFFGQHLHSFKGE 235
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLD------GVRWTQETPSGMYQYFIKVVPTVYTD 293
N+SH ++ L FGE FPG NP+D GV+ E G + YF+KVVPT+Y
Sbjct: 236 TIRKLNLSHIVHALEFGERFPGQNNPMDGMVNARGVKDPSEPLIGRFTYFVKVVPTLYQV 295
Query: 294 VS----GHTIQSNQFSVTEHFRSS----EQGRLQ-------TLPGVFFFYDLSPIKVTFT 338
VS G+ ++SNQ+SVT HF S ++G +PGVF YD+SPI+V+ T
Sbjct: 296 VSMANTGNLVESNQYSVTHHFTPSWAAPKEGETDNPNSDPLVVPGVFISYDISPIRVSVT 355
Query: 339 EEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
H S +H + +CA+ GGV+TV+G+ID+ +HG + +++KI GK
Sbjct: 356 RTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHGIKRVQEKINRGK 403
>gi|367052857|ref|XP_003656807.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
gi|347004072|gb|AEO70471.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
Length = 436
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 164/436 (37%), Positives = 225/436 (51%), Gaps = 73/436 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG+IT+VS IV+L L + E Y V +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARIRTTSGGIITIVSIIVVLFLAWGEWADYRRVVVHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ V+H + K RL R G
Sbjct: 65 GERMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVTKTRL---------RPLSEGG 115
Query: 126 PKID-KPLQRHGG---RLEHNETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWAL 177
ID K L H + + +YCG CYGA+ + CCN C+EV+EAY ++ WA
Sbjct: 116 GDIDSKALALHAADEAAIHLDPSYCGPCYGAKPPTTAKKPGCCNTCDEVKEAYAQQAWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D I+QC+RE + +R+ E+ EGC I G L VNKV GNFH APG+SF VHVHD+
Sbjct: 176 GRGDGIEQCEREHYGERLDEQRREGCRIEGGLRVNKVVGNFHIAPGRSFSNGNVHVHDLK 235
Query: 238 AF--QRDSFNISHKINKLAFGEHFPGV----------------VNPLDGVRWTQETPSGM 279
+ +H I+ L FG P +NPLDG + +
Sbjct: 236 NYWDTPTKHTFTHIIHHLRFGPQLPDSLHKKLGTKHLPWTNHHLNPLDGTSQETDDVNFN 295
Query: 280 YQYFIKVVPTVY------------------------TDVSGHTIQSNQFSVTEHFRSSEQ 315
Y YFIK+VPT Y T G +++++Q+SVT H RS
Sbjct: 296 YMYFIKIVPTSYLPLGWEKTWAGFREEHQAELGSFGTSADG-SVETHQYSVTSHKRSLAG 354
Query: 316 G---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 362
G RL +PGVFF YD+SP+KV EE +FL F+ +CAIVGG TV+
Sbjct: 355 GDDAAEGHRERLHAKGGIPGVFFSYDISPMKVINREERSKTFLGFIAGLCAIVGGTLTVA 414
Query: 363 GIIDAFIYHGQRAIKK 378
+D ++ G +KK
Sbjct: 415 AAVDRALFEGTVRLKK 430
>gi|148674215|gb|EDL06162.1| ERGIC and golgi 3, isoform CRA_b [Mus musculus]
Length = 269
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/249 (52%), Positives = 173/249 (69%), Gaps = 3/249 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 15 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 74
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 75 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHE 134
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K++ + L+ N C SCYGAES D CCN+CE+VREAYR++GWA NPD I+
Sbjct: 135 LGKVEVTV-FDPNSLDPNR--CESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 191
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVH + SF
Sbjct: 192 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSF 251
Query: 245 NISHKINKL 253
+ + + L
Sbjct: 252 GLDNPSDCL 260
>gi|406606433|emb|CCH42207.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Wickerhamomyces ciferrii]
Length = 405
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/402 (36%), Positives = 219/402 (54%), Gaps = 40/402 (9%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S DA+ K ED +T SGG+IT+ + + L +E R + + +L+VD R L
Sbjct: 9 SFDAFSKTVEDARVKTTSGGLITVTCILTLFSLIINEWRQFNEITIDPELVVDRDRNLKL 68
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKI 128
IN DVTFP LPC I+S+D MD+SG+ LDV + F K RL G I + IG
Sbjct: 69 DINLDVTFPDLPCDIMSLDIMDVSGDLQLDVTNYGFTKIRLTETGEEIGEEEMKIG---- 124
Query: 129 DKPLQRHG-GRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALS 178
HG + YCG CYGA++ D++ CCN+C+ VR+AY GWA
Sbjct: 125 ----DDHGHADADIPADYCGPCYGAKNQDKNENKPQEEKVCCNDCDSVRKAYASVGWAFF 180
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+REG++++I + GEGC + G ++N++ GN HFAPG S+ HVHD+
Sbjct: 181 DGKNVEQCEREGYVKKINDRLGEGCRVKGTAKLNRINGNIHFAPGASYSAPNRHVHDLSL 240
Query: 239 FQRD-SFNISHKINKLAFG---------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
+ ++ FN H IN +FG E +PLDG Q + +Y YF+KVVP
Sbjct: 241 YGKNKDFNFRHVINHFSFGPDVNSKYTAETLELSSHPLDGTNAIQGSRDHLYSYFLKVVP 300
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFT 338
T Y ++G +++NQFS T H R GR + +PG+FF +++SP+K+
Sbjct: 301 TRYEYLNGTKVETNQFSSTYHDRPLTGGRDEDHPNTFHARGGIPGLFFHFEMSPLKIINK 360
Query: 339 EEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
E + S+ FL NV + +GG+ TV ++D ++ + I++K
Sbjct: 361 ETYGTSWSGFLLNVISAIGGILTVGAVVDRTVFVADKVIRRK 402
>gi|323454843|gb|EGB10712.1| hypothetical protein AURANDRAFT_2571, partial [Aureococcus
anophagefferens]
Length = 380
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/391 (36%), Positives = 206/391 (52%), Gaps = 45/391 (11%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +M K+R++D YPK ++F RT GGV +L + +V ++L SEL+ L T +L
Sbjct: 1 MADVMAKLRNMDMYPKTKDEFRVRTMQGGVSSLFAVVVAIILVRSELKHSLAVSTHDRLF 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI---- 116
V++S G+ L + F++ FP C +L++DA D SG+ V+ + K RLD+ G +
Sbjct: 61 VNSSHGDGLSVRFELEFPRANCELLAIDANDESGQPLEGVQQHVIKTRLDTNGRRVLVNR 120
Query: 117 ------------ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCE 164
+ ++ + AP KP E CG CYGA+ + CC C+
Sbjct: 121 KAANSVHKVGDTATSEEHLAAPDEAKP-----------EVACGDCYGAQDDERPCCATCD 169
Query: 165 EVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGK 224
+VR AYRK+GW + + QC E + + EGC+I G LE+ V+GNFH APG+
Sbjct: 170 DVRSAYRKRGWTF-HEHTVAQCAGELAEAALDLDSDEGCSIKGTLELPAVSGNFHVAPGR 228
Query: 225 SFHQSGV-HVHDILAFQRDSFNISHKINKLAFG---------EHFPGVVNP-------LD 267
SG+ D++ D FN+SH + +L FG VV P LD
Sbjct: 229 HLQTSGLFKGMDLVQLTFDKFNVSHTVKQLRFGPDERSLEPARASRKVVGPDVDLSSQLD 288
Query: 268 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 327
G T GM+QY++KVVPTVY ++ G T + Q+SVTEH R G + LPGVFFF
Sbjct: 289 GESRTLGDGYGMHQYYLKVVPTVYKNLGGKTRELWQYSVTEHVRHVAPGSGKGLPGVFFF 348
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
Y++SP+ F E +L LT + AIVGGV
Sbjct: 349 YEVSPLCAEFVERRNGWLALLTGLAAIVGGV 379
>gi|61555552|gb|AAX46728.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
Length = 283
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 131/249 (52%), Positives = 170/249 (68%), Gaps = 11/249 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFG 236
Query: 241 RDSFNISHK 249
D+ K
Sbjct: 237 LDNVRTRWK 245
>gi|225562998|gb|EEH11277.1| COPII coated vesicle component Erv46 [Ajellomyces capsulatus
G186AR]
gi|240279818|gb|EER43323.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H143]
gi|325092948|gb|EGC46258.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H88]
Length = 435
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 154/432 (35%), Positives = 216/432 (50%), Gaps = 64/432 (14%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGV+T+ + V+ L + E Y V +L+VD R
Sbjct: 5 SRFARLDAFTKTVEDARIRTRSGGVVTISALFVIFFLIWGEWSEYRRIVVLPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +VTFP LPC +L++D MDISGE V H + K RL S +E +
Sbjct: 65 GERMEIHLNVTFPNLPCELLTLDVMDISGEYQTGVIHGVNKVRLSS----VEEGGRVLDI 120
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ Q + G + + YCG CYGA + CCN CEEVR+AY KGWA +
Sbjct: 121 TALQLHSQTNKG-TDVDPDYCGQCYGATPPSNAKKPGCCNTCEEVRDAYAAKGWAFGRGE 179
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
++QC++EG+ + + EGC + G + VNKV GNFH APG+SF +H HD+ +
Sbjct: 180 NVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVVGNFHIAPGRSFTNGNLHAHDLDNYYH 239
Query: 242 DSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVV 287
N+ H+I+ L FG P + NPLD P + YF+KVV
Sbjct: 240 TPVQHNMGHRIHYLRFGPQLPEQLSSRWKWTDNHHTNPLDNTEQHTTNPRFNFMYFVKVV 299
Query: 288 PTVYTDV--------SGH--------------------TIQSNQFSVTEHFRSSEQG--- 316
T Y + S H +I+++Q+SVT H RS + G
Sbjct: 300 STSYLPLGWDPDASSSAHSQYSKNAPLGKQGLSFGSYGSIETHQYSVTSHKRSVDGGDDS 359
Query: 317 ------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIID 366
RL + +PGVF YD+SP+KV E +F FLT VCA++GG TV+ ID
Sbjct: 360 AEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAID 419
Query: 367 AFIYHGQRAIKK 378
+Y G +KK
Sbjct: 420 RVLYEGAVRVKK 431
>gi|347842451|emb|CCD57023.1| similar to endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Botryotinia fuckeliana]
Length = 439
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 157/438 (35%), Positives = 216/438 (49%), Gaps = 74/438 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+ S +++L L F E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWADYRRITVHPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ + V H + K RL Q G
Sbjct: 65 GEKMEIHLNITFPKIPCELLTLDVMDVSGEQQVGVMHGVKKVRLGPQEE---------GG 115
Query: 126 PKID-KPLQRHGGR---LEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWAL 177
ID K L H + YCG+CYGA + CCN C+EVREAY WA
Sbjct: 116 KVIDIKALDLHNAEDSATHLDPNYCGACYGATPPPNAQKPGCCNTCDEVREAYASVSWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+RE + +R+ + EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GRGENVEQCEREHYGERLDSQRKEGCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVHDLN 235
Query: 238 AFQRDSFN----ISHKINKLAFGEHFPGVV-----------------NPLDGVRWTQETP 276
F SH I+ L FG P V NPLD
Sbjct: 236 NFFDTPVPGGHVFSHHIHSLRFGPELPEEVFKKLGSDSIIPWTNHHLNPLDNTEQITHEA 295
Query: 277 SGMYQYFIKVVPTVYTDVS-------------------GH----TIQSNQFSVTEHFRS- 312
+ + YF+KVV T Y + GH +I+++Q+SVT H RS
Sbjct: 296 AYNFMYFVKVVSTSYLPLGWETNYNSRPHDASVDIGTYGHSEDGSIETHQYSVTSHRRSL 355
Query: 313 -----SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFT 360
S +G + L PGVFF YD+SP+KV EE L FLT +CAIVGG T
Sbjct: 356 NGGDDSAEGHKEKLHARGGIPGVFFSYDISPMKVINKEERTKTLAGFLTGLCAIVGGTLT 415
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ +D +Y G ++K
Sbjct: 416 VAAAVDRGVYEGATRLRK 433
>gi|315044047|ref|XP_003171399.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma gypseum CBS 118893]
gi|311343742|gb|EFR02945.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma gypseum CBS 118893]
Length = 435
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 155/435 (35%), Positives = 217/435 (49%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+ + +V+L L + E + Y V + +L+VD R
Sbjct: 5 SRFTRLDAFAKTVEDARIRTRSGGIVTITALLVVLYLVWGEWKDYRRVVVQPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGE DV H + K RL S G VI+
Sbjct: 65 GERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALA 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG----AESSDEDCCNNCEEVREAYRKKGWALS 178
+ K D P L+ N YCG CYG + + CCN CEEVR+AY +K WA
Sbjct: 125 LHK-KEDSP-----AHLDPN--YCGDCYGVPAPSNAKKPGCCNTCEEVRDAYAEKNWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + QC EG+ QRI E+ EGC I G L VNKVAGNFH APG+S H HD+
Sbjct: 177 RGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDN 236
Query: 239 FQRDSF--NISHKINKLAFGEHFP------------GVVNPLDGVRWTQETPSGMYQYFI 284
+ +SH I+KL FG P +NPLD + + YF+
Sbjct: 237 YYHTPVPHTMSHTIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSDHKTDEARYNFMYFV 296
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS---- 312
KVV T Y + + +I+++Q+SVT H RS
Sbjct: 297 KVVSTSYLPLGWDPTWSSEVHSQAHKDIPLGNHGVYFGTQGSIETHQYSVTSHQRSLDAE 356
Query: 313 --SEQGRLQT------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
S +G + +P V F Y++SP+KV E S F T VCA++GG TV+
Sbjct: 357 DASAEGHKERQHTRGGIPSVIFNYEISPMKVINREARPKSLSAFFTGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AVDRLLYEGGLRVKK 431
>gi|295672798|ref|XP_002796945.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226282317|gb|EEH37883.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 435
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 154/439 (35%), Positives = 220/439 (50%), Gaps = 72/439 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+V+ V+ L + E Y V +L+VD
Sbjct: 2 APKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
RGE + I+ ++TFP LPC +L++D MD+SGE V H I K RL + G+VI++
Sbjct: 62 KGRGERMEIHLNITFPHLPCELLTLDVMDVSGEMQSGVIHGISKVRLAPESEGGHVIDTT 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKG 174
L +H + YCG CYGA ++ CC+ CEEVREAY +
Sbjct: 122 A---------LVLHTQTDAAKHLDPDYCGPCYGAPPPPHATKPGCCSTCEEVREAYASQS 172
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC+REG+ + + + EGC I G L VNKV GNFH APG+SF +H H
Sbjct: 173 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVIGNFHIAPGRSFSNGNLHAH 232
Query: 235 DILAFQRDSFN--ISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMY 280
D+ + ++HKI++L FG P + NPLD P +
Sbjct: 233 DLDTYYHTPVPHYMAHKIHQLRFGPQLPDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 292
Query: 281 QYFIKVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS 312
YF+KVV T Y + S +I+++Q+SVT H RS
Sbjct: 293 MYFVKVVSTSYLPLGWSPEFSSSVHETTLRDTPLGKQGVHFGSSGSIETHQYSVTSHKRS 352
Query: 313 SEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
+ G RL + +PGVF YD+SP+KV E +F FLT VCA++GG
Sbjct: 353 IDGGDDAAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G +KK
Sbjct: 413 TVAAAVDRALYEGAVRVKK 431
>gi|224000966|ref|XP_002290155.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973577|gb|EED91907.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 396
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 144/395 (36%), Positives = 219/395 (55%), Gaps = 53/395 (13%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
RS+D + I+ +F RT SG I+L + + L L SE + + V +
Sbjct: 10 RSIDTHSPISSEFRIRTLSGAAISLFTLLFTLYLISSEYSYNFSTTFLDHVHVMPQSPDG 69
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQ---HLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
L + FD+TFP +PC++L+ DA D +G+ H+D KH I+K RL+ G
Sbjct: 70 LEVEFDITFPHIPCALLASDANDPTGQSQSFHIDKKHRIWKHRLNKDG------------ 117
Query: 126 PKIDKPLQRH-----GGRL---EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
KP+ R GG L +H+E CGSCYGA E CCN C++V+ AYR K W +
Sbjct: 118 ----KPIGRKSRFELGGTLTSSDHDEEECGSCYGAGGEGE-CCNTCDDVKRAYRTKQWHI 172
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG------- 230
++ I QC L R+K+E+GEGCNI+G++ ++ GN HFAP + + + G
Sbjct: 173 TDMTKITQCAH---LVRVKDEDGEGCNIHGYVALSTGGGNLHFAPDRQWEKEGDKQNGLM 229
Query: 231 -----VHVHDILAFQRDS---FNISHKINKLAFGEHFP-------GVVNPLDGVRWTQET 275
+++ I+ D+ FN++H +NKL+FG + P + + LDG T
Sbjct: 230 IMGGFINLDSIVEMFNDAYEQFNVTHTVNKLSFGPYMPKHVKNSLNLTSQLDGATRTVTD 289
Query: 276 PSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKV 335
GM+Q+++++VPTVY ++G TI++ Q+SVTEH R + G + +PGVFFFY++S + V
Sbjct: 290 GYGMFQFYLQIVPTVYRFLNGTTIETFQYSVTEHVRHVDPGSNRGMPGVFFFYEVSALHV 349
Query: 336 TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
F E + HF T VCA VGG FTV G++D ++
Sbjct: 350 EFEEYRRGWTHFFTGVCAAVGGAFTVMGMLDRLVF 384
>gi|401839164|gb|EJT42494.1| ERV46-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 415
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 151/417 (36%), Positives = 217/417 (52%), Gaps = 59/417 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E R + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTKAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQGNVI------ESRQDG 122
+N DVTFP++PC ++++D MD SGE LD+ F RLD +G + + DG
Sbjct: 68 ELNIDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMTRLDKEGRPVGDAAELQVGGDG 127
Query: 123 IG-APKIDKPLQRHGGRLEHNETYCGSCYGAES---------SDEDCCNNCEEVREAYRK 172
G AP D P YCG CYGA +D+ CC +C+ VR AY
Sbjct: 128 DGVAPVNDDP------------NYCGPCYGARDQTQNENLAQADKVCCQDCDAVRSAYLD 175
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
GWA + I+QC+REG++ +I E EGC I G ++N++ GN HFAPG+ F + H
Sbjct: 176 AGWAFFDGKNIEQCEREGYVSKINEHLHEGCRIEGSAQINRIQGNIHFAPGRPFQNANGH 235
Query: 233 VHDILAFQRD-SFNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQE- 274
HD+ +++ N +H IN L+FG+ H V+ +PLDG + E
Sbjct: 236 FHDVSLYEKTPDLNFNHMINHLSFGKPIESRNKLLENDDRHGGAVIATSPLDGRKVFPER 295
Query: 275 -TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPG 323
T S ++ YF K+VPT Y + I++ QFS T H R GR Q +PG
Sbjct: 296 TTHSHLFSYFAKIVPTRYEYLDDVVIETAQFSATYHSRPLRGGRDQDHPNTFHARGGIPG 355
Query: 324 VFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+F F+++SP+KV E+H ++ F+ N +GGV V ++D Y QR+I K
Sbjct: 356 LFVFFEMSPLKVINKEQHGQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|389612123|dbj|BAM19583.1| ptx1 protein [Papilio xuthus]
Length = 285
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 179/292 (61%), Gaps = 16/292 (5%)
Query: 100 VKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED 158
+ H+I K+RLD GN IE +++ I I ++++ L CGSCYGA +D
Sbjct: 1 MDHNIHKRRLDLDGNPIEEPKKEEIA---ISSTVKQNTSELA--TVTCGSCYGAAFNDSQ 55
Query: 159 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 218
CCN CE+V+EAYR + WAL + I QCK + L++ EGC IYG++EVN+V G+F
Sbjct: 56 CCNTCEDVKEAYRIRRWALPDLATIVQCKDDESLEKANLALKEGCQIYGYMEVNRVGGSF 115
Query: 219 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLDGVRWTQETPS 277
H APGKSF + VHVHD+ + +FN +H I L+FG PLDGV+ + +
Sbjct: 116 HIAPGKSFTINHVHVHDVQPYSSSAFNTTHXIQHLSFGSDIKSANTAPLDGVKGIAQEGA 175
Query: 278 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-----SEQGRLQTLPGVFFFYDLSP 332
M+QY+IK+ PT+Y + + +NQFSVT H +S SE G +PG FF Y+LSP
Sbjct: 176 VMFQYYIKIGPTMYVKLDKTVLHTNQFSVTRHQKSVSNINSESG----MPGAFFSYELSP 231
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+ V +TE+ S HF TN+CAI+GGVFTV+GI+D +YH A KI +GK
Sbjct: 232 LMVKYTEKERSIGHFATNICAIIGGVFTVAGILDTLLYHSLNAFHNKIVLGK 283
>gi|363752862|ref|XP_003646647.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890283|gb|AET39830.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
DBVPG#7215]
Length = 399
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/397 (35%), Positives = 215/397 (54%), Gaps = 35/397 (8%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S DA+ K ED RT +GG+I+L +V LLL F+E + + +L++D R +
Sbjct: 8 SFDAFAKTEEDVRVRTKAGGIISLGCIVVTLLLLFNEWSQFNTVIQRPQLVLDRDRRLKM 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKI 128
+N D F +PC++L++D MD SGE LD++ F K RLD G I + + +G+ K
Sbjct: 68 DLNLDFEFSNMPCAMLNLDVMDTSGEVQLDLQDAGFTKTRLDHSGTPIRTEKLEVGSNK- 126
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSN 179
L + YCGSCYG++S D + CC CEEVREAY +KGWA +
Sbjct: 127 -------AVHLPDDPNYCGSCYGSKSQDNNDALPKEQKVCCQTCEEVREAYSEKGWAFFD 179
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILA 238
I+QC REG++++I + EGC + G ++N++ GN HFAPG++ + H HD+
Sbjct: 180 GQKIEQCIREGYVEKINSQLHEGCRVKGSAKLNRIQGNIHFAPGRTTNSGKRTHTHDVSL 239
Query: 239 FQRDS-FNISHKINKLAFGEHFPGVV-NPLDG---VRWTQETPSGMYQYFIKVVPTVYTD 293
+ S N +H I+KL+FG G + NPLDG + + + YF K+VPT Y
Sbjct: 240 YDTHSHLNFNHIIHKLSFGSDADGALSNPLDGHKNIIQGDDAHFSTFSYFTKIVPTRYEY 299
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRLQTLP----------GVFFFYDLSPIKVTFTEEH-V 342
+ G +++ QFSVT H R + G+ P GV F+++SP+KV +E+H +
Sbjct: 300 LDGRKLETTQFSVTTHSRPLKGGKDDDHPNTIHHRGGIAGVTIFFEMSPLKVINSEKHAI 359
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
++ F+ N +G V V +ID Y QR+I K
Sbjct: 360 TWSGFVLNCITSIGSVLAVGTVIDKITYRAQRSIWGK 396
>gi|401426616|ref|XP_003877792.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494038|emb|CBZ29334.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 406
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/403 (33%), Positives = 222/403 (55%), Gaps = 33/403 (8%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M +++ LD +PK + +D RT SGGV ++V+ +V++ L E+R +L+ ++
Sbjct: 1 MRQLKHLDVFPKFDRKFEQDARHRTVSGGVFSVVAVVVIIWLLVGEVRYFLSVEEHQEMF 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ-GNVIESR 119
VDT G +++ +VTF +PC ++++DA+DI G DV+ + K+R+D+ G VI +
Sbjct: 61 VDTKVGGDMQVTVNVTFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDTATGQVISAA 120
Query: 120 QDGIGAPK-IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+ + K + K + G E+ C SCYGAE DCC+ CE+VR+AY ++GW L
Sbjct: 121 RAIVDEKKVVTKAIDADGAEKEN----CPSCYGAERHPGDCCHTCEDVRQAYVRRGWKLD 176
Query: 179 NPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
++ ++QC + EGCN+Y ++ G+ F PG+ + G +HD++
Sbjct: 177 IDEISVEQCAEDRIKMATAAFGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDLM 236
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW-------TQETPSGMYQYFIKVVPTV 290
++SH ++ L FG+ FPG NPLDG ++ +G + YF+K+VPT
Sbjct: 237 GSATRKLDLSHTVHTLEFGDPFPGQQNPLDGTAQGSALSGDAKDAMNGRFSYFVKLVPTT 296
Query: 291 YTDVS-----GHTIQSNQFSVTEHFRSSEQGRLQT--------LPGVFFFYDLSPIKVTF 337
Y S T++SNQ+S T HF SE + ++ +PGVF YDLSP+++
Sbjct: 297 YQRYSLITGLQDTVESNQYSATHHFTPSEAAKAESQAPKKQEIVPGVFMTYDLSPVRILV 356
Query: 338 TEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
E H S HF+ VCA+ GGV TV G++D+ +H R I+K
Sbjct: 357 QERHPYPSLAHFVLQVCAVCGGVLTVVGLVDSLCFHSVRKIRK 399
>gi|326476034|gb|EGE00044.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
gi|326481270|gb|EGE05280.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Trichophyton equinum CBS 127.97]
Length = 435
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 154/435 (35%), Positives = 213/435 (48%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGV+T+ + ++++ L + E + Y V + +L+VD R
Sbjct: 5 SRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGE DV H + K RL S G VI+
Sbjct: 65 GERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALA 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG----AESSDEDCCNNCEEVREAYRKKGWALS 178
+ K D P L+ N YCG CYG + + CCN C+EVR+AY +K WA
Sbjct: 125 LHK-KEDSP-----AHLDPN--YCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + QC EG+ QRI E+ EGC I G L VNKVAGNFH APG+S H HD+
Sbjct: 177 RGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDN 236
Query: 239 FQRDSF--NISHKINKLAFGEHFP------------GVVNPLDGVRWTQETPSGMYQYFI 284
+ +SH I+KL FG P +NPLD + YF+
Sbjct: 237 YYHTPVPHTMSHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEARYNFLYFV 296
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS---- 312
KVV T Y + S +I+++Q+SVT H RS
Sbjct: 297 KVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAE 356
Query: 313 --------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
Q +P V F YD+SP+KV E S F T VCA++GG TV+
Sbjct: 357 DASADGHKERQHARGGIPSVMFNYDISPMKVINRESRPKSLSAFFTGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AVDRLLYEGSLRVKK 431
>gi|452980033|gb|EME79795.1| hypothetical protein MYCFIDRAFT_64499 [Pseudocercospora fijiensis
CIRAD86]
Length = 436
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 150/437 (34%), Positives = 223/437 (51%), Gaps = 73/437 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT +GG++T+ S +++L L + E Y + +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARVRTSTGGIVTIASLLLILYLTWGEWADYRKIIIHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN---VIESRQDG 122
GE + I+ +V+FP +PC +L++D MD+SGE V H I K RL S + VIE ++
Sbjct: 65 GERMEIHLNVSFPRVPCELLTLDVMDVSGEVQTGVLHGINKVRLSSVADGSKVIEKQKLD 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALS 178
+ A + L YCG CYGA + D CCN C EVR+AY W+
Sbjct: 125 LDAAENSVHLA---------PDYCGECYGAPAPDNAKKAGCCNTCAEVRDAYASVSWSFG 175
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+RE + +++ + EGC I G L VNKV GNFHFAPGKSF +HVHD+
Sbjct: 176 RGENVEQCEREHYSEQLDAQRKEGCRIEGALRVNKVVGNFHFAPGKSFSNGNLHVHDLDN 235
Query: 239 FQRDS---FNISHKINKLAFGEHFP----------GV------VNPLDGVRWTQETPSGM 279
+ + +H I++L FG P G+ +NPLD + +
Sbjct: 236 YFNSGEVEHSFTHHIHRLRFGPPLPHDFDKRVGKKGMAWSNHHLNPLDDTHQETDDSAFN 295
Query: 280 YQYFIKVVPTVYTDVS---------------------GH----TIQSNQFSVTEHFRSSE 314
+ YF+KVV T Y + GH +I+++Q+SVT H RS +
Sbjct: 296 FMYFVKVVSTAYLPLGWEKTNSFSRSLPHELIDLGDYGHGEQGSIETHQYSVTSHKRSLQ 355
Query: 315 QGRLQT------------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTV 361
G + +PGVFF YD+SP+KV E SF FL VCA++GG TV
Sbjct: 356 GGDAKDEGHKERVHARGGIPGVFFSYDISPMKVINRETRAKSFSGFLVGVCAVIGGTLTV 415
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D +Y G++ ++K
Sbjct: 416 AAAVDRMLYEGEQRVRK 432
>gi|407929248|gb|EKG22082.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
Length = 442
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 150/442 (33%), Positives = 222/442 (50%), Gaps = 77/442 (17%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT +GG++T+ S I++L L + E + + +L+VD SR
Sbjct: 5 SRFMRLDAFTKTVEDARVRTSTGGIVTITSIIMILWLIWGEWAEFRQVTVKPELIVDKSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +++FP +PC +L++D MD+SGE V H + K RL + SR + A
Sbjct: 65 GEKMEIHMNISFPRIPCELLTLDVMDVSGEIQTGVMHGVNKVRLTPENE--GSRPIEVNA 122
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNP 180
L H H + YCG CYGA + CCN C++VR+AY W+ +
Sbjct: 123 ------LNLHADEASHMDPDYCGECYGAPAPTTAKKPGCCNTCDDVRDAYAAISWSFTRG 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D ++QC+RE + +++ + EGC + G + VNKV GNFHFAPGKSF +HVHD+ +
Sbjct: 177 DGVEQCEREHYGEKLDAQRREGCRVEGGIRVNKVIGNFHFAPGKSFSNGNMHVHDLENYF 236
Query: 241 RDS--FNISHKINKLAFGEHFPGVV--------------------NPLDGVRWTQETPSG 278
+D + +H+++ L FG P V NPLD + +
Sbjct: 237 KDGAPHSFTHQVHSLRFGPQLPDDVIAKLEASGMSASSLWTNHHINPLDNTEQRTDEKAF 296
Query: 279 MYQYFIKVVPTVY----------TDVSG-------------------HTIQSNQFSVTEH 309
+ YF+KVV T Y + +SG +I+++Q+SVT H
Sbjct: 297 NFMYFVKVVSTAYLPLGWENKGSSSLSGLLPDADRAPLGSYGLASGEGSIETHQYSVTSH 356
Query: 310 FRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 356
RS G RL +PGVFF YD+SP+KV E SF FL VCA++G
Sbjct: 357 KRSLAGGNDEKDGHKERLHARGGIPGVFFSYDISPMKVINRESRAKSFSGFLVGVCAVIG 416
Query: 357 GVFTVSGIIDAFIYHGQRAIKK 378
G TV+ ID +Y G +KK
Sbjct: 417 GTLTVAAAIDRALYEGSTKLKK 438
>gi|298708525|emb|CBJ49158.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 467
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 151/435 (34%), Positives = 231/435 (53%), Gaps = 69/435 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLY--LNAVTETKLLVDTSR 65
I+ LD Y +++ED RT +G +T+ ++M++L E++ Y + A TE +++VD+S
Sbjct: 44 IKQLDVYARVDEDLQVRTEAGAAVTIGFWVLMVVLCVGEVQAYRKVQAPTE-RVVVDSSM 102
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
G+ LRIN D+TF ++PC + VDAMD++G+ +D+ H ++K+RLD G+ I +
Sbjct: 103 GQKLRINIDMTFHSIPCLDVHVDAMDVAGDNQIDIDHGMWKQRLDPDGSAIGEAFMEVPG 162
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLID 184
D P Q E YCGSC+GA+ CCN C +V +AY KGW++ + +
Sbjct: 163 EVDDDPAQ------SLPEDYCGSCFGAKKG---CCNMCRDVVDAYTAKGWSVQDIRRTAE 213
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
QC R+ ++ GEGCN+ GF+ VNKV+GNFH A G+ + G HVH Q F
Sbjct: 214 QCIRDNHIE-TPIVNGEGCNLSGFMSVNKVSGNFHVATGEGVMREGRHVHLYTLEQAVGF 272
Query: 245 NISHKINKLAFGEHFPGV-VNPLDGVRWT--QETPSGMYQYFIKVVPTVY-----TDVSG 296
N SH IN L+F E +PG+ NPLD ++ +G +QY+IK+VPT++ ++ SG
Sbjct: 273 NTSHSINLLSFWEPYPGMKPNPLDRTSRIIDEDVGTGAFQYYIKLVPTMHSLSPQSEASG 332
Query: 297 HTIQ---------------SNQFSVTEHFRS--------------------SEQGRLQT- 320
+ ++QF+ T FRS +E+G Q
Sbjct: 333 SPLPKGKGEEAERQQQSSLTSQFTYTYKFRSLKGLTEYHTDHEEGEEQAKEAEKGLTQDG 392
Query: 321 ----------LPGVFFFYDLSPIKV-TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
LPGVFF YD+SP V E F H L +CA+ GG F +SGI+D+ +
Sbjct: 393 GVNSIVNSALLPGVFFVYDVSPFMVEVVPAEQPPFSHLLIRLCAVAGGAFAISGIVDSAV 452
Query: 370 YHGQRAIKKKIEIGK 384
+H +++ +GK
Sbjct: 453 FHLSNRLRRHGVLGK 467
>gi|154280410|ref|XP_001541018.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150412961|gb|EDN08348.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 435
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 153/432 (35%), Positives = 215/432 (49%), Gaps = 64/432 (14%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT GGV+T+ + V+ L + E Y V +L+VD R
Sbjct: 5 SRFARLDAFTKTVEDARIRTRLGGVVTISALFVIFFLIWGEWSEYRRIVVLPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +VTFP LPC +L++D MDISGE V H + K RL S +E +
Sbjct: 65 GERMEIHLNVTFPNLPCELLTLDVMDISGEYQTGVIHGVNKVRLSS----VEEGGRVLDI 120
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ Q + G + + YCG CYGA + CCN CEEVR+AY KGWA +
Sbjct: 121 TALQLHSQTNKG-TDVDPDYCGQCYGATPPSNAKKPGCCNTCEEVRDAYAAKGWAFGRGE 179
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
++QC++EG+ + + EGC + G + VNKV GNFH APG+SF +H HD+ +
Sbjct: 180 NVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVVGNFHIAPGRSFTNGNLHAHDLDNYYH 239
Query: 242 DSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVV 287
N+ H+++ L FG P + NPLD P + YF+KVV
Sbjct: 240 TPVQHNMGHRVHYLRFGPQLPEELSSRWKWTDNHHTNPLDNTEQHTTNPRFNFIYFVKVV 299
Query: 288 PTVYTDV--------SGH--------------------TIQSNQFSVTEHFRSSEQG--- 316
T Y + S H +I+++Q+SVT H RS + G
Sbjct: 300 STSYLPLGWDPDASSSAHSKYSKNAPLGKQGLSFGSYGSIETHQYSVTSHKRSVDGGDDS 359
Query: 317 ------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIID 366
RL + +PGVF YD+SP+KV E SF FLT VCA++GG TV+ ID
Sbjct: 360 AEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKSFSGFLTGVCAVIGGTLTVAAAID 419
Query: 367 AFIYHGQRAIKK 378
+Y G +KK
Sbjct: 420 RVLYEGAVRVKK 431
>gi|398398231|ref|XP_003852573.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
gi|339472454|gb|EGP87549.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
Length = 435
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 151/445 (33%), Positives = 218/445 (48%), Gaps = 85/445 (19%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ ++ LDA+ K ED RT SGG +T+ S ++++ L + E Y + +++VD
Sbjct: 3 VKSRFTKLDAFSKTVEDARIRTTSGGFVTVFSMLLIIWLAWGEWSDYRRITIQPEIIVDK 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+RGE + I+ +VTFP +PC +L++D MD+SG+ V H I K RL +
Sbjct: 63 ARGEKMEIHLNVTFPRIPCELLTLDVMDVSGDVQTGVLHGIVKTRLKPESE--------- 113
Query: 124 GAPKIDKPLQRHGGRLEHNET----------YCGSCYGA----ESSDEDCCNNCEEVREA 169
G IDK GRL+ NE YCG CYGA + CCN C EVREA
Sbjct: 114 GGGDIDK------GRLQVNEVEEAAKHLARDYCGDCYGAPPPANAIKSGCCNTCAEVREA 167
Query: 170 YRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS 229
Y W+ + ++QC RE + + + E+ EGC + G + VNKV GNFHFAPGKSF
Sbjct: 168 YASVSWSFGRGENVEQCTREHYSEHLDEQRKEGCRVDGVIRVNKVVGNFHFAPGKSFSNG 227
Query: 230 GVHVHDILAFQRDS--FNISHKINKLAFGEHFPGV-----------------VNPLDGVR 270
+HVHD+ + SH I+ L FG P ++PLDG R
Sbjct: 228 NMHVHDLENYLTGGGDHTPSHIIHHLRFGPLLPESYKHRVRDTERHWSNNHHLSPLDGFR 287
Query: 271 WTQETPSGMYQYFIKVVPTVYTDVS------------------------GHTIQSNQFSV 306
+ Y YF+KVVPT Y + G +I+++Q+SV
Sbjct: 288 QETNEKAYNYMYFVKVVPTAYLPLGYENLPSVGDYPHEHAHVGEYGISHGSSIETHQYSV 347
Query: 307 TEHFR------SSEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCA 353
T H R ++++G + L PGVFF YD+SP+KV E SF FL +C
Sbjct: 348 TSHKRHLGGGDANDEGHKERLHARGGIPGVFFSYDISPMKVIDREVRAKSFSSFLVGICG 407
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKK 378
++GG TV+ +D + G + +KK
Sbjct: 408 VLGGTLTVAAAVDRIWFEGTQRVKK 432
>gi|219111025|ref|XP_002177264.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411799|gb|EEC51727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 404
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 141/406 (34%), Positives = 233/406 (57%), Gaps = 33/406 (8%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD + ++++ D + ++++F T G V+++V+ + + L ++ + K+
Sbjct: 1 MD-LKDRLKRFDTHSPVSKEFRVYTVQGAVLSIVTLVFVGYLVTADFFFNFQVTLQEKVH 59
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLDVKHDIFKKRLDSQGN--- 114
V+ S + + FDV+ P +PCS LS+DA D +G++ HLD H ++K R+ N
Sbjct: 60 VNASSPSGIELEFDVSLPDVPCSKLSIDANDPNGQKQSLHLDTDHHVWKHRITLLPNGHR 119
Query: 115 --VIESRQDGIGAPKI-DKPLQRHGGRLEHNE---------TYCGSCYGAESSDEDCCNN 162
+ E + +G+ + +K L+ L++ + T CG CYGA E CC +
Sbjct: 120 QLLGERSKLELGSTLLTEKDLEVKAEELQNAKDNSESRTEMTPCGDCYGAGEEGE-CCKS 178
Query: 163 CEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAP 222
CE+V+ AY+++GW+L + + QC+RE I E EGEGCN++G + ++ GN H AP
Sbjct: 179 CEDVKRAYKRRGWSLRDTSGVSQCRRE---SGIAEAEGEGCNVHGVVALSSGGGNLHIAP 235
Query: 223 GKSFHQS---GVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM 279
G+ + G+++ D L +N+SH+I+KL FG+ +P V LDG T GM
Sbjct: 236 GRDTEANFPGGMNIFDALLQSFHQWNVSHQIHKLRFGKDYPAGVYQLDGETRTITDGYGM 295
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ------TLPGVFFFYDLSPI 333
YQY+ +VVPT YT ++G TIQ++Q+SVTEH R G + +PG+FFFY++SP+
Sbjct: 296 YQYYFQVVPTRYTFLNGTTIQTHQYSVTEHLRHVSPGSNRGYSLNSRMPGIFFFYEVSPL 355
Query: 334 KVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V E + ++ FLT+VCAIVGGV T++G+ID I+ Q + ++
Sbjct: 356 HVDIMEVYQKGWIAFLTSVCAIVGGVVTIAGLIDHVIFSRQHSSRE 401
>gi|449549110|gb|EMD40076.1| hypothetical protein CERSUDRAFT_132878 [Ceriporiopsis subvermispora
B]
Length = 1001
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 140/394 (35%), Positives = 212/394 (53%), Gaps = 41/394 (10%)
Query: 19 EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFP 78
ED +T +G ++T++S+ ++L E Y +T + VD SRGE L + +VTFP
Sbjct: 598 EDVKVKTRTGALLTILSAAIILAFTTIEFFDYRRVNVDTSIQVDKSRGEKLTVKMNVTFP 657
Query: 79 ALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGG 137
+PC +LS+D MDISGE D+ H+I K RL +G + + IDK QR GG
Sbjct: 658 RVPCYLLSLDVMDISGETQTDISHNIIKTRLTEKGLPVPNAASSELRNDIDKLNEQRQGG 717
Query: 138 RLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 197
G CCN+CE+VR+AY +GW+ + P+ I+QC EG+ +++K+
Sbjct: 718 YCGSCYGGVEPAGG-------CCNSCEDVRQAYVNRGWSFNRPEGIEQCVDEGWSEKLKD 770
Query: 198 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLA 254
+ EGCNI G + VNKV GN H +PG+SF +++D++ + +D N SH I++ A
Sbjct: 771 QANEGCNIAGRVRVNKVVGNIHLSPGRSFRSGSQNLYDLVPYLKDDGNRHDFSHTIHEFA 830
Query: 255 F-GEHFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
F G+ ++ NPLDG M+QYF+KVV T + + G
Sbjct: 831 FEGDDEYDILKAKSGKEMRRRMGIEGNPLDGAIGRTSKQQYMFQYFLKVVSTQFRTLDGM 890
Query: 298 TIQSNQFSVTEHFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSF 344
++ +NQ+S T R G+ + +PG FF Y++SPI ++ E SF
Sbjct: 891 SVNTNQYSATHFERDLTAGQQEKDQAGLHVAHTSVGIPGAFFNYEISPILISHAESRQSF 950
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
HFLT+ CAIVGGV TV+ +ID+ ++ R +KK
Sbjct: 951 AHFLTSTCAIVGGVLTVASLIDSVLFVAGRTLKK 984
>gi|349804919|gb|AEQ17932.1| putative ergic and golgi 3 [Hymenochirus curtipes]
Length = 228
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 120/231 (51%), Positives = 162/231 (70%), Gaps = 5/231 (2%)
Query: 95 EQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES 154
EQ LDV+H++FK RLD + S + K ++P+ L+ + C SCYGAE+
Sbjct: 1 EQQLDVEHNLFKLRLDKDRQPVSSEAERHDLGKAEEPVIFDPKSLDPDR--CESCYGAET 58
Query: 155 SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKV 214
D CCN+C++VREAYR++GWA PD I+QCKREGF Q+++E++ EGC +YGFLEVNKV
Sbjct: 59 DDFRCCNSCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCRVYGFLEVNKV 118
Query: 215 AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 274
AGNFHFAPGKSF QS VHVHD+ +F D+ N++H+I L+FG +PG+VNPLDG +
Sbjct: 119 AGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGMDYPGLVNPLDGTSVSAV 178
Query: 275 TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 323
S M+QYF+K+VPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 179 QSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKVT-NGLIGDQGLPG 228
>gi|353237029|emb|CCA69011.1| related to ERV46-component of copii vesicles [Piriformospora indica
DSM 11827]
Length = 428
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 136/407 (33%), Positives = 222/407 (54%), Gaps = 35/407 (8%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ + +DA+ + +ED +T +G +TL+S+ + F E + +T ++VD
Sbjct: 4 GVFGAFKGIDAFGRTSEDVKVKTRTGAFLTLISAFFIATFTFIEFMDFRRVGVDTAIVVD 63
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SRGE L++ F++TFP +PC +L++D DISG+ ++ H + K RLD + + DG
Sbjct: 64 RSRGEKLQVVFNITFPRVPCFLLNLDVTDISGDVVREITHHVVKTRLDPAAH--QPIPDG 121
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I + L + ++ YCGSCYG + + CCN C++VR AY +GWA NPD
Sbjct: 122 IYRTDLKSDLSKQ--LTATSKGYCGSCYGGQPPEGGCCNTCDDVRRAYTDRGWAFGNPDQ 179
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
IDQC E + ++I + EGCNI G + VNKV GN F+PG+SF + V+ ++ + +D
Sbjct: 180 IDQCVSENWTEKIMAMQREGCNIEGRVRVNKVTGNMQFSPGRSFVVNRPEVYALVPYLKD 239
Query: 243 SFN-ISHKINKLAFGEH---------FPGVVN--------PLDGVRWTQETPSGMYQYFI 284
S + H I+ L ++ P + PL+ V E+ M+QYF+
Sbjct: 240 SNHFFGHHIHSLEIYDYEEDTWTRRNLPEQIKERLGITKPPLEDVYAHTESADYMFQYFL 299
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFR--------SSEQG-----RLQTLPGVFFFYDLS 331
KVV + Y + G ++Q+S + R +E G Q +PGVFF +++S
Sbjct: 300 KVVKSSYKGLDGKAYSTHQYSTSSFERDLATMSHGKNEDGIEIVHERQGVPGVFFNFEIS 359
Query: 332 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
P++V E+ S+ HF+T++ AI+GGV TV+ ++DA +++ Q IKK
Sbjct: 360 PMEVIHIEQRQSWAHFITSMAAIIGGVLTVATLVDALLFNTQGLIKK 406
>gi|342183042|emb|CCC92522.1| unnamed protein product [Trypanosoma congolense IL3000]
gi|343474271|emb|CCD14057.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 401
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 146/408 (35%), Positives = 217/408 (53%), Gaps = 37/408 (9%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GGV+++ S + + LL E+R +L V + ++
Sbjct: 1 MKRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVTIALLIIGEVRYFLTTVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G T+ + ++TFP +PC +++ DA+D GE D+ D K R+DS
Sbjct: 61 VDPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDS--------- 111
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D + +PL + + C SCYGAE + DCC+ C++VR A+ ++ W
Sbjct: 112 DTLAPLGEARPLVNMNKKATSDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEFHED 171
Query: 181 DL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
D+ I QC +E EGCN++ V +V GN HF PG+ F+ G H+H
Sbjct: 172 DVSIMQCAKERLQMAASTASREGCNLHSSFSVPRVTGNIHFVPGRMFNFFGQHLHSFKGE 231
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ--ETPS----GMYQYFIKVVPTVYT- 292
N+SH I+ L FGE FPG NPLDG+ T+ E PS G + YF+KVVPT+Y
Sbjct: 232 TIQRLNLSHIIHTLEFGERFPGQKNPLDGMVNTRGVENPSEDLIGRFAYFVKVVPTLYQV 291
Query: 293 ---DVSGHTIQSNQFSVTEHFRSS-----------EQGRLQTLPGVFFFYDLSPIKVTFT 338
SG ++SNQ+SVT HF +S + +PGVF YD+SPI+V+
Sbjct: 292 KTLMSSGRVVESNQYSVTHHFTASWDAADQNNQTNRDANPRVVPGVFVSYDISPIRVSVK 351
Query: 339 EEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
H S +H + +CA+ GGV+TV G+ID+ +H R +++KI GK
Sbjct: 352 RTHPYPSVVHLVLQLCAVGGGVYTVVGLIDSMFFHSIRRVQEKINRGK 399
>gi|342183032|emb|CCC92512.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 401
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 146/408 (35%), Positives = 217/408 (53%), Gaps = 37/408 (9%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GGV+++ S + + LL E+R +L V + ++
Sbjct: 1 MKRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVTIALLIIGEVRYFLTTVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G T+ + ++TFP +PC +++ DA+D GE D+ D K R+DS
Sbjct: 61 VDPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDS--------- 111
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D + +PL + + C SCYGAE + DCC+ C++VR A+ ++ W
Sbjct: 112 DTLAPLGEARPLVNMNKKATSDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEFHED 171
Query: 181 DL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
D+ I QC +E EGCN++ V +V GN HF PG+ F+ G H+H
Sbjct: 172 DVSIMQCAKERLQMAASTASREGCNLHSSFSVPRVTGNIHFVPGRMFNFFGQHLHSFKGE 231
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ--ETPS----GMYQYFIKVVPTVYT- 292
N+SH I+ L FGE FPG NPLDG+ T+ E PS G + YF+KVVPT+Y
Sbjct: 232 TIQRLNLSHIIHTLEFGERFPGQKNPLDGMVNTRGVENPSEDLIGRFAYFVKVVPTLYQV 291
Query: 293 ---DVSGHTIQSNQFSVTEHFRSS-----------EQGRLQTLPGVFFFYDLSPIKVTFT 338
SG ++SNQ+SVT HF +S + +PGVF YD+SPI+V+
Sbjct: 292 RTLMSSGRVVESNQYSVTHHFTASWDAADQNNQTNRDANPRVVPGVFVSYDISPIRVSVK 351
Query: 339 EEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
H S +H + +CA+ GGV+TV G+ID+ +H R +++KI GK
Sbjct: 352 RTHPYPSVVHLVLQLCAVGGGVYTVVGLIDSMFFHSIRRVQEKINRGK 399
>gi|30686584|ref|NP_188868.2| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|13877821|gb|AAK43988.1|AF370173_1 unknown protein [Arabidopsis thaliana]
gi|51969000|dbj|BAD43192.1| unknown protein [Arabidopsis thaliana]
gi|51970108|dbj|BAD43746.1| unknown protein [Arabidopsis thaliana]
gi|51970556|dbj|BAD43970.1| unknown protein [Arabidopsis thaliana]
gi|51970734|dbj|BAD44059.1| unknown protein [Arabidopsis thaliana]
gi|62319967|dbj|BAD94071.1| hypothetical protein [Arabidopsis thaliana]
gi|332643097|gb|AEE76618.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 354
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 142/384 (36%), Positives = 219/384 (57%), Gaps = 43/384 (11%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +RS+DA+P+ + +T SG V+++V ++M LF EL YLN +T ++ VD
Sbjct: 2 GVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQ 120
RGETL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++I E
Sbjct: 62 LKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYIS 121
Query: 121 DGI--GAPKIDKPLQRHGGRLEH-NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
D + G P +H G+ EH NET EA G+
Sbjct: 122 DLVEKGHEHGHSP-HKHDGKEEHKNETET---------------------EALNILGF-- 157
Query: 178 SNPDLIDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
DQ E ++++K+ +GEGC +YG L+V +VAGNFH S H ++V
Sbjct: 158 ------DQAA-ETMIKKVKQALADGEGCRVYGVLDVQRVAGNFHI----SVHGLNIYVAQ 206
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
++ + N+SH I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S
Sbjct: 207 MIFGGSKNVNVSHMIHDLSFGPKYPGIHNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLS 266
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+ +NQ+SVTE+F + +T P V+F YDLSPI VT EE SFLH +T +CA++
Sbjct: 267 KDVLSTNQYSVTEYFTPMTEFD-RTWPAVYFLYDLSPITVTIKEERRSFLHLITRLCAVL 325
Query: 356 GGVFTVSGIIDAFIYHGQRAIKKK 379
GG F ++G++D +++ + KK
Sbjct: 326 GGTFALTGMLDRWMFRFIESFNKK 349
>gi|296811622|ref|XP_002846149.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma otae CBS 113480]
gi|238843537|gb|EEQ33199.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma otae CBS 113480]
Length = 435
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 153/435 (35%), Positives = 217/435 (49%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGV+T+ + ++++ L + E + Y V + +L+VD R
Sbjct: 5 SRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVIQPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGE DV H + K RL S G VI+
Sbjct: 65 GERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALD 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG----AESSDEDCCNNCEEVREAYRKKGWALS 178
+ K D P L+ N YCG+CYG + + CCN C EVR+AY +K WA
Sbjct: 125 L-HKKDDSP-----AHLDPN--YCGNCYGVPAPSTAKKPGCCNTCAEVRDAYAEKNWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + QC EG+ QRI E+ EGC I G L VNKVAGNFH APG+S H HD+
Sbjct: 177 RGEGVTQCMDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDN 236
Query: 239 FQRDSF--NISHKINKLAFGEHFP------------GVVNPLDGVRWTQETPSGMYQYFI 284
+ ++H I+KL FG P +NPLD + + YF+
Sbjct: 237 YYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHRTDEVRYNFLYFV 296
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS---- 312
KVV T Y + S +I+++Q+SVT H RS
Sbjct: 297 KVVSTSYLPLGWDATWSSEVHSQAHKDIPLGNHGVYFGSQGSIETHQYSVTSHKRSLDGG 356
Query: 313 --SEQGRLQT------LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSG 363
S +G + +P V F Y++SP+KV E L F T VCA++GG TV+
Sbjct: 357 DDSAEGHKERQYARGGIPSVMFNYEISPMKVINRETRPKSLSTFFTGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AVDRLLYEGSLRVKK 431
>gi|302511557|ref|XP_003017730.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
gi|291181301|gb|EFE37085.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
Length = 435
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 152/435 (34%), Positives = 213/435 (48%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGV+T+ + ++++ L + E + Y V + +L+VD R
Sbjct: 5 SRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGE DV H + K RL S G VI+
Sbjct: 65 GERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALA 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG----AESSDEDCCNNCEEVREAYRKKGWALS 178
+ K D P L+ N YCG CYG + + CCN C+EVR+AY +K WA
Sbjct: 125 LHK-KEDSP-----AHLDPN--YCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + QC EG+ QRI E+ EGC I G L VNKVAGNFH APG+S H HD+
Sbjct: 177 RGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDN 236
Query: 239 FQRDSF--NISHKINKLAFGEHFP------------GVVNPLDGVRWTQETPSGMYQYFI 284
+ ++H I+KL FG P +NPLD + YF+
Sbjct: 237 YYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFV 296
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS---- 312
KVV T Y + S +I+++Q+SVT H RS
Sbjct: 297 KVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAE 356
Query: 313 --------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
Q +P V F Y++SP+KV E S F T VCA++GG TV+
Sbjct: 357 DASADGHKERQHARGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AVDRLLYEGSLRVKK 431
>gi|302666755|ref|XP_003024974.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
gi|291189052|gb|EFE44363.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
Length = 435
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 152/435 (34%), Positives = 213/435 (48%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGV+T+ + ++++ L + E + Y V + +L+VD R
Sbjct: 5 SRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGE DV H + K RL S G VI+
Sbjct: 65 GERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALA 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG----AESSDEDCCNNCEEVREAYRKKGWALS 178
+ K D P L+ N YCG CYG + + CCN C+EVR+AY +K WA
Sbjct: 125 LHK-KEDSP-----AHLDPN--YCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + QC EG+ QRI E+ EGC I G L VNKVAGNFH APG+S H HD+
Sbjct: 177 RGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDN 236
Query: 239 FQRDSF--NISHKINKLAFGEHFP------------GVVNPLDGVRWTQETPSGMYQYFI 284
+ ++H I+KL FG P +NPLD + YF+
Sbjct: 237 YYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFV 296
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS---- 312
KVV T Y + S +I+++Q+SVT H RS
Sbjct: 297 KVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAE 356
Query: 313 --------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
Q +P V F Y++SP+KV E S F T VCA++GG TV+
Sbjct: 357 DASADGHKERQHSRGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AVDRLLYEGSLRVKK 431
>gi|297830940|ref|XP_002883352.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
lyrata]
gi|297329192|gb|EFH59611.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 141/384 (36%), Positives = 219/384 (57%), Gaps = 43/384 (11%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +RS+DA+P+ + +T SG V+++V ++M LF EL YLN +T ++ VD
Sbjct: 2 GVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQ 120
RGETL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++I E
Sbjct: 62 LKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYIS 121
Query: 121 DGI--GAPKIDKPLQRHGGRLEH-NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
D + G P +H G+ EH NET EA G+
Sbjct: 122 DLVEKGHEHGHSP-HKHDGKEEHKNETET---------------------EALNILGF-- 157
Query: 178 SNPDLIDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
DQ E ++++K+ +GEGC +YG L+V +VAGNFH S H ++V
Sbjct: 158 ------DQAA-ETMIKKVKQALADGEGCRVYGVLDVQRVAGNFHI----SVHGLNIYVAQ 206
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
++ + N+SH I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S
Sbjct: 207 MIFGGSKNVNVSHMIHDLSFGPKYPGIHNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLS 266
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+ +NQ+SVTE++ + +T P V+F YDLSPI VT EE SFLH +T +CA++
Sbjct: 267 KDVLSTNQYSVTEYYTPMTEFD-RTWPAVYFLYDLSPITVTIKEERRSFLHLITRLCAVL 325
Query: 356 GGVFTVSGIIDAFIYHGQRAIKKK 379
GG F ++G++D +++ + KK
Sbjct: 326 GGTFALTGMLDRWMFRLIESFNKK 349
>gi|451849936|gb|EMD63239.1| hypothetical protein COCSADRAFT_38106 [Cochliobolus sativus ND90Pr]
Length = 437
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/437 (33%), Positives = 217/437 (49%), Gaps = 72/437 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+VS +V+ L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARVRTTSGGIVTIVSLLVIFWLTWGEWADYRRVTVRPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I +++FP +PC ++++D MD+SGE + V H I K RL + ++G
Sbjct: 65 GERMEIALNISFPRVPCELITLDVMDVSGELQMGVTHGINKVRLSPE-------REGSKT 117
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNP 180
+I K L H H YCG C+GA + CCN C+EVR+AY W+
Sbjct: 118 IEI-KALDLHADEASHLAPDYCGECFGAPPPANAKKPGCCNTCDEVRDAYASISWSFGRG 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ ++QC+RE + + + E+ EGC + G + VNKV GNFH APGKSF +HVHD+ +
Sbjct: 177 EGVEQCEREHYAEHLDEQRQEGCRLEGSIRVNKVVGNFHIAPGKSFSNGNMHVHDLENYF 236
Query: 241 RDSF--NISHKINKLAFGEHFPGVV---------------------NPLDGVRWTQETPS 277
+D + +HKI++L FG VV NPLD + +
Sbjct: 237 KDEYAHTFTHKIHQLRFGPQLSDVVIQGIQDKHRGSGPGSWSNHHINPLDNTEQHTDEKA 296
Query: 278 GMYQYFIKVVPTVYTDVSGH-----------------------TIQSNQFSVTEHFRSSE 314
+ YFIKVV T Y + +I+++Q+SVT H R+ +
Sbjct: 297 FNFMYFIKVVSTAYLPLGWEDAAPRLTKHDELLGSTIDATHKGSIETHQYSVTSHKRNLK 356
Query: 315 QGRLQT------------LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTV 361
G + +PGVFF YD+SP+KV E +F FL +CA++GG TV
Sbjct: 357 GGNDEKDGHKERVHARGGIPGVFFSYDISPMKVINREVREKTFSGFLVGLCAVIGGTLTV 416
Query: 362 SGIIDAFIYHGQRAIKK 378
+ +D +Y G IKK
Sbjct: 417 AAAVDRALYEGVNRIKK 433
>gi|327296796|ref|XP_003233092.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
gi|326464398|gb|EGD89851.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
Length = 435
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 152/435 (34%), Positives = 213/435 (48%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGGV+T+ + ++++ L + E + Y V + +L+VD R
Sbjct: 5 SRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP LPC +L++D MD+SGE DV H + K RL S G VI+
Sbjct: 65 GERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALS 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYG----AESSDEDCCNNCEEVREAYRKKGWALS 178
+ K D P L+ N YCG CYG + + CCN C+EVR+AY +K WA
Sbjct: 125 LHK-KEDSP-----AHLDPN--YCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + QC EG+ QRI E+ EGC I G L VNKVAGNFH APG+S H HD+
Sbjct: 177 RGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDN 236
Query: 239 FQRDSF--NISHKINKLAFGEHFP------------GVVNPLDGVRWTQETPSGMYQYFI 284
+ ++H I+KL FG P +NPLD + YF+
Sbjct: 237 YYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFV 296
Query: 285 KVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS---- 312
KVV T Y + S +I+++Q+SVT H RS
Sbjct: 297 KVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAE 356
Query: 313 --------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
Q +P V F Y++SP+KV E S F T VCA++GG TV+
Sbjct: 357 DASADGHKERQHARGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
+D +Y G +KK
Sbjct: 417 AVDRLLYEGSLRVKK 431
>gi|116181584|ref|XP_001220641.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
gi|88185717|gb|EAQ93185.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
Length = 438
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 152/437 (34%), Positives = 219/437 (50%), Gaps = 73/437 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+VS +V+ L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARIRTTSGGIVTIVSLVVVFFLAWGEWSDYRRVEVHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDG 122
GE + I+ ++TFP +PC +L++D MDISGEQ V+H + K RL Q G I+++
Sbjct: 65 GERMEIHLNITFPRIPCELLTLDVMDISGEQQHGVQHGVTKTRLRPQSEGGGDIDTKAVA 124
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALS 178
+ A R + +YCG CYGA+ + CCN CEEV++AY + WA
Sbjct: 125 LHA--------RDEVATHLDPSYCGPCYGAQPPPNAKKPGCCNTCEEVKDAYAQAAWAFG 176
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ I+QC+RE + +++ E+ EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 177 RGEGIEQCEREHYSEKLDEQRNEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLKN 236
Query: 239 F--QRDSFNISHKINKLAFGEHFPG-----------------VVNPLDGV---------- 269
+ SH+I+ L FG P NPLD
Sbjct: 237 YWDTPTKHTFSHQIHHLRFGPQLPDNLHKKLDARKNMRGRSTTFNPLDDTPPGDGTTSTT 296
Query: 270 --------------RWT-QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-- 312
RW ++T +G + + + G +++++Q+SVT H RS
Sbjct: 297 TTCTSSRSCPHRTCRWAGRKTWAGFREEHHAELGSFGASADG-SVETHQYSVTSHKRSLA 355
Query: 313 -------SEQGRLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTV 361
Q RL +PGVFF YD+SP+KV EE SFL F+ +CAIVGG TV
Sbjct: 356 GGDDSAEGHQERLHARGGIPGVFFSYDISPMKVINREEKAKSFLGFIAGLCAIVGGTLTV 415
Query: 362 SGIIDAFIYHGQRAIKK 378
+ ID ++ G +KK
Sbjct: 416 AAAIDRALFEGGVRLKK 432
>gi|356547537|ref|XP_003542168.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
compartment protein 3-like [Glycine max]
Length = 351
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/373 (36%), Positives = 212/373 (56%), Gaps = 35/373 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I++LDA+P+ + +T SG +++++ I+M LF EL YL T K+ VD RGE
Sbjct: 7 IKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHKMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++ IG
Sbjct: 67 TLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHI-------IGTEY 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
I +++ EH++ + S + N +E
Sbjct: 120 ISDLVEKEHTNQEHDDNKDHDHHHEHSEQKIHLQNLDE---------------------S 158
Query: 188 REGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
E ++++KE + GEGC +YG L+V +VAGNFH S H ++V ++ + N
Sbjct: 159 TENIIKKVKEALKNGEGCRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVN 214
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
+SH I+ L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +NQFS
Sbjct: 215 VSHFIHDLSFGPKYPGLHNPLDDTTRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFS 274
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
V+E++ Q +T P V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G++
Sbjct: 275 VSEYYSPINQFD-RTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGML 333
Query: 366 DAFIYHGQRAIKK 378
D ++Y A+ K
Sbjct: 334 DRWMYRLLEALTK 346
>gi|261327856|emb|CBH10834.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 405
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 146/410 (35%), Positives = 216/410 (52%), Gaps = 37/410 (9%)
Query: 5 MNKIRSLDAYPKINEDF----YSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK +E F RT GGV+++ S +++ L E+R + ++V + ++
Sbjct: 1 MKGLSRLDVFPKFDERFERDARQRTALGGVLSMASILIITFLVVGEVRYFFSSVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G + + ++TFP +PC +++ DA+D GE +V D + R++ V
Sbjct: 61 VDPHIGGIMHMKVNITFPRVPCDLMTADAIDAFGEHVENVLTDTARVRVNPDTLV----P 116
Query: 121 DGIGAPKIDKPLQRHGGR-LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
G P +D Q G EH + C SCYGAES+ DCC+ C++VR A+ ++ W
Sbjct: 117 LGEARPLMDMKKQPADGNGAEHGK--CPSCYGAESNPGDCCHTCDDVRRAFAERQWEFHE 174
Query: 180 PDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
D I QC E EGCN++ V +V GN HF PG+ F+ G H+H
Sbjct: 175 DDASIVQCVHERLKMAAASASTEGCNLHASFSVPRVTGNIHFIPGRMFNFFGQHLHSFKG 234
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDG---VRWTQETPS----GMYQYFIKVVPTVY 291
N+SH ++ L FGE FPG NP+DG VR + PS G + YF+KVVPTVY
Sbjct: 235 ETIQKLNLSHIVHSLEFGERFPGQSNPMDGMANVRGATD-PSEPLIGRFSYFVKVVPTVY 293
Query: 292 TDVS----GHTIQSNQFSVTEHFRSS-----------EQGRLQTLPGVFFFYDLSPIKVT 336
S G ++SNQ+SVT HF S + +PGVF YDLSPI+V+
Sbjct: 294 RIESLVGGGRVVESNQYSVTHHFTPSWETPKGGENNNAKHDPSVVPGVFISYDLSPIRVS 353
Query: 337 FTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
H S +H + +CA+ GGV+TV+G+ID+ +H R ++ K+ GK
Sbjct: 354 VKRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHSIRRMQIKMNRGK 403
>gi|452001785|gb|EMD94244.1| hypothetical protein COCHEDRAFT_1202021 [Cochliobolus
heterostrophus C5]
Length = 437
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 146/438 (33%), Positives = 216/438 (49%), Gaps = 74/438 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++T+VS +V+ L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARIRTTSGGIVTIVSLLVIFWLTWGEWADYRRVTVRPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I +++FP +PC ++++D MD+SGE + V H I K RL + G+
Sbjct: 65 GERMEIALNISFPRVPCELITLDVMDVSGELQMGVTHGINKVRLGPEKE---------GS 115
Query: 126 PKID-KPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSN 179
I+ K L H H YCG C+GA + CCN C+EVR+AY W+
Sbjct: 116 KTIEIKALDLHADEASHLAPDYCGECFGAPPPANAKKPGCCNTCDEVRDAYASISWSFGR 175
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+ ++QC+RE + + + E+ EGC + G + VNKV GNFH APGKSF +HVHD+ +
Sbjct: 176 GEGVEQCEREHYAEHLDEQRQEGCRLEGSIRVNKVVGNFHIAPGKSFSNGNMHVHDLENY 235
Query: 240 QRDSF--NISHKINKLAFGEHFPGVV---------------------NPLDGVRWTQETP 276
+D + +HKI++L FG VV NPLD +
Sbjct: 236 FKDEYAHTFTHKIHQLRFGPQLSDVVIQGIQDKHKGSGPGSWSNHHINPLDNTEQHTDEK 295
Query: 277 SGMYQYFIKVVPTVYTDVSGH-----------------------TIQSNQFSVTEHFRSS 313
+ + YFIKVV T Y + +I+++Q+SVT H R+
Sbjct: 296 AFNFMYFIKVVSTAYLPLGWEDAAPRLTKHDELLGSTIDASHKGSIETHQYSVTSHKRNL 355
Query: 314 EQGRLQT------------LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFT 360
+ G + +PGVFF YD+SP+KV E +F FL +CA++GG T
Sbjct: 356 KGGNDEKDGHKERIHARGGIPGVFFSYDISPMKVINREVREKTFSGFLVGLCAVIGGTLT 415
Query: 361 VSGIIDAFIYHGQRAIKK 378
V+ +D +Y G IKK
Sbjct: 416 VAAAVDRALYEGVNRIKK 433
>gi|72388468|ref|XP_844658.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62360135|gb|AAX80555.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801191|gb|AAZ11099.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 405
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 146/410 (35%), Positives = 215/410 (52%), Gaps = 37/410 (9%)
Query: 5 MNKIRSLDAYPKINEDFY----SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK +E F RT GGV+++ S ++ L E+R + ++V + ++
Sbjct: 1 MKGLSRLDVFPKFDERFLRDARQRTALGGVLSMASIFIITFLVVGEVRYFFSSVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G + + ++TFP +PC +++ DA+D GE +V D + R++ V
Sbjct: 61 VDPHIGGIMHMKVNITFPRVPCDLMTADAIDAFGEHVENVLTDTARVRVNPDTLV----P 116
Query: 121 DGIGAPKIDKPLQRHGGR-LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
G P +D Q G EH + C SCYGAES+ DCC+ C++VR A+ ++ W
Sbjct: 117 LGEARPLMDMKKQPADGNGAEHGK--CPSCYGAESNPGDCCHTCDDVRRAFAERQWEFHE 174
Query: 180 PDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
D I QC E EGCN++ V +V GN HF PG+ F+ G H+H
Sbjct: 175 DDASIVQCVHERLKMAAASASTEGCNLHASFSVPRVTGNIHFIPGRMFNFFGQHLHSFKG 234
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDG---VRWTQETPS----GMYQYFIKVVPTVY 291
N+SH ++ L FGE FPG NP+DG VR + PS G + YF+KVVPTVY
Sbjct: 235 ETIQKLNLSHIVHSLEFGERFPGQSNPMDGMANVRGATD-PSEPLIGRFSYFVKVVPTVY 293
Query: 292 TDVS----GHTIQSNQFSVTEHFRSS-----------EQGRLQTLPGVFFFYDLSPIKVT 336
S G ++SNQ+SVT HF S + +PGVF YDLSPI+V+
Sbjct: 294 RIESLVGGGRVVESNQYSVTHHFTPSWETPKGGENNNAKHDPSVVPGVFISYDLSPIRVS 353
Query: 337 FTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
H S +H + +CA+ GGV+TV+G+ID+ +H R ++ K+ GK
Sbjct: 354 VKRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHSIRRMQIKMNRGK 403
>gi|255712984|ref|XP_002552774.1| KLTH0D01144p [Lachancea thermotolerans]
gi|238934154|emb|CAR22336.1| KLTH0D01144p [Lachancea thermotolerans CBS 6340]
Length = 402
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 145/404 (35%), Positives = 218/404 (53%), Gaps = 38/404 (9%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K+ S+DA+ K ED RT +GG+ITL +V LL SE VT +L+VD R
Sbjct: 4 SKLLSIDAFAKTEEDVRIRTRTGGLITLSCVVVTFLLLLSEWFHLKEVVTRPQLVVDRDR 63
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIG 124
L +N D+TFP +PC +L++D MD +GE L+V + + K RLD G V++++Q G
Sbjct: 64 HLKLDLNMDITFPHIPCYLLNMDIMDSAGEMQLEVLNKGWSKTRLDPSGQVLDTKQFKPG 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGW 175
+D + +E YCG CYGA ++ CC C++VREAY +K W
Sbjct: 124 KDVVDYAPE--------DENYCGPCYGARDQSKNDEVNVDERVCCQTCDDVREAYAEKQW 175
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + I+QC+REG+++++ E EGC I G ++N++ GN HFAPGK FH H HD
Sbjct: 176 AFFDGKNIEQCEREGYVEQVNEHIEEGCRIKGMAKLNRIGGNLHFAPGKGFHNIRGHFHD 235
Query: 236 ILAFQRD-SFNISHKINKLAFGEHFPGVVN------PLDGVRWTQE--TPSGMYQYFIKV 286
+Q S N +H I+ L+FG+ + PLDG + E T + YF K+
Sbjct: 236 ASLYQNSPSLNFNHIIHHLSFGKEVEDITGQGASTAPLDGTNVSPEFDTHKHQFSYFAKI 295
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVT 336
VPT Y +SG T+++ QF+ T H R + GR P V+F++++SP+KV
Sbjct: 296 VPTRYEYLSGETVETTQFTTTYHSRPLKGGRDSDHPTTLHSQGGFPSVYFYFEMSPLKVI 355
Query: 337 FTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+++ S+ F N +GGV V ++D Y QR++ K
Sbjct: 356 NKQQYAQSWSGFWLNCITSIGGVLAVGTVLDKITYKAQRSMWGK 399
>gi|254569250|ref|XP_002491735.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv41p [Komagataella pastoris GS115]
gi|238031532|emb|CAY69455.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv41p [Komagataella pastoris GS115]
gi|328351763|emb|CCA38162.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Komagataella pastoris CBS 7435]
Length = 401
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 140/403 (34%), Positives = 215/403 (53%), Gaps = 33/403 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ SLDA+ K +D +T SGGVITL+ IV L+L +E Y V +L+VD
Sbjct: 5 KLLSLDAFAKTADDVKVKTTSGGVITLICLIVTLILVTNEYFDYQTVVIRPELVVDRDHA 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ L I+ +VTF +PC +L++D MDI+G+ +D+ F+K G E+ + +
Sbjct: 65 KKLDISLNVTFHHIPCELLAMDIMDITGDLQIDLLMSGFQKTRVVDGLAKETTELRVNEY 124
Query: 127 KIDKPLQRHGGRL--EHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGW 175
K + +L +N YCGSCYGA + ++ CCN CE V++AY K GW
Sbjct: 125 K------QENNKLTNSNNPYYCGSCYGALNQKDNENKPFDEKLCCNTCESVKKAYAKAGW 178
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + I+QC+ EG++Q + EGC + G ++N+V+GN HFAPG S H+HD
Sbjct: 179 AFYDGRNIEQCENEGYVQLVTSMVDEGCQVSGTAQINRVSGNLHFAPGSSLTSGSRHIHD 238
Query: 236 ILAFQR--DSFNISHKINKLAFGEHFPG---VVNPLDGVRWTQETPSGMYQYFIKVVPTV 290
+ F++ D FN H +N L+FG+ +PLDG + +Y YF+KVV T
Sbjct: 239 LSLFEKYPDKFNFDHTVNHLSFGKTIDNQEMSTHPLDGYEAATGNKNHLYSYFLKVVATR 298
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEE 340
Y +SG +NQFS T H R E GR +PG FF +++SP+K+ E+
Sbjct: 299 YESMSGLKWDTNQFSATYHDRPLEGGRDSDHPNTLHASGGIPGAFFHFEISPLKIINREQ 358
Query: 341 HV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
+ + F V A V GV T+ ++D I+ + +++K ++
Sbjct: 359 YSKTRSAFALGVSASVAGVLTLGSVLDKTIWTADQILRQKKDL 401
>gi|449299159|gb|EMC95173.1| hypothetical protein BAUCODRAFT_529716 [Baudoinia compniacensis
UAMH 10762]
Length = 435
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 155/435 (35%), Positives = 222/435 (51%), Gaps = 70/435 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ED RT SGG++TL S +++L L + E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVEDARIRTTSGGIVTLASLLLILYLVWGEWADYRRVTVAPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +++FP +PC +L++D MD+SGE V H + K RL G R+ G A
Sbjct: 65 GEKMEIHMNISFPRVPCELLTLDVMDVSGEVQTGVMHGVNKVRLGEDG-----REVGREA 119
Query: 126 PKIDKPLQRHGGRLEH-NETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNP 180
++ K ++ ++H + YCG CYGA + CCN C EVREAY W+
Sbjct: 120 LELGKEVEE---SMKHMDPEYCGECYGAPAPGNAIRAGCCNTCAEVREAYASVSWSFGRG 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ ++QC+RE + + + E+ EGC I G + VNKV GNFHFAPGKSF +HVHD+ +
Sbjct: 177 ENVEQCEREHYSEHLDEQRREGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYF 236
Query: 241 RDSFNI----SHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSGMY 280
I SH I+ L FG P V NPLD + + Y
Sbjct: 237 AGGEGIDHTFSHTIHHLRFGPQLPEDVVRRIGRRGMAWSNHHLNPLDETEQKTDEKAYNY 296
Query: 281 QYFIKVVPTVY------------------TDVSGH------TIQSNQFSVTEHFRS---- 312
YF+KVV T Y ++ G+ +++++Q+SVT H RS
Sbjct: 297 MYFVKVVSTAYLPLGWERTGSILDIPHELVELGGYGKGEAGSVETHQYSVTSHKRSLAGG 356
Query: 313 --SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSG 363
E+G + L PGVFF YD+SP+KV E SF FL VCA++GG TV+
Sbjct: 357 DGGEEGHKERLHARGGIPGVFFSYDISPMKVINREARSKSFSGFLVGVCAVIGGTLTVAA 416
Query: 364 IIDAFIYHGQRAIKK 378
ID +Y G + +KK
Sbjct: 417 AIDRALYEGGQRVKK 431
>gi|344230638|gb|EGV62523.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
Length = 409
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 227/412 (55%), Gaps = 37/412 (8%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ K+ SLDA+ K ED +T SGG+ITLVS ++L L +E Y + +T +L+VD
Sbjct: 2 VQPKLLSLDAFAKTVEDARVKTASGGIITLVSITIVLFLIRNEYLDYTSIITRPELVVDR 61
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDG 122
+ L I D++FP++PCS++++D +D+SG LD+ + F+K R+ S G + +
Sbjct: 62 DINQKLDITLDISFPSIPCSMINLDILDVSGNVELDILQNGFQKYRILSSGEEVLMKN-- 119
Query: 123 IGAPKIDK-PLQRHGGRLEHNE----TYCGSCYGAESSDED--CCNNCEEVREAYRKKGW 175
AP ID PL+ L+ E T CG CYG+ D CCNNCE +R AY K W
Sbjct: 120 --APLIDSTPLEVMAKGLDKPEDAEHTPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVW 177
Query: 176 ALSNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
A + + I C+ EG+++ I+ E EGC + G ++N+++GN HFAPG SF + HV
Sbjct: 178 AFYDGENIKPCEDEGYVKAIQSEIFNNEGCRVKGTTQINRISGNLHFAPGASFTEPSRHV 237
Query: 234 HDILAFQR--DSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFI 284
HD+ + + D FN H IN L+FG+ N PLDG + +Y YF+
Sbjct: 238 HDLSLYNKFPDRFNFDHTINHLSFGKDPETNANTDKKTLHPLDGETRNLKEKYHLYSYFL 297
Query: 285 KVVPTVYTDVS---GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLS 331
KVV T Y + +++NQFS H R + G+ + LPG++F++D+S
Sbjct: 298 KVVSTRYEYLQEKLKAPLETNQFSAIYHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDIS 357
Query: 332 PIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
P+K+ E++ ++ F+ V + + GV + ++D ++ ++AI+ K +I
Sbjct: 358 PLKIINKEQYSKTWSGFVLGVISSIAGVLMIGSLLDRSVWAAEKAIRAKKDI 409
>gi|12060847|gb|AAG48265.1|AF308298_1 serologically defined breast cancer antigen NY-BR-84, partial [Homo
sapiens]
Length = 239
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 125/237 (52%), Positives = 163/237 (68%), Gaps = 21/237 (8%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 13 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 72
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S
Sbjct: 73 RGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSE----- 127
Query: 125 APKIDKPLQRHG-GRLE--------HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
+RH G++E + C SCYGAE+ D CCN CE+VREAYR++GW
Sbjct: 128 -------AERHELGKVEVTVFDPDSLDPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGW 180
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
A NPD I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VH
Sbjct: 181 AFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH 237
>gi|356575088|ref|XP_003555674.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 347
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 134/371 (36%), Positives = 208/371 (56%), Gaps = 35/371 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I++LDA+P+ + +T SG +++++ I+M LF EL YL T ++ VD RGE
Sbjct: 7 IKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++I + +
Sbjct: 67 TLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIGTEY---ISDL 123
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
++K H N + ++ DE N ++V+EA +
Sbjct: 124 VEKEHTHHKHDDNKNHEHSEQKIHLQNLDESTENIIKKVKEALKN--------------- 168
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
GEGC +YG L+V +VAGNFH S H ++V ++ + N+S
Sbjct: 169 ------------GEGCRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVS 212
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H I+ L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +NQFSV+
Sbjct: 213 HFIHDLSFGPKYPGLHNPLDDTTRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVS 272
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E++ Q +T P V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G++D
Sbjct: 273 EYYSPINQFD-RTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDR 331
Query: 368 FIYHGQRAIKK 378
++Y + K
Sbjct: 332 WMYRLLETLTK 342
>gi|344230637|gb|EGV62522.1| hypothetical protein CANTEDRAFT_131007 [Candida tenuis ATCC 10573]
Length = 410
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 142/409 (34%), Positives = 226/409 (55%), Gaps = 37/409 (9%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ SLDA+ K ED +T SGG+ITLVS ++L L +E Y + +T +L+VD
Sbjct: 6 KLLSLDAFAKTVEDARVKTASGGIITLVSITIVLFLIRNEYLDYTSIITRPELVVDRDIN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
+ L I D++FP++PCS++++D +D+SG LD+ + F+K R+ S G + + A
Sbjct: 66 QKLDITLDISFPSIPCSMINLDILDVSGNVELDILQNGFQKYRILSSGEEVLMKN----A 121
Query: 126 PKIDK-PLQRHGGRLEHNE----TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALS 178
P ID PL+ L+ E T CG CYG+ D CCNNCE +R AY K WA
Sbjct: 122 PLIDSTPLEVMAKGLDKPEDAEHTPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFY 181
Query: 179 NPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ + I C+ EG+++ I+ E EGC + G ++N+++GN HFAPG SF + HVHD+
Sbjct: 182 DGENIKPCEDEGYVKAIQSEIFNNEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDL 241
Query: 237 LAFQR--DSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVV 287
+ + D FN H IN L+FG+ N PLDG + +Y YF+KVV
Sbjct: 242 SLYNKFPDRFNFDHTINHLSFGKDPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVV 301
Query: 288 PTVYTDVSGH---TIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIK 334
T Y + +++NQFS H R + G+ + LPG++F++D+SP+K
Sbjct: 302 STRYEYLQEKLKAPLETNQFSAIYHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLK 361
Query: 335 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
+ E++ ++ F+ V + + GV + ++D ++ ++AI+ K +I
Sbjct: 362 IINKEQYSKTWSGFVLGVISSIAGVLMIGSLLDRSVWAAEKAIRAKKDI 410
>gi|401626934|gb|EJS44847.1| erv46p [Saccharomyces arboricola H-6]
Length = 415
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 150/417 (35%), Positives = 218/417 (52%), Gaps = 59/417 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E R + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTKAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQGNVIESRQD------G 122
+N DVTFP++PC ++++D MD SGE LD+ F R+D G+ + + G
Sbjct: 68 ELNMDVTFPSMPCELVNLDIMDDSGELQLDILDAGFTMTRVDKDGHPVGDATELHVGGNG 127
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGA--ESSDED-------CCNNCEEVREAYRKK 173
GA D P YCG CYGA +S++E+ CC NC+ VR AY K
Sbjct: 128 EGATPNDDP------------NYCGQCYGARDQSNNENLAQEDKVCCQNCDSVRSAYLDK 175
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVH 232
GWA + I+QC++EG++ +I + EGC I G ++N++ GN HFAPGK F + G H
Sbjct: 176 GWAFFDGKDIEQCEKEGYVNKINDHLHEGCRIEGSAQINRIQGNIHFAPGKPFQDTRGNH 235
Query: 233 VHDILAFQRD-SFNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQET 275
HD + + N +H IN+L+FG+ H VV +PLDG + +
Sbjct: 236 RHDTSLYDKTPDLNFNHIINRLSFGKPIQSHHKRLGNDKLHGGAVVSTSPLDGRQVFPDR 295
Query: 276 PSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP----------G 323
P+ +Q YF K+VPT Y + I++ QFS T H R GR Q P G
Sbjct: 296 PTHFHQFSYFAKIVPTRYEYLDSTVIETAQFSATYHSRPLGGGRDQDHPNTFHARGGISG 355
Query: 324 VFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
++ F+++SP+KV E+H ++ F+ N +GGV V ++D Y QR+I K
Sbjct: 356 LYVFFEMSPLKVINKEQHGQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|19113757|ref|NP_592845.1| COPII-coated vesicle component Erv46 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|1351651|sp|Q09895.1|YAI8_SCHPO RecName: Full=Uncharacterized protein C24B11.08c
gi|1061296|emb|CAA91773.1| COPII-coated vesicle component Erv46 (predicted)
[Schizosaccharomyces pombe]
Length = 390
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/394 (36%), Positives = 214/394 (54%), Gaps = 35/394 (8%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R DA+ K ED +T SGG+ITLVS ++++ + E Y + +++V+ S G+
Sbjct: 7 LRRFDAFQKTVEDARIKTASGGLITLVSGLIVIFIVLMEWINYRRVIAVHEIIVNPSHGD 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+ INF++TFP +PC IL+VD +D+SGE D+ H + K RL G +I IG
Sbjct: 67 RMEINFNITFPRIPCQILTVDVLDVSGEFQRDIHHTVSKTRLSPSGEIISVDDLDIGN-- 124
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAES-SDED---CCNNCEEVREAYRKKGWALSNPDLI 183
+ + G CG CYGA + ED CCN C+ VR+AY K W + + D
Sbjct: 125 -QQSISDDGA------AECGDCYGAADFAPEDTPGCCNTCDAVRDAYGKAHWRIGDVDAF 177
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QR 241
QCK E F + + ++ EGCN+ G L VN++AGNFH APG+S HVHD + +
Sbjct: 178 KQCKDENFKELYEAQKVEGCNLAGQLSVNRMAGNFHIAPGRSTQNGNQHVHDTRDYINEL 237
Query: 242 DSFNISHKINKLAFGEHFPGVV---NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
D ++SH I+ L+FG V NPLDG T Y+YFIK V + +S T
Sbjct: 238 DLHDMSHSIHHLSFGPPLDASVHYSNPLDGTVKKVSTADYRYEYFIKCVSYQFMPLSKST 297
Query: 299 --IQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV---S 343
I +N+++VT+H RS GR + +PGV+F +D+SP++V E V +
Sbjct: 298 LPIDTNKYAVTQHERSIRGGREEKVPTHVNFHGGIPGVWFQFDISPMRV--IERQVRGNT 355
Query: 344 FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIK 377
F FL+NV A++GG T++ +D Y Q+ K
Sbjct: 356 FGGFLSNVLALLGGCVTLASFVDRGYYEVQKLKK 389
>gi|74267709|gb|AAI02327.1| ERGIC and golgi 3 [Bos taurus]
Length = 231
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 125/232 (53%), Positives = 161/232 (69%), Gaps = 11/232 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD--- 121
RG+ L+IN +V FP +PC+ LS+DAMD++GEQ LDV+H++FKKRLD G + S +
Sbjct: 64 RGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHE 123
Query: 122 -GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G K+ P R C SCYGAE D CCN+CE+VREAYR++GWA NP
Sbjct: 124 LGKVEVKVFDPDSLDPDR-------CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNP 176
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
D I+QC+REGF Q+++E++ EGC +YGFLEVNKVAGNFHFAPGKSF QS VH
Sbjct: 177 DTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH 228
>gi|224011116|ref|XP_002294515.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970010|gb|EED88349.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 454
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 140/381 (36%), Positives = 206/381 (54%), Gaps = 26/381 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLY--LNAVTETKLLVDTSR 65
++ LD +PK+ D+ RT GG TLV ++ML+L +E + LN + ++VDTS
Sbjct: 74 VKKLDFFPKLERDYEVRTERGGQATLVGYVIMLVLILAEFWTWRGLNGESLEHIVVDTSL 133
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
G+ +R+N ++TFP L C L +D +D++G+ LD+ +FK RL+ G + + A
Sbjct: 134 GKRMRVNLNITFPNLHCDDLHLDVIDVAGDSQLDLSDTLFKHRLNLDGTLRSKAKIATEA 193
Query: 126 P-KIDKPLQRHGGRLEH-NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDL 182
K D+ ++ + YCG CYGA+ + DCCN C++V E Y+KK W + L
Sbjct: 194 NIKADEDKKKQEALSKDIPADYCGPCYGADEKEGDCCNTCDDVMERYKKKRWNENAVQPL 253
Query: 183 IDQCKREGFLQR--IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+QC REG + + GEGCN+ G VN+VAGNFH A G+ + G H+H L
Sbjct: 254 AEQCIREGKGKNEPKRMSNGEGCNLSGHFTVNRVAGNFHIAMGEGVDRDGRHIHQFLPED 313
Query: 241 RDSFNISHKINKLAFGEH---------FPG--VVNPLDGVRWTQETPSGMYQYFIKVVPT 289
R +FN SH +++L F + PG +N + V +G++QYFIKVVPT
Sbjct: 314 RMNFNASHVVHELIFMDEEYGDMVIAGVPGETSMNSVSKVVTEDTGTTGLFQYFIKVVPT 373
Query: 290 VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
Y SG T+ EH + LPGVFF Y++ P V T+ V F+H L
Sbjct: 374 KYKGKSGGTLHEK----VEHHDTQN----AVLPGVFFVYEIYPFAVEVTKNKVPFMHLLI 425
Query: 350 NVCAIVGGVFTVSGIIDAFIY 370
+ A VGGVFT+ G ID+ +Y
Sbjct: 426 RIMATVGGVFTIMGWIDSALY 446
>gi|224137484|ref|XP_002322569.1| predicted protein [Populus trichocarpa]
gi|222867199|gb|EEF04330.1| predicted protein [Populus trichocarpa]
Length = 351
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 137/377 (36%), Positives = 211/377 (55%), Gaps = 36/377 (9%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ I+SLDA+P+ E +T SG +++++ ++M LF+ EL YL T ++ VD
Sbjct: 2 GVKQAIKSLDAFPRAEEHLLQKTQSGALVSVIGLVIMATLFYHELAYYLTTYTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE L I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++
Sbjct: 62 LQRGEILPIHVNITFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHI------- 114
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK-GWALSNPD 181
G + +++ E + + +S +E + ++ E KK AL+N
Sbjct: 115 TGTEYLSDLVEKEH---EAHNHDHDKDHHKDSHEEQHTHGFDDAAETMIKKVKQALAN-- 169
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
GEGC +YG L+V +VAGNFH S H + V ++
Sbjct: 170 ------------------GEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGA 207
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N+SH I+ L+FG +PG+ NPLDG SG+++Y+IK+VPT Y +S + +
Sbjct: 208 KHVNVSHIIHDLSFGPKYPGIHNPLDGTARILRETSGIFKYYIKIVPTEYRYISKDVLPT 267
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQFSVTE+F S +T P V+F YDLSPI VT EE SFLHF+T +CAI+GG F +
Sbjct: 268 NQFSVTEYF-SPITDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAILGGTFAL 326
Query: 362 SGIIDAFIYHGQRAIKK 378
+G++D ++Y A+ K
Sbjct: 327 TGMLDRWMYRLLEALTK 343
>gi|255637400|gb|ACU19028.1| unknown [Glycine max]
Length = 347
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 133/371 (35%), Positives = 207/371 (55%), Gaps = 35/371 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I++LDA+P+ + +T SG +++++ I+M LF EL YL T ++ VD RGE
Sbjct: 7 IKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++I + +
Sbjct: 67 TLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIGTEY---VSDL 123
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
++K H N + ++ DE N ++V+EA +
Sbjct: 124 VEKEHTHHKHDDNKNHEHSEQKIHLQNLDESTENIIKKVKEALKN--------------- 168
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
GEGC +YG L+V +VAGNFH S H ++V ++ + N+S
Sbjct: 169 ------------GEGCRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVS 212
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H I+ L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +NQFSV+
Sbjct: 213 HFIHDLSFGPKYPGLHNPLDDTTRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVS 272
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E++ Q +T P V+F YDLSPI VT EE SF HF+T +CA++GG F V+G++D
Sbjct: 273 EYYSPINQFD-RTWPAVYFLYDLSPITVTIKEERRSFFHFITRLCAVLGGTFAVTGMLDR 331
Query: 368 FIYHGQRAIKK 378
++Y + K
Sbjct: 332 WMYRLLETLTK 342
>gi|156844136|ref|XP_001645132.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
70294]
gi|156115789|gb|EDO17274.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
70294]
Length = 405
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 138/408 (33%), Positives = 214/408 (52%), Gaps = 44/408 (10%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K+ SLDA+ + +E+ RT GG+IT+ + L L E + +++ +L+VD
Sbjct: 5 SKLSSLDAFARPDEEVRIRTKMGGIITISCILTTLYLLSWEWSKFREVISKPQLVVDRDH 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIG 124
L +N D++FP +PC +++D MD SG+ LDV ++ K RLD G V+E+
Sbjct: 65 SSKLELNLDISFPNVPCDFINLDIMDDSGDLQLDVLEYGFTKTRLDPDGKVLETD----- 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGA---------ESSDEDCCNNCEEVREAYRKKGW 175
D + + G + YCG CYG+ E+S+ CC CE+VR+AY K GW
Sbjct: 120 ----DFDMYKQDGAPSTDPNYCGPCYGSIDQSKNDEVEASERVCCQTCEDVRKAYVKAGW 175
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
A + I+QC++EG++++I EGC + G +N++ GN HFAPGKSF H HD
Sbjct: 176 AFYDGKGIEQCEQEGYVKKINSHLNEGCRVAGSASLNRIQGNIHFAPGKSFQTVRGHFHD 235
Query: 236 ILAFQRD-SFNISHKINKLAFGEHFP---------GVVNPLDGVRWTQETPSGMYQ--YF 283
++R+ N +H I+ +FG+ P +VNPLDG E + ++Q Y+
Sbjct: 236 QSLYERNPQLNFNHIIHHFSFGKEIPTKLASRHSKNIVNPLDGRSVAPERDTHLHQFSYY 295
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR----------LQTLPGVFFFYDLSPI 333
K+VPT + ++ + + QFS T H R G +PGVFFF+D SPI
Sbjct: 296 TKIVPTRFEYLNKAVVDTAQFSATYHDRPLRGGADDDHPNTFHFRSGIPGVFFFFDASPI 355
Query: 334 KVTFTEEHV--SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
KV +E++ S+ F N +GGV V ++D +Y QR+ K
Sbjct: 356 KV-INKEYISGSWSSFFLNCITSIGGVLAVGSMLDRLMYKAQRSFLGK 402
>gi|349576209|dbj|GAA21381.1| K7_Erv46p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 415
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 148/417 (35%), Positives = 216/417 (51%), Gaps = 59/417 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E R + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQG----NVIESRQDGIG 124
+N DVTFP++PC ++++D MD SGE LD+ F RL+S+G + E G G
Sbjct: 68 ELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGGNG 127
Query: 125 ---APKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRK 172
AP + P YCG CYGA+ ++ CC +C+ VR AY +
Sbjct: 128 DGTAPVNNDP------------NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLE 175
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
GWA + I+QC+REG++ +I E EGC I G ++N++ GN HFAPGK + + H
Sbjct: 176 AGWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGH 235
Query: 233 VHDILAFQRDS-FNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQET 275
HD + + S N +H IN L+FG+ H VV +PLDG + +
Sbjct: 236 FHDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDR 295
Query: 276 PSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPG 323
+ +Q YF K+VPT Y + I++ QFS T H R GR + +PG
Sbjct: 296 NTHFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPG 355
Query: 324 VFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+F F+++SP+KV E+H ++ F+ N +GGV V ++D Y QR+I K
Sbjct: 356 MFVFFEMSPLKVINKEQHGQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|406866287|gb|EKD19327.1| copii-coated vesicle membrane protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 453
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 153/452 (33%), Positives = 219/452 (48%), Gaps = 88/452 (19%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+ S +++L L F E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWTDYRRIAVHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ +++FP +PC +L++D MD+SGEQ V H + K RL + G
Sbjct: 65 GEKMEIHLNISFPRIPCELLTLDVMDVSGEQQTGVMHGVKKVRLGPEAE---------GG 115
Query: 126 PKID-KPLQRHGG-RLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALS 178
+I + L HG + H + YCG CYGA + CCN CEEVREAY WA
Sbjct: 116 KEISIESLDLHGDDQATHLDPDYCGGCYGATAPPNAKKAGCCNTCEEVREAYASVSWAFG 175
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ ++QC+RE + +++ + EGC I G + VNKV GNFH APG+SF +HVHD+
Sbjct: 176 RGENVEQCEREHYGEKLDAQRKEGCRIEGGIRVNKVVGNFHIAPGRSFSNGNMHVHDLNN 235
Query: 239 FQRDSFN----ISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSG 278
+ +H I+ L FG P V NPLD R +
Sbjct: 236 YFDTPVPGGHVFTHHIHSLRFGPQLPESVTKKLGNKALPWTNHHINPLDDTRQVAPETAY 295
Query: 279 MYQYFIKVVPTVYTDVS-------------------GH----TIQSNQFSVTEHFRSSEQ 315
+ YF+KVVPT Y + GH +++++QFSVT H RS
Sbjct: 296 NFMYFVKVVPTSYLPLGWDNSVTSEQRIDHVDIGSYGHLDDGSVETHQFSVTSHKRSLSG 355
Query: 316 G---------RLQT---LPGVFFFY----------------DLSPIKVTFTEEHV-SFLH 346
G +L + +PGVFF Y D+SP+KV EE S
Sbjct: 356 GDDGAEGHKEKLHSRGGIPGVFFSYVSSHFYPQKISTNKTQDISPMKVINREERAKSLAG 415
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
FLT +CAI+GG TV+ +D +Y G +KK
Sbjct: 416 FLTGLCAIIGGTLTVAAAVDRGVYEGTTRLKK 447
>gi|190347075|gb|EDK39286.2| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
6260]
Length = 404
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 140/407 (34%), Positives = 220/407 (54%), Gaps = 44/407 (10%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K ED RT +GG+ITL+ IV+L L +E Y + + +L+VD
Sbjct: 5 KLLSFDAFAKTVEDARVRTPAGGIITLICVIVVLYLIRNEYSEYTSIINRPELVVDRDIN 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
+ L IN D++FP +PC +L++D +D+SG+ +D+ F+K RL G+ I
Sbjct: 65 KKLEINLDISFPDIPCDVLTMDILDVSGDLQVDLLSSGFEKFRLLKDGSEIRD------- 117
Query: 126 PKIDKPLQRHGGRLEHN------ETYCGSCYGAESSDED---CCNNCEEVREAYRKKGWA 176
+ P+ G LE + CGSCYGA DE+ CCN+CE VR AY +K W
Sbjct: 118 ---ESPVMSSAGELEERARGRAPDGSCGSCYGALPQDENSDYCCNDCETVRLAYAQKAWG 174
Query: 177 LSNPDLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
+ + I+QC+REG++ R+ E+ EGC I G ++N+++GN HFAPG SF G H H
Sbjct: 175 FFDGENIEQCEREGYVARLNEKINNFEGCRIKGTGKINRISGNLHFAPGASFTAPGSHFH 234
Query: 235 DILAFQR--DSFNISHKINKLAFGEHFPGV-------VNPLDGVRWTQETPSGMYQYFIK 285
D+ F + D F H IN L+FG + +PLD ++ +Y Y++K
Sbjct: 235 DLSLFNKYDDKFTFDHVINHLSFGSDPHNIQFFEKQSTHPLDKSSMILKSKDRLYSYYLK 294
Query: 286 VVPTVYTDVSGHT--IQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPI 333
VV T + ++ +T +++NQFSV H R G+ LPGVFF +++SP+
Sbjct: 295 VVATRFEFLTPNTPALETNQFSVISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEISPM 354
Query: 334 KVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
K+ E++ ++ F+ V + + GV V ++D ++ +R I+ K
Sbjct: 355 KIINKEQYAKTWSGFVLGVISSIAGVLMVGALLDRSVWAAERVIRAK 401
>gi|365982867|ref|XP_003668267.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
gi|343767033|emb|CCD23024.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
Length = 410
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 144/414 (34%), Positives = 221/414 (53%), Gaps = 53/414 (12%)
Query: 4 IMNK--IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
++NK + SLDA+ K ED RT +G +I++ +V +LL +E Y VT L+V
Sbjct: 2 LVNKSTLLSLDAFSKTQEDVRIRTKTGAIISISCILVTVLLLLNEWIQYSQIVTRPTLVV 61
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDV--KHDIFKKRLDSQGNVIESR 119
D R L +N D++FP++PC IL++D +D +G+ LD+ + K RLD GNVIE
Sbjct: 62 DRERNLKLDLNLDISFPSMPCDILNLDILDDAGDLQLDILNQGQFTKTRLDRMGNVIE-- 119
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----------ESSDEDCCNNCEEVREA 169
+ KID + ++E YCG CYG+ D+ CC CE+VREA
Sbjct: 120 ---VSKFKIDDDVAEFP---PNDENYCGPCYGSIDQSGNDKIESVKDKICCQTCEQVREA 173
Query: 170 YRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS 229
Y K GWA + I+QC+REG++ +I + EGC + G + +N++ GN HFAPGK+F
Sbjct: 174 YLKAGWAFFDGKNIEQCEREGYVTKINKHLNEGCRVKGNVLLNRIQGNIHFAPGKAFQNV 233
Query: 230 GVHVHDILAFQRD-SFNISHKINKLAFGEHFPGVV---------NPLDGVRWTQETPSGM 279
H HD ++ N +H I+ L+FG+ + +PLDG + + S +
Sbjct: 234 KGHFHDSSLYETSPDLNFNHIIHHLSFGKTIEQLAQLRGATVATSPLDGQQISPSFDSHL 293
Query: 280 YQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHF------RSSEQGRLQT----LPGVFFF 327
Y+ YF+K+VPT Y + ++ QFS T H R E ++ LPG+F +
Sbjct: 294 YRYSYFVKIVPTRYEYLDKMISETAQFSATFHQSLVTGERDPENPNIKYSRTGLPGLFIY 353
Query: 328 YDLSPIKVTFTEEHVS-----FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+++SP+K+ TE+H FLH +T+ +GG+ V I+D F Y QR +
Sbjct: 354 FEMSPLKIINTEQHFKSWSGVFLHCITS----IGGILAVGTILDKFFYKAQRTV 403
>gi|11036454|dbj|BAB17274.1| unnamed protein product [Arabidopsis thaliana]
Length = 333
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 139/368 (37%), Positives = 210/368 (57%), Gaps = 43/368 (11%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +RS+DA+P+ + +T SG V+++V ++M LF EL YLN +T ++ VD
Sbjct: 2 GVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQ 120
RGETL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+S G++I E
Sbjct: 62 LKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYIS 121
Query: 121 DGI--GAPKIDKPLQRHGGRLEH-NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
D + G P +H G+ EH NET EA G+
Sbjct: 122 DLVEKGHEHGHSP-HKHDGKEEHKNETET---------------------EALNILGF-- 157
Query: 178 SNPDLIDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
DQ E ++++K+ +GEGC +YG L+V +VAGNFH S H ++V
Sbjct: 158 ------DQAA-ETMIKKVKQALADGEGCRVYGVLDVQRVAGNFHI----SVHGLNIYVAQ 206
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
++ + N+SH I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S
Sbjct: 207 MIFGGSKNVNVSHMIHDLSFGPKYPGIHNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLS 266
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+ +NQ+SVTE+F + +T P V+F YDLSPI VT EE SFLH +T +CA++
Sbjct: 267 KDVLSTNQYSVTEYFTPMTEFD-RTWPAVYFLYDLSPITVTIKEERRSFLHLITRLCAVL 325
Query: 356 GGVFTVSG 363
GG F ++G
Sbjct: 326 GGTFALTG 333
>gi|256078219|ref|XP_002575394.1| serologically defined breast cancer antigen ny-br-84-related
[Schistosoma mansoni]
gi|353230384|emb|CCD76555.1| serologically defined breast cancer antigen ny-br-84-related
[Schistosoma mansoni]
Length = 338
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 201/344 (58%), Gaps = 12/344 (3%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M M +++ DA+ K +DF +T SG +++++SS+++ +LF SEL + + + +++
Sbjct: 1 MAYAMKSLQNFDAFAKPLKDFRIKTLSGALVSIISSLIIGILFTSELLSFTHTQNKQEII 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD +RGE + I D+T +PC LS+D MD +G Q L+V H+++K + G +
Sbjct: 61 VDVNRGEKMSIYMDITLNFIPCRFLSLDTMDTTGAQQLNVMHEVYKTSVSVDGTPVS--- 117
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D + D + YCGSCYGAES CCN CEEV+ AY + W N
Sbjct: 118 DSVRHAVNDAS----ALTTTRDPNYCGSCYGAESPSRKCCNTCEEVQMAYNEMRWIFVNI 173
Query: 181 DLIDQCKREGFLQRIKEEEG-EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+QC++E + IK++ G EGC I+G L VN+V G FH APG S+ ++ H H +
Sbjct: 174 SAFEQCRKENW-NEIKQKIGNEGCRIHGNLTVNRVGGAFHIAPGHSYTENHAHFHSFQSL 232
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-- 297
FN+SH I +L FGE +PG VNPLDG + +T S M Y++K+VPT+Y + +
Sbjct: 233 GPVQFNVSHSIGELRFGESYPGQVNPLDGTKLAVQTHSQMVIYYLKLVPTMYISLRRNES 292
Query: 298 TIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYDLSPIKVTFTEE 340
T+ +NQ+S T H + + G Q LPGVFF Y+++P+ V TEE
Sbjct: 293 TVITNQYSATWHSKGTPLTGDGQGLPGVFFNYEIAPLLVKITEE 336
>gi|440301578|gb|ELP93964.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba invadens IP1]
Length = 363
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/382 (36%), Positives = 206/382 (53%), Gaps = 26/382 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ DAY K+ ED + GG++T+V I++ +L +E R YL +L+VD R E
Sbjct: 1 MKRFDAYGKVPEDLQVKHGFGGIMTIVCGILIGILVLTEFRYYLQREVTPQLIVDRERDE 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++++FD+TFP C I SVD + SGE +D++ +I K RL+ G P
Sbjct: 61 KIKVHFDITFPFSSCPITSVDVLTKSGESMIDIEKNITKTRLNKN-----------GVPL 109
Query: 128 IDKPLQRHGGRLEHN-----ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ L+ +L N + C SCYGAE+ CC C++V EAY+++GW L N
Sbjct: 110 TESELKATQQKLNANIKTVDQKTCRSCYGAETPSRKCCYTCDDVIEAYKERGWNL-NIRT 168
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I QC L+ K EGC + G L +NK+ GNFH APG S + H H+I R
Sbjct: 169 IAQCDNSEKLEMAKLTLEEGCRVEGNLLLNKIGGNFHIAPGTSDNTWTGHHHNIEWTGRT 228
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+++H N L+FGE + +GM+QYF+ ++P ++G +
Sbjct: 229 KIDLTHTWNDLSFGEGSKTYSGSKKDAKM-----NGMFQYFLTLIPKKNNFINGTKFVYD 283
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
F + E RS G+ + PGVF +YD+SP+ + E + FLHFL VCAI+GGVFTV
Sbjct: 284 -FVINEQTRS---GQGEGEPGVFVYYDVSPMLLEVNEFNHGFLHFLIGVCAIIGGVFTVF 339
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
+IDAF++ ++KKIE+GK
Sbjct: 340 QLIDAFVFDSIHTLQKKIELGK 361
>gi|242059085|ref|XP_002458688.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
gi|241930663|gb|EES03808.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
Length = 350
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/371 (36%), Positives = 206/371 (55%), Gaps = 41/371 (11%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A + ++SL+A+P E +T+SG V+T++ +VM+ LF EL+ YL T ++ VD
Sbjct: 2 ARIPSLKSLNAFPHAEEHLLKKTYSGAVVTILGLLVMITLFVHELQFYLTTYTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGETL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G++I +
Sbjct: 62 LKRGETLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIGTEYLS 121
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
K H EH++ + E N EE + + AL N
Sbjct: 122 DLVEKGHGAHHDHDHGQEHHD--------EQKKPEQTFN--EEAEKMIKSVKQALGN--- 168
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
GEGC +YG L+V +VAGNFH S H + V + +
Sbjct: 169 -----------------GEGCRVYGMLDVQRVAGNFHI----SVHGLNIFVAEKIFEGSS 207
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
N+SH I++L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +N
Sbjct: 208 HVNVSHVIHELSFGPKYPGIHNPLDETSRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTN 267
Query: 303 QFSVTEHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
QFSVTE+F R S++ P V+F YDLSPI VT EE +FLHF+T +CA++GG F
Sbjct: 268 QFSVTEYFLPIRPSDRA----WPAVYFLYDLSPITVTIKEERRNFLHFITRLCAVLGGTF 323
Query: 360 TVSGIIDAFIY 370
++G++D ++Y
Sbjct: 324 AMTGMLDRWMY 334
>gi|323449476|gb|EGB05364.1| hypothetical protein AURANDRAFT_30967 [Aureococcus anophagefferens]
Length = 368
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 140/370 (37%), Positives = 206/370 (55%), Gaps = 21/370 (5%)
Query: 16 KINEDFYSRTFSGGVITL-VSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFD 74
KI +F + ++L V VM LLF EL ++L ++VD S G+ L+I +
Sbjct: 6 KIAAEFTTAPSPAAKVSLTVGHWVMALLFLCELLVFLRVEERDHVVVDRSMGQRLKIGLN 65
Query: 75 VTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQR 134
+TFPAL C+ + +DAMD++G+ H ++ + K+RLD +G+ I R A + +
Sbjct: 66 ITFPALTCAEVHLDAMDVAGDYHPYMEQHMTKQRLDGRGSPIPHRAIPERANEYE----- 120
Query: 135 HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-----PDLIDQCKRE 189
HG E C SC+GAE++++ CCN C+E+ AY KGW+ P +D R+
Sbjct: 121 HGP--EDTGAGCQSCFGAETAEQPCCNTCDELLRAYGNKGWSAQEIKKEAPQCVDD-TRD 177
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 249
++ IK+ GEGCN+ G+LEVNKVAGN H A G+S Q+G VH + FN+SH
Sbjct: 178 DSIRAIKK--GEGCNLAGWLEVNKVAGNVHVAMGESAIQNGRFVHQFDPTRAPEFNVSHV 235
Query: 250 INKLAFGEHFPGVVNPLDGVRWTQE--TPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSV 306
I+ LAFGE + G+ PL G + T +G++QYFIK+VPT+Y +++ ++S
Sbjct: 236 IHDLAFGETYDGMALPLSGTSRIVDAATGTGLFQYFIKLVPTIYRAAPDAAPVRTVRYSY 295
Query: 307 TEHFRS--SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
T+ FR ++ LPG+F YD S V T S HFL VCAIVGGV TV
Sbjct: 296 TQRFRPLHNQPPPTAMLPGIFLVYDFSAFMVEVTRHRSSLAHFLVRVCAIVGGVSTVVAF 355
Query: 365 IDAFIYHGQR 374
+D + +R
Sbjct: 356 VDWAVVRAKR 365
>gi|115455745|ref|NP_001051473.1| Os03g0784400 [Oryza sativa Japonica Group]
gi|14718311|gb|AAK72889.1|AC091123_8 unknown protein [Oryza sativa Japonica Group]
gi|108711422|gb|ABF99217.1| Serologically defined breast cancer antigen NY-BR-84, putative,
expressed [Oryza sativa Japonica Group]
gi|113549944|dbj|BAF13387.1| Os03g0784400 [Oryza sativa Japonica Group]
gi|215737170|dbj|BAG96099.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625918|gb|EEE60050.1| hypothetical protein OsJ_12848 [Oryza sativa Japonica Group]
Length = 350
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 203/371 (54%), Gaps = 35/371 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ +A+P + +T+SG ++T+ I+M+ LF EL+ YL T ++ VD RGE
Sbjct: 7 LKNFNAFPHAEDHLLKKTYSGAIVTIFGLIIMVTLFAHELKFYLTTYTVHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G++I G
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEY 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
++ +++ G HN + + E N E+ + + A+ N
Sbjct: 120 LNDLVEKEHG--THNHDHDHEHEDEQKKQEHTFN--EDAEKMVKSVKQAMEN-------- 167
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
GEGC +YG L+V +VAGNFH S H + V + + N+S
Sbjct: 168 ------------GEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAEKIFDGSSHVNVS 211
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S + +NQFSVT
Sbjct: 212 HIIHDLSFGPKYPGIHNPLDETTRILHDTSGTFKYYIKIVPTEYRYLSKQVLPTNQFSVT 271
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E+F P V+F YDLSPI VT EE +FLHFLT +CA++GG F ++G++D
Sbjct: 272 EYFVPKRATDRSAWPAVYFLYDLSPITVTIKEERRNFLHFLTRLCAVLGGTFAMTGMLDR 331
Query: 368 FIYHGQRAIKK 378
++Y ++ K
Sbjct: 332 WMYRLIESVTK 342
>gi|298706631|emb|CBJ29569.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 453
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 145/418 (34%), Positives = 212/418 (50%), Gaps = 56/418 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ LD Y + +F T G ++T+V +L+L + EL + T L V+++
Sbjct: 26 LRKLKRLDIYSRPKREFQRATVHGAMVTIVLVGAVLVLTWRELVFSMKRETVENLFVNST 85
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-------- 116
T+ + FDV F +PC LS+DA D G D++HD+ + RLDS G +
Sbjct: 86 INPTVNVTFDVVFARIPCGFLSLDAEDALGIPQEDLRHDVTRTRLDSIGRALDDGEKHEM 145
Query: 117 -----------ESRQDGIGAPKIDKPL---QRHG----GRLEH----------NETYCGS 148
E +Q A D+ L R G G +E E C +
Sbjct: 146 GNTLKAVIAKEEEKQAEADASPGDEDLDSKSRAGDGGDGDVEQRALEDTATTGQEDEC-N 204
Query: 149 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE------EGEG 202
CYGA + E CC CE+VR+AYR+KGW L NP I C E E EG
Sbjct: 205 CYGAGAEGE-CCRTCEDVRKAYRRKGWRL-NPAEIPACAGEALSANSANTMESPPVENEG 262
Query: 203 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH--DILAFQRDSFNISHKINKLAFGEHFP 260
C + G LEV++ GNFHFAPG H+ + D + +SFN +H IN L FG+ P
Sbjct: 263 CRLAGHLEVSRTEGNFHFAPGHRLHRHANELSFVDRIQVALESFNTTHTINTLTFGDQPP 322
Query: 261 -GVVNP--------LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 311
G +P L+G + T + M+QYF+++VPTVY +G T+ SNQ+S TEH +
Sbjct: 323 PGHASPKHAVASTVLEGHQKTVQDTHAMHQYFLQLVPTVYRLDNGETVHSNQYSATEHLK 382
Query: 312 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
G + LPGV+F+Y++SP++ E+ FL FLT C +VGGV+T+ G+++ I
Sbjct: 383 HVHDGTSRGLPGVYFYYEVSPVQALVEEKRKGFLAFLTGACGVVGGVYTILGLVNTGI 440
>gi|448081831|ref|XP_004194985.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
gi|359376407|emb|CCE86989.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
Length = 405
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 147/405 (36%), Positives = 230/405 (56%), Gaps = 35/405 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ SLDA+ K ED +T SGG+ITLV +V+LLL +E Y + V +L+VD
Sbjct: 7 KLLSLDAFAKTVEDAKVKTASGGIITLVCVLVVLLLIRNEYSEYTSVVNRPELVVDRDVN 66
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGN--VIESR---Q 120
L IN D+TFP LPC ++++D +D+SG+ DV F+K RL N V+++ +
Sbjct: 67 RKLDINIDITFPNLPCDLVTLDILDVSGDTQADVLKSGFEKYRLIPSSNEEVLDNAPVLR 126
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALS 178
+ + I + + GG +CGSCYGA + +E CCN+CE VR AY ++ WA
Sbjct: 127 NDLSLEDIARNPNKEGG------GFCGSCYGALPQGDNEYCCNDCETVRLAYAERMWAFY 180
Query: 179 NPDLIDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ I+QC+ EG++ R+ + E+ EGC I G ++N+V+GN HFAPG + G H+HD+
Sbjct: 181 DGANIEQCENEGYVTRLNQRIEQKEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDL 240
Query: 237 LAFQR--DSFNISHKINKLAFG----EHFPG--VVNPLDGVRWTQETPSGMYQYFIKVVP 288
+++ D FN H IN L+FG + P +PLDG R S + Y++KVV
Sbjct: 241 SLYEKHFDKFNFDHVINHLSFGLDPVKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVA 300
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFT 338
T + +SG +++NQFS H R G+ + +PGVFF +D+SP+K+
Sbjct: 301 TRFEFLSGLAMETNQFSAIPHHRPYRGGKDEDHRHTMHAKGGIPGVFFHFDISPMKIINK 360
Query: 339 EEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
E++ ++ F+ V + + GV TV ++D ++ ++AIK K +I
Sbjct: 361 EQYAKTWSGFVLGVVSSIAGVLTVGAVLDRSVWAAEKAIKSKKDI 405
>gi|218193856|gb|EEC76283.1| hypothetical protein OsI_13786 [Oryza sativa Indica Group]
Length = 350
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 203/371 (54%), Gaps = 35/371 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ +A+P + +T+SG ++T+ I+M+ LF EL+ YL T ++ VD RGE
Sbjct: 7 LKNFNAFPHAEDHLLPKTYSGAIVTIFGLIIMVTLFAHELKFYLTTYTVHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G++I G
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEY 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
++ +++ G HN + + E N E+ + + A+ N
Sbjct: 120 LNDLVEKEHG--THNHDHDHEHEDEQKKQEHTFN--EDAEKMVKSVKQAMEN-------- 167
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
GEGC +YG L+V +VAGNFH S H + V + + N+S
Sbjct: 168 ------------GEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAEKIFDGSSHVNVS 211
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S + +NQFSVT
Sbjct: 212 HIIHDLSFGPKYPGIHNPLDETTRILHDTSGTFKYYIKIVPTEYRYLSKQVLPTNQFSVT 271
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E+F P V+F YDLSPI VT EE +FLHFLT +CA++GG F ++G++D
Sbjct: 272 EYFVPKRATDRSAWPAVYFLYDLSPITVTIKEERRNFLHFLTRLCAVLGGTFAMTGMLDR 331
Query: 368 FIYHGQRAIKK 378
++Y ++ K
Sbjct: 332 WMYRLIESVTK 342
>gi|6319274|ref|NP_009358.1| Erv46p [Saccharomyces cerevisiae S288c]
gi|1723191|sp|P39727.2|ERV46_YEAST RecName: Full=ER-derived vesicles protein ERV46
gi|1326054|gb|AAC04989.1| Yal042wp [Saccharomyces cerevisiae]
gi|285810158|tpg|DAA06944.1| TPA: Erv46p [Saccharomyces cerevisiae S288c]
gi|392301230|gb|EIW12318.1| Erv46p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 415
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 147/417 (35%), Positives = 215/417 (51%), Gaps = 59/417 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQG----NVIESRQDGIG 124
+N DVTFP++PC ++++D MD SGE LD+ F RL+S+G + E G G
Sbjct: 68 ELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGGNG 127
Query: 125 ---APKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRK 172
AP + P YCG CYGA+ ++ CC +C+ VR AY +
Sbjct: 128 DGTAPVNNDP------------NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLE 175
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
GWA + I+QC+REG++ +I E EGC I G ++N++ GN HFAPGK + + H
Sbjct: 176 AGWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGH 235
Query: 233 VHDILAFQRDS-FNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQET 275
HD + + S N +H IN L+FG+ H VV +PLDG + +
Sbjct: 236 FHDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDR 295
Query: 276 PSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPG 323
+ +Q YF K+VPT Y + I++ QFS T H R GR + +PG
Sbjct: 296 NTHFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHVRGGIPG 355
Query: 324 VFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+F F+++SP+KV E+H ++ F+ N +GGV V ++D Y QR+I K
Sbjct: 356 MFVFFEMSPLKVINKEQHGQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|151941348|gb|EDN59719.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
gi|190406692|gb|EDV09959.1| ER-Golgi transport vesicle protein [Saccharomyces cerevisiae
RM11-1a]
gi|207348028|gb|EDZ74008.1| YAL042Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256272276|gb|EEU07261.1| Erv46p [Saccharomyces cerevisiae JAY291]
gi|259144662|emb|CAY77603.1| Erv46p [Saccharomyces cerevisiae EC1118]
gi|323334778|gb|EGA76150.1| Erv46p [Saccharomyces cerevisiae AWRI796]
gi|323338873|gb|EGA80087.1| Erv46p [Saccharomyces cerevisiae Vin13]
gi|323349926|gb|EGA84136.1| Erv46p [Saccharomyces cerevisiae Lalvin QA23]
gi|365767200|gb|EHN08685.1| Erv46p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 415
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 147/417 (35%), Positives = 215/417 (51%), Gaps = 59/417 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQG----NVIESRQDGIG 124
+N DVTFP++PC ++++D MD SGE LD+ F RL+S+G + E G G
Sbjct: 68 ELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGGNG 127
Query: 125 ---APKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRK 172
AP + P YCG CYGA+ ++ CC +C+ VR AY +
Sbjct: 128 DGTAPVNNDP------------NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLE 175
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
GWA + I+QC+REG++ +I E EGC I G ++N++ GN HFAPGK + + H
Sbjct: 176 AGWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGH 235
Query: 233 VHDILAFQRDS-FNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQET 275
HD + + S N +H IN L+FG+ H VV +PLDG + +
Sbjct: 236 FHDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDR 295
Query: 276 PSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPG 323
+ +Q YF K+VPT Y + I++ QFS T H R GR + +PG
Sbjct: 296 NTHFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPG 355
Query: 324 VFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+F F+++SP+KV E+H ++ F+ N +GGV V ++D Y QR+I K
Sbjct: 356 MFVFFEMSPLKVINKEQHGQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|167376738|ref|XP_001734125.1| endoplasmic reticulum-golgi intermediate compartment protein
[Entamoeba dispar SAW760]
gi|165904489|gb|EDR29705.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba dispar SAW760]
Length = 361
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 126/377 (33%), Positives = 211/377 (55%), Gaps = 18/377 (4%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ D Y K+ ED +R GG +T++ +++++L +E YL +LLVD R
Sbjct: 1 MKRFDTYGKLPEDLRTRHCFGGFLTIICVVIIIILSIAEFTFYLQREVVPQLLVDRDRSS 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+ ++FD+TFP C I SVD + SGE +D++ ++ K R+ G+++ +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIDIEQNVTKIRIHHDGSLVTESE------- 113
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
K +Q H+ C SCYGAE+ ++ CC C++V+EAY+KKGW L + +++ QC+
Sbjct: 114 -MKAIQSKLSTETHDPKECRSCYGAETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVSQCQ 171
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
+Q + + EGC + G +NK+ GNFH APG S G H H++ + ++S
Sbjct: 172 NHEKIQMARLTKDEGCRVIGDFLLNKIGGNFHIAPGSSEQSWGRHSHNLEWTGKTQIDLS 231
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
HK N+L+FGEH + + M+QY++ ++P ++G T +S+
Sbjct: 232 HKWNELSFGEHSKKFTTEKKDTQM-----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQ 285
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E+ RS G + PGVF +YD+SP+ + TE + FLHFL +C+IVGG+FT + DA
Sbjct: 286 ENIRS---GEGEGSPGVFVYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDA 342
Query: 368 FIYHGQRAIKKKIEIGK 384
++ +++KK+E+GK
Sbjct: 343 IVFESIHSLEKKVELGK 359
>gi|323356370|gb|EGA88170.1| Erv46p [Saccharomyces cerevisiae VL3]
Length = 415
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 145/415 (34%), Positives = 214/415 (51%), Gaps = 55/415 (13%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID 129
+N DVTFP++PC ++++D MD SGE LD+ LD+ SR + G P D
Sbjct: 68 ELNMDVTFPSMPCDLVNLDIMDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGD 117
Query: 130 KPLQRHGGR------LEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKG 174
GG + ++ YCG CYGA+ ++ CC +C+ VR AY + G
Sbjct: 118 ATELHVGGNGDGTXPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAG 177
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + I+QC+REG++ +I E EGC I G ++N++ GN HFAPGK + + H H
Sbjct: 178 WAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFH 237
Query: 235 DILAFQRDS-FNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQETPS 277
D + + S N +H IN L+FG+ H VV +PLDG + + +
Sbjct: 238 DTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNT 297
Query: 278 GMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVF 325
+Q YF K+VPT Y + I++ QFS T H R GR + +PG+F
Sbjct: 298 HFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMF 357
Query: 326 FFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
F+++SP+KV E+H ++ F+ N +GGV V ++D Y QR+I K
Sbjct: 358 VFFEMSPLKVINKEQHGQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|3860008|gb|AAC72954.1| unknown [Homo sapiens]
Length = 198
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 111/185 (60%), Positives = 138/185 (74%), Gaps = 3/185 (1%)
Query: 149 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 208
CYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGF
Sbjct: 8 CYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGF 67
Query: 209 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 268
LEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 68 LEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDH 127
Query: 269 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFF 326
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 128 TNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFA 186
Query: 327 FYDLS 331
LS
Sbjct: 187 HLPLS 191
>gi|50294900|ref|XP_449861.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529175|emb|CAG62841.1| unnamed protein product [Candida glabrata]
Length = 415
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 148/412 (35%), Positives = 221/412 (53%), Gaps = 43/412 (10%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S DA+ K ED RT SGG ITL +V L+L SE R + + VT +L++D R L
Sbjct: 8 SFDAFAKTEEDVRIRTRSGGFITLGCLVVTLMLLLSEWRDFNSVVTRPELVIDRDRSLRL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIG-APK 127
+N D+TFP++PC +L++D MD SGE LD+ + F+K RL +G V+ + IG A K
Sbjct: 68 DLNLDITFPSMPCELLTLDIMDDSGEVQLDIMNAGFEKTRLSKEGKVLGTADMKIGEAAK 127
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDED----------CCNNCEEVREAYRKKGWAL 177
DK Q +L N YCG+CYGA ++ CC C++VR+AY +K WA
Sbjct: 128 KDKEAQL--AKLGAN--YCGNCYGARDQGKNNDDTPRDQWVCCQTCDDVRQAYFEKNWAF 183
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH-DI 236
+ I+QC+REG++Q+I ++ EGC + G ++N++ GN HFA G F H H D
Sbjct: 184 FDGKDIEQCEREGYVQKIADQLQEGCRVSGSAQLNRIDGNLHFAAGPGFQNIRGHFHDDS 243
Query: 237 LAFQRDSFNISHKINKLAFGEHFPG------------VVNPLDGVRW--TQETPSGMYQY 282
L Q + N +H IN L+FG+ VNPLDG ++ Y Y
Sbjct: 244 LYIQHPNLNFNHIINHLSFGKAVEPTKKGKVMGIEKVTVNPLDGHSMFPPRDAHFLQYSY 303
Query: 283 FIKVVPTVYTDVS-GHTIQSNQFSVTEHFR----SSEQGRLQTL------PGVFFFYDLS 331
+ K+VPT Y ++ + +++ QFS T H R S+ T+ P ++ +++S
Sbjct: 304 YAKIVPTRYEGLNKKNMVETAQFSSTFHIRPVGGGSDDDHPNTVHQRGGSPSMWINFEMS 363
Query: 332 PIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
P+KV EEH S+ F+ N +GGV V ++D +Y QR I +K ++
Sbjct: 364 PLKVINREEHGQSWSGFVLNCITSIGGVLAVGTVLDKALYKAQRTIFQKKDV 415
>gi|354544621|emb|CCE41346.1| hypothetical protein CPAR2_303350 [Candida parapsilosis]
Length = 412
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 143/412 (34%), Positives = 220/412 (53%), Gaps = 30/412 (7%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + K+ SLDA+ K ED +T SGG+ITL+ V L L +E Y + +L+
Sbjct: 1 MSSQRPKLISLDAFAKTVEDARIKTASGGIITLLCIFVALFLIRNEYIDYTTVIARPELV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESR 119
VD + L IN D++F LPC ++S+D D SG+ LD+ + +K R+ QG+ +
Sbjct: 61 VDRDINKQLDINLDISFLNLPCDLVSIDLFDESGDLKLDIINSQLEKFRIIKQGHSSKPV 120
Query: 120 QDGIGAPKIDK--PLQRHGGRLEHNET--YCGSCYGAESSDED--CCNNCEEVREAYRKK 173
+ P + + PL++ L +T CGSCYGA D+ CCN C VR AY +
Sbjct: 121 EIKDEQPALQREVPLEQIAPGLPEGQTEGECGSCYGAVPQDKKQYCCNTCAAVRRAYAEA 180
Query: 174 GWALSNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
W + + I QC++EG++QR+K+ E EGC + G ++N+++G FAPG S + G
Sbjct: 181 NWQFFDGENIAQCEQEGYVQRLKQRIGENEGCRVKGTAKINRISGTMDFAPGASMTKDGR 240
Query: 232 HVHDILAFQ--RDSFNISHKINKLAFGEHFP-------GVVNPLDGVRWTQETPSGMYQY 282
HVHD+ +Q +D FN H IN L+FG + P G + PLDG ++ Q Y
Sbjct: 241 HVHDLSLYQKYKDKFNFDHVINHLSFGNNPPASKLVDTGSITPLDGHKFLQHKKYHSINY 300
Query: 283 FIKVVPTVYTDVSG-HTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLS 331
F+K+V T + + G H +NQFSV H R G+ + +PGV F +D+S
Sbjct: 301 FLKIVATRFESLDGKHKFDTNQFSVITHDRPLAGGKDEDHQHTLHARGGVPGVAFNFDIS 360
Query: 332 PIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
P+K+ EE+ F+ V + + GV V ++D ++ Q+AIK K ++
Sbjct: 361 PLKIINREEYAKTRSGFILGVVSSIAGVLMVGSLMDRSVFAAQQAIKGKKDL 412
>gi|407034208|gb|EKE37117.1| hypothetical protein ENU1_208770 [Entamoeba nuttalli P19]
Length = 361
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/377 (33%), Positives = 211/377 (55%), Gaps = 18/377 (4%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ D Y K+ ED +R GG +T++ +++++L +E YL +LLVD R
Sbjct: 1 MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+ ++FD+TFP C I SVD + SGE +D++ ++ K R+ G+++ +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIDIEQNVTKIRIHHDGSLVTENE------- 113
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
K +Q H+ C SCYGAE+ ++ CC C++V+EAY+KKGW L + +++ QC+
Sbjct: 114 -MKAIQSKLSTETHDPKECRSCYGAETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVSQCQ 171
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
+Q K + EGC + G +NK+ GNFH APG S G H H++ + ++S
Sbjct: 172 NHEKIQMAKLTKDEGCRLIGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLS 231
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
HK N+L+FGE+ + + M+QY++ ++P ++G T +S+
Sbjct: 232 HKWNELSFGENSKKFTTEKKDTQM-----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQ 285
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E+ RS G+ + PGVF +YD+SP+ + TE + FLHFL +C+IVGG+FT + DA
Sbjct: 286 ENTRS---GKGEGQPGVFVYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDA 342
Query: 368 FIYHGQRAIKKKIEIGK 384
++ +KKK+E+GK
Sbjct: 343 IVFESIHTLKKKVELGK 359
>gi|366987855|ref|XP_003673694.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
gi|342299557|emb|CCC67313.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
Length = 425
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 142/417 (34%), Positives = 217/417 (52%), Gaps = 48/417 (11%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K E+ RT +GG+ITL IV L L +E + + +T +L+VD R
Sbjct: 10 KLLSFDAFAKTEEEVRVRTNTGGIITLSCIIVTLYLLLNEWSQFNSVITSPQLVVDRDRN 69
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
L +NFDVTFP++ C ++++D MD SGE LD+ F K R+D+ GN + S +G
Sbjct: 70 LKLELNFDVTFPSISCDLINLDIMDDSGELQLDLLDSAFTKIRVDADGNELGSSTLEVGT 129
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWA 176
+ +Q+ ++ YCGSCYG++ DE+ CC C +VREAY GW
Sbjct: 130 DDLASEVQQRN----NDPDYCGSCYGSKVQDENDKLPRESRVCCQTCNDVREAYLNIGWG 185
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF-----HQSGV 231
+ I+QC++EG++ +I E EGC + G ++++ GN HFAPGKS+ S
Sbjct: 186 FFDGKGIEQCEKEGYVAKINEHLKEGCRVKGQTLLSRIQGNIHFAPGKSYTSYKRSTSAS 245
Query: 232 HVHDILAFQRDS-FNISHKINKLAFGEHFPGV------------VNPLDG---VRWTQET 275
H HD + + S N +HKIN L+FG+ + ++PLDG + +T
Sbjct: 246 HYHDTSLYDKTSNLNFNHKINHLSFGKPIDKLDEKVQDHSTEFSISPLDGREVIPTDIDT 305
Query: 276 PSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPG 323
+Y Y+ K+VPT Y + +I++ QFS T H R GR +PG
Sbjct: 306 HYHVYSYYAKIVPTRYEFLNKKEKSIETAQFSTTFHSRPLRGGRDADHPTTMHSQGGIPG 365
Query: 324 VFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+F ++++S +KV E H S+ FL N VG V V + D Y Q++++ K
Sbjct: 366 LFIYFEMSAVKVINKEHHFRSWSSFLLNCITTVGSVLAVGTVSDKIFYRAQKSLQGK 422
>gi|225446891|ref|XP_002284045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|296086333|emb|CBI31774.3| unnamed protein product [Vitis vinifera]
Length = 351
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/368 (36%), Positives = 205/368 (55%), Gaps = 44/368 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+SL A+P+ E +T SG V++++ ++M LF ELR YL T ++ VD RGE
Sbjct: 7 IKSLHAFPRAEEHLLQKTQSGAVVSIIGLVIMATLFLHELRYYLTTYTVHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K RL+ G + IG
Sbjct: 67 TLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNRDGFI-------IGTEY 119
Query: 128 IDKPLQRHGGRLEHNETY-----CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ +++ +H+ A S D+D N ++V++ AL+N
Sbjct: 120 LSDLVEKEHADHKHDHNKDHHGDSDQKLHAHSFDQDAENMVKKVKQ-------ALAN--- 169
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
GEGC +YG L+V +VAGNFH S H + V ++
Sbjct: 170 -----------------GEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAI 208
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
N+SH I+ L+FG +PG+ NPLDG SG ++Y+IK+VPT Y +S + +N
Sbjct: 209 HVNVSHIIHDLSFGPKYPGLHNPLDGTVRILRGASGTFKYYIKIVPTEYRYISKEVLPTN 268
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSV E+F + +T P V+F YDLSP+ VT EE SFLHF+T +CA++GG F ++
Sbjct: 269 QFSVMEYFSPMNEFD-RTWPAVYFLYDLSPVTVTIKEERRSFLHFITRLCAVLGGTFALT 327
Query: 363 GIIDAFIY 370
G++D ++Y
Sbjct: 328 GMLDRWMY 335
>gi|194708090|gb|ACF88129.1| unknown [Zea mays]
gi|195607866|gb|ACG25763.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|195619788|gb|ACG31724.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|413952088|gb|AFW84737.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 350
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 134/372 (36%), Positives = 209/372 (56%), Gaps = 43/372 (11%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A + ++SL+A+P E +T+SG V+T+ ++M+ LF EL+ YL T ++ VD
Sbjct: 2 ARIPSLKSLNAFPHAEEHLLKKTYSGAVVTIFGLLIMITLFVHELQFYLTTYTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGETL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G++
Sbjct: 62 LKRGETLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHI------- 114
Query: 123 IGAPKIDKPLQR-HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
IG + +++ HG + + + + E N EE + + AL N
Sbjct: 115 IGTEYLSDLVEKGHGAHH--DHDHDHDHHDEQKKHEQTFN--EEAEKMIKSVKQALGN-- 168
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
GEGC +YG L+V +VAGNFH S H + V + +
Sbjct: 169 ------------------GEGCRVYGMLDVQRVAGNFHI----SVHGLNIFVAEKIFEGS 206
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
+ N+SH I++L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +
Sbjct: 207 NHVNVSHVIHELSFGPKYPGIHNPLDETSRILHDTSGTFKYYIKVVPTEYKYLSKKVLPT 266
Query: 302 NQFSVTEHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
NQFSVTE+F R +++ P V+F YDLSPI VT EE +FLHF+T +CA++GG
Sbjct: 267 NQFSVTEYFLPIRPTDRA----WPAVYFLYDLSPITVTIKEERRNFLHFVTRLCAVLGGT 322
Query: 359 FTVSGIIDAFIY 370
F ++G++D ++Y
Sbjct: 323 FAMTGMLDRWMY 334
>gi|226292523|gb|EEH47943.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb18]
Length = 435
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 149/439 (33%), Positives = 216/439 (49%), Gaps = 72/439 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+V+ V+ L + E Y V +L+VD
Sbjct: 2 APKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
RGE + I+ ++TFP LPC +L++D MD+SGE + H I K RL + G+VI++
Sbjct: 62 KGRGERMEIHLNITFPHLPCELLTLDVMDVSGEMQSGIIHGISKVRLAPESEGGHVIDTT 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKG 174
L +H + YCG CYGA ++ +EVREAY +
Sbjct: 122 A---------LVLHTQTDAAKHLDPDYCGPCYGAPPPSHATKPGVALPAKEVREAYASQS 172
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC+REG+ + + + EGC I G L VNKV GNFH APG+SF +H H
Sbjct: 173 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAH 232
Query: 235 DILAFQRDSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMY 280
D+ + ++SHKI++L FG + NPLD P +
Sbjct: 233 DLDTYYHTPVPHHMSHKIHQLRFGPQLSDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 292
Query: 281 QYFIKVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS 312
YF+KVV T Y + S +I+++Q+SVT H RS
Sbjct: 293 MYFVKVVSTSYLPLGWSPEFSSSVHETTLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRS 352
Query: 313 SEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
+ G RL + +PGVF YD+SP+KV E +F FLT VCA++GG
Sbjct: 353 IDGGDDAAEGHKERLHSHGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G +KK
Sbjct: 413 TVAAAVDRALYEGVARVKK 431
>gi|118482697|gb|ABK93267.1| unknown [Populus trichocarpa]
Length = 366
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 138/385 (35%), Positives = 214/385 (55%), Gaps = 37/385 (9%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ I+SLDA+P+ E +T SG +++++ ++M LF+ EL YL T ++ VD
Sbjct: 2 GVKQAIKSLDAFPRAEEHLLQKTQSGALVSVIGLVIMATLFYHELAYYLTTYTVHQMSVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RGE L I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+KK L G ++
Sbjct: 62 LQRGEILPIHVNITFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKKLL--FGMLLT----- 114
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
R+E + S +G + E + E+ EA+ + D
Sbjct: 115 ---------------RIEFLQLRLNS-HGHITGTEYLSDLVEKEHEAHNHDHDKDHHKDS 158
Query: 183 IDQCKREGF-------LQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
++ GF ++++K+ GEGC +YG L+V +VAGNFH S H + V
Sbjct: 159 HEEQHTHGFDDAAETMIKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFV 214
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
++ N+SH I+ L+FG +PG+ NPLDG SG+++Y+IK+VPT Y
Sbjct: 215 AQMIFDGAKHVNVSHIIHDLSFGPKYPGIHNPLDGTARILRETSGIFKYYIKIVPTEYRY 274
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+S + +NQFSVTE+F S +T P V+F YDLSPI VT EE SFLHF+T +CA
Sbjct: 275 ISKDVLPTNQFSVTEYF-SPITDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCA 333
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKK 378
I+GG F ++G++D ++Y A+ K
Sbjct: 334 ILGGTFALTGMLDRWMYRLLEALTK 358
>gi|224086657|ref|XP_002307923.1| predicted protein [Populus trichocarpa]
gi|222853899|gb|EEE91446.1| predicted protein [Populus trichocarpa]
Length = 351
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/373 (35%), Positives = 208/373 (55%), Gaps = 38/373 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+ LDA+P+ E +T SG +++++ + M LF+ EL YL T ++ VD +RGE
Sbjct: 7 IKKLDAFPRAEEHLLQKTQSGALVSIIGLVTMATLFYHELAYYLTTYTVHQMSVDLTRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ I+K RL+S G++
Sbjct: 67 TLPIHINITFPSLPCDVLSVDAIDMSGKHEVDLDTSIWKLRLNSYGHI------------ 114
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
G+ Y ++ +++ + + + + + A + D
Sbjct: 115 ------------------TGTEYLSDLVEKEHEAHNHDHNKDHHEDSHAKQHTHGFDDAA 156
Query: 188 REGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
E ++++K+ GEGC +YG L+V +VAGNFH S H + V ++ N
Sbjct: 157 -ETMVKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVN 211
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
+SH I+ L+FG +PG+ NPLDG SG ++Y+IK+VPT Y +S + +NQFS
Sbjct: 212 VSHIIHDLSFGPKYPGIHNPLDGTTRILHETSGTFKYYIKIVPTEYRYISKEVLPTNQFS 271
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
VTE+F S +T P V+F YDLSPI VT EE SFLHF+T +CA++GG F ++G++
Sbjct: 272 VTEYF-SPMTDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGML 330
Query: 366 DAFIYHGQRAIKK 378
D ++ A+ K
Sbjct: 331 DRWMCRLLEALTK 343
>gi|255732259|ref|XP_002551053.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
gi|240131339|gb|EER30899.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
Length = 414
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 140/409 (34%), Positives = 221/409 (54%), Gaps = 33/409 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K ED +T SGG+ITL+ ++ L+L +E Y +T +L+VD
Sbjct: 6 KLLSFDAFAKTVEDARIKTASGGIITLICVLITLILIRNEYIDYTTIITRPELVVDRDIN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RL--DSQGNV----IESR 119
+ L IN D++F LPC ++SVD +D++G+Q LD+ KK RL + QG+V IE
Sbjct: 66 KQLDINLDISFINLPCDLISVDLLDVTGDQQLDIIDSGLKKVRLLKNKQGDVIINEIEDD 125
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWAL 177
+ + + K L + YCG CYGA D+ CCN+C VR AY +K W
Sbjct: 126 KPALNSDVSLKELAKGLPEGSDQNAYCGPCYGALPQDKKQFCCNDCNTVRRAYAEKQWQF 185
Query: 178 SNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
+ + I+QC++EG+++R++E EGC I G ++N+V+G FAPG SF+ G H HD
Sbjct: 186 FDGENIEQCEKEGYVKRLRERINNNEGCRIKGSTKINRVSGTMDFAPGSSFNHDGRHFHD 245
Query: 236 ILAFQR--DSFNISHKINKLAFG--------EHFPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ +++ D FN H IN L+FG E ++PLD ++ + YF+K
Sbjct: 246 LSLYKKYNDKFNFDHVINHLSFGEVPTNNGAEEMFDSIHPLDDYQFMLHKKDHVVSYFLK 305
Query: 286 VVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIK 334
VV T Y + + +NQFSV H R G+ + +PGV F +D+SP+K
Sbjct: 306 VVATRYESLDYSKRVDTNQFSVITHDRPLIGGKDEDHQHTLHARGGIPGVNFNFDISPLK 365
Query: 335 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
+ +++ ++ F+ V + + GV V ++D ++ Q+AIK K +I
Sbjct: 366 IINRQQYAKTWSGFILGVVSSIAGVLMVGTLLDRSVFAAQQAIKGKKDI 414
>gi|357112836|ref|XP_003558212.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
compartment protein 3-like [Brachypodium distachyon]
Length = 349
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 132/363 (36%), Positives = 200/363 (55%), Gaps = 36/363 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ +A+P + +T+SG ++T+ I+M LF EL+ YL T ++ VD RGE
Sbjct: 7 LKNFNAFPHAEDHLLKKTYSGAIVTIFGLIIMFTLFVHELKFYLTTYTMHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G +I +
Sbjct: 67 TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGTIIGT--------- 117
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
E+ +GA D ++ EE KK N D K
Sbjct: 118 ------------EYLSDLVEKEHGAHHHDNGHEHHDEE------KKPEHTFNEDADKMVK 159
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
R E GEGC +YG L+V +VAGNFH S H ++V + + N+S
Sbjct: 160 S----VRQALENGEGCRVYGMLDVQRVAGNFHI----SVHGLNIYVAEKIFEGSSHVNVS 211
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H I++L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +NQFSVT
Sbjct: 212 HVIHELSFGPKYPGIHNPLDDTTRILHDASGTFKYYIKVVPTEYRYLSKQVLPTNQFSVT 271
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
E+F ++ P V+F YDLSPI VT EE +FLHF+T +CA++GG F ++G++D
Sbjct: 272 EYFVPIRPAD-RSWPAVYFLYDLSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDR 330
Query: 368 FIY 370
++Y
Sbjct: 331 WMY 333
>gi|254581328|ref|XP_002496649.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
gi|238939541|emb|CAR27716.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
Length = 404
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 134/408 (32%), Positives = 208/408 (50%), Gaps = 43/408 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R+ DA+ K ED RT +GG+I L+ +V + L SE + V +L+VD R
Sbjct: 7 LRTFDAFSKTEEDVRIRTRTGGIIALLCCLVTIFLLISEWLNFNQVVNRPELVVDKDRQL 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAP 126
L + D+TFP++PC +LS+D MD +GE LD+ F K RLD G + S +
Sbjct: 67 KLELEADITFPSMPCDMLSLDIMDSAGEIQLDLLESGFTKTRLDQNGQSLGSSSLKVSDE 126
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWAL 177
D +E YCG+CYGA+ + CC C +VR AY + WA
Sbjct: 127 SYDP----------KDENYCGACYGAKDQSRNNEVPKEERVCCQTCNDVRRAYLEANWAF 176
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QC+REG++ R+ E+ EGC + G +N++ G HFAPG +F H HD+
Sbjct: 177 FDGKNIEQCEREGYVDRVNEQLNEGCRVQGSALLNRIQGTLHFAPGVAFQNPKGHFHDLS 236
Query: 238 AFQRD-SFNISHKINKLAFGEHFPG---------VVNPLDGVRWTQETPSGMYQ--YFIK 285
+++ + N +H IN L+FG+ PLDG + + + M+Q YF K
Sbjct: 237 LYEKTHNLNFNHIINHLSFGKPVTSNARGRGASVATAPLDGRQAFPDRDTHMHQFSYFTK 296
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQGRLQTL------PGVFFFYDLSPIKV 335
+VPT Y + +++ QFS T H R ++Q TL PG+F ++++SP+KV
Sbjct: 297 IVPTRYEYMDKMVVETAQFSATLHDRPLHGGADQDHPTTLHTKGGFPGLFVYFEMSPLKV 356
Query: 336 TFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
E+H ++ F+ N +GGV V ++D Y Q++I K +
Sbjct: 357 INREQHAQTWSGFILNCITSIGGVLAVGTVLDKITYKAQKSIWGKKSV 404
>gi|444314203|ref|XP_004177759.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
gi|387510798|emb|CCH58240.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
Length = 406
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 213/402 (52%), Gaps = 42/402 (10%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ + ED RT +G +ITL + LL +E + T +L++D R L
Sbjct: 8 SLDAFSRTEEDVRVRTKTGALITLGCMGITFLLLLNEWLRFGIIETRPELVIDRERHLKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKI 128
++ DVTFP +PC ++++D MD +GE LD+ F K RLDS+GN + + +
Sbjct: 68 DLDLDVTFPNMPCDLINLDLMDDAGEIQLDILSSGFTKTRLDSRGNELGTFDFDLSKDIS 127
Query: 129 DKPLQRHGGRLEHNETYCGSCYGA--ESSDED--------CCNNCEEVREAYRKKGWALS 178
+ P ++ YCG CYGA +S+++D CC C +VR+AY GWA
Sbjct: 128 EYP--------PDDDKYCGPCYGALDQSNNKDDMPMDEKVCCQTCADVRQAYLNAGWAFF 179
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ I+QC+REG++QRI + EGC I G +N++ GN HFAPG +F H HD
Sbjct: 180 DGKDIEQCEREGYVQRINDHLNEGCRIQGNARLNRIHGNVHFAPGLAFQNRRGHYHDTSL 239
Query: 239 FQRDS-FNISHKINKLAFGEHF-PGV--------VNPLDGVRWT-QETPSGM-YQYFIKV 286
+ + + +H IN L+FG+H PG+ V+PLDG + + P + + YF K+
Sbjct: 240 YDKKTELTFNHIINHLSFGKHVKPGIGSKFSAASVSPLDGHQMILNDDPHNVQFIYFAKI 299
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFR----------SSEQGRLQTLPGVFFFYDLSPIKVT 336
VPT Y + I++ QFS T H + + + R PG++ Y++SP+KV
Sbjct: 300 VPTRYEYLDKDVIETAQFSTTTHSKALNNLADDKTTPKPSRRSGTPGLYINYEMSPLKVI 359
Query: 337 FTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIK 377
E+HV +++ F+ N +GGV V +ID Y QR I+
Sbjct: 360 NREQHVQTWVSFILNCLTSIGGVLAVGTVIDKIFYRAQRTIQ 401
>gi|344301666|gb|EGW31971.1| hypothetical protein SPAPADRAFT_50577 [Spathaspora passalidarum
NRRL Y-27907]
Length = 410
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 136/407 (33%), Positives = 224/407 (55%), Gaps = 35/407 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++ SLDA+ K +D RT SGG+ITL+ ++ L+L +E Y +T +L+VD
Sbjct: 8 RLLSLDAFAKTVDDARIRTTSGGIITLLCVLITLVLIRNEYIDYTTVITRPELVVDRDIN 67
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK--RLDSQGNVIESRQDGIG 124
+ L IN D++F LPC + S+D +D +G+ L++ + F+K + +GN++ D
Sbjct: 68 KQLVINLDISFINLPCDMASIDLLDETGDMQLNIINAGFQKLRLIKDKGNIVREISDDTP 127
Query: 125 APKIDKPLQR------HGGRLEHNETYCGSCYGA--ESSDEDCCNNCEEVREAYRKKGWA 176
A +D+PL GG + CGSCYGA + + CCN+C V+ AY ++ W+
Sbjct: 128 ALNLDRPLSEVVKGLPEGG----DPKTCGSCYGALPQEKHQYCCNDCYSVKRAYAERRWS 183
Query: 177 LSNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
+ + I+QC++EG+++R+++ + EGC I G ++N+V+G FAPG SF G HVH
Sbjct: 184 FFDGENIEQCEKEGYVKRLRQRINDNEGCRIKGSAKINRVSGTMDFAPGASFTSDGRHVH 243
Query: 235 DILAFQR--DSFNISHKINKLAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
D+ + + D FN H IN L+FG E V+PLDG ++ + Y++KVV
Sbjct: 244 DVSLYGKYQDKFNFDHIINHLSFGSNDAREEILNSVHPLDGYQFMLHKKHHVASYYLKVV 303
Query: 288 PTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVT 336
T + + + +NQFSV H R G+ + +PGV F +D+SP+K+
Sbjct: 304 ATRFESLDQSKRLDTNQFSVITHDRPLTGGKDEDHEHTLHARGGIPGVEFHFDISPLKII 363
Query: 337 FTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
E++ ++ F+ V + + GV V +ID +Y Q+AI+ K +I
Sbjct: 364 NKEQYAKTWSGFVLGVISSIAGVLMVGTLIDRSVYATQQAIRGKKDI 410
>gi|322693278|gb|EFY85144.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Metarhizium acridum CQMa 102]
Length = 356
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/359 (37%), Positives = 186/359 (51%), Gaps = 64/359 (17%)
Query: 75 VTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQ 133
+TFP +PC +L++D MD+SGEQ V H + RL R + G ID K ++
Sbjct: 1 MTFPRMPCELLTLDVMDVSGEQQHGVSHGVKNVRL---------RPESQGGGVIDIKSMK 51
Query: 134 RHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKR 188
H EH + +YCG CYGA + CCN C+EVREAY +GWA + ++QC R
Sbjct: 52 VHDDPAEHLDPSYCGECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCTR 111
Query: 189 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSF 244
E + +R+ E+ EGC + G LEVNKV GNFH APG+SF +HVHD+ +
Sbjct: 112 EHYAERLDEQREEGCRVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQH 171
Query: 245 NISHKINKLAFGEHFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVP 288
+ +H I++L FG P V NPLDG R P+ Y YF+K+VP
Sbjct: 172 DFTHTIHQLRFGPQLPAAVSDRLGKGSMPWTNHHINPLDGTRQETGDPAFNYMYFVKIVP 231
Query: 289 TVY---------TDVSGHT-------IQSNQFSVTEHFRSSEQGRLQT------------ 320
T Y + +G T ++++Q+SVT H RS E G
Sbjct: 232 TSYLPLGWEKRFKNAAGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQGG 291
Query: 321 LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
+PGVFF YD+SP+KV EE +F FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 292 IPGVFFSYDISPMKVINREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 350
>gi|146416067|ref|XP_001484003.1| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
6260]
Length = 404
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 140/407 (34%), Positives = 220/407 (54%), Gaps = 44/407 (10%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K ED RT +GG+ITL+ IV+L L +E Y + + +L+VD
Sbjct: 5 KLLSFDAFAKTVEDARVRTPAGGIITLICVIVVLYLIRNEYLEYTSIINRPELVVDRDIN 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
+ L IN D++FP +PC +L++D +D+SG+ +D+ F+K RL G +E R +
Sbjct: 65 KKLEINLDISFPDIPCDVLTMDILDVSGDLQVDLLLSGFEKFRLLKDG--LEIRDES--- 119
Query: 126 PKIDKPLQRHGGRLEHN------ETYCGSCYGAESSDED---CCNNCEEVREAYRKKGWA 176
P+ G LE + CGSCYGA DE+ CCN+CE VR AY +K W
Sbjct: 120 -----PVMSSAGELEERARGRAPDGLCGSCYGALPQDENLDYCCNDCETVRLAYAQKAWG 174
Query: 177 LSNPDLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
+ + I+QC+REG++ R+ E+ EGC I G ++N+++GN HFAPG SF G H H
Sbjct: 175 FFDGENIEQCEREGYVARLNEKINNFEGCRIKGTGKINRISGNLHFAPGASFTAPGSHFH 234
Query: 235 DILAFQR--DSFNISHKINKLAFG------EHFPG-VVNPLDGVRWTQETPSGMYQYFIK 285
D+ F + D F H IN L FG + F + +PLD ++ +Y Y++K
Sbjct: 235 DLSLFNKYDDKFTFDHVINHLLFGLDPHNIQFFEKQLTHPLDKSSMILKSKDRLYSYYLK 294
Query: 286 VVPTVYTDVSGHT--IQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPI 333
VV T + ++ +T +++NQF V H R G+ LPGVFF +++ P+
Sbjct: 295 VVATRFEFLTPNTPALETNQFLVISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEILPM 354
Query: 334 KVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
K+ E++ ++ F+ V + + GV V ++D ++ +R I+ K
Sbjct: 355 KIINKEQYAKTWSGFVLGVISSIAGVLMVGALLDRSVWAAERVIRAK 401
>gi|149237735|ref|XP_001524744.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451341|gb|EDK45597.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 411
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 145/412 (35%), Positives = 226/412 (54%), Gaps = 31/412 (7%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + K+ SLDA+ K ED +T SGG+ITL+ +V L+L +E Y VT +L+
Sbjct: 1 MSSPRPKLISLDAFAKTVEDARIKTASGGIITLLCCLVALILIRNEYIDYTTIVTLPELV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGN--VIE 117
VD + L IN D++FP LPC ++++D D +G+ LDV + +K R+ +GN V+E
Sbjct: 61 VDRDINKQLEINMDMSFPNLPCDMINMDLFDETGDMKLDVINSGLEKYRIIKRGNNKVVE 120
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNET-YCGSCYGAESSD--EDCCNNCEEVREAYRKKG 174
D A + ++PL L NE CGSCYGA D E CCN+C VR AY K
Sbjct: 121 ELDDQ-PALRREQPLHEICKGLGENEQGECGSCYGALPQDKKEYCCNSCAAVRRAYAHKK 179
Query: 175 WALSNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
W + + I+QC++EG++Q++K+ + EGC + G ++N+VAG FAPG S +G H
Sbjct: 180 WQFFDGENIEQCEKEGYVQKLKDRINQNEGCRVKGSAKINRVAGTMDFAPGISTTSNGQH 239
Query: 233 VHDILAFQR--DSFNISHKINKLAFGEHFPGVVN--------PLDGVRWTQETPSGMYQY 282
VHD+ + + D FN H I+ L+FG+ + N PLDG + Q M Y
Sbjct: 240 VHDLSLYTKYPDKFNFDHVIHHLSFGKIPTAITNLQETDSLSPLDGHSFLQHKRYHMNNY 299
Query: 283 FIKVVPTVYTDVSG-HTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLS 331
++K+V T + ++ G + +NQFSV H R G+ + +P V F +D+S
Sbjct: 300 YLKIVSTRFENLDGTKKVDTNQFSVITHDRPLVGGKDEDHQHTLHARGGVPSVAFHFDIS 359
Query: 332 PIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
P+K+ E + ++ F+ V + V GV V ++D ++ Q+A+K K ++
Sbjct: 360 PLKIINRERYAKTWSGFVLGVVSSVAGVLMVGALLDRSVFAAQQAMKGKKDL 411
>gi|320583549|gb|EFW97762.1| COPII-coated vesicle membrane protein Erv46, putative [Ogataea
parapolymorpha DL-1]
Length = 400
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 142/400 (35%), Positives = 208/400 (52%), Gaps = 37/400 (9%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I DA+ K +D +T SGG++TLV + LLL +E Y VT +L+VD R +
Sbjct: 7 ILRFDAFSKTVDDARIKTTSGGILTLVCILTTLLLLINEYTDYSRIVTRPELVVDRDRHK 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAP 126
L IN D++F +PC +L++D MD SG+ LD+ F K RLD QGN I G
Sbjct: 67 KLEINLDISFQNMPCDLLTMDIMDQSGDMQLDLLSSGFSKIRLDRQGNEI-----GQENM 121
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWAL 177
++++ + TYCGSCYGA + CCN+CE V++AY + W
Sbjct: 122 RVNQEF----ALTSSDPTYCGSCYGAADQSRNDELPQDQKVCCNSCESVKQAYARNAWKF 177
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ I+QC++EG++ RI EGC + G E+ ++ GN HFAPG S + + HVHD+
Sbjct: 178 YDGKDIEQCEKEGYVDRINARLDEGCRVRGTAEIARIGGNLHFAPGSSMNFNEKHVHDLS 237
Query: 238 AFQRDS--FNISHKINKLAFGEHFPGVVN-----PLDGVRWTQETPSGMYQYFIKVVPTV 290
+ S FN H IN +FG V + PLD +Y YF+KVV T
Sbjct: 238 LYDMHSNKFNFDHTINHFSFGLDDHSVADYKTTHPLDATTHRDGRKYHVYSYFLKVVNTR 297
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEE 340
Y + G +++NQFS T+H R GR + LPGVFF +++SP+K+ E+
Sbjct: 298 YEFLDGRKVETNQFSATQHDRPFRGGRDEDHPNTIHAQGGLPGVFFHFEISPLKIINREQ 357
Query: 341 H-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+ ++ F CA + GV TV ++D I+ R +K K
Sbjct: 358 YNKTWSAFALGACAAISGVLTVFTLLDRTIWAANRMLKDK 397
>gi|443925078|gb|ELU44001.1| ER-derived vesicles protein ERV46 [Rhizoctonia solani AG-1 IA]
Length = 383
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 134/380 (35%), Positives = 202/380 (53%), Gaps = 45/380 (11%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+ LD + K ED +T +G +T++S+ ++L E Y V ++ +LVD SRGE
Sbjct: 11 KGLDGFGKTMEDVKVKTRTGAFLTMLSAAIILTFTIIEFIDYRRVVVDSSILVDRSRGEK 70
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
L + ++TFP +P +LS+D DISGE D+ H++ K RLDS G +I QDG ++
Sbjct: 71 LTVKMNITFPRVP--LLSLDVTDISGEIQQDLTHNMVKTRLDSNGQII---QDGFHNNEL 125
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKR 188
D +++ + YCGSCYG E + CC CE VR+AY +GW+ +PD I+QC
Sbjct: 126 DNDVEK--TMKARPQGYCGSCYGGEPPEGGCCQTCESVRQAYMNRGWSFGDPDAIEQCVA 183
Query: 189 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNI 246
E + +I E+ EGC+I G + VNKV GNFHF+PG+SF + H D++ + +D +
Sbjct: 184 EHWTAKIHEQNSEGCHISGRVRVNKVTGNFHFSPGRSFVLNRGHFQDLVPYLKDGNHHDF 243
Query: 247 SHKINKLAF-GE-----HFPGV-------------VNPLDGVRW---TQETPSGMYQYFI 284
H +++ F GE + G NPLD V + M+QYF+
Sbjct: 244 GHYVHEFRFEGESEAEDEWRGTDRGTRWRKKVGISANPLDQVSAHVVDDRASNYMFQYFM 303
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFFYDL 330
KVV T + + G I+S+Q+SVT + R G +Q LPG FF +++
Sbjct: 304 KVVSTEFKYLDGDIIRSHQYSVTSYERDLTHGDGAERDSHGTLTAHGVQGLPGAFFNFEI 363
Query: 331 SPIKVTFTEEHVSFLHFLTN 350
SP+ V E +F HF T+
Sbjct: 364 SPMMVVHRETRQTFAHFATS 383
>gi|67479189|ref|XP_654976.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472072|gb|EAL49587.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 361
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/383 (33%), Positives = 215/383 (56%), Gaps = 30/383 (7%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ D Y K+ ED +R GG +T++ +++++L +E YL +LLVD R
Sbjct: 1 MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAP 126
+ ++FD+TFP C I SVD + SGE + ++ ++ K R+ G+++ E+ I +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIGIEQNVTKIRIHHDGSLVTENEMKAIQSK 120
Query: 127 -KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
I+ P + C SCYGAE+ ++ CC C++V+EAY+K+GW L + +++ Q
Sbjct: 121 LSIETPDPKE----------CRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVSQ 169
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
C+ +Q K + EGC + G +NK+ GNFH APG S G H H++ + +
Sbjct: 170 CQNHEKIQMAKLTKDEGCRLIGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQID 229
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETP----SGMYQYFIKVVPTVYTDVSGHTIQS 301
+SHK N+L+FGE + ++T E + M+QY++ ++P ++G T
Sbjct: 230 LSHKWNELSFGE---------NSKKFTTEKKDTQMNSMFQYYLTIIPIKNNFING-TSTF 279
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
+S+ E+ RS E G Q PGVF +YD+SP+ + TE + FLHFL +C+IVGG+FT
Sbjct: 280 YDYSIQENIRSGE-GEGQ--PGVFIYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTT 336
Query: 362 SGIIDAFIYHGQRAIKKKIEIGK 384
+ DA ++ +KKK+E+GK
Sbjct: 337 FQLFDAIVFESIHTLKKKVELGK 359
>gi|366996541|ref|XP_003678033.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
gi|342303904|emb|CCC71687.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
Length = 409
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 144/412 (34%), Positives = 221/412 (53%), Gaps = 58/412 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S+DA+ K ED RT SG +IT+ ++ L+L +E Y + V+ L++D R L
Sbjct: 10 SIDAFSKTQEDVRIRTKSGAIITICCIVITLILLLNEYIQYTHIVSRPTLVIDRERNLKL 69
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPK 127
+N D+TFP++PC +L++D +D SGE LD+ + K R+DS GN ++S + +
Sbjct: 70 ELNLDITFPSIPCDLLNLDILDDSGELQLDLLQEGSFTKTRVDSNGNALDSMKFKLDDEV 129
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGA-ESSDED--------CCNNCEEVREAYRKKGWALS 178
+ P Q ++ YCGSCYGA + S+ D CC +CE+VR AY GWA
Sbjct: 130 GEYPPQ--------DDNYCGSCYGALDQSNNDNLPKDEKVCCQDCEQVRNAYLTAGWAFF 181
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ I+QC+REG++ RI EGC + G + +N++ GN HFAPG++F + H HD
Sbjct: 182 DGKKIEQCEREGYVARINSHLNEGCRVKGDVLLNRIHGNIHFAPGRAFQNTKGHFHDTSL 241
Query: 239 FQRD-SFNISHKINKLAFGEHFPGVV---------NPLDGVRWTQETPSGMYQ--YFIKV 286
+++ S N +H IN L+FG+ + +PLDG + + S +Y+ YF K+
Sbjct: 242 YEQTLSLNFNHIINHLSFGKSVEQLAEVRGASVSTSPLDGQQVSPSFDSHLYRYSYFTKI 301
Query: 287 VPTVYTDVSGHTIQSNQFSVT--------------EHFRSSEQGRLQTLPGVFFFYDLSP 332
VPT Y + G ++ QFS T H R S G LPGVF ++++SP
Sbjct: 302 VPTRYEWLDGVVAETAQFSATFHESPVNGAMDPEHPHIRHSRTG----LPGVFIYFEMSP 357
Query: 333 IKVTFTEEHVS-----FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+KV E+H FLH +T+ +GG+ V ++D Y QR I+K+
Sbjct: 358 LKVINQEQHFKSWSGVFLHGITS----MGGILAVGTVLDKIFYRAQRTIQKR 405
>gi|367007030|ref|XP_003688245.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
gi|357526553|emb|CCE65811.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
Length = 407
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 140/409 (34%), Positives = 208/409 (50%), Gaps = 43/409 (10%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M K+ DA+ K +ED RT GG+ITL + + L E + + +L+
Sbjct: 1 MSEKKTKLAKFDAFSKTDEDVRIRTRLGGIITLGCILTAIYLLGGEWAAFNEVTSVPRLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESR 119
VD R L +N D++FP +PC I+++D MD +G LD+ FKK RLD G +E R
Sbjct: 61 VDKDRSIDLNMNLDISFPFIPCDIINLDIMDDAGGLQLDILDSGFKKTRLDPNGKQLEFR 120
Query: 120 QDGIGAPKIDKPLQRHGGRL--EHNETYCGSCYGA--------ESSDEDCCNNCEEVREA 169
+ L+ + R+ E YCGSCYGA E + + CCN CE+VR A
Sbjct: 121 E---------FDLKDNSKRIVSEKGPNYCGSCYGAIDQSHNDEEGAKKVCCNTCEDVRLA 171
Query: 170 YRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS 229
Y WA + I+QC+ EG+++RI E EGC + G ++N+V GN HFAPGK S
Sbjct: 172 YVTANWAFFDGKNIEQCEDEGYVKRINEHLNEGCRVTGKAKINRVKGNIHFAPGKPMQNS 231
Query: 230 GVHVHDILAFQRD-SFNISHKINKLAFGEHFPG---------VVNPLD--GVRWTQETPS 277
H+HD +++ + N H I+ +FGE + NPLD V+ +T
Sbjct: 232 KGHLHDTSLYEKSPNMNFKHIIHHFSFGEPIDRKAKSKGADVLTNPLDDYDVQPNIDTHY 291
Query: 278 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFF 327
+ Y++KVVPT Y ++ +++ QFSVT H R G+ + +PGVFFF
Sbjct: 292 HQFSYYMKVVPTRYEYLNRMVVETAQFSVTFHDRPLRGGKDEDHPNTIHARNGIPGVFFF 351
Query: 328 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 375
+D+S IKV E+ ++ F+ N +GGV V ++D Y Q+
Sbjct: 352 FDISSIKVINNEQITQTWSGFILNCIITIGGVLAVGSMVDRLSYKAQKT 400
>gi|50305633|ref|XP_452777.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641910|emb|CAH01628.1| KLLA0C12947p [Kluyveromyces lactis]
Length = 405
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 136/403 (33%), Positives = 205/403 (50%), Gaps = 41/403 (10%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S+DA+ K ED RT +GG+IT+ I+ +LL SE + + VT L+VD R L
Sbjct: 8 SIDAFGKTEEDVRVRTRTGGLITVSCIIITMLLLVSEWKQFSTIVTRPDLVVDRDRHLKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKI 128
+N DVTFP++PC++L++D +D SGE +++ F K R+ +G + + +G
Sbjct: 68 DLNLDVTFPSMPCNVLNLDILDDSGEFQINLLDSGFTKIRISPEGKELSKEKFQVGDKSS 127
Query: 129 DKPLQRHGGRLEHNETYCGSCYGA-ESSDED--------CCNNCEEVREAYRKKGWALSN 179
+ G YCG CYGA + S D CC C++VR AY +KGWA +
Sbjct: 128 KQSFNEEG--------YCGPCYGALDQSKNDELPQDQKVCCQTCDDVRAAYGQKGWAFKD 179
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
++QC+REG+++ I EGC + G ++N++ G HF PG S H HD +
Sbjct: 180 GKGVEQCEREGYVESINARIHEGCRVQGRAQLNRIQGTIHFGPGSSMRNIRGHFHDTSLY 239
Query: 240 QR-DSFNISHKINKLAFGEH---------FPGVVNPLDG--VRWTQETPSGMYQYFIKVV 287
N +H IN L FGE ++PLD V ++T + YF K++
Sbjct: 240 DAYPHLNFNHIINTLTFGEKPKDGDSELIGSASISPLDSRQVFPDRDTHFHEFSYFCKII 299
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTF 337
PT + + G +++ QFS T H R GR + +PGVFF +++SP+KV
Sbjct: 300 PTRFEFLDGKKVETTQFSATYHDRPLRGGRDEDHPNTVHSKGGVPGVFFNFEMSPLKVIN 359
Query: 338 TEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
E+H S+ FL N +GGV V +ID Y Q++I K
Sbjct: 360 KEQHATSWSGFLLNCITSIGGVLAVGTVIDKITYRAQKSIWGK 402
>gi|403215799|emb|CCK70297.1| hypothetical protein KNAG_0E00290 [Kazachstania naganishii CBS
8797]
Length = 408
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 136/404 (33%), Positives = 204/404 (50%), Gaps = 40/404 (9%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S+DA+ + +D RT +G +TL + + L SE R + V+ + L++D G L
Sbjct: 9 SMDAFSRAEDDVRVRTRAGAYVTLACLVTTVFLLLSEYRQWNTIVSRSSLVIDREHGLKL 68
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHD---IFKKRLDSQGNVIESRQDGIGAP 126
+ DVTFP LPC ++S D +D SG LDV + K R+D +G +++ A
Sbjct: 69 DLRLDVTFPHLPCDLVSFDVLDDSGVLLLDVDDENNHFTKTRIDQRGEPLDA------AA 122
Query: 127 KIDKPLQRHGGRLEHNET-YCGSCYGA---------ESSDEDCCNNCEEVREAYRKKGWA 176
L +L + YCGSCYG+ + +++ CCN C VREAY GWA
Sbjct: 123 AASFKLDAEAAQLPPTDPDYCGSCYGSRDQTRNDELDPANKVCCNTCSSVREAYLDAGWA 182
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ I+QC+REG++ +I + EGC I G + +N+V GN HFAPG +F + H HD
Sbjct: 183 FFDGKNIEQCEREGYVDKISQRITEGCRIKGGVRLNRVQGNIHFAPGDAFRSARGHFHDT 242
Query: 237 LAF-QRDSFNISHKINKLAFGEHFPGV----------VNPLDGVRWTQETPSGMYQ--YF 283
+ Q S N H I+ L+FG + + PLDG + S YQ YF
Sbjct: 243 SMYDQTGSLNFDHIIHHLSFGPSVDNMQSLEKASNVAIAPLDGKQVLPRYDSHAYQYTYF 302
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-------LPGVFFFYDLSPIKVT 336
K+VPT + SG I++ QFS T R G +T PG++F ++SP+KV
Sbjct: 303 TKIVPTRFEYFSGSVIETTQFSSTFSARPIGGGTTETATYTSGGTPGLYFNIEMSPLKVI 362
Query: 337 FTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
E++ +S+ FL N +GGV V ++D +Y +R + K
Sbjct: 363 HKEQNKISWSGFLLNCITSIGGVLAVGTVVDKILYRAERTLLNK 406
>gi|148674214|gb|EDL06161.1| ERGIC and golgi 3, isoform CRA_a [Mus musculus]
Length = 238
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 124/249 (49%), Positives = 166/249 (66%), Gaps = 14/249 (5%)
Query: 30 VITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDA 89
+T+VS ++MLLLF SEL+ YL +L VD SRG+ L+IN DV FP +PC+ LS+DA
Sbjct: 1 AVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKINIDVLFPHMPCAYLSIDA 60
Query: 90 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 149
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 61 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 117
Query: 150 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 209
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 118 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 177
Query: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 269
EVNKV G G Q VHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 178 EVNKVPG------GSKARQL---VHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 228
Query: 270 RWTQETPSG 278
T P G
Sbjct: 229 NVT--APQG 235
>gi|47214843|emb|CAF95749.1| unnamed protein product [Tetraodon nigroviridis]
Length = 299
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 108/230 (46%), Positives = 153/230 (66%), Gaps = 13/230 (5%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K++ DAYPK EDF +T+ G +T++S ++ML+LF SEL+ YL +L VDTSR
Sbjct: 5 SKLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI 123
G+ L+IN D+ FP +PC LS+DAMD++GEQ LDV+H++FK+RLD + E+ + +
Sbjct: 65 GDKLKINIDIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLKPVSTEAEKHEL 124
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G + + L+ N C SCYGAE+ D CCN+C++VREAYR++GWA N D I
Sbjct: 125 GGAEDVEVFD--PSTLDPNR--CESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTI 180
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVA-------GNFHFAPGKSF 226
+QCKREGF Q+++E++ EGC +YG LEVNKV+ G F GK F
Sbjct: 181 EQCKREGFTQKMQEQKNEGCQVYGVLEVNKVSLIAQEGGGKFSLCSGKKF 230
>gi|168004249|ref|XP_001754824.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693928|gb|EDQ80278.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 130/364 (35%), Positives = 204/364 (56%), Gaps = 43/364 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I++LDA+P+ E +T SG ++ + +M +LFF ELR YL VT ++ VD RGE
Sbjct: 9 IKNLDAFPRAEEHLLQKTSSGAAVSAIGLFIMGVLFFHELRFYLETVTVHEMSVDVKRGE 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L I+ ++TFPALPC +LS+DA+D+SG+ +D+ +I+K R+ G V+ S
Sbjct: 69 KLPIHINMTFPALPCEVLSLDAIDMSGKHEVDLDTNIWKLRIHRDGYVLGS--------- 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREA-YRKKGWALSNPD-LIDQ 185
E G +E + +E ++ +RKK +P +I++
Sbjct: 120 ---------------EFVNDLVEGEHRKEEPKADKKDEHKDGDHRKK-----DPQKVINE 159
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K+ ++GEGC I+G L+V +VAGNFH S H ++V + N
Sbjct: 160 VKK-------AIDDGEGCQIFGVLDVERVAGNFHI----SMHGLSLYVASKIFEAGYEVN 208
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
+SH I+ L+FG +PG NPLDG SG ++YF+K+VPT Y + G + +NQFS
Sbjct: 209 VSHVIHDLSFGPTYPGHHNPLDGSERILHDTSGTFKYFLKIVPTEYHYLHGEVMPTNQFS 268
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
VTE+++ ++ ++ P V+F YDLSPI VT E +F HF+T +CA++GG F V+G++
Sbjct: 269 VTEYYQRTKPSD-RSYPAVYFVYDLSPIVVTIREHRRNFGHFITRLCAVLGGTFAVTGML 327
Query: 366 DAFI 369
D ++
Sbjct: 328 DRWM 331
>gi|326490247|dbj|BAJ84787.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493774|dbj|BAJ85349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 348
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/366 (35%), Positives = 204/366 (55%), Gaps = 43/366 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ +A+P + +T+SG ++T++ IVM+ LF EL YL T ++ VD RGE
Sbjct: 7 LKNFNAFPHAEDHLLKKTYSGAIVTILGLIVMVTLFAHELTFYLTTYTMHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
TL I+ +V+FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G + IG
Sbjct: 67 TLPIHINVSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGQI-------IGTEY 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ +++ ++ G + + E N E+ + + A+ N
Sbjct: 120 LSDLVEK---EHGTHDHDHGHGHDVQKQPEHTFN--EDADKMVKSVKLAMEN-------- 166
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
GEGC +YG L+V +VAGNFH S H + V + + N+S
Sbjct: 167 ------------GEGCRVYGALDVQRVAGNFHI----SVHGLNIFVANQIFDGSSHVNVS 210
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H I++L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S + +NQFSVT
Sbjct: 211 HVIHRLSFGPEYPGIHNPLDDTSRILHDTSGTFKYYIKVVPTEYRYLSKGVLPTNQFSVT 270
Query: 308 EHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
E+F R ++ ++ P V+F YDLSPI VT EE +FLHF+T +CA++GG F ++G+
Sbjct: 271 EYFVPIRPTD----RSWPAVYFLYDLSPITVTIREERRNFLHFITRLCAVLGGTFAMTGM 326
Query: 365 IDAFIY 370
+D ++Y
Sbjct: 327 LDRWMY 332
>gi|302414546|ref|XP_003005105.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
gi|261356174|gb|EEY18602.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
Length = 349
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 138/387 (35%), Positives = 193/387 (49%), Gaps = 62/387 (16%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG+IT+VS ++ L + E Y +L+VD R
Sbjct: 5 SRFTKLDAFTKTVDEARIRTSSGGIITIVSLFIVFWLAWGEWADYRRITLHPELIVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
GE + I+ ++TFP +PC +L++D MD+SGEQ + I K RL SQ +DG G
Sbjct: 65 GEKMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGIVSGISKVRLRSQ-------KDGGGV 117
Query: 126 PKID-KPLQRHGGRLEHNE---TYCGSCYGAESS----DEDCCNNCEEVREAYRKKGWAL 177
ID K L H YCG CYGA++ + CCN CEEVREAY + WA
Sbjct: 118 --IDTKALSLHAADEAATHLAPDYCGDCYGAKAPANAVKQGCCNTCEEVREAYAQASWAF 175
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC RE + +R+ E+ EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 176 GKGENVEQCTREHYAERLDEQRAEGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLK 235
Query: 238 AFQRDSF--NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ + +H+I+ L F V +D
Sbjct: 236 NYWDAEIIHDFTHQIHALRF----------------------------------VLSDEP 261
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNV 351
+ S H RL T +PGVFF YD+SP+KV EE SF FLT +
Sbjct: 262 QAQLSGGDDSAEGHAE-----RLHTRGGIPGVFFSYDISPMKVINREERSKSFTGFLTGL 316
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKK 378
CA++GG TV+ +D ++ G +KK
Sbjct: 317 CAVIGGTLTVAAAVDRGMFEGSLRLKK 343
>gi|294657513|ref|XP_459821.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
gi|199432751|emb|CAG88060.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
Length = 402
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 134/400 (33%), Positives = 217/400 (54%), Gaps = 27/400 (6%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S+D + K ED +T SGG+ITLV +++ L +E + Y + +T +L+VD
Sbjct: 6 KLISIDVFAKTVEDAKIKTASGGIITLVCIFIVMFLIRNEYKDYTSIITRPELVVDRDIN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
L IN DV+FP +PC +L++D +DISG+ LD+ F+K R+ + N D
Sbjct: 66 TKLDINLDVSFPNMPCDVLTLDILDISGDLQLDILKSGFQKYRILKESN--HEILDEAPV 123
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
D L+ + N CG CYGA + ++E CCN+CE V+ AY +K WA + I
Sbjct: 124 LSNDLSLEEMAKGVGANGK-CGPCYGALPQDNNEYCCNSCETVKLAYAEKMWAFYDGKDI 182
Query: 184 DQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+QC+ EG++ R+ E EGC + G ++N+++GN HFAPG S G H+HD+ F++
Sbjct: 183 EQCENEGYVSRLTERINNNEGCRVKGTAQINRISGNLHFAPGSSSTAPGRHIHDLSLFEK 242
Query: 242 --DSFNISHKINKLAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 294
D FN H IN +FG + +PLD + + + Y++KVV T + +
Sbjct: 243 YEDKFNFDHVINHFSFGSDPHDNNLQQSTHPLDNHQLVFDEKYHVASYYLKVVATRFEFI 302
Query: 295 -SGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV- 342
+ + +NQFSV H R G+ + LPGVFF +++SP+K+ E++
Sbjct: 303 DTSLPLDTNQFSVISHHRPLRGGKDEDHKHTLHARGGLPGVFFHFEISPMKIINKEQYAK 362
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
++ F+ V + V GV V ++D ++ ++AIK K ++
Sbjct: 363 TWSGFILGVISSVAGVLMVGTVLDRSVWAAEKAIKGKKDM 402
>gi|156838396|ref|XP_001642904.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
70294]
gi|156113483|gb|EDO15046.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 404
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 138/407 (33%), Positives = 210/407 (51%), Gaps = 48/407 (11%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
N I + D + K ED RT GG+ITL +L FSE + + +T+ L++D
Sbjct: 4 NSILAYDVFTKTEEDVRIRTRVGGIITLCCLSFTAILLFSEWINFNHVITKPNLVIDREH 63
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIG 124
L +N D+TFP +PC +L++D MD SG LD+ F K R+ S G +Q G
Sbjct: 64 HLKLELNIDITFPFIPCQLLNLDIMDDSGNVQLDITESGFTKTRIGSDG-----QQLGTT 118
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGA---------ESSDED-CCNNCEEVREAYRKKG 174
K+ + L + + ++ YCGSCYGA ES D+ CC CE+V+ AY G
Sbjct: 119 NFKVSEDLLEYSPK---DKNYCGSCYGARDQSKNDEAESVDKKVCCQTCEDVKNAYSDAG 175
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + I+QC+REG+++++ ++ EGC I G +N++ GN HFAPGK+F G H H
Sbjct: 176 WAFFDGKNIEQCEREGYVEKMNDQLNEGCRISGEALLNRIHGNIHFAPGKAFQNRGGHFH 235
Query: 235 DILAFQRD--SFNISHKINKLAFG---------EHFPGVVNPLDGVRWTQETPS-----G 278
D +F D + N H I L+FG + + +PLDG QE PS
Sbjct: 236 DT-SFYNDHKNLNFKHMIEHLSFGRPVAQFKSNKDLVAMTSPLDG---HQELPSIDAHNH 291
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--------SSEQGRLQTLPGVFFFYDL 330
+ YF K+VPT + ++ +++Q VT H + S+ Q +PG+F Y++
Sbjct: 292 QFIYFAKIVPTRFEYLNKQAQETSQLVVTSHMKPIGDATDYSTTMNSRQGIPGLFIDYEI 351
Query: 331 SPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
SP+KV E+H ++ FL N +GG+ V + D ++ QR +
Sbjct: 352 SPLKVINREQHATTWSGFLLNCITSIGGILAVGTVADKIVHATQRVV 398
>gi|367017984|ref|XP_003683490.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
gi|359751154|emb|CCE94279.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
Length = 406
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 133/403 (33%), Positives = 209/403 (51%), Gaps = 41/403 (10%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
S DA+ K ED RT SGG+I+L ++ + L SE + VT +L+VD R L
Sbjct: 9 SFDAFSKTEEDVRIRTRSGGLISLSCVVLTIFLLISEWLNFNQVVTRPQLVVDRDRQLKL 68
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKI 128
D+TFP++PC+++S+D MD +GE LD+ F K R+DS G I +
Sbjct: 69 DFVVDITFPSMPCAMISLDIMDNAGELQLDIMEAGFTKTRIDSNGKEISTSSFDASDSSS 128
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSN 179
+ +E YCGSCYGA+ D++ CC C++VR+AY + WA +
Sbjct: 129 --------DYVPDDENYCGSCYGAKDQDKNDELPKEERVCCQTCDDVRKAYLEAEWAFYD 180
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
I+QC+REG+++RI ++ EGC + G ++++ G HFAPG+ F + H HD+ +
Sbjct: 181 GKNIEQCEREGYVERINQQLNEGCRVQGNALLSRIQGTIHFAPGRGFQNNRGHFHDMSLY 240
Query: 240 QRD-SFNISHKINKLAFGEHF---------PGVVNPLDGVRWTQETPSGMYQ--YFIKVV 287
N +H I+ L+FG+ +PLDG + + + ++Q YF K+V
Sbjct: 241 DNTPQLNFNHIIHHLSFGKPINSGAEDRGAATSTHPLDGRQVFPDRDTHLHQFSYFAKIV 300
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQG----RLQTL------PGVFFFYDLSPIKVTF 337
PT Y + +++ QFS T H R G TL PG+F ++++SP+KV
Sbjct: 301 PTRYEYLDDVVVETAQFSTTYHDRPLRGGVDDDHPNTLHSRGGSPGMFVYFEMSPLKVIN 360
Query: 338 TEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
E+H ++ FL N +GGV V ++D +Y Q++I K
Sbjct: 361 KEQHAQTWSGFLLNCITSIGGVLAVGTVLDKVLYKAQKSIWGK 403
>gi|448531492|ref|XP_003870264.1| Erv46 protein [Candida orthopsilosis Co 90-125]
gi|380354618|emb|CCG24134.1| Erv46 protein [Candida orthopsilosis]
Length = 411
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 138/412 (33%), Positives = 216/412 (52%), Gaps = 31/412 (7%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + K+ SLDA+ K ED +T SGG+ITL+ +V L L +E Y + +L+
Sbjct: 1 MSSQRPKLISLDAFAKTVEDARIKTASGGIITLICILVALFLIRNEYIDYTTVIARPELV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESR 119
VD + L IN D++F LPC ++S+D D SG+ LD+ + +K R+ G+ +
Sbjct: 61 VDRDINKQLDINLDISFLNLPCDLVSIDLFDESGDLKLDIINSQLEKFRIIKSGHSSKPT 120
Query: 120 QDGIGAPKIDK--PLQRHGGRLEHNET--YCGSCYGAESSDED--CCNNCEEVREAYRKK 173
+ P + + PL++ L +T CGSCYGA D+ CCN+C VR AY +
Sbjct: 121 EIKDDQPPLQREMPLEQIAPGLPDGQTEGECGSCYGAVPQDKKQYCCNSCAAVRRAYAEA 180
Query: 174 GWALSNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
W + + I QC+ EG++QR+++ + EGC + G ++N+VAG FAPG S +
Sbjct: 181 NWQFYDGENIAQCEEEGYVQRLRQRINDNEGCRVKGTTKINRVAGTMDFAPGASMTKER- 239
Query: 232 HVHDILAFQ--RDSFNISHKINKLAFGEHFP-------GVVNPLDGVRWTQETPSGMYQY 282
HVHD+ + +D FN H IN L+FG + P G ++PLDG ++ Q Y
Sbjct: 240 HVHDLSLYMKYKDKFNFDHVINHLSFGNNPPDSQLVDTGSISPLDGHKFLQHKKLHSINY 299
Query: 283 FIKVVPTVYTDVSGH-TIQSNQFSVTEHFRSSEQGR----------LQTLPGVFFFYDLS 331
F+K+V T + + G +NQFS H R G+ +PGV F +D+S
Sbjct: 300 FLKIVATRFESLEGKDKFDTNQFSAITHDRPLAGGKDDDHQHTLHARAGVPGVAFNFDIS 359
Query: 332 PIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
P+K+ EE+ F+ V + + GV V ++D ++ Q+AIK K ++
Sbjct: 360 PLKIINREEYAKTRSGFILGVVSSIAGVLMVGSLMDRSVFAAQQAIKGKKDL 411
>gi|45188262|ref|NP_984485.1| ADR389Cp [Ashbya gossypii ATCC 10895]
gi|44983106|gb|AAS52309.1| ADR389Cp [Ashbya gossypii ATCC 10895]
Length = 392
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 138/403 (34%), Positives = 205/403 (50%), Gaps = 42/403 (10%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +K+ SLDA+ K ED RT +GG+ITL +V LLL SE R ++++D
Sbjct: 2 VKSKLLSLDAFAKTEEDVRVRTRAGGLITLGCVVVTLLLLVSEWRRLWEVEKRPQVVLDR 61
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDG 122
R + L + D+TF +PC +L++D +D +GE L++ + F K RLD G + +
Sbjct: 62 DRQQKLELRLDITFSQMPCELLNLDIIDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFR 121
Query: 123 IGA--PKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYR 171
+G P D ++ YCG CYGA D++ CC C EVR AY
Sbjct: 122 VGETLPSTD------------DQDYCGPCYGARDQDQNENLPRSERVCCQTCGEVRAAYA 169
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
+ WA + +QCKREG+ +R++E+ EGC + G ++N+V GN HFAPG S H
Sbjct: 170 EMNWATFDGKGFEQCKREGYTERLQEQINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKG 228
Query: 232 HVHDILAFQRDS-FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVV 287
H HD ++ + +H I+ L+FG G PL+G E P+G + YF KVV
Sbjct: 229 HAHDDSFYKEHPHLSFNHVIHSLSFGPEIAGNPGPLNGR--AMEVPNGHSHFFSYFAKVV 286
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF----------YDLSPIKVTF 337
P Y ++G +S +FSVT H R GR P F +++SP+KV
Sbjct: 287 PIRYETLAGTITESAEFSVTAHDRPVHGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQ 346
Query: 338 TEEHVS-FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
E++ S + F+ N +GGV V ++D YH QR + K
Sbjct: 347 REQYASTWTAFVLNAITSIGGVLAVGTVLDRVTYHTQRTLMGK 389
>gi|365989554|ref|XP_003671607.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
gi|343770380|emb|CCD26364.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
Length = 438
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 138/429 (32%), Positives = 216/429 (50%), Gaps = 57/429 (13%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K E+ RT +GG+IT+ +V L L +E + + +T +L+VD R
Sbjct: 8 KLLSFDAFAKTEEEVRIRTNTGGIITISCILVTLYLLLNEWSQFNSVITSPQLVVDRDRN 67
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIF-KKRLDSQGNVIESRQDGIGA 125
L +N D++FP + C ++++D MD SGE LD+ F K RLD QGN +++ + +
Sbjct: 68 LKLELNLDISFPNISCDLINLDIMDESGELQLDLLDSTFIKTRLDPQGNPLDN-DNNVAD 126
Query: 126 PKID-----KPLQRHGGR-----LEHNETYCGSCYGAESSDED---------CCNNCEEV 166
D L ++G + L + YCGSCYG++ E+ CC C +V
Sbjct: 127 TDADLVIGVDDLTKNGEKRLKEILAKDPDYCGSCYGSQDQTENESKSKDQKICCQTCNDV 186
Query: 167 REAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF 226
R++Y GWA + I+QC+ EG++ +I + EGC I G +N++ GN HFAPGKS+
Sbjct: 187 RDSYLNAGWAFFDGAQIEQCENEGYVAKINKHLEEGCRIKGQALLNRIQGNIHFAPGKSY 246
Query: 227 H----QSGVHVHDILAFQR-DSFNISHKINKLAFGEHFPGV---------------VNPL 266
+ H HD + + N +H I+ L+FG+ V +NPL
Sbjct: 247 SNYKAKGSTHRHDTSLYDKVKKMNFNHIIHHLSFGKSIDKVGKNDLKDYSDRKKFSINPL 306
Query: 267 DGVRWTQE--TPS-GMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQT- 320
D + + P+ + Y+ K+VPT Y D +I++ QFS T H R + G +
Sbjct: 307 DDRKVIVKDFNPAFHQFSYYTKIVPTRYEFLDEKISSIETAQFSATYHSRPIQGGTDEDH 366
Query: 321 ---------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+PG+FFF+++SPIKV E H ++ FL N +G V V + D Y
Sbjct: 367 PTTFHSRGGIPGLFFFFEMSPIKVINKEHHFRTWSSFLLNCITSIGSVLAVGTVFDKIFY 426
Query: 371 HGQRAIKKK 379
Q+ +K K
Sbjct: 427 RAQKTLKAK 435
>gi|444321132|ref|XP_004181222.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
gi|387514266|emb|CCH61703.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
Length = 414
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 139/414 (33%), Positives = 214/414 (51%), Gaps = 47/414 (11%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K +E+ RT +GG+ITL + L L E Y + +++VD R
Sbjct: 4 KLLSFDAFNKTDEEVRIRTRTGGIITLFCILTTLYLLQKEWIEYYKITNKPQVVVDRDRH 63
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
L +N D+TFP+L C ++ +D +D SGE LDV F K R+D+ GN ++ DG
Sbjct: 64 LKLELNLDITFPSLSCDLIGLDIVDDSGETSLDVLESGFTKIRVDTNGNELD---DG--- 117
Query: 126 PKIDKPLQRHG-GRLEHNET-YCGSCYGA----------ESSDEDCCNNCEEVREAYRKK 173
++D R L+ ++ YCG CYGA +S++ CC C +VR+AY
Sbjct: 118 SQLDVGTDRESLSSLDMDKAKYCGPCYGALDQSGNDNIDVASEKVCCQTCYDVRKAYTDV 177
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
GWA + I+QC+REG++ RI + EGC I G +N++ GN HFAPG +F + H
Sbjct: 178 GWAFFDGKDIEQCEREGYVDRINDHLHEGCRIVGSALLNRIQGNVHFAPGAAFETAKGHF 237
Query: 234 HDILAFQR-DSFNISHKINKLAFGEHFPGVVN-------------PLDG---VRWTQETP 276
HD + + + N +H IN L+FG+ ++ PLDG + ++ T
Sbjct: 238 HDTSLYDKTEQLNFNHIINHLSFGKTGHELLTPKSSKSFSVSRRQPLDGRVMIPESRNTH 297
Query: 277 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFF 326
+ YF K+VPT + +SG ++ Q+SVT H R + GR + +PG+F
Sbjct: 298 FFQFSYFAKIVPTRFESLSGKVEEAAQYSVTFHSRPLQGGRDEDHPNTFHGRSGIPGLFI 357
Query: 327 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
++ ++P+KV E H +F L N +GGV V ++D Y QR+I K
Sbjct: 358 YFQMAPLKVIDIEAHSQTFSGLLLNCITTIGGVLAVGTMMDKVFYKAQRSIWGK 411
>gi|260950825|ref|XP_002619709.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
gi|238847281|gb|EEQ36745.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
Length = 415
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 133/414 (32%), Positives = 220/414 (53%), Gaps = 42/414 (10%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++ SLDA+ K ED +T SGGVITLV +++L L +E Y++ V +L+V+
Sbjct: 6 RLLSLDAFAKTVEDARVKTASGGVITLVCVLIVLFLIRNEYSDYMSVVVRPELVVNRDVN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
L IN D+TFP +PC ++S+D +D++G+ HLD+ F+ R+ G E D +
Sbjct: 66 RQLDINLDITFPDVPCGVMSLDILDMTGDLHLDIVESGFEMFRVLPSG---EEISDDLPL 122
Query: 126 PKIDKPLQRHGGRLEHNETY----CGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSN 179
K + G L +E CG CYGA ++ ++ CCN CE VR AY + W +
Sbjct: 123 LSGAKKFEDVCGPLTEDEISRGVPCGPCYGAVDQTDNKRCCNTCEAVRMAYAVQEWGFFD 182
Query: 180 PDLIDQCKREGFLQRI--KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
I+QC+REG+++++ + EGC I G ++N+++GN HFAPG ++G H HD+
Sbjct: 183 GSNIEQCEREGYVEKMVSRINNNEGCRIKGSAKINRISGNLHFAPGVPLSRNGRHSHDLS 242
Query: 238 AFQR--DSFNISHKINKLAFGEHFPGV--------------VNPLDGVRWTQETPSGMYQ 281
+ + + F+I HKIN +FGE P ++PLDG + + + +
Sbjct: 243 LWTKYSNKFSIDHKINHFSFGED-PSASRRLASTDDSQEPSIHPLDGFHFDLKKKNHVAS 301
Query: 282 YFIKVVPTVYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYD 329
Y++ VV T + + G + +NQFSV H R GR +PG FF +D
Sbjct: 302 YYLSVVSTRFEFLDGKKEAVDTNQFSVITHDRPIVGGRDDDHQNTMHAQGGVPGAFFHFD 361
Query: 330 LSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
+SP+K+ EE+ ++ F+ V + + GV TV +D ++ ++ ++ K ++
Sbjct: 362 ISPMKIISREEYAKTWSGFILGVVSSIAGVLTVGAALDRSVWTAEQVLRGKKDM 415
>gi|374107698|gb|AEY96606.1| FADR389Cp [Ashbya gossypii FDAG1]
Length = 392
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 137/403 (33%), Positives = 204/403 (50%), Gaps = 42/403 (10%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +K+ SLDA+ K ED RT +GG+ITL +V LLL SE R ++++D
Sbjct: 2 VKSKLLSLDAFAKTEEDVRVRTRAGGLITLGCVVVTLLLLVSEWRRLWEVEKRPQVVLDR 61
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDG 122
R + L + D+TF +PC +L++D +D +GE L++ + F K RLD G + +
Sbjct: 62 DRQQKLELRLDITFSQMPCELLNLDIIDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFR 121
Query: 123 IGA--PKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYR 171
+G P D ++ YCG CYGA D++ CC C EVR AY
Sbjct: 122 VGETLPSTD------------DQDYCGPCYGARDQDQNENLPRSERVCCQTCGEVRAAYA 169
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
+ WA + +QCKREG+ +R++E+ EGC + G ++N+V GN HFAPG S H
Sbjct: 170 EMNWATFDGKGFEQCKREGYTERLQEQINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKG 228
Query: 232 HVHDILAFQRDS-FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVV 287
H HD ++ + +H I+ L+FG G PL+G E P+G + YF KVV
Sbjct: 229 HAHDDSFYKEHPHLSFNHVIHSLSFGPEIAGNPGPLNGR--AMEVPNGHSHFFSYFAKVV 286
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF----------YDLSPIKVTF 337
P Y ++G +S +FS T H R GR P F +++SP+KV
Sbjct: 287 PIRYETLAGTITESAEFSATAHDRPVHGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQ 346
Query: 338 TEEHVS-FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
E++ S + F+ N +GGV V ++D YH QR + K
Sbjct: 347 REQYASTWTAFVLNAITSIGGVLAVGTVLDRVTYHTQRTLMGK 389
>gi|150866674|ref|XP_001386342.2| hypothetical protein PICST_85013 [Scheffersomyces stipitis CBS
6054]
gi|149387930|gb|ABN68313.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 407
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 140/400 (35%), Positives = 214/400 (53%), Gaps = 29/400 (7%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ + DA+ K ED RT SGG+ITL V++ L +E Y + +T +L+VD
Sbjct: 7 KLLTFDAFAKTVEDARIRTTSGGIITLFCIFVVMFLIRNEYSDYTSVITRPELVVDRDIN 66
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RL--DSQGNVIESRQDGI 123
+ L I DV+F LPC +LS+D MD +G+ LD+ F+K R+ DS+ +I+ I
Sbjct: 67 KPLDIYLDVSFHNLPCDLLSLDIMDEAGDLQLDILKSGFEKFRIVKDSEEEIIDRESTPI 126
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPD 181
A + + + G E + CGSCYGA D+ CCN+CE V+ AY +K W + +
Sbjct: 127 NADLSIEEMAK--GLKEGEDGECGSCYGALPQDKKQYCCNDCETVKLAYAEKLWGFYDGE 184
Query: 182 LIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
I+QC+ EG++QR++ EGC I G +N+++G FAPG SF SG HVHD+ +
Sbjct: 185 NIEQCENEGYVQRVQSRINGKEGCRIKGNARINRISGTMDFAPGASFTSSGHHVHDLSLY 244
Query: 240 QRDS-FNISHKINKLAFG----EHFPGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ N H +NKL FG E P +PLD + ++ Y++KVV T +
Sbjct: 245 DKHPHLNFDHIVNKLTFGPIPDESVPTAESTHPLDNYGVALNDKNHVFTYYLKVVATRFE 304
Query: 293 DVSGHT--IQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEE 340
++G + + +NQFSV H R G+ +PGV F +D+SP+K+ E+
Sbjct: 305 FLNGASKALDANQFSVITHDRPISGGKDNDHQHTLHAKGGIPGVVFHFDISPLKIINREQ 364
Query: 341 HV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
+ S+ F+ V + V GV V ++D +Y + AIK K
Sbjct: 365 YAKSWSGFVLGVVSSVAGVLIVGSLLDRSVYAAESAIKGK 404
>gi|403215743|emb|CCK70242.1| hypothetical protein KNAG_0D05030 [Kazachstania naganishii CBS
8797]
Length = 422
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/418 (32%), Positives = 214/418 (51%), Gaps = 50/418 (11%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
+LDA+ K E+ RT GG+I+L+ + ++L + E + T+ L++D L
Sbjct: 10 ALDAFSKTEEEARVRTSGGGLISLLCVVSAVVLLWREWAQFRAVTTDPMLVIDRDHELPL 69
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKI 128
++ D+TFPA+PC++L +D MD SG LDV D F K R+D GN++ G A +
Sbjct: 70 KLTLDITFPAMPCALLGLDIMDESGNVQLDVLFDQFTKTRVDVNGNMV-----GGSASEP 124
Query: 129 DKPLQRHGGR-----LEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKG 174
KP G R L+ + YCGSCYG+++ + + CC C++V +AY + G
Sbjct: 125 YKPNSLSGKRAGAKDLQMDADYCGSCYGSKNQENNAELPPEQRICCQTCDDVHDAYLEAG 184
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV--- 231
WA + I+QC+ EG+++RI+E+ EGCN+ G +N++ GN HFAPGK + Q
Sbjct: 185 WAFFDGANIEQCESEGYVKRIQEQLHEGCNVKGTALLNRIQGNLHFAPGKPYQQLAAGMP 244
Query: 232 -----HVHDILAFQRDS-FNISHKINKLAFGEHFPGVV--------NPLDGVRWTQETPS 277
H HD+ ++R+ N++H IN+ FGE + PL+ + E P
Sbjct: 245 GQGLGHYHDVSLYERNRHMNLNHVINEFRFGEDPQSEIVAQKIQRSAPLEDTVASLENPH 304
Query: 278 -GMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGR----LQTL------PGVF 325
++ Y+ VVPT Y + + + + Q+S T H R GR TL PGV+
Sbjct: 305 YYIFNYYTNVVPTRYEFLGASKPLDTAQYSATYHDRPIMGGRDADHPTTLHGRGGTPGVY 364
Query: 326 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
F + SP+K+ E + L N +GG+ V + D +Y QR+I K ++
Sbjct: 365 FNLEFSPLKIINRERRPQQWSTLLLNWITTIGGILAVGTVTDKVVYKAQRSIGAKKQL 422
>gi|219110527|ref|XP_002177015.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411550|gb|EEC51478.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 136/427 (31%), Positives = 216/427 (50%), Gaps = 73/427 (17%)
Query: 8 IRSLD-AYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYL--NAVTETKLLVDTS 64
++ LD +PK++ ++ +T GG+ +LV+ +++ +L +E +L N T + VDTS
Sbjct: 76 VKKLDFLFPKVDTEYTVQTDRGGLASLVAYLLIAVLALAETASWLSHNRDTVDHVRVDTS 135
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD-----SQGNVIESR 119
G+ +R+N ++TFP+L C L VD MD++G+ L+++ + K+++D Q +++S
Sbjct: 136 LGQRMRVNLNITFPSLACDDLHVDVMDVAGDSQLNIEDTLTKRKMDRTGRYGQAEILQSN 195
Query: 120 QDGIGAPKIDKPLQRHGGRLEHN---ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
Q + Q +L + +TYCG CYGA+ + CCNNC+ + +AY+ KGW
Sbjct: 196 QH--------EQEQSRKAKLRQDPLPDTYCGPCYGAQPDVDACCNNCDALLDAYKLKGW- 246
Query: 177 LSNPDLI----DQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG 230
DL+ +QC REG Q+ +GEGCN+ GF+ +N+VAGNFH A G+ + G
Sbjct: 247 --RTDLVLYTAEQCIREGRDQKKLRPLIQGEGCNLSGFMSLNRVAGNFHIAMGEGLQRDG 304
Query: 231 VHVHDILAFQRDSFNISHKINKLAFGEHFPGVV-------NPLDGVR---WTQETPSGMY 280
H+H + +N SH I+ L+FG G + L+GV + +G++
Sbjct: 305 RHIHVFDPEDSEHYNASHVIHHLSFGPEIQGKTKSGNLDSSSLNGVTKMVTPEHGTTGLF 364
Query: 281 QYFIKVVPTVYTDVSGH-----TIQSNQFSVTEHFRS------SEQG------------- 316
QYFIKVVPT Y G T ++N++ TE FR E+
Sbjct: 365 QYFIKVVPTTYLGPGGRRDESGTFETNRYFYTERFRPLMKEYLPEEAVAEDPKQAAVHAG 424
Query: 317 -----------RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
R LPGVFF Y++ P V V H L + A +GGVFT+ +
Sbjct: 425 GGHRTHDHHHVRNSVLPGVFFLYEIYPFAVEIHPVSVPLTHLLIRLMATIGGVFTIVRWV 484
Query: 366 DAFIYHG 372
D + G
Sbjct: 485 DTAVLEG 491
>gi|448086324|ref|XP_004196073.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
gi|359377495|emb|CCE85878.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
Length = 405
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 138/401 (34%), Positives = 219/401 (54%), Gaps = 27/401 (6%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ SLDA+ K ED +T SGG+ITLV +V+LLL +E Y + V +L+VD
Sbjct: 7 KLLSLDAFAKTVEDAKVKTASGGIITLVCVLVVLLLIRNEYSEYTSVVNRPELVVDRDVN 66
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA 125
L IN D+TFP LPC ++++D +D+SG+ DV F+K RL N E D
Sbjct: 67 RKLDINIDITFPYLPCDLVTLDILDVSGDTQADVLKSGFEKYRLIPSSN--EEVLDNAPV 124
Query: 126 PKIDKPLQ---RHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ D L+ R+ + + +E CCN+CE VR AY ++ WA +
Sbjct: 125 LRNDLSLEDIARNPNKEGGGYCGSCYGALPQGDNEFCCNDCETVRVAYAERMWAFYDGAN 184
Query: 183 IDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
I+QC+ EG++ R+ + E+ EGC I G ++N+V+GN HFAPG + G H+HD+ ++
Sbjct: 185 IEQCENEGYVTRLNQRIEQKEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDLSLYE 244
Query: 241 R--DSFNISHKINKLAFG----EHFPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ D F+ H IN L+FG + P +PLDG R S + Y++KVV T +
Sbjct: 245 KHFDKFSFDHVINHLSFGLDPAKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVATRFE 304
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV 342
++G ++++NQFS H R G+ + +PGVFF +D+SP+K+ E++
Sbjct: 305 FLNGSSMETNQFSAIPHHRPYRGGKDEDHRHTMHAKGGIPGVFFHFDISPMKIINKEQYA 364
Query: 343 -SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
++ F+ V + + GV TV ++D ++ ++ IK K +I
Sbjct: 365 KTWSGFVLGVISSIAGVLTVGAVLDRSVWAAEKVIKSKKDI 405
>gi|302823246|ref|XP_002993277.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
gi|302825185|ref|XP_002994225.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
gi|300137936|gb|EFJ04730.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
gi|300138947|gb|EFJ05698.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
Length = 333
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/374 (34%), Positives = 215/374 (57%), Gaps = 52/374 (13%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+++++A+ +E +T SG ++T+V ++L+LF E + YL+ ++ VDT+RG
Sbjct: 4 KMKNINAFAHADEHLTQKTVSGAILTIVGVSIILVLFAYEFKFYLSTNVVHQMSVDTTRG 63
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ L I+ ++TFP+LPC ILSVDA+D+SG+ +D+ +I+K RL G+++ G+
Sbjct: 64 QNLPIHINITFPSLPCQILSVDAIDMSGKHEVDLDTNIWKLRLHKDGHIL-------GSE 116
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ +++ EH A + ++ EE+R A + ++++
Sbjct: 117 YLSDLVEK-----EH----------AHDNLTGIFHSHEELRSAVK----------VVNEI 151
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFN 245
+ ++GEGC ++G L+V +VAGNFH S H + + H + N
Sbjct: 152 NK-------ALQDGEGCRVFGVLDVERVAGNFHI----SMHGMSLQIFHSV-----KEVN 195
Query: 246 ISHKINKLAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
+SH IN L+FG +PG+ NPLD VR ++T +G ++YFIK+VPT Y ++G + +NQF
Sbjct: 196 VSHIINDLSFGPKYPGIHNPLDRTVRILRDT-AGTFKYFIKIVPTEYRYLNGGKLPTNQF 254
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
SV E++ ++ + + P V+F YDLSPI V EE SF H LT CAIVGG F+++G+
Sbjct: 255 SVGEYYLAARDDDI-SWPAVYFLYDLSPITVLIKEERRSFGHLLTRFCAIVGGTFSLTGM 313
Query: 365 IDAFIYHGQRAIKK 378
+D +IY +I +
Sbjct: 314 LDRWIYRLVESITR 327
>gi|195162746|ref|XP_002022215.1| GL25735 [Drosophila persimilis]
gi|194104176|gb|EDW26219.1| GL25735 [Drosophila persimilis]
Length = 313
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 166/283 (58%), Gaps = 21/283 (7%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDAYP+ +DF RT G +T++S+ ++ LL F E Y+ +L VDT+RG
Sbjct: 7 LRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLSYMQPALNEELFVDTTRGH 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LRIN DVT L C+ +S+DAMD SG+ HL V HDIFK RLD +G P
Sbjct: 67 KLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDIFKHRLDLKGE-----------PL 115
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ P++ N+ CGSCYGAE + CCN CE+V +AYR W + D I+QC
Sbjct: 116 KETPIKEIVAVSPPNKNVTCGSCYGAEHNATHCCNTCEDVLDAYRLHKWNVQ-VDKIEQC 174
Query: 187 KREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K G +R E+ EGC I G LEVN++AG+FHFAPGKSF H+HD FQ +
Sbjct: 175 K--GKYKRTDEDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVK 229
Query: 246 ISHKINKLAFGEHFP-GVVNPLDGVRW-TQETPSGMYQYFIKV 286
+SH IN L+FGE +PLDG+R ET + M+ +++K+
Sbjct: 230 LSHTINHLSFGEKIEFAKTHPLDGLRVDVAETKTEMFNHYLKI 272
>gi|291001965|ref|XP_002683549.1| predicted protein [Naegleria gruberi]
gi|284097178|gb|EFC50805.1| predicted protein [Naegleria gruberi]
Length = 391
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 133/388 (34%), Positives = 201/388 (51%), Gaps = 34/388 (8%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
IM IRS D Y K + +T GGV+++++ I+++ L S L YL+ L VD
Sbjct: 5 IMKSIRSFDLYSKTDSIATKKTSLGGVVSILALIIIIFLVGSALIRYLSINRRDTLSVDI 64
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+ + I F+++FP L C L VD++D SG+ +DV H I K +DS G + +
Sbjct: 65 QVEDRVVIFFNISFPDLKCYDLHVDSVDASGDAAIDVAHHIHKVPVDSSGRITH-----L 119
Query: 124 GAPK------IDKPLQRHG-GRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
+PK + P ++ + H+ YCG+CY E +CCN C++V E Y++ G
Sbjct: 120 ESPKHKTKLGTEMPQDKYDPTKDPHSIMYCGTCY-VEQRRGECCNTCQDVMEVYKRNGLP 178
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV----H 232
+ ++QC + + GCNIYG L+V KV GNFHF PG+SF Q H
Sbjct: 179 APRVEDVEQCLFDA------SKNHPGCNIYGTLDVQKVNGNFHFLPGRSFSQEYETRVHH 232
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV---------RWTQETPSGMYQYF 283
+H+ D +N +H I+ L+FG P V PLD Q + +++YF
Sbjct: 233 IHEFNPILVDRYNSTHIIHSLSFGLRIPHVTYPLDETVGIIPKIEESDAQAPKTALFKYF 292
Query: 284 IKVVPTVYTDVS--GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 341
IK VPT Y S TI + QFS T+H + ++ LPGVFF Y+ PI++T+ E
Sbjct: 293 IKAVPTTYIGSSYFSSTINTYQFSFTKHVMPFDSSKMMMLPGVFFVYNFEPIRITYEENG 352
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ F HF+ ++ A+ G+F V IDA +
Sbjct: 353 MPFTHFIVDLMAVCAGIFVVLNYIDALL 380
>gi|353242343|emb|CCA73995.1| related to ERV46-component of copii vesicles [Piriformospora indica
DSM 11827]
Length = 420
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 207/404 (51%), Gaps = 46/404 (11%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+++DA+ K ED RT +G +T +S ++ LL E Y +T + + +R E
Sbjct: 12 KAIDAFGKTLEDVKIRTRTGAFLTFLSIGIICLLTLIEFIDYRTVYLDTNIEIMKARDER 71
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
L +N ++TFP +PC +LS+DA D+SGE +V H+I K RLDS+G + QD I +
Sbjct: 72 LTVNMNITFPRVPCFLLSLDATDVSGEHMREVSHNIVKVRLDSEGKPYPN-QDHISDLRN 130
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKR 188
+ + G+ YCGSCYG + CCN CE+VR++Y +GWA S P+ I+QC R
Sbjct: 131 EISRVKDIGK----PGYCGSCYGGLEPEGGCCNTCEDVRKSYLDRGWAFSAPEHIEQCVR 186
Query: 189 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NI 246
EG+ ++IK + +GC I G + + KVA + F+ G+SF + H +++ + +D +
Sbjct: 187 EGWTEKIKVQANDGCQISGRVRIKKVASSLIFSFGRSFQANSFHAQELVPYLKDGLIHDF 246
Query: 247 SHKINKLAF---GEHFPGVVN--------------PLDGV---------RWTQETPSGMY 280
H I L F E+ P N PL+G R + + M+
Sbjct: 247 GHHIETLQFQSDDEYDPRRANEAARLKKHLGVPKDPLNGFNSHYAKYSGRRGPDITTYMF 306
Query: 281 QYFIKVVPTVYTDVSGHTI---------QSNQFSVTEHFRSSE----QGRLQTLPGVFFF 327
QYFIKVV + + + + H +++E PG+F
Sbjct: 307 QYFIKVVSADFETLDHEHVSSHLYSYSSHTRNVGEAYHLKNTEGIETTHGYDAAPGLFIN 366
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 371
D+SP++V TE+ F HFLT CAI+GGV TV+ ++D+ +++
Sbjct: 367 IDVSPMQVIHTEKRKPFAHFLTTFCAIIGGVLTVASLVDSALFN 410
>gi|340055752|emb|CCC50073.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 404
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 132/408 (32%), Positives = 206/408 (50%), Gaps = 32/408 (7%)
Query: 5 MNKIRSLDAY----PKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M++IR D + P + E RT GG+++ + +++ L EL YL+ V ++
Sbjct: 1 MHRIRRFDMFSRFDPALEEAGRERTTCGGLLSFLFILLVALFIKIELYRYLSVVELREMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G + I ++TFP + C +++VD + GE I K R+ +Q S
Sbjct: 61 VDPHVGGDMHITINITFPHIHCDLMAVDVIGPFGEYMTGAVRSITKVRVPTQDPAPVSE- 119
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
P+ D+ + + + C SCYGAE S DCCN+C++V A+R+ GW +
Sbjct: 120 ---ALPQSDRSVSTAALPVSNKMGGCVSCYGAEESPGDCCNSCDDVHAAFRRNGWEIDEN 176
Query: 181 DL-IDQCKREGFLQRIKE-EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
D+ + QC EG L + EGCNI+ V K+ GN HF PG+ + G ++ +
Sbjct: 177 DIKLSQCT-EGQLHNVGPVSPSEGCNIHSKFSVRKIKGNIHFVPGRRLNHRGQPMYVVRR 235
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLD------GVRWTQETPSGMYQYFIKVVPTVYT 292
N+SH + L FGE FPG VNPL+ GVR E SG + Y+++V+PT Y
Sbjct: 236 EAIKKMNLSHVFHSLEFGERFPGQVNPLNGIANARGVRNASEVVSGRFSYYVQVLPTEYQ 295
Query: 293 DV----SGHTIQSNQFSVTEHFRSSEQGRLQTLP---------GVFFFYDLSPIK--VTF 337
V S +++NQ+SV +HF S + P GVF YD+SP+K V
Sbjct: 296 FVPALGSRVRLETNQYSVKQHFTESWYTTDRRYPGWSDPTLVAGVFIVYDVSPVKTLVMR 355
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
T + S +H L +CA+ GG FTV+ +ID+ + + ++K+ K+
Sbjct: 356 TSPYPSLIHLLLRMCAVGGGAFTVASMIDSLLLNILGHFRRKMRETKY 403
>gi|241955457|ref|XP_002420449.1| COPII-coated vesicle complex subunit, putative; ER-derived vesicle
protein, putative [Candida dubliniensis CD36]
gi|223643791|emb|CAX41527.1| COPII-coated vesicle complex subunit, putative [Candida
dubliniensis CD36]
Length = 414
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 139/409 (33%), Positives = 222/409 (54%), Gaps = 33/409 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K ED +T SGG+ITL+ ++ L+L +E Y +T +L+VD
Sbjct: 6 KLLSFDAFAKTVEDARIKTTSGGIITLICILITLVLIRNEYVDYTTIITRPELVVDRDIN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDG 122
+ L IN D++F LPC ++S+D +D++G+ L++ KK RL + QG+VI + +D
Sbjct: 66 KQLDINLDISFINLPCDLISIDLLDVTGDLSLNIIDSGLKKIRLLKNKQGDVIVNEIEDD 125
Query: 123 IGAPKIDKPLQRHGGRL---EHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWAL 177
A D L L YCGSCYGA D+ CCN+C VR AY +K W+
Sbjct: 126 EPAFNNDIELTDLAKGLPEGSDENAYCGSCYGALPQDKKQFCCNDCNTVRRAYAEKHWSF 185
Query: 178 SNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
+ + I+QC++EG++ R++E EGC I G ++N+V+G FAPG SF + G H HD
Sbjct: 186 YDGENIEQCEKEGYVARLRERINNNEGCRIKGTTKINRVSGTMDFAPGASFTREGRHFHD 245
Query: 236 ILAFQR--DSFNISHKINKLAFGE--------HFPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ + + D FN H IN L+FGE ++PLD ++ + + Y++K
Sbjct: 246 LSLYTKYEDKFNFDHIINHLSFGEMPVDGQADQLFDSIHPLDDHQFMLHKKAHLVSYYLK 305
Query: 286 VVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIK 334
VV T + + + I +NQFSV H R G+ + +PGV F +D+SP+K
Sbjct: 306 VVATRFESLDYKNRIDTNQFSVITHDRPLRGGKDEDHQHTLHARGGIPGVNFNFDISPLK 365
Query: 335 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
+ +++ ++ F+ V + + GV V ++D ++ Q+AIK K +I
Sbjct: 366 IINRQQYAKTWSGFVLGVISSIAGVLMVGTLLDRSVFAAQQAIKGKKDI 414
>gi|229594330|ref|XP_001024169.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila]
gi|225566928|gb|EAS03924.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila
SB210]
Length = 348
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 133/400 (33%), Positives = 200/400 (50%), Gaps = 72/400 (18%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +K++S D Y K+ D T SG ++++VS+++ML+LF SE YL+ +++ VD
Sbjct: 5 GVQSKLKSFDMYRKLPSDLTEPTLSGAIVSIVSTLIMLILFISEFNGYLSVEENSEMFVD 64
Query: 63 TSRG-ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
++G + +R+N D+ FP PC I S+D DI G ++V+ D+ K RL S G +E
Sbjct: 65 VAQGGQKIRVNLDIDFPQFPCDIFSLDVQDIMGSHSVNVEGDLVKTRLSSTGTYLE---- 120
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+++ N G D + E V++A+ +
Sbjct: 121 ----------------KIKQNTGGDHGHGGHGHGHGDVSLDLERVKKAFNDR-------- 156
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
EGC I GF+ VNKV GNFH S H G ++ I R
Sbjct: 157 -------------------EGCKISGFMLVNKVPGNFHI----SSHAYGNYLQRIFQDAR 193
Query: 242 -DSFNISHKINKLAFGEHF----------PGVVNPLDGVRWTQ----ETPSGMYQYFIKV 286
++ ++SH IN L+FGE G++ PLD + + T +QY+I V
Sbjct: 194 INTLDLSHVINHLSFGEENDLNRIKKTFQQGILQPLDHTKKIKPENLRTVGVTHQYYINV 253
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
VPT Y D+S + ++ V + +S + Q LP VFF YDLSP+ V F++ SFLH
Sbjct: 254 VPTTYKDLS-----NRKYHVYQFVANSNEMTTQHLPAVFFRYDLSPVTVQFSQTRESFLH 308
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
FL VCAI+GGVFTV+GIID+ ++ I KK E+GK S
Sbjct: 309 FLVQVCAIIGGVFTVAGIIDSIVHRSVVHILKKAEMGKLS 348
>gi|225680824|gb|EEH19108.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb03]
Length = 413
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 144/439 (32%), Positives = 203/439 (46%), Gaps = 94/439 (21%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A ++ LDA+ K ED RT SGG++T+V+ V+ L + E Y V +L+VD
Sbjct: 2 APKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVVD 61
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESR 119
R D MD+SGE V H I K RL + G+VI++
Sbjct: 62 KGR----------------------DVMDVSGEMQSGVIHGISKVRLAPESEGGHVIDT- 98
Query: 120 QDGIGAPKIDKPLQRHGGRLEH-NETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKG 174
L +H + YCG CYGA ++ CC+ CEEVREAY +
Sbjct: 99 --------TALVLHTQTDAAKHLDPDYCGPCYGAPPPSHATKPGCCSTCEEVREAYASQS 150
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
WA + ++QC+REG+ + + + EGC I G L VNKV GNFH APG+SF +H H
Sbjct: 151 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAH 210
Query: 235 DILAFQRDSF--NISHKINKLAFGEHFPGVV------------NPLDGVRWTQETPSGMY 280
D+ + ++SHKI++L FG + NPLD P +
Sbjct: 211 DLDTYYHTPVPHHMSHKIHQLRFGPQLSDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 270
Query: 281 QYFIKVVPTVYTDV----------------------------SGHTIQSNQFSVTEHFRS 312
YF+KVV T Y + S +I+++Q+SVT H RS
Sbjct: 271 MYFVKVVSTSYLPLGWSPEFSSSVHETTLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRS 330
Query: 313 SEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 359
+ G RL + +PGVF YD+SP+KV E +F FLT VCA++GG
Sbjct: 331 IDGGDDAAEGHKERLHSHGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 390
Query: 360 TVSGIIDAFIYHGQRAIKK 378
TV+ +D +Y G +KK
Sbjct: 391 TVAAAVDRALYEGAARVKK 409
>gi|68483709|ref|XP_714213.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
gi|68483794|ref|XP_714172.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
gi|46435713|gb|EAK95089.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
gi|46435761|gb|EAK95136.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
gi|238882494|gb|EEQ46132.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 414
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 139/409 (33%), Positives = 222/409 (54%), Gaps = 33/409 (8%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ S DA+ K ED +T SGG+ITL+ ++ L+L +E Y +T +L+VD
Sbjct: 6 KLLSFDAFAKTVEDARIKTTSGGIITLICILITLVLIRNEYVDYTTIITRPELVVDRDIN 65
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDG 122
+ L IN D++F LPC ++S+D +D++G+ L++ KK RL + QG+VI + +D
Sbjct: 66 KQLDINLDISFINLPCDLISIDLLDVTGDLSLNIIDSGLKKIRLLKNKQGDVIVNEIEDD 125
Query: 123 IGAPKIDKPLQRHGGRL---EHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWAL 177
A D L L YCGSCYGA D+ CCN+C VR AY +K W+
Sbjct: 126 EPAFNNDIELSDLAKGLPEGSDENAYCGSCYGALPQDKKQFCCNDCNTVRRAYAEKHWSF 185
Query: 178 SNPDLIDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
+ + I+QC++EG++ R++E EGC I G ++N+V+G FAPG SF + G H HD
Sbjct: 186 YDGENIEQCEKEGYVGRLRERINNNEGCRIKGTTKINRVSGTMDFAPGASFTREGRHFHD 245
Query: 236 ILAFQR--DSFNISHKINKLAFGE--------HFPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ + + D FN H IN L+FGE ++PLD ++ + + Y++K
Sbjct: 246 LSLYTKYPDKFNFDHIINHLSFGEMPVDGQADELFDSIHPLDDHQFMLHKKAHLVSYYLK 305
Query: 286 VVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIK 334
VV T + + + I +NQFSV H R G+ + +PGV F +D+SP+K
Sbjct: 306 VVATRFESLDYKNRIDTNQFSVITHDRPLVGGKDEDHQHTLHARGGIPGVNFNFDISPLK 365
Query: 335 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
+ +++ ++ F+ V + + GV V ++D ++ Q+AIK K +I
Sbjct: 366 IINRQQYAKTWSGFVLGVISSIAGVLMVGTLLDRSVFAAQQAIKGKKDI 414
>gi|412992535|emb|CCO18515.1| predicted protein [Bathycoccus prasinos]
Length = 428
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 145/407 (35%), Positives = 210/407 (51%), Gaps = 37/407 (9%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
++ I SLDAYPK+ ED+ + G ITL+ + L LFFSE R +L + E++L VDT
Sbjct: 28 VVKAIASLDAYPKVKEDYARGSTLGAAITLICFLACLCLFFSEYRTHLVSKIESELDVDT 87
Query: 64 -------SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHD--IFKKRLDSQGN 114
S E L + DVTF +L C ++++D++D +GE H DV HD I K+RLD G
Sbjct: 88 MGVNKFESNAERLHVYVDVTFHSLACELITLDSLDAAGEVHHDV-HDGHITKRRLDRDGK 146
Query: 115 VIESR----QDGIGAPKIDKPLQRHGGRL-----EHNETYCGSCYGAESSDEDCCNNCEE 165
I R +D + + +H +L + E + E +E
Sbjct: 147 PIPRRDSSAKDDVAVTREKPNKHKHIEKLVREKEKEEEGKKNEGEQEQEQQEQNHEQHDE 206
Query: 166 VREAYRKKGWA------LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 219
R + A LI + G + K + EGC + G+LEVN+V G+F
Sbjct: 207 KRRKLQNTALAGFGGGFFDINALIHEQFPNGLEEAFKNKNKEGCEVMGYLEVNRVPGSFS 266
Query: 220 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD-GVRWTQETPSG 278
+PGKS H+ + N+SH IN+LAFGE FPG +N LD R+ P+
Sbjct: 267 ISPGKSLQIGMSHIQLNVV---SHLNMSHTINRLAFGEAFPGALNLLDKNTRYL--PPNA 321
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ-----GRLQTLPGVFFFYDLSPI 333
++QYF+KVVPT + + T+ +NQ+SVTE S++Q G G++F Y+LSPI
Sbjct: 322 VHQYFLKVVPTSFARLKDTTLATNQYSVTESSSSAKQSFFGMGSSGKPSGIYFHYELSPI 381
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ-RAIKKK 379
++ F E SF F+ +VC+I+GGV T SGI+ I Q RA KK
Sbjct: 382 RIDFKERRNSFGEFMLSVCSIIGGVATSSGILHKLIVFIQTRARSKK 428
>gi|367004394|ref|XP_003686930.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
gi|357525232|emb|CCE64496.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
Length = 439
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 132/430 (30%), Positives = 218/430 (50%), Gaps = 63/430 (14%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ + + D + K+ ED RT +GG+ITL+ V LL SE + +++ +L++D
Sbjct: 8 DNLLAYDVFTKVEEDIRIRTRTGGLITLICIGVTFLLLISEWFQFKKVISKPELVIDRDY 67
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDV-----KHDIFKKRLDSQGNVIESRQ 120
L +N DVTFP +PC +L++D +D SG LD+ + K RL+++G VI +
Sbjct: 68 QSKLELNIDVTFPYIPCDLLNLDILDDSGNVQLDIDLEEASSNFVKTRLNNRGEVIGKAK 127
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES----------SDEDCCNNCEEVREAY 170
KI L + E E YCGSCYG++ +D+ CCN+CE+VR+AY
Sbjct: 128 ----KFKITDDLGEYAP--EDKENYCGSCYGSKDQTKNEDIEKITDKVCCNSCEDVRQAY 181
Query: 171 RKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG 230
+ GWA + I+QC+REG+++ I E EGC + G +NK+ GN HFAPGK+F
Sbjct: 182 SEAGWAFFDGKNIEQCEREGYVKTINERLSEGCRVKGEALLNKIHGNLHFAPGKAFQNRR 241
Query: 231 VHVHDILAF-QRDSFNISHKINKLAFGEHFPGVVN----------------PLDGVRWTQ 273
H HD F Q + N H IN L+FG+ +V P+DG +
Sbjct: 242 GHFHDTSLFNQHKNLNFQHVINHLSFGKPIRQLVTSNFQDTMSDSLRAQTAPIDGHQAFI 301
Query: 274 ETPSG--------------MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQ 315
+ +G + Y+ +++ T + + G +++Q +VT H++ + Q
Sbjct: 302 QDNTGDSDSASTTIAAHDYQFIYYAEIISTRFEYLKGDLEETSQLTVTSHYKKIGYQNGQ 361
Query: 316 GRLQTL------PGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAF 368
+Q + PG++ +++SP+KV E++ S+ +L +GG+ V +ID
Sbjct: 362 DYMQGMQSRSGIPGLYIDFEVSPLKVINKEQYSTSWSGYLLKTITSIGGILAVGTVIDKV 421
Query: 369 IYHGQRAIKK 378
+Y Q A+K+
Sbjct: 422 VYATQTALKQ 431
>gi|449016424|dbj|BAM79826.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 499
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 144/457 (31%), Positives = 222/457 (48%), Gaps = 79/457 (17%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR-- 65
+R LD YPK ED R+ +GG+I L S I + +L SE +L + +LVD
Sbjct: 42 LRKLDVYPKTVEDVRLRSVTGGIIALFSYICIGILVVSEFLRWLQPQLHSNVLVDARSIL 101
Query: 66 -GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-------- 116
E + ++ + A+ C S+DA+ +G Q + ++ K+ LD+ G +
Sbjct: 102 DTEPITVDLGIDLLAVGCDEFSLDALTANGAQLPNSVVELRKRPLDASGQPVIFPRGAFG 161
Query: 117 ----ESRQDGIG-APKI---DKP-LQRHGGRLEH-----------------------NET 144
+ + G+ AP+ D P Q+ GR+ N+T
Sbjct: 162 RSRLRNERGGVAPAPQALTEDPPNTQQLEGRVSQEVRAQLKQYREEAIAFRDRLAALNKT 221
Query: 145 ---YCGSCYGAESSDED-----------CCNNCEEVREAYRKKGWALSNP-DLIDQCKRE 189
YCGSCYGA + CCN C+E+R Y ++ WA +QC +
Sbjct: 222 GVAYCGSCYGAVPQTDQVGEANQITSGVCCNTCDEIRVLYEERNWAFDQVLRTAEQCAEK 281
Query: 190 GFLQRIKEE---EGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDIL-AFQRDSF 244
+L + E + GC + L++ +VAGNFHFAPGK H+ G HVH + ++
Sbjct: 282 RYLTLLHEAGRVQSGGCRVSARLQLPRVAGNFHFAPGKGHTHRMGHHVHSVDDQLLHRTY 341
Query: 245 NISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSG-----MYQYFIKVVPTVYTD--VSG 296
N SH+I L FG FP NPLDG +R ++ P G M Y+ K++PT Y G
Sbjct: 342 NFSHRIRHLRFGPLFPHQQNPLDGAMRILEQPPPGSPFGNMVLYYCKLIPTTYRRDRQRG 401
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQ------TLPGVFFFYDLSPIKVTFTEEHV-SFLHFLT 349
++S +++ + +SSEQ R+ LPG+FFFY+ P+++ + E + LHF+
Sbjct: 402 DALRSMEYAAADLTQSSEQDRVGITHSTGALPGIFFFYEPQPLQIAYFEGRMYGLLHFIV 461
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIK-KKIEIGKF 385
+CAIVGGVFTVS +ID F++ I+ +K +GK
Sbjct: 462 QLCAIVGGVFTVSSMIDRFVFGAGTFIRAQKRRLGKL 498
>gi|303290895|ref|XP_003064734.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453760|gb|EEH51068.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 363
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 139/384 (36%), Positives = 207/384 (53%), Gaps = 43/384 (11%)
Query: 4 IMNKIRSLDAYP--KINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
+ +R +D Y K+ EDF S + SGG+IT +++ +LF +E + V ++ L
Sbjct: 1 VAKTLRRMDVYSSSKVIEDFRQSSSMSGGIITCACALLCFVLFVNEYFYHRTPVVKSSLT 60
Query: 61 VD--------TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR-LDS 111
VD ++ L + D+TF LPC I+++D MD +GE DV KKR LDS
Sbjct: 61 VDATGLDAKTSANSNRLHVEIDITFHQLPCDIINMDTMDQAGEAFHDVHSGHLKKRRLDS 120
Query: 112 QGNVIES--RQDGIGAPK-IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVRE 168
G +E + + A K I + ++ H L +E Y ++S+ED
Sbjct: 121 DGKPLEGVFKHEKANAHKEIREDIESHALALSGDEEY-------KTSEEDL--------- 164
Query: 169 AYRKKGWALSN-PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH 227
++G + N L+D+ G + K E EGC + G+LEVN+V G+F +PGKS
Sbjct: 165 -MPEEGLTMFNLKQLLDKQFPGGIEKAFKNEAREGCEVIGYLEVNRVPGSFSVSPGKSIR 223
Query: 228 QSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
HV L Q N+SH IN+ AFG+ FPG V+PLDG P+ ++QYF+K+V
Sbjct: 224 LGMEHVQ--LNVQ-SRLNMSHTINRFAFGKSFPGFVSPLDG-NARDLDPNYVHQYFLKIV 279
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL----PGVFFFYDLSPIKVTFTEEHVS 343
PT +T + G +QSNQ+SVTE S+ L + GV+F YDLSP++V + E S
Sbjct: 280 PTSFTPLRGEYLQSNQYSVTE--ASAPAKALNVVGSKPSGVYFNYDLSPLRVDYVESRNS 337
Query: 344 FLHFLTNVCAIVGGVFTVSGIIDA 367
F+T+VCAIVGGV ++SG++ A
Sbjct: 338 MTEFITSVCAIVGGVASMSGLVQA 361
>gi|145476255|ref|XP_001424150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391213|emb|CAK56752.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 132/401 (32%), Positives = 202/401 (50%), Gaps = 82/401 (20%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++R LD Y K+ D T +G +I+++S+IV+++LF +EL+ Y+ +++ VD
Sbjct: 4 GVQSRLRKLDIYRKLPADLTEPTTAGALISVISTIVIVILFITELQAYIEVDNSSEMFVD 63
Query: 63 TSRG-ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+RG E +R+N D+ F PC ILS+D DI G ++V+ + KKR+ + G VI
Sbjct: 64 INRGGEQIRVNLDIEFHKFPCDILSLDVQDIMGSHVVNVEGRLIKKRIKN-GKVIS---- 118
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
++ H G HN+ + + +A+++K
Sbjct: 119 -------EEVHSNHEGHEHHNQPSI---------------DFARIEQAFKEK-------- 148
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
EGC I G++ VNKV GNFH S H G +H + FQR
Sbjct: 149 -------------------EGCQIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQR 183
Query: 242 ---DSFNISHKINKLAFGEH----------FPGVVNPLDGVRWTQETPSG---MYQYFIK 285
+ ++SH IN ++FGE GV+NPLD + + G M+QY+I
Sbjct: 184 SQIQTLDLSHTINHISFGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYIS 243
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 345
VVPT Y DVSG N++ V + +S + LP +F YDLSP+ V F + SFL
Sbjct: 244 VVPTTYVDVSG-----NEYYVHQFTANSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFL 298
Query: 346 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
HFL +CAI+GGVFT++ I+D I+ A+ KK E+GK S
Sbjct: 299 HFLVQICAILGGVFTIASIVDGMIHKSVVALLKKYEMGKLS 339
>gi|384253563|gb|EIE27037.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 327
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 195/384 (50%), Gaps = 77/384 (20%)
Query: 7 KIRSLD---AYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
K++S + AY + RT+ G ++T++ I+ ++LF +ELR Y + + VDT
Sbjct: 2 KLKSFNRFSAYARAESHLVQRTYFGAIVTVLGVILAIVLFANELREYTTPFSIQTMSVDT 61
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKH----DIFKKRLDSQGNVIESR 119
SR +R+NF+ T+P++PC +LS+DA D+SGE+ D H +I K RL+ G
Sbjct: 62 SRAHYIRMNFNFTYPSMPCQVLSLDATDMSGEKSGDSGHAANGEIHKVRLNEAG------ 115
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ IG + P R+ G+ +
Sbjct: 116 -EKIGLGEYIPP---------------------------------------RRWGFMMGK 135
Query: 180 PDLIDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
P R+ + + + + EGCNI+G+L++ +VAGNF + VHV D
Sbjct: 136 P-------RQQEVMEVNQAMDAHEGCNIFGWLDLQRVAGNFRVS---------VHVEDFF 179
Query: 238 AFQR-----DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
A R N SH I++++FG FPG VNPLDG + SG ++YF+KVVPT Y
Sbjct: 180 ALTRLQADTTGINSSHIIHRVSFGPTFPGQVNPLDGAERILDKESGTFKYFLKVVPTEYQ 239
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+G +NQ+SVTE+ +G +Q +P V+F YD+SPI VT +E SF H L C
Sbjct: 240 WSAGTRTTTNQYSVTEYDTVVHKGEMQ-MPSVWFSYDISPISVTISEIRKSFAHLLVRFC 298
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAI 376
A+VGGVF V+G+ D +++ AI
Sbjct: 299 AVVGGVFAVTGMFDRWVHRIVTAI 322
>gi|384501765|gb|EIE92256.1| hypothetical protein RO3G_17063 [Rhizopus delemar RA 99-880]
Length = 291
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 109/260 (41%), Positives = 153/260 (58%), Gaps = 23/260 (8%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
++ +R D Y K ++F +T SG + SEL Y +V + L+VD
Sbjct: 6 SLFRNLRQFDGYAKTLDEFRIKTTSGASV------------LSELMTYNTSVWKPSLVVD 53
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
SR E + I+F++TFP +PC +LS+D MD SGEQ D+ K RLD+ GN+IES
Sbjct: 54 KSRKEKMPIDFNITFPNMPCHMLSIDIMDESGEQSSGYSQDVTKIRLDTLGNIIESGH-- 111
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED-CCNNCEEVREAYRKKGWALSNPD 181
K+ LE CGSCYGA+ ED CC++C++VREAY K+GW L N
Sbjct: 112 --TVKLGDHTNDAKKALEE-APECGSCYGAKPLREDGCCHSCQDVREAYVKQGWGLVNTK 168
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
I+QC REG+L +++ + EGCN++G L VNKV GNFHFAPG +F +HVHD+ + +
Sbjct: 169 EIEQCIREGWLAKLENQSNEGCNVHGHLLVNKVRGNFHFAPGGAFQAGSMHVHDLQEYTQ 228
Query: 242 D-----SFNISHKINKLAFG 256
SF++SH+I+KL FG
Sbjct: 229 GAPNGHSFDMSHRIHKLKFG 248
>gi|397564627|gb|EJK44287.1| hypothetical protein THAOC_37187 [Thalassiosira oceanica]
Length = 506
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 136/440 (30%), Positives = 212/440 (48%), Gaps = 77/440 (17%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLY--LNAVTETKLLVDTSR 65
+R LD + KI D RT GG +T ++ML+L +E + +N + ++VDTS
Sbjct: 58 VRKLDFFNKIEVDHIVRTERGGQLTAAGYVIMLILILAEYLTWSGMNGESIEHVVVDTSL 117
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGI 123
G+ +++N ++TFP+L C L ++ +D++G+ L+V +FK+RLD G +
Sbjct: 118 GKRMKVNLNITFPSLHCEDLHLNIIDVAGDSQLEVSDKMFKQRLDLDGTPRPLAKISAEA 177
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDL 182
A ++ +R YCG CYGA+ + +DCCN C++V E Y+KK W + L
Sbjct: 178 NAKALEDKKRREVVEKSVGPDYCGPCYGAQENAQDCCNTCDDVIERYKKKRWNDNAVQPL 237
Query: 183 IDQCKREG---FLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
+QC REG + + GEGCN+ G VN+VAGNFH A G+ + G H+H L
Sbjct: 238 AEQCIREGRAGVSEPKRMAGGEGCNLSGHFTVNRVAGNFHIAMGEGVERDGRHIHQFLPE 297
Query: 240 QRDSFNISHKINKLAF---------GEHFPGVVNP--LDGVRWTQET---------PSGM 279
R +F +H I++L+F GE F +++ ++G R + +G+
Sbjct: 298 DRVNFIANHVIHELSFLDDEYGDIEGEGFLNLMSKAGVNGERSMNGSVKTVTEETGTTGL 357
Query: 280 YQYFIKVVPTVY-------------TDVSGHTIQSNQFSVTEHFRS-------------- 312
+QYFIKVVPT Y +D +++N++ TE FR
Sbjct: 358 FQYFIKVVPTKYKGDIIDDMGVSTLSDGQEKQLETNRYFYTERFRPLIGDIDEEALLAGD 417
Query: 313 ----------SEQGRLQ------------TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
S+ G Q LPGVFF Y++ P V + V F+H
Sbjct: 418 VEKGTAGAHVSKAGGTQHQQAEHHAATNAVLPGVFFVYEIYPFMVEVSRNRVPFMHLWIR 477
Query: 351 VCAIVGGVFTVSGIIDAFIY 370
+ A VGGVFT+ ID ++
Sbjct: 478 IMATVGGVFTMMSWIDGALH 497
>gi|209876426|ref|XP_002139655.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555261|gb|EEA05306.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 395
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 119/379 (31%), Positives = 207/379 (54%), Gaps = 39/379 (10%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D+I+ ++ +D Y K+++D+ +++ SG +++++ I++++L E Y+ T + V
Sbjct: 28 DSILKSVKYIDIYGKVHDDYCAKSTSGSIMSILVYILVIILTIGEFLKYIGGETVEHIGV 87
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D + + L I D++FP+L CS +SVD +D GE ++ ++ K +D GN + Q+
Sbjct: 88 DDNMNQKLDIRLDISFPSLRCSEISVDTVDNVGENQVNAHGNLLKIPIDIHGNEV---QE 144
Query: 122 GIGAPKIDKPLQRHGGRLEHNETY---CGSCYGAESSDEDCCNNCEEVREAYRKKGWA-L 177
I A ++NE+ C SC+GAES CCN CE ++ A+R KGW+ L
Sbjct: 145 EIMA--------------QYNESTSMKCLSCFGAESIHYKCCNTCESLKSAFRYKGWSYL 190
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI- 236
QC GC ++G L+VNKV+GN H A G++ + G HVH+
Sbjct: 191 DIASKAPQCINT-----------VGCRLHGSLQVNKVSGNIHVALGQATVRDGKHVHEFN 239
Query: 237 LAFQRDSFNISHKINKLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ FN SH I++L FG ++ + +PL+ + T + M+ Y++K+VPT + S
Sbjct: 240 MNDISRGFNTSHTIHELRFGKDNIEFIGSPLENTKKIVTTGTSMFHYYLKLVPTQFIK-S 298
Query: 296 GHT--IQSNQFSVTEHFRS--SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
G++ + SNQ++ TE + + G L LPGVF YD P + + HFLT+
Sbjct: 299 GYSKVLFSNQYTYTERQKDVLVKDGELSGLPGVFIVYDFQPFVIRKIHNSIPTTHFLTSF 358
Query: 352 CAIVGGVFTVSGIIDAFIY 370
CAI+GG++++ ++D+ ++
Sbjct: 359 CAIIGGIYSLMSLVDSILF 377
>gi|66363024|ref|XP_628478.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
and possible N region transmembrane [Cryptosporidium
parvum Iowa II]
gi|46229502|gb|EAK90320.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
and possible N region transmembrane [Cryptosporidium
parvum Iowa II]
Length = 397
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 201/373 (53%), Gaps = 30/373 (8%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+A+ K++ +D Y KI+ED+ ++ S +I+L+ I++ L +E+ Y + V
Sbjct: 28 EALQTKVKKIDIYGKIHEDYCVKSTSRSIISLLVYIIVFFLTLNEIFKYFKGEMIDNIGV 87
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D + L I D+TFP L C +SVD++D GE +D K + K +D G + +
Sbjct: 88 DNTINNKLDIMLDITFPRLRCEEISVDSVDYVGENQVDSKEYMVKIPIDLNGQEVRNI-- 145
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
K Q++ ++E C SCYGAE+++ CCN+C+ ++ AYR KGW S D
Sbjct: 146 --------KYNQQNDLKIE-----CMSCYGAETNEFLCCNDCDSLKTAYRSKGW--SYLD 190
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQ 240
++ + Q I E GC I G ++VNKV+GN H A G + ++G HVH+ +
Sbjct: 191 IVSKAP-----QCI---EKVGCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDV 242
Query: 241 RDSFNISHKINKLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT- 298
FN SH I++L FG + P + +PL+ ++ + M+ Y++K++PT Y +G
Sbjct: 243 SRGFNTSHIIHELRFGSDKIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSGNGEVN 302
Query: 299 IQSNQFSVTEHFRS--SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+ NQ++ TE R + G L LPG+F YD P + + V H +T+ CAIVG
Sbjct: 303 LYGNQYAFTERERDVHVQNGELSGLPGIFIVYDFQPFLLQKIYKRVPISHLITSFCAIVG 362
Query: 357 GVFTVSGIIDAFI 369
G++++ ++D F+
Sbjct: 363 GIYSIMSLLDTFV 375
>gi|67623967|ref|XP_668266.1| serologically defined breast cancer antigen 84 [Cryptosporidium
hominis TU502]
gi|54659454|gb|EAL38030.1| serologically defined breast cancer antigen 84 [Cryptosporidium
hominis]
Length = 397
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/373 (32%), Positives = 201/373 (53%), Gaps = 30/373 (8%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+A+ K++ +D Y KI+ED+ ++ S +I+L+ I++ L +E+ Y + V
Sbjct: 28 EALQTKVKKIDIYGKIHEDYCVKSTSRSIISLLVYIIVFFLTLNEIFKYFKGEMIDNIGV 87
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D + L I D+TFP L C +SVD++D GE +D K + K +D G + +
Sbjct: 88 DNTINNKLDIMLDITFPRLRCEEISVDSVDYVGENQVDSKEYMAKIPIDLNGQEVRNI-- 145
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
K Q++ ++E C SCYGAE+++ CCN+C+ ++ AYR KGW S D
Sbjct: 146 --------KYNQQNDLKIE-----CMSCYGAETNEFLCCNDCDSLKTAYRSKGW--SYLD 190
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQ 240
++ + Q I E GC I G ++VNKV+GN H A G + ++G HVH+ +
Sbjct: 191 IVSKAP-----QCI---EKVGCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDV 242
Query: 241 RDSFNISHKINKLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT- 298
FN SH I++L FG + P + +PL+ ++ + M+ Y++K++PT Y +G
Sbjct: 243 SRGFNTSHIIHELRFGSDRIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSGNGEVN 302
Query: 299 IQSNQFSVTEHFRS--SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+ NQ++ TE R + G L LPGVF YD P + + V H +T+ CAIVG
Sbjct: 303 LYGNQYAFTERERDVHVQNGELSGLPGVFIVYDFQPFLLQKIYKRVPISHLITSFCAIVG 362
Query: 357 GVFTVSGIIDAFI 369
G++++ ++D F+
Sbjct: 363 GIYSIMSLLDTFV 375
>gi|145351005|ref|XP_001419879.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580112|gb|ABO98172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 373
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 127/375 (33%), Positives = 204/375 (54%), Gaps = 34/375 (9%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD- 62
+ N +++LDA PK+ ED+ S + SG + TLV + + L+LFF E Y ++L V+
Sbjct: 1 MTNILKALDANPKLKEDYVSESTSGVITTLVCAALCLILFFGEFFSYKTTKIVSELRVNP 60
Query: 63 ------TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHD--IFKKRLDSQGN 114
E L+I+ D+TF +L C+++++D D +GE+H DV HD I K+R+D G
Sbjct: 61 LGVHQTVPNAERLKIDVDITFHSLACNLITLDTSDKAGEEHYDV-HDGHIEKRRIDKHGK 119
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
VI++ + K +K + + NET S + A+S E + G
Sbjct: 120 VIDA---AFTSEKPNKHKEIEQALQKMNET--DSAHAADS----------HAMEHVQPFG 164
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
L+ + EG + E EGC + G+LEVN+V G F +PG+S +
Sbjct: 165 GMFGLQSLLQEVFPEGVEHAFRNENQEGCEVKGYLEVNRVPGRFSISPGRSLMMG---MQ 221
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 294
+ + + N++H I++L+FGE FPG+V+PLDG + P+ + QYF+ VV T + +
Sbjct: 222 MVKLNVQTALNLTHTIHRLSFGESFPGLVSPLDGTHRSLP-PNAVQQYFLNVVSTTFEPL 280
Query: 295 -SGHTIQSNQFSVTEHFRSSEQGRLQTL----PGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
I ++Q+SVTE F SS++ + T PGV F Y++SPI+V F E SF F+
Sbjct: 281 GENKIISTHQYSVTETFTSSQRSIMGTSNGRDPGVIFTYEISPIRVDFKETRTSFGAFVL 340
Query: 350 NVCAIVGGVFTVSGI 364
+C+++GGV T++GI
Sbjct: 341 GICSVIGGVVTMAGI 355
>gi|428171090|gb|EKX40010.1| hypothetical protein GUITHDRAFT_154283 [Guillardia theta CCMP2712]
Length = 331
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 129/361 (35%), Positives = 192/361 (53%), Gaps = 65/361 (18%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ D +PK +D + SGG +++V M LL F+E ++L T+ ++ VDT RG
Sbjct: 15 LKNFDVFPKTVDDAKEASVSGGTVSVVVLFFMFLLLFTETSIFLKTNTKFEMEVDTMRGG 74
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+INFD++FP LPCS+LS+D+MD+SGE LD+ HD++K+ ++S+ + +G P
Sbjct: 75 MLQINFDISFPGLPCSVLSLDSMDVSGEHELDIVHDVYKR-------AMDSKGNALG-PV 126
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
I E+V+ A ALS + +Q +
Sbjct: 127 I----------------------------------SEKVKLARD----ALSISHIKEQLE 148
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
R EGCNIYG L KV+GNFH S H HV + R + N S
Sbjct: 149 RH-----------EGCNIYGTLNAQKVSGNFHL----SLHAQDFHVLAQVFPDRATVNTS 193
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H +N L+FG +PG+ NPLDG + SG ++Y+IK+VPT + + G I +NQ+SVT
Sbjct: 194 HIVNHLSFGRDYPGLKNPLDGEMKVLDQGSGTFEYYIKIVPTKFHHLDGTIIDTNQYSVT 253
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
+HFR + G P V+F YD+SPI V + SF H+ T +CAI GG++ V+G + A
Sbjct: 254 DHFRKLQDG----FPAVYFIYDISPIMVRVKQWKQSFSHYATQLCAITGGMYVVTGQLHA 309
Query: 368 F 368
Sbjct: 310 L 310
>gi|410083920|ref|XP_003959537.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
gi|372466129|emb|CCF60402.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
Length = 417
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 132/419 (31%), Positives = 209/419 (49%), Gaps = 53/419 (12%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K+ DA+ K ED RT +GG+I++ ++ L E + +T KL+VD
Sbjct: 4 SKLLVFDAFNKTEEDVRVRTNTGGLISIGCVVLTCFLLLREWYQFNEIITRPKLVVDRDH 63
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIG 124
L +NFD+TFP++ C +L++D +D +G+ LD+ + + K R+DS G + + IG
Sbjct: 64 DLELDLNFDITFPSISCDLLTLDILDDAGDLQLDLLESGLTKTRVDSNGVSLTTESFNIG 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGA---------ESSDEDCCNNCEEVREAYRKKGW 175
+ K + + YCGSCYGA ++++ CC CE+V +AY GW
Sbjct: 124 NEALIKR--------DFPQDYCGSCYGALDQGKNDELNANEKVCCQTCEDVHDAYLNIGW 175
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS------ 229
A + I+QC+ EG++ RI E EGC + G +N+V GN HFAPGKS+
Sbjct: 176 AFYDGKNIEQCETEGYVDRINEHLNEGCRVQGSARLNRVQGNIHFAPGKSYQDYSRRNSF 235
Query: 230 GVHVHDILAFQRD-SFNISHKINKLAFGE---------HFPGV----VNPLDGVRWTQET 275
H HD + + S + +H I+ +FG+ H G+ NPLDG + +
Sbjct: 236 ATHFHDTSLYDKTHSLSFNHIIHHFSFGKPIENSYVNNHNEGLSKISTNPLDGRKVFPDR 295
Query: 276 PSGM--YQYFIKVVPTVYTDVSGHT--IQSNQFSVTEHFRSSEQGRLQT----------L 321
S Y YF ++VPT Y ++ + +++ QFS T H R GR + +
Sbjct: 296 DSHFIQYSYFAEIVPTRYEYLNNKSDPVETTQFSATFHSRPLRGGRDEDHPTTLHQRGGI 355
Query: 322 PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
PG+F +++ SP+KV E++ ++ FL N +GG+ V D Y QR I K
Sbjct: 356 PGLFIYFETSPLKVINKEQYSQAWSTFLLNCITTIGGILAVGTSFDKITYKAQRTIWGK 414
>gi|323306137|gb|EGA59869.1| Erv46p [Saccharomyces cerevisiae FostersB]
Length = 349
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 121/334 (36%), Positives = 174/334 (52%), Gaps = 48/334 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWXQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQG----NVIESRQDGIG 124
+N DVTFP++PC ++++D MD SGE LD+ F RL+S+G + E G G
Sbjct: 68 ELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGGNG 127
Query: 125 ---APKIDKPLQRHGGRLEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRK 172
AP + P YCG CYGA+ ++ CC +C+ VR AY +
Sbjct: 128 DGTAPVNNDP------------NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLE 175
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
GWA + I+QC+REG++ +I E EGC I G ++N++ GN HFAPGK + + H
Sbjct: 176 AGWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGH 235
Query: 233 VHDILAFQRDS-FNISHKINKLAFGE--------------HFPGVV--NPLDGVRWTQET 275
HD + + S N +H IN L+FG+ H VV +PLDG + +
Sbjct: 236 FHDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDR 295
Query: 276 PSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 307
+ +Q YF K+VPT Y + I++ QFS T
Sbjct: 296 NTHFHQFSYFAKIVPTRYEYLDNVVIETAQFSAT 329
>gi|410078101|ref|XP_003956632.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
gi|372463216|emb|CCF57497.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
Length = 414
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 127/406 (31%), Positives = 202/406 (49%), Gaps = 48/406 (11%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSE-LRLYLNAVTETKLLVDTSRGET 68
S+DA+ + +D RT SG +IT+ V ++L ++ L+ + T T L+VD R
Sbjct: 8 SIDAFSRAQDDIRIRTKSGAIITISCIAVTVILLINQWLQFQYSISTITNLVVDRERNLK 67
Query: 69 LRINFDVTFPALPCSILSVDAMDISG--EQHLDVKHDIFKK-RLD-SQGNVIESRQDGIG 124
L ++FD+TF LPC+++++D +D + + +D F K R+D S G I S + +
Sbjct: 68 LNLDFDITFTNLPCNLINIDILDDASFLQSIIDPDSSSFTKIRIDRSSGKPISSSEFNLN 127
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESS-----------DEDCCNNCEEVREAYRKK 173
+ P +E YCG CYGA+ D CC C +V+ +Y
Sbjct: 128 EKTYEYP--------PDDENYCGPCYGAKDQSINDKEGIKKEDRVCCQTCSDVKNSYLDA 179
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF-LEVNKVAGNFHFAPGKSFHQSGVH 232
GWA + I+QC+REG++++I + EGC I G + +N+V GN HFAPG+++H H
Sbjct: 180 GWAFFDGKNIEQCEREGYIEKINSQLNEGCQIKGSNVLINRVNGNLHFAPGEAYHNPNGH 239
Query: 233 VHDILAFQ-RDSFNISHKINKLAFGE--------HFPGVVN-PLDGVRWTQETPSGMYQ- 281
HD + + N +H IN +FG H ++N PLDG + E S Y
Sbjct: 240 YHDTSFYDLKPQLNFNHIINHFSFGNGAVDRDATHDTTLMNSPLDGTQVLPEYDSHAYAF 299
Query: 282 -YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR----------LQTLPGVFFFYDL 330
YF K+V T Y + +++ QF+ H R G +PG+F ++D+
Sbjct: 300 TYFNKIVSTRYEYLERDPLETVQFTSMFHDRQINGGNDIHDEKIKHARGGIPGLFIYFDI 359
Query: 331 SPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 375
SP+K+ E+H V++ F+ N +GG+ V +ID Y QR
Sbjct: 360 SPMKIINKEQHTVNWSTFVLNCITSIGGILAVGTVIDKIFYKTQRT 405
>gi|212275606|ref|NP_001131002.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
gi|194690678|gb|ACF79423.1| unknown [Zea mays]
gi|413952089|gb|AFW84738.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 293
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 111/313 (35%), Positives = 176/313 (56%), Gaps = 41/313 (13%)
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD RGETL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K RLD G++I
Sbjct: 3 VDLKRGETLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHII---- 58
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G + +++ G ++ + ++ E++ ++ ++ AL N
Sbjct: 59 ---GTEYLSDLVEKGHGAHHDHDHDHDHHDEQKKHEQTFNEEAEKMIKSVKQ---ALGN- 111
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
GEGC +YG L+V +VAGNFH S H + V + +
Sbjct: 112 -------------------GEGCRVYGMLDVQRVAGNFHI----SVHGLNIFVAEKIFEG 148
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ N+SH I++L+FG +PG+ NPLD SG ++Y+IKVVPT Y +S +
Sbjct: 149 SNHVNVSHVIHELSFGPKYPGIHNPLDETSRILHDTSGTFKYYIKVVPTEYKYLSKKVLP 208
Query: 301 SNQFSVTEHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+NQFSVTE+F R +++ P V+F YDLSPI VT EE +FLHF+T +CA++GG
Sbjct: 209 TNQFSVTEYFLPIRPTDRA----WPAVYFLYDLSPITVTIKEERRNFLHFVTRLCAVLGG 264
Query: 358 VFTVSGIIDAFIY 370
F ++G++D ++Y
Sbjct: 265 TFAMTGMLDRWMY 277
>gi|449445069|ref|XP_004140296.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 388
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 118/323 (36%), Positives = 182/323 (56%), Gaps = 47/323 (14%)
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD RGETL I+ ++TFP+LPC +LSVDA+D+SG+ +D+ +I+K R
Sbjct: 102 VDLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLR------------ 149
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
L HG G+ Y ++ +++ ++ + +P
Sbjct: 150 -----------LNSHG-------QIIGTEYLSDLVEKEHVDHKHDHDHDK-----EKDHP 186
Query: 181 DL--IDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ DQ E ++++K+ EE +GC +YG L+V +VAGNFH S H + V +
Sbjct: 187 HIHGFDQAA-ENLVKKVKQALEEAQGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQM 241
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ N+SH I+ L+FG +PG+ NPLDG VR ++T SG ++Y+IK+VPT Y +S
Sbjct: 242 IFGGSKHVNVSHMIHDLSFGPKYPGIHNPLDGTVRILRDT-SGTFKYYIKIVPTEYKYIS 300
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+ +NQFSVTE+F S ++ P V+F YDLSPI VT EE SFLHF+T +CA++
Sbjct: 301 KAVLPTNQFSVTEYF-SPMTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVL 359
Query: 356 GGVFTVSGIIDAFIYHGQRAIKK 378
GG F V+G++D +++ A+ K
Sbjct: 360 GGTFAVTGMLDRWMFRFLEALTK 382
>gi|328875761|gb|EGG24125.1| DUF1692 family protein [Dictyostelium fasciculatum]
Length = 1172
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 126/395 (31%), Positives = 198/395 (50%), Gaps = 42/395 (10%)
Query: 4 IMNKIRSLDAYPKINEDFY-SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I+ K++ D YPK++E + +++ GG+ T++ IV + L SEL Y + + L VD
Sbjct: 806 ILEKLKLFDFYPKLDESVHQTKSIYGGIATVICIIVTVFLLTSELYYYTFPIRDHSLRVD 865
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMD-ISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
SRG + INFDV FP+L CS + V+++D + G+ D H I K+RL+ +G+
Sbjct: 866 VSRGNRMNINFDVHFPSLICSDIIVESVDGVDGKPIKDAAHQIVKERLNRRGS------- 918
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES----SDEDCCNNCEEVREAYRKKGWAL 177
PL+R R C C CCN+CE++R YR
Sbjct: 919 ---------PLERLHARA--GLFSCTKCELPPKYQLLEKRKCCNSCEDLRTFYRTNKVPQ 967
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS----GVHV 233
D QC + E EGC ++G L V K+ G+ H G+ +S HV
Sbjct: 968 HLADESPQCTIGKPVT-----EDEGCRVFGILSVQKMKGDIHIIAGRPHEESHDGHSHHV 1022
Query: 234 HDI---LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 290
H + +A + FNISH I+K +FG+ G++NPL+G G+ Y+++VVPT+
Sbjct: 1023 HKLTPEIAQRIHKFNISHHIHKFSFGQDVEGLINPLEGFGIVVPMGLGLQTYYLQVVPTI 1082
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL-PGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
Y + + +++NQ+S T ++S L L PG++F YDLSP+ + + F +T
Sbjct: 1083 YKQ-NNYILETNQYSYTREYKSINYNNLGYLFPGIYFKYDLSPLMIEVDQSSKPFSELIT 1141
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
++CAI GG++ G+ YH I KI+ K
Sbjct: 1142 SICAIGGGMYVAFGL----FYHVTARIVGKIKKQK 1172
>gi|340502903|gb|EGR29544.1| hypothetical protein IMG5_153610 [Ichthyophthirius multifiliis]
Length = 342
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 125/403 (31%), Positives = 193/403 (47%), Gaps = 86/403 (21%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +K++S+D Y K+ D T SG +I++ SS++ML+LF SE YL+ +++ +D
Sbjct: 6 VKSKLKSIDMYRKLPTDLTESTVSGAMISIASSLIMLILFISEFNGYLSITETSEMYIDE 65
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
R + +RIN D+ +P LPC ++S+D D+ G + +GN+
Sbjct: 66 KRYDKIRINIDIDYPRLPCDVISLDVEDLKGTHSYQL-----------EGNI-------- 106
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
R+ + Y + + D+ N +E EA
Sbjct: 107 -----------QITRISNTNQY----FDTQKYDDSHSENNQEFSEAR------------- 138
Query: 184 DQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAP---GKSFHQSGVHVHDILA 238
L R+K + EGC I G + VNK GNFH + + HQ HV+
Sbjct: 139 --------LNRLKSAFLDQEGCKIQGHIFVNKAPGNFHVSAHSFDRILHQIASHVN---- 186
Query: 239 FQRDSFNISHKINKLAFGEHF-----------PGVVNPLDGVRWT----QETPSGMYQYF 283
+ ++SH IN ++FG+ G+++PLD R Q+ S YQY+
Sbjct: 187 --ISTIDVSHIINHISFGDETDIIRIKRQFKSQGILDPLDRTRKIKTEDQKNISISYQYY 244
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 343
I VV T Y + IQ ++SV + ++ + LP FF YDLSP+ V F++ +S
Sbjct: 245 INVVHTTYVN-----IQKKEYSVYQFTANNNELLSDRLPACFFRYDLSPVIVRFSQSRMS 299
Query: 344 FLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
FLHF+ VCAI+GGVFTV+GIID+ I+ I KK E+GK S
Sbjct: 300 FLHFIVQVCAIIGGVFTVAGIIDSIIHKSVVHILKKAEMGKLS 342
>gi|330803630|ref|XP_003289807.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
gi|325080118|gb|EGC33688.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
Length = 388
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 119/376 (31%), Positives = 198/376 (52%), Gaps = 39/376 (10%)
Query: 5 MNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ K++ D YPK+++D ++ GGV+T+V ++ L SE+ + V E L VD
Sbjct: 33 LEKVKLFDFYPKVDDDVPRQKSTFGGVVTVVCLLITAYLLISEIYFFTFPVREHSLKVDV 92
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMD-ISGEQHLDVKHDIFKKRLDSQG-----NVIE 117
+RG L IN D+ FP L C+ +++D +D I G+ D + I K+RLDS+G V
Sbjct: 93 TRGNRLPINIDIHFPRLVCTDITIDVVDGIDGKPIKDAAYQIVKERLDSKGVPFAKGVAL 152
Query: 118 SRQDGIGAPKIDK---PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
+ + GI + + + P Q+ G + + CCN+C+++RE YR
Sbjct: 153 AGKKGIFSSRCTECEFPKQKKGSSVFFRQK--------------CCNSCDDLREYYRLNR 198
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS----G 230
+ D QC E +Q + EGC IYG L+V K+ G+FH G S +S
Sbjct: 199 IPQNFADDAPQCLIERPIQ-----DDEGCRIYGSLQVQKMKGDFHILAGLSADESHDGHA 253
Query: 231 VHVHDILA---FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
HVH I + FNI+H I+K +FG+ G++NPL+G ++ + Y+I+VV
Sbjct: 254 HHVHRITKENIGRVTQFNITHHIHKFSFGDDIDGLINPLEGFGIVAQS-LAVQNYYIQVV 312
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
P +Y + + +++NQ+S T +R+ L + PG++F YD+SP+ + + +
Sbjct: 313 PAIYKK-NDYVLETNQYSYTYDYRNVNVFNLGRIFPGIYFKYDMSPLMIEVDQTSKPIVE 371
Query: 347 FLTNVCAIVGGVFTVS 362
+T++CAI GG+F +S
Sbjct: 372 LITSICAIGGGIFYIS 387
>gi|340505495|gb|EGR31815.1| hypothetical protein IMG5_101180 [Ichthyophthirius multifiliis]
Length = 327
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 120/387 (31%), Positives = 188/387 (48%), Gaps = 84/387 (21%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG- 66
I+S D Y K+ D T SG V++++ I++L+LF SELR +L +++ +D RG
Sbjct: 3 IKSFDMYRKLPSDLTQSTTSGAVVSIICGIIVLILFISELRSFLAIEETSEMFIDIVRGG 62
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ +++N D+ FP PC ILS+D DI G ++++ I K+R+ S GN + + G
Sbjct: 63 QKIKVNLDIDFPKFPCDILSLDMQDIMGSHTVNIEGTINKRRISSDGNYFDLLKAGA--- 119
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+D N + +AY K
Sbjct: 120 ------------------------------DDSEFNLQRATQAYMDK------------- 136
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFN 245
EGCNI G + VNKV GNFH S H G + +L+ +++ +
Sbjct: 137 --------------EGCNISGTMLVNKVPGNFHI----SSHAYGHVLGQVLSNAGKNTID 178
Query: 246 ISHKINKLAFGEHF----------PGVVNPLDGVRW--TQETPSGM-YQYFIKVVPTVYT 292
+SHK+ L+FG+ F G+++P+D + Q +G+ YQY+I +VPT Y
Sbjct: 179 LSHKVKHLSFGDEFDLKNIKRQFSQGLLHPMDNKQKDKPQNILNGITYQYYINIVPTTYV 238
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
D QF+ + S+EQ LP V++ YDLSP+ V F+ + SFLHFL +C
Sbjct: 239 DTGNKNYHVYQFT----YNSNEQIN-NHLPTVYYRYDLSPVTVKFSMQKESFLHFLVQIC 293
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKK 379
AI+GG+FTV+ I+D+ +Y I K+
Sbjct: 294 AIIGGIFTVASIVDSIVYRAVLNILKR 320
>gi|430811512|emb|CCJ31046.1| unnamed protein product [Pneumocystis jirovecii]
Length = 264
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 114/266 (42%), Positives = 150/266 (56%), Gaps = 19/266 (7%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
R DA+ K ED +T +GG+IT++S I++ +L E Y V +L +D +R E
Sbjct: 8 RRFDAFSKTIEDAQIKTTNGGLITIISIIIIFILVSFEWHDYRRVVVLPELTIDRTRSEK 67
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
L+IN ++TFP +PCSILS+D MD+SGE DV H++ K RLD G I S I
Sbjct: 68 LQINLNLTFPKIPCSILSLDIMDVSGELQTDVSHNVVKNRLDKNGIFINSTS--INTLNF 125
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKR 188
+P++ YCGSCYGA+ E CCN CE+V AY W + N +QCK
Sbjct: 126 QQPIKVLPS------DYCGSCYGAK---EGCCNTCEDVINAYIANNWPIPNKRTFEQCKD 176
Query: 189 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSF--N 245
+ + EGCN G +EVNKV GNFHFAPG S +G HVHDI + DS +
Sbjct: 177 SNNM----DGPDEGCNFVGRIEVNKVIGNFHFAPGHSSQTITGGHVHDIYDYLTDSLPHD 232
Query: 246 ISHKINKLAFGEHFPGVV-NPLDGVR 270
SH INKL+FG G + NPLD V+
Sbjct: 233 FSHMINKLSFGPEIEGSLQNPLDNVK 258
>gi|145551751|ref|XP_001461552.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429387|emb|CAK94179.1| unnamed protein product [Paramecium tetraurelia]
Length = 317
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 191/379 (50%), Gaps = 87/379 (22%)
Query: 12 DAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRI 71
D Y K+ +D + SG +I+ S I+M +LF +E + YL +T++ +D ++ +TL +
Sbjct: 5 DLYRKLPQDLIEPSKSGALISFTSLILMFILFITEFQEYLTQQVQTEMYIDQNKDDTLLV 64
Query: 72 NFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKP 131
N D++FP +PC +S+D D+ G +VK ++ KKR+ G VI++
Sbjct: 65 NMDISFPNMPCDFISIDQQDVIGTHQQNVKGELLKKRI-LNGRVIDTY------------ 111
Query: 132 LQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGF 191
L +NET N E ++AY +K
Sbjct: 112 -------LSNNETL----------------NLERAQKAYDQK------------------ 130
Query: 192 LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKI 250
EGC + G++ +++V GNFH S H G V+ +L F + + ++SH I
Sbjct: 131 ---------EGCEMTGYIIISRVPGNFHI----SAHSYGGQVNIVLPFVEMSTIDLSHTI 177
Query: 251 NKLAFG---------EHFP-GVVNPLDGVRW--TQETPSG--MYQYFIKVVPTVYTDVSG 296
L+FG E F G++NPLDG+ TQE + +QY+I +VPT+Y D+
Sbjct: 178 KHLSFGNQNDIQKIREKFQQGLLNPLDGISRIKTQELKNVGVTHQYYISIVPTIYVDIDN 237
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
NQF+ ++ + + ++P ++F YD+SP+ V FT+ + +F HF+ +CAI+G
Sbjct: 238 REYFVNQFTA-----NTNEAQTNSMPAIYFRYDISPVTVQFTKYYETFNHFIVQLCAILG 292
Query: 357 GVFTVSGIIDAFIYHGQRA 375
GVFT++GIID+ Y Q+
Sbjct: 293 GVFTIAGIIDSVFYALQKT 311
>gi|123451578|ref|XP_001313964.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121895945|gb|EAY01112.1| hypothetical protein TVAG_442240 [Trichomonas vaginalis G3]
Length = 375
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 195/388 (50%), Gaps = 23/388 (5%)
Query: 5 MNKIRSLDAYPKINE-DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
MN ++ D +PK + D +T G +++L++ +M +LF EL ++ + VD+
Sbjct: 1 MNSLKKFDIFPKYTDPDVKVKTNGGAILSLIAMTLMSILFLHELYRFIFPRIYEDIAVDS 60
Query: 64 SR---GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
SR T+ INF+++ +PC L + A D G +DI ++R+D G I
Sbjct: 61 SRVSLARTMNINFNISI-QVPCGKLFISAYDAEGNAQSTDVNDIKQQRIDENGFAI---- 115
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D + ++ + + + E + YCG CYGA + CCN+CE+V A++ KGW +
Sbjct: 116 DSVNWIRLKRAAKSKKQKKEQPQQYCGKCYGALPQGK-CCNSCEDVINAFKAKGWGIDGI 174
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D QC EG+ KE CN+YG + V ++G +FA + + H DI
Sbjct: 175 DRWQQCIDEGYADLGKES----CNVYGDINVAHISGFLYFAL-EDYKVGDKHPKDISRLS 229
Query: 241 RDSFNISHKINKLAFG---EHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSG 296
+N++H IN L FG H PG PLDG+ QE P M Y Y ++VVPT + G
Sbjct: 230 H-KYNLTHTINYLEFGPRVSHEPG---PLDGLTVLQEEPGLMQYNYDLEVVPTKWFSSRG 285
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+ + +F ++ + + +PG+F Y+L+PI + E S +T+VCAIVG
Sbjct: 286 FPVSTYKFHPMITQKNFTEKVNRGVPGIFLNYNLAPISLVQYEVISSPWKLITSVCAIVG 345
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
G FT + D + +I+ K +IGK
Sbjct: 346 GCFTCVSLADQIFFRTLSSIEGKRQIGK 373
>gi|302853436|ref|XP_002958233.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
nagariensis]
gi|300256421|gb|EFJ40687.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
nagariensis]
Length = 337
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 183/371 (49%), Gaps = 56/371 (15%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ SL AY K +T G ++TL ++ +LF EL + T++ VD +R
Sbjct: 4 KLSSLSAYVKPEAHLVQQTVHGALVTLCGILLAAMLFVHELGSFYRQHRVTQMSVDLARR 63
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKH----DIFKKRLDSQGNVIESRQDG 122
L IN D+TFPA+PC++LS+D +DI+G D + I K RLD G
Sbjct: 64 NALTINIDLTFPAIPCAVLSIDVLDIAGTAENDASYAHHMHIHKLRLDGAG--------- 114
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
KP+ G+ E++ +++ N +E + L
Sbjct: 115 -------KPI----GKAEYHTPQSQQIMDT-GAEQLVSVNIQEAMQ------------HL 150
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV--HVHDILAFQ 240
+D + + E EGC++YG ++V +VAG HF S HQ+ V + +L
Sbjct: 151 VDMEE--------EAEHHEGCHVYGTMDVKRVAGRLHF----SVHQNMVFQMLPQLLGAH 198
Query: 241 R--DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
R NISH I L FG H+PG +NPLDG + P ++YF+KVVPT Y + G
Sbjct: 199 RIPKVANISHTIKHLGFGPHYPGQLNPLDGYVRMVKGPPQSFKYFLKVVPTEYYNRLGRV 258
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+++Q+SVTE+ + E G + TL YDLSPI +T E S LHF+ +CA+VGG
Sbjct: 259 TETHQYSVTEYTQPLEPGYVPTLD---VHYDLSPIVMTINERPPSLLHFVVRLCAVVGGA 315
Query: 359 FTVSGIIDAFI 369
F ++ + D ++
Sbjct: 316 FAITRMTDRWV 326
>gi|308808274|ref|XP_003081447.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
gi|116059910|emb|CAL55969.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
Length = 406
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 126/385 (32%), Positives = 205/385 (53%), Gaps = 46/385 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD----- 62
++SLDA PK+ ED+ ++ SG +ITLV + LLLF E Y ++L V+
Sbjct: 34 LKSLDANPKLKEDYARQSTSGVIITLVCGALCLLLFLGEFFAYRTTKVVSELRVNPMGVH 93
Query: 63 --TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHD--IFKKRLDSQGNVIES 118
T E L+I+ D+TF ++ C+++++D D +GEQH DV HD I K+R+D G I++
Sbjct: 94 SVTPNAERLKIDIDITFHSMACNLITLDTSDKAGEQHYDV-HDGHIEKRRVDKDGKPIDA 152
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
P K + + ++ ++ G+ + D A+R G
Sbjct: 153 TFTS-EKPNKHKEMVQALEKMNQTDSVVGNETALQKQDR-----------AHRFAG-VFG 199
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGK----SFHQSGVHVH 234
++ + EG + E EGC + G+LEVN+V G +PG+ Q ++VH
Sbjct: 200 FESMLKEAFPEGIENAFRNEAREGCEVKGYLEVNRVPGRISISPGRVVMMGMQQFKLNVH 259
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 294
L N++H I++L+FGE FPG+V+PLDG + P+ + QYF+ VV T + +
Sbjct: 260 TDL-------NLTHTIHRLSFGERFPGLVSPLDGTHRSLP-PNAVQQYFLNVVATTFQPL 311
Query: 295 SGHT-IQSNQFSVTEHFRSSEQ-------GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
G I ++Q+SVTE F +S++ GR PGVFF Y++ PI+V F E +F
Sbjct: 312 RGDARISTHQYSVTETFTTSQRSLGGSSNGRD---PGVFFTYEIEPIRVDFKETRTTFGA 368
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYH 371
F+ +C+I+GGV T++G++ + + H
Sbjct: 369 FIIGICSIIGGVVTMAGVVQSAVEH 393
>gi|449476586|ref|XP_004154778.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 140
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 96/130 (73%), Positives = 114/130 (87%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDAI NK+R+LDAYPKINEDFY RTFSGG+ITL SS ML LFFSELR+YL+A TET+L+
Sbjct: 1 MDAIFNKLRNLDAYPKINEDFYRRTFSGGLITLASSFFMLFLFFSELRMYLHAKTETQLV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VDTSRG L INFD++FPA+PCSILS+DA+DISGEQHLD++H+I KKR+D G VIE+R
Sbjct: 61 VDTSRGGELHINFDLSFPAIPCSILSLDAIDISGEQHLDIRHNIIKKRIDHLGTVIEARP 120
Query: 121 DGIGAPKIDK 130
DGIGAPK+ K
Sbjct: 121 DGIGAPKVSK 130
>gi|66813156|ref|XP_640757.1| DUF1692 family protein [Dictyostelium discoideum AX4]
gi|60468793|gb|EAL66793.1| DUF1692 family protein [Dictyostelium discoideum AX4]
Length = 421
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 192/379 (50%), Gaps = 33/379 (8%)
Query: 2 DAIMNKIRSLDAYPKINEDF--YSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKL 59
D+ + K++ D YPK+N+D + TF GGV T++ ++ L SE+ Y + E L
Sbjct: 48 DSWVEKVKLFDFYPKVNDDVPRHKSTF-GGVATMICILITTYLLVSEIYFYTFPIREHSL 106
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMD-ISGEQHLDVKHDIFKKRLDSQGNVIES 118
VD +RG L IN D+ FP L C+ +++D +D I G D + I K+RLDS G E
Sbjct: 107 KVDITRGNRLPINIDIHFPRLVCTDITIDVVDGIDGNPIKDAAYQIVKQRLDSYG---EP 163
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSD----EDCCNNCEEVREAYRKKG 174
G+ L G + T C S + CCN+CE++R+ YR
Sbjct: 164 FAQGVA-------LAGKKGIFSRSCTECEFPKSKRVSSVFYKQKCCNSCEDLRQYYRLNR 216
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHV 233
+ D QC E +Q + EGC IYG L V K+ G+FH G QS HV
Sbjct: 217 IPQNLADDSPQCLIERPVQ-----DDEGCRIYGSLSVQKMKGDFHILAGTGIDQSHDGHV 271
Query: 234 HDILAFQRDS------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
H R++ FNI+H I+K +FGE G++NPL+ ++ + Y+++VV
Sbjct: 272 HHAHHIPRENIGRIKHFNITHHIHKFSFGEDIEGLINPLEDFGIVAQS-LAVQTYYLQVV 330
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
P +Y + +++NQ+S T +R L Q PG++F YDLSP+ + + +
Sbjct: 331 PAIYKK-NDFVLETNQYSYTYDYRIVNMFNLGQLFPGIYFKYDLSPLMIEVDQTSKPLVE 389
Query: 347 FLTNVCAIVGGVFTVSGII 365
+T++CAI GG++ V G++
Sbjct: 390 LITSICAIGGGMYVVLGLV 408
>gi|412991249|emb|CCO16094.1| predicted protein [Bathycoccus prasinos]
Length = 409
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 194/425 (45%), Gaps = 103/425 (24%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ SLDAY KI + RT SG +++L+ +M +L SE+ Y+ ++ VD ++ E
Sbjct: 17 LSSLDAYKKIEDHLMVRTTSGAIVSLLGIALMCILGASEILNYITPPVVKQMAVDGTQNE 76
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+ + D+TFP +PCS+LSVDA D SG+ DV+ ++ K+RL+ G + S D G
Sbjct: 77 LMTVRMDITFPRVPCSVLSVDAYDQSGKNDQDVRGELHKERLNKDGKSLGS-YDKAGGGV 135
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
D+ ++ + + G G + + + EV+ A KK
Sbjct: 136 TDE----EDALIQDLQQFFGG--GMKVVFQKRAEHSREVKHAVEKK-------------- 175
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
EGC +YG + V +V GNFH + +++ H + + NIS
Sbjct: 176 -------------EGCRLYGRMHVQRVGGNFHISAHAEEYETLQHAFGAV----NKINIS 218
Query: 248 HKINKLAFGEHFPGVVNPLDGV-------------------------------------- 269
H I L+FG +PG+VNPLDGV
Sbjct: 219 HTITHLSFGAGYPGLVNPLDGVARSGSDDEFHYDESSKDSRSSDRKNIEKEKEEEEKRKK 278
Query: 270 ------------RWTQETPSGMYQYFIKVVPTVYTDVSG---------HTIQSNQFSVTE 308
W E SG+Y+YF+K+VPT Y ++ +NQ+SVTE
Sbjct: 279 KEQVRRSRLMDLTW-DENGSGVYKYFLKLVPTFYRTHRSVFLGLFSWTKSVSTNQYSVTE 337
Query: 309 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT----VSGI 364
+FR ++ +LP V+F YD SPI VT + F++FLT +CA+ GGVF +S +
Sbjct: 338 YFRKTDAWS-GSLPAVYFLYDFSPIAVTIDTKRPHFVYFLTRLCAVCGGVFAFAHMISNL 396
Query: 365 IDAFI 369
+DA +
Sbjct: 397 VDALL 401
>gi|71409973|ref|XP_807304.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871276|gb|EAN85453.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 393
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 130/389 (33%), Positives = 203/389 (52%), Gaps = 45/389 (11%)
Query: 4 IMNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYL---NAVTETKL 59
++ K+ ++D +PK ED+ S+T+ G +++LV+ +V+ LL F E+ Y+ +A T T+L
Sbjct: 20 LLKKVAAVDLFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVYSYIFGRDAYT-TEL 78
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN--VIE 117
VDTS + + N D+TFP +PC +S+D +D++G +L+V +IFK +D+QGN I
Sbjct: 79 SVDTSLSKEVEFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNIFKTPVDAQGNFAFIG 138
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE------SSDEDCCNNCEEVREAYR 171
+RQ G+G + ++ +CG C+ +E + CCN C +V AY
Sbjct: 139 TRQ-GVGE---YGSFREQSKDDPNSPQFCGRCFISEHQLSMSENKNRCCNTCNDVLNAYD 194
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
++G + ++QC + L RI GCN G L V K G FAP + G
Sbjct: 195 QQGLPRPQKNEVEQCIYD--LSRIN----PGCNYKGTLIVKKFGGRLVFAPKRV--PGGF 246
Query: 232 HVHDILAFQRDSFNISHKINKLAFGEH------FPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ D++ F DS SH INKL+ G+ GV +PL+G + + +YF+K
Sbjct: 247 LIRDVMQF--DS---SHIINKLSIGDERVTRFSRRGVQHPLNGHEFDTQRRFTEIRYFLK 301
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL-----PGVFFFYDLSPIKVTFTEE 340
VVPT+Y +SG S F+ T + RL + P V +D P++V
Sbjct: 302 VVPTMY--LSGK--NSASFNATYEYSVQWSHRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
SF HFL +C IVGG+F V G+ID +
Sbjct: 358 RSSFPHFLVQLCGIVGGLFVVLGLIDGLV 386
>gi|385302035|gb|EIF46185.1| erv46p [Dekkera bruxellensis AWRI1499]
Length = 266
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 146/262 (55%), Gaps = 17/262 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I DA+ K ++ +T SGG++TL+ S + +L +E R Y + +L+VD +
Sbjct: 7 IFRFDAFAKTLDEAKVKTTSGGILTLICSFTIFILLINEYRDYRTLIMRPELVVDRDHDK 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAP 126
TL +N D+TFP +PC +LS+D MD++G+ D+ + + + RLD G I + +
Sbjct: 67 TLGLNLDITFPNMPCDLLSMDIMDLTGDVQADILEGNFLRTRLDRDGKEIATDE----PF 122
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGA--ESSDED--------CCNNCEEVREAYRKKGWA 176
K++K + YCGSCYGA +S +E CCN+CE V+ AY K W
Sbjct: 123 KVNKEDXVKSELSTEDSQYCGSCYGAIDQSGNEKESDPTKWVCCNSCEAVKLAYSKAAWK 182
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+ + I+QC++EG++ RI + EGC + G ++N++ GN HFAPG S + HVHD+
Sbjct: 183 FYDGEGIEQCEKEGYVDRINKRLDEGCRVKGTAQLNRIGGNLHFAPGSSITMNDRHVHDL 242
Query: 237 LAFQR--DSFNISHKINKLAFG 256
F + D FN H IN +FG
Sbjct: 243 SLFDKHQDKFNFDHVINHFSFG 264
>gi|221114903|ref|XP_002155889.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Hydra magnipapillata]
Length = 399
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 178/364 (48%), Gaps = 40/364 (10%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
+ LDA+PKI E + + SGG ++++ + + +L SE Y ++ K VD
Sbjct: 19 KDLDAFPKIPESYQETSASGGTVSILVFLFISMLVISEFIYYSGSILTYKYEVDKEADNK 78
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
RIN D+T A+ C + D +D+SG GNV ++ ++ P
Sbjct: 79 FRINIDITV-AMECDDIGADVLDLSG------------------GNV-DTGENLHLTPA- 117
Query: 129 DKPLQRHGGRLEHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQC 186
H + + + + A SDE N ++ + D++
Sbjct: 118 ------HFSMSSNQKQWWDAFRSARKSDEGYRSINKVTQIDMIF---------GDVMPTY 162
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
+ + +E +GC IYG +EVNKVAGNFH GKS H H ++N
Sbjct: 163 MPDEIESEFEGKEFDGCRIYGNIEVNKVAGNFHITAGKSIPHPRGHAHLSALVSELNYNF 222
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH+I+ L+FGE PG++NPLDG TP MYQY+I +VPT + +TI++NQ+SV
Sbjct: 223 SHRIDMLSFGEPHPGIINPLDGDLMITTTPYHMYQYYIAIVPTTIQTLK-NTIKTNQYSV 281
Query: 307 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
T+ R + Q +PG+FF YD + I V+ EE SF FL +C I+GGVF SG++
Sbjct: 282 TQRSRQLNLNSGSQGVPGIFFKYDFNAISVSVNEERRSFNEFLIRLCGIIGGVFATSGML 341
Query: 366 DAFI 369
+ I
Sbjct: 342 HSAI 345
>gi|119596606|gb|EAW76200.1| ERGIC and golgi 3, isoform CRA_b [Homo sapiens]
Length = 239
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 85/154 (55%), Positives = 112/154 (72%), Gaps = 3/154 (1%)
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 85 IHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 144
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT
Sbjct: 145 KVDGEVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 203
Query: 351 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 204 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 237
>gi|145546125|ref|XP_001458746.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426567|emb|CAK91349.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 187/375 (49%), Gaps = 87/375 (23%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
D Y K+ +D + SG +I+ S I+M +LF +E + YL +T++ +D ++ + L
Sbjct: 4 FDLYRKLPQDLIEPSKSGALISFTSLILMFILFITEFQEYLTQQVQTEMYIDQNKDDKLL 63
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK 130
+N D++FP +PC +S+D D+ G +V+ +++K R + IDK
Sbjct: 64 VNMDISFPNMPCDFISIDQQDVIGTHQQNVEGELYKSR-------------TLNGKVIDK 110
Query: 131 PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREG 190
L S D N E ++AY++K
Sbjct: 111 YL----------------------STNDSLN-LERAQQAYQQK----------------- 130
Query: 191 FLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHK 249
EGC++ G++ +++V GNFH S H G V+ +L F S ++SH
Sbjct: 131 ----------EGCDLAGYIIISRVPGNFHI----SAHPYGGQVNMVLPFVGLSVIDLSHS 176
Query: 250 INKLAFG---------EHFP-GVVNPLDGVRW--TQE-TPSGM-YQYFIKVVPTVYTDVS 295
I L+FG E F G++NPLDG+R TQE T G+ +QY+I +VPT+Y D+
Sbjct: 177 IKHLSFGKQNDIQKIREKFKQGLLNPLDGIRRIKTQELTNVGVTHQYYISIVPTLYVDID 236
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
NQF+ ++ + + +P V+F YD+SP+ V FT+ + SF HF+ +CAI+
Sbjct: 237 NKEYFVNQFAA-----NTNEAQTTQMPAVYFRYDISPVTVQFTKYYESFNHFIVQLCAIL 291
Query: 356 GGVFTVSGIIDAFIY 370
GGVFT++GIID+ Y
Sbjct: 292 GGVFTIAGIIDSIFY 306
>gi|169731514|gb|ACA64886.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
(predicted) [Callicebus moloch]
Length = 237
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/153 (55%), Positives = 111/153 (72%), Gaps = 3/153 (1%)
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 84 HDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMK 143
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT V
Sbjct: 144 VDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 202
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
CAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 203 CAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 235
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/79 (49%), Positives = 53/79 (67%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ K++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD S
Sbjct: 4 LGKLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKS 63
Query: 65 RGETLRINFDVTFPALPCS 83
RG+ L+IN DV FP +PC+
Sbjct: 64 RGDKLKINIDVLFPHMPCA 82
>gi|449705731|gb|EMD45722.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 272
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 142/239 (59%), Gaps = 10/239 (4%)
Query: 146 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 205
C SCYGAE+ ++ CC C++V+EAY+K+GW L + +++ QC+ +Q K + EGC +
Sbjct: 42 CRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRL 100
Query: 206 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 265
G +NK+ GNFH APG S G H H++ + ++SHK N+L+FGE+
Sbjct: 101 IGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGENSKKFTTE 160
Query: 266 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 325
+ + M+QY++ ++P ++G T +S+ E+ RS E G Q PGVF
Sbjct: 161 KKDTQM-----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRSGE-GEGQ--PGVF 211
Query: 326 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+YD+SP+ + TE + FLHFL +C+IVGG+FT + DA ++ +KKK+E+GK
Sbjct: 212 IYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 270
>gi|407859749|gb|EKG07137.1| hypothetical protein TCSYLVIO_001725 [Trypanosoma cruzi]
Length = 393
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 129/389 (33%), Positives = 200/389 (51%), Gaps = 45/389 (11%)
Query: 4 IMNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYL---NAVTETKL 59
++ K+ ++D +PK ED+ S+T+ G +++LV+ +V+ LL F E+ Y+ +A T T+L
Sbjct: 20 LLKKVAAVDLFPKPKEDYSRSQTYHGALVSLVTVVVIGLLVFWEVCSYIFGRDAYT-TEL 78
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN--VIE 117
VDTS + N D+TFP +PC +S+D +D++G +L+V +IFK +D+QGN I
Sbjct: 79 SVDTSLSTEVEFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNIFKTPVDAQGNFAFIG 138
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE------SSDEDCCNNCEEVREAYR 171
+RQ G+G + ++ +CG C+ +E + CCN C +V AY
Sbjct: 139 TRQ-GVGE---YGSFREQSKDDPNSPQFCGRCFISEHQLSMMDNKNRCCNTCNDVLNAYD 194
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
++G + ++QC E L I GCN G L V K G FAP + G
Sbjct: 195 QQGLPRPQKNEVEQCIYE--LSLIN----PGCNYKGTLIVKKFGGRLVFAPKRV--PGGF 246
Query: 232 HVHDILAFQRDSFNISHKINKLAFGEH------FPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ D++ F DS SH INKL+ G+ GV +PL+G + + +YF+K
Sbjct: 247 LIKDVMQF--DS---SHIINKLSIGDERVTRFSRRGVQHPLNGHEFVAQRRFTEIRYFLK 301
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL-----PGVFFFYDLSPIKVTFTEE 340
VVPT+Y SG S F+ T + RL + P V +D P++V
Sbjct: 302 VVPTMY--FSGK--NSASFNATYEYSVQWSHRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
SF HF+ +C IVGG+F V G+ID +
Sbjct: 358 RSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386
>gi|145511431|ref|XP_001441642.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408894|emb|CAK74245.1| unnamed protein product [Paramecium tetraurelia]
Length = 329
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 131/401 (32%), Positives = 192/401 (47%), Gaps = 92/401 (22%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++R LD Y K+ D T +G +I+ V+++LF +EL+ Y+ +++ VD
Sbjct: 4 GVQSRLRKLDIYRKLPADLTEPTTAGALIS-----VIIILFITELQAYIEVDNSSEMFVD 58
Query: 63 TSRG-ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+RG E +R+N D+ F PC ILS LDV+ R + +G E R +
Sbjct: 59 INRGGEQIRVNLDIEFHKFPCDILS-----------LDVQDYYGVSRCECRG---EQRME 104
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
K + ++ H EH+ N
Sbjct: 105 RQFLKKFIQIMKEH----EHH------------------------------------NQP 124
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
ID + E Q KE+EG C I G++ VNKV GNFH S H G +H + FQR
Sbjct: 125 SIDFARIE---QAFKEKEG--CQIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQR 173
Query: 242 ---DSFNISHKINKLAFGEH----------FPGVVNPLDGVRWTQETPSG---MYQYFIK 285
+ ++SH IN ++FGE GV+NPLD + + G M+QY+I
Sbjct: 174 SQIQTLDLSHTINHISFGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYIS 233
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 345
VVPT Y DVSG+ +QF+ +S + LP +F YDLSP+ V F + SFL
Sbjct: 234 VVPTTYVDVSGNEYYVHQFTA-----NSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFL 288
Query: 346 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
HFL +CAI+GGVFT++ I+D I+ A+ KK E+GK S
Sbjct: 289 HFLVQICAILGGVFTIASIVDGMIHKSVVALLKKYEMGKLS 329
>gi|407424942|gb|EKF39210.1| hypothetical protein MOQ_000571 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 126/389 (32%), Positives = 201/389 (51%), Gaps = 45/389 (11%)
Query: 4 IMNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYL---NAVTETKL 59
++ K+ ++D +PK ED+ S+T+ G +++LV+ +V+ LL F E+ Y+ +A T T+L
Sbjct: 20 LLKKVAAVDFFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVYSYIVGRDAYT-TEL 78
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN--VIE 117
VDTS + N D+TFP + C +S+D +D++G +L+V +IFK +D+QGN I
Sbjct: 79 SVDTSLSTEVEFNLDITFPRIRCHDVSLDILDVTGTVNLNVTRNIFKTPVDAQGNFAFIG 138
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY------GAESSDEDCCNNCEEVREAYR 171
+RQ G+G + ++ +CG C+ + + CCN C++V AY
Sbjct: 139 TRQ-GVGE---YGSFREQSKDDPNSPQFCGRCFINEHQVSVKENKNRCCNTCDDVLNAYD 194
Query: 172 KKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
++G ++QC + L RI GCN G L V K G FAP + G
Sbjct: 195 QQGLPRPRKSEVEQCIYD--LSRIN----PGCNYKGTLIVKKFGGRLVFAPKRV--SGGF 246
Query: 232 HVHDILAFQRDSFNISHKINKLAFGEH------FPGVVNPLDGVRWTQETPSGMYQYFIK 285
+ D++ F DS SH INKL+ G+ GV +PL+G ++ + +YF+K
Sbjct: 247 LIKDVMQF--DS---SHVINKLSIGDERVTRFSRRGVQHPLNGHKFDTQRRITEIRYFLK 301
Query: 286 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL-----PGVFFFYDLSPIKVTFTEE 340
+VPT+Y +SG S F+ T + RL + P V +D P++V
Sbjct: 302 IVPTMY--LSGK--NSAPFNATYEYSVQWSQRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
SF HF+ +C IVGG+F V G+ID +
Sbjct: 358 RSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386
>gi|123389547|ref|XP_001299739.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121880652|gb|EAX86809.1| hypothetical protein TVAG_100310 [Trichomonas vaginalis G3]
Length = 351
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 123/381 (32%), Positives = 184/381 (48%), Gaps = 39/381 (10%)
Query: 11 LDAYPK-INEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL-VDTSRG-- 66
LD +PK I+ +T G +++ L L SE+ Y +L+ V RG
Sbjct: 3 LDFFPKFIDSAMTHKTACGAFNSILMIACALALCISEIYAYAKPALHEQLVSVSDLRGAL 62
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ L I+F+ T ++PC +L +D D+ G + + ++K R+D GN I Q
Sbjct: 63 DQLSISFNFTV-SVPCVLLHLDVFDMMGSGNRPDQKTLYKVRVDQNGNPIPQTQIA---- 117
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
CG CYGAESS CC CE+V AY++KGW + N QC
Sbjct: 118 -----------------EDCGPCYGAESSQRKCCQTCEDVVAAYQEKGWGIGNLSSWAQC 160
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
+ EG + KE C YG L VN + G FH APG + HVHD D+ N+
Sbjct: 161 RAEGVMFDGKER----CQAYGNLHVNAIEGGFHLAPGINVFSRFGHVHDFSPLV-DTLNL 215
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFS 305
+H+I ++FG P +PLD R Q+ P + Y+Y +K VPTV +V+G + +F+
Sbjct: 216 THEIEHISFGA--PIDKSPLDNTRVVQKKPGQIHYRYNLKAVPTV-KEVNGKVHRFFRFT 272
Query: 306 VT-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
V + +GR PG+FF Y +P+ +T T + + L + +I GG F ++ +
Sbjct: 273 VNYAEIPVTARGRYG--PGIFFVYSFAPVAITSTYDRPNITVLLARLISIFGGSFMLARL 330
Query: 365 IDAFIYHGQRAIKKKIEIGKF 385
ID+F Y I+ K I KF
Sbjct: 331 IDSFTYR-LNTIEGKDRINKF 350
>gi|327273481|ref|XP_003221509.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Anolis carolinensis]
Length = 377
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 176/367 (47%), Gaps = 50/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLMKELDAFPKVPESYIETSASGGTVSLIAFTTMALLTIMEFTVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FTSKLRINIDITV-AMKCQYIGADVLDLA--------------------ETMVASADGLS 108
Query: 125 APKID---KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ PLQR R+ S E S +D + A++ AL
Sbjct: 109 YEPVIFELSPLQREWQRMLQ---IIQSRLQEEHSLQDVI-----FKTAFKSASTALPP-- 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ + LQ + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 -----REDNTLQ-----PPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI-- 299
+S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVP T + H I
Sbjct: 209 ESYNFSHRIDHLSFGELIPGIINPLDGTEKVASDHNQMFQYFITVVP---TKLHTHKISA 265
Query: 300 QSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+++QFSVTE R + + G+F YD+S + VT TEEH+ F FL +C I+GG+
Sbjct: 266 ETHQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGI 325
Query: 359 FTVSGII 365
F+ +GI+
Sbjct: 326 FSTTGIL 332
>gi|345567560|gb|EGX50490.1| hypothetical protein AOL_s00075g219 [Arthrobotrys oligospora ATCC
24927]
Length = 354
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 180/359 (50%), Gaps = 52/359 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++S DA+PK + +R+ GGVIT+V + + L + EL LYL+ +E V G
Sbjct: 10 LKSFDAFPKTRVSYTTRSSKGGVITMVFVAICVWLVWGELSLYLDGKSEEHFSVQGGEGH 69
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN DV A+PC L V+ D +G++ L D+ K + + I + + K
Sbjct: 70 FMQINLDVIV-AMPCDSLHVNVQDAAGDRIL--AGDLLHK---ASTDFIYADTHSL-PQK 122
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ R GG +Y GS EEV + K
Sbjct: 123 LKNKDSREGG-----PSYDGS---------------EEVIKKA--------------GKK 148
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
++ L K +G+ C I+G ++VN+V G+FH A G + G HV D+FN
Sbjct: 149 KKFKLNLPKRPKGKSCRIWGSMDVNRVMGDFHITAKGHGYWDPGQHV------DHDTFNF 202
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH +N+L+FGE +P +VNPLDGV E YQYF+ VVPT Y G T+Q+NQ+SV
Sbjct: 203 SHVVNELSFGEFYPKLVNPLDGVASVTEDKFYRYQYFMSVVPTTYK-AHGRTLQTNQYSV 261
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
TE RS Q++PG+FF +D+ PI +T T+ H +++ + + ++GGV G +
Sbjct: 262 TEQGRSMNP---QSVPGIFFKFDIEPIMLTITDTHTPWIYLIVRLANVIGGVMVAGGWL 317
>gi|387015774|gb|AFJ50006.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2-like
[Crotalus adamanteus]
Length = 377
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 179/370 (48%), Gaps = 50/370 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PKI + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKIPDSYIETSTSGGTVSLIAFTTMALLTIMEFMVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG-I 123
LRIN D+T A+ C + D +D++ + + DG +
Sbjct: 70 YTSKLRINVDITV-AMKCQHIGADVLDLA--------------------ETMVATADGLV 108
Query: 124 GAPKIDK--PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P I + PLQR R+ N S E S +D + A++ AL P
Sbjct: 109 YEPVIFELSPLQREWQRILQN---IQSRLQEEHSLQDII-----FKSAFKSASTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDN----------PVQSADACRIHGHLYVNKVAGNFHVTVGKAIPHPRGHAHLAALVSH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI-- 299
+S+N SH+I+ L+FGE PG++NPLDG + M+QYF+ VVP T + H I
Sbjct: 209 ESYNFSHRIDHLSFGELIPGIINPLDGTEKIASDHNQMFQYFVTVVP---TKLQTHKISA 265
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+++QF+VTE R + + G+F YD+S + VT TEEH+ F FL +C IVGG+
Sbjct: 266 ETHQFAVTERERIINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIVGGI 325
Query: 359 FTVSGIIDAF 368
F+ +GI+ +
Sbjct: 326 FSTTGILHSI 335
>gi|148674216|gb|EDL06163.1| ERGIC and golgi 3, isoform CRA_c [Mus musculus]
Length = 261
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/160 (53%), Positives = 112/160 (70%), Gaps = 9/160 (5%)
Query: 233 VHDILAFQRDS------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
+HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KV
Sbjct: 101 IHDLQSFGLDNPSDCLQINMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKV 160
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSF 344
VPTVY V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF
Sbjct: 161 VPTVYMKVDGEVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 219
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 220 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 259
>gi|123449396|ref|XP_001313417.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121895300|gb|EAY00488.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 361
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 126/395 (31%), Positives = 197/395 (49%), Gaps = 57/395 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR-- 65
IR D +PK+ ++ T SGG+++L+S ++L F E+ YLNA T L VDT R
Sbjct: 3 IRKFDVFPKLANEYRIGTISGGILSLISVFAAIVLCFYEVAAYLNAPTRQFLFVDTRRPT 62
Query: 66 ---GET--------LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQG 113
G T L + VTFP PC ++ +D +D + + +++ K RLDSQG
Sbjct: 63 GPDGVTIDQNSQPRLDVKVSVTFPKAPCFLIHLDVIDSVTQLAMPLENINSKFMRLDSQG 122
Query: 114 NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 173
IE+ + ++ +Q CGSCY A+ CC +C+EV +AYR
Sbjct: 123 KPIEALD---LSTLVNTTVQEK----------CGSCYNAKDPKRICCRSCQEVFDAYRDA 169
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
+ I+QCK +++ + EGEGC + + +VA H APG S++ G HV
Sbjct: 170 AFKPPVLTEIEQCKPVA--EKVAKMEGEGCKVDASFKALRVASEMHIAPGYSWNSEGWHV 227
Query: 234 HDILAFQRD--SFNISHKINKLAFGEH---FPGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
HD+ F ++ S N++H I+ L+F E +P +N L+ V +T +G + +VV
Sbjct: 228 HDLSLFTKEFASLNLTHTIHYLSFSEKEGDYP--LNNLNNV----QTENGAW----RVVY 277
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK-VTFTEEHVSFLHF 347
T ++ Q + F S G+FF YD+SPI VT+T+ F H
Sbjct: 278 TADILEGNYSASKYQMYNPKSFAS----------GLFFKYDVSPISAVTYTDSEPVF-HL 326
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 382
LT + ++GGV + +IDA +H +R +K+ EI
Sbjct: 327 LTRILTVLGGVLGLCRLIDAITFHTRR-MKRTEEI 360
>gi|156030895|ref|XP_001584773.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980]
gi|154700619|gb|EDO00358.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 381
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 128/401 (31%), Positives = 178/401 (44%), Gaps = 101/401 (25%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
++ LDA+ K ++ RT SGG++T+ S +++L L F E Y +L+VD R
Sbjct: 5 SRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWADYRRITVHPELVVDKGR 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-D 121
D MD+SGEQ + V H + K RL +Q G VI++ D
Sbjct: 65 ----------------------DVMDVSGEQQVGVMHGVKKVRLSAQEEGGKVIDTTALD 102
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWAL 177
A + L + YCG CYGA + + CCN C+EVREAY WA
Sbjct: 103 LHNADEAATHL---------DPNYCGPCYGATPPPNAKKQGCCNTCDEVREAYASVSWAF 153
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
+ ++QC+RE + +R+ + EGC I G L VNKV GNFH APG+SF +HVHD+
Sbjct: 154 GRGENVEQCEREHYGERLDSQRKEGCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVHDLN 213
Query: 238 AFQRDSFN----ISHKINKLAFGEHFPGVV-----------------NPLDGVRWTQETP 276
+ SH I+ L FG P V NPLD
Sbjct: 214 NYFDTPVPGGHVFSHHIHSLRFGPELPEEVTKKLGSDSIIPWTNHHLNPLDNTEQITHEA 273
Query: 277 SGMYQYFIKVVPTVYTDVS-------------------GH----TIQSNQFSVTEHFRS- 312
+ + YF+KVV T Y + GH +I+++Q+SVT H RS
Sbjct: 274 AYNFMYFVKVVSTSYLPLGWETTYNSPPHDASVDIGTYGHSEDGSIETHQYSVTSHRRSL 333
Query: 313 -----SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV 342
S +G + L PGVFF Y V+F E H+
Sbjct: 334 NGGDDSAEGHKEKLHARGGIPGVFFSY------VSFLEIHM 368
>gi|449479952|ref|XP_004155757.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 266
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/295 (35%), Positives = 162/295 (54%), Gaps = 39/295 (13%)
Query: 85 LSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET 144
LSVDA+D+SG+ +D+ +I+K RL+S G +I +
Sbjct: 4 LSVDAIDMSGKHEVDLDTNIWKLRLNSHGQIIGTE------------------------- 38
Query: 145 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 204
Y E D ++ ++ ++ G+ + +L+ + K+ EE +GC
Sbjct: 39 YLSDLVEKEHVDHKHDHDHDKEKDHPHIHGFDQAAENLVKKVKQA-------LEEAQGCR 91
Query: 205 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 264
+YG L+V +VAGNFH S H + V ++ N+SH I+ L+FG +PG+ N
Sbjct: 92 VYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFGGSKHVNVSHMIHDLSFGPKYPGIHN 147
Query: 265 PLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 323
PLDG VR ++T SG ++Y+IK+VPT Y +S + +NQFSVTE+F S ++ P
Sbjct: 148 PLDGTVRILRDT-SGTFKYYIKIVPTEYKYISKAVLPTNQFSVTEYF-SPMTDSDRSWPA 205
Query: 324 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G++D +++ A+ K
Sbjct: 206 VYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMFRFLEALTK 260
>gi|169603005|ref|XP_001794924.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
gi|111067148|gb|EAT88268.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
Length = 351
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 142/297 (47%), Gaps = 63/297 (21%)
Query: 145 YCGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 200
YCG CYGA S CCN C+EVR+AY W+ + ++QC+RE + + + ++
Sbjct: 51 YCGECYGAPSPTNAIKAGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQ 110
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN--ISHKINKLAFGEH 258
EGC + G + VNKV GNFH APGKSF +HVHD+ + +D ++ +HKI+ L FG
Sbjct: 111 EGCRLEGSIRVNKVVGNFHIAPGKSFSTGNMHVHDLENYFKDEYSHTFTHKIHHLRFGPQ 170
Query: 259 FPGVV---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVY------ 291
V NPLD + + YF+KVV T Y
Sbjct: 171 LSNAVIADMQKKHQNTGPGGWTSHHINPLDNTEQQTSEKAYNFMYFVKVVSTAYLPLGWE 230
Query: 292 ---------TDVSGHTIQSN--------QFSVTEHFRSSEQGRLQT------------LP 322
++ G TI+ N Q+SVT H RS G + +P
Sbjct: 231 KEAPRLTKHDELLGSTIEGNYKGSIETHQYSVTSHKRSLAGGNDEKEGHKERIHAKGGIP 290
Query: 323 GVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
GVFF YD+SP+KV E +F FL +CA++GG TV+ +D +Y G IKK
Sbjct: 291 GVFFSYDISPMKVINREVRDKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKK 347
>gi|145349688|ref|XP_001419260.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579491|gb|ABO97553.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 310
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 115/368 (31%), Positives = 167/368 (45%), Gaps = 67/368 (18%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
+DA+ + RT +G +++V ++ L E+ +L VD +R TLR
Sbjct: 1 VDAFARAAPHLTKRTRAGACVSVVGVVLACALALVEITDFLTPTRAKTHGVDDARNATLR 60
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK 130
I DVTFP +PC +L VDA D SG+ +D + + K RLD+ G I + G
Sbjct: 61 IEIDVTFPRMPCQLLYVDAYDESGKHEVDARGLLLKTRLDASGRAIGEYESAGGVDLGGL 120
Query: 131 PL-QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE 189
L QR R EH EVREA
Sbjct: 121 VLFQR---RPEH---------------------AHEVREA-------------------- 136
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 249
+ + EGC ++G LE +VAG + G ++ ++D + ++ H
Sbjct: 137 -------KADVEGCRLHGELEARRVAGTLRASTGPESYEFLKEIYD----EPWEIDMRHA 185
Query: 250 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------TIQS 301
+ FG FPG VNP++GVR ET SG+Y+YF+KVVPT Y+ ++
Sbjct: 186 VKTFTFGAEFPGAVNPMNGVR-RMETKSGIYKYFMKVVPTTYSSTRALFGFIPWTVRTRT 244
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ+SVTEHF E LP +FF YDLS I V T S ++FLT A +GG+F +
Sbjct: 245 NQYSVTEHF--IETPHWGALPQLFFIYDLSAIAVNITVTSKSIVYFLTKTLATMGGIFAL 302
Query: 362 SGIIDAFI 369
+ +D +I
Sbjct: 303 TRTVDRYI 310
>gi|224093106|ref|XP_002193654.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Taeniopygia guttata]
Length = 377
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 174/367 (47%), Gaps = 44/367 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ E + + SGG ++L++ + L E +Y + + + VD
Sbjct: 10 LNLMKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFMVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FTSKLRINIDITV-AMRCQYVGADVLDLA--------------------ETMVASADGLI 108
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ L L+ S E S +D + A++ AL
Sbjct: 109 YEPVPFELTPQQKELQRMLQLIQSRLQEEHSLQDVI-----FKSAFKSASTALPP----- 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ + LQ + C I+G L VNKVAGNFH GK+ H H +S+
Sbjct: 159 --REDNSLQ-----SPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSN 302
N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T +
Sbjct: 212 NFSHRIDHLSFGELIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---H 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
QFSVTE R + + G+F YD+S + VT TEEH+ F FL +C I+GG+F+
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328
Query: 362 SGIIDAF 368
+GI+ F
Sbjct: 329 TGILHGF 335
>gi|123438593|ref|XP_001310077.1| MGC83277 protein [Trichomonas vaginalis G3]
gi|121891831|gb|EAX97147.1| MGC83277 protein, putative [Trichomonas vaginalis G3]
Length = 355
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 175/364 (48%), Gaps = 39/364 (10%)
Query: 23 SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-------------DTSRGETL 69
SRT SGG+I ++S I M++LF + + N+ K +V DT +
Sbjct: 3 SRTNSGGIIAVLSVISMVILFILRFQAWTNSPLTQKFVVNTPQLPFINNRIIDTEHLPKM 62
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID 129
INFD+ +PCS L VD +D E + + +R D +GN I + PK
Sbjct: 63 DINFDIMMKHIPCSYLHVDVIDNIKESDESYEGHVRMERFDEKGNPILKKS----YPK-- 116
Query: 130 KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE 189
+ + YCG+CYG +S CCN C+EVR+A++ I QC E
Sbjct: 117 ------NSSVTKDPGYCGNCYGQKSG---CCNTCKEVRKAFKANNRPPPPIIHIQQCVDE 167
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH--DILAFQRDSFNIS 247
G+ + + +GE C ++G L V++ G FH APG+S++ +G H H + L D N S
Sbjct: 168 GYKEELIAMKGEACRVHGTLTVHRAPGTFHVAPGESYNINGEHDHYYEDLGINIDEMNFS 227
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ-YFIKVVPTVYTDVSGHTIQSNQFSV 306
H IN + G PLDG Q+ M YF++ VP ++ G S S
Sbjct: 228 HTINHFSIGMPTANSYYPLDGHTEIQQKTGRMKMIYFLRAVP---INLDGRVF-SFGASS 283
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
+++R S + PGVFF YD+S I + + ++ S + +T + +I+GGVF ++ +D
Sbjct: 284 YQNYRGSNSTK---YPGVFFSYDVSLIGIV-SSQNSSLMDLVTELMSILGGVFAIATFLD 339
Query: 367 AFIY 370
Y
Sbjct: 340 MLSY 343
>gi|126339088|ref|XP_001363644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Monodelphis domestica]
Length = 378
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 173/365 (47%), Gaps = 46/365 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKVPVSYVETSASGGTVSLIAFTTMALLTIMEFSVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN ++T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRININITV-AMKCQYVGADVLDLA--------------------ETMVAAADGLV 108
Query: 125 APKID---KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ P QR R+ S E S +D + A++ AL
Sbjct: 109 YEPVIFDLSPQQREWQRMLQT---IQSRLQEEHSLQDVI-----FKSAFKSASTALPP-- 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ + LQ + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 -----REDNSLQ-----PPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT + + +
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIANDHNQMFQYFITVVPT-KLNTYKISADT 267
Query: 302 NQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+QFSVTE R+ + + G+F YDLS + VT TEEH+ F FL +C I+GG+F+
Sbjct: 268 HQFSVTERERAINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFS 327
Query: 361 VSGII 365
+G++
Sbjct: 328 TTGML 332
>gi|348562091|ref|XP_003466844.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Cavia porcellus]
Length = 377
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 174/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKVPQSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDVAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ S E S +D + A++ AL
Sbjct: 110 EPAIFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTALP---- 157
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
RE + + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 158 ----PREAN----SSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|355686514|gb|AER98081.1| ERGIC and golgi 2 [Mustela putorius furo]
Length = 365
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 117/369 (31%), Positives = 174/369 (47%), Gaps = 48/369 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPAIFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDDSS----------QPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGIIDAF 368
+ +G++ F
Sbjct: 327 STTGMLHGF 335
>gi|313661438|ref|NP_001186332.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Gallus gallus]
Length = 377
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 175/369 (47%), Gaps = 48/369 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ E + + SGG ++L++ + L E +Y + + + VD
Sbjct: 10 LNLMKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S I
Sbjct: 70 FTSKLRINIDITV-AMRCQYVGADVLDLAE-------------------TMVASADGLIY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPVVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D E + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDNSL----------ESPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YD+S + VT TEEH+ F FL +C I+GG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIF 326
Query: 360 TVSGIIDAF 368
+ +GI+ F
Sbjct: 327 STTGILHGF 335
>gi|326911226|ref|XP_003201962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Meleagris gallopavo]
Length = 377
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 175/369 (47%), Gaps = 48/369 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ E + + SGG ++L++ + L E +Y + + + VD
Sbjct: 10 LNLMKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S I
Sbjct: 70 FTSKLRINIDITV-AMRCQYVGADVLDLAE-------------------TMVASADGLIY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPVVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D E + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDNSL----------ESPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YD+S + VT TEEH+ F FL +C I+GG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIF 326
Query: 360 TVSGIIDAF 368
+ +GI+ F
Sbjct: 327 STTGILHGF 335
>gi|431908425|gb|ELK12022.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Pteropus alecto]
Length = 377
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 171/364 (46%), Gaps = 44/364 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLA--------------------ETMVASADGLV 108
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ L + S E S +D + A++ AL
Sbjct: 109 YEPVIFDLSPQQKEWQRMLQLIQSRLQEEHSLQDVI-----FKSAFKSSSTALP------ 157
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
RE + + C I G L VNKVAGNFH GK+ H H DS+
Sbjct: 158 --PRE----EDSSQPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSN 302
N SH+I+ L+FGE PG++NPLDG E + M+QYFI VVPT ++T +S T +
Sbjct: 212 NFSHRIDHLSFGELVPGIINPLDGTEKIAEDHNQMFQYFITVVPTKLHTYKISADT---H 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F+
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 362 SGII 365
+G++
Sbjct: 329 TGML 332
>gi|345441780|ref|NP_001230861.1| ERGIC and golgi 2 [Sus scrofa]
Length = 377
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 175/364 (48%), Gaps = 44/364 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ +++ + + Q
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAETMVASTDGLVYEPAIFDLSPQQKEWQ---- 124
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ LQR RL+ E S +D + A++ AL P D
Sbjct: 125 -----RMLQRIQSRLQE-----------EHSLQDVI-----FKSAFKSASTAL--PPRED 161
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ + C I+G L VNKVAGNFH GK+ H H DS+
Sbjct: 162 DSS----------QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSN 302
N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T +
Sbjct: 212 NFSHRIDHLSFGELVPGIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---H 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F+
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 362 SGII 365
+G++
Sbjct: 329 TGML 332
>gi|426225295|ref|XP_004006802.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Ovis aries]
Length = 377
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P QR R+ S E S +D + A++ AL P
Sbjct: 110 EPAIFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDDSS----------QPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QF+VTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFAVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|395537817|ref|XP_003770886.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Sarcophilus harrisii]
Length = 378
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 172/365 (47%), Gaps = 46/365 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ + + GG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKVPVSYVETSAIGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVAPADGLV 108
Query: 125 APKID---KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ P QR R+ S E S +D + A++ AL
Sbjct: 109 YEPVIFDLSPQQREWQRMLQT---IQSRLQEEHSLQDVI-----FKSAFKSASTALPP-- 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ + LQ + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 -----REDNSLQ-----PPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT + + +
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAIDHNQMFQYFITVVPT-KLNTYKISADT 267
Query: 302 NQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+QFSVTE R+ + + G+F YDLS + VT TEEH+ F FL +C I+GG+F+
Sbjct: 268 HQFSVTERERAINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFS 327
Query: 361 VSGII 365
+G++
Sbjct: 328 TTGML 332
>gi|344267803|ref|XP_003405755.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Loxodonta africana]
Length = 377
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 117/366 (31%), Positives = 172/366 (46%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ S E S +D + A + AL P
Sbjct: 110 EPAIFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAIKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDD----------SSQPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|332020071|gb|EGI60517.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Acromyrmex echinatior]
Length = 390
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 177/361 (49%), Gaps = 43/361 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PK+ E + +T GG ++ + ++++ L +E YL++ + DT
Sbjct: 12 VKELDAFPKVPEVYVDKTAVGGTFSIFTVLIIMYLVIAETSYYLDSRLQFTFEPDTDIDA 71
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN DVT A+PC + D +D + + +D L + E Q+
Sbjct: 72 KLQINIDVTV-AMPCGRIGADVLDSTNQHMIDFD------SLTEEDTWWELTQEQ----- 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ H L+H +Y Y A + W + L +
Sbjct: 120 -----RTHFEALKHMNSYLREEY-----------------HAIHELLWKSNQVTLYSEMP 157
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNI 246
+ + + + C ++G L +NKVAGNFH GKS H+H I AF D +N
Sbjct: 158 KRSY---VPDYAPNACRVHGSLNINKVAGNFHITAGKSLSVPHGHIH-ISAFMTDRDYNF 213
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFS 305
+H+INK +FG PG+V+PL+G + +YQYF++VVPT + T ++ T ++ Q+S
Sbjct: 214 THRINKFSFGGPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLT--TSKTYQYS 271
Query: 306 VTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
V +H R + + +PG+FF YD+S +K+ T+E + FL +CA VGG+F SG+
Sbjct: 272 VKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVTSGL 331
Query: 365 I 365
+
Sbjct: 332 V 332
>gi|291392459|ref|XP_002712727.1| PREDICTED: PTX1 protein [Oryctolagus cuniculus]
gi|291416214|ref|XP_002724342.1| PREDICTED: PTX1 protein-like [Oryctolagus cuniculus]
Length = 377
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 175/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P QR R+ S E S +D + A++ AL
Sbjct: 110 EPAIFDLSPHQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSPSTALPP--- 158
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ + LQ + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 159 ----REDDSLQ-----SPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI +VPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAIDHNQMFQYFITIVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|149713890|ref|XP_001502984.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Equus caballus]
Length = 377
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 168/366 (45%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYDVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLA--------------------ETMVASADGLV 108
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ L + S E S +D + A++ AL P D
Sbjct: 109 YEPVIFDLSPQQKEWQRMLQVIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPRED 161
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ + C I G L VNKVAGNFH GK+ H H DS+
Sbjct: 162 DS----------SQPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ---- 300
N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT HT +
Sbjct: 212 NFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPT-----KLHTYKISAD 266
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
++QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 267 THQFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|328862174|gb|EGG11276.1| hypothetical protein MELLADRAFT_33547 [Melampsora larici-populina
98AG31]
Length = 361
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 169/362 (46%), Gaps = 61/362 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ IR DA+PK + R+ GG++T+V ++++L + ELR YL VD +
Sbjct: 11 LPAIREFDAFPKTIPTYKERSSRGGILTIVVGFLIMILIWHELREYLFGAATYSFSVDNT 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESRQDGI 123
G L +NFDVT +PC LS+D D G++ H+ D FKK
Sbjct: 71 VGHDLGLNFDVTI-NMPCHYLSIDVRDAVGDRMHIS---DEFKK---------------- 110
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
E E G E++++ + + VR+A + GW
Sbjct: 111 ----------------EGTEFSIGQAARLETNNDAGISASKMVRDA--QGGWT------- 145
Query: 184 DQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
R F ++ K EG C I+G V KV GN H + S H L
Sbjct: 146 ----RPTF-KKTKPLIPEGPACRIFGSTHVKKVTGNLHITTLGHGYLSWEHTDHQL---- 196
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N++H I++ +FGE FP +V PLD + P ++QYFI VVPT Y + G + +
Sbjct: 197 --MNLTHVISEFSFGEFFPNMVQPLDNSVEITDKPFHIFQYFISVVPTTYINSGGRQVFT 254
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ+SVT+ RS+E GR +PG+FF YD+ P+ +T E + + FL + IVGG+
Sbjct: 255 NQYSVTDMSRSTEHGR--GVPGIFFKYDIEPMYLTIRERTTTLVQFLVRLAGIVGGIVVC 312
Query: 362 SG 363
+G
Sbjct: 313 TG 314
>gi|410964074|ref|XP_003988581.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Felis catus]
Length = 377
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 171/364 (46%), Gaps = 44/364 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLA--------------------ETMVASADGLV 108
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ L + S E S +D + A++ AL P D
Sbjct: 109 YEPVIFDLSPQQKEWQRMLQLIQSRLQEEHSLQDVI-----FKSAFKSDSTAL--PPRED 161
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ + C I+G L VNKVAGNFH GK+ H H DS+
Sbjct: 162 DSS----------QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSN 302
N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T +
Sbjct: 212 NFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---H 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F+
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 362 SGII 365
+G++
Sbjct: 329 TGML 332
>gi|308806572|ref|XP_003080597.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
gi|116059058|emb|CAL54765.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
Length = 327
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 116/381 (30%), Positives = 171/381 (44%), Gaps = 79/381 (20%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
++ RSLDA +T +G V++L + V ++L S+ + + VD
Sbjct: 1 MLRAFRSLDALTSAPAHLRRKTSTGAVVSLCGTFVAVILTLSQTIDFFTPLRTKTTRVDE 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
R + ++ DVTF +PC IL VDA D SG+ +DV+ + K RLD+ G + +
Sbjct: 61 QRAGEMTMDIDVTFTRMPCQILYVDAYDASGKHEVDVRGRLMKTRLDAAGRELGEYESAG 120
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G L R R EH EVR+A
Sbjct: 121 GVDLGGLVLFRR--RPEHGS---------------------EVRKA-------------- 143
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPG-KSFHQSGVHVHDILAFQRD 242
+ + EGC ++G +E +VAG+ + G +SF F R+
Sbjct: 144 -------------KADMEGCRLHGRVEARRVAGSLRISTGPESFE-----------FLRE 179
Query: 243 SFN------ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
FN H I AFG FPG VNPL+GV+ +E SG+Y+YF+KVVPT Y +
Sbjct: 180 MFNEPWEIDARHAIKTFAFGPEFPGSVNPLNGVK-RKEKKSGIYKYFMKVVPTTYANSRN 238
Query: 297 --------HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+++NQ+SVTEHF +E LP + F YD+S I V + S ++FL
Sbjct: 239 LFGMIPWTMRVRTNQYSVTEHF--TESAHWGMLPQILFSYDISAISVNVESQSKSGVYFL 296
Query: 349 TNVCAIVGGVFTVSGIIDAFI 369
T A VGGVF ++ ID ++
Sbjct: 297 TKTIATVGGVFALTRTIDRYV 317
>gi|301783747|ref|XP_002927289.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Ailuropoda melanoleuca]
Length = 377
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 174/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M +L E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMAILTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPAIFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDDSS----------QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|89272944|emb|CAJ82943.1| ptx1 [Xenopus (Silurana) tropicalis]
Length = 377
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 176/368 (47%), Gaps = 52/368 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + S G ++L++ +M +L E +Y N + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASRGTVSLMAFSIMGILTIMEFLVYRNTRMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMD-----ISGEQHLDVKHDIFKKRLDSQGNVIESR 119
+R+N D+T A+ C + D +D ++ Q L + IF+ L Q + +
Sbjct: 70 FTSKIRLNIDITV-AMKCQYVGADVLDLAETMVTSAQGLVYEPVIFE--LSPQQRLWQ-- 124
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ LQ+ GRL+ E S +D + A R +S
Sbjct: 125 ----------RMLQQIQGRLQ-----------EEHSLQDLL-----FKSAMRTS--VMSL 156
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P D E C I+G LE+NKVAGNFH GK+ H H
Sbjct: 157 PPREDS----------PTEPPNACRIHGHLEINKVAGNFHITVGKAIPHPRGHAHLAALV 206
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHT 298
DS+N SH+I+ +FGE PG+VNPLDG E + MYQYFI +VPT ++T+
Sbjct: 207 SHDSYNFSHRIDHFSFGEPLPGIVNPLDGTEKIAEDSNQMYQYFITIVPTKLHTNKVD-- 264
Query: 299 IQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++QFSVTE R + G+F YD+S + V TE+H+ FL +C IVGG
Sbjct: 265 CDTHQFSVTERERVINHASGSHGVSGIFMKYDISSLMVMVTEDHMPLWKFLVRLCGIVGG 324
Query: 358 VFTVSGII 365
+FT +G+I
Sbjct: 325 IFTTTGMI 332
>gi|82074366|sp|Q5EHU7.1|ERGI2_GECJA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
Length = 377
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 174/364 (47%), Gaps = 44/364 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ +++ + + Q
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAETMVASTDGLVYEPAIFDLSPQQKEWQ---- 124
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ LQR RL+ E S +D + ++ AL P D
Sbjct: 125 -----RMLQRIQSRLQ-----------EEHSLQDVI-----FKSTFKSASTAL--PPRED 161
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ + C I+G L VNKVAGNFH GK+ H H DS+
Sbjct: 162 DSS----------QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSN 302
N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T +
Sbjct: 212 NFSHRIDHLSFGELVPGIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---H 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F+
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328
Query: 362 SGII 365
+G++
Sbjct: 329 TGML 332
>gi|388493200|gb|AFK34666.1| unknown [Medicago truncatula]
Length = 106
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/107 (70%), Positives = 92/107 (85%), Gaps = 2/107 (1%)
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 338
M QYFIKVVPTVYTD+ G I SNQ+SVTEHF+SSE G +PGVFFFYD+SPIKV F
Sbjct: 1 MCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELG--AAVPGVFFFYDISPIKVNFK 58
Query: 339 EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
EEH+ FLHFLTN+CAI+GG+FT++GI+D+ IY+GQ+ IKKK+EIGK+
Sbjct: 59 EEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIGKY 105
>gi|57106442|ref|XP_534852.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 isoform 1 [Canis lupus familiaris]
Length = 377
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPAIFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDDSS----------QPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGEVVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|21312962|ref|NP_080444.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
isoform 1 [Mus musculus]
gi|81903633|sp|Q9CR89.1|ERGI2_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|12835992|dbj|BAB23451.1| unnamed protein product [Mus musculus]
gi|12843481|dbj|BAB25998.1| unnamed protein product [Mus musculus]
gi|12844310|dbj|BAB26318.1| unnamed protein product [Mus musculus]
gi|13905198|gb|AAH06895.1| ERGIC and golgi 2 [Mus musculus]
gi|17390417|gb|AAH18188.1| ERGIC and golgi 2 [Mus musculus]
gi|20072972|gb|AAH26558.1| ERGIC and golgi 2 [Mus musculus]
gi|26326029|dbj|BAC26758.1| unnamed protein product [Mus musculus]
gi|40353061|gb|AAH64749.1| ERGIC and golgi 2 [Mus musculus]
gi|74191314|dbj|BAE39481.1| unnamed protein product [Mus musculus]
gi|148678796|gb|EDL10743.1| ERGIC and golgi 2, isoform CRA_c [Mus musculus]
Length = 377
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 174/367 (47%), Gaps = 50/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVASADGLA 108
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 109 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT- 267
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C I+GG+
Sbjct: 268 --HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGI 325
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 326 FSTTGML 332
>gi|12846043|dbj|BAB27008.1| unnamed protein product [Mus musculus]
Length = 377
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 174/367 (47%), Gaps = 50/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVASADGLA 108
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 109 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT- 267
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C I+GG+
Sbjct: 268 --HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGI 325
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 326 FSTTGML 332
>gi|12841082|dbj|BAB25070.1| unnamed protein product [Mus musculus]
Length = 377
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 174/367 (47%), Gaps = 50/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVASADGLA 108
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 109 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT- 267
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C I+GG+
Sbjct: 268 --HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGI 325
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 326 FSTTGML 332
>gi|149048933|gb|EDM01387.1| rCG29652, isoform CRA_c [Rattus norvegicus]
Length = 377
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 174/367 (47%), Gaps = 50/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVASADGLA 108
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 109 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSSSTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT- 267
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C I+GG+
Sbjct: 268 --HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGI 325
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 326 FSTTGML 332
>gi|395839293|ref|XP_003792530.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Otolemur garnettii]
Length = 377
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPAIFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKTASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDN----------PSQSPDACRISGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|148224086|ref|NP_001087666.1| ERGIC and golgi 2 [Xenopus laevis]
gi|51950053|gb|AAH82468.1| MGC81917 protein [Xenopus laevis]
Length = 377
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 175/373 (46%), Gaps = 62/373 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + S G ++L++ +M +L E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASRGTVSLMAFSIMGILTIMEFLVYRDTRMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMD-----ISGEQHLDVKHDIFKKRLDSQGNVIESR 119
+R+N D+T A+ C + D +D ++ Q L + IF L Q R
Sbjct: 70 FTSKIRLNIDITV-AMKCQYVGADVLDLAETMVTSAQGLAYQPVIFD--LSPQ-----QR 121
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
Q + LQ+ GRL+ E S +D + A R LS
Sbjct: 122 Q-------WQRMLQQIQGRLQE-----------EHSLQDLL-----FKSAMRTS--VLSL 156
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P D E+ C I+G L++NKVAGNFH GK+ H H
Sbjct: 157 PPREDS----------PMEQPNACRIHGHLDINKVAGNFHITVGKAIPHPRGHAHLAALV 206
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT------VYTD 293
DS+N SH+I+ +FGE P ++NPLDG E + MYQYFI +VPT VY D
Sbjct: 207 SHDSYNFSHRIDHFSFGEPLPAIINPLDGTEKIAEDSNQMYQYFITIVPTKLNTNKVYCD 266
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
++QFSVTE R + G+F YD+S + VT TE+H+ FL +C
Sbjct: 267 -------THQFSVTERERVINHATGSHGVSGIFMKYDISSLMVTVTEDHMPLWKFLVRLC 319
Query: 353 AIVGGVFTVSGII 365
I+GG+FT +G+I
Sbjct: 320 GIIGGIFTTTGMI 332
>gi|449278843|gb|EMC86582.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Columba livia]
Length = 377
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 176/369 (47%), Gaps = 48/369 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + + +GG ++L++ + L E +Y + + + VD
Sbjct: 10 LTLMKELDAFPKVPESYVETSATGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S I
Sbjct: 70 FTSKLRINIDITV-AMRCQYVGADVLDLAE-------------------TMVASADALIY 109
Query: 125 APKIDK--PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + + P Q+ R+ S E S +D + A++ AL
Sbjct: 110 EPVVFELSPQQKEWQRMLQ---VIQSRLQEEHSLQDVI-----FKSAFKSASTALPP--- 158
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ + LQ + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 159 ----REDNSLQ-----SPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET-- 267
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YD+S + VT TEEH+ F FL +C I+GG+F
Sbjct: 268 -HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIF 326
Query: 360 TVSGIIDAF 368
+ +GI+ F
Sbjct: 327 STTGILHGF 335
>gi|115497448|ref|NP_001069031.1| endoplasmic reticulum-Golgi intermediate compartment protein 2 [Bos
taurus]
gi|113912114|gb|AAI22616.1| ERGIC and golgi 2 [Bos taurus]
gi|296487341|tpg|DAA29454.1| TPA: PTX1 protein [Bos taurus]
Length = 377
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 170/366 (46%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P QR R+ S E S +D + ++ AL P
Sbjct: 110 EPAIFDLSPQQREWQRMLQ---LFQSRLQEEHSLQDVV-----FKSVFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDDSS----------QPPDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI +VP T + + I ++
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIALDHNQMFQYFITIVP---TKLQTYKISAD 266
Query: 303 --QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
QF+VTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 267 THQFAVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|291232448|ref|XP_002736170.1| PREDICTED: MGC81917 protein-like [Saccoglossus kowalevskii]
Length = 395
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 176/379 (46%), Gaps = 39/379 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PKI E++ T +GG +++++ ++ +L SE++ Y + + VDT
Sbjct: 13 VKELDAFPKIPENYQETTATGGTVSILTFSLIAILVISEIQYYSETTMKYEYEVDTDLTS 72
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
LR+N D+T A+ C + D +D++ G+ + + +
Sbjct: 73 KLRLNIDITV-AMKCDYIGADVLDMT-------------------GDTVSASFGSLKEQA 112
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ L R + + S E + +D G S P+ D K
Sbjct: 113 VHFELSRRQKQWQKKLQAVRSALANEHAIQDLLFKVGF-------DGSPTSMPERED--K 163
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
G C I+G + +NKVAGNFH GKS H H + +N S
Sbjct: 164 PAG--------APNSCRIHGSMSLNKVAGNFHITLGKSIPHPRGHAHLAAFISQSQYNFS 215
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
H+I+ +FG PG+VNPLDG + + + MYQYFI++VPT + + ++Q++VT
Sbjct: 216 HRIDHFSFGVPTPGIVNPLDGDQRVTQENARMYQYFIQIVPT-RVNTRRASADTHQYAVT 274
Query: 308 EHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E R S + G+FF YDLS + V TEE+ + FL +C I+GGVF SG++
Sbjct: 275 ERDRVISHSSGSHGVAGIFFKYDLSSVSVKVTEEYQPYWQFLVRLCGIIGGVFATSGMLH 334
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I I K + GK+
Sbjct: 335 SLIGCLYDLICCKYQFGKY 353
>gi|75075986|sp|Q4R5C3.1|ERGI2_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|67970720|dbj|BAE01702.1| unnamed protein product [Macaca fascicularis]
Length = 377
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPAVFDLSPQQKEWQRMLQ---LTQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDDSS----------QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|380787459|gb|AFE65605.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
gi|383418929|gb|AFH32678.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
gi|384941148|gb|AFI34179.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
Length = 377
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPAVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDDSS----------QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|332233018|ref|XP_003265701.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 isoform 1 [Nomascus leucogenys]
Length = 377
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDDSS----------QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|255563175|ref|XP_002522591.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223538182|gb|EEF39792.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 191
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 84/190 (44%), Positives = 119/190 (62%), Gaps = 9/190 (4%)
Query: 192 LQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 249
++++K+ GEGC +YG L+V +VAGNFH S H + V ++ N+SH
Sbjct: 2 IKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAIHVNVSHI 57
Query: 250 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 309
I+ L+FG FPG+ NPLDG SG ++Y+IK+VPT Y +S + +NQFSVTE+
Sbjct: 58 IHDLSFGPKFPGLHNPLDGTARILHDASGTFKYYIKIVPTEYRYISKEVLPTNQFSVTEY 117
Query: 310 FRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 368
F SE R T P V+F YDLSPI VT EE SFLHF+T +CA++GG F ++G++D +
Sbjct: 118 FSPMSEYDR--TWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLDRW 175
Query: 369 IYHGQRAIKK 378
+Y A+ K
Sbjct: 176 MYRLLEAVTK 185
>gi|397517363|ref|XP_003828883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Pan paniscus]
gi|410259224|gb|JAA17578.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410298004|gb|JAA27602.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410334949|gb|JAA36421.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410334951|gb|JAA36422.1| ERGIC and golgi 2 [Pan troglodytes]
Length = 377
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDDSS----------QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|326672443|ref|XP_003199668.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Danio rerio]
Length = 365
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 177/370 (47%), Gaps = 55/370 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I++LDA+PK+ E + + + GG +TL I+M LL SE +Y + + + VD
Sbjct: 15 IKNLDAFPKVPESYVATSAFGGTVTLTVFILMALLTISEFFVYQDTWMKYEYEVDRDFTS 74
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+I D+T A+ C L D +DI+ G V+ S++ +
Sbjct: 75 KLKIKIDITV-AMKCERLGADVLDIA-------------------GAVVASKEIKYDSVS 114
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
D Q+ + ++++ R++ S D++ +
Sbjct: 115 FDPSAQK----------------------KQWYQILQQIQNRLREEH---SLQDVLFKSA 149
Query: 188 REGFLQ----RI--KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+G+ R+ E C I+G + VNKVAGNFH GK H H +
Sbjct: 150 LKGYFSDPAPRVDPTPESQNACRIHGKIYVNKVAGNFHITLGKPIETHKGHAHYASFIKD 209
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
+ +N SH+I+ L+FG PG +NPLDG+ T + ++QYFI VVPT S ++
Sbjct: 210 EVYNFSHRIDHLSFGNDVPGHINPLDGMEKTTLEQNTLFQYFITVVPT-KLHTSNVSVDM 268
Query: 302 NQFSVTEHFR--SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R S+E+G Q + G+FF Y LSP+ V +EEH+ FL +C IVGG+F
Sbjct: 269 HQFSVTERERVVSNEKGN-QGVSGIFFKYKLSPLMVRVSEEHMPLAAFLVRLCGIVGGIF 327
Query: 360 TVSGIIDAFI 369
+ S ++ I
Sbjct: 328 STSDLLHRLI 337
>gi|50959176|ref|NP_057654.2| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Homo sapiens]
gi|108935982|sp|Q96RQ1.2|ERGI2_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|22760017|dbj|BAC11037.1| unnamed protein product [Homo sapiens]
gi|38173702|gb|AAH00887.2| ERGIC and golgi 2 [Homo sapiens]
gi|78070782|gb|AAI07795.1| ERGIC and golgi 2 [Homo sapiens]
gi|119616998|gb|EAW96592.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
gi|119617000|gb|EAW96594.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
gi|167773797|gb|ABZ92333.1| ERGIC and golgi 2 [synthetic construct]
Length = 377
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 172/366 (46%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSTSTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDDSS----------QSPNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|403269250|ref|XP_003926667.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Saimiri boliviensis boliviensis]
Length = 377
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 172/366 (46%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LD +PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDVFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 160 EDD----------SSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSYGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|432943284|ref|XP_004083140.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oryzias latipes]
Length = 372
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 168/366 (45%), Gaps = 49/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+++ + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVSDSYVETSTSGGTVSLIAFSTMALLSVLEFFVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN DVT A+ C + D +D++ I L + V E
Sbjct: 70 FSSKLRINVDVTV-AMRCQHVGADILDLAETM-------ITSGGLQYEPVVFEL------ 115
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
P QR RL +EV KG + P
Sbjct: 116 -----TPKQREWQRLREEHA------------------LQEVLYKSLLKGAPTALPP--- 149
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ F+Q + C I+G + VNKVAGN H GK H H H +S+
Sbjct: 150 --RDAVFMQ-----SPDACRIHGDIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHESY 202
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH+I++L FGE PG++NPLDG + MYQYFI VVPT T ++QF
Sbjct: 203 NFSHRIDRLCFGEEIPGIINPLDGTEKITYDNNQMYQYFITVVPTKLKTYKI-TADTHQF 261
Query: 305 SVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
SVTE R + + G+FF YD S + VT +E+H+ FL +C I+GG+++ +G
Sbjct: 262 SVTERERVINHTAGSHGVSGIFFKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIYSTTG 321
Query: 364 IIDAFI 369
++ + I
Sbjct: 322 MLHSLI 327
>gi|62897157|dbj|BAD96519.1| CDA14 variant [Homo sapiens]
Length = 377
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 172/366 (46%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSTSTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDD----------SSQSPNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|443716796|gb|ELU08142.1| hypothetical protein CAPTEDRAFT_19918 [Capitella teleta]
Length = 403
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 168/364 (46%), Gaps = 48/364 (13%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LDA+PK+ E + + SGG I+++ ++ +L SE+R Y + VD
Sbjct: 12 VRELDAFPKVPEGYQECSASGGSISILVLVLSAILIISEIRYYTATEFKYDYEVDKHFEG 71
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L IN D+T A+ C + D +DI+G+ + S G + E +P
Sbjct: 72 KLSINIDITV-AMKCHQVGADVLDITGQN------------VASFGKLTEEEVHFELSPN 118
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
K L+ + +E N + + + G+ L
Sbjct: 119 QRKHLK-----------------SMSAINEYIRNEYHSIHKFLWRSGFG---GYLAQMPP 158
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRDSFNI 246
RE Q K GC YG L+VNKVAGNFH GKS + G H H + + +N
Sbjct: 159 REDHPQTPKN----GCRFYGTLDVNKVAGNFHITAGKSVPLNIGGHAHMAMMVKESDYNF 214
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVP----TVYTDVSGHTIQSN 302
+H+I +FG+ G +NPLDG MYQYFI+VVP T++TD I +
Sbjct: 215 THRIEHFSFGDKVSGRINPLDGEEKNTNDNYHMYQYFIQVVPTHVKTLFTD-----INTY 269
Query: 303 QFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
QFSVTE R+ G+ +PG+F YDL+P+ V E H F L +C I+GG+F
Sbjct: 270 QFSVTEQNRTISHGKGSHGIPGIFVKYDLAPMMVKVIESHKPFSQLLIRLCGIIGGLFAT 329
Query: 362 SGII 365
SG++
Sbjct: 330 SGML 333
>gi|15010925|gb|AAK77355.1|AF302767_1 PTX1 protein [Homo sapiens]
Length = 377
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 173/366 (47%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL + +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMKFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSTSTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 160 EDD----------SSQSPNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S +T
Sbjct: 210 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISAYT-- 267
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 268 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 326
Query: 360 TVSGII 365
+ +G++
Sbjct: 327 STTGML 332
>gi|66500700|ref|XP_395190.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Apis mellifera]
Length = 389
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 174/367 (47%), Gaps = 41/367 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + +T GG ++ + + L +E YL++ + K DT
Sbjct: 9 IKTVKELDAFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTD 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+T A+PC + D +D S Q++ V H+ L+ + E Q+
Sbjct: 69 IDAKLKINIDITV-AMPCGRIGADVLD-STNQNM-VGHE----SLEQEDTWWELTQEQ-- 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ H L+H +Y Y A N E ++ + P+
Sbjct: 120 --------RSHFEALKHTNSYLREEYHAIHELLWKSNQVTLYSEMPKRTHQPIYAPN--- 168
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
C I+G L VNKVAGNFH GKS H+H +
Sbjct: 169 -----------------ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQ 303
N +H+INK +FG PG+V+PL+G + +YQYF++VVPT + T +S T ++ Q
Sbjct: 212 NFTHRINKFSFGGPSPGIVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLS--TSKTYQ 269
Query: 304 FSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
+SV +H R + Q PG+FF YD+S +K+ T++ + FL +CA VGG+F S
Sbjct: 270 YSVKDHQRPINHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTS 329
Query: 363 GIIDAFI 369
G++ +
Sbjct: 330 GLVKNIV 336
>gi|417399168|gb|JAA46612.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein 2 isoform 1 [Desmodus rotundus]
Length = 337
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/373 (31%), Positives = 172/373 (46%), Gaps = 54/373 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D LD ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADV-------------------LDLAETMVASANGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ + E S +D + A++ + + P
Sbjct: 110 EPVIFDLSPQQKEWQRMLQ---LIQTRLQEEHSLQDVL-----FKSAFKS---STALPPR 158
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 159 EDDS----------SQPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 208
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ-- 300
S+N SH+I+ L+FGE PG+VNPLDG + M+QYFI VVPT HT +
Sbjct: 209 SYNFSHRIDHLSFGELVPGIVNPLDGTEKIAVDHNRMFQYFITVVPT-----KLHTYKIS 263
Query: 301 --SNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++QFSVTE R + G+F YDLS + VT TEEH+ F F +C IVGG
Sbjct: 264 ADTHQFSVTERERVVNHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGG 323
Query: 358 VFTVSGIIDAFIY 370
+F+ +G D+F++
Sbjct: 324 IFSTTG-KDSFLF 335
>gi|380016475|ref|XP_003692209.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Apis florea]
Length = 392
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 173/364 (47%), Gaps = 41/364 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PK+ E + +T GG ++ + + L +E YL++ + K DT
Sbjct: 12 VKELDAFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN D+T A+PC + D +D S Q++ V H+ L+ + E Q+
Sbjct: 72 KLKINIDITV-AMPCGRIGADVLD-STNQNM-VGHE----SLEQEDTWWELTQEQ----- 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ H L+H +Y Y A N E ++ + P+
Sbjct: 120 -----RSHFEALKHTNSYLREEYHAIHELLWKSNQVTLYSEMPKRTHQPIYAPN------ 168
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
C I+G L VNKVAGNFH GKS H+H +N +
Sbjct: 169 --------------ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFT 214
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSV 306
H+INK +FG PG+V+PL+G + +YQYF++VVPT + T +S T ++ Q+SV
Sbjct: 215 HRINKFSFGGPSPGIVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLS--TSKTYQYSV 272
Query: 307 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+H R + Q PG+FF YD+S +K+ T++ + FL +CA VGG+F SG++
Sbjct: 273 KDHQRPINHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLV 332
Query: 366 DAFI 369
+
Sbjct: 333 KNIV 336
>gi|417399911|gb|JAA46936.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein 2 isoform 1 [Desmodus rotundus]
Length = 376
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 171/366 (46%), Gaps = 49/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D LD ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADV-------------------LDLAETMVASANGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P Q+ R+ + E S +D + A+ K AL P
Sbjct: 110 EPVIFDLSPQQKEWQRMLQ---LIQTRLQEEHSLQDVL-----FKSAF-KSSTAL--PPR 158
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 159 EDDSS----------QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHD 208
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG+VNPLDG + M+QYFI VVPT ++T +S T
Sbjct: 209 SYNFSHRIDHLSFGELVPGIVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADT-- 266
Query: 301 SNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 267 -HQFSVTERERVVNHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 325
Query: 360 TVSGII 365
+ +G++
Sbjct: 326 STTGML 331
>gi|395744111|ref|XP_003780425.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Pongo abelii]
Length = 387
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 174/367 (47%), Gaps = 49/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 19 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 78
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 79 FSSKLRINIDITV-AMKCQCIGADVLDLAE-------------------TMVASADGLVY 118
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 119 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 168
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 169 EDDSS----------QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 218
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
S+N SH+I+ L+FGE P ++NPLDG + + M+QYFI VVPT ++T +S T
Sbjct: 219 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDRKHQMFQYFITVVPTKLHTYKISADT- 277
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+
Sbjct: 278 --HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGI 335
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 336 FSTTGML 342
>gi|307188057|gb|EFN72889.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Camponotus floridanus]
Length = 386
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 179/365 (49%), Gaps = 43/365 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PK+ E + +T GG ++ + +++ L +E +L++ + K DT
Sbjct: 12 VKELDAFPKVPELYVDKTAVGGTFSIFTMLIIAYLIIAETSYFLDSRLQFKFEPDTEIDA 71
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN D+T A+PC + D +D S Q++ + +D L+ + E Q+
Sbjct: 72 KLQINIDITV-AMPCGRIGADVLD-STNQNM-ISYDT----LEEEDTWWELTQEQ----- 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ H L+H +Y +RE Y L + I
Sbjct: 120 -----RAHFEALKHMNSY--------------------LREEYHAIHELLWKSNQITLYS 154
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNI 246
+ C I+G L VNKVAGNFH GKS H+H I A+ D +N
Sbjct: 155 EMPMRSHKPDYATNACRIHGSLVVNKVAGNFHITAGKSLSLPRGHIH-ISAYMTDQDYNF 213
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFS 305
+H+IN+ +FG PG+V+PL+G + +YQYF++VVPT + T +S T ++ Q+S
Sbjct: 214 THRINRFSFGGPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLS--TSKTYQYS 271
Query: 306 VTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
V +H R + + +PG+FF YD+S +K+ T+E + FL +CA VGG+F SG+
Sbjct: 272 VKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVTSGL 331
Query: 365 IDAFI 369
+ +
Sbjct: 332 VKNIV 336
>gi|41055383|ref|NP_956701.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Danio rerio]
gi|82188148|sp|Q7T2D4.1|ERGI2_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|32451749|gb|AAH54593.1| ERGIC and golgi 2 [Danio rerio]
gi|182890474|gb|AAI64472.1| Ergic2 protein [Danio rerio]
Length = 376
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 171/372 (45%), Gaps = 53/372 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N +R LDA+PK+ E + T SGG ++L++ M LL F E +Y + + + VD
Sbjct: 10 LNFVRELDAFPKVPESYVETTASGGTVSLLAFTAMALLAFFEFFVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D + D+ + + S G V E +
Sbjct: 70 FTSKLRINIDITV-AMRCQFVGADVL------------DLAETMVASDGLVYEPVVFDLS 116
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
P QR L H E ++ ++V KG + P D
Sbjct: 117 ------PQQR----LWHRTLLLIQGRLREE------HSLQDVLFKNVMKGSPTALPPRED 160
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+ C I+G L VNKVAGNFH GK+ H H +++
Sbjct: 161 D----------PNQPLNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHETY 210
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT------VYTDVSGHT 298
N SH+I+ L+FGE PG++NPLDG + M+QYFI +VPT VY D
Sbjct: 211 NFSHRIDHLSFGEEIPGILNPLDGTEKVSADHNQMFQYFITIVPTKLQTYKVYAD----- 265
Query: 299 IQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
++Q+SVTE R + + G+F YD+S + V TE+H+ F FL +C I+GG
Sbjct: 266 --THQYSVTERERVINHAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWQFLVRLCGIIGG 323
Query: 358 VFTVSGIIDAFI 369
+F+ +G++ +
Sbjct: 324 IFSTTGMLHNLV 335
>gi|225717192|gb|ACO14442.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Esox lucius]
Length = 379
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 170/367 (46%), Gaps = 43/367 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + T SGG ++L++ M LL F E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETTASGGTVSLIAFTAMALLAFFEFFVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D + D+ + + S G E G+
Sbjct: 70 FSSKLRINIDITV-AMKCQHVGADIL------------DLAETMITSNGIQYEPVVFGL- 115
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
P Q+ L H E ++ +EV KG + P
Sbjct: 116 -----TPEQK----LWHRTLLLIQNRLREE------HSLQEVLYKSVLKGAPTALPP--- 157
Query: 185 QCKREGFLQRIKEEEGEG-CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
+ + E G C I+G + VNKVAGNFH GK H H H D+
Sbjct: 158 --------REVATSEPLGACRIHGHVYVNKVAGNFHITVGKPIHHPRGHAHIAAFVSHDT 209
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
+N SH+I+ +FGE PG++NPLDG + M+ YFI VVPT S + ++Q
Sbjct: 210 YNFSHRIDHFSFGEEIPGIINPLDGTEKVTTNNNHMFLYFITVVPT-KLHTSKVSADTHQ 268
Query: 304 FSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
FSVTE R + + G+F YD S + VT +E+H+ FL +C I+GG+F+ +
Sbjct: 269 FSVTERERVINHAAGSHGVSGIFMKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIFSTT 328
Query: 363 GIIDAFI 369
G+I F+
Sbjct: 329 GMIHGFV 335
>gi|388858415|emb|CCF48009.1| uncharacterized protein [Ustilago hordei]
Length = 415
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 170/359 (47%), Gaps = 42/359 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ KIR DA+PK + R+ GG++T++S++ +L L ++EL YL VD+
Sbjct: 11 LPKIRQFDAFPKTQSIYTQRSSKGGLLTIISTVTLLFLLWTELSSYLYGERAYSFAVDSQ 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
T++IN D+T A+ C L++D D G++ L V F K G + IG
Sbjct: 71 LSSTMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFD-----IG 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
H RL+ + E S + N + + YRKK N
Sbjct: 120 ----------HADRLD-------AMPREELSVQKTINQARK-KPLYRKKP---KNKKFSR 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
Q + +G C IYG +EV +V GN H + S H L
Sbjct: 159 QVAFHKTAHIV--PDGPACRIYGSMEVKRVTGNLHITTLGHGYLSLEHTDHKL------M 210
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N+SH I++ +FG +FP + PLD T + ++QYFI VPT++ D G + ++Q+
Sbjct: 211 NLSHVIHEFSFGPYFPEISQPLDSSVETTDKHFTVFQYFISAVPTLFVDARGRKLHTHQY 270
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
SVT++ R E G+ +PG+F YD+ PI++T E +F+ FL + ++GGV+ G
Sbjct: 271 SVTDYTRQIEHGK--GVPGIFIKYDIEPIQMTIRERSSTFVQFLVRLAGVLGGVWVCVG 327
>gi|390337315|ref|XP_792272.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Strongylocentrotus purpuratus]
gi|390337317|ref|XP_003724529.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Strongylocentrotus purpuratus]
Length = 388
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 109/367 (29%), Positives = 176/367 (47%), Gaps = 47/367 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PKI ED+ T +GG +++V+ IV+ L SE YL++ + VDT
Sbjct: 13 VKELDAFPKIPEDYVKTTSTGGTVSIVTFIVIAGLVISEFMYYLDSRMKYGYDVDTDFNT 72
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN D+T A+ C + D +D +G+ + F +L + E
Sbjct: 73 KLQINIDITV-AMKCDYIGADVLDSAGDSAMFK----FSGKLKEEPTSFEM--------- 118
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA---LSNPDLID 184
P QR S + + + +++ + G++ + P +D
Sbjct: 119 --TPQQR-------------SWHKTLQTVRKALSEEHAIQDLLFQTGFSSKPTNQPQRVD 163
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
K+ + C ++G L NKVAGNFH GKS H H L +++
Sbjct: 164 SGKKL-----------DACRLHGSLTTNKVAGNFHVTIGKSIPHPRGHAHLALMIDPNNY 212
Query: 245 NISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
N SH+I+ ++G PG+VNPLDG ++ T E+ +YQYFI++VPT ++Q
Sbjct: 213 NFSHRIDHFSYGTPVPGIVNPLDGDLKVTNESLQ-IYQYFIQIVPT-KVKTRAAKAHTHQ 270
Query: 304 FSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
++VTE R G + G+FF Y+LS + ++ E + F L +C IVGGVF S
Sbjct: 271 YAVTERERVINHGAGSHGVTGIFFKYELSSLVISVEEVYDPFWKLLVRLCGIVGGVFATS 330
Query: 363 GIIDAFI 369
GII++ +
Sbjct: 331 GIINSLM 337
>gi|410918691|ref|XP_003972818.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Takifugu rubripes]
Length = 378
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 171/370 (46%), Gaps = 49/370 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK++E + + +GG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSSLKELDAFPKVSESYVETSATGGTVSLIAFSSMALLAVLEFFVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+RIN D+T A+ C + D +D++ + + S G E
Sbjct: 70 FSSKMRINIDITV-AMKCQHVGADILDLA------------ETMITSNGLQYE------- 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P QR R S +S ++ + +EV KG + P
Sbjct: 110 -PTIFDLTPQQRLWQR---------SLLLVQSRIKEE-HALQEVLYKTLLKGGPTALPPR 158
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D E C IYG + VNKVAGN H GK H H H +
Sbjct: 159 KDAAM----------EPHNACRIYGHIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHE 208
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQ 300
++N SH+I+ L+FGE G++NPLDG + MYQYFI VVPT V VS T
Sbjct: 209 TYNFSHRIDHLSFGEEITGIINPLDGTEKITSKHTQMYQYFITVVPTRLVTHKVSADT-- 266
Query: 301 SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YD S + VT TE+H+ FL +C IVGG+F
Sbjct: 267 -HQFSVTERERVINHAAGSHGVSGIFVKYDTSSLTVTVTEQHMPLWQFLVRLCGIVGGIF 325
Query: 360 TVSGIIDAFI 369
+ +G++ +
Sbjct: 326 STTGMLHGLV 335
>gi|198422133|ref|XP_002131157.1| PREDICTED: similar to ptx1 [Ciona intestinalis]
Length = 391
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 176/371 (47%), Gaps = 47/371 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++SLDA+PK+ E + GG ITL+++ V+ L SE+ Y N VD
Sbjct: 12 LSNVKSLDAFPKVPELCIETSTRGGTITLITTAVITFLVLSEIIYYFNVTFRYDYQVDVD 71
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ +NFD+T A PC+++ D +D++G+ +F+ + + RQ
Sbjct: 72 FDSKVWLNFDITV-ATPCTLIGADVLDVTGQA------TVFENEVYEELTFF--RQSNTA 122
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
A + K L R L E N +++ E + + NP+L+
Sbjct: 123 AAQ-RKALLRMKEELLTPE------------------NGKKMSEITLQSNF---NPNLM- 159
Query: 185 QCKREGFLQRIKEEEG---EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
F R + G + C YG L +NKVAGNFH GK G H H + F
Sbjct: 160 ------FKNRKLDNVGIKMDACRFYGNLPLNKVAGNFHIVAGKPIQMFGGHAHLSMMFSP 213
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
+N SH+I+ +FG G +N LDG + S ++QY++ VV T ++ I +
Sbjct: 214 IPYNFSHRIDHFSFGNMKTGFINALDGDERVTSSESYIFQYYLDVVS---TKINSRRITT 270
Query: 302 N--QFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+ QFSV+E R+ + PGVFF Y+ SP+ V TE+ + F L +C+IVGG+
Sbjct: 271 DTFQFSVSEQSRALDHASGSHGQPGVFFKYNFSPLSVMITEQKMPFYRLLVRLCSIVGGI 330
Query: 359 FTVSGIIDAFI 369
F S +++A +
Sbjct: 331 FATSHVLNALL 341
>gi|343427702|emb|CBQ71229.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 412
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 168/359 (46%), Gaps = 42/359 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ KIR DA+PK + R+ GG++T+VS++ +L L ++EL YL VD
Sbjct: 11 LPKIRQFDAFPKTQSIYTQRSSKGGILTIVSTVTLLALLWTELSSYLYGERGYSFAVDQQ 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
T++IN D+T A+ C L++D D G++ L V F K G E IG
Sbjct: 71 LQSTMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFE-----IG 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
H RL+ + E S + N + + YRKK N
Sbjct: 120 ----------HADRLD-------AMPREEVSVQKTINQARK-KPLYRKKP---KNKKFSR 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
Q + +G C IYG +EV +V GN H + S H L
Sbjct: 159 QVAFHKTAHVV--PDGPACRIYGSMEVKRVTGNLHITTLGHGYLSMEHTDHKL------M 210
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N+SH I++ +FG +FP + PLD T + ++QYF+ VPT++ D G + ++Q+
Sbjct: 211 NLSHVIHEFSFGPYFPEISQPLDSSVETTDKHFTVFQYFVSAVPTLFVDARGRKLHTHQY 270
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
SVT++ R E G+ +PG+F YD+ P+++T E + L FL + ++GGV+ G
Sbjct: 271 SVTDYTRQIEHGK--GVPGIFIKYDIEPLQMTIRERSTTLLQFLVRLAGVLGGVWVCVG 327
>gi|12857352|dbj|BAB30984.1| unnamed protein product [Mus musculus]
Length = 377
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 173/369 (46%), Gaps = 54/369 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVASADGLA 108
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 109 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ +FGE PG++NPLDG + M+QYFI V+PT ++T +S T
Sbjct: 209 DSYNFSHRIDHCSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVMPTKLHTYKISADT- 267
Query: 300 QSNQFSVTEHFRSS---EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+QFSVTE R S + G+F YDLS + VT TEEH+ F F +C I+G
Sbjct: 268 --HQFSVTE--RESIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIG 323
Query: 357 GVFTVSGII 365
G+F+ +G++
Sbjct: 324 GIFSTTGML 332
>gi|383865060|ref|XP_003707993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Megachile rotundata]
Length = 392
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 181/368 (49%), Gaps = 43/368 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LD +PK+ E + +T GG ++ + ++ L +E YL++ + K +DT
Sbjct: 9 IKTVKELDGFPKVPEPYVDKTAVGGTFSIFTICIIAYLIIAETSYYLDSRLQFKFELDTD 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+T A+PC + D +D S Q++ V H+ L+ + E Q+
Sbjct: 69 IDAKLKINIDITV-AMPCGRIGADVLD-STNQNM-VGHE----SLEEEDTWWELTQEQ-- 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ H L+H +Y Y A + E K + ++
Sbjct: 120 --------RSHFEALKHMNSYLREEYHA-------------IHELLWKSNQVTLHSEMPK 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-S 243
+ + + C I+G L VNKV+GNFH GKS H+H I AF D
Sbjct: 159 RSHQPSY-------PPNACRIHGSLNVNKVSGNFHITAGKSLSIPRGHIH-ISAFMIDRD 210
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSN 302
+N +H+INK +FG PGVV+PL+G + +YQYF++VVPT + T +S T ++
Sbjct: 211 YNFTHRINKFSFGGPSPGVVHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTY 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SV ++ R Q +PG+FF YD+S +K+ T++ + FL +CA VGG+F
Sbjct: 269 QYSVKDYQRPIDHQKGSHGVPGIFFKYDMSALKIKVTQQRDTVSQFLVKLCATVGGIFVT 328
Query: 362 SGIIDAFI 369
SG++ +
Sbjct: 329 SGLVKNIV 336
>gi|331239265|ref|XP_003332286.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309311276|gb|EFP87867.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 366
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 170/371 (45%), Gaps = 65/371 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ IR DA+PK ++ R+ GGV+T+ + ++L+L + EL+ YL + LVD S
Sbjct: 12 LPAIREFDAFPKTLPNYKQRSSRGGVLTVFVACLILVLIWHELKEYLFGEPKYSFLVDPS 71
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+L IN D+T A+PC LSVD D G++ + FKK E IG
Sbjct: 72 IAHSLGINIDLTV-AMPCHYLSVDIKDAVGDRMY--MNQEFKK---------EGTHFDIG 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K R++HN N+ E LS ++
Sbjct: 120 DAK----------RIDHN------------------NSTSE-----------LSATQILH 140
Query: 185 QCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
K+ + + +G C IYG +V KV GN H + S H L
Sbjct: 141 ASKKGQTFGKTRPLVPDGPACRIYGNTQVKKVTGNLHITTLGHGYLSWEHTDHKL----- 195
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
N+SH I + +FG+ FP +V PLD + P ++QYFI VVPT Y D G + +N
Sbjct: 196 -MNLSHVITEFSFGQFFPKIVQPLDNSVELTDKPFHIFQYFISVVPTTYIDRLGRQLHTN 254
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q+SVT+ R E G Q +PG+FF YD+ P+ + E S + FL + ++GG+ +
Sbjct: 255 QYSVTDMSRPVEHG--QGIPGLFFKYDMEPMSLILHERTTSLIQFLVRLAGMIGGIVVCT 312
Query: 363 G----IIDAFI 369
G ++D F+
Sbjct: 313 GWTFRLVDRFV 323
>gi|58261152|ref|XP_567986.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134115843|ref|XP_773404.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256029|gb|EAL18757.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57230068|gb|AAW46469.1| ER to Golgi transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 431
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 177/390 (45%), Gaps = 60/390 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+ DA+PK+ + ++ GGV+T + +++ LL ++L YL + VD+ +
Sbjct: 32 IKRFDAFPKVESTYTIKSRRGGVLTALVGLIIFLLVLNDLGEYLYGAPDYAFQVDSEVQK 91
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ------------HLDVKHDIFKKRLDSQGNV 115
L++N D+T A+PC L++D D G++ H +V F K ++ +
Sbjct: 92 DLQLNVDLTV-AMPCRYLTIDLRDAVGDRLHLSNSFAKDGTHFNVGTATFIK--NNPSST 148
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG--SCYGAESSDEDCCNNCEEVREAYRKK 173
S + I + + P Q+ ++ G +G +SS + AYR
Sbjct: 149 TPSASEIISSSRRRTPNQQ--------SSFSGIKRLFGLDSS-ASSNRRTSQGHTAYRPT 199
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
K ++G C IYG +EV KV N H
Sbjct: 200 --------------------YDKVQDGPACRIYGSVEVKKVTANLHIT---------TLG 230
Query: 234 HDILAFQRDS---FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 290
H ++FQ N+SH +++ +FG FP + PLD E P ++QYF++VVPT
Sbjct: 231 HGYMSFQHTDHHLMNLSHVVHEFSFGPFFPAIAQPLDQSYEITEQPFTIFQYFLRVVPTT 290
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
Y D S + ++Q++VT++ RS E G+ +PG+FF YDL P+ V E S FL
Sbjct: 291 YIDASRRKLITSQYAVTDYSRSFEHGK--GVPGLFFKYDLEPMSVIIRERTTSLYQFLIR 348
Query: 351 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 380
+ +VGGV+TV+ Q+ + K +
Sbjct: 349 LAGVVGGVWTVAAFALRVFNRAQKHVSKAV 378
>gi|350419069|ref|XP_003492060.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Bombus impatiens]
Length = 392
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 168/368 (45%), Gaps = 43/368 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LD +PK+ E + +T GG ++ + + L +E YL++ + K DT
Sbjct: 9 IKTVKELDGFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTD 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+T A+ CS +S D +D + Q+ IG
Sbjct: 69 IDAKLKINIDITV-AMTCSRISADVLD-------------------------STNQNMIG 102
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
LE +T+ S E N +RE Y L + +
Sbjct: 103 HES-----------LEQEDTWWELTQEQRSHFEALKNVNSYLREEYHAIHELLWKSNQVT 151
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS- 243
C I+G L VNKVAGNFH GKS H+H IL F D
Sbjct: 152 LYSEMPKRTHQPSYPPNSCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIH-ILTFMTDKD 210
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSN 302
+N +H+INK +FG PG+++PL+G + +YQYF++VVPT + T +S T ++
Sbjct: 211 YNFTHRINKFSFGGPSPGIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTY 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SV +H R Q PG+FF YD+S +K+ T++ + FL +CA VGG+F
Sbjct: 269 QYSVKDHQRPIDHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVT 328
Query: 362 SGIIDAFI 369
SG+I +
Sbjct: 329 SGMIKNIV 336
>gi|9963759|gb|AAG09679.1|AF183410_1 cd002 protein [Homo sapiens]
Length = 387
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 173/370 (46%), Gaps = 49/370 (13%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ ++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + V
Sbjct: 16 EKTLSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEV 75
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D LRIN D+T A+ C + D +D++ ++ S
Sbjct: 76 DKDFSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADG 115
Query: 122 GIGAPKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ P + P Q+ R+ S E S +D + A++ AL
Sbjct: 116 LVYEPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSTSTAL-- 165
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH-DILA 238
P D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 166 PPREDDSS----------QSPNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTC 215
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSG 296
+S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S
Sbjct: 216 STMESYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISA 275
Query: 297 HTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
T +QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IV
Sbjct: 276 DT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIV 332
Query: 356 GGVFTVSGII 365
GG+F+ +G++
Sbjct: 333 GGIFSTTGML 342
>gi|71013590|ref|XP_758634.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
gi|46098292|gb|EAK83525.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
Length = 415
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 171/363 (47%), Gaps = 50/363 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ KIR DA+PK + R+ GG++T+++++ +L L ++EL YL VD+
Sbjct: 11 LPKIRQFDAFPKTQSIYTQRSSKGGLLTIIATVTLLALLWTELSSYLYGERGYSFSVDSR 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
T++IN D+T A+ C L++D D G++ L V F K G E IG
Sbjct: 71 LQSTMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFE-----IG 119
Query: 125 -APKIDK-PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
A ++D P+Q E S + N + YRKK
Sbjct: 120 HADRLDALPMQ-------------------EVSVQKTINQARR-KPVYRKKPRN------ 153
Query: 183 IDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ R+ Q+ +G C IYG +EV +V GN H + S H L
Sbjct: 154 -KKFSRQVAFQKTAHIVPDGPACRIYGSMEVKRVTGNLHITTLGHGYLSVEHTDHKL--- 209
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
N+SH I++ +FG +FP + PLD T E ++QYF+ VPT++ D G +
Sbjct: 210 ---MNLSHVIHEFSFGPYFPEISQPLDSSVETTEKHFTVFQYFVSAVPTLFIDARGRKLH 266
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
++Q+SVT++ R E G+ +PG+F YD+ P+++T + S FL + ++GGV+
Sbjct: 267 THQYSVTDYTRQIEHGK--GVPGIFIKYDIEPLQMTIRQRSTSLFQFLVRLAGVLGGVWV 324
Query: 361 VSG 363
G
Sbjct: 325 CVG 327
>gi|322791472|gb|EFZ15869.1| hypothetical protein SINV_02690 [Solenopsis invicta]
Length = 403
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 175/379 (46%), Gaps = 57/379 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLV--------------SSIVMLLLFFSELRLYLNA 53
++ LDA+PK+ E + +T GG L + ++ L +E YL++
Sbjct: 12 VKELDAFPKVPELYVDKTAVGGTCELTVINKIFSIIHISIFTIFIIAYLIIAETSYYLDS 71
Query: 54 VTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+ K DT L+IN DVT A+PC + D +D S QH+ + D K+
Sbjct: 72 RLQFKFEPDTEIDAKLQINIDVTV-AMPCGRIGADVLD-STNQHM-IDFDSLKEEDTWWE 128
Query: 114 NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 173
E R H L+H +Y Y A + E K
Sbjct: 129 LTAEQRA--------------HFEALKHMNSYLREEYHA-------------IHELLWKS 161
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
+ ++ + + C ++G L VNKVAGNFH GKS H+
Sbjct: 162 NQVILYSEMPKRTSEPDY-------APNACRVHGSLNVNKVAGNFHITAGKSLSVPHGHI 214
Query: 234 HDILAFQRD-SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VY 291
H I AF D +N +H+IN+ +FG PG+V+PL+G + +YQYF++VVPT +
Sbjct: 215 H-ISAFMTDRDYNFTHRINRFSFGGPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIR 273
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
T +S T ++ Q+SV +H R + + +PG+FF YD+S +K+ T+E + FL
Sbjct: 274 TLLS--TSKTYQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVK 331
Query: 351 VCAIVGGVFTVSGIIDAFI 369
+CA VGG+F SG+I +
Sbjct: 332 LCATVGGIFVTSGLIKNIV 350
>gi|340709072|ref|XP_003393139.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Bombus terrestris]
Length = 392
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 177/368 (48%), Gaps = 43/368 (11%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LD +PK+ E + +T GG ++ + + L +E YL++ + K DT
Sbjct: 9 IKTVKELDGFPKVPELYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTD 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+T A+ CS +S D +D S Q++ I + L+ + E Q+
Sbjct: 69 IDAKLKINIDITV-AMTCSRISADVLD-STNQNM-----IGHESLEQEDTWWELTQEQ-- 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ H L+ +Y Y A + E K ++
Sbjct: 120 --------RSHFEALKDVNSYLREEYHA-------------IHELLWKSNQVTLYSEMPK 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS- 243
+ + + C I+G L VNKVAGNFH GKS H+H IL F D
Sbjct: 159 RTHQPSY-------PPNSCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIH-ILTFMTDKD 210
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSN 302
+N +H+INK +FG PG+++PL+G + +YQYF++VVPT + T +S T ++
Sbjct: 211 YNFTHRINKFSFGGPSPGIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTY 268
Query: 303 QFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SV +H R Q PG+FF YD+S +K+ T++ + FL +CA VGG+F
Sbjct: 269 QYSVKDHQRPIDHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVT 328
Query: 362 SGIIDAFI 369
SG++ + +
Sbjct: 329 SGMVKSIV 336
>gi|158292439|ref|XP_313915.3| AGAP005044-PA [Anopheles gambiae str. PEST]
gi|157016993|gb|EAA09437.3| AGAP005044-PA [Anopheles gambiae str. PEST]
Length = 371
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 182/366 (49%), Gaps = 48/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ + LDA+PK+ E+F T GG ++L+S +V++ L + E+ YL++ + DT
Sbjct: 11 LDAVSRLDAFPKVKEEFVQPTRVGGTLSLISRLVIVFLIYHEVTYYLDSRLVFTFVPDTD 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQGNVIESRQDGI 123
L+++ D+T A+PC + D +D + + ++F L + E
Sbjct: 71 LQSKLKVHIDLTV-AMPCKSIGADILDSTNQ-------NVFSFGILQEEDTWFEL----- 117
Query: 124 GAPKIDKPLQR-HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL--SNP 180
P QR H ++H+ +Y + Y + E K A+ S P
Sbjct: 118 ------CPSQRVHFDYMQHHNSYLRNEY-------------HSIAEILYKSDHAVVYSMP 158
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ + I E+ + C I+G L +NKVAGNFH GK+ H S H+H F
Sbjct: 159 ERV----------IIPEKPHDACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSIFA 208
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
N SH+IN+ +FG+H G+++PL+G + M QYFI+VVPT H+ +
Sbjct: 209 NTQTNFSHRINRFSFGDHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHS-K 267
Query: 301 SNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+ Q++V E+ + + + +Q + G++F YD+S ++V ++ S HF+ + +I+ G+
Sbjct: 268 TYQYTVRENLQLIDIDKGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIV 327
Query: 360 TVSGII 365
+SG++
Sbjct: 328 VISGML 333
>gi|261334705|emb|CBH17699.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 391
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 192/389 (49%), Gaps = 34/389 (8%)
Query: 4 IMNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLY---LNAVTETKL 59
++ K+ ++D + K ED+ S+T +G +I++++ + LL E+ Y NA +T+L
Sbjct: 21 LLRKVAAVDLFTKPKEDYCRSQTRAGAIISIITVFAVGLLASWEVMSYTLGWNAY-KTEL 79
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNV--IE 117
VDTS + + N D+TF PC L +D D+SG ++V ++ K +D GN+ +
Sbjct: 80 SVDTSPEKNITFNIDITFMQEPCHDLFLDVSDVSGTFSINVTENLLKTPVDVGGNLAYLG 139
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY---GAESSDEDCCNNCEEVREAYRKKG 174
+R+ P+ +R+ ++ +CG C+ A + ++CCN CEEV + +KG
Sbjct: 140 TRR-FFTDPRSPLYTRRND---PNSPDFCGRCFTGNKAIAGGKNCCNTCEEVMAEHDRKG 195
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
N ++++QC E L E GCN G L V KV+G F P ++ + +
Sbjct: 196 LPRPNKNVVEQCIGELSL------ENPGCNYRGALNVRKVSGVIFFTP--KVIKNTIKME 247
Query: 235 DILAFQRDSFNISHKINKLAFGEHF------PGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
D+L F+ SH INK + G+ GV+NPL+ R+ +Y++ +VP
Sbjct: 248 DLL-----KFDASHVINKFSIGDESVRRHSRRGVLNPLEKQRFNGSGRFMKVRYYLNIVP 302
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQG-RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
T Y + + + + ++ S E P V F +D P++V + HF
Sbjct: 303 TTYGSGASSGLHPPTYEYSANWNSREVAIGYGGFPSVEFSFDFFPMQVNNNFKREPIYHF 362
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
L +C IVGG+F V G++D+ + R +
Sbjct: 363 LVQLCGIVGGLFVVLGLVDSVVARLTRLV 391
>gi|340372649|ref|XP_003384856.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Amphimedon queenslandica]
Length = 347
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 177/372 (47%), Gaps = 53/372 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ DA+PK++ED+ T GG+ ++VS ++L L SEL + ++ + +VDT
Sbjct: 7 LKVVKEFDAFPKVSEDYIKPTTRGGLFSIVSITIILFLIVSELSYFKDSEILYEYMVDTD 66
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISG-----EQHLDVKHDIFKKRLDSQGNVIESR 119
TL++ FD+T A+PC L D +D +G +Q + + IF+ Q + ++
Sbjct: 67 MTSTLKLRFDITV-AMPCEFLGADVVDAAGSSKSLQQEVHKEPTIFELN-KEQKAWLAAK 124
Query: 120 QDGIGAPKIDKPLQRHGG-RLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
Q+ I +RH G RL + + +S + E + S
Sbjct: 125 QEVI---------RRHEGLRLLRDVMF-------DSHPQQYIPFPEHPQH---------S 159
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
P + C+ ++G ++VNKV+GNFH G++ H H
Sbjct: 160 AP--LTSCR-----------------VHGHIQVNKVSGNFHITAGQAVPHPQGHAHLSAF 200
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
+ N SH+I+ FG PG+V+PL+G + ++QY+I++VPT G
Sbjct: 201 VPTNMINFSHRIDSFGFGVSTPGMVDPLEGTYVIARESNRLFQYYIQIVPTTLQMRGGSD 260
Query: 299 IQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+ +NQ+SVTE R+ S + LPG+FF Y++ + V E FL +CAIVGG
Sbjct: 261 LHTNQYSVTERNRAISHKAGSHGLPGLFFKYEIYSLMVLMKEVDRPLSLFLVRLCAIVGG 320
Query: 358 VFTVSGIIDAFI 369
VF G+I F+
Sbjct: 321 VFATLGMISQFL 332
>gi|242006215|ref|XP_002423949.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
gi|212507219|gb|EEB11211.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
Length = 349
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 179/385 (46%), Gaps = 59/385 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK++ + GG ++++S I+ML + +SE+ Y N+ K L D
Sbjct: 13 LKSVKVLDAFPKVDNSCRESSPVGGTLSIISYILMLWILYSEITYYTNSKITYKFLPDVD 72
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ ++I D+T A+PCS +S D +D + + +
Sbjct: 73 FDQKVKIYLDMTV-AMPCSAVSADILDSTQQSVFNF------------------------ 107
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG-------WAL 177
G L T+ + E S + + + V R+ W
Sbjct: 108 ------------GELHEENTW----FDLEPSQKINFDQIKNVNALLRQDYHEVHEYLWKS 151
Query: 178 SNPDLID-QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
++P I+ R+ R + C IYG L +NKVAGNFH + GKS H+H I
Sbjct: 152 ASPSFINVYVPRKNLPNR----PYDACRIYGELVLNKVAGNFHISAGKSLQLPRGHIH-I 206
Query: 237 LAFQRDS-FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDV 294
F D FN SH++N +FG++ PG+V+PL+G YQYFI+VVPT V T +
Sbjct: 207 ATFMSDKEFNFSHRLNYFSFGDYSPGIVHPLEGDEKIATDAMMSYQYFIEVVPTEVKTFL 266
Query: 295 SGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+ + Q+SV ++ R +PG+FF YD+S +KV +E S ++F +CA
Sbjct: 267 TNQL--TYQYSVKDYQRPINHNTGSHGIPGIFFKYDMSALKVIVMQERDSPINFAVKLCA 324
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKK 378
+GG+ SG+++ I + KK
Sbjct: 325 SIGGIHITSGLVNNIILYLINFYKK 349
>gi|71755761|ref|XP_828795.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834181|gb|EAN79683.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 391
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 192/389 (49%), Gaps = 34/389 (8%)
Query: 4 IMNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLY---LNAVTETKL 59
++ K+ ++D + K ED+ S+T +G +I++++ + LL E+ Y NA +T+L
Sbjct: 21 LLRKVAAVDLFTKPKEDYCRSQTRAGAIISIITVFAVGLLASWEVMSYTLGWNAY-KTEL 79
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNV--IE 117
VDTS + + N D+TF PC L +D D+SG ++V ++ K +D GN+ +
Sbjct: 80 SVDTSPEKNITFNIDITFMQEPCHDLFLDVSDVSGTFSINVTENLLKTPVDVGGNLAYLG 139
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY---GAESSDEDCCNNCEEVREAYRKKG 174
+R+ P+ +R+ ++ +CG C+ A + ++CCN CEEV + +KG
Sbjct: 140 TRR-FFTDPRSPLYTRRND---PNSPDFCGRCFTGNKAIAGGKNCCNTCEEVMAEHDRKG 195
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
N ++++QC E L E GCN G L V KV+G F P ++ + +
Sbjct: 196 LPRPNKNVVEQCIGELSL------ENPGCNYRGALNVRKVSGVIFFTP--KVIKNTIKME 247
Query: 235 DILAFQRDSFNISHKINKLAFGEHF------PGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
D+L F+ SH INK + G+ GV+NPL+ R+ +Y++ +VP
Sbjct: 248 DLL-----KFDASHVINKFSIGDESVRRHSRRGVLNPLEKQRFNGSGRFMKVRYYLNIVP 302
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQG-RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
T Y + + + + ++ S E P V F +D P++V + HF
Sbjct: 303 TTYGSGASSGLHPPTYEYSANWNSREVAIGYGGFPSVEFSFDFFPMQVNNNFKREPIYHF 362
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
L +C I+GG+F V G++D+ + R +
Sbjct: 363 LVQLCGIIGGLFVVLGLVDSVVARLTRLV 391
>gi|307105802|gb|EFN54050.1| hypothetical protein CHLNCDRAFT_136126 [Chlorella variabilis]
Length = 319
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 165/383 (43%), Gaps = 84/383 (21%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
K+ L A+ E +T G ++T++ V L+LF SE++ + + VDTSR
Sbjct: 7 KLSHLTAFSHAQEHLRVQTIHGAIVTIIGVCVALVLFISEVQQCMVVKRVQDMRVDTSRR 66
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKH------DIFKKRLDSQGNVIESRQ 120
E L ++F+VTFPALPC L +DA D+SG+ + + ++ K +D G + R
Sbjct: 67 EELHVSFNVTFPALPCEALLMDAGDVSGKWQTESRMKVAKNGEVHKHSVDISGRWL--RL 124
Query: 121 DGIGAP---KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
AP + D P E NE GA + CN
Sbjct: 125 AEYTAPSEGEWDNP-------FEMNEI------GAALKRHEGCN---------------- 155
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
I+G+LEV +VAGN HFA ++ I+
Sbjct: 156 ---------------------------IHGWLEVQRVAGNVHFAVRPEALFLSMNAEAIM 188
Query: 238 AFQRDS--FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
D+ NISH NPL+GV T +G+ +YF+KVVPT + +
Sbjct: 189 QLHPDASKLNISH--------------ANPLEGVAQIDRTATGIDKYFVKVVPTDFYTLW 234
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
G + Q+SVTE++ G Q P V+ YD SPI V E L L VCA+V
Sbjct: 235 GRKTHTYQYSVTEYYHQFRGGEEQP-PAVYLLYDASPIMVDIREMRPGLLRLLVRVCAVV 293
Query: 356 GGVFTVSGIIDAFIYHGQRAIKK 378
GG F ++G+ D ++ A+K+
Sbjct: 294 GGAFALTGLFDKMVHRAVVAVKR 316
>gi|402885549|ref|XP_003906216.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Papio anubis]
Length = 364
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 174/371 (46%), Gaps = 71/371 (19%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPC------SILSVDAM-DISGEQHLDVKHDIFKKRLDSQGNVIE 117
LRIN D+T A+ C ++L+ A+ D+S +Q K +I+
Sbjct: 70 FSSKLRINIDITV-AMKCQCKYTFNLLNPHAVFDLSPQQ----------KEWQRMLQLIQ 118
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
SR LQ E S +D + A++ AL
Sbjct: 119 SR------------LQE------------------EHSLQDVI-----FKSAFKSASTAL 143
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
P D + + C I+G L VNKVAGNFH GK+ H H
Sbjct: 144 --PPREDD----------SSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAA 191
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVS 295
+S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S
Sbjct: 192 LVNHESYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKIS 251
Query: 296 GHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
T +QFSVTE R + + G+F YDLS + VT TEEH+ F F +C I
Sbjct: 252 ADT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGI 308
Query: 355 VGGVFTVSGII 365
VGG+F+ +G++
Sbjct: 309 VGGIFSTTGML 319
>gi|270003406|gb|EEZ99853.1| hypothetical protein TcasGA2_TC002635 [Tribolium castaneum]
Length = 380
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 176/369 (47%), Gaps = 45/369 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++KI+ +D +PKI E F ++ GG ++ S I++ L F E+ YL++ K DT
Sbjct: 16 LSKIKKIDIFPKIEETFKEKSSVGGTFSVFSFILITWLVFLEINYYLDSKFIFKFSPDTD 75
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+T A+PCS L D +D + + LD + E
Sbjct: 76 FDAKLKINVDITV-AMPCSNLGADILDSTNQNAYKFG------SLDEEDTWFEM------ 122
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
AP Q H + +Y VRE Y L
Sbjct: 123 APN----QQIHFHNKKQFNSY--------------------VREEYHALKDVLWKSRFST 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRD 242
+ + C I+G L +NKV+GNFH GKS + H+H I AF +RD
Sbjct: 159 MFRHRPERSTYPNRPHDACRIHGSLILNKVSGNFHITAGKSLNLPRGHIH-ISAFMSERD 217
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQS 301
+N SH+I+ +FG+ PG+++PL+G ++ YFI+VVPT V T ++ + +
Sbjct: 218 -YNFSHRIDTFSFGDSSPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN--VNT 274
Query: 302 NQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Q+SV E R + + +PG+FF YD+S +KVT ++E FL +C+I+GG+F
Sbjct: 275 YQYSVKELNRPIDHDKGSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSIIGGIFV 334
Query: 361 VSGIIDAFI 369
SG +++F+
Sbjct: 335 CSGFVNSFV 343
>gi|189235693|ref|XP_966630.2| PREDICTED: similar to AGAP005044-PA [Tribolium castaneum]
Length = 373
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 176/369 (47%), Gaps = 45/369 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++KI+ +D +PKI E F ++ GG ++ S I++ L F E+ YL++ K DT
Sbjct: 9 LSKIKKIDIFPKIEETFKEKSSVGGTFSVFSFILITWLVFLEINYYLDSKFIFKFSPDTD 68
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+T A+PCS L D +D + + LD + E
Sbjct: 69 FDAKLKINVDITV-AMPCSNLGADILDSTNQNAYKFG------SLDEEDTWFEM------ 115
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
AP Q H + +Y VRE Y L
Sbjct: 116 APN----QQIHFHNKKQFNSY--------------------VREEYHALKDVLWKSRFST 151
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRD 242
+ + C I+G L +NKV+GNFH GKS + H+H I AF +RD
Sbjct: 152 MFRHRPERSTYPNRPHDACRIHGSLILNKVSGNFHITAGKSLNLPRGHIH-ISAFMSERD 210
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQS 301
+N SH+I+ +FG+ PG+++PL+G ++ YFI+VVPT V T ++ + +
Sbjct: 211 -YNFSHRIDTFSFGDSSPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN--VNT 267
Query: 302 NQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Q+SV E R + + +PG+FF YD+S +KVT ++E FL +C+I+GG+F
Sbjct: 268 YQYSVKELNRPIDHDKGSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSIIGGIFV 327
Query: 361 VSGIIDAFI 369
SG +++F+
Sbjct: 328 CSGFVNSFV 336
>gi|66360024|ref|XP_627190.1| ERV41 like membrane associated protein involved in vesicular
transport with a transmembrane region near the
C-terminus [Cryptosporidium parvum Iowa II]
gi|46228832|gb|EAK89702.1| ERV41 like membrane associated protein involved in vesicular
transport with a transmembrane region near the
C-terminus [Cryptosporidium parvum Iowa II]
Length = 403
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/405 (27%), Positives = 177/405 (43%), Gaps = 62/405 (15%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSR 65
K++ DA+ K +F +T GG +T++S I M++LF+SEL+ YLN + ++ VD S
Sbjct: 15 KMKQFDAFSKPISEFRIKTAFGGYLTILSMIAMIILFYSELKYYLNITRKDEVTVDHLSS 74
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
+ + + FP LPC IL V +++ + + + GI
Sbjct: 75 NRNINLRMQLEFPKLPCDILGVRIINLQENKEIYLP------------------DGGIEF 116
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPD 181
KI CG CY A ++ +CCN C+++ Y KKG L +
Sbjct: 117 VKIGSNESNANSSSG-----CGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLPHVI 171
Query: 182 LIDQCKREGFLQRIKEE-----EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HV 233
QC + +RI EGC I + KV G + H+ V +
Sbjct: 172 SFKQCDYDK-SKRISNALSSNLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEM 225
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET-------------PSGMY 280
D+ + FN S+K+N L FGE PG+ N + Q +
Sbjct: 226 TDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYI 285
Query: 281 QYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPI 333
+ + +PT Y ++ +I S+QFSV ++ S G+ ++PG+ YD +P
Sbjct: 286 DFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPF 345
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V TE SFL F+T CAI+GG+F SG+ID F + ++ K
Sbjct: 346 LVKITESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 390
>gi|7341109|gb|AAF61208.1|AF216751_1 CDA14 [Homo sapiens]
Length = 378
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/367 (30%), Positives = 171/367 (46%), Gaps = 49/367 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FSSKLRINIDITV-AMKCQYVGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 110 EPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSTSTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + C I+G L VNKVAGNFH GK+ H H Q
Sbjct: 160 EDD----------SSQSPNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCQPW 209
Query: 243 SFNI-SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
+ I SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 NLTIFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT- 268
Query: 300 QSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+
Sbjct: 269 --HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGI 326
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 327 FSTTGML 333
>gi|67623433|ref|XP_667999.1| serologically defined breast cancer antigen 84 like (42.9 kD)
(XQ234) [Cryptosporidium hominis TU502]
gi|54659178|gb|EAL37768.1| serologically defined breast cancer antigen 84 like (42.9 kD)
(XQ234) [Cryptosporidium hominis]
Length = 388
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/404 (27%), Positives = 177/404 (43%), Gaps = 62/404 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSRG 66
++ DA+ K +F +T GG +T++S I M++LF+SEL+ YLN + ++ VD S
Sbjct: 1 MKQFDAFSKPISEFRIKTAFGGYLTILSIIAMIILFYSELKYYLNITRKDEVTVDHLSSN 60
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ + + FP LPC IL V +++ + + + GI
Sbjct: 61 RNINLRMQLEFPKLPCDILGVRIINLQENKEIYLP------------------DGGIEFV 102
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDL 182
KI CG CY A +++ +CCN C++V Y KKG L +
Sbjct: 103 KIGSNESNANSSSG-----CGPCYDASINNDLGVVNCCNTCKDVFNEYDKKGIKLPHVIS 157
Query: 183 IDQCKREGFLQRIKEE-----EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVH 234
QC + +RI EGC I + KV G + H+ V +
Sbjct: 158 FKQCDYDK-SKRISNALSSNLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMT 211
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET-------------PSGMYQ 281
D+ + FN S+K+N L FGE PG+ N + Q +
Sbjct: 212 DLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFDDAYID 271
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIK 334
+ + +PT Y ++ +I S+QFSV ++ S G+ ++PG+ YD +P
Sbjct: 272 FDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFL 331
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V TE SFL F+T CAI+GG+F SG+ID F + ++ K
Sbjct: 332 VKMTESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 375
>gi|392577310|gb|EIW70439.1| hypothetical protein TREMEDRAFT_43159 [Tremella mesenterica DSM
1558]
Length = 435
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 161/359 (44%), Gaps = 40/359 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+S DA+PK+ + S++ G V+T + ++ LL ++L YL + VD +
Sbjct: 33 IKSFDAFPKVQSTYTSQSRRGAVLTALVGFIIFLLVLNDLGEYLYGAPDYTFDVDQQLQK 92
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
L++N D+T A+PC LS+D D G++ HL S G E +G
Sbjct: 93 DLQLNVDLTV-AMPCHFLSIDLRDAVGDRLHL------------SDGFTKEGTTFAVGKA 139
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLID 184
K H + ++ S + R R + A+ P
Sbjct: 140 VTSK---THPTPISASQVISSSRRRTPTQQRSFSGIRRLLSSRPKRRTRKHAMFRP---- 192
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
K + G C IYG +EV KV N H + S H L
Sbjct: 193 --------TPNKADNGPACRIYGSVEVKKVTANLHITTLGHGYMSFEHTDHAL------M 238
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N+SH +++ +FG FP + PLD + P QYF++VVPT Y D +G + ++Q+
Sbjct: 239 NLSHVVHEFSFGPFFPAIAQPLDMTMQVSDNPFTAIQYFLRVVPTTYIDANGRKLVTSQY 298
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA-IVGGVFTVS 362
+VT++ RS + G Q +PG+FF YDL + VT E S HF+ + IVGGV+TV+
Sbjct: 299 AVTDYLRSFQHG--QGVPGIFFKYDLEAMAVTVRERTTSLYHFVIRLIGVIVGGVWTVA 355
>gi|388583623|gb|EIM23924.1| DUF1692-domain-containing protein [Wallemia sebi CBS 633.66]
Length = 396
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 166/360 (46%), Gaps = 51/360 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R DA+PK + R+ GG+ T++ ++LL F E+ +L E + VDT+
Sbjct: 7 MPPLREFDAFPKTQASYKIRSKQGGIATVIVIFALVLLVFHEIGDWLYGHNEYQFSVDTT 66
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+++N D+T A+PC L+VD D G+ RL
Sbjct: 67 TETEMQLNVDLTV-AMPCHYLNVDIRDAVGD------------RL--------------- 98
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K+ +Q+ G E E Y +S+ + ++ R+ +R
Sbjct: 99 --KLSDSIQKDGTTFE-PEKYRQIGSAKQSTLSRIVKDSKKGRKWFRP------------ 143
Query: 185 QCKREGFLQRIKE-EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
R F + K ++G C IYG +E KV GN H + S H L
Sbjct: 144 TSTRNRFPKTKKLIKDGPACRIYGSVETKKVNGNMHITTLGHGYSSLEHTDHKL------ 197
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
N+SH I++ +FG+HFP + PLD + +YQYF+ VVPT Y D SGH++ +NQ
Sbjct: 198 MNLSHTIDEFSFGQHFPYISQPLDKSVEITDNHFPVYQYFMHVVPTTYVDASGHSLSTNQ 257
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+S E + + + +PG+FF Y+L PI ++ + +SF L + A++GGV+ SG
Sbjct: 258 YSAREDIKFIHNHQ-RGIPGLFFRYELEPIHLSLSATTMSFTKLLIRLTALIGGVWCCSG 316
>gi|323509323|dbj|BAJ77554.1| cgd8_2900 [Cryptosporidium parvum]
gi|323510503|dbj|BAJ78145.1| cgd8_2900 [Cryptosporidium parvum]
Length = 388
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 111/404 (27%), Positives = 176/404 (43%), Gaps = 62/404 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSRG 66
++ DA+ K +F +T GG +T++S I M++LF+SEL+ YLN + ++ VD S
Sbjct: 1 MKQFDAFSKPISEFRIKTAFGGYLTILSMIAMIILFYSELKYYLNITRKDEVTVDHLSSN 60
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ + + FP LPC IL V +++ + + + GI
Sbjct: 61 RNINLRMQLEFPKLPCDILGVRIINLQENKEIYLP------------------DGGIEFV 102
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDL 182
KI CG CY A ++ +CCN C+++ Y KKG L +
Sbjct: 103 KIGSNESNANSSSG-----CGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLPHVIS 157
Query: 183 IDQCKREGFLQRIKEE-----EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVH 234
QC + +RI EGC I + KV G + H+ V +
Sbjct: 158 FKQCDYDK-SKRISNALSSNLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMT 211
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET-------------PSGMYQ 281
D+ + FN S+K+N L FGE PG+ N + Q +
Sbjct: 212 DLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYID 271
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIK 334
+ + +PT Y ++ +I S+QFSV ++ S G+ ++PG+ YD +P
Sbjct: 272 FDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFL 331
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
V TE SFL F+T CAI+GG+F SG+ID F + ++ K
Sbjct: 332 VKITESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 375
>gi|190346055|gb|EDK38054.2| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
6260]
Length = 407
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 178/368 (48%), Gaps = 58/368 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA ++++ DA+PK+N R+ GG+ T+++ + +L + + ++ +L + + +
Sbjct: 62 MDAFSTRVKTFDAFPKLNSQHAVRSQRGGLSTIMTVVFILFVMWVQIGGFLGGYVDHQFV 121
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD LRIN D+ A+PC L + MDI+ ++ L + L+ QG+
Sbjct: 122 VDDQVRSDLRINLDMKV-AMPCEFLHTNVMDITDDRFLA------SEVLNFQGSYF---- 170
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
P + + N+ ++D + E + EA R
Sbjct: 171 ---FVPDL----------IRMNDA---------TTDYETPELEEIMLEAGRY-------- 200
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ REG+ + E C+I+G + VN+V+G+FH ++ HV
Sbjct: 201 ----EFDREGYHE---AESAPACHIFGSIPVNQVSGDFHITAKGMGYRDRAHV------D 247
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ N SH I + +FGE +P + NPLD T + Y+Y+ KVVPT+Y + G +
Sbjct: 248 PQALNFSHIIAEFSFGEFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYERM-GLQVD 306
Query: 301 SNQFSVTEHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+NQ+S+TE R E GR+Q +PG+FF Y+ IK+ +++ + F F+ + I+GG
Sbjct: 307 TNQYSITESHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTSFVARLATIIGG 366
Query: 358 VFTVSGII 365
VF V+G +
Sbjct: 367 VFIVAGYL 374
>gi|340507573|gb|EGR33515.1| hypothetical protein IMG5_050820 [Ichthyophthirius multifiliis]
Length = 290
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 103/327 (31%), Positives = 158/327 (48%), Gaps = 77/327 (23%)
Query: 57 TKLLVDTSRG-ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNV 115
+++ VD+ RG + +R+N D+ FP PC ILS+D DI G ++V+ D+ K R+ G
Sbjct: 4 SEMFVDSLRGGQKIRVNLDIDFPKFPCDILSLDFQDIMGSHSVNVEGDLHKTRITKTGEY 63
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
+ RH E ++ + G
Sbjct: 64 FD----------------RH-----------------------------EQQQNKQHSGH 78
Query: 176 ALSNPDLIDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
A + +D LQRI++ + EGC + GF+ VN+V GNFH + +F Q +V
Sbjct: 79 AHDQSNQVD-------LQRIQQAIQNKEGCKLSGFMYVNRVPGNFHIS-CHAFGQILGYV 130
Query: 234 HDILAFQRDSFNISHKINKLAFGEH----------FPGVVNPLDGVRWTQ----ETPSGM 279
I ++ ++SHKIN L+FG+ GV+NP+D + T+ E
Sbjct: 131 FRITGI--NTIDLSHKINHLSFGDEDEIKIVKKQFTLGVLNPMDKLVKTKQKHFENYGIS 188
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 339
Y Y++ VVPT Y D G+T NQF TE+ Q + +P ++F YDLSP+ V F +
Sbjct: 189 YNYYLNVVPTTYIDEWGYTYYVNQFVFTEN-----QIQTDYIPAIYFRYDLSPVTVMFKK 243
Query: 340 EHVSFLHFLTNVCAIVGGVFTVSGIID 366
+ + FLHFL V AIVGG+FT++ +D
Sbjct: 244 DRMPFLHFLVQVSAIVGGIFTIAAFMD 270
>gi|146421059|ref|XP_001486481.1| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
6260]
Length = 407
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 178/368 (48%), Gaps = 58/368 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA ++++ DA+PK+N R+ GG+ T+++ + +L + + ++ +L + + +
Sbjct: 62 MDAFSTRVKTFDAFPKLNSQHAVRSQRGGLSTIMTVVFILFVMWVQIGGFLGGYVDHQFV 121
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD LRIN D+ A+PC L + MDI+ ++ L + L+ QG+
Sbjct: 122 VDDQVRSDLRINLDMKV-AMPCEFLHTNVMDITDDRFLA------SEVLNFQGSYF---- 170
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
P + + N+ ++D + E + EA R
Sbjct: 171 ---FVPDL----------IRMNDA---------TTDYETPELEEIMLEAGRY-------- 200
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ REG+ + E C+I+G + VN+V+G+FH ++ HV
Sbjct: 201 ----EFDREGYHE---AESAPACHIFGSIPVNQVSGDFHITAKGMGYRDRAHV------D 247
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ N SH I + +FGE +P + NPLD T + Y+Y+ KVVPT+Y + G +
Sbjct: 248 PQALNFSHIIAEFSFGEFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYERM-GLQVD 306
Query: 301 SNQFSVTEHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+NQ+S+TE R E GR+Q +PG+FF Y+ IK+ +++ + F F+ + I+GG
Sbjct: 307 TNQYSITELHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTLFVARLATIIGG 366
Query: 358 VFTVSGII 365
VF V+G +
Sbjct: 367 VFIVAGYL 374
>gi|307206941|gb|EFN84785.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Harpegnathos saltator]
Length = 396
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 174/367 (47%), Gaps = 47/367 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PK+ E + +T GG ++ + + L +E +L++ + K DT
Sbjct: 12 VKELDAFPKVPELYVDKTAVGGTFSIFTVCFIAYLIIAETSYFLDSRLQFKFETDTDIDA 71
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIF-KKRLDSQGNVIESRQDGIGAP 126
L+IN D+T A+PC + D +D ++ ++F L+ + E
Sbjct: 72 KLQINIDITV-AMPCGRIGADVLD-------SMEENVFGYDSLEQEDTWWEL-------- 115
Query: 127 KIDKPLQR-HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
P QR H L+H +Y Y A + W + L +
Sbjct: 116 ---TPEQRAHFEALKHMNSYLREEY-----------------HAIHELLWKSNQITLYSE 155
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SF 244
+ + + C I+G L VNKVAGNFH GKS H+H I AF D +
Sbjct: 156 MPKRSYE---PDYPPNACRIHGSLNVNKVAGNFHITTGKSLSVPRGHIH-ISAFMTDRDY 211
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQ 303
N +H+IN+ +FG PG+V+PL+G + +YQYF++VVPT + T +S T ++ Q
Sbjct: 212 NFTHRINRFSFGGPSPGIVHPLEGDEKIADYNMMLYQYFVEVVPTDIRTLLS--TSKTYQ 269
Query: 304 FSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
+SV ++ R +PG+F Y++S +K+ T++ + FL +CA VGG+F S
Sbjct: 270 YSVKDYQRPINHNEGSHGVPGIFIKYNMSALKIKVTQQRDTIFQFLVKLCATVGGIFVTS 329
Query: 363 GIIDAFI 369
G+I +
Sbjct: 330 GLIKNIV 336
>gi|156402826|ref|XP_001639791.1| predicted protein [Nematostella vectensis]
gi|156226921|gb|EDO47728.1| predicted protein [Nematostella vectensis]
Length = 413
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 105/184 (57%), Gaps = 1/184 (0%)
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
KRE +E + C +YG +VNKVAGNFH GKS H H H +S N
Sbjct: 156 KREESKDAANTKEHDACRVYGSFKVNKVAGNFHITSGKSIHHPRGHAHLSSMVPVESLNF 215
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH+I+ L+FG+ PG+V+PLDG E MYQY+I+VVPT ++ I++NQ+S+
Sbjct: 216 SHRIDMLSFGKRVPGIVHPLDGEMQITEKRRMMYQYYIQVVPTSIKSLNSEEIKTNQYSM 275
Query: 307 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
T+ R S + G+FF YD+S I V +H S + FL +C IVGG+F SG++
Sbjct: 276 TQRIREISHDSGSHGIAGLFFKYDMSSIMVRVKHQHHSMVGFLVRLCGIVGGIFATSGML 335
Query: 366 DAFI 369
FI
Sbjct: 336 HDFI 339
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 1/87 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+ DA+PKI E++ T SGG ++LVS + + +L SE Y T+ VDT
Sbjct: 13 IKEFDAFPKIPENYQQTTASGGSVSLVSFLFIFVLVISEFWYYRATETKFSYEVDTDADS 72
Query: 68 TLRINFDVTFPALPCSILSVDAMDISG 94
L+IN D+T A+ C + D +D+SG
Sbjct: 73 KLQINVDLTI-AMKCEDIDADVLDLSG 98
>gi|154418008|ref|XP_001582023.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121916255|gb|EAY21037.1| hypothetical protein TVAG_172950 [Trichomonas vaginalis G3]
Length = 371
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 179/386 (46%), Gaps = 27/386 (6%)
Query: 8 IRSLDAYPKI-NEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+ D +PK ++ +TF+GG+I+ ++++ + L ++ + ++ +++D
Sbjct: 2 LSKFDVFPKFADKSVNIQTFTGGLISFLTTLWVCFLLVGKIHGLIYPEIKSSVVLDKEHV 61
Query: 67 ETLR---INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+ R INFD+T + PC++L +D + G Q ++ +I R G I
Sbjct: 62 DGQRKTFINFDITIGS-PCTMLHIDLFEHDGYQKTNIIENISLTRYAQSGEDINDL---- 116
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
++K + + + YCG+CY S+D+ CCN C EV + ++ KG
Sbjct: 117 ----LEKRVPSKSKKQDFPPDYCGNCY--LSTDKKCCNTCREVMDVFKAKGLTYYASFRW 170
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRD 242
+QC REG L + E C I G L+V K +GNFH A G + + + H HD+ +
Sbjct: 171 EQCIREGVL----DFGNETCRIKGKLKVKKQSGNFHIALGANTNDNYKGHSHDLSSVDA- 225
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG----MYQYFIKVVPTVYTDVSGHT 298
S ++H I+ L FGE L V +G M Y++ P + +
Sbjct: 226 SHKLNHVIHSLTFGEPVDYYKPQLTDVEMQLPELNGSNYWMVTYYLHAAPERIS--TTDK 283
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
I S ++S R + PG+ F+YD +P+ V + H S + ++C IVGG
Sbjct: 284 IDSYRYSAFPSRRKVTNKTKKGFPGIVFYYDFAPMIVVYQPTHGSIRSIIVDICGIVGGA 343
Query: 359 FTVSGIIDAFIYHGQRAIKKKIEIGK 384
F+ + IIDA + I+ K IGK
Sbjct: 344 FSFAAIIDALAFGALSGIRGKTMIGK 369
>gi|443897407|dbj|GAC74748.1| CDK9 kinase-activating protein cyclin T [Pseudozyma antarctica
T-34]
Length = 414
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 162/348 (46%), Gaps = 42/348 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ KIR DA+PK + R+ GGV+T++S++ ++ L ++EL YL VD+
Sbjct: 11 LPKIRQFDAFPKTQSIYTQRSSKGGVLTIISALALVFLLWTELSTYLYGERGYSFAVDSQ 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
T++IN D+T A+ C L++D D G++ L V FKK G + IG
Sbjct: 71 LQSTMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDTEFKK----DGTTFD-----IG 119
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
H RL+ E+ D + + YR+K N
Sbjct: 120 ----------HADRLD--------ALPQEALDVGKTISKARKKPLYRRKP---RNKKFSR 158
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
Q + +G C IYG +EV +V GN H + S H L
Sbjct: 159 QVAFHKTAHLV--PDGPACRIYGSMEVKRVTGNLHITTLGHGYLSMEHTDHKL------M 210
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N+SH I++ +FG +FP + PLD T + ++QYF+ +PT++ D G + ++Q+
Sbjct: 211 NLSHVIHEFSFGPYFPEISQPLDSSVETTDKHFTVFQYFVSAIPTLFIDARGRRLHTHQY 270
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
SVT++ R E G+ +PG+F YD+ P+++T E VS + FL +
Sbjct: 271 SVTDYARPIEHGK--GVPGIFIKYDIEPLQMTIRERSVSLVQFLVRLA 316
>gi|148678794|gb|EDL10741.1| ERGIC and golgi 2, isoform CRA_a [Mus musculus]
Length = 375
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 165/367 (44%), Gaps = 60/367 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 18 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 77
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ + + DG+
Sbjct: 78 FSSKLRINIDITV-AMKCHYVGADVLDLA--------------------ETMVASADGLA 116
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 117 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 166
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 167 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 216
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVR--WTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG P+ ++ Y I +
Sbjct: 217 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDLVPTKLHTYKI-------------SA 263
Query: 300 QSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
++QFSVTE R + + G+F YDLS + VT TEEH+ F F +C I+GG+
Sbjct: 264 DTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGI 323
Query: 359 FTVSGII 365
F+ +G++
Sbjct: 324 FSTTGML 330
>gi|321258600|ref|XP_003194021.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
gi|317460491|gb|ADV22234.1| ER to Golgi transport-related protein, putative [Cryptococcus
gattii WM276]
Length = 444
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 168/366 (45%), Gaps = 45/366 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+S DA+PK+ + ++ GGV+T V +++ LL ++L YL + VD+ +
Sbjct: 33 IKSFDAFPKVESTYMIKSKRGGVLTAVVGLIIFLLVLNDLGEYLYGAPDYAFQVDSDVQK 92
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ-HL------DVKHDIFKKRLDSQGNVIESRQ 120
L++N D+T A+PC L++D D G++ HL D H K + N +
Sbjct: 93 DLQLNVDLTV-AMPCRYLTIDLRDAVGDRLHLSNSFVKDGTHFDIGKATSIKNNPSSTTP 151
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
A +I +R + + + + + S + AYR
Sbjct: 152 ---SASEIISSSRRRTPNQQSSFSGIKRLFSSSPSSSSSNRRTAQDHTAYRPT------- 201
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
D+ ++G C IYG ++V KV N H H ++FQ
Sbjct: 202 --YDKV-----------QDGPACRIYGSVQVKKVTANLHIT---------TLGHGYMSFQ 239
Query: 241 RDS---FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
N+SH +++ +FG FP + PLD P ++QYF++VVPT Y D S
Sbjct: 240 HTDHHLMNLSHVVHEFSFGPFFPAIAQPLDQSYEITLQPFTIFQYFLRVVPTTYIDASRR 299
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+ ++Q++VT++ RS E G+ +PG+FF YDL P+ V E S FL + +VGG
Sbjct: 300 KLITSQYAVTDYSRSFEHGK--GVPGLFFKYDLEPMSVVIRERTTSLFQFLIRLAGVVGG 357
Query: 358 VFTVSG 363
V+TV+
Sbjct: 358 VWTVAA 363
>gi|156553212|ref|XP_001600226.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Nasonia vitripennis]
Length = 391
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 176/368 (47%), Gaps = 41/368 (11%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
I+ ++ LDA+ KI ED+ ++ GG +L S +++ L ++E +L++ + K D
Sbjct: 7 IIKVVKELDAFTKIPEDYRKQSAVGGTFSLASFCIIVYLIYAETSYFLDSRLQFKFEPDV 66
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
L++N D+T A PC + D +D S Q+L + S+ +E +
Sbjct: 67 EYDSQLQMNIDITV-ATPCDRIGADILD-STNQNL----------MTSENFHLEDTWWDL 114
Query: 124 GAPKIDKPLQR-HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P QR H L+H Y Y A E+ ++ SN
Sbjct: 115 ------TPDQRAHFEALKHMNYYFREEYHA----------LHEL--LWKSNQLTFSN--- 153
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ KR+ I C IYG L+VNKVAGNFH GKS H H
Sbjct: 154 -EMPKRD----YIPSYPSNACRIYGSLDVNKVAGNFHVTSGKSVILPRGHFHFTSFHSST 208
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
++N +H+IN+ +FG+ PG+++PL+G ++QYFI+VV T ++ H ++
Sbjct: 209 AYNFTHRINRFSFGKPSPGIIHPLEGDEKITTDNMMLFQYFIEVVSTD-INMLMHKSKTY 267
Query: 303 QFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SV +H R + +PG+FF YD S +K+ ++E S FL +CA VG +F
Sbjct: 268 QYSVKDHQRPINHAKGSHGIPGIFFKYDTSALKIKVSQERDSIGQFLVKLCATVGCIFVT 327
Query: 362 SGIIDAFI 369
+GI+++ +
Sbjct: 328 NGILNSIV 335
>gi|367025937|ref|XP_003662253.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
42464]
gi|347009521|gb|AEO57008.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
42464]
Length = 380
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 167/360 (46%), Gaps = 46/360 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ DA+PK + T +GG T+ + + L+LF+SEL + E V+
Sbjct: 22 VKAFDAFPKAKPQYVQHTSAGGKWTVAMAFISLILFWSELARWWRGTEEHTFAVEKGVSH 81
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L IN DV + C+ L V+ D +G++ L L + DG G +
Sbjct: 82 VLPINLDVVV-RMRCADLHVNVQDAAGDRILAAS------ALRRDPTLWAHWVDGKGVHR 134
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q GR+ E Y G+ + +E + ++ RK+ P
Sbjct: 135 LGRDAQ---GRVITGEGYTGADHDEGFGEE----HVHDIVALGRKRAKWSRTP------- 180
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
R+ E + C IYG LE+NKV G+FH A G + + G H+ ++FN
Sbjct: 181 ------RLWGAEADSCRIYGSLELNKVQGDFHITARGHGYMEFGEHL------DHNAFNF 228
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYT-----DVSGHTI 299
SH I++L+FG P +VNPLD R P+ Y QYF+ VVPT Y+ + ++
Sbjct: 229 SHIISELSFGPFLPSLVNPLD--RTVNTAPAHFYKFQYFLSVVPTTYSVGHPEERGSRSV 286
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQ++VTE ++ + T+PG+F YD+ PI + E SF FL V +V GV
Sbjct: 287 LTNQYAVTEQSKAVPE---NTVPGIFVKYDIEPILLNIVETRDSFFVFLIKVINVVSGVL 343
>gi|340058906|emb|CCC53277.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 394
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 190/392 (48%), Gaps = 35/392 (8%)
Query: 3 AIMNKIRSLDAYPKINEDFY-SRTFSGGVITLVSSIVMLLLFFSE--LRLYLNAVTETKL 59
+ + K + D +PK ED+ S+T G ++++V+ ++LLL E +Y T+L
Sbjct: 20 SFLKKFEAFDFFPKPKEDYRRSQTTVGALVSVVTLALILLLVLWEGVAYIYGRDAYRTEL 79
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR 119
VDTS + + N D++FP C+ L +D D +G +V ++ K LD+ G +
Sbjct: 80 AVDTSLTKEVVFNIDISFPQERCNELFLDVFDATGSTRFNVTMNVHKTPLDASGKSVFVG 139
Query: 120 QDGIGAPKIDKPLQRHGGRLE-HNETYCGSCYGA------ESSDEDCCNNCEEVREAYRK 172
+ D + ++ + + + +CG C+ + + C N CE+V E + +
Sbjct: 140 ERHF---HTDYTVPQYNAKFDPTSPKFCGKCFVGRKYSYLQQPETPCRNTCEQVMEEFER 196
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
+ A + ++QC E EE GCN G L++ K +G FAP ++
Sbjct: 197 RKLAKPSKSTVEQCIGE------LSEENPGCNYRGSLKLKKASGTLIFAP--KMFENVFR 248
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHF------PGVVNPLDGVRWTQETPSGMYQYFIKV 286
++D++ FN SH INKL+ G+ GV PL+ R+ +YF+K+
Sbjct: 249 INDLM-----QFNASHVINKLSIGDDLVRRFSKRGVYFPLNNQRFVTTKQFAQVRYFMKI 303
Query: 287 VPTVY-TDVSGHTIQSN-QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
VPT Y +D + + + S ++SV R G + +P V F +D S ++V + SF
Sbjct: 304 VPTTYISDNTANPVASTYEYSVQWDHRQVPLGSGE-IPSVVFSFDFSSMQVNNYFQRPSF 362
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
HF+ ++C IVGG+F V G++D + R +
Sbjct: 363 CHFIVSLCGIVGGLFVVLGMVDGLVARVLRLL 394
>gi|402085784|gb|EJT80682.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 379
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 173/386 (44%), Gaps = 49/386 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + +RT GG T+ ++ +L +SEL + V V+ G
Sbjct: 21 VSAFDAFPKSKPQYVTRTAGGGKWTVAMLVISAVLTWSELARWWRGVETHTFAVEKGVGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++IN DV + C L V+ D +G++ L RL DG G K
Sbjct: 81 SMQINLDVVV-HMKCDDLHVNVQDAAGDRILAAS------RLKMDPTAWAQWVDGNGVHK 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ GR +HN + + DE E V + L +
Sbjct: 134 L--------GRDKHNRLITNEGFEHDGHDEGFGE--EHVHDIVA----------LGKKRA 173
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
R G R+ + C ++G L++NKV G+FH A G + + G H+ D+FN
Sbjct: 174 RWGKTPRLWGSTADSCRLFGSLDLNKVQGDFHITARGHGYMEFGEHL------DHDAFNF 227
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-----GHTIQS 301
+H IN+ +FGE +P +VNPLD T +QYF+ VVPTVY+ S G TI +
Sbjct: 228 THIINEFSFGEFYPSLVNPLDRTINGANTHFHKFQYFLSVVPTVYSVKSSAGGFGSTIFT 287
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV--- 358
NQ++VTE + + +PG+FF YD+ P+ + E +FL FL V I+ G
Sbjct: 288 NQYAVTEQNAEISE---RAIPGIFFKYDIEPVLLNIEESRDTFLLFLVKVVNILSGAMVA 344
Query: 359 ----FTVSGIIDAFIYHGQRAIKKKI 380
FT++ I + +RA I
Sbjct: 345 GHWGFTMTEWIKEIMGKRRRATSGMI 370
>gi|342878666|gb|EGU79974.1| hypothetical protein FOXB_09504 [Fusarium oxysporum Fo5176]
Length = 376
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 172/376 (45%), Gaps = 45/376 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + RT GG T+ SI+ L+L + EL + V+
Sbjct: 23 VSAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLVLIWGELGRWWRGAESHNFEVEAGVSR 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN D+ + C + V+ D SG+ H + KRL + + D G K
Sbjct: 83 ELQINMDIVV-KMNCDDIHVNVQDASGD------HILAAKRLKADRTLWSQWVDNKGMHK 135
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q GR+ Y Y E E+ ++ V ++ WA P
Sbjct: 136 LGRDSQ---GRVNTGSGYNELGYEDEGFGEEHVHDI--VALGKKRAKWA-KTPKF----- 184
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ C IYG L++NKV G+FH A G + +G H+ FN
Sbjct: 185 ---------RGNADSCRIYGSLDLNKVQGDFHITARGHGYRGNGEHL------DHSKFNF 229
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L++G +P +VNPLDG T +QY++ VVPTVY+ V+ +I +NQ++V
Sbjct: 230 SHIISELSYGPFYPSLVNPLDGTVNTAPDNFHKFQYYLSVVPTVYS-VNSKSILTNQYAV 288
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------F 359
TE ++ ++ + +PG+FF YD+ PI +T E + L V I+ GV F
Sbjct: 289 TEQSKAVDE---RYIPGIFFKYDIEPILLTVHESRDGIISLLVKVINIMSGVLVAGHWGF 345
Query: 360 TVSGIIDAFIYHGQRA 375
T+S I I +R+
Sbjct: 346 TISDWIHDVIGRRRRS 361
>gi|119497911|ref|XP_001265713.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
fischeri NRRL 181]
gi|119413877|gb|EAW23816.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
fischeri NRRL 181]
Length = 397
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 169/385 (43%), Gaps = 60/385 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
++ +++ DA+PK + + + GG T++ +V SE R +L + V+
Sbjct: 19 SVKGSLKTFDAFPKTKPSYTAPSPRGGQWTVLILLVCTFFSISEFRTWLKGTEKQHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ +PC L V+ D SG++ L ++ K+ S ++ R
Sbjct: 79 KGISHDLQLNLDIVV-HMPCDTLDVNIQDASGDRVL--AGELLKREPTSWQLWMDKRNFE 135
Query: 123 I--GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWAL 177
I GA + Q H RL E +D + EVR RKK G L
Sbjct: 136 IYGGAHEYQTLSQEHADRLSEQE-----------ADAHVHHVLGEVRRNPRKKFAKGPKL 184
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 236
D +D C+ IYG LE NKV G+FH A G +H S H+
Sbjct: 185 RRGDAVDSCR-----------------IYGSLEGNKVQGDFHITARGHGYHNSAPHL--- 224
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD--- 293
+ +FN SH I +L+FG H+P ++NPLD T E YQYF+ +VPT+Y+
Sbjct: 225 ---EHKTFNFSHMITELSFGPHYPTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNL 281
Query: 294 -------------VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 340
S + I +NQ++ T + + +PG+FF Y++ PI + +EE
Sbjct: 282 ALDTYANAPPTSRYSKNLIFTNQYAATSQSSAIPENPY-FIPGIFFKYNIEPILLMISEE 340
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGII 365
SFL L + + GV G +
Sbjct: 341 RTSFLSLLVRLVNTISGVMVTGGWL 365
>gi|297262047|ref|XP_001105686.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Macaca mulatta]
Length = 374
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 168/366 (45%), Gaps = 51/366 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
SV+ + + DV D+ LD ++ S +
Sbjct: 70 FS-------------------SVECKTSNSFPYADVGADV----LDLAETMVASADGLVY 106
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P + P Q+ R+ S E S +D + A++ AL P
Sbjct: 107 EPAVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 156
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + C I+G L VNKVAGNFH GK+ H H +
Sbjct: 157 EDDSS----------QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHE 206
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE P ++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 207 SYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT-- 264
Query: 301 SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVGG+F
Sbjct: 265 -HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIF 323
Query: 360 TVSGII 365
+ +G++
Sbjct: 324 STTGML 329
>gi|195997845|ref|XP_002108791.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
gi|190589567|gb|EDV29589.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
Length = 324
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/170 (45%), Positives = 99/170 (58%), Gaps = 4/170 (2%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 260
+ C I+G + +NKVAGNFH G S + H H R+S N SH+I+ LAFG P
Sbjct: 137 DACRIHGNIPLNKVAGNFHVTAGMSINHPMGHAHVSDLVPRESVNFSHRIDLLAFGVAAP 196
Query: 261 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ--GRL 318
V+NPLDGV + + MYQYFIK+VPT S I + Q+SVTEHF + G+
Sbjct: 197 NVINPLDGVEFITKITDKMYQYFIKIVPTKVKTFSV-AIDTYQYSVTEHFSKVDHMNGK- 254
Query: 319 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 368
+ G+FF YDLSPI V TE V F L +C IVGG+F SG+I F
Sbjct: 255 HGVSGLFFKYDLSPISVQVTEARVPFGQLLIRLCGIVGGIFATSGMIHIF 304
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ +++ LDA+PKI ED + SGG ++ + ++ ++ EL Y + + VD
Sbjct: 12 LQEVKKLDAFPKIAEDCKESSTSGGTASVTAFFLITIMVIMELVDYSFSGVKYNYSVDKD 71
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
+ ++ D+T A+ C L D +D++
Sbjct: 72 IQSKMMLHLDLTI-AMKCRDLGADVLDLA 99
>gi|336472105|gb|EGO60265.1| hypothetical protein NEUTE1DRAFT_56465 [Neurospora tetrasperma FGSC
2508]
gi|350294686|gb|EGZ75771.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 379
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 163/355 (45%), Gaps = 37/355 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + +RT +GG T+ ++V +LF+SE + V+
Sbjct: 22 VSAFDAFPKSKPQYVTRTTAGGKWTVFVALVSFILFWSEASRWWRGSESHTFAVEKGVSH 81
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L IN D+ + C + ++ D +G++ L RL V + D G K
Sbjct: 82 ALDINLDIVV-KMKCQDIHINVQDAAGDRILAA------SRLHRDPTVWQHWVDNKGIHK 134
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q G++ E Y E E+ ++ V RK WA +
Sbjct: 135 LGRDAQ---GKVVTGEGYMQGQGHDEGFGEEHVHDI--VSLGRRKAKWART--------- 180
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
R+ + C ++G LE+NKV G+FH A G + + G H+ +FN
Sbjct: 181 -----PRLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL------DHSAFNF 229
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L+FG P +VNPLD +QYFI VVPTVY+ SG +I +NQ++V
Sbjct: 230 SHIISELSFGPFLPSLVNPLDQTVNIASANFHKFQYFISVVPTVYSS-SGKSIVTNQYAV 288
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
TE S++ + +PG+F YD+ PI + EE SFL F+ V ++ G
Sbjct: 289 TEQ---SQEVTERIIPGIFVKYDIEPILLNIEEERDSFLVFIIKVVNVISGALVA 340
>gi|123425245|ref|XP_001306773.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121888365|gb|EAX93843.1| hypothetical protein TVAG_177510 [Trichomonas vaginalis G3]
Length = 353
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 117/390 (30%), Positives = 190/390 (48%), Gaps = 53/390 (13%)
Query: 8 IRSLDAYPKINED-FYSRTFSGGVITLVSSIVMLLLFFSE------LRLYLNAVTETKLL 60
+R D YPK+ +D F RT SGGV+T+++ + M+++ E + + +AV +++ +
Sbjct: 1 MRKFDIYPKVQDDSFNIRTVSGGVVTIITFLFMIIVAIKEGSSFHRVEIKQHAVVQSQYI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
+++ E I D+T A PC +L ++ +D SG + + DI ++RLD
Sbjct: 61 KESNEIE---IFMDITV-AYPCHMLQLNVIDASGNPQPNARQDISRQRLDVHF------- 109
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETY--CGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
KPL++ + + CG+C GA S CC C ++ ++R+ +
Sbjct: 110 ---------KPLEQLISDSDPKSVFQTCGNCLGANVSK--CCLTCTDIANSFRQMEEFIP 158
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
N ++QC R+ + E+ E C I L N HF GK +G V +
Sbjct: 159 NLQNVEQCNRD----KKAIEDKETCRIVAKL-------NTHFTKGKLTIMAGGIVPTPVN 207
Query: 239 FQ------RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG-MYQYFIKVVPTVY 291
++ D+ N++H I+ L FG F G+ NPLD Q S MY Y I +VPT+
Sbjct: 208 YKFDLSHFGDNVNLTHTIHTLRFGRDFEGLKNPLDNYTNNQLKKSQFMYNYKIDLVPTIT 267
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
DV I ++Q+S + + + + PG+ F +D +P+ F E S FLT +
Sbjct: 268 NDVENQ-IPAHQYSASSSSKEITKMITKKHPGITFDFDTAPVAARFIVEKQSLSSFLTQL 326
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 381
CAI+GG FT+ G ID+FI+ R KK E
Sbjct: 327 CAILGGGFTLGGFIDSFIF---RVRAKKFE 353
>gi|448105220|ref|XP_004200441.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|448108351|ref|XP_004201072.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|359381863|emb|CCE80700.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|359382628|emb|CCE79935.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
Length = 344
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 170/365 (46%), Gaps = 58/365 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ K+R+ DA+PKI+ R+ SGG TLV+++ +LL+ + E+ +L + + +
Sbjct: 1 MDSFSTKVRTFDAFPKIDPHKTQRSSSGGFSTLVTALFILLVTWVEIGGFLGGYVDHQFI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD L IN D+ +PC L + MD++ ++ L G ++ +
Sbjct: 61 VDDKLTSDLFINLDM-LVGMPCEYLHTNVMDVTHDRLL-------------AGELLNFQG 106
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
P I +Q + +HN + D D E VR + G ++
Sbjct: 107 MNFFVPDI---VQMNSENNDHN-----------TPDLDEVMR-ETVRAEFNVAGTRMN-- 149
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
E+ C+IYG + VNKVAG+FH GK F + H + F+
Sbjct: 150 -----------------EDASACHIYGSIPVNKVAGDFHIT-GKGFGYADRHR---VPFE 188
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ N SH I + +FGE +P + NPLD Y+YF+ VPT+Y + G +
Sbjct: 189 K--LNFSHVIMEFSFGEFYPMIKNPLDFTGKIASQKLQSYKYFMTAVPTLYEKL-GIEVD 245
Query: 301 SNQFSVTEHFR---SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+ Q+S+TE R + E G +PG++F YD IK+ E+ + FL F+ + IV G
Sbjct: 246 TYQYSLTEQHRAITTDETGLPSDIPGLYFKYDFDTIKLLIAEKRIPFLQFVARLATIVSG 305
Query: 358 VFTVS 362
+F V+
Sbjct: 306 LFIVA 310
>gi|340914937|gb|EGS18278.1| hypothetical protein CTHT_0063020 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 388
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 105/360 (29%), Positives = 164/360 (45%), Gaps = 43/360 (11%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
++ DA+PK + +RT GG T+ S++ L+LF++EL + E V+ T
Sbjct: 30 QAFDAFPKTKSQYTTRTSGGGKWTVAMSLIALILFWAELSRWWRGTEEHTFAVEKGVART 89
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
L IN D+ + C+ L V+ D +G++ L +RL + DG G ++
Sbjct: 90 LDINLDIVV-RMRCADLHVNVQDAAGDRILAA------ERLTRDPTMWVQWVDGKGVHRL 142
Query: 129 DKPLQRHGGRLEHNETYC-GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ +Q GR+ E + +G E + ++ RKK P L
Sbjct: 143 GRDVQ---GRVVTGEGWVEDEGFGEE--------HVHDIVALGRKKAKWAKTPKL---PP 188
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
R G + + C IYG LE+NKV G+FH A G + + G H +FN
Sbjct: 189 RGG--------QADSCRIYGSLELNKVQGDFHITARGHGYLEGGNAQH----LDHSAFNF 236
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----DVSGHTIQS 301
SH I++L+FG P + NPLD +QYF+ +VPT Y+ ++ +I +
Sbjct: 237 SHIISELSFGPFLPSLSNPLDRTVNLASHHFHRFQYFLSIVPTTYSVGRPGEMGSQSIFT 296
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ++VTE + + +PG+FF YD+ PI + E S FL V IV GV
Sbjct: 297 NQYAVTEQSHPVSE---RNIPGIFFKYDIEPILLNIVETRDSVFKFLVKVVNIVSGVLVA 353
>gi|367038975|ref|XP_003649868.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
gi|346997129|gb|AEO63532.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
Length = 380
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 157/362 (43%), Gaps = 43/362 (11%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
N + + DA+PK + +RT GG T+ +V L+LF+SEL + E V+
Sbjct: 21 NIVSAFDAFPKSKPQYVTRTSGGGKWTVAMGLVSLVLFWSELGRWWRGTEEHTFAVEKGV 80
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
L IN DV + C+ L V+ D +G++ L RL DG G
Sbjct: 81 SHVLNINLDVVV-RMRCADLHVNVQDAAGDRILAA------DRLSRDPTAWAHWVDGKGM 133
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
K+ GR G Y AE + + ++ R++ P
Sbjct: 134 HKL--------GRDAQGRVITGEGYTAEHDEGFGEEHVHDIVALGRRRAKWSRTP----- 180
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
R+ E + C IYG LE+NKV G+FH A G + G H+ ++F
Sbjct: 181 --------RLWGAEPDSCRIYGSLELNKVQGDFHITARGHGYMAFGDHL------DHNAF 226
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----DVSGHTI 299
N SH I++L+FG P + NPLD +QYF+ VVPT Y+ + +I
Sbjct: 227 NFSHIISELSFGPFLPSLANPLDRTVNIATAHFHKFQYFLSVVPTTYSVGRPGALGARSI 286
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQ++VTE S++ T+PG+F YD+ PI + E F FL V +V GV
Sbjct: 287 FTNQYAVTEQ---SQEVPDTTIPGIFVKYDIEPILLNIVETRDGFFVFLLRVINVVSGVL 343
Query: 360 TV 361
Sbjct: 344 VA 345
>gi|343476464|emb|CCD12449.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 224
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 73/228 (32%), Positives = 116/228 (50%), Gaps = 14/228 (6%)
Query: 5 MNKIRSLDAYPKIN----EDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + LD +PK + +D RT GGV+++ S + + LL E+R +L V + ++
Sbjct: 1 MKRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVAIALLIIGEVRYFLTTVEQHEMY 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD G T+ + ++TFP +PC +++ DA+D GE D+ D K R+DS
Sbjct: 61 VDPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDS--------- 111
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
D + +PL + + C SCYGAE + DCC+ C++VR A+ ++ W
Sbjct: 112 DTLAPLGEARPLVNMNKKATSDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEFHED 171
Query: 181 DL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH 227
D+ I QC +E EGCN++ V +V N HF PG+ F+
Sbjct: 172 DVSIMQCAKERLQMAASTASREGCNLHSSFRVPRVTENIHFVPGRMFY 219
>gi|85101064|ref|XP_961083.1| hypothetical protein NCU04293 [Neurospora crassa OR74A]
gi|11611445|emb|CAC18610.1| conserved hypothetical protein [Neurospora crassa]
gi|28922621|gb|EAA31847.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 379
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 162/355 (45%), Gaps = 37/355 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + +RT +GG T+ ++ +LF+SE + V+
Sbjct: 22 VSAFDAFPKSKPQYVTRTTAGGKWTVFVGLISFILFWSEASRWWRGSESHTFAVEKGVSH 81
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L IN D+ + C + ++ D +G++ L RL V + D G K
Sbjct: 82 ALDINLDIVV-KMKCQDIHINVQDAAGDRILAA------SRLHRDPTVWQHWVDNKGIHK 134
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q G++ E Y E E+ ++ V RK WA +
Sbjct: 135 LGRDAQ---GKVVTGEGYMQGQGHDEGFGEEHVHDI--VSLGRRKAKWART--------- 180
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
R+ + C ++G LE+NKV G+FH A G + + G H+ +FN
Sbjct: 181 -----PRLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL------DHSAFNF 229
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L+FG P +VNPLD +QYFI VVPTVY+ SG +I +NQ++V
Sbjct: 230 SHIISELSFGPFLPSLVNPLDQTVNIASANFHKFQYFISVVPTVYSS-SGKSIVTNQYAV 288
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
TE S++ + +PG+F YD+ PI + EE SFL F+ V ++ G
Sbjct: 289 TEQ---SQEVTERIIPGIFVKYDIEPILLHIDEERDSFLVFIIKVVNVISGALVA 340
>gi|405968654|gb|EKC33703.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Crassostrea gigas]
Length = 345
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/295 (33%), Positives = 144/295 (48%), Gaps = 18/295 (6%)
Query: 103 DIFKKRLDSQGNVIESRQDGIGAPKIDKPLQ-RHG-GRLEHNETYCGSCYGAESSDEDCC 160
D F K LD S IGA +D Q HG G L++ ET+ + +
Sbjct: 20 DAFPKVLDDCQEKTASGGGTIGADVLDVTGQDTHGFGELKYEETH----FELSPNQRHYH 75
Query: 161 NNCEEVREAYRKKGWALSNPDLIDQ----CKREGFLQRIKEEEGE--GCNIYGFLEVNKV 214
+E+ E R + AL + + + + G +R EGE C +YG LEVNKV
Sbjct: 76 ETVQEISEFLRSEYHALQDVMWMSRGLIATYKTGMPKREIPAEGEPDACRVYGSLEVNKV 135
Query: 215 AGNFHFAPGKS---FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW 271
AGNFH GKS F + H H + +N SH+I+ +FGE G++NPLDG
Sbjct: 136 AGNFHITAGKSVPVFPRG--HAHISMMVHEKEYNFSHRIDHFSFGESVKGIINPLDGEEQ 193
Query: 272 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDL 330
++ YFIK+VPT + I + QFSVT+ R+ + +PG+F YDL
Sbjct: 194 VSSDNFHVFNYFIKIVPTEVRTYAAGNIDTYQFSVTQRNRTINHSKGSHGVPGIFVKYDL 253
Query: 331 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
+ +K+ E+H F FL +C IVGG+F VSG++ + + K ++GK+
Sbjct: 254 NALKIRVVEKHRPFSQFLIRLCGIVGGIFAVSGMLHNWTEFFMEVVCCKFKLGKY 308
>gi|336269097|ref|XP_003349310.1| hypothetical protein SMAC_05593 [Sordaria macrospora k-hell]
gi|380089883|emb|CCC12416.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 379
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 162/358 (45%), Gaps = 44/358 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + +RT +GG T+ +++ +LF+SE + V+
Sbjct: 23 VSAFDAFPKSKPQYVTRTTAGGKWTVFVTLISFILFWSEASRWWRGTESHTFAVEKGVSH 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+L IN D+ + C + ++ D +G++ L +L V + D G K
Sbjct: 83 SLDINLDIVV-KMKCQDIHINVQDAAGDRILAA------SKLHRDPTVWQHWVDNKGIHK 135
Query: 128 IDKPLQRHGGRLEHNETYC---GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ + Q G++ E Y +G E + + + A + W + PD
Sbjct: 136 LGRDAQ---GKVVTGEDYLQGHDEGFGEEHVHDIVALGRKRAKWARTPRLWG-ATPD--- 188
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDS 243
C ++G LE+NKV G+FH A G + + G H+ +
Sbjct: 189 -----------------SCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL------DHSA 225
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
FN SH I++L++G P +VNPLD + +QYFI VVPTVY+ G +I +NQ
Sbjct: 226 FNFSHIISELSYGPFLPSLVNPLDQTVNLATSNFHKFQYFISVVPTVYSVSGGRSIVTNQ 285
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
++VTE S++ + +PG+F YD+ PI + EE SFL FL V ++ G
Sbjct: 286 YAVTEQ---SQEVTERIIPGIFVKYDIEPILLNIVEERDSFLLFLIKVVNVISGALVA 340
>gi|407927953|gb|EKG20833.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
Length = 366
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 165/366 (45%), Gaps = 58/366 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A +++ DA+PK + + +T G TL+ + + L +E R + T V+
Sbjct: 15 APKGALQAFDAFPKTKKTYLQQTTQGANWTLLLIVTCVWLSITETRRWWTGETSHTFSVE 74
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
G ++IN D+ A+ C L V+ D SG++ L V ++ D
Sbjct: 75 KGVGHEMQINLDIVV-AMRCRDLHVNIQDASGDRIL--------------AGVALAKDDT 119
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
++K H E S E + E+V D
Sbjct: 120 RWLQWVEKSKNVHK---------------LERSQEQKRYDEEDVH-------------DY 151
Query: 183 IDQCKREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQ 240
+ K + F + + + C IYG L+ N+V G+FH A G + + G H+
Sbjct: 152 LGASKSKKFPKTPRYRGVPDSCRIYGSLDANRVQGDFHITARGHGYMEFGEHL------D 205
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGH 297
FN SH+IN+L+FG ++P + NPLD R TP +QY++ VVPTVYTD S H
Sbjct: 206 HSQFNFSHQINELSFGPYYPSLTNPLDYTRAVTPTPDDHFYKFQYYLSVVPTVYTDNS-H 264
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
TI +NQ++VTE S + ++PGVF +D+ PIK+T +E + FL L + +V G
Sbjct: 265 TIVTNQYAVTEQSHSVPE---MSVPGVFVKFDIEPIKLTISEYNGGFLALLIRLVNVVSG 321
Query: 358 VFTVSG 363
V G
Sbjct: 322 VMVAGG 327
>gi|408393109|gb|EKJ72376.1| hypothetical protein FPSE_07400 [Fusarium pseudograminearum CS3096]
Length = 376
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 170/376 (45%), Gaps = 45/376 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + RT GG T+ SI+ L+L + EL + V+
Sbjct: 23 VAAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGELGRWWRGAESHNFEVEAGVSR 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ + C + V+ D SG++ + K RL + + D G K
Sbjct: 83 EMQINLDIVV-KMSCDDIHVNVQDASGDRIMAAK------RLHTDKTLWGQWADNKGVHK 135
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q GR+ + Y Y E E+ ++ V ++ WA P
Sbjct: 136 LGRDDQ---GRVNTGQGYNDPKYEDEGFGEEHVHDI--VALGKKRAKWA-KTPRF----- 184
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ C IYG L++NKV G+FH A G + G H+ FN
Sbjct: 185 ---------RGNADSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL------DHSKFNF 229
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L++G +P + NPLDG T + +QY++ VVPTVY+ V+ +I +NQ++V
Sbjct: 230 SHIISELSYGPFYPSLENPLDGTVNTADGNFHKFQYYLSVVPTVYS-VNSRSILTNQYAV 288
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------F 359
TE ++ + + +PG+FF YD+ PI +T E + + I+ GV F
Sbjct: 289 TEQSKAVDD---RYIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVLVAGHWGF 345
Query: 360 TVSGIIDAFIYHGQRA 375
T+S I I +R+
Sbjct: 346 TISDWIHDVIGRRRRS 361
>gi|123472317|ref|XP_001319353.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121902134|gb|EAY07130.1| hypothetical protein TVAG_342940 [Trichomonas vaginalis G3]
Length = 358
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 176/375 (46%), Gaps = 50/375 (13%)
Query: 8 IRSLDAYPKINEDFYSRT-FSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++ D +PK+ ED +T FSG V + +I+ LL F L ++ + + KL+VD ++
Sbjct: 3 LKDFDFFPKVFEDHSRKTDFSGTVTVVCLAIMSYLLVFQTLG-FIASPPKQKLVVDQAKL 61
Query: 67 ET-------------LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
L+I D+ FP+LPC ++ +D E D + KR+ G
Sbjct: 62 PVNEDNVLDWPFVPKLQIYIDIEFPSLPCPVIDFQVLDRFEEIQSDSFSKVKLKRIGPDG 121
Query: 114 NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 173
+I+++ K +KP CGSCYGA S CCN C++V+ A++KK
Sbjct: 122 KIIKNK-------KTEKP------------EVCGSCYGAASG---CCNTCKDVKNAFKKK 159
Query: 174 GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 233
G + I QC R+ + E C++YG + V G G S+
Sbjct: 160 GRVPPSLSTIRQC-RDAVID-YNHIRNESCHVYGTVIVPPTHGTIVMNSGDSYGAQMNTT 217
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ--YFIKVVPTVY 291
L D FN +HKIN + GE+ G +PL G++ Q+ G Y+ YFI+ +
Sbjct: 218 TSSLGISIDDFNFTHKINDIYIGENDLG-DHPLKGIKKVQKE-VGRYKGLYFIRTLREQK 275
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
+ + S+ H+ +G PG++F YD+SPI V + + + L+F+ +
Sbjct: 276 GSLQVYRATSS------HYDRYREGTTGKFPGLYFNYDVSPIIVMYKRD-TTVLNFVIEL 328
Query: 352 CAIVGGVFTVSGIID 366
AI+GG++++ ++D
Sbjct: 329 MAILGGIYSLGSLLD 343
>gi|320591987|gb|EFX04426.1| copii-coated vesicle protein [Grosmannia clavigera kw1407]
Length = 385
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 166/362 (45%), Gaps = 46/362 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ DA+PK + RT GG T+ +V LLLF++ELR + E V G
Sbjct: 26 VQAFDAFPKAKPQYVQRTAGGGKWTVAMIVVSLLLFWTELRRWWAGSQEHTFAVAKGVGH 85
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++IN D+ + C L ++ D +G++ + +L D G +
Sbjct: 86 SMQINMDIVVK-MRCDDLHINVQDAAGDRIMAAA------KLQRDATTWAQWVDHGGNHR 138
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q GR+ E + + E E+ ++ V RK W
Sbjct: 139 LGRDTQ---GRMITGEGWT-TLPHEEGFGEEHVHDI--VALGRRKARW------------ 180
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
G R++ + C I+G L++N+V G++H A G + + G H+ SFN
Sbjct: 181 --GKTPRLRGAAPDSCRIFGSLDLNRVQGDYHITARGHGYMEMGDHL------DHTSFNF 232
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYT-----DVSGHTI 299
SH +N+L+FG +P +VNPLD E + Y QYF+ +VPTVY+ S +I
Sbjct: 233 SHVVNELSFGPFYPSLVNPLDQT--VNEATANFYRFQYFMSIVPTVYSVGHAGSRSARSI 290
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQ++VTE +Q + +PG+FF YD+ PI + E FL F+ + ++ G
Sbjct: 291 VTNQYAVTEQSAEIDQ---RAIPGIFFKYDIEPILLYIEESRDGFLVFVLKIVNVLSGAL 347
Query: 360 TV 361
Sbjct: 348 VA 349
>gi|46137745|ref|XP_390564.1| hypothetical protein FG10388.1 [Gibberella zeae PH-1]
Length = 376
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 170/376 (45%), Gaps = 45/376 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + RT GG T+ SI+ L+L + EL + V+
Sbjct: 23 VAAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGELGRWWRGAESHNFEVEAGVSR 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ + C + V+ D SG++ + K RL + + D G K
Sbjct: 83 EMQINLDIVV-KMNCDDIHVNVQDASGDRIMAAK------RLHTDKTLWGQWADNKGVHK 135
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q GR+ + Y Y E E+ ++ V ++ WA P
Sbjct: 136 LGRDDQ---GRVNTGQGYNDPKYEDEGFGEEHVHDI--VALGKKRAKWA-KTPRF----- 184
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ C IYG L++NKV G+FH A G + G H+ FN
Sbjct: 185 ---------RGNADSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL------DHSKFNF 229
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L++G +P + NPLDG T + +QY++ VVPTVY+ V+ +I +NQ++V
Sbjct: 230 SHIISELSYGPFYPSLENPLDGTVNTADGNFHKFQYYLSVVPTVYS-VNSRSILTNQYAV 288
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------F 359
TE ++ + + +PG+FF YD+ PI +T E + + I+ GV F
Sbjct: 289 TEQSKAVDD---RYIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVLVAGHWGF 345
Query: 360 TVSGIIDAFIYHGQRA 375
T+S I I +R+
Sbjct: 346 TISDWIHDVIGRRRRS 361
>gi|241953329|ref|XP_002419386.1| COPii-coated vesicle-associated protein, putative [Candida
dubliniensis CD36]
gi|223642726|emb|CAX42980.1| COPii-coated vesicle-associated protein, putative [Candida
dubliniensis CD36]
Length = 345
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 176/395 (44%), Gaps = 73/395 (18%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ K+++ DA+PK++ R+ GG+ TLV+ LL+ + E+ Y+ + +
Sbjct: 1 MDSFAQKVKTFDAFPKVDPHHQVRSQRGGLSTLVTYFCGLLILWIEIGGYIGGYVDRQFT 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD L IN D+ A+PC + + DI+ + +L G +
Sbjct: 61 VDDQIRSDLTINIDMIV-AMPCQFIHTNVEDITHDTYL-------------AGETLNFEG 106
Query: 121 DGIGAP---KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
P KI+ P H E+ D D E +R +R +G +
Sbjct: 107 IHFFVPDSFKINNPNDFH-----------------ETPDLDEVMQ-ESLRAEFRSEGARV 148
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDI 236
+ E C+I+G + VN+V G+F GK F ++ HV
Sbjct: 149 N-------------------EGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRSHV--- 185
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+S N SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y + G
Sbjct: 186 ---PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKITEERLQTYMYYAKVVPTLYEQL-G 241
Query: 297 HTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
I +NQ+S+TE H +Q R +PG++F YD PIK+ E+ + F F+ +
Sbjct: 242 LEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAKLA 301
Query: 353 AIVGGVFTVSGII------DAFIYHGQRAIKKKIE 381
I GG+ +G + FI++GQ+A+++ E
Sbjct: 302 TIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336
>gi|332373256|gb|AEE61769.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 171/370 (46%), Gaps = 47/370 (12%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+N+++ +D +PK+ + + + GG +++S +++ L +SE+ YLN+ K D
Sbjct: 16 LNRVKKMDIFPKVEDPYKMTSSVGGTFSIISFLIIGWLVYSEISYYLNSKFVFKFSPDVQ 75
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDV----KHDIFKKRLDSQGNVIESRQ 120
+ L +N D+T A+PCS L D +D + + + D + + D+Q E ++
Sbjct: 76 LEDKLDMNIDITV-AMPCSKLGTDVLDSTNQNTYKFGTLKQDDTWFELSDNQKVHFEHKK 134
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
H +Y Y A +++ K ++
Sbjct: 135 --------------------HFNSYLREEYHA-------------IKDLLWKNSFSTQFG 161
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
DL + + C IYG L +NKVAGNF + GK + +
Sbjct: 162 DLPPR-------DHTPSRPHDACRIYGTLGLNKVAGNFLISGGKRYMFGLGYQQFRTLIS 214
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+N +H+IN+ +FG PG+V+PL+G P + YFI++VPT + +TI
Sbjct: 215 EGEYNFTHRINRFSFGHSSPGIVHPLEGDELILPDPMTVVNYFIEIVPTT-VNTFMYTIS 273
Query: 301 SNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+ Q+SV E R + + P ++F YD+S ++VT ++E FL +C+IVGGV+
Sbjct: 274 TYQYSVKELTRPIDHNKGSHGTPAIYFKYDMSALRVTVSQERDHLGMFLARLCSIVGGVY 333
Query: 360 TVSGIIDAFI 369
SGI+++ +
Sbjct: 334 VCSGILNSIV 343
>gi|68465583|ref|XP_723153.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|68465876|ref|XP_723006.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|46445018|gb|EAL04289.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|46445174|gb|EAL04444.1| likely COPII secretory vesicle component [Candida albicans SC5314]
Length = 345
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 108/395 (27%), Positives = 176/395 (44%), Gaps = 73/395 (18%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ K+++ DA+PK++ R+ GG+ TL++ LL+ + E+ Y+ + +
Sbjct: 1 MDSFAQKVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFCGLLILWIEIGGYIGGYVDRQFT 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD L IN D+ A+PC + + DI+ + +L G +
Sbjct: 61 VDDQIRSALTINVDMIV-AMPCQFIHTNVEDITHDTYL-------------AGETLNFEG 106
Query: 121 DGIGAP---KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
P KI+ P H E+ D D E +R +R +G +
Sbjct: 107 IHFFVPDSFKINNPNDFH-----------------ETPDLDEVMQ-ESLRAEFRSEGARV 148
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDI 236
+ E C+I+G + VN+V G+F GK F ++ HV
Sbjct: 149 N-------------------EGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRSHV--- 185
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+S N SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y + G
Sbjct: 186 ---PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLYEQL-G 241
Query: 297 HTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
I +NQ+S+TE H +Q R +PG++F YD PIK+ E+ + F F+ +
Sbjct: 242 LEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAKLA 301
Query: 353 AIVGGVFTVSGII------DAFIYHGQRAIKKKIE 381
I GG+ +G + FI++GQ+A+++ E
Sbjct: 302 TIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336
>gi|328771759|gb|EGF81798.1| hypothetical protein BATDEDRAFT_86854 [Batrachochytrium
dendrobatidis JAM81]
Length = 333
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 169/383 (44%), Gaps = 66/383 (17%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ ++ SLDA+PKI + T SGG+++L+ V++ L +E+ + + +VD
Sbjct: 10 LSKRLASLDAFPKIEKQLQQTTKSGGLVSLMMLAVLVYLACTEIYRWRSIDQRYDFIVDQ 69
Query: 64 SRGE--TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+R +L+IN D+T A+ C +L D DIS + K + + V ++
Sbjct: 70 TRSHEHSLQINVDLTI-AMDCKVLRADIQDISRTSL------VLKDAIHATPTVFRTQ-- 120
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
GA K + EHN+ Y + KG S+ D
Sbjct: 121 --GAVKYTR---------EHNQ-YIAQIH----------------------KGLRDSSRD 146
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQ 240
L D G + C G + NKV G HF A G + GVH
Sbjct: 147 LEDHASESG--------TPDACRFRGSFQANKVEGMLHFTALGHGYF--GVHT------P 190
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS----G 296
D+ N +H+I++L+FG +P + NPLD T + YF+ VVPT+Y D + G
Sbjct: 191 HDAINFTHRIDELSFGARYPDLHNPLDHTLEIGTTNFDSFMYFLGVVPTIYVDKARSLFG 250
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
T+ +NQ++VTE + + LPG+F Y + PI V TE + + F T +C I+G
Sbjct: 251 ATLLTNQYAVTEFSHAVDPQNPDALPGIFIKYHIEPISVRITESRLGLVQFTTRMCGIIG 310
Query: 357 GVFTVSGIIDAFIYHGQRAIKKK 379
G F G I F + + + K
Sbjct: 311 GAFVTIGAILGFFRNVRTMLSAK 333
>gi|398412138|ref|XP_003857398.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
gi|339477283|gb|EGP92374.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
Length = 407
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 108/389 (27%), Positives = 172/389 (44%), Gaps = 75/389 (19%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I++ DA+PK + +T SGGV TLV + +L ++E+ + + T V+ G
Sbjct: 23 IKAFDAFPKTKPSYTQQTSSGGVWTLVLIALSTVLAYTEVTRWWSGTTTHSFSVEQGVGH 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHL---DVKHDIFKKRLDSQGNVIESRQDGIG 124
L+IN D+ A+ C + ++ D +G++ L VK D RL + + + +G
Sbjct: 83 DLQINVDLVV-AMKCEDIHINVQDAAGDRVLVDKAVKEDPTLFRLWGENHGAHT----LG 137
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
A D+ L+ G R+ AE +ED + R
Sbjct: 138 ASLKDR-LEVDGNRIVQ----------AEYEEEDVHDYLSLARGG--------------- 171
Query: 185 QCKREGFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 242
KR + R + EE + C IYG + NKV G+FH A G + H+
Sbjct: 172 --KRYQYTPRTPRNEEADSCRIYGSMHSNKVQGDFHITARGHGYMAYSQHL------DHS 223
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT---------- 292
+FN SH IN+L+FG ++P +VNPLD E +QY++ VVPT+YT
Sbjct: 224 AFNFSHHINELSFGPYYPKLVNPLDSTYARTEAHFHKFQYYLSVVPTIYTVDVNALKRMD 283
Query: 293 ------------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK 334
V+ H++ +NQ++VTE S + +PG+FF YD+ P++
Sbjct: 284 SKYETPSSGDDGLNQHPRRVTQHSVFTNQYAVTEQSHSVPENH---VPGIFFKYDIEPLQ 340
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+T EE S L + +V G+ G
Sbjct: 341 LTIAEEWTSVPALLLRIVNVVSGLLVAGG 369
>gi|238880883|gb|EEQ44521.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 345
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 108/395 (27%), Positives = 176/395 (44%), Gaps = 73/395 (18%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ K+++ DA+PK++ R+ GG+ TL++ LL+ + E+ Y+ + +
Sbjct: 1 MDSFSQKVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFCGLLILWIEIGGYIGGYVDRQFT 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD L IN D+ A+PC + + DI+ + +L G +
Sbjct: 61 VDDQIRSALTINVDMIV-AMPCQFIHTNVEDITHDTYL-------------AGETLNFEG 106
Query: 121 DGIGAP---KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
P KI+ P H E+ D D E +R +R +G +
Sbjct: 107 IHFFVPDSFKINNPNDFH-----------------ETPDLDEVMQ-ESLRAEFRSEGARV 148
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDI 236
+ E C+I+G + VN+V G+F GK F ++ HV
Sbjct: 149 N-------------------EGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRSHV--- 185
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+S N SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y + G
Sbjct: 186 ---PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLYEQL-G 241
Query: 297 HTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
I +NQ+S+TE H +Q R +PG++F YD PIK+ E+ + F F+ +
Sbjct: 242 LEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAKLA 301
Query: 353 AIVGGVFTVSGII------DAFIYHGQRAIKKKIE 381
I GG+ +G + FI++GQ+A+++ E
Sbjct: 302 TIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336
>gi|302882273|ref|XP_003040047.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256720914|gb|EEU34334.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 376
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 167/376 (44%), Gaps = 45/376 (11%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + RT GG T+ SI+ L+L + E + V+ G
Sbjct: 23 VSAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGEAARWWRGAESHNFEVEAGVGR 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN D+ + C + V+ D SG++ + K K L SQ D G K
Sbjct: 83 ELQINLDIVV-RMQCDDIHVNVQDASGDRIMAAKRLRHDKTLWSQ------WVDSKGMHK 135
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ + Q GR+ T G + + ++ RKK P +
Sbjct: 136 LGRDSQ---GRV---VTQSGWNDLGYEEEGFGEEHVHDIVALGRKKAKWAKTPKV----- 184
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ + C +YG L +NKV G+FH A G + +G H +FN
Sbjct: 185 ---------KGRADSCRVYGSLHLNKVQGDFHITARGHGYMGNGEH------LDHKNFNF 229
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L++G +P +VNPLDG +QY++ +VPTVY+ V +I +NQ++V
Sbjct: 230 SHIISELSYGPFYPSLVNPLDGTVNAASDNFHKFQYYLSIVPTVYS-VGSRSILTNQYAV 288
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------F 359
TE +S + +PG+FF YD+ PI +T E L FL + IV GV F
Sbjct: 289 TEQSKSVNE---HYIPGIFFKYDIEPILLTVHESRDGILTFLVKIINIVSGVLVAGHWGF 345
Query: 360 TVSGIIDAFIYHGQRA 375
T+S + I +R+
Sbjct: 346 TISDWVKDVIGRRRRS 361
>gi|294655234|ref|XP_457337.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
gi|199429792|emb|CAG85341.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
Length = 354
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 175/371 (47%), Gaps = 63/371 (16%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M++ K+R+ DA+PK++ + R+ GG TLV+ + LL+ + E+ +L + +
Sbjct: 1 MESFTTKVRTFDAFPKVDAEHTVRSSRGGFSTLVTIVCGLLILWVEIGGFLGGYVDHQFT 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR--LDSQGNVIES 118
+D L +N D+ A+PC L + MDI+ ++ L + F+ Q I S
Sbjct: 61 IDDKVKSDLSLNIDM-LVAMPCEFLHTNVMDITDDRFLAGELLNFEGTNFFLPQHFEINS 119
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+ P +D +Q E +R +R G ++
Sbjct: 120 KNTDHDTPDLDHVMQ------------------------------ETLRAEFRVAGARVN 149
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDIL 237
E C+I+G + VN+V G+FH GK F + G ++
Sbjct: 150 -------------------EGAPACHIFGSIPVNQVKGDFHIT-GKGFGYNDG---RSVV 186
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
F+ + N +H I++ ++G+ +P + NPLD E Y+Y+ KVVPT+Y + G
Sbjct: 187 PFE--ALNFTHVISEFSYGDFYPFINNPLDFTGKVTEQKLQAYKYYSKVVPTIYEKL-GM 243
Query: 298 TIQSNQFSVTEH---FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
I +NQ+S+TE ++ + ++ +PG+FF Y+ PIK+ +E+ + F+ F++ + I
Sbjct: 244 IIDTNQYSLTEQHNVYKVNRFNNVEGIPGIFFKYEFEPIKLIISEKRIPFIQFVSRLATI 303
Query: 355 VGGVFTVSGII 365
+GG+ V+G +
Sbjct: 304 IGGLLIVAGYL 314
>gi|358374656|dbj|GAA91246.1| COPII-coated vesicle protein [Aspergillus kawachii IFO 4308]
Length = 399
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 172/388 (44%), Gaps = 64/388 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++ DA+PK + + + GG T++ ++ + FSE R +LN V+
Sbjct: 19 GLQGGLKTFDAFPKTKPSYTAPSRRGGQWTVLILVICTVFTFSEFRTWLNGSENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
G L++N D+ +PC L V+ D SG++ L D+ ++ S ++ +R+
Sbjct: 79 KGVGHDLQLNLDLVV-RMPCDTLDVNIQDASGDRIL--AGDLLQRERTSWKLWMDKRNRE 135
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK---KGWAL 177
G + Q R+ A +D + EVR+ R+ KG L
Sbjct: 136 TSGGVHEYQTLSQEDSDRIS-----------AREADAHVHHVLGEVRKNPRRKFAKGPRL 184
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHV-HD 235
D +D C+ IYG LE NKV G+FH A G + G H+ H
Sbjct: 185 RRGDTVDSCR-----------------IYGSLEGNKVQGDFHITARGHGYRNFGEHLDHG 227
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY---- 291
+ FN SH + +L+FG H+P ++NPLD T ET YQYF+ VVPT+Y
Sbjct: 228 V-------FNFSHMVTELSFGPHYPTLLNPLDKTIATTETHYYKYQYFLSVVPTLYSKGA 280
Query: 292 --------------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 337
T+ + + + +NQ++ T + + +PG+FF Y++ PI +
Sbjct: 281 SALDTYTNHPDLIATNRNRNLVFTNQYAATTQAQELPENPY-FIPGIFFKYNIEPILLMI 339
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+EE SFL L + V GV G I
Sbjct: 340 SEERTSFLSLLIRLVNTVSGVMVTGGWI 367
>gi|452988546|gb|EME88301.1| hypothetical protein MYCFIDRAFT_25415 [Pseudocercospora fijiensis
CIRAD86]
Length = 380
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 160/371 (43%), Gaps = 59/371 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ +++ DA+PK + RT +GG+ T+ + L L +SEL + T V+
Sbjct: 20 LSAVKAFDAFPKTKPSYQERTSTGGIWTVTLILASLFLTWSELARWWKGSTTHTFSVEQG 79
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
G L+IN D+ + C L V+ D +G++ L G+V + +D
Sbjct: 80 IGHDLQINLDMVV-MMNCEDLHVNVQDAAGDRIL-------------AGSVFQ--KDPTI 123
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYR-------KKGWAL 177
+ DK L+ H L H++ G + +ED N + R +GW
Sbjct: 124 WTRWDKKLKAHA--LGHDKQERLGEAGKDYKEEDVHNYLSVAHHSKRFPKTPKIPRGWT- 180
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 236
+ C IYG + NKV G+FH A G + + H+
Sbjct: 181 ----------------------ADSCRIYGTMHGNKVQGDFHITARGHGYLEFAEHL--- 215
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVS 295
FN SH+IN+L+FG +P + NPLD T + +QYF+ VVPTVY TD
Sbjct: 216 ---DHSKFNFSHRINELSFGPFYPSLENPLDNTFATTDINYYKFQYFLSVVPTVYTTDAR 272
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQT---LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ N F T + +EQ R + +PG+F +D+ PI +T EE SF +
Sbjct: 273 ALRLLDNNFVFTNQYAVTEQSRKVSENFVPGIFIKFDMEPIGLTIAEEWSSFPALFIRIV 332
Query: 353 AIVGGVFTVSG 363
+V G+ G
Sbjct: 333 NVVSGLLVAGG 343
>gi|121710902|ref|XP_001273067.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
clavatus NRRL 1]
gi|119401217|gb|EAW11641.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
clavatus NRRL 1]
Length = 401
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 167/385 (43%), Gaps = 65/385 (16%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ DA+PK + + + GG T++ ++ SE R +L + V+
Sbjct: 24 LKIFDAFPKTKPSYTAPSHRGGQWTVLILLICTFFSLSEFRAWLRGTEKHHFSVEKGISH 83
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI--GA 125
L++N D+ +PC L V+ D SG++ L ++ ++ S +E R I GA
Sbjct: 84 DLQLNLDIVV-DMPCESLDVNIQDASGDRIL--AGELLQRERTSWNLWMEKRNYEIHGGA 140
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDL 182
+ Q HG RL E D + EVR RKK G L D+
Sbjct: 141 HEYQTLNQEHGDRLAEQE-----------QDAHVHHVLGEVRRNPRKKFPRGPRLRRGDV 189
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQR 241
+D C+ IYG LE NKV G+FH A G +H + H+ +
Sbjct: 190 VDSCR-----------------IYGSLEGNKVQGDFHITARGHGYHAAAPHL------EH 226
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 292
+FN SH + +L+FG H+P ++NPLD T E YQYF+ VVPT+Y+
Sbjct: 227 STFNFSHMVTELSFGPHYPTILNPLDKTIATTEEHYYKYQYFLSVVPTIYSKGNLALDAY 286
Query: 293 ------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 340
+ + + I +NQ++ T + + +PG+FF Y + PI + +EE
Sbjct: 287 SGSAPTLHDPNRNRNRNLIFTNQYAATSQSTALPESPY-FVPGIFFKYSIEPILLIISEE 345
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGII 365
SFL L + V GV G +
Sbjct: 346 RGSFLTLLVRLVNTVSGVIVTGGWL 370
>gi|429862433|gb|ELA37083.1| copii-coated vesicle protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 375
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 162/362 (44%), Gaps = 47/362 (12%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
N + + DA+PK + +RT GG T+ +++ L LF++E+ + V+
Sbjct: 21 NIVSAFDAFPKAKPQYVTRTSGGGKWTVAMAVISLFLFWTEVGRWWRGSETHTFAVEKGV 80
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
G ++IN D+ + C L ++ D +G++ L +L D G
Sbjct: 81 GHEMQINLDIVV-RMHCDDLHINVQDAAGDRILAAS------KLKRDKTNWSQWVDNKGI 133
Query: 126 PKIDKPLQRHGGRLEHNETYCGS-CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
++ + + GR+ E + +G E + + + A K W
Sbjct: 134 HRLGRDTK---GRIVTGEGWQEEEGFGEEHVHDIVAIGKKRAKWAKTPKLWG-------- 182
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDS 243
EG+ C IYG L+VN+V G+FH A G + + G H+ +
Sbjct: 183 --------------EGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGEHL------DHAA 222
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTI 299
FN SH I++++FG +P +VNPLD +QY++ VVPTVYT + +TI
Sbjct: 223 FNFSHIISEMSFGPFYPSLVNPLDRTVNAARINFHKFQYYLSVVPTVYTVGKSASTSNTI 282
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQ++VTE + + +PG+FF YD+ PI ++ E FL FL + +V GV
Sbjct: 283 FTNQYAVTEQSKEVDD---HNVPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVL 339
Query: 360 TV 361
Sbjct: 340 VA 341
>gi|57208596|emb|CAI42845.1| ERGIC and golgi 3 [Homo sapiens]
Length = 129
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/128 (54%), Positives = 88/128 (68%), Gaps = 9/128 (7%)
Query: 265 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------HTIQSNQFSVTEHFRSSEQGRL 318
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L
Sbjct: 1 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAPLPPQVLRTNQFSVTRHEKVAN-GLL 59
Query: 319 --QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI
Sbjct: 60 GDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAI 119
Query: 377 KKKIEIGK 384
+KKI++GK
Sbjct: 120 QKKIDLGK 127
>gi|303278158|ref|XP_003058372.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459532|gb|EEH56827.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 399
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 139/300 (46%), Gaps = 59/300 (19%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ +++ DAY K +T +GG +TL S ++M +F ELR YL T VD
Sbjct: 3 LAARVKLFDAYHKPERHLTKKTAAGGAVTLSSLLLMAFVFVFELRSYLATERVTTTGVDV 62
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+R E L IN DVTF +LPC LS+DA+D SG+ DV ++ K R+D G I + +
Sbjct: 63 TRDEMLAINVDVTFTSLPCQTLSLDALDASGKHDQDVGGELHKTRVDRFGRAIATYES-- 120
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
H E +D+ N E+ + +G + +
Sbjct: 121 -----------------HRE-----------NDDGVVNLITELFYGFETEG----HKAHV 148
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
D+ K GEGC ++G L+V +VAGNFH + VH D R +
Sbjct: 149 DEIK-------TALSAGEGCRVHGRLKVQRVAGNFHVS---------VHGEDARTL-RAT 191
Query: 244 F------NISHKINKLAFGEHFPGVVNPLDGVRWT--QETPSGMYQYFIKVVPTVYTDVS 295
F N+SH +++L+FG+ FP +PL G T +G Y+YF+KVVP YT S
Sbjct: 192 FEHPRNVNMSHAVHRLSFGKSFPRKEDPLSGFTRTTRHANETGTYKYFLKVVPVTYTGKS 251
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 46/72 (63%)
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
++N +SVTE + ++ +LP V+F YDLSPI VT ++ SF HFL A VGG +
Sbjct: 318 RTNLYSVTETYIPTKNWNGGSLPAVYFIYDLSPIAVTISDARKSFGHFLARTVAGVGGAY 377
Query: 360 TVSGIIDAFIYH 371
++G+ID I+H
Sbjct: 378 AIAGLIDRMIHH 389
>gi|346970151|gb|EGY13603.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium dahliae VdLs.17]
Length = 373
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 162/364 (44%), Gaps = 55/364 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + RT GG T+ +++ ++LF+SEL + V+ G
Sbjct: 20 VSAFDAFPKSKPQYVQRTSGGGKWTVAMAVISVMLFWSELGRWWRGSESHTFAVEKGVGH 79
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L++N D+ + C L V+ D SG+ L +L + D G K
Sbjct: 80 DLQVNLDIVV-KMRCEDLHVNVQDASGDLILAA------TKLREEITSWHQWADMTGNHK 132
Query: 128 IDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ + GR+E N Y +G E D++ Q
Sbjct: 133 LGRSPS---GRIETNSGYHLDEGFGEEHVH------------------------DIVAQS 165
Query: 187 KREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDS 243
K+ R G + C I+G L++NKV G+FH A G + +G H+ S
Sbjct: 166 KKRQKWARTPRLRGPPDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL------DHTS 219
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT----DVSGH 297
FN SH +N+L+FG +P + NPLD R P+ +QY++ +VPTVYT +
Sbjct: 220 FNFSHIVNELSFGAFYPNLENPLD--RTVNLAPANFHKFQYYLSIVPTVYTVGRSASKAN 277
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
T+ +NQF+VTE +S E G ++PGVF YD+ PI + E F+ F V ++ G
Sbjct: 278 TVYTNQFAVTE--QSKEVGD-HSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVINVLSG 334
Query: 358 VFTV 361
V
Sbjct: 335 VLVA 338
>gi|380492334|emb|CCF34678.1| hypothetical protein CH063_01185 [Colletotrichum higginsianum]
Length = 377
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 169/362 (46%), Gaps = 47/362 (12%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
N + + DA+PK + +RT GG T+ +++ + LF++E+ + V+
Sbjct: 21 NIVSAFDAFPKAKPQYVTRTSGGGKWTVAMTVISVFLFWTEVGRWWRGSETHTFAVEKGI 80
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
G ++IN D+ + C L ++ D +G++ L + K+ + ++S+ G
Sbjct: 81 GHEMQINLDIVV-RMHCDDLHINVQDAAGDRIL--AGSMLKRDKTNWSQWVDSK----GI 133
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESS-DEDCCNNCEEVREAYRKKGWALSNPDLID 184
++ GR + G+ + E E+ ++ V +K W P L
Sbjct: 134 HRL--------GRDSKGKIVTGAGWQEEEGFGEEHVHDI--VSLGKKKAKWG-KTPRLWG 182
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDS 243
+G+ C +YG L+VN+V G+FH A G + + G H+ +
Sbjct: 183 --------------DGDSCRVYGNLDVNRVQGDFHITARGHGYMEFGEHL------DHAA 222
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTI 299
FN SH +++L+FG +P +VNPLD +QY++ +VPTVYT S +TI
Sbjct: 223 FNFSHIVSELSFGPFYPSLVNPLDRTVNLARINFHKFQYYLSIVPTVYTVGKSASSSNTI 282
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQ++VTE + ++ +PG+FF YD+ PI ++ E FL FL + +V GV
Sbjct: 283 FTNQYAVTEQSKETDD---HNIPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVL 339
Query: 360 TV 361
Sbjct: 340 VA 341
>gi|403371798|gb|EJY85783.1| hypothetical protein OXYTRI_16231 [Oxytricha trifallax]
Length = 333
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 177/400 (44%), Gaps = 100/400 (25%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE-TL 69
LD + ++ +D TF G ++T + V++ L SE+ YLN T+T +LVD S + L
Sbjct: 10 LDIFKRVPKDLTEPTFCGALLTSICFFVLVGLSLSEVARYLNVETKTDMLVDISHSDDKL 69
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID 129
IN D+TFP PC ILS+D D+ G H+++ +G +++ R G ++
Sbjct: 70 EINIDITFPRFPCEILSLDVQDVMGTHHVNI-----------EGGLVKQRITANGEVILE 118
Query: 130 KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE 189
Y A + +D + + R+ + +
Sbjct: 119 --------------------YSAHTK-QDRSHVASQTRDEVKAQ---------------- 141
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGK------SFHQSGVHVHDILAFQRDS 243
EGC+IYG + +N+V GNFH + Q G H
Sbjct: 142 -----------EGCHIYGNILINRVPGNFHISTHAFNDILMGLMQEGHH----------- 179
Query: 244 FNISHKINKLAFGE--HFPGV---------VNPLDG-----VRWTQETPSGMY-QYFIKV 286
F+ S+KI+ ++FG+ +F + ++PLDG R + P + +++
Sbjct: 180 FDFSYKIDHISFGKRNNFDMIRRKFRDHQLISPLDGKSETAPRDNKNFPKSLEGNFYLIA 239
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
VP+ + DVSG Q Q + +H + + F Y+LSPI V F+++ S
Sbjct: 240 VPSYFKDVSGGVYQVYQLTANDHTNFGTGNNI-----LKFNYELSPITVGFSQDRESIAL 294
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
FL ++CAI+GGVFT IIDA I+ + KK IGK S
Sbjct: 295 FLVHICAIIGGVFTAVSIIDAIIHKSFSLLFKK-RIGKLS 333
>gi|440801547|gb|ELR22565.1| serologically defined breast cancer antigen 84 isoform 1, putative
[Acanthamoeba castellanii str. Neff]
Length = 355
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 157/371 (42%), Gaps = 68/371 (18%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
++ ++RS D +PK ED + +G +T+V +VML LF SE Y VTE
Sbjct: 8 SMAKRLRSFDIFPKSVEDVREQASAGAAVTIVGVLVMLFLFVSEFSSYTQVVTEAW---- 63
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
RG + D F +D + E+ + + ++ +L
Sbjct: 64 --RGGAIWAEADTIF------------VDTTREKTMWINFELVFLQL------------- 96
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+C E D + + R +K+ A+
Sbjct: 97 -------------------------ACKEVEVDIVDNFGDPQRGRRDIQKQ--AVDPEQY 129
Query: 183 IDQCKREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD----I 236
+ Q F EE +G GC ++G EV KV GN H A G + QS I
Sbjct: 130 LQQTFSSWFTSAHTEEFPKGSGCRVFGKAEVQKVKGNLHIAAGSNAPQSHDGHQHHVHHI 189
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVS 295
Q SFN+SH I L+FG FP +PL R + P+ M + I++VPT+Y D
Sbjct: 190 TPEQVASFNVSHFIPHLSFGPAFPRRTDPLSWTRVIE--PNAMQVNHMIQLVPTIYEDWG 247
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
G+ I+ Q+S +++ G LPGVF +D+SP + + E SF HFLT +CAI
Sbjct: 248 GNVIEGYQYSAQTNYKHIVPGASSFPLPGVFIKWDMSPFVIQYRETGRSFAHFLTRLCAI 307
Query: 355 VGGVFTVSGII 365
GG F V G+I
Sbjct: 308 TGGTFVVLGLI 318
>gi|321479391|gb|EFX90347.1| hypothetical protein DAPPUDRAFT_309719 [Daphnia pulex]
Length = 369
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 178/375 (47%), Gaps = 49/375 (13%)
Query: 6 NKIRS---LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+KI++ LDA+PK+ + + +T SGG I+L+ ++L L FSE+ ++++ + + D
Sbjct: 7 DKIKAVIELDAFPKVPDTYKEKTTSGGTISLICIFIILYLVFSEVNDFIHSGVKFHFVPD 66
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
+ +N D+T A+PC + D +D +G+ + H L + E
Sbjct: 67 DDLDTRMDLNVDMTV-AMPCRYIGADVLDSTGQSVVSFGH------LTEENTWFEL---- 115
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P QR+ H E A+ + + +++ K G+ +L
Sbjct: 116 -------SPRQRN-----HFE-------AAQRLNSILRDKPHGIQQLLWKSGYQ----NL 152
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDILAFQR 241
+ F + + + C ++G L++ KVAGNFH GK H H
Sbjct: 153 FGEMPSREF---VPSQPSDACRLHGTLQLTKVAGNFHITAGKVLPLPMRAHAHLSPMMDD 209
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHT-- 298
+ FN SH+I+K +FG H ++ PL+G + + ++QYF+ VPT + + VS +
Sbjct: 210 ERFNYSHRIDKFSFG-HSSTLIQPLEGDEVITDKGAMLFQYFVTAVPTEIESLVSASSGI 268
Query: 299 ---IQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+++ Q+SV R Q +PG++F YD++P++V + L F+ +CAI
Sbjct: 269 HGSMKTWQYSVRNQSRIIGHQKGSHGIPGIYFKYDVAPLRVRVVPDAPPLLRFVLRLCAI 328
Query: 355 VGGVFTVSGIIDAFI 369
VGGV+T +GI+ I
Sbjct: 329 VGGVYTSAGIVHKVI 343
>gi|432862155|ref|XP_004069750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oryzias latipes]
Length = 373
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 100/178 (56%), Gaps = 2/178 (1%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
QR C I+G L VNKVAGNFH GKS H H DS+N SH+I+
Sbjct: 157 QRDSSSPPNACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDH 216
Query: 253 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 312
L+FGE PG+++PLDG + M+QYFI +VPT + + +++Q+SVTE R
Sbjct: 217 LSFGEAIPGLISPLDGTEKIAADYNHMFQYFITIVPT-KLNTYKVSAETHQYSVTERERV 275
Query: 313 -SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ + G+F YD+S + V TE+H+ F FL +C IVGG+F+ +G+I +
Sbjct: 276 INHAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWKFLVRLCGIVGGIFSTTGMIHGLV 333
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 50/89 (56%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + T SGG ++L++ +M +L F E +Y N + + VD
Sbjct: 10 LTLVKELDAFPKVPESYVESTASGGTVSLIAFTLMAVLAFLEFFVYTNTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
LRIN D+T A+ C + D +D++
Sbjct: 70 FSSKLRINVDITV-AMRCQYIGADVLDLA 97
>gi|443700340|gb|ELT99344.1| hypothetical protein CAPTEDRAFT_162161 [Capitella teleta]
Length = 110
Score = 134 bits (336), Expect = 1e-28, Method: Composition-based stats.
Identities = 62/108 (57%), Positives = 83/108 (76%), Gaps = 2/108 (1%)
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 336
M+ Y++KVVPT Y +G + SNQ+SVT+H + G L Q LPGVF Y+LSP+ V
Sbjct: 1 MFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVK 60
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+TE++ SF+HFLT VCAI+GGVFTV+G++DAFIYH RAI+KKI++GK
Sbjct: 61 YTEKNRSFMHFLTGVCAIIGGVFTVAGLVDAFIYHSARAIQKKIDLGK 108
>gi|320170541|gb|EFW47440.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 408
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 8/205 (3%)
Query: 165 EVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAP 222
E R+ ++ +LS + + + + +EG + C ++G + +K+AGNFH
Sbjct: 177 ENRKPLTREHLSLSGTTRKAKKNFQAMPRELSSQEGTPDACRLHGSVSADKIAGNFHIIA 236
Query: 223 GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQY 282
G + G H H + + N +H+IN L+FGE PG+ PLDG W + + YQY
Sbjct: 237 GAAVEVPGGHAHMGQMIPQHALNFTHRINHLSFGEEMPGMEFPLDGDEWITTSHTMAYQY 296
Query: 283 FIKVVPTVYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 340
FI+VVPTVYT + ++S QFSVT H E LPG+FF YD PI VT
Sbjct: 297 FIQVVPTVYTRHANDPEQLRSGQFSVTRH----ESPNSNRLPGLFFKYDTFPILVTVQYS 352
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGII 365
SF H L + I+GGVF SG I
Sbjct: 353 PYSFWHLLIRLSGIIGGVFATSGFI 377
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 46/87 (52%), Gaps = 1/87 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LD +PK+ + + SGG +TLV ++++ L +EL Y N VD
Sbjct: 19 VKQLDIFPKVASTYKETSSSGGTVTLVCLVLIVFLVGAELGEYFNQQAAFSYGVDPVVDG 78
Query: 68 TLRINFDVTFPALPCSILSVDAMDISG 94
+L++ +D+ A+PC +L D + +G
Sbjct: 79 SLKLTYDIVV-AMPCDLLGADVLQATG 104
>gi|74189495|dbj|BAE22750.1| unnamed protein product [Mus musculus]
Length = 303
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 260
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 94 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 153
Query: 261 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 317
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 154 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 210
Query: 318 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 211 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 258
>gi|357627966|gb|EHJ77470.1| putative PTX1 protein isoform 1 [Danaus plexippus]
Length = 353
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 166/383 (43%), Gaps = 69/383 (18%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+++K++ LDA+ K+ +++ N+ + + DT
Sbjct: 10 VIDKVKELDAFSKVPDEYVD----------------------------NSNLAFRFMPDT 41
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
E LRIN D+T A+PCS + D +D + Q
Sbjct: 42 DMDEKLRINIDITI-AMPCSNIGADILD-------------------------STSQSVF 75
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G G L+ +T+ +++ E +RE Y W L
Sbjct: 76 GF-----------GELQEEDTWWELTPEQKNAFEAVKYMNSYLREEYHSV-WQLLWKKGH 123
Query: 184 DQCKREGFLQRIK-EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ ++ K + C ++G L +NKVAGNFH GKS H H+H + F
Sbjct: 124 GSVRATVPPRKTKPNRRPDACRLHGVLTLNKVAGNFHITAGKSLHLPRGHIHLNMLFDDT 183
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
N SH+IN+L+FG G++ PL+G S +YQYF++VVPT D + +I++
Sbjct: 184 PQNFSHRINRLSFGSPANGIIYPLEGDEKITSDESMLYQYFLEVVPTD-VDTTFESIKTF 242
Query: 303 QFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SV E R + +PGVFF YD++ +KV +E + L F+ + +I+GG++ +
Sbjct: 243 QYSVKELARPISHSKGSHGVPGVFFKYDMAALKVQVYQERENLLQFMLRLFSIIGGIYVI 302
Query: 362 SGIIDAFIYHGQRAIKKKIEIGK 384
I+ + + + KK E+ K
Sbjct: 303 ISFINTIVLTAKTLLVKKPEVKK 325
>gi|449303002|gb|EMC99010.1| hypothetical protein BAUCODRAFT_120300 [Baudoinia compniacensis
UAMH 10762]
Length = 387
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 162/369 (43%), Gaps = 48/369 (13%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ +R+ DA+PK + +T +GG+ T+V L L ++E+ + T + V+
Sbjct: 20 IKAVRAFDAFPKTKPSYTQKTNNGGIWTVVLVCASLWLAWTEVMRWWWGHTTHEFSVEQG 79
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
G L+IN DV + C L V+ D SG++ L G ++
Sbjct: 80 VGHDLQINLDVVV-KMRCDDLHVNVQDASGDRIL-------------AGETLQRDATLWS 125
Query: 125 APKIDKPLQRHGG-RLEHNETYCGSCYG-AESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
++ L G R E E S YG A ED ++ + +K P
Sbjct: 126 QWGANRKLHTLGATRDERLEMTGYSSYGDAREYAEDDVHDYLGAASSTKKFKKTPRVP-- 183
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQR 241
K +E + C IYG + NKV G+FH A G + + G H+ +
Sbjct: 184 -------------KSKEADSCRIYGSMHGNKVQGDFHITARGHGYMEFGQHL------EH 224
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-------DV 294
SFN SH IN+L+FG +P + NPLD E +QY++ VVPT+YT +
Sbjct: 225 SSFNFSHHINELSFGPFYPSLTNPLDNTLAATEFNFFKFQYYLSVVPTIYTTNAKALRKI 284
Query: 295 SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ T+ +NQ++VTE R + + +PGVF YD+ PI + EE SF + +
Sbjct: 285 TKSTVFTNQYAVTEQSRPVPENQ---VPGVFVKYDIEPILLMIAEERNSFPALFIRLVNV 341
Query: 355 VGGVFTVSG 363
+ GV G
Sbjct: 342 ISGVLVAGG 350
>gi|452847826|gb|EME49758.1| hypothetical protein DOTSEDRAFT_58941 [Dothistroma septosporum
NZE10]
Length = 402
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 167/394 (42%), Gaps = 88/394 (22%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++S DA+PK + RT SGGV T+V + LLL +SE+ + T V+ G
Sbjct: 23 VKSFDAFPKTKPSYTQRTESGGVWTVVLIVASLLLGWSEISGWWTGKTTHTFAVEQGVGH 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L+IN DV A+ C L V+ D SG++ L G+ + K
Sbjct: 83 DLQINLDVVV-AMQCGDLHVNVQDSSGDRIL-------------AGSAL----------K 118
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL---ID 184
D R G H A +S+++ E +R Y KG D+ +
Sbjct: 119 KDPTTWRQWGGRSH----------ALASEKE-----ERIRSGYDGKGAEYEEEDVHNYLG 163
Query: 185 QCKREGFLQRIKE----EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAF 239
KR+ ++ + + C IYG + NKV G+FH A G + + G H+
Sbjct: 164 AAKRQKKFKKTPGLPWGAQADSCRIYGSMHGNKVQGDFHITARGHGYMEFGAHL------ 217
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYTD---- 293
+FN SH +N+L+FG +P + NPLD TP Y QY++ VVPT+YT
Sbjct: 218 DHSTFNFSHTVNELSFGPFYPSLTNPLDNT--VATTPDHFYKFQYYLSVVPTIYTTDAKT 275
Query: 294 ------------------------VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 329
S +T+ +NQ++VTE S + +PGVF +D
Sbjct: 276 LRKIDKHHESPSSGEDGLSQYPHRYSRNTVFTNQYAVTEQ---SHRVPENAVPGVFIKFD 332
Query: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ PI +T EE S L + +V G+ G
Sbjct: 333 IEPIGLTIAEEWSSIPALLIRLVNVVSGLLVAGG 366
>gi|432879813|ref|XP_004073560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Oryzias latipes]
Length = 271
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/201 (40%), Positives = 110/201 (54%), Gaps = 22/201 (10%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I +GEGC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 86 MKIPINQGEGCRFEGKFTINKVPGNFH-----------VSTHSATA-QPQNPDMTHSIHK 133
Query: 253 LAFGE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
LAFG+ + G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 134 LAFGDTLQVHNVKGAFNALGGADKLSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVA 193
Query: 308 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAIVGG FTV+GII
Sbjct: 194 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 251
Query: 366 DAFIYHGQRAIKKKIEIGKFS 386
D+ I+ A KKI+IGK S
Sbjct: 252 DSCIFTASEA-WKKIQIGKMS 271
>gi|145235453|ref|XP_001390375.1| COPII-coated vesicle protein (Erv41) [Aspergillus niger CBS 513.88]
gi|134058058|emb|CAK38286.1| unnamed protein product [Aspergillus niger]
gi|350632895|gb|EHA21262.1| hypothetical protein ASPNIDRAFT_191708 [Aspergillus niger ATCC
1015]
Length = 399
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 171/388 (44%), Gaps = 64/388 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++ DA+PK + + + GG T++ ++ + FSE R +L+ V+
Sbjct: 19 GLQGGLKTFDAFPKTKPSYTAPSRRGGQWTVLILVICTVFTFSEFRTWLHGSENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
G L++N D+ +PC L V+ D SG++ L D+ ++ S ++ +R+
Sbjct: 79 KGVGHDLQLNLDLVV-RMPCDTLDVNIQDASGDRIL--AGDLLQRERTSWKLWMDKRNRE 135
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK---KGWAL 177
G + Q R+ A +D + EVR+ R+ KG L
Sbjct: 136 TSGGVHEYQTLSQEDTDRIS-----------AREADAHVHHVLGEVRKNPRRKFAKGPRL 184
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHV-HD 235
D +D C+ IYG LE NKV G+FH A G + G H+ H
Sbjct: 185 RRGDTVDSCR-----------------IYGSLEGNKVQGDFHITARGHGYRNFGEHLDHG 227
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY---- 291
+ FN SH + +L+FG H+P ++NPLD T ET YQYF+ VVPT+Y
Sbjct: 228 V-------FNFSHMVTELSFGPHYPTLLNPLDKTIATTETHYYKYQYFLSVVPTLYSKGA 280
Query: 292 --------------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 337
T+ + + + +NQ++ T + +PG+FF Y++ PI +
Sbjct: 281 SALDTYTNHPDLIATNRNRNLVFTNQYAATTQATELPENPY-FIPGIFFKYNIEPILLMI 339
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+EE SFL L + V GV G +
Sbjct: 340 SEERTSFLSLLIRLVNTVSGVMVTGGWV 367
>gi|289741661|gb|ADD19578.1| cOPII vesicle protein [Glossina morsitans morsitans]
Length = 418
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 183/377 (48%), Gaps = 47/377 (12%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
++LDA+ K+ E + T GG ++L+S ++++ L + E++ Y +A + D + E
Sbjct: 19 KNLDAFKKVPEKYTEATEIGGTLSLISRLLIIYLIYREVKYYQDAGLVYQFEPDIDK-EK 77
Query: 69 LRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++++ D+T A+PC+ LS VD MD + + D+F GA
Sbjct: 78 VQMHVDITV-AMPCNSLSGVDLMD-------ETQQDVF----------------AYGA-- 111
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG-----WALSNPDL 182
L+R G + + E + +RE Y + + +P+
Sbjct: 112 ----LRRQG-------VWWHLTPHERTEFERVQHENHFLREEYHSVADLLFKYIIQSPE- 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+D+ E + + EE+ + C ++G L +NKVAG H G + H ++ F+
Sbjct: 160 VDETATEEDEKPLSEEQYDACRLHGTLGINKVAGVLHLVGGTQPVVDLLGEHLMIGFRHI 219
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+ N +H+IN+L+FG++ +V PL+G + QYF+ +VPT + TI +
Sbjct: 220 AANFTHRINRLSFGQYARRIVQPLEGDETFVSEEGTIVQYFLNIVPT-EIHKTFTTISTY 278
Query: 303 QFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SVTE+ R + R PG++F YD S +K+ + + L F+ +C+I+ G+ +
Sbjct: 279 QYSVTENVRVLDSDRNSYGSPGIYFKYDWSALKIIVRTDRDNMLQFIIRLCSIISGIVVL 338
Query: 362 SGIIDAFIYHGQRAIKK 378
SGI++ F+ +R I K
Sbjct: 339 SGILNVFLLTLRRNIIK 355
>gi|453088947|gb|EMF16987.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 404
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 174/389 (44%), Gaps = 70/389 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ +++ DA+PK + RT +GGV T++ + + L +SEL + T V+
Sbjct: 20 LSAVKAFDAFPKTKPSYQQRTSTGGVWTVILIVASVALTWSELARWWKGETTHTFAVEQG 79
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ-DGI 123
G L++N D T + C+ L V+ D +G++ L +F K + +R+ +
Sbjct: 80 VGHDLQMNLD-TVVRMKCADLHVNVQDAAGDRIL--AGSVFHKDGTTWDQWAGNRKAHAL 136
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G+ K ++ Q+ GS AE +ED + LS+ +
Sbjct: 137 GSTKEERLSQK------------GSAASAEYREEDVHH--------------YLSSARMK 170
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 242
+ R + R + E + C IYG + NKV G+FH A G + + G H+
Sbjct: 171 HKFGRTPHIPRGR--EADSCRIYGSMHGNKVKGDFHITARGHGYMEFGQHL------DHS 222
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT---------- 292
+FN SH+I +L+FG ++P + NPLD T E+ +QY++ VVPT+YT
Sbjct: 223 TFNFSHRITELSFGPYYPSLTNPLDNTFATTESNFYKFQYYLSVVPTIYTADAKALRKID 282
Query: 293 ------------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK 334
S +T+ +NQ++VTE + ++PG+F +D+ PI+
Sbjct: 283 KYHESPTSGDDGLSQQPKRYSKNTVFTNQYAVTEQSHPVSE---SSVPGIFVKFDIEPIQ 339
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+T E S L + +V G+ G
Sbjct: 340 LTIAENWSSVPALLIRIVNVVSGLLVAGG 368
>gi|229366152|gb|ACQ58056.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Anoplopoma fimbria]
Length = 290
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/200 (41%), Positives = 108/200 (54%), Gaps = 22/200 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I +G+GC G +NKV GNFH V H A Q S +++H I+KL
Sbjct: 106 KIPLNQGDGCRFEGEFTINKVPGNFH-----------VSTHSATA-QPQSPDMTHNIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
AFGE G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 154 AFGEKIQVQRVQGAFNALGGADRLSSNPLASHDYILKIVPTVYEDLSGKQRFSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAIVGG FTV+GIID
Sbjct: 214 KEYVAYSHAGRI--IPAIWFRYDLSPITVKYTERRQPVYRFITTICAIVGGTFTVAGIID 271
Query: 367 AFIYHGQRAIKKKIEIGKFS 386
+ I+ A KKI+IGK S
Sbjct: 272 SCIFTASEAW-KKIQIGKMS 290
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+R D Y K+ +D T++G I+++ + +L LF SEL ++ +L V D
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATELVNELYVDDPDKD 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 65 SGGKIEVSLNISLPNLHCDLVGLDIQDEMGRHEVGHID 102
>gi|310800159|gb|EFQ35052.1| hypothetical protein GLRG_10196 [Glomerella graminicola M1.001]
Length = 377
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 165/360 (45%), Gaps = 47/360 (13%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + +RT GG T+ +++ LF +E+ + V+ G
Sbjct: 23 VSAFDAFPKAKPQYVTRTEGGGKWTVAMAVISFFLFCTEVGRWWRGSETHTFAVEKGVGH 82
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ + C L ++ D +G++ L + K+ + ++S+ G +
Sbjct: 83 EMQINLDIVV-RMHCDDLHINVQDAAGDRIL--AGSMLKRDKTNWSQWVDSK----GIHR 135
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESS-DEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ G+ + G+ + E E+ ++ V +K W P L
Sbjct: 136 L--------GKDSKGKVVTGAGWQEEEGFGEEHVHDI--VSLGKKKAKWG-KTPRLWG-- 182
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFN 245
EG+ C IYG L+VN+V G+FH A G + + G H+ +FN
Sbjct: 183 ------------EGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGAHL------DHAAFN 224
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQS 301
SH I++L+FG +P +VNPLD +QY++ VVPTVYT S +TI +
Sbjct: 225 FSHIISELSFGPFYPSLVNPLDRTVNLARINFHKFQYYLSVVPTVYTVGKSASSSNTIFT 284
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ++VTE + ++ +PG+FF YD+ PI ++ E FL L + IV GV
Sbjct: 285 NQYAVTEQSKETDD---HNIPGIFFKYDIEPILLSVEESRDGFLQLLMKIVNIVSGVLVA 341
>gi|296821254|ref|XP_002850059.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
gi|238837613|gb|EEQ27275.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
Length = 399
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/403 (26%), Positives = 175/403 (43%), Gaps = 62/403 (15%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + SGG+ T+ +I+ +L SEL + V
Sbjct: 18 DGIAAKLKTFDAFPKTKPSYTSTSRSGGLWTVFIAILCAILSCSELVTWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N DV A+PC + ++ D G+ L + + + N +RQ
Sbjct: 78 ERGVSQEMQLNLDVVV-AMPCDDVRINVQDAVGDHILAGELLTQQPTSWAAWNREFNRQR 136
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALS 178
G G+P+ + RLE E D + EVR +KK L
Sbjct: 137 GGGSPEYQTLSKEDPFRLEEQE-----------EDLHVEHVLGEVRRGRKKKFPKAPKLK 185
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 237
D +D C+ ++G LE NKV GN H A G + + G +
Sbjct: 186 KSDAVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYLEWGQPTNP-- 226
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
S N +H I +L+FG H+ ++NPLD T YQY + VVPT+YT SGH
Sbjct: 227 ----HSLNFTHLITELSFGPHYARLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTK-SGH 281
Query: 298 ---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 336
T+ +NQ++VT + Q R++++PG+FF Y++ PI +
Sbjct: 282 IDPNHRSLPDPSSITAKDSKTTVSTNQYAVTS-YSQPVQPRIESIPGIFFKYNIEPILLI 340
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
++E S L L + +V GV G + A++K+
Sbjct: 341 VSQERDSLLALLVRLVNVVSGVLVTGGWLFQIGSWAVEAMRKR 383
>gi|123430864|ref|XP_001307985.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121889642|gb|EAX95055.1| hypothetical protein TVAG_428580 [Trichomonas vaginalis G3]
Length = 358
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 167/364 (45%), Gaps = 39/364 (10%)
Query: 28 GGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD---TSRGETLRINFDVTFPALPCSI 84
G V++++ ++V +L + + LY+N L V TS ET+ I+ + A+PC
Sbjct: 26 GSVVSILLTVVSSILIITNVALYINPRIYRDLSVKPSVTSASETINISLTIKI-AMPCYF 84
Query: 85 LSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET 144
L +D MD G Q +K+ + +RL++ G VI D +
Sbjct: 85 LHIDYMDSLGFQRSYIKNTVTFRRLNNLGRVIGYTNDTLSD------------------- 125
Query: 145 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE---GFLQRIKEEEGE 201
C CY ++ ++CCN+C +V+ +L +D K + ++ E
Sbjct: 126 VCEPCYNLSTNPDECCNSCLKVQL------LSLMQNKPVDFSKYRVCNNYEKKPNVSLSE 179
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C + G L VN++ G+FH APG + QS ++HD+ + Q +++H I +L FG H P
Sbjct: 180 KCLVKGKLTVNRIPGSFHIAPGTNVPQSA-YLHDLSSMQM-FHDMTHSIQRLRFGPHIPR 237
Query: 262 VVNPLDGVRWTQETPSGMYQYF--IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 319
NPLD + Q+ P+ YF + + P ++ ++ +++ + Q
Sbjct: 238 TSNPLDNFKSFQQIPTHDRTYFYNLLITPVIFYRDGVEYLKGYEYTAFSEAIDTFQ-LFG 296
Query: 320 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
PG+FF Y +P + + +FL F++N ++ G++ I+D I G+
Sbjct: 297 ISPGLFFQYQFTPYTIVVSANRQNFLQFISNTFGVISGIYACLSILDKLI--GEDIGSNV 354
Query: 380 IEIG 383
+EIG
Sbjct: 355 VEIG 358
>gi|240275142|gb|EER38657.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Ajellomyces capsulatus H143]
gi|325094499|gb|EGC47809.1| COPII-coated vesicle protein [Ajellomyces capsulatus H88]
Length = 401
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 171/389 (43%), Gaps = 64/389 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I + +R+ DA+PK + + T GG T++ + L +ELR + V V+
Sbjct: 19 GIGSGLRTFDAFPKTKPTYTTSTRRGGQWTIIVFALCAFLSLNELRTWYRGVENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ A+PC L V+ D +G++ L D+ K+ S +G
Sbjct: 79 KGVSRELQMNLDIV-AAMPCDALRVNVQDAAGDRIL--ASDLLDKQPTSWA-AWNRELNG 134
Query: 123 IGAPKIDKPLQRHGGRLEH---NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ + GG E+ NE E+ D + E + +Y++K
Sbjct: 135 VTS----------GGGREYQTLNEEDSSRLMEQEA-DAHVGHALGEAKRSYKRK--FPKG 181
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 238
P L + E+ + C IYG LE NKV G+FH A G + + G H+
Sbjct: 182 PKLK------------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYPEYGEHL----- 224
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS- 295
D+FN SH + +L+FG H+P ++NPLD + TP+ +QY++ VVPT+YT
Sbjct: 225 -SHDAFNFSHMVTELSFGPHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGI 281
Query: 296 -------------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 336
G TI +NQ++ T + +PG+FF Y++ PI +
Sbjct: 282 VDPYNHVLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLV 340
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+EE S L L + ++ GV G +
Sbjct: 341 VSEERGSLLALLVRLVNVLAGVVVAGGWL 369
>gi|255726548|ref|XP_002548200.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240134124|gb|EER33679.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 355
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 167/371 (45%), Gaps = 63/371 (16%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ NK+R+ DA+PK++ + R+ GG TLV+ + LL+ + E+ Y+ + +
Sbjct: 1 MDSFTNKVRTFDAFPKVDPNQQVRSQRGGFSTLVTYMFGLLILWIEIGGYIGGYVDRQFT 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD L IN D+ +PC L + DI+ +++L + L+ +G
Sbjct: 61 VDNQIRSDLTINLDMIV-GMPCEFLHTNVEDITRDRYLA------GETLNFEGIHF---- 109
Query: 121 DGIGAP--KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
I P +I+ P H E+ D D E +R +R +G ++
Sbjct: 110 --IVPPSFRINNPNDFH-----------------ETPDLDEIMQ-ESLRAEFRSQGARVN 149
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
E C+I+G + V +V G+F ++ HV
Sbjct: 150 -------------------EGAPACHIFGSIPVTQVRGDFRITAKGFGYRDRSHV----- 185
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
++FN SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y + G
Sbjct: 186 -PIEAFNFSHVIQEFSFGEFYPFINNPLDATGKITEEKLQTYLYYAKVVPTMYEQL-GLE 243
Query: 299 IQSNQFSVTEH---FRSSEQ-GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
I +NQ+S+TE + EQ R +PG++F YD PIK+ E+ + F F+ + I
Sbjct: 244 IDTNQYSLTESQHVIQVDEQTKRPNGIPGIYFRYDFEPIKLVIREKRIPFFQFIAKLGTI 303
Query: 355 VGGVFTVSGII 365
GG+ +G +
Sbjct: 304 GGGIMIAAGYL 314
>gi|401427507|ref|XP_003878237.1| hypothetical protein, unknown function [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494484|emb|CBZ29786.1| hypothetical protein, unknown function [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 309
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 171/392 (43%), Gaps = 93/392 (23%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M A+ N R+ D + I D T +G +I++ ++M LLF E+ Y+ ++ ++
Sbjct: 1 MRAVRNWQRA-DFFRHIPRDLTESTTAGSIISIACVVLMALLFAGEVISYVFPRIQSDMI 59
Query: 61 V--DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
+ D T++++ D+TFP +PC++L++D +D+ + I + RLD+ G I
Sbjct: 60 IMPDLDDQNTIKVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPI-- 117
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
SD ++ V E
Sbjct: 118 ------------------------------------SDGRSSDDFVSVAEG--------- 132
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
C+ EG+++ V KV GNFH + H H
Sbjct: 133 -------CRLEGYIK-----------------VGKVPGNFHISSHGRQHLLAQHF----- 163
Query: 239 FQRDSFNISHKINKLAFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ N+ H I+ L+FG ++PLDG E P +YQYF+ +VPT+Y
Sbjct: 164 --PNGINVEHSIHHLSFGTTDVKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY- 219
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ S T+ + QF+ T SS + + V F Y LSPI V ++ VS HFLT VC
Sbjct: 220 ESSFSTVHTYQFTGTS---SSTPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVC 276
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
AI+GGV+TV+G++ F++ ++++ +GK
Sbjct: 277 AIIGGVYTVAGLLSRFVHSSAAQFQRRV-LGK 307
>gi|348516790|ref|XP_003445920.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Oreochromis niloticus]
Length = 290
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 108/200 (54%), Gaps = 22/200 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I +G+GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNQGDGCRFEGEFTINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
AFGE G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 154 AFGEKLQVQKVQGAFNALGGADKMSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GIID
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGAFTVAGIID 271
Query: 367 AFIYHGQRAIKKKIEIGKFS 386
+ I+ A KKI+IGK S
Sbjct: 272 SCIFTASEAW-KKIQIGKMS 290
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+R D Y K+ +D T++G I+++ + +L LF SEL ++ +L V D
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
G + ++ +++ P L C ++ +D D G +
Sbjct: 65 SGGKIEVSLNISLPNLHCDLVGLDIQDEMGRHEV 98
>gi|432954843|ref|XP_004085560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Oryzias latipes]
Length = 122
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 56/107 (52%), Positives = 81/107 (75%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+NK++ DAYPK EDF +T+ G +T++S ++ML+LF SEL+ +L +L VDTS
Sbjct: 4 LNKLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYFLTKEVHPELYVDTS 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDS 111
RG+ L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD
Sbjct: 64 RGDKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKRRLDK 110
>gi|348505737|ref|XP_003440417.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oreochromis niloticus]
Length = 374
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 2/169 (1%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C I+G L VNKVAGNFH GKS H H DS+N SH+I+ L+FGE PG
Sbjct: 168 ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVAHDSYNFSHRIDHLSFGEPLPG 227
Query: 262 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 320
+++PLDG + M+QYFI +VPT + + +++Q+SVTE R +
Sbjct: 228 IISPLDGTEKIATDSNHMFQYFITIVPT-KLNTYKVSAETHQYSVTERERVINHAAGSHG 286
Query: 321 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ G+F YD+S + V TE+H+ FL +C I+GG+F+ +G+I +
Sbjct: 287 VSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGLV 335
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + T SGG ++L++ M +L F E +Y + + + VD
Sbjct: 10 LTLVKELDAFPKVPESYVESTASGGTVSLIAFTFMAVLAFLEFFVYRHTWMKYEYEVDRD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
LRIN D+T A+ C + D +D++
Sbjct: 70 FSSKLRINVDITV-AMRCQYIGADVLDLA 97
>gi|71409118|ref|XP_806922.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870803|gb|EAN85071.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 310
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 158/306 (51%), Gaps = 48/306 (15%)
Query: 4 IMNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYL---NAVTETKL 59
++ K+ ++D +PK ED+ S+T+ G +++LV+ +V+ LL F E+ Y+ +A T T+L
Sbjct: 20 LLKKVAAVDLFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVCSYIFGRDAYT-TEL 78
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN--VIE 117
VDTS + N D+TFP +PC +S+D +D++G +L+V ++FK +D+QGN I
Sbjct: 79 SVDTSLSTEVDFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNLFKTPVDAQGNFAFIG 138
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNE------TYCGSCYGAE------SSDEDCCNNCEE 165
+RQ G+G +G E ++ +CG C+ E + CCN C +
Sbjct: 139 TRQ-GVGE---------YGSFREQSKDDPSSPQFCGRCFINEHQVSMMENKNRCCNTCND 188
Query: 166 VREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS 225
V AY ++G + ++QC E L RI GCN G L V K G FAP +
Sbjct: 189 VLNAYDQQGLPRPQKNEVEQCIYE--LSRI----NPGCNYKGTLIVKKFGGRLVFAPKRV 242
Query: 226 FHQSGVHVHDILAFQRDSFNISHKINKLAFG-EHFP-----GVVNPLDGVRWTQETPSGM 279
G + D++ F+ SH INKL+ G EH GV +PL+G + +
Sbjct: 243 --PGGFLIRDVM-----RFDSSHIINKLSIGDEHVTRFSRRGVQHPLNGHEFDAQRRFTE 295
Query: 280 YQYFIK 285
+YF +
Sbjct: 296 IRYFFE 301
>gi|398021306|ref|XP_003863816.1| hypothetical protein, unknown function [Leishmania donovani]
gi|322502049|emb|CBZ37133.1| hypothetical protein, unknown function [Leishmania donovani]
Length = 309
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 173/392 (44%), Gaps = 93/392 (23%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M A N R+ D + I D T +G +I++ +VM+LLF E+ Y+ ++ ++
Sbjct: 1 MRAARNWQRA-DFFRHIPRDLTEPTTAGSIISVACVVVMVLLFAGEVISYVFPRIQSDMI 59
Query: 61 V--DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
+ D T++++ D+TFP +PC++L++D +D+ + I + RLD+ G I
Sbjct: 60 IMPDLDDRNTIKVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPIS- 118
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
DG SSD+
Sbjct: 119 --DG------------------------------RSSDDFV------------------- 127
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
+ + C+ EG+++ V KV GNFH + H H
Sbjct: 128 --SVAEGCRLEGYIK-----------------VAKVPGNFHISSHGRQHLLAQHF----- 163
Query: 239 FQRDSFNISHKINKLAFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ N+ H I+ L+FG ++PLDG E P +YQYF+ +VPT+Y
Sbjct: 164 --PNGINVEHSIHHLSFGTIDVKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY- 219
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ S T+ + QF+ T SS + + V F Y LSPI V ++ VS HFLT VC
Sbjct: 220 ESSFSTVHTYQFTGTS---SSTPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVC 276
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
AI+GGV+TV+G++ F++ ++++ +GK
Sbjct: 277 AIIGGVYTVAGLLSRFVHSSAAQFQRRV-LGK 307
>gi|312376736|gb|EFR23738.1| hypothetical protein AND_12338 [Anopheles darlingi]
Length = 265
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 107/188 (56%), Gaps = 22/188 (11%)
Query: 85 LSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET 144
+S+DA D +GEQHL ++H I+K+RLD +GN IE PK + +Q R+ ET
Sbjct: 31 VSLDAQDSTGEQHLHIEHSIYKRRLDLEGNQIEE-------PKKED-IQVSTKRVSSTET 82
Query: 145 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEEGEG 202
S S+ + C N V +AYR++ W NP++ D QCK + EG
Sbjct: 83 PVTS-----STIKPACGN---VIDAYRERKW---NPNVEDFEQCKNSNHGAIEGKAFNEG 131
Query: 203 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-G 261
C+IYG +EVN+V G FH APGKSF +HVHD+ + FN SH+IN L+FGE F G
Sbjct: 132 CHIYGTMEVNRVEGRFHIAPGKSFSIQNIHVHDVQPYSSSRFNTSHRINTLSFGEQFDFG 191
Query: 262 VVNPLDGV 269
PLDG+
Sbjct: 192 TTQPLDGL 199
>gi|388501278|gb|AFK38705.1| unknown [Medicago truncatula]
Length = 148
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 62/134 (46%), Positives = 86/134 (64%)
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N+SH I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S + +NQF
Sbjct: 10 NVSHVIHDLSFGPKYPGIHNPLDETSRILHDASGTFKYYIKIVPTEYRYISKEVLPTNQF 69
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
SVTE+F +T P V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G+
Sbjct: 70 SVTEYFSPITSQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGM 129
Query: 365 IDAFIYHGQRAIKK 378
+D ++Y A K
Sbjct: 130 LDRWMYRLVEAATK 143
>gi|209877186|ref|XP_002140035.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555641|gb|EEA05686.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 384
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 169/389 (43%), Gaps = 53/389 (13%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSRG 66
++ DA+ K +F +T GG +T++S + ML LF+SELR YL ++ VD T G
Sbjct: 1 MQRFDAFSKPIAEFRIKTAFGGYLTILSILTMLFLFYSELRYYLKVNRNDEITVDKTLAG 60
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR-QDGIGA 125
+ I V FP LPC ++ + + L++Q N S +D I
Sbjct: 61 GNVNIKMLVEFPKLPCEVVGL-------------------RILNTQDNTEFSHPKDSIIY 101
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
I+ PL + CGSCY S CCN C EV +Y++ L +Q
Sbjct: 102 IPIN-PLNEESNI----GSSCGSCYNP-SKKNHCCNTCSEVIRSYQEDNIKLPQKINFEQ 155
Query: 186 CK---REGFLQRIKEEEG-EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
CK RE + I GC I + + KV G + + + + + DI +
Sbjct: 156 CKFDPRERLEKAISAPLNISGCKIKVDINIPKVKGRIEISHKRWMNYNEMTNLDIS--EA 213
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETP-------------SGMYQYFIKVVP 288
+N S+ + L +G+ PG+ N + + Q + +P
Sbjct: 214 HLYNFSYIVKYLHYGDDLPGINNIWNNQEYIQTAKFTHNKESDNLFLEDAHLDIDMHCIP 273
Query: 289 TVYTDV-SGHTIQSNQFSVTEHFRSSE---QGRL---QTLPGVFFFYDLSPIKVTFTEEH 341
T + + S T +QFSV + + GR +LPG++ YD +P V TE
Sbjct: 274 TQFNSINSKKTKIGHQFSVRKQSKQVNVLNNGRFVPETSLPGIYINYDFTPFIVKITESR 333
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
SFL FLT CAI+GG+F S +ID F++
Sbjct: 334 RSFLSFLTECCAIIGGIFAFSSMIDIFMF 362
>gi|410907774|ref|XP_003967366.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Takifugu rubripes]
Length = 388
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 97/169 (57%), Gaps = 2/169 (1%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C I+G L VNKVAGNFH GKS H H DS+N SH+I+ L+FGE PG
Sbjct: 167 ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGEDLPG 226
Query: 262 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 320
+++PLDG + ++QYFI +VPT + +++Q+SVTE R+ +
Sbjct: 227 IISPLDGTEKVSADSNHIFQYFITIVPTKLNTYRV-SAETHQYSVTEQDRAINHAAGSHG 285
Query: 321 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ G+F YD++ + V TE+H+ FL +C I+GG+F+ +G+I +
Sbjct: 286 VSGIFMKYDINSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGIV 334
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 51/89 (57%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + T SGG ++L++ +M +L F E +Y + + + VD
Sbjct: 10 LTLVKELDAFPKVPESYVESTASGGTVSLIAFSLMAILAFLEFFVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
G LRIN D+T A+ C + D +D++
Sbjct: 70 FGSKLRINVDITV-AMRCQYIGADVLDLA 97
>gi|213512030|ref|NP_001133523.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
gi|209154344|gb|ACI33404.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
Length = 381
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 95/169 (56%), Gaps = 2/169 (1%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C I+G L VNKVAGNFH GK+ H H D++N SH+I+ L+FGE PG
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFGEEIPG 228
Query: 262 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQT 320
++NPLDG + M+QYFI +VPT + + +NQ+SVTE R
Sbjct: 229 IINPLDGTEKVCTDHNQMFQYFITIVPT-KLNTYQISADTNQYSVTERERVINHAVGSHG 287
Query: 321 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ G+F YD+S + V TE+H+ FL +C I+GG+F+ +G+I +
Sbjct: 288 VSGIFMKYDISSLMVKVTEQHMPLWRFLVRLCGIIGGIFSTTGMIHGMV 336
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + T +GG ++L++ M LL F E +Y + + + VD
Sbjct: 11 LTLVKELDAFPKVPESYVETTATGGTVSLIAFTAMALLAFLEFFVYRDTWMQYEYEVDKD 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
LRIN D+T A+ C + D +D++
Sbjct: 71 FSSKLRINIDITV-AMRCQFVGADVLDLA 98
>gi|146097219|ref|XP_001468078.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
gi|134072444|emb|CAM71154.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
Length = 309
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 168/381 (44%), Gaps = 92/381 (24%)
Query: 12 DAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV--DTSRGETL 69
D + I D T +G +I++ +VM+LLF E+ Y+ ++ +++ D T+
Sbjct: 11 DFFRHIPRDLTEPTTAGSIISVACVVVMVLLFAGEVISYVFPRIQSDMIIMPDLDDRNTI 70
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID 129
+++ D+TFP +PC++L++D +D+ + I + RLD+ G I DG
Sbjct: 71 KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPIS---DG------- 120
Query: 130 KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE 189
SSD+ + + C+ E
Sbjct: 121 -----------------------RSSDDFV---------------------SVAEGCRLE 136
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 249
G+++ V KV GNFH + H H + N+ H
Sbjct: 137 GYIK-----------------VAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHS 172
Query: 250 INKLAFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
I+ L+FG ++PLDG E P +YQYF+ +VPT+Y + S T+ + Q
Sbjct: 173 IHHLSFGTIDVKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQ 230
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
F+ T SS + + V F Y LSPI V ++ VS HFLT VCAI+GGV+TV+G
Sbjct: 231 FTGTS---SSTPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAG 287
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
++ F++ ++++ +GK
Sbjct: 288 LLSRFVHSSAAQFQRRV-LGK 307
>gi|402224967|gb|EJU05029.1| DUF1692-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 517
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/351 (29%), Positives = 170/351 (48%), Gaps = 51/351 (14%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
+D+I ++S DA+PK+ + +R+ GG ITL +++ LLL ++ Y+ T + +
Sbjct: 13 LDSIGAPLKSFDAFPKVPSTYRTRSSGGGFITLGIALLCLLLVLNDWAEYVWGTTTWRFV 72
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESR 119
VD + + +N D+T A+PC +SVD D G++ HL D FK+ G + ++R
Sbjct: 73 VDDKIEKEMMLNVDITV-AMPCHYISVDLRDAVGDRLHLS---DQFKR----DGTLFDAR 124
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
Q A I E Y + Y A+ + VREA ++G +
Sbjct: 125 Q----ATHI-------------REQY--TDYSAQ----------QMVREAKTRRG-RIGI 154
Query: 180 PDLIDQCKREGFLQRIKE-EEGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDIL 237
D + + + F ++G C +YG +EV KV N H G +H + H ++
Sbjct: 155 FDWLRRRQPSAFQPTFNHVKDGSACRVYGSMEVKKVQANLHITTLGHGYHSNEHTDHSLM 214
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
N+SH I + +FG +FP +V PLD + + P +QYF+ VVPT Y G
Sbjct: 215 -------NLSHIITEFSFGPYFPDIVQPLDYTIESSDDPFTAFQYFLTVVPTEYRTSKG- 266
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+++NQ+SV H + + GR P +FF YDL P+ + + + + FL
Sbjct: 267 VVKTNQYSVGSHMQHIQHGR--GTPVIFFKYDLEPLSLIVEQRTTTLIQFL 315
>gi|401416963|ref|XP_003872975.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489202|emb|CBZ24457.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 368
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 175/380 (46%), Gaps = 48/380 (12%)
Query: 14 YPKINEDFY-SRTFSGGVITLVSSIVMLLLFFSELRLYLNA--VTETKLLVDTSRGETLR 70
+PK ED+ +T G V+++ + ++++L E YL +T + +D E +
Sbjct: 2 FPKPKEDYQREQTRWGAVLSVATVSIVIILVLWEGAAYLRGRDAYDTDISLDRGLSEDMP 61
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK-ID 129
++FDV FP +PC+ LS+D +D +G + + K G V+ G+ K +D
Sbjct: 62 VHFDVFFPFMPCNRLSIDVVDTTGMAKFNYTGTLHKLPTALDGRVLYK-----GSLKDLD 116
Query: 130 KPLQRHGGRLEHNETYCGSC-------YGAE---SSDEDCCNNCEEVREAYRKKGWALSN 179
++ R N T C C AE ++ CC+ CE V + Y++ G +
Sbjct: 117 NAMETEEAR---NGTKCRPCPPSAFDGVAAEVRSAAVSKCCDTCESVLDLYKELGKGIPG 173
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS--FHQSGVHVHDIL 237
+ + QC + + ++ GCN+ G L++ KV F P ++ F+ + D++
Sbjct: 174 TEYLPQCLEQLY------QQASGCNVVGSLDLKKVHVTVIFGPRRTGRFYS----LKDVI 223
Query: 238 AFQRDSFNISHKINKLAFG----EHFP--GVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
+ SH I KL G E F GV PL G + +T S +Y +KVVPT Y
Sbjct: 224 -----RLDTSHSIRKLRIGDEAVERFSKNGVAEPLSGHKSFSKTYSET-RYLVKVVPTTY 277
Query: 292 TDVSGHTIQSNQFSVTEHF--RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
+++ + + + R+ G +P V F ++ +PI+V E F HF+
Sbjct: 278 RKTKKRNAKASTYEYSAQWSKRTIVVGFAGAVPAVLFEFEPAPIQVNNVFERQPFSHFVV 337
Query: 350 NVCAIVGGVFTVSGIIDAFI 369
+C IVGG+F V G ID +
Sbjct: 338 QLCGIVGGLFVVLGFIDNVV 357
>gi|260950511|ref|XP_002619552.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
gi|238847124|gb|EEQ36588.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
Length = 347
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 163/365 (44%), Gaps = 56/365 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD +K+R DA+PK+ + R+ GG T+++ LL+ + ++ YL + +
Sbjct: 1 MDNFSSKVRVFDAFPKVAPEASVRSQRGGFSTILTVFCGLLIIWIQIGGYLGGYIDRQFS 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD + L IN D+ A+PC +S + MDI+ +++L G V+ +
Sbjct: 61 VDNETRKDLNINLDMVV-AMPCQFISTNVMDITSDRYL-------------AGEVLNFQG 106
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
G P+ E++D D E ++E R + + ++
Sbjct: 107 TGFYVPEF-------------------FALNRENNDYDTPELDEIMQETLRAE-YGIAGA 146
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ E+ C+I+G + VN V G F P S ++ D +
Sbjct: 147 RV--------------NEDAPACHIFGTIPVNHVRGEFFIVPKGSMYR------DRSSID 186
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
++N SH I++ +FG+ +P + NPLD E Y+YF K+VPT Y + G +
Sbjct: 187 PKAYNFSHVISEFSFGDFYPFITNPLDFTAKVTEENRQAYRYFAKLVPTHYEKL-GLVVD 245
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+ Q+S+TE + + R PG+FF Y PIK+T E+ + F F+ + ++ G+
Sbjct: 246 TYQYSLTE-IHNVDHNRGIPPPGIFFDYSFEPIKLTIREKRIGFFAFVARLMTVLSGLLI 304
Query: 361 VSGII 365
+G +
Sbjct: 305 AAGYL 309
>gi|400594740|gb|EJP62573.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Beauveria bassiana ARSEF 2860]
Length = 374
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 161/361 (44%), Gaps = 50/361 (13%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK ++ +RT GG T+ + L+L SE+ + V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTAGGGKWTVAMIFISLVLMGSEVARWWRGEQTHNFAVEKGISH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ L C+ L ++ D SG++ L + + R +
Sbjct: 81 EMQINLDIVVNML-CADLHINVQDASGDRIL--------------ASAMLHRDPTKWSQW 125
Query: 128 IDKPLQRHG----GRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
+D + + G GR+ E + NN E E + D++
Sbjct: 126 VDNGVHKLGHDANGRVNTGEGWT-----------SLANNDEGFGEEHVH--------DIV 166
Query: 184 DQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQ 240
K+ + G + C IYG L++NKV G+FH A G + + G H+
Sbjct: 167 ALGKKRAKWSKTPRFWGTADSCRIYGSLDLNKVQGDFHITARGHGYMEFGQHL------D 220
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D FN SH I++L++G +P +VNPLD +QY++ VVPTVY+ V TIQ
Sbjct: 221 HDKFNFSHVISELSYGAFYPSLVNPLDRTVNVAAAHFHKFQYYLSVVPTVYS-VGRSTIQ 279
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+NQ++VTE +S E +PG+F YD+ PI + E SF+ FL + +V GV
Sbjct: 280 TNQYAVTE--QSKEIDEHSAVPGIFVKYDIEPILLAVHESRDSFIVFLLKLINVVSGVLV 337
Query: 361 V 361
Sbjct: 338 A 338
>gi|157874469|ref|XP_001685717.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
Friedlin]
gi|68128789|emb|CAJ08922.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
Friedlin]
Length = 309
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 170/392 (43%), Gaps = 93/392 (23%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M A N R+ D + I D T +G +I++ +VM+LLF E+ Y+ ++ ++
Sbjct: 1 MRAARNWQRA-DFFRHIPRDLTESTTAGSIISVACVVVMVLLFAGEVIAYVFPRIQSDMI 59
Query: 61 V--DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
+ D T++++ D+TFP +PC++L++D +D+ + I + RLD+ G I
Sbjct: 60 IMPDLDDRNTIKVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPI-- 117
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
SD ++ V E
Sbjct: 118 ------------------------------------SDGRSSDDFVSVAEG--------- 132
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
C+ EG+++ V KV GNFH + H H
Sbjct: 133 -------CRLEGYIK-----------------VAKVPGNFHISSHGRQHLLAQHF----- 163
Query: 239 FQRDSFNISHKINKLAFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ N+ H I+ L+FG ++PLDG E P +YQYF+ +VPT+Y
Sbjct: 164 --PNGINVEHSIHHLSFGTIDVKKLAKKAALHPLDGKEHRSEMPM-VYQYFLDIVPTIY- 219
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ S T+ + QF+ T SS + + V F Y LSPI V ++ VS HFLT VC
Sbjct: 220 ESSFSTVYTYQFTGTS---SSTPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVC 276
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
AI+GGV+TV+G++ F++ ++ + +GK
Sbjct: 277 AIIGGVYTVAGLLSRFVHSSAAQFQRHV-LGK 307
>gi|261193579|ref|XP_002623195.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
gi|239588800|gb|EEQ71443.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
gi|239613876|gb|EEQ90863.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ER-3]
gi|327349942|gb|EGE78799.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ATCC 18188]
Length = 401
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 111/407 (27%), Positives = 170/407 (41%), Gaps = 72/407 (17%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I + +R+ DA+PK + S T GG T++ + L +ELR + V V+
Sbjct: 19 GIGSGLRTFDAFPKTKPTYTSSTVRGGQWTIIVFALCAFLSINELRTWYRGVENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG------NVI 116
L++N D+ A+PC L V+ D G++ L D+ K+ S NV+
Sbjct: 79 KGISRELQMNLDIVV-AMPCDALRVNVQDAVGDRIL--ASDLLDKQPTSWAAWNRELNVV 135
Query: 117 ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED--CCNNCEEVREAYRKKG 174
S GG E+ +ED + E + +Y++K
Sbjct: 136 SS-----------------GGSREYQTLNEEDAVRLMEQEEDVHVGHALGEAQRSYKRK- 177
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHV 233
P L + E + C IYG L NKV G+FH A G + + G H+
Sbjct: 178 -FPKGPKL------------KRGENADSCRIYGSLVGNKVQGDFHITARGHGYFEFGEHL 224
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
DSFN SH I +L+FG H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 225 ------SHDSFNFSHMITELSFGPHYSTLLNPLDKTISTTPAHFHKYQYYMSIVPTIYTR 278
Query: 294 VS--------------------GHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSP 332
G+TI +NQ++VT RS E + +PG+FF Y + P
Sbjct: 279 AGVVDPYSQALPDPSTITPSQRGNTIFTNQYAVTS--RSHELPDAEYDVPGIFFKYTIEP 336
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
I + +EE S L L + ++ GV G + +KK+
Sbjct: 337 ILLVVSEERGSLLALLVRLVNVLAGVVVAGGWLFQIFTWAMDNLKKR 383
>gi|344229081|gb|EGV60967.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
gi|344229082|gb|EGV60968.1| hypothetical protein CANTEDRAFT_115996 [Candida tenuis ATCC 10573]
Length = 352
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 165/379 (43%), Gaps = 77/379 (20%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD ++R+ DA+PK++ + R+ G + T+ + L++ + E+ +L + + +
Sbjct: 1 MDGFATRVRTFDAFPKVDSEHTVRSLRGALSTIATYFFALVILWVEVGGFLGGYVDHQFV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK----------RLD 110
VD L IN D+T +PC ++ + +DI+ ++ L + F+ R++
Sbjct: 61 VDDQIRTNLSINIDMTV-TMPCELIHTNVVDITDDRFLAAELLNFEGVHFFAPPQFFRIN 119
Query: 111 SQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAY 170
SQ E+ P +D ++ E +R +
Sbjct: 120 SQNKEYET-------PDLDHVMR------------------------------ENIRAEF 142
Query: 171 RKKGWALSNPDLIDQCKREGFLQRIKEEEGE-GCNIYGFLEVNKVAGNFHFAPGKSFHQS 229
G Q+I + G C+I+G + VN V G FH
Sbjct: 143 YISG------------------QKINQVAGAPACHIFGTIPVNHVQGEFHIT------AK 178
Query: 230 GVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT 289
GV D L + N SH I + +FG +P + NPLD Y+Y+ VVPT
Sbjct: 179 GVGYQDSLHTPWERMNFSHVIQEFSFGTFYPMIDNPLDMSGKITHESLQSYKYYSNVVPT 238
Query: 290 VYTDVSGHTIQSNQFSVTEH---FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
+Y + G + +NQ+S++E R GR+ + PG+FF Y+ PIK+T E+ + F+
Sbjct: 239 LYERL-GIVVDTNQYSISEQHLVIRKDSNGRIYSPPGIFFKYEFEPIKLTIVEKRLPFIQ 297
Query: 347 FLTNVCAIVGGVFTVSGII 365
F+ + I+GG+ ++G +
Sbjct: 298 FVARLGTILGGLLILAGYV 316
>gi|448521200|ref|XP_003868450.1| Erv41 protein [Candida orthopsilosis Co 90-125]
gi|380352790|emb|CCG25546.1| Erv41 protein [Candida orthopsilosis]
Length = 352
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 164/371 (44%), Gaps = 63/371 (16%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ ++++ DA+PK++ R+ GG+ TL++ LL+ + E+ Y+ + + +
Sbjct: 1 MDSFSKRVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFFGLLILWVEIGGYIGGYVDRQFI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK--KRLDSQGNVIES 118
VD L IN D+ A+PC L +A+DI+G++ L + F+ K G I +
Sbjct: 61 VDDVLRSDLTINLDMIV-AMPCEFLHTNAVDIAGDRFLAGETLNFEGLKFFIPSGFSINN 119
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
D P +D+ +Q E +R + + G
Sbjct: 120 PNDFHETPDLDEVMQ------------------------------ESLRAEFSQLG---- 145
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
R E C+I+G + VN+V G F G+ D
Sbjct: 146 ---------------RRVNEGAPACHIFGSIPVNQVKGEFRIT------AKGLGYKDRSF 184
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
++ N SH I + ++G+ FP + NPLD E +Y Y KVVPT+Y + G
Sbjct: 185 VPVEALNFSHVIQEFSYGDFFPFLNNPLDATGKVTEENLQIYLYHSKVVPTLYEKL-GLE 243
Query: 299 IQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ + Q+S+TE+ + + Q +PG++F Y+ PIK+ E+ + FL F+ + I
Sbjct: 244 VDTTQYSLTENHHIVKVNPHSKKPQGIPGIYFAYEFEPIKLIIREKRIPFLQFIAKLGTI 303
Query: 355 VGGVFTVSGII 365
VGG+ +G +
Sbjct: 304 VGGIIVAAGYL 314
>gi|195439332|ref|XP_002067585.1| GK16119 [Drosophila willistoni]
gi|194163670|gb|EDW78571.1| GK16119 [Drosophila willistoni]
Length = 443
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 185/377 (49%), Gaps = 25/377 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++EL+ Y + ET+++ D +
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELQYYWH---ETQIIYQFEPDIA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
E + ++ D+T A+PC+ LS VD MD + + D+F + V D
Sbjct: 76 LEEQVPMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMSDAD 127
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEE-VREAYRKKGWALSNPDL 182
L H R E + + D + + A G S P +
Sbjct: 128 RMQFQSAQLTNHYLR-EQYHSVADILFKDIMRDGILKGRSDSSAKPAAPPPG---SLPAV 183
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+D ++ LQ+ E + + C ++G L +NKVAG H G H ++ F+R
Sbjct: 184 LD-LHQDTHLQQ-PEAKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFQDHWMIEFRRM 241
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
N +H+IN+L+FG++ +V PL+G + + QYF+K+VPT + + TI +
Sbjct: 242 PANFTHRINRLSFGQYSRRIVQPLEGDETIIQEEATTVQYFLKIVPT-EIEQTFSTINTF 300
Query: 303 QFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q+SVTE+ R + R PG++F YD S +K+ + + L F+ +C+I+ G+ +
Sbjct: 301 QYSVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVSNDRDHILTFVIRLCSIISGIIVL 360
Query: 362 SGIIDAFIYHGQRAIKK 378
SG I++ + QR + +
Sbjct: 361 SGAINSLLLGMQRRLLR 377
>gi|389749487|gb|EIM90658.1| DUF1692-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 533
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 157/363 (43%), Gaps = 54/363 (14%)
Query: 1 MDAIMNK-IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKL 59
+DA+ + I+S DA+PK+ + SR+ S G +T+ + + LL +++ ++ + +
Sbjct: 12 LDALAPESIKSFDAFPKLPATYKSRSESRGFLTIFVAFLAFLLVLNDIGEFIWGWPDHEF 71
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR 119
VD + +N D+ +PC LSVD D+ G++
Sbjct: 72 AVDRDDSSFMNVNVDLVV-NMPCRWLSVDLRDVVGDRLF--------------------- 109
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ K +R G + + A + + VR++ + +G
Sbjct: 110 --------LSKGFRRDGTLFDIGQAT------ALKEHAKALSTRQAVRQSRKSRG----- 150
Query: 180 PDLIDQCKREGFLQRIK---EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
D +R + + + +G C +YG LEV KV N H + S VHV
Sbjct: 151 --FFDLFRRSQDIYKPTYNYQADGSACRVYGSLEVKKVTANLHITSLGHGYASKVHV--- 205
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
N+SH I + +FG HFP +V PLD YQYF++VVPT Y
Sbjct: 206 ---DHTKINMSHVITEFSFGPHFPDIVQPLDNSFEITHDHFTAYQYFMRVVPTTYVAPRS 262
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+ +NQ+SVT + R+ EQ PG+FF +++ P+++ + +F F +VG
Sbjct: 263 APLNTNQYSVTHYTRTFEQ-HSGLAPGIFFKFEIEPVRLIQHQRTTTFAQFFVRWAGVVG 321
Query: 357 GVF 359
GVF
Sbjct: 322 GVF 324
>gi|154343635|ref|XP_001567763.1| hypothetical protein, unknown function [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065095|emb|CAM43209.1| hypothetical protein, unknown function [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 309
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 100/381 (26%), Positives = 166/381 (43%), Gaps = 92/381 (24%)
Query: 12 DAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV--DTSRGETL 69
D + I +D T SG +I++ VM+LLF E+ Y++ ++ +++ D T+
Sbjct: 11 DFFRHIPKDLTESTTSGAIISIACVTVMVLLFVGEVISYVSPRIQSDMIILPDLDETSTI 70
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID 129
+++ D+TFP +PC+IL++D +D+ + I + RLD G I DGI
Sbjct: 71 KVSMDITFPKMPCAILTLDILDVLHNHMFNSMDHITRTRLDPAGKPIS---DGIS----- 122
Query: 130 KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE 189
++ + + G C+ E
Sbjct: 123 ------------SDLFVSAAEG----------------------------------CRLE 136
Query: 190 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 249
G+++ V KV GNFH + S H ++ + N H
Sbjct: 137 GYIK-----------------VGKVPGNFHIS-------SHGRQHLLMTHFPNGTNAEHS 172
Query: 250 INKLAFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
I+ L+FG ++PLDG E P +YQYF+ +VPT+Y + S T + Q
Sbjct: 173 IHHLSFGTLDVKKLDKKAQLHPLDGKEHRSEVPK-IYQYFLDIVPTIY-ESSFSTAHTYQ 230
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
F+ T ++ V F Y +SPI V ++ VS HFLT VCAI+GGV+TV+G
Sbjct: 231 FTGTSSSSPVPSSQMA---AVVFQYQMSPITVRYSSARVSLTHFLTYVCAIIGGVYTVAG 287
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
++ F++ +++I +GK
Sbjct: 288 LLSRFVHSSAAQFQRRI-LGK 307
>gi|348529156|ref|XP_003452080.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oreochromis niloticus]
Length = 379
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 96/173 (55%), Gaps = 2/173 (1%)
Query: 198 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 257
E C I+G + VNKVAGN H GK H H H +++N SH+I+ L+FGE
Sbjct: 164 EPLNACRIHGHVYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGE 223
Query: 258 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 316
PG++NPLDG + M+QYFI VVPT + + ++QFSVTE R +
Sbjct: 224 ELPGIINPLDGTEKITYNNNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERVINHAA 282
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ G+F YD S + VT +E+H+ FL +C I+GG+F+ +G++ +
Sbjct: 283 GSHGVSGIFVKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIFSTTGMLHGLV 335
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 49/89 (55%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK++E + + SGG ++L++ M LL E +Y + + VD
Sbjct: 10 LSLVKELDAFPKVSESYVETSASGGTVSLLAFSAMALLAVLEFFVYRETWMKYEYSVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
LRIN D+T A+ C + D +D++
Sbjct: 70 FSSKLRINIDITV-AMKCQHVGADILDLA 97
>gi|146079597|ref|XP_001463805.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398011570|ref|XP_003858980.1| hypothetical protein, conserved [Leishmania donovani]
gi|134067893|emb|CAM66174.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322497192|emb|CBZ32265.1| hypothetical protein, conserved [Leishmania donovani]
Length = 368
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 105/375 (28%), Positives = 169/375 (45%), Gaps = 38/375 (10%)
Query: 14 YPKINEDFY-SRTFSGGVITLVSSIVMLLLFFSELRLYLNA--VTETKLLVDTSRGETLR 70
+PK ED+ +T G ++++ + ++ L E YL +T + +D E +
Sbjct: 2 FPKPKEDYQREQTRWGALLSVFTVFFVIFLVLWEGAAYLRGRDAYDTDVSLDKGLSEDMP 61
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK-ID 129
++FDV FP +PC+ LS+D +D +G + + K G V+ G+ K +D
Sbjct: 62 VHFDVLFPFMPCNRLSIDVVDTTGMAKFNYTGRLHKLPTALDGEVVYK-----GSLKDLD 116
Query: 130 KPLQRHGGRLEHNETYC------GSCYGAESSDE-DCCNNCEEVREAYRKKGWALSNPDL 182
++ GR C G S+ E CC+ CE V + Y++ G + +
Sbjct: 117 NEMETREGRAGKKCRPCPPSAFDGVPAEVRSAAELKCCDTCESVLDLYKELGKGIPGTEY 176
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
I QC E QR GC + G L++ KV F P ++ H + D++
Sbjct: 177 IPQC-LEQLYQR-----ASGCTVMGSLDLKKVPVTVIFGPRRTGHF--YSLKDVI----- 223
Query: 243 SFNISHKINKLAFG----EHFP--GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+ SH I KL G E F GV PL G + + +T S +Y +KVVPT Y
Sbjct: 224 RLDTSHFIRKLRIGDETVERFSKNGVAEPLSGHKSSSKTYSET-RYLVKVVPTTYRKTKT 282
Query: 297 HTIQSNQFSVTEHF--RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+++ + + + R+ G +P V F ++ +PI+V E F HFL +C I
Sbjct: 283 KNAKASTYEYSAQWSRRTIVVGFAGAVPAVLFEFEPAPIQVNNVFERQPFSHFLVQLCGI 342
Query: 355 VGGVFTVSGIIDAFI 369
VGG+F V G ID +
Sbjct: 343 VGGLFVVLGFIDNVV 357
>gi|302422316|ref|XP_003008988.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
gi|261352134|gb|EEY14562.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
Length = 374
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 161/363 (44%), Gaps = 52/363 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSEL-RLYLNAVTETKLLVDTSRG 66
+ + DA+PK + RT GG T+ +++ ++LF+ EL R + T+L +
Sbjct: 20 VSAFDAFPKSKPQYVQRTSGGGKWTVAMAVISVMLFWPELGRGGRGSREPTRLRSRRASA 79
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
TL++N D+ + C L ++ D SG+ L +L + D G
Sbjct: 80 TTLQVNLDIVV-KMRCEDLHINVQDASGDLILAAT------KLREEITSWHQWADITGNH 132
Query: 127 KIDKPLQRHGGRLEHNETY-CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
K+ + GR+E N Y +G E D++ Q
Sbjct: 133 KLGRSPS---GRIETNSGYHLDEGFGEEHVH------------------------DIVAQ 165
Query: 186 CKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 242
K+ R G + C I+G L++NKV G+FH A G + +G H+
Sbjct: 166 SKKRQKWARTPRLRGPPDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL------DHT 219
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHT 298
SFN SH +N+L+FG +P + NPLD +QY++ +VPTVYT +T
Sbjct: 220 SFNFSHIVNELSFGAFYPNLENPLDRTVNLASANFHKFQYYLSIVPTVYTVGRSASKANT 279
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+ +NQF+VTE +S E G ++PGVF YD+ PI + E F+ F V ++ GV
Sbjct: 280 VYTNQFAVTE--QSKEVGD-HSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVINVLSGV 336
Query: 359 FTV 361
Sbjct: 337 LVA 339
>gi|315054535|ref|XP_003176642.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
gi|311338488|gb|EFQ97690.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
Length = 399
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 103/391 (26%), Positives = 168/391 (42%), Gaps = 66/391 (16%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + GG+ T+ +I+ LL SEL + V
Sbjct: 18 DGIATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAILCTLLTCSELITWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N D T A+PC + ++ D +G+ L G+++
Sbjct: 78 ERGVSQEMQLNID-TVVAMPCDDVRINIQDAAGDHIL-------------AGDLLTQEPT 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRKK---GWA 176
A + +R GG E+ E +ED + EVR + +KK
Sbjct: 124 SWAAWNREMNKRRSGGSPEYQTLNKEDTLRLEEQEEDLHVEHVLGEVRRSRKKKFPKAPK 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHD 235
+ D++D C+ ++G LE NKV GN H A G + + G
Sbjct: 184 MKKSDVVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYFEWG----- 221
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
A S N +H I +L+FG H+ ++NPLD T YQY + VVPT+YT S
Sbjct: 222 -RATNPHSLNFTHLITELSFGPHYGRLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTK-S 279
Query: 296 GH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK 334
GH T+ +NQ++VT + Q R+ + PG+FF Y++ PI
Sbjct: 280 GHMDPSRRSLPDSSTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPIL 338
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ ++E S L + + +V GV G +
Sbjct: 339 LIVSQERDSLLGLMIRLVNVVSGVLVTGGWL 369
>gi|123499008|ref|XP_001327531.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121910461|gb|EAY15308.1| hypothetical protein TVAG_394520 [Trichomonas vaginalis G3]
Length = 357
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 171/382 (44%), Gaps = 57/382 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD----- 62
+R D +PK++ + T GG++++ S V ++LFFSE+ YLN + +VD
Sbjct: 3 LRKFDVFPKLDRQYRVSTSFGGILSIASITVTIILFFSEIHTYLNPPIRQRFIVDNTKPM 62
Query: 63 -----TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK---RLDSQGN 114
+S L +N D+ FP +PC +L +D +D + LD+ + RLD G
Sbjct: 63 GISGKSSNQRKLSVNLDIEFPNVPCYLLHIDVVDPISQ--LDLPMESISNNFARLDKTGK 120
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
IG +K L+ + + SCY A ++ C C++V +A++ +
Sbjct: 121 -------NIGDFHPEKFLEPDNAKTSDST----SCYAANNT--KVCKTCKDVVQAHKNQE 167
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
I QC + I+E + EGC + + ++A FH APG ++ G H H
Sbjct: 168 LLPPPLSTIAQCASTAAI--IQEMKDEGCKLTSAFQTVRLASEFHVAPGYNYLYKGWHSH 225
Query: 235 D--ILAFQRDSFNISHKINKLAFGE---HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT 289
+ IL + N++H I F F PLD V Q T G ++
Sbjct: 226 NTTILGSESKDLNLTHIIRSFRFNRVDGKF-----PLDNVTSIQ-TGKGSWR-------V 272
Query: 290 VYT-DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
VY+ D+ +T +N++ + + + S GV+F Y ++P+ + FLH
Sbjct: 273 VYSADIMDNTYTANKYELMDPPKFSS--------GVYFRYAINPVSAIDYYDTEPFLHLC 324
Query: 349 TNVCAIVGGVFTVSGIIDAFIY 370
T + ++G V ++D+F++
Sbjct: 325 TRLLTVIGAVLAAFRLLDSFLF 346
>gi|346322712|gb|EGX92310.1| COPII-coated vesicle protein (Erv41), putative [Cordyceps militaris
CM01]
Length = 376
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 158/355 (44%), Gaps = 37/355 (10%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK ++ +RT GG T+V + L+L SE+ + V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTAGGGKWTVVIVFISLVLMGSEVGRWWRGSETHNFAVEKGISH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ L C+ L ++ D SG++ L L + D G K
Sbjct: 81 DMQINLDIVVHML-CNDLHINVQDASGDRILAAS------MLHRDPTMWSHWVDQAGVHK 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ GR+ E + + E E+ ++ V ++ W+ P
Sbjct: 134 LGHDAN---GRVNTGEGWTSLAHNDEGFGEEHVHDI--VALGKKRAKWS-KTPRFWGTA- 186
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ C +YG L++NKV G+FH A G + + G H+ + FN
Sbjct: 187 -------------DSCRVYGSLDLNKVQGDFHITARGHGYMEFGQHL------DHNQFNF 227
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I++L++G +P +VNPLD +QY++ VVPT+Y+ V TIQ+NQ++V
Sbjct: 228 SHVISELSYGAFYPSLVNPLDRTVNLAAAHFHKFQYYLSVVPTIYS-VGSSTIQTNQYAV 286
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
TE +S E +PG+F YD+ PI + E SF FL + IV GV
Sbjct: 287 TE--QSKEIDEHSAVPGIFVKYDIEPILLAVHESRDSFPVFLLKLINIVSGVLVA 339
>gi|71480113|ref|NP_001025133.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Danio rerio]
gi|78099248|sp|Q4V8Y6.1|ERGI1_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|66911928|gb|AAH97146.1| Zgc:114085 [Danio rerio]
Length = 290
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 107/200 (53%), Gaps = 22/200 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
++ G GC G +NKV GNFH V H A Q S +++H I+KL
Sbjct: 106 KVPLNNGHGCRFEGEFSINKVPGNFH-----------VSTHSATA-QPQSPDMTHIIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
AFG +H G N L G Q + Y +K+VPTVY ++ G S Q++V
Sbjct: 154 AFGAKLQVQHVQGAFNALGGADRLQSNALASHDYILKIVPTVYEELGGKQRFSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAI+GG FTV+GIID
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRRPFYRFITTICAIIGGTFTVAGIID 271
Query: 367 AFIYHGQRAIKKKIEIGKFS 386
+ I+ A KKI+IGK S
Sbjct: 272 SCIFTASEAW-KKIQIGKMS 290
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 46/94 (48%), Gaps = 3/94 (3%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+R D Y K+ +D T++G I++ + ML LF SEL ++ +L V D
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISICCCVFMLFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
G + ++ +++ P L C ++ +D D G +
Sbjct: 65 SGGKIDVSLNISLPNLHCDLVGLDIQDEMGRHEV 98
>gi|410914052|ref|XP_003970502.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Takifugu rubripes]
Length = 290
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 108/200 (54%), Gaps = 22/200 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I +G GC G +NKV GNFH S H + Q + +++H I+KL
Sbjct: 106 KIPLNQGAGCRFEGEFIINKVPGNFHI----STHSASA--------QPQNPDMTHFIHKL 153
Query: 254 AFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
AFG+ G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 154 AFGDKLQMHQEKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAIVGG FTV+GIID
Sbjct: 214 KEYVAYSHTGRI--VPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIID 271
Query: 367 AFIYHGQRAIKKKIEIGKFS 386
+ I+ A KKI+IGK S
Sbjct: 272 SCIFTASEAW-KKIQIGKMS 290
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+R D Y K+ +D T++G I+++ + +L LF SEL ++ +L V D
Sbjct: 5 VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
G + ++ ++T P L C ++ +D D G +
Sbjct: 65 SGGKIEVSLNITLPNLHCDLVGLDIQDEMGRHEV 98
>gi|47222972|emb|CAF99128.1| unnamed protein product [Tetraodon nigroviridis]
Length = 288
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 108/200 (54%), Gaps = 22/200 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I +G GC G +NKV GNFH S H + Q + +++H I+KL
Sbjct: 104 KIPLNQGGGCRFEGEFNINKVPGNFHI----STHSASA--------QPQNPDMTHFIHKL 151
Query: 254 AFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
AFG+ G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 152 AFGDKLQMHQVKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVAN 211
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAIVGG FTV+GIID
Sbjct: 212 KEYVAYSHTGRI--VPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIID 269
Query: 367 AFIYHGQRAIKKKIEIGKFS 386
+ I+ A KKI+IGK S
Sbjct: 270 SCIFTASEAW-KKIQIGKMS 288
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G I+++ + +L LF SEL ++ +L V D
Sbjct: 3 LHRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 62
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
G + ++ ++T P L C ++ +D D G + + K L+ G
Sbjct: 63 SGGKIEVSLNITLPNLHCDLVGLDIQDEMGRHEVGHIENSMKIPLNQGG 111
>gi|391872305|gb|EIT81439.1| COPII vesicle protein [Aspergillus oryzae 3.042]
Length = 390
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 161/382 (42%), Gaps = 59/382 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++ DA+PK D+ + + GG T++ ++ + SE + + V+
Sbjct: 19 GLQGGLKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ +PC L V+ D SG++ L ++ KK S + R
Sbjct: 79 KGVSHDLQLNLDIVV-QMPCDALHVNIQDASGDRIL--AGELLKKDPTSWKLWTDKRNYD 135
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSN 179
+ + RLE A+ D + EVR R+K G L
Sbjct: 136 HEYQTLSR---EEPSRLE-----------AQEEDAHVRHVLGEVRHNPRRKFPKGPKLRR 181
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 238
D +D C+ IYG LE NKV G+FH A G + G H+
Sbjct: 182 GDAVDSCR-----------------IYGSLEGNKVQGDFHITARGHGYRDMGGHL----- 219
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
+FN SH I +L+FG H+P ++NPLD E+ YQYF+ VVPT+Y+
Sbjct: 220 -DHSTFNFSHMITELSFGPHYPTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAA 278
Query: 299 IQSNQFS----------VTEHFRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVS 343
+ S ++ T + ++ QG +PG+FF Y++ PI + +EE S
Sbjct: 279 LDSTLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSS 338
Query: 344 FLHFLTNVCAIVGGVFTVSGII 365
FL L + V GV G +
Sbjct: 339 FLSLLIRLVNTVSGVMVTGGWL 360
>gi|238495520|ref|XP_002378996.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
NRRL3357]
gi|220695646|gb|EED51989.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
NRRL3357]
Length = 390
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 106/402 (26%), Positives = 168/402 (41%), Gaps = 63/402 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++ DA+PK D+ + + GG T++ ++ + SE + + V+
Sbjct: 19 GLQGGLKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ +PC L V+ D SG++ L ++ KK S + R
Sbjct: 79 KGVSHDLQLNLDIVV-QMPCDALHVNIQDASGDRIL--AGELLKKDPTSWKLWTDKRNYD 135
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSN 179
+ + RLE A+ D + EVR R+K G L
Sbjct: 136 HEYQTLSR---EEPSRLE-----------AQEEDAHVRHVLGEVRHNPRRKFPKGPKLRR 181
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 238
D +D C+ IYG LE NKV G+FH A G + G H+
Sbjct: 182 GDAVDSCR-----------------IYGSLEGNKVQGDFHITARGHGYRDMGGHL----- 219
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
+FN SH I +L+FG H+P ++NPLD E+ YQYF+ VVPT+Y+
Sbjct: 220 -DHSTFNFSHMITELSFGPHYPTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAA 278
Query: 299 IQSNQFS----------VTEHFRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVS 343
+ S ++ T + ++ QG +PG+FF Y++ PI + +EE S
Sbjct: 279 LDSTLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSS 338
Query: 344 FLHFLTNVCAIVGGVFTVSGIIDAFIYHG----QRAIKKKIE 381
FL L + V GV G + G +R KK+ E
Sbjct: 339 FLSLLIRLVNTVSGVMVTGGWLYQIAGWGGELLRRGRKKRSE 380
>gi|358058634|dbj|GAA95597.1| hypothetical protein E5Q_02253 [Mixia osmundae IAM 14324]
Length = 682
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 158/363 (43%), Gaps = 65/363 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ +R+ DA+PK + S + GGV T++ ++ +L+L + E YL + VD
Sbjct: 26 LPPLRTFDAFPKTLPTYRSTSSRGGVYTVLLAVAILVLVWYEATEYLFGEPLYEFSVDKG 85
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
G+ L+IN D+T A+PC L+ +D++ +
Sbjct: 86 IGKMLQINVDMTV-AMPCHYLT-----------VDIRDAV-------------------- 113
Query: 125 APKIDKPLQRHGGRLEHNETYC--GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
G RL ++ + G+ + + E EAY+ +
Sbjct: 114 -----------GDRLHVSDEFVKDGTTFEIGQAQRLVTMAFESDPEAYK----------V 152
Query: 183 IDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ + +R ++ E G C IYG + V KV GN H + S H L
Sbjct: 153 VQEARRPRAFEQTYHIVENGPACRIYGTMAVKKVTGNLHITTLGHGYLSWEHTDHKL--- 209
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
N+SH I++ +FG FPG+ PLD E+ ++QYF+ +V T Y D + ++
Sbjct: 210 ---MNLSHVIHEFSFGPLFPGISQPLDNTLEVTESSFHIFQYFMSIVSTTYVDHHRNVLE 266
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+ Q+SVT+ R++ GR +PG+F YD P+ +T E + FL + IVGGV
Sbjct: 267 TAQYSVTDMSRATVHGR--GVPGIFLKYDPEPMMLTLRERTTTLGQFLIRLAGIVGGVIV 324
Query: 361 VSG 363
SG
Sbjct: 325 CSG 327
>gi|308198100|ref|XP_001386838.2| predicted protein [Scheffersomyces stipitis CBS 6054]
gi|149388859|gb|EAZ62815.2| putative ER to golgi transport [Scheffersomyces stipitis CBS 6054]
Length = 352
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 160/357 (44%), Gaps = 57/357 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ K+R+ DA+PK++ R+ GG TL+++ LL+ + E+ +L + + +
Sbjct: 1 MDSFAKKVRTFDAFPKVDSQHTVRSQRGGFSTLMTAFCGLLIVWVEIGGFLGGYVDHQFI 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD +L IN D+ A+PC L + DI+ +++ + + L+ QG
Sbjct: 61 VDNEIKSSLVINVDMLV-AMPCEFLHTNVEDITKDRY------LAGETLNFQGT------ 107
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ I P + +D+ + +E+ + + +++S
Sbjct: 108 NFITPPTFNI---------------------NNINDKHDTPDLDEIMQDSLRAEFSVSGA 146
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ E C+I+G + V+ V G+FH + HV
Sbjct: 147 RI--------------NEGAPACHIFGSIPVSHVKGDFHITAKGLGYSDRSHV------P 186
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
++ N SH I + +FG+ +P + NPLD E P Y YF KVVPT+Y + G +
Sbjct: 187 LEALNFSHVIQEFSFGDFYPFINNPLDASGKLTEEPLISYSYFAKVVPTLYQRL-GLVVD 245
Query: 301 SNQFSVTE--HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+NQ+S+TE H E R +PG+FF YD PIK+ E + F+ F+ + IV
Sbjct: 246 TNQYSLTENNHVFKLEHKRPTGIPGIFFKYDFEPIKLIIIERRLPFIQFVARLATIV 302
>gi|169778245|ref|XP_001823588.1| COPII-coated vesicle protein (Erv41) [Aspergillus oryzae RIB40]
gi|83772325|dbj|BAE62455.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 390
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 161/382 (42%), Gaps = 59/382 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++ DA+PK D+ + + GG T++ ++ + SE + + V+
Sbjct: 19 GLQGGLKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ +PC L V+ D SG++ L ++ KK S + R
Sbjct: 79 KGVSHDLQLNLDIVV-QMPCDALHVNIQDASGDRIL--AGELLKKDPTSWKLWTDKRNYD 135
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSN 179
+ + RLE A+ D + EVR R+K G L
Sbjct: 136 HEYQTLSR---EEPSRLE-----------AQEEDAHVRHVLGEVRHNPRRKFPKGPKLRR 181
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 238
D +D C+ IYG LE NKV G+FH A G + G H+
Sbjct: 182 GDAVDSCR-----------------IYGSLEGNKVQGDFHITARGHGYRDMGGHL----- 219
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
+FN SH I +L+FG H+P ++NPLD E+ YQYF+ VVPT+Y+
Sbjct: 220 -DHSTFNFSHMITELSFGTHYPTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAA 278
Query: 299 IQSNQFS----------VTEHFRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVS 343
+ S ++ T + ++ QG +PG+FF Y++ PI + +EE S
Sbjct: 279 LDSTLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSS 338
Query: 344 FLHFLTNVCAIVGGVFTVSGII 365
FL L + V GV G +
Sbjct: 339 FLSLLIRLVNTVSGVMVTGGWL 360
>gi|326470603|gb|EGD94612.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
Length = 399
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 167/391 (42%), Gaps = 66/391 (16%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + GG+ T+ +I+ +L SEL + V
Sbjct: 18 DGIATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N D T A+PC + ++ D +G+ L G+++
Sbjct: 78 ERGVSQEMQLNID-TVVAMPCDDVRINIQDAAGDHIL-------------AGDLLTQEPT 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRK---KGWA 176
A + +R GG E+ E ED + EVR + +K K
Sbjct: 124 SWAAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQAEDLHVEHVLGEVRRSRKKKFPKAPK 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHD 235
L D +D C+ ++G LE NKV GN H A G + + G
Sbjct: 184 LKKSDAVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYFEWG----- 221
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
A S N +H I +L+FG H+ ++NPLD + YQY++ VVPT+YT S
Sbjct: 222 -RATNPHSLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYYLSVVPTIYTK-S 279
Query: 296 GH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK 334
GH T+ +NQ++VT + Q R+ + PG+FF Y++ PI
Sbjct: 280 GHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPIL 338
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ ++E S L + + +V GV G +
Sbjct: 339 LIVSQERDSLLALMVRLVNVVSGVLVTGGWL 369
>gi|301093181|ref|XP_002997439.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262110695|gb|EEY68747.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 278
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/198 (40%), Positives = 112/198 (56%), Gaps = 15/198 (7%)
Query: 188 REGFLQRIKEEE--GE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+E LQ+ +EE GE GC +YG ++V KVAG+ FA H+ + V F +F
Sbjct: 92 KEIMLQKDIQEEPYGENGCRLYGTVQVQKVAGDLSFA-----HEGSLTVFSFFDFL--NF 144
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH +N L FG P + PL V Y+YF+ VVP+ Y ++G ++ + Q+
Sbjct: 145 NSSHVVNHLRFGPQIPDMETPLIDVSKILTKNLATYKYFVSVVPSRYVYLNGRSVTTFQY 204
Query: 305 SVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
SVTEH SS Q + PGV F Y+ SPI V + E +S LHFLT+ AIVGGVF V+
Sbjct: 205 SVTEHETSSRGPNGQVSFPGVIFSYEFSPIAVEYIESKLSVLHFLTSTSAIVGGVFAVAR 264
Query: 364 IIDAFIYHGQRAIKKKIE 381
+ID IY ++ KK++
Sbjct: 265 MIDGAIY----SVSKKVD 278
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 47/97 (48%), Gaps = 1/97 (1%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE- 67
R D K E RT GGV+TL+S + + L SEL ++ ++ VDT +
Sbjct: 4 RRFDLNAKGVEGIQERTIGGGVVTLMSCVAVAFLLLSELSVWWTVSVTHRMHVDTDPQDF 63
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI 104
+ I DV+F C +++D D G + + ++ DI
Sbjct: 64 PINIEVDVSFLHEACKEVAMDVSDSKGHKEIMLQKDI 100
>gi|57208595|emb|CAI42844.1| ERGIC and golgi 3 [Homo sapiens]
Length = 156
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 89/157 (56%), Gaps = 35/157 (22%)
Query: 248 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--------- 298
H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G
Sbjct: 1 HYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAQQERGRSRG 60
Query: 299 -----------------------IQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPI 333
+++NQFSVT H + + G L Q LPGVF Y+LSP+
Sbjct: 61 GADGGWSQVLALALAQAPLPPQVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPM 119
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IY
Sbjct: 120 MVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIY 156
>gi|336370998|gb|EGN99338.1| hypothetical protein SERLA73DRAFT_108802 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383753|gb|EGO24902.1| hypothetical protein SERLADRAFT_449635 [Serpula lacrymans var.
lacrymans S7.9]
Length = 503
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 149/354 (42%), Gaps = 47/354 (13%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+ DA+PK+ + SR+ S G IT+ + + LL ++ Y+ + + VD+
Sbjct: 15 PLAQFDAFPKLPSTYKSRSESRGFITIFITFLAFLLVLNDFGEYIWGWPDYEFSVDSQSN 74
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ IN D+ +PC +LSVD D+ G++ K G + + Q
Sbjct: 75 SFMSINVDMAV-NMPCHLLSVDLRDVVGDRLY------LSKGFRRDGTLFDVGQA----- 122
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
L+ H L A + + + +R+ S PD
Sbjct: 123 ---TSLKEHAAMLS-----------ARQALSQSRKSRGLLSSVFRR-----SQPDYRPTY 163
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
+ +G C IYG L+V KV N H + S VHV N+
Sbjct: 164 N--------YQADGSACRIYGTLQVKKVTANLHITTLGHGYTSNVHV------DHTKMNL 209
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I + +FG +FP + PLD + P YQYF+ VVPT + + +NQ+SV
Sbjct: 210 SHVITEFSFGPYFPDITQPLDYSFEVAKDPFVAYQYFLHVVPTTFIAPRSEPLHTNQYSV 269
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
T H+ +G T PG+FF +DL P+ +T + SFL ++GGVFT
Sbjct: 270 T-HYTRVLKGHHGT-PGIFFKFDLDPMVITIHQRTTSFLQLFIRCVGVIGGVFT 321
>gi|225558748|gb|EEH07032.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 401
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 165/386 (42%), Gaps = 58/386 (15%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I + +R+ DA+PK + + T GG T++ + L +ELR + V V+
Sbjct: 19 GIGSGLRTFDAFPKTKPTYTTSTRRGGQWTIIVFALCAFLSLNELRTWYRGVENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ A+ C L V+ D +G++ L D+ K
Sbjct: 79 KGVSRELQMNLDIVV-AMSCDALRVNVQDAAGDRIL--ASDLLDK--------------- 120
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+P E N G ++ +E+ + E +EA G AL
Sbjct: 121 -------QPTSWAAWNRELNGVTSGGGREYQTLNEEDSSRLME-QEADAHVGHALGEAKR 172
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQR 241
+ K + + E+ + C IYG LE NKV G+FH A G + + G H+
Sbjct: 173 SYKRKFPKGPKLKRGEKADSCRIYGSLEGNKVQGDFHITARGHGYPEFGEHL------SH 226
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS---- 295
D+FN SH + +L+FG H+P ++NPLD + TP+ +QY++ VVPT+YT
Sbjct: 227 DAFNFSHMVTELSFGPHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDP 284
Query: 296 ----------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 339
G TI +NQ++ T + +PG+FF Y++ PI + +E
Sbjct: 285 YNHVLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSE 343
Query: 340 EHVSFLHFLTNVCAIVGGVFTVSGII 365
E L L + ++ GV G +
Sbjct: 344 ERGGLLALLVRLVNVLAGVVVAGGWL 369
>gi|393231429|gb|EJD39021.1| DUF1692-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 518
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 156/352 (44%), Gaps = 45/352 (12%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
+DAI ++ DA+PK+ + SR GG++TL + ++ ++L +++ Y+ + +
Sbjct: 14 LDAIA-PLKQFDAFPKVPATYKSRRGEGGLLTLFACLLSVVLVLNDIAEYMWGWPDHEFS 72
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD SR + IN D+ +PC LSVD D G+ RL NV
Sbjct: 73 VDKSRQSYMPINVDLIV-NMPCHYLSVDIRDAVGD------------RLHLSDNV----- 114
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG-WALSN 179
+R G + G + + + E VR++ + +G +++
Sbjct: 115 ------------KREGTVWD-----VGQATRMANHSQTMMSATEVVRQSRKSRGLFSIFQ 157
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
Q K + + G C ++G + V KV N H + S H +
Sbjct: 158 RSSKPQFKPTYNHPNMGKAVGSACRVFGSMFVKKVTANLHITTAGHGYSSNAHTDHTM-- 215
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
N+SH I++ +FG P + PLD + + P YQYF+ VVPT Y + +
Sbjct: 216 ----MNLSHIISEFSFGPFMPDISQPLDNLFEVAKEPFTAYQYFLTVVPTTYVAPRSYPM 271
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
++NQ+SVT + R E GR PG+FF +D+ P+++T + +F + +
Sbjct: 272 RTNQYSVTNYKRVFEHGR--ATPGIFFKFDIDPMQLTVIQRTTTFTQLIIRI 321
>gi|322792513|gb|EFZ16471.1| hypothetical protein SINV_10123 [Solenopsis invicta]
Length = 141
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 75/109 (68%)
Query: 159 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 218
CCN CE+V EAYR+K WA +P + QC+ + ++++K +GC IYG++EVN+V G+F
Sbjct: 12 CCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMEKLKHAFTQGCQIYGYMEVNRVGGSF 71
Query: 219 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 267
H APG SF + VHVHD+ + FN++HKI L+FG + PG NP+D
Sbjct: 72 HIAPGVSFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPGKTNPMD 120
>gi|326928384|ref|XP_003210360.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Meleagris gallopavo]
Length = 321
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G+GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 137 KIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 184
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L+G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 185 SFGDKLQVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVAN 244
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+D
Sbjct: 245 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILD 302
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 303 SCIFTASEA-WKKIQLGKM 320
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 36 VVGFDIYRKVPKDLTQPTYTGALISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 95
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + +N +++ P L C ++ +D D G H+D
Sbjct: 96 SGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHID 133
>gi|405119686|gb|AFR94458.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Cryptococcus neoformans var. grubii H99]
Length = 431
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 98/188 (52%), Gaps = 14/188 (7%)
Query: 196 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 252
K E+G C IYG +EV KV N H H ++FQ N+SH +++
Sbjct: 202 KVEDGPACRIYGSVEVKKVTANLHIT---------TLGHGYMSFQHTDHHLMNLSHVVHE 252
Query: 253 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 312
+FG FP + PLD E P ++QYF++VVPT Y D S + ++Q++VT++ RS
Sbjct: 253 FSFGPFFPAIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 312
Query: 313 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 372
E G+ +PG+FF YDL P+ V E S FL + +VGGV+TV+
Sbjct: 313 FEHGK--GVPGLFFKYDLEPMSVVIRERTTSLYQFLIRLAGVVGGVWTVAAFALRVFNRA 370
Query: 373 QRAIKKKI 380
QR + K +
Sbjct: 371 QREVSKAV 378
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I+S DA+PK+ + ++ GGV+T V +++ LL ++L YL + VD+ +
Sbjct: 32 IKSFDAFPKVESTYTIKSRRGGVLTAVVGLIIFLLVLNDLGEYLYGAPDYAFQVDSDIQK 91
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ 96
L++N D+T A+PC L++D D G++
Sbjct: 92 DLQLNVDLTV-AMPCRYLTIDLRDAVGDR 119
>gi|224067439|ref|XP_002195791.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Taeniopygia guttata]
Length = 290
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G+GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L+G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 154 SFGDKLQVHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + +N +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|70988875|ref|XP_749289.1| COPII-coated vesicle protein (Erv41) [Aspergillus fumigatus Af293]
gi|66846920|gb|EAL87251.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
fumigatus Af293]
gi|159128703|gb|EDP53817.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
fumigatus A1163]
Length = 379
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 159/371 (42%), Gaps = 58/371 (15%)
Query: 16 KINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDV 75
K + + + GG T++ +V L SE R +L + V+ L++N D+
Sbjct: 14 KTKPSYTAPSPRGGQWTVLVLLVCTFLSISEFRTWLKGTEKQHFSVEKGISHDLQLNLDI 73
Query: 76 TFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI--GAPKIDKPLQ 133
+ C +L V+ D SG++ L + + K+ S ++ R GA + Q
Sbjct: 74 VV-HMSCDMLDVNIQDASGDRILAGQ--LLKREPTSWQLWMDKRNYETYGGAHEYQTLSQ 130
Query: 134 RHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREG 190
H RL E +D + EVR RKK G L D +D C+
Sbjct: 131 EHADRLSEQE-----------ADAHVHHVLGEVRRNPRKKFAKGPKLRRGDAVDSCR--- 176
Query: 191 FLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHK 249
IYG LE NKV G+FH A G +H + H+ + +FN SH
Sbjct: 177 --------------IYGSLEGNKVQGDFHITARGHGYHNNAPHL------EHKTFNFSHM 216
Query: 250 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------DVSGHTIQSNQ 303
I +L+FG H+P ++NPLD T E YQYF+ +VPT+Y+ D + SN+
Sbjct: 217 ITELSFGPHYPTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLALDTYANAPPSNR 276
Query: 304 ----FSVTEHFRSSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
T + + Q + +PG+FF Y++ PI + +EE SFL L +
Sbjct: 277 RGKNLVFTNQYAVTSQSSVIPESPYFIPGLFFKYNIEPILLLISEERTSFLSLLVRLVNT 336
Query: 355 VGGVFTVSGII 365
V GV G +
Sbjct: 337 VSGVMVTGGWL 347
>gi|145524934|ref|XP_001448289.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415833|emb|CAK80892.1| unnamed protein product [Paramecium tetraurelia]
Length = 324
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 111/201 (55%), Gaps = 28/201 (13%)
Query: 203 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISH----------K 249
I G++ VNKV GNFH S H G +H + FQR + ++SH K
Sbjct: 135 VKIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQISTLDLSHTYQSYSHLVKK 188
Query: 250 INKLAFGEHF-PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFS 305
+ + + F GV+NPLD + + G M+QY+I VVPT Y DVSG N++
Sbjct: 189 DDLVKIKKQFQKGVLNPLDNTKKIAQPQGGTGMMFQYYISVVPTTYIDVSG-----NEYY 243
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
V + +S + + LP V+F YDLSP+ V F + SFLHFL +CAI+GGVFT++ II
Sbjct: 244 VHQFTANSNEVQTDHLPAVYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTIASII 303
Query: 366 DAFIYHGQRAIKKKIEIGKFS 386
D I+ A+ KK E+GK S
Sbjct: 304 DGMIHKSVVALLKKYEMGKLS 324
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 70/111 (63%), Gaps = 5/111 (4%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+ +++R LD Y K+ D T +G +I+++S+IV+++LF +EL+ Y+ +++ VD
Sbjct: 4 GVQSRLRKLDIYRKLPADLTEPTTAGALISVISTIVIVILFTTELQAYIEVDNSSEMFVD 63
Query: 63 TSR-GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
+R GE +R+N D+ F PC ILS+D DI G ++V+ ++R++ Q
Sbjct: 64 INRGGEQIRVNLDIEFHKFPCDILSLDVQDIMGSHVVNVE----EQRMERQ 110
>gi|170108190|ref|XP_001885304.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
gi|164639780|gb|EDR04049.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
Length = 398
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 155/361 (42%), Gaps = 47/361 (13%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A+ + DA+PK+ + +RT S G +T+ ++ LL +++ Y+ + + VD
Sbjct: 14 AVPAPLAKFDAFPKLPSTYKTRTESRGFMTIFVILLAFLLMLNDIGEYIWGWPDFEFSVD 73
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
++ L +N D+ +PC +SVD D G+ RL G + R+DG
Sbjct: 74 DNKSSFLDVNVDLVV-NMPCKFISVDLRDAMGD------------RLYLSGGL---RRDG 117
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
E G + E + + V ++ + +G +L
Sbjct: 118 -------------------TEFNVGQATALKEHSE-ALSARQAVSQSRKSRGLF---ANL 154
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ K + G C ++G L+V +V N H + S HV +
Sbjct: 155 FRRNKSNFKPTYNYQPHGNACRVWGSLQVKRVTANLHITTLGHGYASYEHV------DHN 208
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
N+SH I + +FG HFP + PLD + + YQYF+ VVPT Y +Q++
Sbjct: 209 QMNLSHVITEFSFGPHFPDITQPLDNSFESTDERFVAYQYFLHVVPTTYIAPRSAPLQTH 268
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q+SVT + R + Q PG+FF +DL P+ +T + +FL L ++GGVF
Sbjct: 269 QYSVTHYTRVMQHN--QGTPGIFFKFDLDPLAITQHQRTTTFLQLLIRCVGVIGGVFVCM 326
Query: 363 G 363
G
Sbjct: 327 G 327
>gi|392564830|gb|EIW58008.1| DUF1692-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 539
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 153/375 (40%), Gaps = 47/375 (12%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ + + DA+PK+ + +R+ S G +TL + V LL +++ Y+ + + V
Sbjct: 19 EMVPAPLAQFDAFPKVPSSYKTRSESRGFLTLFVAFVAFLLVLNDIGEYIWGWPDYEFGV 78
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
DT + L IN D+ +PC LSVD D G++ D F R+D
Sbjct: 79 DTDQTNALDINVDMVI-NMPCQFLSVDLRDAVGDRLF--LSDGF-------------RRD 122
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
G K D + + EH E ++S + R A R K PD
Sbjct: 123 GT---KFD--IGQATSLKEHAEALSARQAVSQSRSSRGFFDVLLRRAAVRYKPTYNYQPD 177
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
G C ++G + +V N H + S HV L
Sbjct: 178 ------------------GSACRVFGTITAKRVTANLHITTLGHGYASQTHVDHKL---- 215
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N+SH I + +FG +FP + PLD P YQY++ VVPT Y + +
Sbjct: 216 --MNLSHVITEFSFGPYFPDITQPLDNSFELTSEPFVAYQYYLHVVPTTYIAPRTKPLNT 273
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ+SVT + R + R PG+FF +DL P+K+T + SF+ ++GGVF
Sbjct: 274 NQYSVTHYTRVLDHHR--GTPGIFFKFDLEPMKLTIHQRTTSFVQLFIRTVGVIGGVFVC 331
Query: 362 SGIIDAFIYHGQRAI 376
G H A+
Sbjct: 332 MGYAVKITGHAVDAV 346
>gi|19112857|ref|NP_596065.1| COPII-coated vesicle component Erv41 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74582843|sp|O94283.1|ERV41_SCHPO RecName: Full=ER-derived vesicles protein 41
gi|3850069|emb|CAA21880.1| COPII-coated vesicle component Erv41 (predicted)
[Schizosaccharomyces pombe]
Length = 333
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 168/365 (46%), Gaps = 73/365 (20%)
Query: 8 IRSLDAYPKINEDFYSRTFS-GGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
IR+ DA+PK ++++ ++ S GG T++ S+++++L FS+ Y+ + E +L + S
Sbjct: 10 IRAFDAFPKFSKEYRRQSSSRGGFFTILLSVLIVVLVFSQCVQYIRGIREQELFIYDSVS 69
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVK----HDIFKKRLDSQGNVIESRQDG 122
E + +N D+T A+PCS L +D +D + + L + + F K + + + +
Sbjct: 70 ELMDLNIDITI-AMPCSNLRIDVVDRTKDLVLATEALTLEEAFIKDMPTSSTIYK----- 123
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
N+ Y G + E +RKK A
Sbjct: 124 -------------------NDRYAGLRWART--------------EKFRKKNNA------ 144
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQR 241
+ G C IYG L VN+V G H APG + +S + H
Sbjct: 145 -------------EPGSGTACRIYGQLVVNRVNGQLHITAPGWGYGRSNIPFH------- 184
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
S N +H I +L+FGE++P +VN LDG +QY++ V+PT Y S + ++
Sbjct: 185 -SLNFTHYIEELSFGEYYPALVNALDGHYGHANDHPFAFQYYLSVLPTSYKS-SFRSFET 242
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ+S+TE+ + G PG+F YDL P+ V ++H + L + AI GG+ TV
Sbjct: 243 NQYSLTENSVVRQLGFGSLPPGIFIDYDLEPLAVRVVDKHPNVASTLLRILAISGGLITV 302
Query: 362 SGIID 366
+ I+
Sbjct: 303 ASWIE 307
>gi|358390077|gb|EHK39483.1| hypothetical protein TRIATDRAFT_302881 [Trichoderma atroviride IMI
206040]
Length = 372
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 167/378 (44%), Gaps = 51/378 (13%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + ++T GG T+ ++ + ++EL + + V+ G
Sbjct: 21 VSAFDAFPKSKPQYVTQTSGGGKWTVAMLLISSIFMWTELGRWWRGIEAHTFAVERGVGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ + C L V+ D SG++ L +L + D G K
Sbjct: 81 DMQINLDIVV-KMHCDDLHVNVQDASGDRILAAD------KLAREATTWSQWVDEKGMHK 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ G+ E+ + G + S D E V D+I +
Sbjct: 134 L--------GKNENGQLDTGLGW---HSKHDEGFGEEHVH-------------DIIALTQ 169
Query: 188 REGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
R R G + C ++G +++NKV G+FH A G + G H+ D F
Sbjct: 170 RRAKWARTPRPRGKPDSCRMFGSMDLNKVQGDFHITARGHGYMGMGQHL------DHDKF 223
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH I+++++G ++P +VNPLD + +QY++ VVPTVY + + +NQ+
Sbjct: 224 NFSHIISEMSYGPYYPSLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYL-ANRRIVNTNQY 282
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV------ 358
+VTEH ++ +PG+FF YD+ PI ++ E FL F+ + I GV
Sbjct: 283 AVTEHSKTISD---HQIPGIFFKYDIEPILLSVEESRDGFLSFVIKIVNIFSGVMVAGHW 339
Query: 359 -FTVSGIIDAFIYHGQRA 375
FT+S I I +R+
Sbjct: 340 GFTLSDWIREVIGKRRRS 357
>gi|344301277|gb|EGW31589.1| hypothetical protein SPAPADRAFT_62204 [Spathaspora passalidarum
NRRL Y-27907]
Length = 353
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 163/369 (44%), Gaps = 65/369 (17%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + +R+ DA+PK+N R+ GG+ +L++ I L+ + E+ +L + +
Sbjct: 1 MSNPVRSLRTFDAFPKVNSQNTVRSQRGGLSSLMTYIFGLMFLWVEIGGFLGGYIDRQFS 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR---LDSQGNVIE 117
VD L IN D+ A+PC + D++ +++L + F+ + + N I
Sbjct: 61 VDDVIKPGLSINIDMIV-AMPCEFIHATVEDVTLDRYLAGETLNFEGMHFFIPASFN-IN 118
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
+ D P++D+ +Q E +R +R +G
Sbjct: 119 NANDAHDTPELDEIMQ------------------------------ESLRAEFRVQG--- 145
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
QR+ E C+I+G + +N+V G+F G D++
Sbjct: 146 ---------------QRVNEN-APACHIFGSIPINQVKGDFRIT------AKGYGYRDVI 183
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
A D N SH I + ++GE +P + NPLD E Y Y KVVPT Y + G
Sbjct: 184 AAPIDKLNFSHVIQEFSYGEFYPFINNPLDATGKVTEEKFQKYMYSAKVVPTSYEKL-GL 242
Query: 298 TIQSNQFSVTEHF----RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+++NQ+SVTE+ ++S+ G +PG++ YD PIK+ E+ + F+ F+ +
Sbjct: 243 IVETNQYSVTENHQVLQKNSQTGVPIGVPGIYIKYDFEPIKMVIKEKRMPFMQFVAKLAT 302
Query: 354 IVGGVFTVS 362
I GG+ +
Sbjct: 303 IAGGILITA 311
>gi|326479518|gb|EGE03528.1| COPII-coated vesicle protein [Trichophyton equinum CBS 127.97]
Length = 399
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 166/391 (42%), Gaps = 66/391 (16%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + GG+ T+ +I+ +L SEL + V
Sbjct: 18 DGIATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N D T A+PC + ++ D +G+ L G+++
Sbjct: 78 ERGVSQEMQLNID-TVVAMPCDDVRINIQDAAGDHIL-------------AGDLLTQEPT 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRK---KGWA 176
A + +R GG E+ E ED + EVR + +K K
Sbjct: 124 SWAAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQAEDLHVEHVLGEVRRSRKKKFPKAPK 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHD 235
L D +D C+ ++G LE NKV GN H A G + + G
Sbjct: 184 LKKSDAVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYFEWG----- 221
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
A S N +H I +L+FG H+ ++NPLD + YQY + VVPT+YT S
Sbjct: 222 -RATNPHSLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTK-S 279
Query: 296 GH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK 334
GH T+ +NQ++VT + Q R+ + PG+FF Y++ PI
Sbjct: 280 GHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPIL 338
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ ++E S L + + +V GV G +
Sbjct: 339 LIVSQERDSLLALMVRLVNVVSGVLVTGGWL 369
>gi|326427137|gb|EGD72707.1| hypothetical protein PTSG_04435 [Salpingoeca sp. ATCC 50818]
Length = 357
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 165/374 (44%), Gaps = 55/374 (14%)
Query: 4 IMNKIRSLDAYPKINED--FYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ +++SLD + K+ D + SG ++TLV++ ++ +L +SE+ Y + V
Sbjct: 10 LQEQVKSLDVFSKVEPDTGITQSSTSGALVTLVTAAIVCVLVWSEISEYNTLKIKYDYFV 69
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
DT + + D+T A+ C + D +++SGE D
Sbjct: 70 DTDLRRDMNMTVDMTV-AMQCDHIGADYINLSGES-----------------------TD 105
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDC--CNNCEEVREAYRKKGWALSN 179
G K L+ L N+ + S+E ++ ++ +
Sbjct: 106 GSKYLK----LEPAHFELSPNQLEWLEAWAKVKSEEGSRGLDSLSRFLHGSMREPMPTAA 161
Query: 180 PDL---IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
P++ D C+ G L K VA NFH GKS H S H H
Sbjct: 162 PEIDSEPDACRLHGVLPVAK-----------------VAANFHITAGKSVHHSRGHSHVN 204
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
D+ N SH+I++ +F E G + LDG T + P ++QYF++VVP+ +
Sbjct: 205 SMVPPDAVNFSHRIDRFSFSEEPRGAMA-LDGDLRTTDQPRQVFQYFLEVVPSTTQRLGQ 263
Query: 297 -HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+SNQ+SVTE R ++G + +PG++F +D+ I V+ +EEH L +C IV
Sbjct: 264 RQPFRSNQYSVTEQHRVLKEG-ARGIPGIYFKFDIESIGVSVSEEHPPLSRLLIRLCGIV 322
Query: 356 GGVFTVSGIIDAFI 369
GG+ SG++ +FI
Sbjct: 323 GGIVAASGMLHSFI 336
>gi|334311203|ref|XP_001380577.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Monodelphis domestica]
Length = 321
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 76/199 (38%), Positives = 106/199 (53%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I GEGC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 137 KIPLNNGEGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 184
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 185 SFGDTLQVQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 244
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 245 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 302
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 303 SCIFTASEAW-KKIQLGKM 320
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 6/99 (6%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DT 63
++ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 35 RLTRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEVVNELYVDDPDK 94
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 95 DSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 133
>gi|169860063|ref|XP_001836668.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Coprinopsis cinerea okayama7#130]
gi|116502344|gb|EAU85239.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Coprinopsis cinerea okayama7#130]
Length = 516
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 153/359 (42%), Gaps = 49/359 (13%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
DAI + DA+PK+ + +R+ S G + + I+ LL +++ ++ + + V
Sbjct: 13 DAIPASLTKFDAFPKLPSTYKARSESRGFLMVFVIILAFLLMLNDIGEFIWGWPDFEFGV 72
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D +G TL IN D+T +PC L+VD D G++ G + + Q
Sbjct: 73 DNDKGSTLPINLDMTV-NMPCKYLTVDLRDAMGDRLF------LSNGFRRDGTIFDVGQA 125
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
L+ H L E V ++ + +G+ +
Sbjct: 126 TA--------LKEHAAALSAQE---------------------AVAQSRKSRGFFAT--- 153
Query: 182 LIDQCKREGFLQRIKEE-EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ + K+ F + + C I+G + V KV N H + S HV L
Sbjct: 154 -LFRSKKSKFKPTYNHQADASACRIWGTMYVKKVTANLHVTTLGHGYASYEHVDHHL--- 209
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
N+SH I + +FG HFP +V PLD YQYF+ VVPT Y ++
Sbjct: 210 ---MNLSHVIQEFSFGPHFPEIVQPLDNSFEATHEHFIAYQYFLHVVPTTYVAPRTAPLE 266
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+NQ+SVT + R E R PG+FF ++L P+K+T + + L + ++GGVF
Sbjct: 267 TNQYSVTHYTRVLEHNR--GTPGIFFKFELDPLKITQYQRTTTLLQLMIRCVGVIGGVF 323
>gi|327307836|ref|XP_003238609.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
gi|326458865|gb|EGD84318.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
Length = 399
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 167/391 (42%), Gaps = 66/391 (16%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + GG+ T+ +I+ +L SEL + V
Sbjct: 18 DGIATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N D T A+PC + ++ D +G+ L G+++
Sbjct: 78 ERGVSQEMQLNID-TVVAMPCDDVRINIQDAAGDHIL-------------AGDLLTQEPT 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRK---KGWA 176
A + +R GG E+ + E +ED + EVR + +K K
Sbjct: 124 SWTAWNREMNQRRSGGSPEYQTLNKEDTFRLEEQEEDLHVEHVLGEVRRSRKKKFPKAPK 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHD 235
L D +D C+ ++G LE NKV GN H A G + + G +
Sbjct: 184 LKRSDAVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYFEWGRTTNP 226
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
S N +H I +L+FG H+ ++NPLD + YQY + VVPT+YT S
Sbjct: 227 ------HSLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTK-S 279
Query: 296 GH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK 334
GH T+ +NQ++VT + Q R+ PG+FF Y++ PI
Sbjct: 280 GHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIEPIL 338
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ ++E S L + + +V GV G +
Sbjct: 339 LIVSQEWDSLLALMVRLVNVVSGVLVTGGWL 369
>gi|395505103|ref|XP_003756885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Sarcophilus harrisii]
Length = 290
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 76/199 (38%), Positives = 107/199 (53%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I +GEGC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNDGEGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|322792514|gb|EFZ16472.1| hypothetical protein SINV_10246 [Solenopsis invicta]
Length = 153
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 91/154 (59%), Gaps = 9/154 (5%)
Query: 5 MNKIRSLDAYPKINE--DFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
M +R LD +PK+ E D RTFSG ++T++S+I+M +LF SE+ YL +L VD
Sbjct: 1 MQMLRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFLSEVNYYLTPSMSEELFVD 60
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQ 120
TSRG LRIN D+ PA+ C AMD +GEQHL ++H+IFK+RLD G IE R
Sbjct: 61 TSRGSKLRINLDIIVPAVSCD----HAMDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRT 116
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES 154
+ A + K ++ ET CG CYGA +
Sbjct: 117 NITDAKAVSKTTEKAVEIGSTTET-CGDCYGAAT 149
>gi|156841160|ref|XP_001643955.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
70294]
gi|156114586|gb|EDO16097.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 349
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 181/366 (49%), Gaps = 59/366 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK E ++ +GG+ ++++ ++LL+ ++E Y + + VD +
Sbjct: 1 MAGLKTFDAFPKTEERHVKKSVNGGLSSILTYFMLLLIAWTEFGSYFGGYIDEQYSVDPT 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR--LDSQGNVIESRQDG 122
ET++IN D+ + +PC ++ V+AMD + ++ IF+ G + ++ D
Sbjct: 61 IRETVQINMDM-YIKMPCQLIHVNAMDETMDRKFVSNELIFEDMPFFVPYGTKVNNKND- 118
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
I +P +D+ + E + +R+K D
Sbjct: 119 IVSPGLDEII------------------------------GEAIPAEFREK------LDF 142
Query: 183 IDQCKREGF-LQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQ 240
Q +G L ++ +GC+IYG +++N+VAG F A G + +G D + F
Sbjct: 143 KSQVDADGNPLFKV-----DGCHIYGSVKLNRVAGELQFTAKGWGYRDNGRAPLDQIDF- 196
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPS-GMYQYFIKVVPTVYTDVSGHTI 299
+H IN+ +FG+ +P + NPLDG ++ S Y Y VVPT++ + G +
Sbjct: 197 ------NHVINEFSFGDFYPYIDNPLDGTAKIEKQKSISRYIYSTSVVPTIFQKL-GAEV 249
Query: 300 QSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+NQ+S+ E+ + + G+++ ++PG+FF YD P+ + +++ +SF+ F+ + AI+
Sbjct: 250 DTNQYSLAEYHTAPKDGKIKLTTSIPGIFFRYDFEPLSIVISDKRLSFVQFIVRLVAILS 309
Query: 357 GVFTVS 362
+ ++
Sbjct: 310 FILYMA 315
>gi|390594538|gb|EIN03948.1| DUF1692-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 551
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 148/358 (41%), Gaps = 49/358 (13%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
++ DA+PK+ + +R+ S G+ T + + + L ++L ++ + + VD
Sbjct: 22 SLKHFDAFPKLPASYKARSESRGLFTALVAFIAFFLVLNDLGEFIWGWPDYEFSVDNEAR 81
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ IN D+ +PC LSVD D G+ RL
Sbjct: 82 SHMNINVDMVV-KMPCQYLSVDLRDAVGD------------RL----------------- 111
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ +R G + + + A+ S + R + D++ +
Sbjct: 112 YLSSAFRRDGTLFDIGQATALKEHAAQLSARKAVAQSRQSRGLF----------DVLLRR 161
Query: 187 KREGFLQRIK-EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
+G+ + +G C IYG L+V KV N H + S HV D N
Sbjct: 162 SGQGYKPTYNHQPDGGACRIYGTLQVKKVTANLHITTAGHGYASVQHV------PHDQMN 215
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
+SH I + +FG +FP + PLD P YQYF+ VVPT Y +++ Q+S
Sbjct: 216 LSHVITEFSFGPYFPDITQPLDDSFEITTDPFIAYQYFLHVVPTTYVAPRSSPLKTAQYS 275
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
VT + R E GR PG+FF ++L P+ +T + + V +VGG+F +G
Sbjct: 276 VTHYTRVLEHGR--GTPGIFFKFELDPLSITVNQRTTTLAQLFIRVIGVVGGIFVCAG 331
>gi|387015778|gb|AFJ50008.1| ER Golgi intermediate [Crotalus adamanteus]
Length = 290
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 106/199 (53%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G+GC G +NKV GNFH + H A Q + +++H I+KL
Sbjct: 106 KIPLNNGDGCRFEGHFSINKVPGNFH-----------ISTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 154 SFGDKLQVPNIHGAFNALGGTDRLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEA-WKKIQLGKM 289
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D TF+G +I++ +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTFTGAIISICCCFFILFLFLSELTGFIATEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + +N +++ P+L C ++ +D D G H+D
Sbjct: 66 GGKIEVNLNISLPSLHCELIGLDIQDEMGRHEVGHID 102
>gi|440632946|gb|ELR02865.1| hypothetical protein GMDG_05797 [Geomyces destructans 20631-21]
Length = 384
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 166/366 (45%), Gaps = 53/366 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK ++ ++T GG T++ I+ LL SEL + + V+
Sbjct: 21 VSAFDAFPKSKPEYVTKTSGGGKWTVLMLIISALLTMSELGRWWRGNEDHTFEVEKFVSR 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L++N D+ A+ C + ++ D SG++ L K + K L + + +
Sbjct: 81 DLQVNLDMVV-AMRCPDIHINVQDASGDRILASK--VLKTELTNWLQWVNMKG------- 130
Query: 128 IDKPLQRHGGRLEHNETYCGSCY---GAESSDEDCCNNCEEVRE----AYRKKGWALSNP 180
+H +L HN GS G ES D E V + A R WA P
Sbjct: 131 ------QH--QLGHNAD--GSVITDEGWESDGHDEGFEEEHVHDIIYTAMRSNKWA-KTP 179
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAF 239
+ + +G+ C I+G + +NKV G+FH A G + ++ H
Sbjct: 180 KIKGHPR-----------DGDSCRIFGSMMLNKVQGDFHITARGHGYQEAFGTKH----L 224
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-- 297
SFN SH +++ +FG +P ++NPLD T QYF+ VVPT+YT S +
Sbjct: 225 DHSSFNFSHIVSEFSFGAFYPKLINPLDQTITTTANQFYKSQYFMSVVPTIYTVSSPNPL 284
Query: 298 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
TI +NQ++VT R + +T+PG+FF YD+ P+ +T E SFL F V
Sbjct: 285 SSKSTIFTNQYAVTHEDRKINE---RTVPGIFFKYDIEPLMLTIEERRDSFLRFAIKVVN 341
Query: 354 IVGGVF 359
I+ GV
Sbjct: 342 ILSGVL 347
>gi|345320110|ref|XP_001521132.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Ornithorhynchus anatinus]
Length = 283
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G+GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 99 KIPLNNGDGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 146
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P Y Y +K+VPTVY D +G S Q++V
Sbjct: 147 SFGDKLQVQNIHGAFNALGGADKRSSNPLASYDYILKIVPTVYEDKNGKQRYSYQYTVAN 206
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 207 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 264
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 265 SCIFTASEAW-KKIQLGKM 282
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D G
Sbjct: 1 FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDSGG 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ ++ +++ P L C ++ +D D G H+D
Sbjct: 61 KIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHID 95
>gi|340514865|gb|EGR45124.1| predicted protein [Trichoderma reesei QM6a]
Length = 372
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 156/357 (43%), Gaps = 44/357 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + ++T GG T+ +V + +SE+ + V+ G
Sbjct: 21 VSAFDAFPKAKPQYVTKTAGGGKWTVAMLLVSSIFLWSEIGRWWRGSEHHTFAVEKGIGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
++IN D+ + C L V+ D SG++ L +L E D G +
Sbjct: 81 DMQINLDIVVK-MSCGDLHVNVQDASGDRILA------GDKLTRDATNWEQWVDAKGVHR 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ K G+L+ + G+ D E V D++ +
Sbjct: 134 LGK---NENGKLDTGAGWHGA--------HDEGFGEEHVH-------------DIVSLSR 169
Query: 188 REGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
++ + + G + C +YG L++NKV G+FH A G + G H+ D F
Sbjct: 170 KKAKWAKTPKPRGRTDSCRMYGSLDLNKVQGDFHITARGHGYSGIGGHL------DHDKF 223
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH I++L++G +P ++NPLD T +QY++ VVPTVY S + +NQ+
Sbjct: 224 NFSHIISELSYGPFYPSLINPLDRTVNTAIVHFHKFQYYLSVVPTVYI-ASHRIVNTNQY 282
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
+VTE ++ +PG+FF YD+ PI ++ E F FL + + GV
Sbjct: 283 AVTEQSKTISD---HQVPGIFFKYDIEPIMLSVEETRDGFFAFLLKLVNVFSGVMVA 336
>gi|255944653|ref|XP_002563094.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587829|emb|CAP85889.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 396
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 58/383 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ +++ DA+PK + + T SGG T++ I+ + +SE + + V+
Sbjct: 20 LAALKTFDAFPKTKAAYTTPTRSGGQWTVLILIICTIFSWSEFKTWWRGTENYHFSVEKG 79
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLD---VKHDIFKKRLDSQGNVIESRQD 121
L++N D+ +PC L V+ D +G++ L +K D L Q E+ D
Sbjct: 80 VSHELQLNLDMVV-HMPCDQLRVNIQDAAGDRILAGELLKRDDTNWLLWMQKRNHET-SD 137
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
G+ + L H E + +D + EVR R+K P
Sbjct: 138 GVHEYQT----------LSHEE---ADRLAEQEADAHVGHVLGEVRRNPRRK--FEKGPR 182
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQ 240
L R G + + C IYG LE NKV G+FH A G + ++ H+
Sbjct: 183 L-----RRGVV-------ADACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------D 224
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG---- 296
SF+ SH I +L+FG H+P + NPLD E +QYF+ VVPT+Y+ G
Sbjct: 225 HSSFDFSHMITELSFGPHYPTLQNPLDKTIAETEEHYYKFQYFLSVVPTLYSRGKGALDA 284
Query: 297 --------------HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
T+ +NQ++ T + + + +PG+FF Y++ PI + +EE
Sbjct: 285 YTRSPDAAASRYGRDTVFTNQYAATSQSSAIPESPM-VVPGIFFKYNIEPILLLVSEERA 343
Query: 343 SFLHFLTNVCAIVGGVFTVSGII 365
SFL L V + GV G +
Sbjct: 344 SFLSLLVRVINTISGVLVTGGWL 366
>gi|425765498|gb|EKV04175.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
digitatum PHI26]
gi|425783511|gb|EKV21358.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
digitatum Pd1]
Length = 396
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 164/383 (42%), Gaps = 58/383 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ +++ DA+PK + + T SGG T++ ++ + +SEL+ + V+
Sbjct: 20 LTALKTFDAFPKTKASYTTPTRSGGQWTVLILLICTVFSWSELKTWWRGTENYHFSVEKG 79
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLD---VKHDIFKKRLDSQGNVIESRQD 121
L++N D+ +PC L V+ D +G++ L +K D L Q E+
Sbjct: 80 VSHELQLNLDMVV-HMPCDQLRVNIQDAAGDRILAGELLKRDDTNWLLWMQKRNYETND- 137
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
GA + RL E +D + EVR R+K P
Sbjct: 138 --GAHEYQTLSHEESDRLAEQE-----------ADAHVGHVLGEVRHNPRRK--FPKGPR 182
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQ 240
+ R G + + C IYG LE NKV G+FH A G + ++ H+
Sbjct: 183 M-----RRGVVP-------DACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------D 224
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY--------- 291
+FN SH I +L+FG H+P + NPLD E +QYF+ +VPT+Y
Sbjct: 225 HSAFNFSHMITELSFGPHYPTLQNPLDKTIAETEEHYYKFQYFLSIVPTLYSRGKSALDL 284
Query: 292 ------TDVSGH---TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
T + H T+ +NQ++ T + + + +PG+FF YD+ PI + +EE
Sbjct: 285 YTRSPETLAARHGRNTVFTNQYAATSQSSAIPESPM-VVPGIFFKYDIEPILLLVSEERA 343
Query: 343 SFLHFLTNVCAIVGGVFTVSGII 365
FL L V V GV G +
Sbjct: 344 GFLSLLIRVINTVSGVLVTGGWL 366
>gi|449272958|gb|EMC82607.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Columba livia]
Length = 297
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 107/200 (53%), Gaps = 22/200 (11%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I G+GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 112 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 159
Query: 253 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
L+FG+ G N L+G P + Y +K+VPTVY D+ G S Q++V
Sbjct: 160 LSFGDKLQVHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMGGKQRYSYQYTVA 219
Query: 308 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+
Sbjct: 220 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 277
Query: 366 DAFIYHGQRAIKKKIEIGKF 385
D+ I+ A KKI++GK
Sbjct: 278 DSCIFTASEA-WKKIQLGKM 296
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D G
Sbjct: 15 FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDSGG 74
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ +N +++ P L C ++ +D D G H+D
Sbjct: 75 KIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHID 109
>gi|213408569|ref|XP_002175055.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
yFS275]
gi|212003102|gb|EEB08762.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
yFS275]
Length = 331
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 164/369 (44%), Gaps = 62/369 (16%)
Query: 2 DAIMNKIRSLDAYPKINEDFY-SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
D I IR DA+PK+ + + R+ GG+++++ +I + + E Y E +
Sbjct: 4 DKIPEGIRVFDAFPKVAKTYRKQRSSQGGLLSIILAICITCISIMEFFFYFQGTREQQFF 63
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
V + E + IN D+T A+PC L VD +D Q +D H + Q +E +
Sbjct: 64 VYETISEHMNINLDMTI-AMPCKFLQVDVLD----QTMD--HVFATEVFTKQETTVEDMR 116
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+PL T GS D + R+ + KK L P
Sbjct: 117 H--------EPLP---------VTSTGSF--------DAADLRRTRRKKFNKKSKTL--P 149
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAF 239
D G C YG + V++ G H APG + S + +
Sbjct: 150 D-----------------GGSACRFYGAVTVHRTQGLLHITAPGWGYGMSNIPL------ 186
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
++ N +H I++L+FG+++P +VN LDG + + +QY+ ++PT YT + +
Sbjct: 187 --NALNFTHAIDELSFGDYYPSLVNALDGSYGFTDEHAFAFQYYTSIIPTTYTS-TFRNV 243
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
Q+NQ++VTE+ + G PG+F YD+ P+ + E + S + + + AI GG+
Sbjct: 244 QTNQYAVTENSVRRQTGFRSDPPGIFISYDIEPLGIHIRETYPSLGNTILRILAISGGLV 303
Query: 360 TVSGIIDAF 368
TV+ ++ F
Sbjct: 304 TVTTWVERF 312
>gi|73953406|ref|XP_852891.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 isoform 1 [Canis lupus familiaris]
Length = 290
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/199 (38%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
RI G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 RIPVNNGAGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|353236810|emb|CCA68797.1| related to ERV41-component of copii vesicles involved in transport
between the ER and golgi complex [Piriformospora indica
DSM 11827]
Length = 559
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 154/361 (42%), Gaps = 51/361 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ I+ DA+PK+ + SRT GG +TL + LL +++ ++ ++ + +DT
Sbjct: 44 IAPIKQFDAFPKLPASYKSRTKFGGFMTLFVVTLSFLLVLNDIGEFIWGWSDYEFAIDTD 103
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ L IN D+ PCSILSVD D G+ RL ++
Sbjct: 104 QHRLLEINVDLVV-NTPCSILSVDLRDAVGD------------RLHLSDTIV-------- 142
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
R G + ++ + E + + E+ A R+ S +
Sbjct: 143 ---------RDGTLFDISQAH-------EFKEHQRVLSTREIVAASRRSRGFFS----MF 182
Query: 185 QCKREGFLQRIKE-EEGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRD 242
+ R F +G C +YG V K+ GNFH G + H D
Sbjct: 183 KASRPQFRPTWNHTPDGGACRVYGSFAVRKLTGNFHITTLGHGYGGHNAHA------SHD 236
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+ N+SH I + +FG ++P +V PLD T + +QYFI VVPT Y + ++
Sbjct: 237 NINMSHVITEFSFGPYYPDIVQPLDYSFETTQEHFVAFQYFITVVPTTYVAPRSKPLHTH 296
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q+SVT + + E Q PG+FF YD+ P+ + + + FL + ++GGV+
Sbjct: 297 QYSVTHYVK--ELPHSQGTPGIFFKYDIDPVALEIHQRTTTLTQFLVRIVGVIGGVWVCF 354
Query: 363 G 363
G
Sbjct: 355 G 355
>gi|389640739|ref|XP_003718002.1| hypothetical protein MGG_00949 [Magnaporthe oryzae 70-15]
gi|351640555|gb|EHA48418.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae 70-15]
gi|440464580|gb|ELQ33987.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae Y34]
gi|440481695|gb|ELQ62250.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae P131]
Length = 376
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 55/382 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + +RT GG T+ +V +L +SEL + V V+ G+
Sbjct: 22 VSAFDAFPKSKPQYVTRTSGGGKWTVAMLLVSAILTWSELARWWRGVETHTFAVEKGVGQ 81
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++IN D T + C + V+ D +G++ + RL DG G +
Sbjct: 82 SMQINMD-TVVHMRCQDIHVNVQDAAGDRIMAAA------RLKMDDTTWAQWVDGSGVHR 134
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ G +H + G + + ++ +K+ P
Sbjct: 135 L--------GHDQHGKVVTGEGHEEGFG----EEHIHDIVALGKKRARWSKTP------- 175
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
R+ + C I+G L++NKV G+FH A G + + G H+ +FN
Sbjct: 176 ------RLWGATPDSCRIFGSLDLNKVQGDFHITARGHGYIEFGDHL------DHSAFNF 223
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------HTIQ 300
SH +N+ +FG+ +P +VNPLD T E +QYF+ VVPT+Y+ S TI
Sbjct: 224 SHIVNEFSFGDFYPSLVNPLDKTVNTCEKNFHKFQYFLSVVPTLYSVKSSTGAFGYSTIF 283
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-- 358
+NQ++VTE +SSE + +PG+FF YD+ PI + E + L FL V I+ G
Sbjct: 284 TNQYAVTE--QSSEISEMN-VPGIFFKYDIEPILLDIEESRDTILVFLIKVINILSGAMV 340
Query: 359 -----FTVSGIIDAFIYHGQRA 375
FT+S I + +RA
Sbjct: 341 AGHWGFTMSEWIKEVLGKRRRA 362
>gi|114603487|ref|XP_001145588.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pan troglodytes]
Length = 424
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 239 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 286
Query: 253 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
L+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 287 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 346
Query: 308 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 347 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 404
Query: 366 DAFIYHGQRAIKKKIEIGKF 385
D+ I+ A KKI++GK
Sbjct: 405 DSCIFTASEA-WKKIQLGKM 423
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 7/102 (6%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-- 61
I+ +R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V
Sbjct: 136 ILTPVR-FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDD 194
Query: 62 -DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
D G + ++ +++ P L C ++ +D D G H+D
Sbjct: 195 PDKDSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 236
>gi|109079798|ref|XP_001099287.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Macaca mulatta]
Length = 379
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 195 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 242
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 243 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 302
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 303 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 360
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 361 SCIFTASEAW-KKIQLGKM 378
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 95 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 154
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 155 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 191
>gi|303313533|ref|XP_003066778.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
delta SOWgp]
gi|240106440|gb|EER24633.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
delta SOWgp]
gi|320036232|gb|EFW18171.1| COPII-coated vesicle protein [Coccidioides posadasii str. Silveira]
Length = 399
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 169/393 (43%), Gaps = 66/393 (16%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I +R+ DA+PK + + + GG T+ + + +L SEL + V+
Sbjct: 19 GIAAGLRTFDAFPKTKPTYTTASRRGGQWTVFTFLFCGILVLSELISWHGGTENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
E +++N D+ +PC L V+ D +G+ L + + K E G
Sbjct: 79 KGVSEEIQLNLDLVV-RMPCDSLRVNMQDAAGDFILAAEL-LHKTPTSWDAWNREMNFAG 136
Query: 123 IGAPKIDKPLQRHGG-RLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALS 178
G + + L RL E D+ + EVR +++++ G L
Sbjct: 137 KGGSRQYQTLSAEDNVRLAEQE-----------EDQHVGHVLGEVRRSWKRQFPPGPKLK 185
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSG--VHVHD 235
D++D C+ IYG LE NKV GNFH A G ++ V+V+D
Sbjct: 186 RKDVVDSCR-----------------IYGSLEGNKVQGNFHITAKGLGYYDPTGMVNVND 228
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ N +H I +L+FG H+P ++NPLD + YQY++ VVPT+YT
Sbjct: 229 M--------NFTHLITELSFGPHYPTLLNPLDKTVAATKDKFYKYQYYLSVVPTIYTRAG 280
Query: 296 G--------------------HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKV 335
+TI +NQ++VT R+ QG ++PG+FF +D+ PI +
Sbjct: 281 TVDPYSQRLPDPSTITPSQRKNTIFTNQYAVTSQSRTISQGPY-SVPGIFFKFDIEPILL 339
Query: 336 TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 368
+EE S L L + +V GV G + F
Sbjct: 340 VVSEERGSLLALLVRLVNVVSGVLVAGGWVFNF 372
>gi|345325542|ref|XP_001508860.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Ornithorhynchus anatinus]
Length = 372
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 142/311 (45%), Gaps = 47/311 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M L E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMAFLTVMEFLVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D++ ++ S +
Sbjct: 70 FASKLRINIDITV-AMKCQYIGADVLDLAE-------------------TMVASADGLVY 109
Query: 125 APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
P I P QR R+ + E S +D + A++ AL P
Sbjct: 110 EPVIFDLSPQQREWQRMLQ---MIQNRLQEEHSLQDVI-----FKSAFKSASTAL--PPR 159
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
D + + + C I+G L VNKVAGNFH GK+ H H D
Sbjct: 160 GD----------LSLQPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHD 209
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQ 300
S+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 210 SYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISAET-- 267
Query: 301 SNQFSVTEHFR 311
+QFSVTE R
Sbjct: 268 -HQFSVTERER 277
>gi|6330243|dbj|BAA86495.1| KIAA1181 protein [Homo sapiens]
Length = 336
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 152 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 199
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 200 SFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 259
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 260 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 317
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 318 SCIFTASEAW-KKIQLGKM 335
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 52 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 111
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 112 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 148
>gi|194382656|dbj|BAG64498.1| unnamed protein product [Homo sapiens]
Length = 235
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 51 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 98
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 99 SFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 158
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 159 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 216
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 217 SCIFTASEAW-KKIQLGKM 234
>gi|322710423|gb|EFZ01998.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium
anisopliae ARSEF 23]
Length = 372
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 161/357 (45%), Gaps = 44/357 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK ++ +RT GG T+ ++V + L ++E+ + V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTEGGGKWTVAMAVVSIFLLWAEIARWWRGAESHTFAVEKGVSH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++IN D T + C L ++ D +G++ L K +D Q G+
Sbjct: 81 SMQINLD-TVILMKCGDLHINVQDAAGDRILAGS----KLNMDETSWSQWVNQKGV---- 131
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
GR G+ G ++ D++ E V D++ +
Sbjct: 132 ------HKLGRDSEGRVITGA--GWQNLDDEGFGE-EHVH-------------DIVALGQ 169
Query: 188 REGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
R + +G + C IYG L++NKV G+FH A G + G H+ + F
Sbjct: 170 RRAKWAKTPRVKGPPDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHL------DHEQF 223
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH I++L+FG ++P +VNPLD E +QY++ VVPT Y+ V +I +NQ+
Sbjct: 224 NFSHIISELSFGSYYPSLVNPLDRTLNIAENHFHKFQYYVSVVPTRYS-VGSSSIFTNQY 282
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
+VTE + + +PGVF YD+ PI ++ E+ L F+ + ++ GV
Sbjct: 283 AVTEQSKGVSE---YNVPGVFVKYDIEPILLSVNEDRDGILMFVVKLINVLSGVLVA 336
>gi|148678795|gb|EDL10742.1| ERGIC and golgi 2, isoform CRA_b [Mus musculus]
Length = 310
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 140/309 (45%), Gaps = 49/309 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 18 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 77
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D+ + + DG+
Sbjct: 78 FSSKLRINIDITV-AMKCHYVGADVLDL--------------------AETMVASADGLA 116
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 117 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 166
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 167 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 216
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 217 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT- 275
Query: 300 QSNQFSVTE 308
+QFSVTE
Sbjct: 276 --HQFSVTE 282
>gi|66773206|ref|NP_080631.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
isoform 2 [Mus musculus]
gi|12854944|dbj|BAB30175.1| unnamed protein product [Mus musculus]
Length = 302
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 140/309 (45%), Gaps = 49/309 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LSLVKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
LRIN D+T A+ C + D +D+ + + DG+
Sbjct: 70 FSSKLRINIDITV-AMKCHYVGADVLDL--------------------AETMVASADGLA 108
Query: 125 -APKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
P + P QR R+ S E S +D + A++ AL P
Sbjct: 109 YEPALFDLSPQQREWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL--PP 158
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
D + C I+G L VNKVAGNFH GK+ H H
Sbjct: 159 REDDSSLTP----------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNH 208
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTI 299
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S T
Sbjct: 209 DSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT- 267
Query: 300 QSNQFSVTE 308
+QFSVTE
Sbjct: 268 --HQFSVTE 274
>gi|410949214|ref|XP_003981318.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Felis catus]
Length = 398
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 214 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 261
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 262 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 321
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 322 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 379
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 380 SCIFTASEAW-KKIQLGKM 397
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D G
Sbjct: 116 FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 175
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ ++ +++ P L C ++ +D D G H+D
Sbjct: 176 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 210
>gi|313247758|emb|CBY15879.1| unnamed protein product [Oikopleura dioica]
Length = 285
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/191 (40%), Positives = 102/191 (53%), Gaps = 22/191 (11%)
Query: 196 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 255
K ++ GC +G VNKV GNFH + S Q H HD FN HKINKL F
Sbjct: 108 KNQQKSGCRFHGEFYVNKVPGNFHVSTHASKKQP--HKHD--------FN--HKINKLFF 155
Query: 256 GE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 310
GE PG L G T E PS Y Y +K+VPTV+ D T Q++VT
Sbjct: 156 GEDLSALELPGNQTSLAGQATTNE-PSLSYDYTLKIVPTVHNDNKRRTTFGYQYTVTSKT 214
Query: 311 RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+ +G P ++F Y+++PI V +T + F H LT +CAIVGG FTV+G+ID+ I+
Sbjct: 215 FKNTRGT----PAIWFRYEIAPITVKYTHKKKPFYHLLTTICAIVGGTFTVAGMIDSMIF 270
Query: 371 HGQRAIKKKIE 381
+A+KK E
Sbjct: 271 SAHQAVKKASE 281
>gi|115452719|ref|NP_001049960.1| Os03g0321400 [Oryza sativa Japonica Group]
gi|113548431|dbj|BAF11874.1| Os03g0321400, partial [Oryza sativa Japonica Group]
Length = 83
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/82 (71%), Positives = 70/82 (85%), Gaps = 1/82 (1%)
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
QFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+
Sbjct: 1 QFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVA 59
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
GIID+F+YHG RAIKKK+EIGK
Sbjct: 60 GIIDSFVYHGHRAIKKKMEIGK 81
>gi|395817675|ref|XP_003782285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Otolemur garnettii]
Length = 356
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 172 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 219
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 220 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVAN 279
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 280 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 337
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 338 SCIFTASEAW-KKIQLGKM 355
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 72 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 131
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 132 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 168
>gi|392351111|ref|XP_001066818.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Rattus norvegicus]
Length = 497
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 313 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 360
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 361 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 420
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 421 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 478
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 479 SCIFTASEAW-KKIQLGKI 496
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 49/99 (49%), Gaps = 6/99 (6%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DT 63
K+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 211 KVERFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDK 270
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 271 DSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 309
>gi|417409674|gb|JAA51332.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein, partial [Desmodus rotundus]
Length = 318
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 134 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 181
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 182 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 241
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 242 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 299
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 300 SCIFTASEAW-KKIQLGKM 317
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 34 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 93
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 94 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 130
>gi|50510831|dbj|BAD32401.1| mKIAA1181 protein [Mus musculus]
Length = 320
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 105/198 (53%), Gaps = 22/198 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 136 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHKL 183
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 184 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 243
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 244 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 301
Query: 367 AFIYHGQRAIKKKIEIGK 384
+ I+ A KKI++GK
Sbjct: 302 SCIFTASEAW-KKIQLGK 318
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 36 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 95
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 96 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 132
>gi|390459630|ref|XP_002744599.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Callithrix jacchus]
Length = 342
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 157 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 204
Query: 253 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 205 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 264
Query: 308 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 265 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 322
Query: 366 DAFIYHGQRAIKKKIEIGKF 385
D+ I+ A KKI++GK
Sbjct: 323 DSCIFTASEA-WKKIQLGKM 341
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 57 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 116
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 117 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 154
>gi|119191516|ref|XP_001246364.1| hypothetical protein CIMG_00135 [Coccidioides immitis RS]
gi|392864406|gb|EAS34753.2| COPII-coated vesicle protein [Coccidioides immitis RS]
Length = 399
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/400 (26%), Positives = 170/400 (42%), Gaps = 80/400 (20%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I +R+ DA+PK + + + GG T+ + +L SEL + V+
Sbjct: 19 GIAAGLRTFDAFPKTKPTYTTASRRGGQWTVFIFLFCGMLVLSELISWHGGTENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGE--------QHLDVKHDIFKKRLDSQGN 114
E +++N D+ +PC L V+ D +G+ D + + ++ G
Sbjct: 79 KGVSEEIQLNLDLVV-RMPCDSLRVNMQDAAGDFILAAELLHKTPTSWDAWNREMNFAGK 137
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK- 173
SRQ + + D RL E D+ + EVR +++++
Sbjct: 138 G-GSRQYQTLSAEDDV-------RLAEQE-----------EDQHVGHVLGEVRRSWKRQF 178
Query: 174 --GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSG 230
G L D++D C+ IYG LE NKV GNFH A G ++
Sbjct: 179 PPGPKLKRKDVVDSCR-----------------IYGSLEGNKVQGNFHITAKGLGYYDPT 221
Query: 231 --VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVP 288
V+V+D+ N +H I +L+FG H+P ++NPLD + YQY++ VVP
Sbjct: 222 GMVNVNDM--------NFTHLITELSFGPHYPTLLNPLDKTVAATKDKFYKYQYYLSVVP 273
Query: 289 TVYTDVS--------------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 328
T+YT +TI +NQ++VT R+ QG ++PG+FF +
Sbjct: 274 TIYTRAGTVDPYSQRLPDPSTITVSQRKNTIFTNQYAVTSQSRTISQGPY-SVPGIFFKF 332
Query: 329 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 368
D+ PI + +EE S L L + +V GV G + F
Sbjct: 333 DIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWVFNF 372
>gi|123408947|ref|XP_001303296.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884664|gb|EAX90366.1| hypothetical protein TVAG_036780 [Trichomonas vaginalis G3]
Length = 364
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 174/395 (44%), Gaps = 62/395 (15%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR- 65
+ LD + K++ + + T +GG+++L++ +++ LF E++ +LN +L V R
Sbjct: 2 RFSKLDLFEKLDNNHRTGTTTGGILSLITIGLIISLFVIEIKSFLNPPLRQRLSVVNKRP 61
Query: 66 ------------GETLRINFDVTFPALPCSILSVDAMDISGEQHL-DVKHDIFKKRLDSQ 112
E ++NFD+ FP PC +L D +D + L +I R S
Sbjct: 62 TEADGVTITKESQEKTKVNFDIFFPNAPCYLLHFDLIDAVSQLDLFTYNQNITYTRFSSD 121
Query: 113 GNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGSCYGAESSDE--DCCNNCEEVREA 169
G +I H R ++ T CG C + + CCN C++V E
Sbjct: 122 GKIIGDFD--------------HSARFNTSKVTECGFCNATKGLKDKYKCCNTCQQVLEV 167
Query: 170 YRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS-FHQ 228
D I QC + ++ +K+ + EGC I G E K+ FH +PG S +
Sbjct: 168 ----AQVFRVVD-IPQCSDK--VKELKKMQNEGCRIKGNFETIKIKAEFHISPGYSVIDE 220
Query: 229 SGVHVHDILAFQRD--SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
GVH HD+ +F D N+S+K+N FG+ + LDG Q+ Y
Sbjct: 221 DGVHAHDVSSFIDDVSELNLSYKLNHCRFGDQNH---SQLDGFSTIQKQIGYFY------ 271
Query: 287 VPTVYT-DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 345
VYT DVS ++N +S T + + G L +PG+ F YD I + +
Sbjct: 272 --AVYTIDVS----ENNDYS-TAYMEQVDNGTL--VPGIVFKYDFGIITAKSFPDRPPLI 322
Query: 346 HFLTNVCAIVGGVFTVSGIIDAFIYHG--QRAIKK 378
H +N+ ++ GGV + I+D ++ QR I K
Sbjct: 323 HLFSNLVSMAGGVAMIFYILDYALFSSIKQRKIHK 357
>gi|410349413|gb|JAA41310.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
gi|410349417|gb|JAA41312.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
Length = 290
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 47/97 (48%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNSQCRLVGLDIQDEMGRHEVGHID 102
>gi|355686511|gb|AER98080.1| endoplasmic reticulum-golgi intermediate compartment 1 [Mustela
putorius furo]
Length = 312
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 129 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 176
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 177 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 236
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 237 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 294
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 295 SCIFTASEAW-KKIQLGKM 312
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 29 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 88
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 89 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 125
>gi|301763094|ref|XP_002916978.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Ailuropoda melanoleuca]
Length = 306
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 122 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 169
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 170 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 229
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 230 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 287
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 288 SCIFTASEAW-KKIQLGKM 305
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D G
Sbjct: 24 FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 83
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ ++ +++ P L C ++ +D D G H+D
Sbjct: 84 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 118
>gi|281351238|gb|EFB26822.1| hypothetical protein PANDA_005115 [Ailuropoda melanoleuca]
Length = 238
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 54 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 101
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 102 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 161
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 162 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 219
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 220 SCIFTASEAW-KKIQLGKM 237
>gi|154305556|ref|XP_001553180.1| hypothetical protein BC1G_08547 [Botryotinia fuckeliana B05.10]
Length = 381
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 158/353 (44%), Gaps = 54/353 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ DA+PK + ++T GG T+ +V L SE + +V+ G
Sbjct: 21 VKAFDAFPKAKPQYITQTSGGGKWTVAMMLVSFALLVSEFMRWWTGHETHTFVVEKGVGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+L++N D+ + CS L ++ D +G++ L I K + N D G +
Sbjct: 81 SLQVNMDMVV-KMKCSELHINVQDAAGDRILA---GIMLKEDATNWN---QWVDAKGMHQ 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ K GR+ E Y +G E D + R + K P
Sbjct: 134 LGKDAH---GRVITGEEYHEEGFGEEHV-HDIVTLGGKKRAKFAKTPRVKGGP------- 182
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ G+ C +YG LEVNKV G+FH A G + + G H+ +FN
Sbjct: 183 ----------KGGDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHL------DHSAFNF 226
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT------ 298
SH IN+L+FG +P ++NPLD R TP+ YQYF+ VVPT+Y+
Sbjct: 227 SHIINELSFGPFYPSLLNPLD--RTIAGTPNHFHKYQYFLSVVPTLYSLSPSTFSPSSSP 284
Query: 299 --IQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
+++NQ++VT EH +++PG+FF YD+ P+ +T E FL F
Sbjct: 285 TLLRTNQYAVTSQEHIVGE-----RSVPGIFFKYDIEPLLLTVEESRDGFLRF 332
>gi|13385678|ref|NP_080446.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Mus
musculus]
gi|52000733|sp|Q9DC16.1|ERGI1_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|12835932|dbj|BAB23423.1| unnamed protein product [Mus musculus]
gi|13529617|gb|AAH05516.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
musculus]
gi|26351067|dbj|BAC39170.1| unnamed protein product [Mus musculus]
gi|26353098|dbj|BAC40179.1| unnamed protein product [Mus musculus]
gi|53236959|gb|AAH83144.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
musculus]
gi|71059789|emb|CAJ18438.1| 1200007D18Rik [Mus musculus]
gi|74185526|dbj|BAE30231.1| unnamed protein product [Mus musculus]
gi|148690563|gb|EDL22510.1| RIKEN cDNA 1200007D18 [Mus musculus]
gi|158148953|dbj|BAF82010.1| MAA-136 protein [Mus musculus]
Length = 290
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 105/198 (53%), Gaps = 22/198 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGK 384
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGK 288
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|351705474|gb|EHB08393.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Heterocephalus glaber]
Length = 305
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 121 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 168
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 169 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQWYSYQYTVAN 228
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 229 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 286
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 287 SCIFTASEAW-KKIQLGKM 304
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 20 VEGFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 79
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 80 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 117
>gi|403290258|ref|XP_003936243.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Saimiri boliviensis boliviensis]
Length = 415
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 231 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 278
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 279 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGRQQYSYQYTVAN 338
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 339 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 396
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 397 SCIFTASEAW-KKIQLGKM 414
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 130 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 189
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 190 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 227
>gi|302508773|ref|XP_003016347.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
gi|291179916|gb|EFE35702.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
Length = 427
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 103/413 (24%), Positives = 169/413 (40%), Gaps = 82/413 (19%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + GG+ T+ +I+ +L SEL + V
Sbjct: 18 DGIATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N D T A+PC + ++ D +G+ L G+++
Sbjct: 78 ERGVSQEMQLNID-TVVAMPCDDVRINIQDAAGDHIL-------------AGDLLTQEPT 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRK---KGWA 176
GA + +R GG E+ E +ED + EVR + +K K
Sbjct: 124 SWGAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQEEDLHVEHVLGEVRRSRKKKFPKSPK 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH--------FAPGKS--- 225
L D +D C+ ++G LE NKV GN H F G++
Sbjct: 184 LKKSDAVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYFEWGRATNP 226
Query: 226 ------------FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ 273
H ++ D L N +H I +L+FG H+ ++NPLD +
Sbjct: 227 HSMSLLQPIITCIHGDAKNLTDQLTKLFPGLNFTHLITELSFGPHYGRLLNPLDKTVSST 286
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGH---------------------TIQSNQFSVTEHFRS 312
YQY + VVPT+YT SGH T+ +NQ++VT +
Sbjct: 287 SINFYKYQYHLSVVPTIYTK-SGHIDPNRRSLPDTSTITAKDSKTTVSTNQYAVTS-YSQ 344
Query: 313 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
Q R+ PG+FF Y++ PI + ++E S L + + +V GV G +
Sbjct: 345 PIQPRIDATPGIFFKYNIEPILLIVSQERDSLLALMVRLVNVVSGVLVTGGWL 397
>gi|403413226|emb|CCL99926.1| predicted protein [Fibroporia radiculosa]
Length = 546
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 165/391 (42%), Gaps = 66/391 (16%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D + + DA+PK+ + +R+ S G +T+ ++V LL ++L YL ++ + V
Sbjct: 21 DIVPAPLAQFDAFPKLPSTYKARSESRGFLTIFVALVAFLLILNDLGEYLWGWSDHEFSV 80
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D+ L +N D+ +PC LSVD D G++ +F R R+D
Sbjct: 81 DSDTTNGLNLNVDLMV-NMPCQYLSVDLRDAVGDR-------LFLSR--------GFRRD 124
Query: 122 GIGAPKID----KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW-- 175
GI K D L+ H L + + ++ + +G+
Sbjct: 125 GI---KFDVGHATALKEHAAALSAQQA---------------------IAQSRKSRGFFS 160
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
L D+ + +++G C IYG + K N H + S HV
Sbjct: 161 TLFRKDVAQYRPTHNY-----QKDGSACRIYGTITAKKATANLHITTIGHGYASRDHV-- 213
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
N+SH IN+ +FG FP +V PLD P YQY++ VVPT Y
Sbjct: 214 ----DHKYMNLSHVINEFSFGPFFPEIVQPLDNSFELALDPFVAYQYYLHVVPTTYIAPR 269
Query: 296 GHTIQSNQFSVTEHFR--SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+ ++Q+SVT + R S+ QG PG+FF +DL P+ +T + + FL
Sbjct: 270 STPLHTHQYSVTHYTRTMSTHQG----TPGIFFKFDLEPMHLTIHQRTTTLAQFLIRCVG 325
Query: 354 IVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+VGG+F G + G RA++ + +
Sbjct: 326 VVGGIFVCMGYA---VRVGTRAVEAATGVDR 353
>gi|397485838|ref|XP_003814045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pan paniscus]
Length = 290
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDMLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|350594414|ref|XP_003134100.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Sus scrofa]
Length = 313
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 106/200 (53%), Gaps = 22/200 (11%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I +G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 128 MKIPLNDGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPPNPDMTHVIHK 175
Query: 253 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
L+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 176 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 235
Query: 308 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 236 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 293
Query: 366 DAFIYHGQRAIKKKIEIGKF 385
D+ I+ A KKI++GK
Sbjct: 294 DSCIFTASEA-WKKIQLGKM 312
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 24/95 (25%), Positives = 46/95 (48%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + + LF SEL ++ +L V D G
Sbjct: 31 FDIYRKVPKDLTQPTYTGAIISICCCLFIFFLFLSELTGFITTEIVNELYVDDPDKDSGG 90
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ ++ +++ P L C ++ +D D G H+D
Sbjct: 91 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 125
>gi|72534712|ref|NP_001026881.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Homo sapiens]
gi|332248275|ref|XP_003273290.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Nomascus leucogenys]
gi|426351000|ref|XP_004043047.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Gorilla gorilla gorilla]
gi|51701446|sp|Q969X5.1|ERGI1_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|15215343|gb|AAH12766.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[Homo sapiens]
gi|15680269|gb|AAH14490.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[Homo sapiens]
gi|119581826|gb|EAW61422.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1,
isoform CRA_a [Homo sapiens]
gi|208966210|dbj|BAG73119.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[synthetic construct]
gi|410301142|gb|JAA29171.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
gi|410349415|gb|JAA41311.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
Length = 290
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|354477345|ref|XP_003500881.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Cricetulus griseus]
Length = 333
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 149 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 196
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 197 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 256
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 257 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 314
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 315 SCIFTASEAW-KKIQLGKI 332
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 48 VHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 107
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 108 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 145
>gi|432100023|gb|ELK28916.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Myotis davidii]
Length = 298
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 114 KIPLNSGAGCRFEGQFSINKVPGNFH-----------VSTHSASA-QPQNPDMTHVIHKL 161
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 162 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 221
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 222 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 279
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 280 SCIFTASEA-WKKIQLGKM 297
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/102 (24%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-- 61
++ + D Y K+ +D T++G +I++ + +L LF SEL ++ +L V
Sbjct: 9 LITQTCRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDD 68
Query: 62 -DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
D G + ++ +++ P L C ++ +D D G H+D
Sbjct: 69 PDKDSGGKIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHID 110
>gi|395736490|ref|XP_002816264.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pongo abelii]
Length = 290
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPHLHCELVGLDIQDEMGRHEVGHID 102
>gi|355691849|gb|EHH27034.1| hypothetical protein EGK_17136, partial [Macaca mulatta]
gi|355750428|gb|EHH54766.1| hypothetical protein EGM_15664, partial [Macaca fascicularis]
Length = 290
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 5 LRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 65 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|347828541|emb|CCD44238.1| similar to endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Botryotinia fuckeliana]
Length = 381
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 158/353 (44%), Gaps = 54/353 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ DA+PK + ++T GG T+ +V L SE + +V+ G
Sbjct: 21 VKAFDAFPKAKPQYITQTSGGGKWTVAMMLVSFALLVSEFMRWWTGHETHTFVVEKGVGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+L++N D+ + CS L ++ D +G++ L I K + N D G +
Sbjct: 81 SLQVNMDMVV-KMKCSELHINVQDAAGDRILA---GIMLKEDATNWN---QWVDAKGMHQ 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ K GR+ E Y +G E D + R + K P
Sbjct: 134 LGKDAH---GRVITGEEYHEEGFGEEHV-HDIVTLGGKKRAKFAKTPRVKGGP------- 182
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ G+ C +YG LEVNKV G+FH A G + + G H+ +FN
Sbjct: 183 ----------KGGDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHL------DHSAFNF 226
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT------ 298
SH IN+L+FG +P ++NPLD R TP+ YQYF+ +VPT+Y+
Sbjct: 227 SHIINELSFGPFYPSLLNPLD--RTIAGTPNHFHKYQYFLSIVPTLYSLSPSTFSPSSSP 284
Query: 299 --IQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
+++NQ++VT EH +++PG+FF YD+ P+ +T E FL F
Sbjct: 285 TLLRTNQYAVTSQEHIVGE-----RSVPGIFFKYDIEPLLLTVEESRDGFLRF 332
>gi|338713524|ref|XP_001499596.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Equus caballus]
Length = 356
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
++ G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 172 KVPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 219
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 220 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 279
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 280 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 337
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 338 SCIFTASEAW-KKIQLGKM 355
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 71 LRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 130
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 131 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 168
>gi|402873423|ref|XP_003900575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Papio anubis]
gi|380784387|gb|AFE64069.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
gi|383408185|gb|AFH27306.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
gi|384941372|gb|AFI34291.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
Length = 290
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|410082748|ref|XP_003958952.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
gi|372465542|emb|CCF59817.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
Length = 354
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 91/368 (24%), Positives = 175/368 (47%), Gaps = 56/368 (15%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R+ DA+PK E++ ++ GG+ +L++ ++ + ++E Y + + VD
Sbjct: 1 MAGLRTFDAFPKTEEEYQKKSSKGGLSSLLTYFFLIFIAWTEFGNYFGGYIDEQYTVDPE 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGE-----QHLDVKHDIFKKRLDSQGNVIESR 119
E ++IN D+ F +PC L ++A D++ + + L ++ F D++ N I
Sbjct: 61 VKEDIQINMDI-FVNIPCKWLHINARDMTLDRKLAGEELKLEDMPFFIPFDTRVNDITE- 118
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
I P++D+ L G AE ++ ++R+ Y +
Sbjct: 119 ---IVTPELDRIL--------------GEAIPAEFREKI------DMRQFYDENNH---- 151
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
D+ K F+ E GC+++G + VN+V G G+ D
Sbjct: 152 ----DETKH--FVP-----EFNGCHVFGSIPVNRVTGELQIT------AKGMGYPDREKA 194
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
D N +H IN+L+FG+ +P + NPLD ++ QE P Y Y + V+PT+Y + G
Sbjct: 195 PIDEVNFAHVINELSFGDFYPYIDNPLDNSAKFDQENPISAYVYHMNVIPTIYQKL-GAE 253
Query: 299 IQSNQFSVTE-HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+ +NQ+SV+E H+ ++ + +PG+F Y+ P+ + T++ +SF+ F+ + AI+
Sbjct: 254 VDTNQYSVSEYHYTEADNAIRKAGRVPGIFLKYNFEPLSIVVTDKRLSFIQFVIRLVAIL 313
Query: 356 GGVFTVSG 363
+ ++
Sbjct: 314 SFIVYIAS 321
>gi|149241719|ref|XP_001526345.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450468|gb|EDK44724.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 353
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 167/369 (45%), Gaps = 59/369 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M ++ ++++ DA+PK++ R+ GG+ TL++ LL+ + E+ ++ + +
Sbjct: 1 MSSLSKRVKTFDAFPKVDPQHQVRSERGGLSTLLTYFFGLLILWVEVGGFIGGYVDRQFE 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD L IN D+ A+PC + + DI+ ++ L + L+ +G Q
Sbjct: 61 VDRVVRSDLSINVDMIV-AMPCEFIHTNVEDITRDRFLA------GETLNFEGIHFFIPQ 113
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ KI+ P H E+ D D E +R +R+ G
Sbjct: 114 NF----KINNPNDFH-----------------ETPDLDEVMQ-ESLRAEFRQGG------ 145
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
QRI E C+I+G + VN+V G+F GK F S D L
Sbjct: 146 ------------QRINEG-APACHIFGSIPVNQVKGDFRIT-GKGFGYS-----DRLHVP 186
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+ N +H I + ++GE FP + NPLD E Y Y +VVPT+Y + G +
Sbjct: 187 LAALNFTHVIQEFSYGEFFPFLNNPLDATGKVTEEKLQAYIYNAQVVPTLYEKL-GLEVD 245
Query: 301 SNQFSVTEHFRSSE----QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+NQ+S+TE+ + R Q +PG++F Y+ PIK+T E+ + F F+ + I G
Sbjct: 246 TNQYSLTENHHVIKLDEISNRPQGVPGIYFRYEFEPIKLTIREKRIPFFQFVARLGTICG 305
Query: 357 GVFTVSGII 365
G+ +G +
Sbjct: 306 GLLVAAGYL 314
>gi|348575225|ref|XP_003473390.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Cavia porcellus]
Length = 345
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 161 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 208
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 209 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 268
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 269 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 326
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 327 SCIFTASEAW-KKIQLGKM 344
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 61 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 120
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 121 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 157
>gi|366998832|ref|XP_003684152.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
gi|357522448|emb|CCE61718.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
Length = 349
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 163/367 (44%), Gaps = 54/367 (14%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK E ++ GG+ ++++ +LL+ ++E Y + + VD
Sbjct: 1 MAGLKTFDAFPKTEERHVKKSKKGGLSSILTYAFLLLIAWTEFGSYFGGYIDKQYSVDKD 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+ ++IN D+ + +PC L V+ +D + ++ + + IF+
Sbjct: 61 IRKVVQINMDI-YVKMPCEWLHVNVLDDTNDRKIVSEELIFE------------------ 101
Query: 125 APKIDKPL-QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
D P HG ++ + E D E RE K L PD
Sbjct: 102 ----DMPFFVPHGSKVNN----LNKVVTPELDDILAEAIPAEFREKIETK--PLLGPD-- 149
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
+ F E GC++YG + VN+VAG G D +D
Sbjct: 150 ---GKPIF-------ELTGCHVYGSVTVNRVAGEMQIT------AKGYGYRDRKRAPKDL 193
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
+ +H +N+ +FG+ +P + NPLDG + +P Y YF+ VVPT Y + G I +N
Sbjct: 194 IDFNHVVNEFSFGDFYPYIENPLDGTCKMYPNSPFSSYNYFMSVVPTFYQKL-GAEIDTN 252
Query: 303 QFSVTEHF----RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
Q+S+ E+ S+ +L T+PG+F YD P+ + ++ ++FL F+ + AI+ V
Sbjct: 253 QYSIREYHVDLKNSNVNAKLSTIPGIFLKYDFEPLAIIISDVRLTFLQFIVRLVAILSFV 312
Query: 359 FTVSGII 365
++ I
Sbjct: 313 LYIASWI 319
>gi|157865526|ref|XP_001681470.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124767|emb|CAJ02321.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 365
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 163/372 (43%), Gaps = 32/372 (8%)
Query: 14 YPKINEDFYSRTFSGGVITLVSSI-VMLLLFFSELRLYLNA--VTETKLLVDTSRGETLR 70
+PK ED+ G + VS++ +++LL E YL T + +D E +
Sbjct: 2 FPKPKEDYQREQTRWGAVLSVSTVSIVILLVLWEGAAYLRGRDAYSTDVSLDKGLSEDMP 61
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK 130
++FDV FP +PC+ LS+D +D +G + + K G V+ +++
Sbjct: 62 VHFDVLFPFMPCNRLSIDVVDTTGMAKFNYTGRLHKLPTALDGEVLYKGSLKDLDNEMET 121
Query: 131 PLQRHGGRLEHNETYCGSCYGAE---SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
R G + AE ++ CC+ CE V Y++ G + + I QC
Sbjct: 122 EEVRTGKKCRQCPPSAFDGVAAEVRSAAASKCCDTCESVLGLYKELGRGVPGTEYIPQC- 180
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS--FHQSGVHVHDILAFQRDSFN 245
E QR GC + G L++ KV F P ++ F+ + D++ +
Sbjct: 181 LEQLYQR-----ASGCAVMGSLDLKKVPVTVIFGPRRTGQFYS----LKDVI-----RLD 226
Query: 246 ISHKINKLAFG----EHFP--GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
SH I KL G E F GV L G + + +T S +Y +KVVPT Y
Sbjct: 227 TSHFIRKLRIGDETVERFSKNGVAERLSGHKSSSKTYSET-RYLVKVVPTTYRKTKTKNA 285
Query: 300 QSNQFSVTEHF--RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+++ + + + R+ G +P V F ++ +PI+V E F HFL +C IVGG
Sbjct: 286 KASTYEYSAQWSRRTILVGFAGAVPAVLFEFEPAPIQVNNVFERQPFSHFLVQLCGIVGG 345
Query: 358 VFTVSGIIDAFI 369
+F V G ID +
Sbjct: 346 LFVVLGFIDNVV 357
>gi|67901384|ref|XP_680948.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
gi|40742675|gb|EAA61865.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
gi|259484020|tpe|CBF79887.1| TPA: COPII-coated vesicle protein (Erv41), putative
(AFU_orthologue; AFUA_2G01530) [Aspergillus nidulans
FGSC A4]
Length = 394
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 158/380 (41%), Gaps = 62/380 (16%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R+ DA+PK + + + GG T++ I+ + +E R +L V+
Sbjct: 24 LRTFDAFPKTKPSYTTPSRRGGQWTVLILIICTIFSITEFRTWLKGHETHHFTVEKGVSH 83
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L++NFD +PC L ++ D +G++ L ++ KK S ++ R
Sbjct: 84 DLQLNFDAVI-HMPCDALHINIQDAAGDRVL--ASEMLKKEPTSWKLWMDKRN------- 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNN---CEEVREAYRK--KGWALSNPDL 182
H + G + +ED E R RK KG L D+
Sbjct: 134 ------YHSSEYQTLSDSRGDEERVAAMEEDVHAGHVLNELRRNGKRKFAKGPKLRRGDV 187
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQR 241
+D C+ IYG LE NKV G+FH A G + H+
Sbjct: 188 VDSCR-----------------IYGSLEGNKVQGDFHITARGHGYRDGREHL------DH 224
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 292
+FN SH I +L+FG H+P + NPLD T E YQYF+ +VPT+Y+
Sbjct: 225 SAFNFSHIITELSFGPHYPSLHNPLDKTIATTEFHYYKYQYFLSIVPTIYSRNQNLRLDA 284
Query: 293 -------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 345
+ + I +NQ++ T + + +PG+FF Y++ PI + +EE FL
Sbjct: 285 LPSSSSARSNKNLIFTNQYAATSQSDAIPESPY-VIPGIFFKYNIEPIMLLISEERTGFL 343
Query: 346 HFLTNVCAIVGGVFTVSGII 365
+ L + V GV G +
Sbjct: 344 NLLIRIVNTVSGVLVTGGWV 363
>gi|440902711|gb|ELR53466.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1,
partial [Bos grunniens mutus]
Length = 290
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEA-WKKIQLGKM 289
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 51/102 (50%), Gaps = 6/102 (5%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-- 61
+ + +R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V
Sbjct: 1 VPSALRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDD 60
Query: 62 -DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
D G + ++ +++ P L C ++ +D D G H+D
Sbjct: 61 PDKDSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|156065931|ref|XP_001598887.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980]
gi|154691835|gb|EDN91573.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 421
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 155/353 (43%), Gaps = 54/353 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ DA+PK + ++T GG T+ I+ L SE + +V+ G
Sbjct: 21 VQAFDAFPKAKPQYITQTSGGGKWTVAMLIISFALLLSEFSRWWTGYETHTFVVEKGIGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+L+IN D+ + CS L ++ D +G++ L ++ +
Sbjct: 81 SLQINMDMVV-KMKCSGLHINVQDAAGDRIL--------------AGIMLKEDPTNWSQW 125
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+D G+ H G Y E E+ ++ V +K+ P L
Sbjct: 126 VDAKGVHQLGKDAHGRVVTGEEYHEEGFGEEHVHDI--VALGGKKRAKFAKTPRL----- 178
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ G+ C +YG LEVNKV G+FH A G + + G H+ ++FN
Sbjct: 179 ------KGGPRGGDSCRVYGSLEVNKVQGDFHITAKGHGYPELGQHL------DHNAFNF 226
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHTI----- 299
SH IN+L+FG +P ++NPLD R TP+ YQYF+ +VPT+Y+
Sbjct: 227 SHIINELSFGPFYPSLLNPLD--RTIAGTPNHFHKYQYFLSIVPTLYSLSPSTFSPSSSP 284
Query: 300 ---QSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
++NQ++VT EH + +PG+FF YD+ P+ +T E FL F
Sbjct: 285 SLLRTNQYAVTSQEHIVGE-----RNVPGIFFKYDIEPLLLTVEESRDGFLRF 332
>gi|146163751|ref|XP_001012240.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila]
gi|146145943|gb|EAR91995.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila
SB210]
Length = 331
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 174/384 (45%), Gaps = 88/384 (22%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R LD + K+N+D + T +GGV ++++ +V +LF++EL+ Y K+ V E
Sbjct: 1 MRGLDFFQKVNQDIDTSTATGGVYSIIAFVVGFILFWNELKDYRTDQMIYKMRVQQLEVE 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++ N D+ PC++L++D D G LD I K R+ G +ES G G
Sbjct: 61 SVKANIDLHIYGSPCTLLALDLQDEVGNHTLDYTDTIKKIRVLKDGTELES---GFG--- 114
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ N Y GS +E+ EA ID
Sbjct: 115 ------------DGNPNYRGSS--------------QEIDEA-------------IDAVN 135
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--- 244
E EGC I G++ + KV GNFH S+H ++ I + + D++
Sbjct: 136 NE-----------EGCRINGYINLKKVPGNFHI----SYHAKMDVMNRIASTKPDTYSKI 180
Query: 245 NISHKINKLAFGEH--FPGVVNPLDGVRWTQETPSGMYQY---------------FIKVV 287
N+++KIN L FGE+ + + G QET + Y + ++K++
Sbjct: 181 NLNYKINHLGFGENTNHMATIFKIMGRTLFQETNTNDYPHDDTKYINPGKNDYDNYLKIL 240
Query: 288 PTVYTDVSGH-TIQSNQFSV--TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
P Y H ++ ++++ T +SS + +P +FF Y++SPI V ++ + SF
Sbjct: 241 PCRYDSNKLHMSVSRYKYAMYSTHTPKSSTE-----IPTIFFRYEISPINVYYSTKSKSF 295
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAF 368
HFL + AIVGG+F V GI ++
Sbjct: 296 YHFLVQIFAIVGGIFAVMGIFNSL 319
>gi|426246271|ref|XP_004016918.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Ovis aries]
Length = 290
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|322697212|gb|EFY88994.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium acridum
CQMa 102]
Length = 372
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 160/357 (44%), Gaps = 44/357 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK ++ +RT GG T+ ++V + L ++E+ + V+
Sbjct: 21 VSAFDAFPKSKPEYVTRTEGGGKWTVAMAVVSIFLLWAEIARWWRGSESHTFAVEKGISH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++IN D T + C L ++ D +G++ L K +D Q G+
Sbjct: 81 SMQINLD-TVILMKCGDLHINVQDAAGDRILAGA----KLNMDETSWSQWVNQKGV---- 131
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
GR G+ G ++ D++ E V D++ +
Sbjct: 132 ------HKLGRDSEGRVVTGA--GWQNLDDEGFGE-EHVH-------------DIVALGQ 169
Query: 188 REGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
R + +G + C IYG L++NKV G+FH A G + G H+ F
Sbjct: 170 RRAKWAKTPRVKGPPDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHL------DHSQF 223
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH I++L+FG ++P +VNPLD E +QY++ VVPT Y+ V +I +NQ+
Sbjct: 224 NFSHIISELSFGSYYPSLVNPLDRTINIAENHFHKFQYYVSVVPTRYS-VGSSSIFTNQY 282
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
+VTE + + +PG+F YD+ PI ++ E+ L F+ + ++ GV
Sbjct: 283 AVTEQSKGVSE---YNVPGIFVKYDIEPILLSVNEDRDGILMFVVKLINVLSGVLVA 336
>gi|392331685|ref|XP_003752358.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Rattus norvegicus]
Length = 290
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 104/198 (52%), Gaps = 22/198 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGK 384
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEA-WKKIQLGK 288
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|195130281|ref|XP_002009580.1| GI15435 [Drosophila mojavensis]
gi|193908030|gb|EDW06897.1| GI15435 [Drosophila mojavensis]
Length = 433
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 191/380 (50%), Gaps = 36/380 (9%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++ELR Y N ET+++ D S
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYWN---ETEIIYQFEPDIS 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
E ++++ D+T A+PC+ LS VD MD + + D+F + G + +++G+
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQLDVF-----AYGTL---QREGV 119
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYG--AESSDEDCCNNCEEVREAYRKKGWALSNPD 181
D +RH ++ Y Y A+ +D +E+ + S+
Sbjct: 120 WWQMSDAD-RRHFQSMQMTNHYLREEYHSVADILFKDILRERSPPKESDTQ-----SDAA 173
Query: 182 LIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
LQ+I + E + C ++G L +NKVAG H G H ++ F
Sbjct: 174 APPPPGALQQLQQISQMESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEF 233
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
+R N +H+IN+L+FG++ +V PL+G + QYFIKVVPT + TI
Sbjct: 234 RRMPANFTHRINRLSFGQYSRRIVQPLEGDETIIREEATTVQYFIKVVPTEIRH-TFSTI 292
Query: 300 QSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
+ Q++VTE+ R + R PG++F YD S +K+ + + + + F+ +C+I+ G+
Sbjct: 293 STFQYAVTENVRKLDAERNSYGSPGIYFKYDWSALKIVVSHDRDNLVTFVIRLCSIISGI 352
Query: 359 FTVSGIIDAFIYHGQRAIKK 378
+SG ++A + QR + +
Sbjct: 353 IVISGAVNALLVAIQRRLLR 372
>gi|344265732|ref|XP_003404936.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Loxodonta africana]
Length = 338
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 154 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 201
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D +G S Q++V
Sbjct: 202 SFGDTLQVQNVQGAFNALGGADRLHSNPLASHDYILKIVPTVYEDKNGKQRYSYQYTVAN 261
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 262 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 319
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 320 SCIFTASEAW-KKIQLGKM 337
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 47/97 (48%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 54 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 113
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + + +++ P L C ++ +D D G H+D
Sbjct: 114 GGKIDVTLNISLPNLHCELVGLDIQDEMGRHEVGHID 150
>gi|149052230|gb|EDM04047.1| rCG34297 [Rattus norvegicus]
Length = 283
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/198 (37%), Positives = 105/198 (53%), Gaps = 22/198 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 99 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 146
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 147 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 206
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 207 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 264
Query: 367 AFIYHGQRAIKKKIEIGK 384
+ I+ A KKI++GK
Sbjct: 265 SCIFTASEAW-KKIQLGK 281
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPC 82
G + ++ +++ P L C
Sbjct: 66 GGKIDVSLNISLPNLHC 82
>gi|322792517|gb|EFZ16475.1| hypothetical protein SINV_13267 [Solenopsis invicta]
Length = 110
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/110 (52%), Positives = 80/110 (72%), Gaps = 7/110 (6%)
Query: 279 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LPGVFFFYDLSPIK 334
M+ ++IK+VPT Y G T+ +NQFSVT H ++Q L T +PG+FF Y+LSP+
Sbjct: 4 MFYHYIKIVPTTYVRADGSTLLTNQFSVTRH---AKQVSLLTGESGMPGIFFSYELSPLM 60
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI++KIE+GK
Sbjct: 61 VKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELGK 110
>gi|392594239|gb|EIW83563.1| DUF1692-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 506
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 163/384 (42%), Gaps = 62/384 (16%)
Query: 2 DAIMNKIRS------LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVT 55
D+I++K+ + DA+PK+ + SR+ S G +T+ + LL ++L Y+
Sbjct: 8 DSILSKLDAAVPLAKFDAFPKLPSSYKSRSESRGFLTIFVGFLCFLLILNDLSEYIWGWP 67
Query: 56 ETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNV 115
+ + VD + +N D+ +PC LSVD D+SG++ K G +
Sbjct: 68 DYEFGVDKQSKSFMDVNVDMVV-NMPCQFLSVDLRDVSGDRLY------LSKGFRRDGTL 120
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
+ Q L+ H L + V ++ + +G+
Sbjct: 121 FDIGQA--------TSLKEHAKMLSAQQA---------------------VSQSRKSRGF 151
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
+ K E + +G C IYG L V KV N H + S +HV
Sbjct: 152 F----SWFKRSKAEFRPTYNHQPDGSACRIYGTLAVKKVTANLHVTTLGHGYTSHMHV-- 205
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
N+SH I + +FG +FP + PLD + P +QY++ VVPT Y
Sbjct: 206 ----DHTKMNLSHVITEFSFGPYFPDISQPLDYSFEVAKDPYTAFQYYMHVVPTNYIAPR 261
Query: 296 GHTIQSNQFSVTEH---FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+++NQ+SVT + +++ +G +PG+FF +DL P+ ++ + S +
Sbjct: 262 SKPLETNQYSVTHYTHIYKTPHEG----IPGIFFKFDLDPMVLSIHQRTTSLTALIIRCV 317
Query: 353 AIVGGVFTVSGIIDAFIYHGQRAI 376
++GGVFT + F+ RA+
Sbjct: 318 GVIGGVFTCA---TYFVRASMRAV 338
>gi|258573091|ref|XP_002540727.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900993|gb|EEP75394.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 398
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 162/396 (40%), Gaps = 79/396 (19%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I +R+ DA+PK + + + GG T+ + L FSEL + V+
Sbjct: 19 GIAAGLRTFDAFPKTKPTYTTASRRGGQWTVFIFLFCGSLVFSELVSWYRGTENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHL--------DVKHDIFKKRLD--SQ 112
+ ++IN D+ +PC L ++ D G+ L D D + + L+ S+
Sbjct: 79 KGVSQEIQINLDMVV-HMPCEALRMNMQDAVGDFILAAELLHKDDTSWDAWNRELNYASK 137
Query: 113 GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK 172
G G+P+ RL E D+ + EVR ++++
Sbjct: 138 G----------GSPQYQTLNAEDDTRLAEQE-----------EDQHVGHVLGEVRRSWKR 176
Query: 173 K---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS 229
K G L + D +D C+ IYG LE NKV GNFH
Sbjct: 177 KFPKGPKLKSKDAMDSCR-----------------IYGSLEGNKVQGNFHIT------AR 213
Query: 230 GVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT 289
G+ D F + N +H I +L+FG + ++NPLD + YQY++ VVPT
Sbjct: 214 GLGYWDPSGFHLEGLNFTHLITELSFGPRYSTLLNPLDKTVAGTKDAFYKYQYYLSVVPT 273
Query: 290 VYTDVS--------------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 329
+YT +TI +NQ++VT + Q ++ +PG+FF +D
Sbjct: 274 IYTRAGTVDPYNQELPDPSTITSRQRKNTIFTNQYAVTSQSHAIPQ-NVRAVPGIFFKFD 332
Query: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ PI + +EE S L L + +V GV G +
Sbjct: 333 IEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWV 368
>gi|225712696|gb|ACO12194.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Lepeophtheirus salmonis]
Length = 372
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 99/178 (55%), Gaps = 6/178 (3%)
Query: 195 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 254
I +E + C I+G L +NKVAGNFH +PGK+ HVH + +N +H+I++ +
Sbjct: 166 IPDEPHDACRIHGSLTLNKVAGNFHISPGKTLPLFRAHVHFATFGGDEVYNFTHRIDRFS 225
Query: 255 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT---IQSNQFSVTEHFR 311
FG G+V PL+G S YQY I+VVP TD+ G+T + Q+SV EH R
Sbjct: 226 FGTPHGGIVQPLEGEEKIAMQDSMHYQYLIQVVP---TDIQGYTDLIWSTYQYSVKEHKR 282
Query: 312 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
++++ PG++F YD+S +KV +++ FL + A VGG S I+ FI
Sbjct: 283 ATKERGSGDTPGIYFKYDMSALKVLASQDREPIFKFLVRLLAAVGGRIATSQIVCVFI 340
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 51/86 (59%), Gaps = 1/86 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ LDA+PK+ E + +T SG I+++++I++++L SE +++ + + DT
Sbjct: 16 VKELDAFPKVPETYVEKTASGAAISIITTILVIVLLCSETSYFMDPGINFRFIPDTDFKS 75
Query: 68 TLRINFDVTFPALPCSILSVDAMDIS 93
L IN D+T A PC + D +D++
Sbjct: 76 KLEINVDITI-ATPCKAIGADVLDVT 100
>gi|115497382|ref|NP_001069885.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Bos
taurus]
gi|111308658|gb|AAI20358.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Bos
taurus]
Length = 290
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEAW-KKIQLGKM 289
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|302659461|ref|XP_003021421.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
gi|291185318|gb|EFE40803.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
Length = 427
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/413 (24%), Positives = 168/413 (40%), Gaps = 82/413 (19%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D I K+++ DA+PK + S + GG+ T+ +I+ +L SEL + V
Sbjct: 18 DGIATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSV 77
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
+ + +++N D T A+PC + ++ D +G+ L G+++
Sbjct: 78 ERGVSQEMQLNID-TVVAMPCDDVRINIQDAAGDHIL-------------AGDLLTQEPT 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRK---KGWA 176
A + +R GG E+ E +ED + EVR + +K K
Sbjct: 124 SWAAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQEEDLHVEHVLGEVRRSRKKKFPKSPK 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH--------FAPGKS--- 225
L D +D C+ ++G LE NKV GN H F G++
Sbjct: 184 LKKSDAVDSCR-----------------VFGSLEGNKVQGNLHITARGFGYFEWGRATNP 226
Query: 226 ------------FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ 273
H ++ D L N +H I +L+FG H+ ++NPLD +
Sbjct: 227 HSMSLLQPIITCIHGDAKNLTDQLTKLFPGLNFTHLITELSFGPHYGRLLNPLDKTVSST 286
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGH---------------------TIQSNQFSVTEHFRS 312
YQY + VVPT+YT SGH T+ +NQ++VT +
Sbjct: 287 SINFYKYQYHLSVVPTIYTK-SGHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTS-YSQ 344
Query: 313 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
Q R+ PG+FF Y++ PI + ++E S L + + +V GV G +
Sbjct: 345 PIQPRIDATPGIFFKYNIEPILLIVSQERDSLLALMVRLVNVVSGVLVTGGWL 397
>gi|296475934|tpg|DAA18049.1| TPA: endoplasmic reticulum-golgi intermediate compartment 32 kDa
protein [Bos taurus]
Length = 290
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 153
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 154 SFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVAN 213
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 214 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 271
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 272 SCIFTASEA-WKKIQLGKM 289
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|158292441|ref|XP_001688474.1| AGAP005044-PB [Anopheles gambiae str. PEST]
gi|157016994|gb|EDO64057.1| AGAP005044-PB [Anopheles gambiae str. PEST]
Length = 287
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 102/172 (59%), Gaps = 2/172 (1%)
Query: 195 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 254
I E+ + C I+G L +NKVAGNFH GK+ H S H+H F N SH+IN+ +
Sbjct: 79 IPEKPHDACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSIFANTQTNFSHRINRFS 138
Query: 255 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 314
FG+H G+++PL+G + M QYFI+VVPT H+ ++ Q++V E+ + +
Sbjct: 139 FGDHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHS-KTYQYTVRENLQLID 197
Query: 315 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ +Q + G++F YD+S ++V ++ S HF+ + +I+ G+ +SG++
Sbjct: 198 IDKGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGML 249
>gi|195042004|ref|XP_001991346.1| GH12601 [Drosophila grimshawi]
gi|193901104|gb|EDV99970.1| GH12601 [Drosophila grimshawi]
Length = 434
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 177/352 (50%), Gaps = 33/352 (9%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++ELR Y +ET ++ D S
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYW---SETNIIYQFEPDMS 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
E ++++ D+T A+PC+ LS VD MD + + D+F + G++ +++G+
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVF-----AYGSL---QREGV 119
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSD--EDCCNNCEEVREAYRKKGWALSNPD 181
D +RH ++ Y Y + ++ +D + +E+ A P
Sbjct: 120 WWQMADAD-RRHFQSMQMTNHYLREEYHSVANILFKDILRERTQPKESEAHSVPAQPAPG 178
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
+ Q ++ E + + C ++G L +NKVAG H G H ++ F+R
Sbjct: 179 PLQQLQQHPQF----EAKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFDDHWMIEFRR 234
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N +H+IN+L+FG++ +V PL+G T + QYFIKVVPT + T+ +
Sbjct: 235 MPANFTHRINRLSFGQYSRRIVQPLEGDETTITEEATTVQYFIKVVPTEIQQ-TFSTVST 293
Query: 302 NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
Q++VTE+ R + R PG++F YD S +KV + + FL F+ +C
Sbjct: 294 FQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKVVISHDRDYFLTFVIRLC 345
>gi|301626814|ref|XP_002942582.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Xenopus (Silurana) tropicalis]
Length = 298
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 105/199 (52%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I GC GF +NKV GNFH V H +A Q + ++ H I+KL
Sbjct: 114 KIPINNAHGCRFEGFFSINKVPGNFH-----------VSTHSAMA-QPANPDMRHIIHKL 161
Query: 254 AFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 308
+FG E+ G N L G + Y +K+VPTVY D++G S Q++V
Sbjct: 162 SFGNTLQVENIHGAFNALGGADKLASQALESHDYVLKIVPTVYEDMNGEQQFSYQYTVAN 221
Query: 309 --HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
+ S GR+ +P ++F YDLSPI V +TE F+T VCAI+GG FTV+GI+D
Sbjct: 222 KAYVAYSHTGRV--VPAIWFRYDLSPITVKYTERRQPIYRFITTVCAIIGGTFTVAGILD 279
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+FI+ A KKI++GK
Sbjct: 280 SFIFTASEA-WKKIQLGKM 297
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + + LF SEL ++ +L V D + G
Sbjct: 16 FDIYRKVPKDLTQPTYTGAIISICCCLFITFLFLSELTGFIANEIVNELYVDDPDKNSGG 75
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ + +V+ P L C ++ +D D G H+D
Sbjct: 76 KIEVTLNVSLPNLACEVVGLDIQDEMGRHEVGHID 110
>gi|358388143|gb|EHK25737.1| hypothetical protein TRIVIDRAFT_33251 [Trichoderma virens Gv29-8]
Length = 370
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/357 (24%), Positives = 157/357 (43%), Gaps = 46/357 (12%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ + DA+PK + ++T GG T+ ++ + ++E+ + V+ G
Sbjct: 21 VSAFDAFPKSKPQYVTKTSGGGKWTVAMLLISSIFLWTEIGRWWRGAEHHTFAVEKGIGH 80
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++N D+ + C L ++ D SG++ L +L+ DG G +
Sbjct: 81 DMQVNLDIVV-KMDCDDLHINVQDASGDRILA------GDKLNRDATTWHQWVDGKGMHR 133
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
+ K G+L+ E + + D E V D++ +
Sbjct: 134 LGKS---ENGKLDTGEGWLAA--------HDEGFGEEHVH-------------DIVALSR 169
Query: 188 REGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
++ + +G + C +YG L++N+V G+FH A G + G H+ D F
Sbjct: 170 KKAKWAKTPSPKGRPDSCRMYGSLDLNRVQGDFHITARGHGY--GGQHL------DHDKF 221
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
N SH I+++++G +P +VNPLD + +QY++ VVPTVY + + +NQ+
Sbjct: 222 NFSHIISEMSYGPFYPSLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYL-ANNRIVNTNQY 280
Query: 305 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
+VTE ++ +PG+FF YD+ PI ++ E F FL + I GV
Sbjct: 281 AVTEQSKTISD---HQVPGIFFKYDIEPIMLSVEESRDGFFTFLVKIVNIFSGVMVA 334
>gi|295663046|ref|XP_002792076.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279251|gb|EEH34817.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 392
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 162/390 (41%), Gaps = 75/390 (19%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I + +R+ DA+PK + S T GG T+V ++ LL SELR + V V+
Sbjct: 19 GIGSGLRTFDAFPKTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ A+ C L ++ D +G++ L D+ K S
Sbjct: 79 KGISRELQLNLDIVV-AMTCDALRINVQDAAGDRIL--ASDMLNKEPTSWAAWNRELNVA 135
Query: 123 I--GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK---KGWAL 177
+ G + + H GRL E D + E R ++++ KG L
Sbjct: 136 LSGGGREYQTLTEEHAGRLMEQE-----------EDMHVGHALGEARRSHKRKFPKGPKL 184
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 236
++ D C+ IYG LE NKV G+FH A G + + G H+
Sbjct: 185 KRGEMPDSCR-----------------IYGSLEGNKVQGDFHITARGHGYFEYGEHLDH- 226
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS- 295
++L+FG H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 227 --------------HELSFGPHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRTGT 272
Query: 296 -------------------GHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKV 335
+TI +NQ++VT RS E +Q +PG+FF Y + PI +
Sbjct: 273 IDPYSQVLPDPSTISPSQRKNTIFTNQYAVTS--RSHELPDVQFYVPGIFFKYSIEPILL 330
Query: 336 TFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+EE S L L + ++ GV G +
Sbjct: 331 IISEERGSLLALLVRLVNVMAGVVVAGGWL 360
>gi|395326723|gb|EJF59129.1| hypothetical protein DICSQDRAFT_156384 [Dichomitus squalens
LYAD-421 SS1]
Length = 559
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 153/366 (41%), Gaps = 59/366 (16%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ + DA+PK+ E + + + S G +TL + V LL ++L ++ + + VD
Sbjct: 24 VPAPLAQFDAFPKLPETYKTHSESRGFLTLFVAFVAFLLILNDLGEFIWGWPDFEFGVDK 83
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESRQDG 122
L IN D+ +PC LS+D D G++ +L D F R+DG
Sbjct: 84 MPSANLDINVDMVV-NMPCQYLSIDLRDAVGDRLYLS---DGF-------------RRDG 126
Query: 123 IGAPKID----KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
K D L+ H L + V ++ R +G+ +
Sbjct: 127 T---KFDIGQATSLKEHAAMLSARQA---------------------VSQSRRSRGFFDT 162
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDIL 237
L+ + K + +G C IYG + +V N H + S HV H +
Sbjct: 163 ---LLHRTKSSFKPTYNYQPDGSACRIYGTITAKRVTANLHVTTLGHGYASHEHVDHKFM 219
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 297
N+SH I + +FG +FP + PLD P YQYF+ VVPT Y
Sbjct: 220 -------NLSHVITEFSFGPYFPDITQPLDNSFEMAHDPFVAYQYFLHVVPTTYIAPRSK 272
Query: 298 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
+ +NQ+SVT + R + R PG+FF +DL PI +T + S FL +VGG
Sbjct: 273 PLHTNQYSVTHYTRVLDHHR--GTPGIFFKFDLEPIHMTIHQRTTSLAAFLLRCAGVVGG 330
Query: 358 VFTVSG 363
VF G
Sbjct: 331 VFVCMG 336
>gi|212527292|ref|XP_002143803.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
marneffei ATCC 18224]
gi|210073201|gb|EEA27288.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
marneffei ATCC 18224]
Length = 402
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 159/377 (42%), Gaps = 53/377 (14%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R+ DA+PK ++ + + GG T++ + L F E + V+
Sbjct: 24 LRTFDAFPKTKPNYTTASRRGGQWTVIIFAICTFLTFGEFVNWYRGTENQHFSVEKGVSR 83
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAP 126
L++N D+ + C+ L V+ D SG+ H+ + K + + N ++Q G P
Sbjct: 84 QLQMNIDMVV-KMHCNDLRVNVQDASGD-HIMAGMLLMKDGTNWELWNEKLNQQSSSGVP 141
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ RL E D + R ++K P L +
Sbjct: 142 EYQTLNAEDVKRLMDQE-----------DDAHARHVLSHTRRNPKRK--FPKTPRLSSKY 188
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFN 245
+ C IYG LE NKV G+FH A G +++ G H+ +FN
Sbjct: 189 PTDS------------CRIYGSLESNKVHGDFHITARGHGYNEVGQHL------DHSNFN 230
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------------- 292
+H + +L+FG H+P ++NPLD + ET +QYFI VVPT+Y
Sbjct: 231 FTHMVTELSFGPHYPSLLNPLDKTVASTETHYYKFQYFINVVPTIYAKGNNAVEKYTANP 290
Query: 293 ----DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+ S +TI +NQ+S T + T PG+FF Y++ PI + +EE SFL L
Sbjct: 291 AKAFEKSRNTIFTNQYSATSQSHPLPESPFNT-PGIFFKYNIEPILLFVSEERGSFLALL 349
Query: 349 TNVCAIVGGVFTVSGII 365
+ +V GV G +
Sbjct: 350 VRLVNVVSGVIVTGGWL 366
>gi|164661257|ref|XP_001731751.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
gi|159105652|gb|EDP44537.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
Length = 454
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 169/365 (46%), Gaps = 63/365 (17%)
Query: 21 FYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDV----- 75
+ RT GG +TL I +++ + E++ YL +D+ G ++IN DV
Sbjct: 53 YQKRTSYGGFVTLAVFIATMVVIWYEIQHYLMLKPTYSFDIDSHVGGFMQINLDVVVATP 112
Query: 76 ---TFP---ALPC----SILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
T+P PC S +S+D D SG+ + DI K +D +++
Sbjct: 113 CGRTYPYDVRFPCILTLSGVSIDLRDASGDTLHFSEDDIVKDPVDFNKERQRAQK----- 167
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
+ L ++ ++ H++ + E D+ +R G+ S+P
Sbjct: 168 ----RSLTQYFLKMLHSQ--YRNMKKIERKDKKIVAGGPR----HRDSGFDFSDP----- 212
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH---FAP---GKSFHQSGVHVHDILAF 239
EE C +YG + V KV GN H F P + H++G+ +
Sbjct: 213 --------MENAEEARACRVYGSILVKKVTGNLHISTFVPTFMAVNAHENGMGI------ 258
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
++SH I++ +FG++FP + PLD + P+ +QYF+ VVPT + I
Sbjct: 259 -----DMSHIIHEFSFGDYFPNIAEPLDASLELTDDPAAAFQYFLSVVPTHFIH-GRRVI 312
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
++NQ+SV + ++ + QG L T PG++F YD+ P+ + T + VS + F+ VC+++GG++
Sbjct: 313 KTNQYSVHD-YKRNPQGSL-TFPGLYFKYDIEPLTMKVTHKSVSLVAFIVRVCSVLGGLW 370
Query: 360 TVSGI 364
+ +
Sbjct: 371 ICTDL 375
>gi|449542382|gb|EMD33361.1| hypothetical protein CERSUDRAFT_117979 [Ceriporiopsis subvermispora
B]
Length = 530
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 144/370 (38%), Gaps = 53/370 (14%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
DA+PK+ + +R+ S G +TL + LL ++L Y+ VD+ L+
Sbjct: 27 FDAFPKLPTTYKARSESRGFLTLFVAFAAFLLVLNDLGEYIWGWPVYDFTVDSDPSSDLK 86
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID- 129
IN D+ +PC+ LSVD D G++ L + + R+DG K D
Sbjct: 87 INVDMMV-NMPCAYLSVDLRDAMGDR-LYLSNAF--------------RRDGT---KFDI 127
Query: 130 ---KPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
LQ H L + +V RK SN L +
Sbjct: 128 GQATTLQEHAAAL----------------------SARQVIAQSRKSRGFFSN--LFRRT 163
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
+ +G C ++G + KV N H H H H N+
Sbjct: 164 NGGYKATYNHQPDGSACRVFGSITAKKVTANLHIT--TLGHGYATHSH----VDHSKMNL 217
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I + +FG HFP + PLD P YQYF+ VVPT Y + ++Q+SV
Sbjct: 218 SHVITEFSFGPHFPDITQPLDNSFEVAHDPFVAYQYFLHVVPTTYIAPRSSPLHTHQYSV 277
Query: 307 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
T + R + + PG+FF +DL P+ + + S + ++GGVF G
Sbjct: 278 THYTRILDPSHHRHTPGIFFKFDLDPLAIKIEQRTTSLVQLAIRCVGVIGGVFVCMGYAV 337
Query: 367 AFIYHGQRAI 376
H A+
Sbjct: 338 KITTHAVDAV 347
>gi|358333955|dbj|GAA52416.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Clonorchis sinensis]
Length = 306
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 90/168 (53%), Gaps = 6/168 (3%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDILAFQRDSFNISHKINKLAFGEHF 259
+ CNI G V KVAGN H PG+ F G HVH + FN SH+IN L+FG
Sbjct: 86 DACNIVGTFHVQKVAGNMHVLPGRPFDGPGGSHVHIAPFVRLADFNFSHRINHLSFGAQV 145
Query: 260 PGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRSSEQGR 317
VNPLD V P ++Y+I +VPT VY S + + Q+++T R++E +
Sbjct: 146 ANRVNPLDAVEEISYNPMETFRYYISIVPTRVVYAFSS---LDTYQYAITVKNRTAEGNK 202
Query: 318 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
++PG+FF YD P+ V TE F FL + A+VGG+F G I
Sbjct: 203 SDSIPGIFFSYDTFPLLVQVTESRELFGTFLARLAALVGGLFATVGFI 250
>gi|321465392|gb|EFX76393.1| hypothetical protein DAPPUDRAFT_306117 [Daphnia pulex]
Length = 289
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 115/218 (52%), Gaps = 27/218 (12%)
Query: 181 DLIDQCKRE--GFLQ---RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D+ D R GF++ + +G+GC +N+V GNFH + H D
Sbjct: 87 DIQDDLGRHDVGFIENTLKTPWNKGKGCIFESRFHINRVPGNFHVS---------THSAD 137
Query: 236 ILAFQRDSFNISHKINKLAFGE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 290
Q DS +++H I L FGE + PG NPL +Q P+ + Y +K+VPT+
Sbjct: 138 K---QPDSADMAHYITSLTFGEMLDNKNLPGNFNPLARRDRSQADPAESHDYTMKIVPTI 194
Query: 291 YTDVSGHTIQSNQFSV--TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
Y D +G T+ S Q++ + + S GR + ++F YDL+PI V + E FL
Sbjct: 195 YEDSAGTTLVSYQYTYAYSNYVSFSLGGR--SPAAIWFRYDLNPITVKYHERRQPIYAFL 252
Query: 349 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
T+VCAI+GG FTV+GIID+F++ I KK E+GK S
Sbjct: 253 TSVCAIIGGTFTVAGIIDSFVFTASE-IFKKFELGKLS 289
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 3/115 (2%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT--SR 65
+R LD Y K+ +D T +G VI++ M LFFSE +++ ++L VD +
Sbjct: 5 LRRLDIYRKVPKDLTQPTVTGAVISICCCAFMTFLFFSEFFHFISPEVVSELFVDNPGNT 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDS-QGNVIESR 119
E + + ++T P L C + +D D G + + K + +G + ESR
Sbjct: 65 DEKIPVQINITLPRLACEYVGIDIQDDLGRHDVGFIENTLKTPWNKGKGCIFESR 119
>gi|354545468|emb|CCE42196.1| hypothetical protein CPAR2_807450 [Candida parapsilosis]
Length = 351
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 157/371 (42%), Gaps = 63/371 (16%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MD+ ++++ DA+PK++ R+ GG+ TL++ + LL+ + E+ Y+ + + L
Sbjct: 1 MDSFSKRVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFLGLLILWVEVGGYIGGYVDRQFL 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK--KRLDSQGNVIES 118
VD L IN D+ A+PC L + DI+ ++ L + F+ K I +
Sbjct: 61 VDDVLRSDLTINLDMIV-AMPCEYLHTNVEDITRDRFLAGETLNFEGVKFFIPPNFSINN 119
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
D P +D+ +Q E +R + + G
Sbjct: 120 PNDFHETPDLDEVMQ------------------------------ESLRAEFSQLG---- 145
Query: 179 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 238
R E C+I+G + VN+V G+F G D
Sbjct: 146 ---------------RRVNEGAPACHIFGSIPVNQVKGDFRIT------AKGFGYRDRSF 184
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
++ N SH I + ++G+ +P + NPLD E Y Y KVVPT+Y + G
Sbjct: 185 VPLEALNFSHVIQEFSYGDFYPFLNNPLDATGKVTEENLQTYLYHAKVVPTLYEKL-GLE 243
Query: 299 IQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ + Q+S+TE+ R Q + G++F Y+ PIK+ E+ + FL F+ + I
Sbjct: 244 VDTTQYSLTENHHVVKVDPHSKRPQEISGIYFAYEFEPIKLIIREKRIPFLQFIAKLGTI 303
Query: 355 VGGVFTVSGII 365
GGV +G +
Sbjct: 304 AGGVVVAAGYL 314
>gi|45190741|ref|NP_984995.1| AER136Wp [Ashbya gossypii ATCC 10895]
gi|44983720|gb|AAS52819.1| AER136Wp [Ashbya gossypii ATCC 10895]
gi|374108218|gb|AEY97125.1| FAER136Wp [Ashbya gossypii FDAG1]
Length = 340
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 161/362 (44%), Gaps = 59/362 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R+ DA+PK ++ R+ GG+++++ + +L + + E Y + + ++D
Sbjct: 1 MASLRTFDAFPKTDQQHVRRSSRGGIMSIMMYLFLLFIAWGEFGSYFGGYLDEQYIIDPE 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK--RLDSQGNVIESRQDG 122
+T +IN DV +PC L V A DI+ + + K +FK G +S +
Sbjct: 61 LRQTTQINMDVMV-QMPCKYLDVKATDITRDINDVSKRLVFKNIPFFVPYGTTFDSVNE- 118
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
+ P ID L + + +R+ + + DL
Sbjct: 119 VRTPDIDGML------------------------------ADAIPLKFREN---IPDADL 145
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+ E E GC+IYG + VN+V G H P + S V D
Sbjct: 146 PE------------EFEFNGCHIYGSIPVNRVKGELHITPKGWRYSSRQRV------PHD 187
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N++H N+ +FGE FP + N LD V R+ Q+ + + YF+ V+PT+Y + G + +
Sbjct: 188 EINLTHIFNEFSFGEFFPYIDNTLDQVGRYAQQRLT-RFHYFVSVLPTIYRKM-GAVVDT 245
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ+SV+ + + RL T PG+F Y+ + V ++ +SF FL + ++ + +
Sbjct: 246 NQYSVSHNDITYTSSRLYT-PGIFILYNFEALTVVVQDKRISFWAFLIRLVTMLSFIVYI 304
Query: 362 SG 363
+
Sbjct: 305 AA 306
>gi|225685292|gb|EEH23576.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 386
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 158/377 (41%), Gaps = 66/377 (17%)
Query: 16 KINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDV 75
K + S T GG T+V ++ LL SELR + V V+ L++N D+
Sbjct: 17 KTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISRELQLNLDI 76
Query: 76 TFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI--GAPKIDKPLQ 133
A+ C L ++ D +G++ L D+ K S + G + +
Sbjct: 77 VV-AMTCDALRINVQDAAGDRIL--ASDMLNKEPTSWAAWNRELNVALSGGGREYQTLAE 133
Query: 134 RHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQ 193
GRL E D + E R ++++K F +
Sbjct: 134 EDAGRLMEQE-----------EDMHVGHALGEARRSHKRK-----------------FPK 165
Query: 194 RIKEEEGE---GCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHK 249
K + GE C IYG LE NKV G+FH A G + + G H+ +FN SH
Sbjct: 166 GPKLKRGEMPDSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHL------DHHAFNFSHM 219
Query: 250 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-------------- 295
I +L+FG H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 220 ITELSFGPHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTIDPYSQVLPDPST 279
Query: 296 ------GHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+TI +NQ++VT RS E +Q +PG+FF Y++ PI + +EE S L L
Sbjct: 280 ISPSQRKNTIFTNQYAVTS--RSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLALL 337
Query: 349 TNVCAIVGGVFTVSGII 365
+ ++ GV G +
Sbjct: 338 VRLVNVMSGVVVAGGWL 354
>gi|426200953|gb|EKV50876.1| hypothetical protein AGABI2DRAFT_113626 [Agaricus bisporus var.
bisporus H97]
Length = 542
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 153/368 (41%), Gaps = 56/368 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
+DA + DA+PK+ F +R+ S G +T+ +V LLL +++ Y+ E K
Sbjct: 13 LDAAAAPLAKFDAFPKVPSAFKARSESRGFMTIFVMLVALLLMLNDIGEYIWGWPEFKFA 72
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD + +N D+ + C LSVD D+ G++ L
Sbjct: 73 VDQDNAPYMFVNLDMVV-NMQCRYLSVDLRDVVGDRLL---------------------- 109
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ LQR G + E A + + + ++ + +G+
Sbjct: 110 -------LSGGLQRDGVKFNIGEAT------ALKEHSKGLSARQALSQSRKSRGF----- 151
Query: 181 DLIDQCKREGFLQRIKE-----EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D R + K +G C IYG + V +V N H + S HV
Sbjct: 152 --FDSLLRRNSEPKFKPTYNHVPDGGACRIYGTMPVKRVTANLHITTVGHGYSSYQHV-- 207
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ N+SH I + +FG +FP +V PLD + YQYF+ VVPT Y
Sbjct: 208 ----DHNQMNLSHVITEFSFGPYFPEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPR 263
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+++NQ+SVT + R E + PG+FF +DL P+ +T ++ + + L ++
Sbjct: 264 TSPLRTNQYSVTHYTRQVEHNK--GTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVI 321
Query: 356 GGVFTVSG 363
GGVF G
Sbjct: 322 GGVFVCMG 329
>gi|409083992|gb|EKM84349.1| hypothetical protein AGABI1DRAFT_32491 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 542
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 153/368 (41%), Gaps = 56/368 (15%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
+DA + DA+PK+ F +R+ S G +T+ +V LLL +++ Y+ E K
Sbjct: 13 LDAAAAPLAKFDAFPKVPSAFKARSESRGFMTIFVMLVALLLMLNDIGEYIWGWPEFKFA 72
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
VD + +N D+ + C LSVD D+ G++ L
Sbjct: 73 VDQDNAPYMFVNLDMVV-NMQCRYLSVDLRDVVGDRLL---------------------- 109
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
+ LQR G + E A + + + ++ + +G+
Sbjct: 110 -------LSGGLQRDGVKFNIGEAT------ALKEHSKGLSARQALSQSRKSRGF----- 151
Query: 181 DLIDQCKREGFLQRIKE-----EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D R + K +G C IYG + V +V N H + S HV
Sbjct: 152 --FDSLLRRNSEPKFKPTYNHVPDGGACRIYGTMPVKRVTANLHITTVGHGYSSYQHV-- 207
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ N+SH I + +FG +FP +V PLD + YQYF+ VVPT Y
Sbjct: 208 ----DHNQMNLSHVITEFSFGPYFPEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPR 263
Query: 296 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+++NQ+SVT + R E + PG+FF +DL P+ +T ++ + + L ++
Sbjct: 264 TSPLRTNQYSVTHYTRQVEHNK--GTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVI 321
Query: 356 GGVFTVSG 363
GGVF G
Sbjct: 322 GGVFVCMG 329
>gi|406868300|gb|EKD21337.1| copii-coated vesicle protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 382
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 150/350 (42%), Gaps = 42/350 (12%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
N +++ DA+PK + +RT GG T+ IV +L +SE + V+ +
Sbjct: 20 NIVQAFDAFPKAKPQYVTRTSGGGKWTVAMLIVSFMLIYSEFSRWWRGHETHTFTVEKAV 79
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
L+IN D+ P + C + ++ D +G++ L +F + + R G+
Sbjct: 80 ERGLQINLDIVVP-MKCEDIHINVQDAAGDRIL--AGVMFTRNPTQWAQWVHER--GVHR 134
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
D G++ E Y D D E V + G
Sbjct: 135 LGTDA-----NGKIITGEEYL---------DHDEGFGEEHVHDIVAAAGKLKKAKFAKTP 180
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
R K E + C I+G LEVNKV G H +Q H +FN
Sbjct: 181 RSR-------KSAEMDSCRIFGNLEVNKVQGELHITARGHGYQELAAGH----LDHHAFN 229
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVY-----TDVSGHT 298
SH +++L+FG +P + NPLD R TP+ +QYF+ VVPTVY T S T
Sbjct: 230 FSHVVSELSFGPFYPSLHNPLD--RTVSTTPNNFHKFQYFLSVVPTVYSVDSSTTYSSQT 287
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+ +NQ++VTE + ++PG+FF YD P+ +T E SFL FL
Sbjct: 288 LFTNQYAVTEQSHVVSE---FSVPGIFFKYDFEPMLLTVQESRDSFLRFL 334
>gi|324516732|gb|ADY46617.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Ascaris suum]
Length = 286
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/217 (35%), Positives = 114/217 (52%), Gaps = 26/217 (11%)
Query: 181 DLIDQCKRE--GFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
D+ D+ R GF+ + + E GC E+NKV GNFH S H +
Sbjct: 85 DIQDENGRHEVGFITDVTKVPTEENGCRFEANFEINKVPGNFHL----STHSA------- 133
Query: 237 LAFQRDSFNISHKINKLAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
A Q +S+++ H +N + FG+ G NPL Q P ++Y +KVVP+VY
Sbjct: 134 -ASQPESYDMRHIVNSVKFGDDLQEKAQIGSFNPLQDRTALQGDPLNTHEYILKVVPSVY 192
Query: 292 TDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
D++G T S Q++ E+ GR+ +P V+F Y+L PI V +TE F+T
Sbjct: 193 EDIAGRTKYSYQYTYAHKEYIAYHHSGRI--IPAVWFKYELQPITVKYTERRQPLYAFIT 250
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
+VCA+VGG FTV+GIID+ ++ + KK ++GK S
Sbjct: 251 SVCAVVGGTFTVAGIIDSSLF-SLSELYKKHQLGKLS 286
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 59/115 (51%), Gaps = 1/115 (0%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-DT 63
M IR LD Y K+ +D T +G VI+++ + + F++LR++L+ ++L V D
Sbjct: 1 MFDIRRLDIYRKVPKDLTQPTRTGAVISIICVCFIAFMLFNDLRMFLSVDLHSELFVDDP 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
R ++++ + T P LPC L VD D +G + D+ K + G E+
Sbjct: 61 GREGRIKVHLNATLPYLPCEYLGVDIQDENGRHEVGFITDVTKVPTEENGCRFEA 115
>gi|115623567|ref|XP_794044.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Strongylocentrotus purpuratus]
Length = 289
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 110/201 (54%), Gaps = 21/201 (10%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
++I G+GC Y +NKV GNFH V H + Q S + +H I++
Sbjct: 103 KKIPLNNGQGCLFYSAFTINKVPGNFH-----------VSTHAVGMNQPQSTDFAHIIHE 151
Query: 253 LAFGEHFP-----GVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV 306
++FG+ NPL+G R +++ S + + Y++K+VPTVY D+ G S Q++
Sbjct: 152 VSFGDDIQNKTLGASFNPLEG-RDKRDSKSDLSHDYYMKIVPTVYEDLWGTKNVSYQYTY 210
Query: 307 T-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ + S GR + LP ++F YD+SPI V + E+ F F+T VCAIVGG FTV+GI
Sbjct: 211 AYKDYGSQGHGR-RVLPAIWFRYDISPITVKYHEKRAPFYTFITTVCAIVGGTFTVAGIF 269
Query: 366 DAFIYHGQRAIKKKIEIGKFS 386
D+ I+ KK E+GK S
Sbjct: 270 DSIIFTAAEVFKKA-ELGKLS 289
>gi|159464951|ref|XP_001690702.1| hypothetical protein CHLREDRAFT_180779 [Chlamydomonas reinhardtii]
gi|158270379|gb|EDO96229.1| predicted protein [Chlamydomonas reinhardtii]
Length = 656
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 100/190 (52%), Gaps = 27/190 (14%)
Query: 206 YGFLEVNKVAGNFHFAPGKSFHQSGV-----------HVHDILAFQRDSFNISHKINKLA 254
Y +V +VAG H S HQ+ V H+ IL N+SH I L
Sbjct: 84 YHTPQVKRVAGRLHL----SVHQNMVFQMLPQLLGTHHIPKIL-------NMSHVIKHLG 132
Query: 255 FGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 313
FG H+PG +NPLDG VR P Y+YF+KVVPT Y + G +++Q+SVTE+ +
Sbjct: 133 FGPHYPGQLNPLDGYVRMVGREPFS-YKYFLKVVPTEYYNRLGRATETHQYSVTEYAQPL 191
Query: 314 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 373
++G P V YDLSPI +T E S LHF+ +CA+VGGVF ++ + D ++
Sbjct: 192 QRG---YAPAVDVHYDLSPIVMTINERPPSLLHFVVRLCAVVGGVFAITRLTDRWVDWLV 248
Query: 374 RAIKKKIEIG 383
R + K G
Sbjct: 249 RLVNKAAARG 258
>gi|242783317|ref|XP_002480163.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
stipitatus ATCC 10500]
gi|218720310|gb|EED19729.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
stipitatus ATCC 10500]
Length = 400
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 160/384 (41%), Gaps = 68/384 (17%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+++ DA+PK ++ + + GG T++ + L EL + V+
Sbjct: 24 LKTFDAFPKTKPNYTTPSRRGGQWTVIIIAICTFLSIGELITWYRGTENQHFSVEKGVSR 83
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHL--------DVKHDIFKKRLDSQGNVIESR 119
L++N D+ +PC+ + V+ D SG+ + +++ ++L+ Q + +
Sbjct: 84 QLQMNIDMVV-KMPCNDIRVNVQDASGDHIMAGMLLMKDSTNWEMWNEKLNQQSSGVTEY 142
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
Q + A + L+ + D + R R+K
Sbjct: 143 QT-LNAEDTKRLLE-------------------QEEDMHAHHVLSHTRRNPRRK--FPKT 180
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 238
P L + + C IYG LE NKV G+FH A G +++ G H+
Sbjct: 181 PRLSAKYPTDS------------CRIYGSLESNKVHGDFHITARGHGYNELGEHL----- 223
Query: 239 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------ 292
+FN +H I +L+FG H+P ++NPLD E +QYF+ VVPT+Y
Sbjct: 224 -DHKTFNFTHMITELSFGPHYPSLLNPLDKTVAYTEDHYYKFQYFLNVVPTIYAKGNNAV 282
Query: 293 -----------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 341
S +TI +NQ+S T + + T PG+FF Y++ PI + +EE
Sbjct: 283 EKYTANPALAFKKSRNTIFTNQYSATSQSHALPENPYNT-PGIFFKYNIEPILLFVSEER 341
Query: 342 VSFLHFLTNVCAIVGGVFTVSGII 365
SFL L + +V GV G +
Sbjct: 342 GSFLALLVRLVNVVSGVIVTGGWL 365
>gi|50293697|ref|XP_449260.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528573|emb|CAG62234.1| unnamed protein product [Candida glabrata]
Length = 352
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 174/376 (46%), Gaps = 71/376 (18%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R+ DA+PK +E + ++ GGV ++++ I +L + ++E + + + VD
Sbjct: 1 MAGLRTFDAFPKTDETYKKKSTKGGVTSILTYIFLLFIAWTEFGKFFGGYIDQQYTVDKV 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGE-----QHLDVKHDIFKKRLDSQGNVIESR 119
ET +IN D+ + + C + ++ D + + Q L ++ F DS+ N + S
Sbjct: 61 VRETAQINMDL-YVNIKCENIHINVRDQTQDRKLVIQDLKLEDMPFFIPYDSKVNGVNS- 118
Query: 120 QDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
I P ID+ L G AE ++ + R+ Y +
Sbjct: 119 ---IVTPDIDEIL--------------GEAIPAEFREK------LDTRQFYDE------- 148
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA------PGKSFHQSGVHV 233
+ + E +L + GC+I+G + VN+V G PGK
Sbjct: 149 ----NDPESEKYLPKFN-----GCHIFGSVPVNRVKGELQITASGYGYPGKRA------- 192
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYT 292
++ + +H IN+L+FG+ +P + NPLD R+ +E P Y Y+I VPT+Y
Sbjct: 193 ------PKEEIDFAHAINELSFGDFYPYIDNPLDKTARFDKEHPLSAYMYYISAVPTMYK 246
Query: 293 DVSGHTIQSNQFSVTEHFRS---SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
+ G I++ Q+SV ++ S ++ ++ +PG+FF Y P+ + T+ +SFL F+
Sbjct: 247 KL-GVEIETFQYSVNDYKYSMTDADPATVRKIPGIFFRYGFEPLSIEITDVRISFLQFIV 305
Query: 350 NVCAIVG-GVFTVSGI 364
+ AI+ +F VS I
Sbjct: 306 RLVAILSFFMFVVSWI 321
>gi|148223633|ref|NP_001084786.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Xenopus laevis]
gi|78099249|sp|Q6NS19.1|ERGI1_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|47125098|gb|AAH70532.1| MGC78834 protein [Xenopus laevis]
Length = 290
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I GC G +NKV GNFH V H +A Q + ++ H I+K
Sbjct: 105 MKIPINNAYGCRFEGLFSINKVPGNFH-----------VSTHSAIA-QPANPDMRHIIHK 152
Query: 253 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
L+FG ++ G N L G + Y +K+VPTVY D++G S Q++V
Sbjct: 153 LSFGNTLQVDNIHGAFNALGGADKLASKALESHDYVLKIVPTVYEDLNGKQQFSYQYTVA 212
Query: 308 E--HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ S GR+ +P ++F YDLSPI V +TE F+T VCAI+GG FTV+GI+
Sbjct: 213 NKAYVAYSHTGRV--VPAIWFRYDLSPITVKYTERRQPMYRFITTVCAIIGGTFTVAGIL 270
Query: 366 DAFIYHGQRAIKKKIEIGKF 385
D+FI+ A KKI++GK
Sbjct: 271 DSFIFTASEA-WKKIQLGKM 289
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 47/97 (48%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + + LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISICCCLFITFLFLSELTGFIANEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + + +VT P LPC ++ +D D G H+D
Sbjct: 66 GGKIDVTLNVTLPNLPCEVVGLDIQDEMGRHEVGHID 102
>gi|296415728|ref|XP_002837538.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633410|emb|CAZ81729.1| unnamed protein product [Tuber melanosporum]
Length = 341
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 94/165 (56%), Gaps = 11/165 (6%)
Query: 203 CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C IYG + VN++ G+FH A G + + G H+ SFN SH I +L+FG+++P
Sbjct: 155 CRIYGSMGVNRILGDFHITAKGHGYWEDGAHI------DHRSFNFSHVITELSFGDYYPK 208
Query: 262 VVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSNQFSVTEHFRSSEQGRLQT 320
+VNPLDGV + +QYF+ +VPT Y + SG ++ +NQ++VTE R +
Sbjct: 209 LVNPLDGVVSKTDENFHKFQYFLSIVPTTYESQTSGKSLLTNQYAVTEQSRKISS---HS 265
Query: 321 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+PG++F YD+ PI + ++ + L F+ + IV G+ G +
Sbjct: 266 VPGIYFKYDIEPISLKISDRRTALLAFVVRLVNIVSGILVGGGWV 310
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
++ +R+ DA+PK + +RT GG ITL+ + L +ELR YL +V+
Sbjct: 12 SLGESVRTFDAFPKTRATYTTRTPRGGAITLLLLLTSACLTLTELRNYLTGSESHTFMVE 71
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMD 91
G ++IN D+T A+PCS L ++ D
Sbjct: 72 PGIGHDMQINLDITV-AMPCSSLHLNVQD 99
>gi|226294628|gb|EEH50048.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb18]
Length = 392
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 102/390 (26%), Positives = 162/390 (41%), Gaps = 75/390 (19%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I + +R+ DA+PK + S T GG T+V ++ LL SELR + V V+
Sbjct: 19 GIGSGLRTFDAFPKTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVE 78
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
L++N D+ A+ C L ++ D +G++ L D+ K S
Sbjct: 79 KGISRELQLNLDIVV-AMTCDALRINVQDAAGDRIL--ASDMLNKEPTSWAAWNRELNVA 135
Query: 123 I--GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK---KGWAL 177
+ G + + GRL E D + E R ++++ KG L
Sbjct: 136 LSGGGREYQTLAEEDAGRLMEQE-----------EDMHVGHALGEARRSHKRKFPKGPKL 184
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 236
++ D C+ IYG LE NKV G+FH A G + + G H+
Sbjct: 185 KRGEMPDSCR-----------------IYGSLEGNKVQGDFHITARGHGYFEFGEHLDH- 226
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS- 295
++L+FG H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 227 --------------HELSFGPHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGT 272
Query: 296 -------------------GHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKV 335
+TI +NQ++VT RS E +Q +PG+FF Y++ PI +
Sbjct: 273 VDPYSQVLPDPSTISPSQRKNTIFTNQYAVTS--RSHELPDVQFHVPGIFFKYNIEPILL 330
Query: 336 TFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+EE S L L + ++ GV G +
Sbjct: 331 IISEERGSLLALLVRLVNVMAGVVVAGGWL 360
>gi|363738942|ref|XP_414530.3| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 1 [Gallus gallus]
Length = 291
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 106/199 (53%), Gaps = 21/199 (10%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G+GC G +NKV+ H V H A Q + +++H I+KL
Sbjct: 106 KIPLNNGDGCRFEGHFSINKVSP-------WXLH---VSTHSATA-QPQNPDMTHIIHKL 154
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L+G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 155 SFGDKLQVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVAN 214
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+D
Sbjct: 215 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILD 272
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 273 SCIFTASEAW-KKIQLGKM 290
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 48/97 (49%), Gaps = 6/97 (6%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSR 65
R D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 6 RRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDS 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + +N +++ P L C ++ +D D G H+D
Sbjct: 66 GGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHID 102
>gi|396485364|ref|XP_003842153.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
gi|312218729|emb|CBX98674.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
Length = 486
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 160/383 (41%), Gaps = 79/383 (20%)
Query: 8 IRSLDAYPKINEDFY--SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ S DA+PK + + R SG TL+ + L L +SE+ ++ T V+
Sbjct: 113 VSSFDAFPKTKKTYLVQGRNSSGWTATLI--LTCLYLSWSEISRWMAGTTTQTFSVEKGV 170
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
+++N D+ + C+ L V+ D +G++ L D+ +K S
Sbjct: 171 SHDMQLNLDIIV-HMRCADLHVNMQDAAGDRTL--AGDLLRKDPTSWS------------ 215
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
Q G LE G ED EE + + + G +
Sbjct: 216 -------QWTGKNLEWGTHELGK------GKEDRAPGWEEEFDVHEQLG----------K 252
Query: 186 CKREGFLQ--RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 242
K+ F + R++ E + C I+G +E NKV G+FH A G + + GVH+
Sbjct: 253 AKKRKFSKTPRVRGET-DSCRIFGSIEGNKVQGDFHITARGHGYIEYGVHL------DHK 305
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD------ 293
+FN SH I +L+FG ++P + NPLD TP +QYF+ +VPT+YTD
Sbjct: 306 TFNFSHIIRELSFGPYYPSLTNPLDNTIAITPTPDDHFYKFQYFLSIVPTIYTDDPSLIP 365
Query: 294 ---------------VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 338
S H +++NQ++VT + +PGVF +D+ PI +
Sbjct: 366 YLDILNRYGKNPDLFNSAHAVKTNQYAVTSQSHPVSE---YYVPGVFVKFDIEPIMLNVV 422
Query: 339 EEHVSFLHFLTNVCAIVGGVFTV 361
EE F L + ++ GV
Sbjct: 423 EEWGGFWRLLVRLVNVISGVMVA 445
>gi|403337257|gb|EJY67839.1| hypothetical protein OXYTRI_11647 [Oxytricha trifallax]
Length = 279
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/346 (28%), Positives = 154/346 (44%), Gaps = 85/346 (24%)
Query: 59 LLVDTSR-GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE 117
+ VD S + L IN D+ FP +PC +L++D MDI G +D+ ++KK L G +
Sbjct: 1 MFVDASHHDDRLNINIDIVFPKMPCEVLTLDIMDIMGTHIVDIGGSLYKKGLSQNGEFV- 59
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
+ET S G + +D ++ E +K+G
Sbjct: 60 ------------------------SET---SMLGGIQTRQDLLKRIKD--EMDQKQG--- 87
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
C+ +GF +N+V GNFH + S Q + V+ L
Sbjct: 88 --------CQLKGF-----------------FNINRVPGNFHIS---SHSQKDLIVN--L 117
Query: 238 AFQRDSFNISHKINKLAFG--EHFP---------GVVNPLDGVRWTQE-----TPSGM-Y 280
Q +F+ +HKIN ++FG E F GV+NPLDG+ ++ P +
Sbjct: 118 EMQGYTFDFTHKINHVSFGRQEDFKVIQKNFKQQGVLNPLDGLEFSANQDNKGKPQALAT 177
Query: 281 QYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 340
+F+ V + Y D + +T Q + T +S+ L F Y+LSPIKV F +E
Sbjct: 178 NFFMVAVSSYYMDTNRNTYNMYQLTSTHKSQSNANVNENML---VFSYELSPIKVLFNQE 234
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
+ + F+ +CAI+GGVFT+S ++D I H ++ K IGK S
Sbjct: 235 KENIVDFMIQLCAIIGGVFTISSVVDTII-HRSVSLLFKQRIGKLS 279
>gi|50303625|ref|XP_451754.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640886|emb|CAH02147.1| KLLA0B04950p [Kluyveromyces lactis]
Length = 341
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 166/372 (44%), Gaps = 76/372 (20%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R+ DA+PK +E + GG+ ++++ + +L + +SE + + + +VD
Sbjct: 1 MAGLRTFDAFPKTDEQHVKTSSKGGLSSILTYLFLLFIAWSEFGSFFGGYIDQQYVVDDQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI----------FKKRLDSQGN 114
ET+ IN D+ + + C + ++A DI+G++ L + +I R++ N
Sbjct: 61 IKETVTINLDL-YVNMACKNIRINARDITGDRGL-ISENIQMEGMPFYIPVGTRVNEMNN 118
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
++ +P +D+ L G A+ REA
Sbjct: 119 IV--------SPDLDEIL--------------GEAIPAQ------------FREA----- 139
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHV 233
ID + G ++ GC+I+G + VNKV G H A G + +
Sbjct: 140 --------IDTSELTG------RDDFNGCHIFGSVPVNKVKGELHITAHGWGYRSAS--- 182
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
A +D N +H IN+L+FG+ +P + NPLD + Y YF +VPT+Y
Sbjct: 183 ----AIPKDQINFNHVINELSFGDFYPYIDNPLDNTAKFSDEKIKAYYYFTSIVPTLYKK 238
Query: 294 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+ G + +NQ++++E E + +PG+F Y P+K+ ++ + F F+ + A
Sbjct: 239 M-GAEVDTNQYALSET-EYGESSKATGVPGIFIRYQFEPMKIIISDMRIGFFQFIIRLVA 296
Query: 354 IVGG-VFTVSGI 364
I+ V+T S I
Sbjct: 297 ILSFIVYTASWI 308
>gi|325187435|emb|CCA21973.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 283
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 8/181 (4%)
Query: 196 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 255
++ EGC G L + K+ G+ F G S + + +++ R FN SH I KL F
Sbjct: 110 EDPHNEGCRYKGTLTIQKLQGDIFFCHGGS-----LSIFNLMEMFR--FNSSHVITKLNF 162
Query: 256 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 315
G P + PL V T Y+YF KVVP+ Y + G + + Q+SVTEH +
Sbjct: 163 GLSIPKMQTPLTDVHKTVLAQVATYKYFAKVVPSRYVYLDGKSTMTYQYSVTEHLLKMD- 221
Query: 316 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 375
G + +PGV YD SPI V + E + HF+TN CAI+GGV V+ I DA +Y +
Sbjct: 222 GFVTNIPGVIISYDFSPIAVDYIETKPNIFHFITNTCAILGGVIAVARIFDAALYSMSKK 281
Query: 376 I 376
+
Sbjct: 282 L 282
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 1/101 (0%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M R DAY K E RT GG+ITL+S + + LF SE+ ++ ++ VDT+
Sbjct: 1 MRGWRRFDAYAKAVEGIQERTIGGGIITLLSCVFVCFLFISEISVWWTVNVVHRMHVDTA 60
Query: 65 RGET-LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI 104
E+ + ++ D++ C + VD D G+ + + +++
Sbjct: 61 PQESPITLDVDISMLHETCRDIKVDVSDSQGDGSILIANNL 101
>gi|260800124|ref|XP_002594986.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
gi|229280225|gb|EEN50997.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
Length = 292
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 114/220 (51%), Gaps = 28/220 (12%)
Query: 181 DLIDQCKRE--GFLQ---RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D+ D+ R GF++ ++ G GC G +NKV GNFH S H + V
Sbjct: 87 DIQDEMGRHEVGFVEDTEKVPVNNGLGCRFEGRFWINKVPGNFHM----STHSAHV---- 138
Query: 236 ILAFQRDSFNISHKINKLAFGE--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
Q S +++H ++ L FGE H G NPLD V + YF+K+V
Sbjct: 139 ----QPASPDMTHVVHDLRFGEDLAAFLPDHIKGSFNPLDEVERLHANALSSHDYFLKIV 194
Query: 288 PTVYTDVSGHTIQSNQFSVT-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
PT++ + S + Q++ + + S G + +P ++F YDLSPI V +T++ F H
Sbjct: 195 PTIFENRSDKKSFAFQYTYAYKDYISFGHGN-RVMPAIWFRYDLSPITVKYTDKRKPFYH 253
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
F+T +CA+VGG FTV+GIID+ I+ KK E+GK S
Sbjct: 254 FITTICAVVGGTFTVAGIIDSVIFTAAEVFKKA-ELGKLS 292
>gi|327265232|ref|XP_003217412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Anolis carolinensis]
Length = 291
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 102/199 (51%), Gaps = 22/199 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G+GC +NK+ GNFH V H A Q + +++H I+KL
Sbjct: 107 KIPLNNGDGCRFESHFSINKIPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 154
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L+G P + Y +K+VPTVY D+SG Q++V
Sbjct: 155 SFGDQLQAQKIRGSFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQQYPFQYTVAN 214
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ P ++F YDL+PI + + E F+T +CAI+GG FTV+GI D
Sbjct: 215 KEYVVYSHTGRIT--PAIWFRYDLTPITLKYIERRQPLYRFITTICAIIGGTFTVAGIFD 272
Query: 367 AFIYHGQRAIKKKIEIGKF 385
+ I+ A KKI++GK
Sbjct: 273 SCIFTASEAW-KKIQLGKM 290
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 22/91 (24%), Positives = 41/91 (45%), Gaps = 3/91 (3%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D TF+G +I++ +L L SEL ++ +L V D
Sbjct: 9 FDIYRKVPKDLTQPTFTGAIISVCCCFFILFLLLSELTGFIATEVVNELYVEDPDKDSSG 68
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHL 98
+ + +++ P L C ++ +D D G +
Sbjct: 69 KIEVTLNISLPNLHCELIGLDIQDEMGRHEI 99
>gi|189207969|ref|XP_001940318.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187976411|gb|EDU43037.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 394
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 101/402 (25%), Positives = 166/402 (41%), Gaps = 79/402 (19%)
Query: 8 IRSLDAYPKINEDFY--SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ S DA+PK + + R S +TL+ + + L +SE+ +L T V+
Sbjct: 21 VSSFDAFPKTKKTYLVQGRNSSAWTVTLI--LTCIYLSWSEISRWLAGSTSQSFSVEKGI 78
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
+++N DV A+ C+ L V+ D +G++ L ++ +K S
Sbjct: 79 SHDMQLNLDVIV-AMRCADLHVNMQDAAGDRTL--AGELLRKDPTSWS------------ 123
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
Q G LE G G E+ + E++ +A+++K P
Sbjct: 124 -------QWTGRNLERGTHELGIDAGKAQPWEEVWDVHEQLGKAHKRK--FSKTP----- 169
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
RI+ E + C IYG L+ NKV G+FH A G + + G H+ SF
Sbjct: 170 --------RIRGET-DSCRIYGSLDGNKVQGDFHITARGHGYIEFGQHL------DHSSF 214
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDV------- 294
N SH I +++FG ++P + NPLD TP +QY++ +VPT+YTD
Sbjct: 215 NFSHIIREMSFGPYYPSLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPSLIPLL 274
Query: 295 -----------------SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 337
H I++NQ++VT S + +PG+F +D+ PI +
Sbjct: 275 ELVGSTSNHPGAASMFHGAHAIKTNQYAVTSQ---SHKVPENYVPGIFVKFDIEPIVLRV 331
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
EE F + + +V GV G G + K+
Sbjct: 332 VEEWGGFWRLIVTLINVVSGVMVAGGWAWQMFEWGCEVLGKR 373
>gi|156406959|ref|XP_001641312.1| predicted protein [Nematostella vectensis]
gi|156228450|gb|EDO49249.1| predicted protein [Nematostella vectensis]
Length = 287
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 110/217 (50%), Gaps = 26/217 (11%)
Query: 181 DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D+ D+ R GF + ++ E GEGC I +NKV GNFH S H +G
Sbjct: 86 DIQDEMGRHEVGFKENVERREINNGEGCFISTRFTINKVPGNFHV----STHGAGK---- 137
Query: 236 ILAFQRDSFNISHKINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
Q DS +++H IN + FG + PG L + + Y +K+VPT+Y
Sbjct: 138 ----QPDSPDMNHIINAVNFGSRIMDKLPGAFTALKDRKRHDTNGLASHDYILKIVPTIY 193
Query: 292 TDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
+ G T S Q++ E+ S G Q LP ++F YDLSPI V + E HF+T
Sbjct: 194 QKLDGTTTFSYQYTWAYKEYVSYSHGG--QMLPAIWFRYDLSPITVKYIERRQPLYHFIT 251
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
VCAIVGG FTV+GIID+ ++ +K ++GK S
Sbjct: 252 TVCAIVGGTFTVAGIIDSAVFTASEMWRKH-QLGKLS 287
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 58/114 (50%), Gaps = 2/114 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT-SRG 66
+R D Y K+ +D TF+G VI++ S + + LF SE ++ ++L VD +
Sbjct: 5 VRRFDIYRKVPKDLTEPTFAGAVISICSCLFITFLFLSEFYGFIGTEIASELFVDNPTED 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDS-QGNVIESR 119
+ + + ++T P + C +D D G + K ++ ++ +++ +G I +R
Sbjct: 65 DKIPVILNITLPRMKCEFPGLDIQDEMGRHEVGFKENVERREINNGEGCFISTR 118
>gi|291244956|ref|XP_002742359.1| PREDICTED: endoplasmic reticulum-golgi intermediate compartment
(ERGIC) 1-like [Saccoglossus kowalevskii]
Length = 318
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 97/199 (48%), Gaps = 17/199 (8%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+I GC + ++NKV GNFH S H +G + Q + H I++
Sbjct: 132 NKIPLNNNAGCRFEAYFKINKVPGNFHV----STHAAG-------SRQPQKADFVHTIHE 180
Query: 253 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
+ G+ NPL G + + Y++KVVPTVY DV G S Q++
Sbjct: 181 IIIGDDIQNKSINAAFNPLAGYDRSDAAAESSHDYYMKVVPTVYEDVWGRVNLSYQYTYA 240
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
S + +P ++F YD+SPI V + E+ F F+T +CAIVGG FTV+GIID+
Sbjct: 241 YKDYVSYGHGHRVMPAIWFRYDISPITVKYHEKRAPFYTFITTICAIVGGTFTVAGIIDS 300
Query: 368 FIYHGQRAIKKKIEIGKFS 386
IY KK EIGK S
Sbjct: 301 MIYSASEVFKKA-EIGKLS 318
>gi|393221326|gb|EJD06811.1| DUF1692-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 537
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 146/344 (42%), Gaps = 53/344 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ DA+PK+ + +R+ G +T++ + + LL +++ Y+ K +D G
Sbjct: 22 LNQFDAFPKLPSTYKARSGGRGFLTVLVAFISFLLVVNDIGEYIFGWPTYKFGLDNRPGH 81
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
L IN D+ +PC LSVD D G++ +L D FK+ G + + Q
Sbjct: 82 YLAINVDLVV-NMPCKHLSVDLRDAVGDRLYLS---DGFKR----DGTLFDIGQA----- 128
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
+ LQ H L+ + D N ++ R Y K PD
Sbjct: 129 ---QALQSHTQALDARLAVAQARKSRGFFDTILRRNKDKFRPTYNYK------PD----- 174
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
G C +YG ++ KV N H ++S HV N+
Sbjct: 175 -------------GGACRVYGSIQAKKVTANLHITTAGHGYRSMHHV------DHSQMNL 215
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
SH I +FG +FP + PL P YQYF+ VVPT Y +G + ++Q+SV
Sbjct: 216 SHVITDFSFGPYFPDMAQPLKNTFELTHEPFIAYQYFLSVVPTTYIASNGKQVHTSQYSV 275
Query: 307 TEHFR--SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
T + R EQG PG+FF YDL P+++T ++ + + FL
Sbjct: 276 THYTRVLQHEQG----TPGIFFKYDLEPLQMTIHQKTTTLVQFL 315
>gi|367012766|ref|XP_003680883.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
gi|359748543|emb|CCE91672.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
Length = 348
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 165/370 (44%), Gaps = 69/370 (18%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +RS DA+PK +E R+F GG+ ++++ + +L + ++E Y + + VD
Sbjct: 1 MAGLRSFDAFPKTDETHQQRSFKGGLSSVMTYLFLLFMCWTEFGSYFGGYVDQQYKVDGE 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ--------GNVI 116
ET +IN D+ + +PC++L ++ D + +D K + K L Q G ++
Sbjct: 61 VRETFQINMDM-YVNMPCNLLHINVRD----KTMDRK--VVSKELSMQNMPFFVPYGTMV 113
Query: 117 ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
+ I P +D+ L E + +R++
Sbjct: 114 NDMKK-IATPDLDEIL------------------------------GEAIPAQFRER--- 139
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+D E L + +GC+IYG + VN+VAG G D
Sbjct: 140 ------MDPSVLEASLG--SDVTFDGCHIYGSVPVNRVAGELQIT------AKGWGYQDF 185
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVS 295
N SH IN+ ++G+ FP + NPLD M Y Y +VPTVY +
Sbjct: 186 EKAPVSEINFSHVINEFSYGDFFPYIDNPLDNTAKISIVDRLMGYLYDTSIVPTVYEKL- 244
Query: 296 GHTIQSNQFSVTEHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
G + +NQ++V+E +S+++G T+PG+FF YD P+ ++ + +SF+ F+ +
Sbjct: 245 GAYVDTNQYAVSERQFDQKSTKRGS-TTVPGIFFRYDFEPLSISIKDRRLSFIQFIIRLV 303
Query: 353 AIVGGVFTVS 362
A++ V ++
Sbjct: 304 ALLSFVVYIA 313
>gi|348667045|gb|EGZ06871.1| hypothetical protein PHYSODRAFT_319561 [Phytophthora sojae]
Length = 469
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 110/228 (48%), Gaps = 44/228 (19%)
Query: 181 DLIDQCKREGFLQRIK--EEEG----------EGCNIYGFLEVNKVAGNFHFAPGKSFHQ 228
D ++ K+E F Q K E+G EGC +YG L V +V GNFH
Sbjct: 257 DAVEARKKELFEQDKKNAREQGKAIARSAVGPEGCRLYGHLYVKRVPGNFH--------- 307
Query: 229 SGVHVHDILAFQRDS--FNISHKINKLAFGEHFPG--------------VVNPLDGVRWT 272
VH+ + A+ DS N SH +N+L FGEH + LD +T
Sbjct: 308 --VHLANP-AYSMDSSLVNASHTVNELWFGEHLTSGEMSMLPRDAQMQLYTHRLDNQDYT 364
Query: 273 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 332
+ Y ++IKVV Y I N + T H S+E LP + F YDLSP
Sbjct: 365 SFYKNHTYVHYIKVVTNSYVQSDAADI--NVYKYTAH--SNEYLETDDLPSIMFRYDLSP 420
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 380
+ V +E+ V F HFLT+ CAI+GGVFTV GI+D I+ RA+ KK+
Sbjct: 421 MSVRISEDSVPFYHFLTSACAIIGGVFTVIGILDQIIHQTARALNKKV 468
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 66/123 (53%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M+ ++ D Y KI ED T G +++ +M LLF E YL + +++D
Sbjct: 4 MDVLKKWDFYKKIPEDLTVSTLPGVSLSIAGCFIMFLLFILEFNSYLTVDYKYDIVMDEG 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
+T+RINF++T P LPC +VD D++G + ++ +I+K RLD +G + Q+
Sbjct: 64 LDQTMRINFNITVPDLPCEFATVDVSDMTGTRKHNMTSNIYKIRLDQKGRSVGLAQEKQI 123
Query: 125 APK 127
P+
Sbjct: 124 MPQ 126
>gi|313230728|emb|CBY08126.1| unnamed protein product [Oikopleura dioica]
Length = 289
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 28/219 (12%)
Query: 181 DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D+ D+ R G+L+ +++ G+GC G VNKV GNFH S H S V
Sbjct: 86 DIQDEHGRHEVGYLENTRKDPINGGKGCIFGGTFHVNKVPGNFHV----STHSSQV---- 137
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVP 288
Q + +++H+I++L+FGE G+ + PL+G + E + + Y +KVVP
Sbjct: 138 ----QPQNPDMNHEIHELSFGESMKGINSNLPANFIPLNGKKTGAEKMAS-HDYTLKVVP 192
Query: 289 TVYTDVSGHTIQSNQFS-VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
TVY D+ T QF+ V + F + G + +P ++F Y++SPI V +TE+ HF
Sbjct: 193 TVYQDIKKRTKFGYQFTAVYKDFVAFGHGH-RVMPAIWFRYEVSPITVKYTEKSKPLYHF 251
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
LT CAI+GG FTV+G+ID+ I+ + +KK E GK S
Sbjct: 252 LTTFCAIIGGTFTVAGMIDSMIFSAHQMVKKAGE-GKLS 289
>gi|313220803|emb|CBY31643.1| unnamed protein product [Oikopleura dioica]
Length = 289
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 28/219 (12%)
Query: 181 DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D+ D+ R G+L+ +++ G+GC G VNKV GNFH S H S V
Sbjct: 86 DIQDEHGRHEVGYLENTRKDPINGGKGCIFGGTFHVNKVPGNFHV----STHSSQV---- 137
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVP 288
Q + +++H+I++L+FGE G+ + PL+G + E + + Y +KVVP
Sbjct: 138 ----QPQNPDMNHEIHELSFGESMKGINSNLPANFIPLNGKKTGAEKMAS-HDYTLKVVP 192
Query: 289 TVYTDVSGHTIQSNQFS-VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
TVY D+ T QF+ V + F + G + +P ++F Y++SPI V +TE+ HF
Sbjct: 193 TVYQDIKKRTKFGYQFTAVYKDFVAFGHGH-RVMPAIWFRYEVSPITVKYTEKSKPLYHF 251
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
LT CAI+GG FTV+G+ID+ I+ + +KK E GK S
Sbjct: 252 LTTFCAIIGGTFTVAGMIDSMIFSAHQMVKKAGE-GKLS 289
>gi|301100294|ref|XP_002899237.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262104154|gb|EEY62206.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 469
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 116/228 (50%), Gaps = 44/228 (19%)
Query: 181 DLIDQCKREGFLQRIKE--EEG----------EGCNIYGFLEVNKVAGNFHFAPGKSFHQ 228
D+++ K+E F Q K+ E+G EGC ++G L V +V GNFH
Sbjct: 257 DVVEARKKELFEQDKKDAREQGRAIARSAVGPEGCRLFGHLYVKRVPGNFH--------- 307
Query: 229 SGVHVHDILAFQRDS--FNISHKINKLAFGEHF-PG-------------VVNPLDGVRWT 272
VH+ + A+ DS N SH +N+L FGEH PG + L+ +T
Sbjct: 308 --VHLANP-AYSMDSSLVNASHTVNELWFGEHLAPGDMSRLPREAQTQLYTHRLENQDFT 364
Query: 273 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 332
+ Y ++IKVV Y V G + N + T H S+E LP V F YDLSP
Sbjct: 365 SLYKNHTYVHYIKVVTNSY--VQGDGSEINVYKYTAH--SNEYLETDDLPSVMFRYDLSP 420
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 380
+ V +E+ V F HF+T+ CAI+GGVFTV GI+D I+ RA+ KK+
Sbjct: 421 MSVRISEDTVPFYHFVTSACAIIGGVFTVIGIVDQIIHQTARALNKKV 468
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 63/112 (56%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ D Y KI ED T G +++ +M LLF E YL + +++D
Sbjct: 4 VDVLKKWDFYKKIPEDLTVSTLPGVSLSIAGCFIMFLLFILEFNSYLTVDYKYDIVMDEG 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI 116
+T+RINF++T P LPC SVD D++G + ++ DIFK RLD +G ++
Sbjct: 64 LDQTMRINFNITVPDLPCEFASVDVSDMTGTRKHNMTSDIFKIRLDQKGRMV 115
>gi|330935325|ref|XP_003304912.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
gi|311318248|gb|EFQ86993.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
Length = 395
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 163/387 (42%), Gaps = 80/387 (20%)
Query: 8 IRSLDAYPKINEDFY--SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ S DA+PK + + R S +TL+ + + L +SE+ + T V+
Sbjct: 21 VSSFDAFPKTKKTYLVQGRNSSAWTVTLI--LTCIYLSWSEISRWYAGSTWQSFAVEKGV 78
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
++IN D+ A+ C+ L V+ D +G++ L ++ +K S
Sbjct: 79 SHDMQINLDIIV-AMRCADLHVNMQDAAGDRTL--AGELLRKDPTSWS------------ 123
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
Q G LE G+ G S E+ + E++ +A+++K S
Sbjct: 124 -------QWTGRNLERGTHELGTEAGDAPSWEEAWDVREQLGKAHKRK---FSK------ 167
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
RI+ + C IYG L+ NKV G+FH A G + + G H+ SF
Sbjct: 168 ------TPRIRGNP-DSCRIYGSLDGNKVQGDFHITARGHGYMEFGEHL------DHSSF 214
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY---QYFIKVVPTVYTD-------- 293
N SH I +++FG ++P + NPLD TP + QY++ +VPT+YTD
Sbjct: 215 NFSHIIREMSFGPYYPSLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPTLIPYL 274
Query: 294 --VS---------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 336
VS I++NQ++VT S + +PGVF +D+ PI +
Sbjct: 275 EAVSSTAGNHPGAASIFHGARAIKTNQYAVTSQ---SHKVPENYVPGVFVKFDIEPIMLA 331
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSG 363
EE F + + +V GV G
Sbjct: 332 VVEEWSGFWRLIVTLVNVVSGVMVAGG 358
>gi|195402035|ref|XP_002059616.1| GJ14724 [Drosophila virilis]
gi|194147323|gb|EDW63038.1| GJ14724 [Drosophila virilis]
Length = 434
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 169/353 (47%), Gaps = 32/353 (9%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++ELR Y N ET+++ D +
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYWN---ETEIIYQFEPDMA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
E ++++ D+T A+PC+ LS VD MD + + D+F + G + +++G+
Sbjct: 76 LDEQVQMHLDITV-AMPCASLSGVDLMD-------ETQQDVF-----AYGTL---QREGV 119
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYG--AESSDEDCCNNCEEVREA--YRKKGWALSN 179
D +RH ++ Y Y A+ +D +E+ + A +
Sbjct: 120 WWQMSDAD-RRHFKSMQMTNHYLREEYHSVADILFKDILRERTPTKESETHAATAAAAAA 178
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
++ E + + C ++G L +NKVAG H G H ++ F
Sbjct: 179 APPPPGALQQPQQLAQLESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEF 238
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
+R N +H+IN+L+FG++ +V PL+G S QYF+KVVPT TI
Sbjct: 239 RRMPANFTHRINRLSFGQYSRRIVQPLEGDETIIHEESTTVQYFLKVVPTEIQHTFS-TI 297
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ Q++VTE+ S PG++F YD S +K+ + + L F+ +C
Sbjct: 298 STFQYAVTENVHSERNSYGS--PGIYFKYDWSALKIVVSHDRDYLLTFVIRLC 348
>gi|302675040|ref|XP_003027204.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
gi|300100890|gb|EFI92301.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
Length = 528
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 154/358 (43%), Gaps = 57/358 (15%)
Query: 1 MDAIMNKIRS--------LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLN 52
M A+++K+ + LDA+PK+ + +R+ S G +TL + + +L F+++ Y+
Sbjct: 1 MAALIDKLEAVLPPGLAKLDAFPKLPGTYKARSESRGFLTLFVAFICFILVFNDISEYIW 60
Query: 53 AVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
+ + VD + IN D+ +PC +SVD D G++ H + +R ++
Sbjct: 61 GWPDYEFSVDRHSSSFMNINVDMVV-NMPCRFISVDLRDAVGDRLFLSNHGL--RRDGTK 117
Query: 113 GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRK 172
+V ++ + L+ H L E V + +
Sbjct: 118 FDVGQATK-----------LKEHARALSAREA---------------------VAQGRKN 145
Query: 173 KGWALSNPDLIDQCKREGFLQRIK-EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 231
+G L ++ F E G C ++G LEV KV N H + S
Sbjct: 146 RGLFSG---LFGGKSKDLFPPTYNYEPHGSACRVWGSLEVKKVTANLHITTAGHGYASRE 202
Query: 232 HV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 290
H H ++ N++H I++ +FG HFP +V PLD + P YQY++ VVPT
Sbjct: 203 HADHKVM-------NLTHVISEFSFGPHFPDIVQPLDYTFEVAKDPFVAYQYYLHVVPTT 255
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
Y + +NQ+SVT + + E Q PG+FF +D+ P+ + + SF
Sbjct: 256 YIAPRSAPLSTNQYSVTHYKKVFEHN--QATPGIFFKFDIDPLAIQIHQRTTSFARLF 311
>gi|123483410|ref|XP_001324018.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121906894|gb|EAY11795.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 384
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 167/387 (43%), Gaps = 55/387 (14%)
Query: 8 IRSLDAYPK-INEDFYSRTFSGGVITLV-----SSIVMLLLFFSELRLYLNAVTETKLLV 61
I+ +D + K N+DF T S +++ + ++IV++ +F + + KL+
Sbjct: 5 IQYIDIFDKSTNDDFKLDTKSSAILSTILTAFGATIVLIHIF---------GLIQPKLVR 55
Query: 62 DTS-------RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN 114
D + + E ++ DV +PC L +D +D G L++ RL +Q
Sbjct: 56 DLNLEIQGLDQQELANVSLDVKV-NMPCYFLHLDVIDNLGFNQLNINTTAKFIRLSAQ-- 112
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
+K L G E + C SCYG + CCN+CE+ + G
Sbjct: 113 --------------EKEL---GYANETISSICHSCYGL-LPEGSCCNSCEQTLLLHIMNG 154
Query: 175 WALSNPDLIDQC--KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
A + D QC K G K E E C I G + +NK GNFH APG + + H
Sbjct: 155 KAANTKDW-PQCQGKNPG-----KVYENEKCRIKGKVCLNKAQGNFHIAPGTNMKERYGH 208
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG-MYQYFIKVVPTVY 291
VHD L+ Q +F++SH I + G P NPL V+ Q +Y+Y + V P VY
Sbjct: 209 VHD-LSGQLPNFDLSHVIQGMRVGPKIPLTYNPLRYVQQIQNPNQPVVYRYDLVVTPAVY 267
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
SG+ I + T G PG++F Y +P VT +++ T++
Sbjct: 268 K--SGNRILGKGYDYTAMINRFFVGNSGGAPGIYFHYSFTPYGVTVNATYLTIAQIFTSI 325
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKK 378
+ G + + IID ++ + + K
Sbjct: 326 FGFMSGAYAIFSIIDESMFKDDKRMAK 352
>gi|323310251|gb|EGA63441.1| Erv46p [Saccharomyces cerevisiae FostersO]
Length = 189
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 94/183 (51%), Gaps = 27/183 (14%)
Query: 10 SLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETL 69
SLDA+ K ED RT +GG+ITL + L L +E + + VT +L+VD R L
Sbjct: 8 SLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWXQFNSVVTRPQLVVDRDRHAKL 67
Query: 70 RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQGNVIESRQDGIGAPKI 128
+N DVTFP++PC ++++D MD SGE LD+ F RL+S+G P
Sbjct: 68 ELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGR-----------PVG 116
Query: 129 DKPLQRHGGR------LEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKK 173
D GG + ++ YCG CYGA+ ++ CC +C+ VR AY +
Sbjct: 117 DATELHVGGNGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176
Query: 174 GWA 176
GWA
Sbjct: 177 GWA 179
>gi|226479782|emb|CAX73187.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Schistosoma japonicum]
Length = 410
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 155/365 (42%), Gaps = 49/365 (13%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I I LD +PK+ ++ T+ GG++T+++ + L +E R YL+ + +D
Sbjct: 18 TITKLINELDVFPKLPKECKKSTWGGGLLTILTFCCISWLLVNEFRDYLDPPVKYSYEID 77
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISG-----EQHLDVKHDIFKKRLDSQGNV-I 116
+++N D+ A PC +S+D +D +G E+ ++ +F L V
Sbjct: 78 KDISGKIKVNIDIVV-ASPCHAISMDVVDTTGSPLFGEEKIEYISTVFD--LSPPARVAF 134
Query: 117 ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA 176
+ RQ GA L+ ++H +SD + N E
Sbjct: 135 KKRQYVAGA------LREKHHAIQH-------WLWKYASDTNVFTNFNE----------- 170
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHD 235
PD Q + C I G L V KV GN H GK G +H+H
Sbjct: 171 ---PDT----------QVSGGRNPDACRIVGTLFVKKVEGNIHILLGKPLEGLGNLHLHV 217
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS 295
+ + N SH+IN +FG+ G ++PL+ + S +QYF+ +VPT +
Sbjct: 218 APFLSKTNLNFSHRINHFSFGDLVNGQIHPLEAIESITAVASTSFQYFVTMVPTKVVN-Q 276
Query: 296 GHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
H ++ Q++ T R+ + +PG+FF YD P+ V T + F T + A+
Sbjct: 277 FHVTETYQYAATVQNRTIDHASDSHGIPGIFFIYDTFPLVVKITYDRELLGTFFTRLAAL 336
Query: 355 VGGVF 359
GG+F
Sbjct: 337 AGGIF 341
>gi|361126303|gb|EHK98312.1| putative ER-derived vesicles protein 41 [Glarea lozoyensis 74030]
Length = 343
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 99/176 (56%), Gaps = 18/176 (10%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
R++ G+ C IYG LEVNKV G+FH A G + + G D AF N SH +N+
Sbjct: 142 RLRGNVGDSCRIYGNLEVNKVQGDFHLTARGHGYQEWGAGHLDHTAF-----NFSHIVNE 196
Query: 253 LAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT-DVSGH----TIQSNQFS 305
L+FG +P ++NPLD R TP+ +QYF+ VVPT YT D S TI +NQ++
Sbjct: 197 LSFGAFYPSLLNPLD--RTVSTTPNHFHKFQYFLSVVPTAYTVDSSSRSARDTIFTNQYA 254
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
VTE S + +++PG+FF YD+ P+ +T E SFL F+ V + GV
Sbjct: 255 VTEQ---SHEVNERSVPGIFFKYDIEPMLLTVEESRDSFLRFVVKVVNVFSGVLVA 307
>gi|348690307|gb|EGZ30121.1| COPII vesicle trafficking protein [Phytophthora sojae]
Length = 306
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 115/227 (50%), Gaps = 44/227 (19%)
Query: 188 REGFLQRIKEEE--GE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
+E L++ +EE GE GC ++G ++V KVAG+ FA H+ + V F +F
Sbjct: 91 KEILLKKDIQEEPFGENGCRLFGTVQVQKVAGDLSFA-----HEGSLTVFSFFDFL--NF 143
Query: 245 NISHKINKLAFGEHFPGVVNPLDGV------RWTQET----------------------- 275
N SH +N L FG P + PL V TQE+
Sbjct: 144 NSSHVVNHLRFGPQIPDMETPLIDVSKILERNCTQESCWLARSWDSVAALLTSFIALLLF 203
Query: 276 PSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIK 334
Y+YF+ VVP+ Y ++G ++ + Q+SVTEH SS Q + PGV F Y+ SPI
Sbjct: 204 TVATYKYFVNVVPSRYVYLNGRSVTTFQYSVTEHETSSRGPNGQVSFPGVIFSYEFSPIA 263
Query: 335 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 381
V + E S LHFLT+ AIVGGVF V+ +ID IY ++ KKI+
Sbjct: 264 VEYIESKPSVLHFLTSTSAIVGGVFAVARMIDGAIY----SVSKKID 306
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 49/105 (46%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGET 68
R D K E RT GGV+TL+S +V+ L SE ++ ++ VDT
Sbjct: 4 RRFDLNVKGVEGIQERTIGGGVVTLLSCVVVAFLLLSEFSVWWTVSVTHRMHVDTDPDYP 63
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+ I DV+F C +++D D G + + +K DI ++ G
Sbjct: 64 INIEVDVSFLHEACKEVALDVSDSKGHKEILLKKDIQEEPFGENG 108
>gi|443683891|gb|ELT87978.1| hypothetical protein CAPTEDRAFT_224400 [Capitella teleta]
Length = 292
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 106/202 (52%), Gaps = 23/202 (11%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
++ EGC ++NKV GNFH + S Q N+ H +++L
Sbjct: 105 KVPINNNEGCRFKSSFKINKVPGNFHISTHASKEQP------------PQPNMKHIVHEL 152
Query: 254 AFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 307
FG+ H PG NPL ++ + Y++K+VP V+ D SG T+ + + T
Sbjct: 153 IFGDRVPQTIHIPGSFNPLLEKDKSESNALSSHDYYLKIVPAVFNDYSGKTLM-HPYQYT 211
Query: 308 EHFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGI 364
+R S ++G +P ++F Y L+P+ V ++E+ + F HFLT VCAIVGG FTV+GI
Sbjct: 212 FAYRHSIRQRGGQVVIPAIWFKYKLNPMCVKYSEQRPIPFYHFLTAVCAIVGGTFTVAGI 271
Query: 365 IDAFIYHGQRAIKKKIEIGKFS 386
D+F++ I KK E+GK S
Sbjct: 272 FDSFLFTAAE-IFKKAELGKLS 292
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD--TSR 65
IR LD Y KI +D T +G I++ S + + LF SEL YL++ T++ VD +
Sbjct: 5 IRRLDIYRKIPKDLTQPTKTGACISVGSVLFIAYLFISELTSYLSSEIVTEMYVDDPATN 64
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
E + + D++ + C + +D D G +
Sbjct: 65 SERIPVKLDISLLNMECKYIGLDIQDDLGRHEV 97
>gi|194911936|ref|XP_001982403.1| GG12755 [Drosophila erecta]
gi|190648079|gb|EDV45372.1| GG12755 [Drosophila erecta]
Length = 441
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 182/378 (48%), Gaps = 29/378 (7%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++EL Y + ET ++ D +
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELYYYWH---ETAIVYQFEPDIA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKK-RLDSQGNVIE-SRQD 121
E ++++ D+T A+PC+ LS VD MD + + D+F L +G E S+ D
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMSKHD 127
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ I +Q H R + + A+ +D + VRE + A
Sbjct: 128 RLQFEAIQ--MQNHYLREQFHSV-------ADVLFKDIMRDPHPVREGASQVPAAPPPGA 178
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
L G E + + C ++G L +NKVAG H G H ++ +R
Sbjct: 179 LALAVDLMGQHNVQPESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRR 238
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N +H+IN+L+FG++ +V PL+G + QYF+KVVPT + TI +
Sbjct: 239 MPANFTHRINRLSFGQYSGRIVQPLEGDEIVIHEEATTIQYFLKVVPTEIHQ-TFTTINA 297
Query: 302 NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Q++VTE+ R + R PG++F YD S +K+ + L F +C+I+ G+
Sbjct: 298 FQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVDNDRDHLLTFAIRLCSIISGIIV 357
Query: 361 VSGIIDAFIYHGQRAIKK 378
+SG I+A + QR + +
Sbjct: 358 ISGAINALLLGIQRRLLR 375
>gi|443921357|gb|ELU41041.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
solani AG-1 IA]
Length = 579
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 165/390 (42%), Gaps = 100/390 (25%)
Query: 15 PKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFD 74
P Y+ FS +TL+S ++L+ E+ Y + ++VD SRGE + +N +
Sbjct: 173 PLRENTLYANRFS---VTLISMGIILIFTIIEIIDYRRIGMASDIIVDVSRGEQISVNMN 229
Query: 75 VTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQR 134
+TFP +PC +LS+D D+SG+ DV H I K RL+ G +I
Sbjct: 230 ITFPRVPCYLLSLDITDVSGDIQQDVSHHILKTRLEPSGAMIH----------------- 272
Query: 135 HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREG--FL 192
E+ N + +G L P + R G L
Sbjct: 273 ----------------------ENTLNYRIKSETGISHQGMELRRP----EHDRAGMLLL 306
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKI 250
+ I +E + FL +NKV GNFHF+PG+SF H +D++ + +D + H I
Sbjct: 307 ELIPFKEP-----HPFLRINKVTGNFHFSPGRSFLSQRGHAYDLVPYLKDGNHHDFGHYI 361
Query: 251 NKLAF---------------GEHFPGVV----NPLDGVRWTQETPSG-MYQYFIKVVPTV 290
++ F G + V PLDG+ E PS M QYF+KVV T
Sbjct: 362 HEFHFEGDREIEDRWREGNRGTEWRARVGSDKQPLDGL----EQPSNWMIQYFLKVVSTE 417
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF--FYDLSPIKVTFTEEHVSFLHFL 348
+ G ++++Q+SVT + R PG F D + IK T
Sbjct: 418 VRHLDGDLVRAHQYSVTNYERDIR-------PGHEFDPLRDANGIKTTH----------- 459
Query: 349 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
+CAIVGGV T++ I D+ + I++
Sbjct: 460 -GLCAIVGGVLTLASIADSVAFASLNKIEE 488
>gi|198421328|ref|XP_002120997.1| PREDICTED: similar to Endoplasmic reticulum-Golgi intermediate
compartment protein 1 (ER-Golgi intermediate compartment
32 kDa protein) (ERGIC-32) [Ciona intestinalis]
Length = 289
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 110/201 (54%), Gaps = 21/201 (10%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+++ +G GC ++NKV GNFH V H + Q D+ +++H+I +
Sbjct: 103 EKVPTHDGNGCLFTSRFQINKVPGNFH-----------VSTHSARS-QPDNPDMTHEIKE 150
Query: 253 LAFGEHF--PGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS- 305
L G++ PGV N L+G + P + Y +K+VPTVY + G+ Q++
Sbjct: 151 LRIGDNMVIPGVKSQSFNALEGKTTFDKHPLSSHDYIMKIVPTVYESIDGNLRYLYQYTN 210
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ + + G+ + +P ++F Y+++PI V +TE F HF+T VCAI+GG FTV+GII
Sbjct: 211 AYKDYIAYGHGQ-RVMPAIWFRYEMTPITVKYTERRKPFYHFITMVCAIIGGTFTVAGII 269
Query: 366 DAFIYHGQRAIKKKIEIGKFS 386
D+ I+ + KK+ IGK S
Sbjct: 270 DSMIFSATE-MYKKLTIGKLS 289
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 42/92 (45%), Gaps = 1/92 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR-G 66
IR D Y K+ +D T +G I++ + L SEL +L ++L VD + G
Sbjct: 5 IRRFDIYRKVPKDLTQPTTTGAAISVGCCFFISYLLISELLGFLTIDVASELYVDDPQSG 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
+ + + ++ P + C L +D D G +
Sbjct: 65 DKIPVQIIISLPKMKCEYLGMDIQDSMGRHEV 96
>gi|62319241|dbj|BAD94459.1| hypothetical protein [Arabidopsis thaliana]
Length = 56
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 53/56 (94%), Positives = 56/56 (100%)
Query: 331 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
SPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 1 SPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 56
>gi|325184531|emb|CCA19024.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 466
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 72/192 (37%), Positives = 96/192 (50%), Gaps = 26/192 (13%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 260
EGC +YG L V +V GNFH H S H + N SH +N+L FGE
Sbjct: 288 EGCQLYGHLIVKRVPGNFHI------HLS----HPFYSMNSSLVNASHTVNELWFGEVLS 337
Query: 261 GVV-------NPLDGVRWT-QETPSGM----YQYFIKVVPTVYTDVSGHTIQSNQFSVTE 308
LD R QE + M Y ++IKVV Y +G I + +++
Sbjct: 338 ASALAKLPPNTRLDSHRLARQEFTAYMQNYTYVHYIKVVTNTYVQRNGEVISAYRYTA-- 395
Query: 309 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 368
S+E + LP V F YDLSP+ V TE + F HF+T+ CAI+GGVFTV GIID
Sbjct: 396 --HSNEYLETEDLPSVMFRYDLSPMSVRITERSMPFYHFVTSACAIIGGVFTVIGIIDQL 453
Query: 369 IYHGQRAIKKKI 380
++ RA+ KK+
Sbjct: 454 VHQTVRAMNKKV 465
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 66/123 (53%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M+ ++ D Y KI ED T G +++V +ML+LF E YL+ +++D
Sbjct: 4 MDVLKKWDFYKKIPEDLTVSTLPGVSLSIVGCFIMLILFILEFNAYLSVNHAYDIVIDEG 63
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
E INF++T P LPC S+D D++G + ++ ++ K R+D++G ++ D +
Sbjct: 64 LDEKFEINFNITIPDLPCEFASIDVSDMTGTRKHNMTKNVSKFRIDTKGRLVGFASDEVT 123
Query: 125 APK 127
PK
Sbjct: 124 HPK 126
>gi|195564437|ref|XP_002105825.1| GD16474 [Drosophila simulans]
gi|194203186|gb|EDX16762.1| GD16474 [Drosophila simulans]
Length = 441
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 180/370 (48%), Gaps = 31/370 (8%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + + GG ++L+S ++++ L ++EL Y + ET+++ D +
Sbjct: 19 KNLDAFKKVPEKYTETSEIGGTLSLLSRLLIVYLVYTELHYYWH---ETEIVYQFEPDIA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKK-RLDSQGNVIE-SRQD 121
E ++++ D+T A+PC+ LS VD MD + + D+F L +G E S D
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMSEHD 127
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ I +Q H R E + A+ +D + RE+ K A
Sbjct: 128 RLQFQAIQ--IQNHYLREEFHSV-------ADVLFKDIMRDPHPARESASKTHAAPPPGA 178
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
L G E + + C ++G L +NKVAG H G H ++ +R
Sbjct: 179 LPLSVDLHGQHNVQPESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRR 238
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQ 300
N +H+IN+L+FG++ +V PL+G + QYF+KVVPT ++ + TI
Sbjct: 239 MPANFTHRINRLSFGQYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFT--TIN 296
Query: 301 SNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+ Q++VTE+ R + R PG++F YD S +K+ + + F +C+I+ G+
Sbjct: 297 AFQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIMVRNDRDHLVTFAIRLCSIISGII 356
Query: 360 TVSGIIDAFI 369
+SG I+A +
Sbjct: 357 VISGAINALL 366
>gi|50545267|ref|XP_500171.1| YALI0A17600p [Yarrowia lipolytica]
gi|49646036|emb|CAG84103.1| YALI0A17600p [Yarrowia lipolytica CLIB122]
Length = 337
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 169/381 (44%), Gaps = 71/381 (18%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+RS DA+PK+N + ++ GG+ TLV ++ SELR Y N E V E
Sbjct: 7 LRSFDAFPKVNTAYKRQSTRGGLATLVIGVLCFYFLCSELRGYSNGHEEHIYTVTKDLAE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
T+++N DVT A+PC + V A D S + H++ L+ QG + D
Sbjct: 67 TIQLNVDVTV-AMPCKSIKVIAQDYSEDTFF--AHEL----LNMQGLTYDFGTD------ 113
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
R++H E + Y S ++ + + + G ++P
Sbjct: 114 ----------RMQH-EIHSHKAYEMNS------KTLKKSKFKHTRVGSHSTDPH------ 150
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGN---FHFAPGKSFHQSGVHVHDILAFQRDSF 244
C I G + +N V G F+ + F ++ + A D
Sbjct: 151 ---------------CRISGSVPINHVEGALQIFNLPDNQYF------INPMKA--SDGL 187
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-TIQSNQ 303
N++H I++L+FG++FP V+NPLDGV + P YQYF+ VP Y+ SG I + Q
Sbjct: 188 NLTHAIHELSFGDYFPKVLNPLDGVSTVTDEPLMSYQYFLSAVPVEYS--SGRKKIHTYQ 245
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
++V + ++ Q T P +FF Y P+ + + + F+ + +I+GG F V G
Sbjct: 246 YAVKKQ-TTNLQEHFVTRPAIFFHYKYEPVTLKIQDSRETLTVFVVKLLSILGG-FVVCG 303
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
++I G +KI +GK
Sbjct: 304 ---SWIVRGGEKAYEKI-VGK 320
>gi|241560364|ref|XP_002401002.1| COPII vesicle protein, putative [Ixodes scapularis]
gi|215501827|gb|EEC11321.1| COPII vesicle protein, putative [Ixodes scapularis]
gi|442749161|gb|JAA66740.1| Putative copii vesicle protein [Ixodes ricinus]
Length = 285
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 110/215 (51%), Gaps = 24/215 (11%)
Query: 181 DLIDQCKRE--GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ D R GF++ K G GC G ++KV GNFH S H +
Sbjct: 86 DIQDDMGRHEVGFVENTEKTPVGAGCRFEGKFYIHKVPGNFHM----STHAA-------- 133
Query: 238 AFQRDSFNISHKINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
A Q D +++H I+ L FG E G N LD + ++ + Y +K+VPTV+
Sbjct: 134 AKQPDKIDMTHIIHDLTFGNKMVEGVRGSFNSLDEMDKSEANGLESHDYVMKIVPTVFEK 193
Query: 294 VSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
I+S Q++ + S GR+ +P ++F YDL+PI V +T V FLT+V
Sbjct: 194 SPSERIESYQYTYAYKSYVSISHSGRI--MPAIWFRYDLTPITVKYTRRSVPLYSFLTSV 251
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAIVGG FTV+GI+D+ ++ I KK E+GK S
Sbjct: 252 CAIVGGTFTVAGIVDSLVFTASE-IFKKYEMGKLS 285
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 1/92 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT-SRG 66
+R D Y KI +D T +G VI+++S + +LF SE Y++ ++L VD S
Sbjct: 5 VRRFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELASELFVDNPSSA 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
+ + ++ ++T L CS + +D D G +
Sbjct: 65 DKIPVSINITLLKLDCSAVGLDIQDDMGRHEV 96
>gi|254579156|ref|XP_002495564.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
gi|238938454|emb|CAR26631.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
Length = 353
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 167/363 (46%), Gaps = 63/363 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +RS DA+PK +E ++ +GG+ ++ + + +L + ++E Y + VD
Sbjct: 1 MAGLRSFDAFPKTDETHVKKSSNGGLSSIFTYLFLLFIAWTEFGSYFGGYVDEHYEVDDQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
ET +IN D+ + PC L ++ D + ++ K L+ +
Sbjct: 61 LRETFQINMDL-YVKTPCQYLDINVRDTTMDRKF------VSKELNLE------------ 101
Query: 125 APKIDKPL-QRHGGRL-EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
D P +G R+ + NE ++ +N + +R+K + ++
Sbjct: 102 ----DMPFFIPYGSRVNDMNEIVTPDL-------DNVLSNA--IPAQFREK---IDTNNM 145
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR- 241
D+ +R+ F C+I+G ++VN+VAG Q H +F R
Sbjct: 146 FDEEERDAF---------NSCHIFGSVQVNRVAGEL---------QITAKGHGYSSFMRA 187
Query: 242 --DSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
+ + SH IN+L++GE +P + NPLD ++ + P + Y +VPT+Y + G
Sbjct: 188 PPEEIDFSHVINELSYGEFYPYIDNPLDSTAKFVPDAPRTTFVYDTAIVPTIYEKL-GAK 246
Query: 299 IQSNQFSVTEHFRSSE--QGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
I +NQ++V+E+ + E QG+ PG+F YD P+ + ++ +SF+ F+ + AI+
Sbjct: 247 IDTNQYAVSEYHINPEAQQGKGPIRFPGIFLRYDFEPLSIHISDVRLSFIQFVVRLVAIL 306
Query: 356 GGV 358
V
Sbjct: 307 SFV 309
>gi|312081872|ref|XP_003143209.1| HT034 [Loa loa]
gi|307761627|gb|EFO20861.1| hypothetical protein LOAG_07628 [Loa loa]
Length = 292
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 108/204 (52%), Gaps = 20/204 (9%)
Query: 190 GFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 247
GF+Q ++ GC G E++KV GNFH + H D Q +++++
Sbjct: 102 GFVQNTEKIPIGTSGCRFEGKFEISKVPGNFHLS---------THAADT---QPETYDMR 149
Query: 248 HKINKLAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
H I+ + FG++ G NPL Q S + Y +K+VP+VY D++G+T S
Sbjct: 150 HTIHSVVFGDNIITSQNLGSFNPLKNREALQTDGSFTHDYVLKIVPSVYEDINGNTKYSY 209
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q++ + + +P ++F Y+L PI + +TE F F+T++CA+VGG FTV+
Sbjct: 210 QYTYAHKEYVTYHYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVA 269
Query: 363 GIIDAFIYHGQRAIKKKIEIGKFS 386
GIIDA ++ + +K +IGK S
Sbjct: 270 GIIDASLF-SLTELYRKHQIGKLS 292
>gi|427788003|gb|JAA59453.1| Putative copii vesicle protein [Rhipicephalus pulchellus]
Length = 285
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/215 (34%), Positives = 109/215 (50%), Gaps = 24/215 (11%)
Query: 181 DLIDQCKRE--GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ D R GF++ K G GC G ++KV GNFH S H +
Sbjct: 86 DIQDDMGRHEVGFVENTEKTPVGSGCRFEGKFFIHKVPGNFHV----STHAA-------- 133
Query: 238 AFQRDSFNISHKINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
A Q D +++H I+ L FG + G N LD + + + Y +K+VPTVY
Sbjct: 134 AKQPDKIDMTHIIHDLTFGVKMTDEVRGSFNSLDEMDKSGANGIESHDYVMKIVPTVYEK 193
Query: 294 VSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
G I+S Q++ + S GR+ +P ++F YDL+PI V +T + FLT+V
Sbjct: 194 SKGERIESYQYTYAYKSYVSISHSGRI--MPAIWFRYDLTPITVKYTRRGIPLYSFLTSV 251
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAIVGG FTV+GI+D+ ++ +K E+GK S
Sbjct: 252 CAIVGGTFTVAGIVDSLVFTASEVF-RKFEMGKLS 285
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT-SRG 66
+R D Y KI +D T +G VI+++S + +LF SE Y++ ++L VD S
Sbjct: 5 VRRFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELVSELYVDNPSSA 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
+ + ++ ++T L CS++ +D D G +
Sbjct: 65 DKIPVSINITLLKLDCSVVGLDIQDDMGRHEV 96
>gi|308494873|ref|XP_003109625.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
gi|308245815|gb|EFO89767.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
Length = 286
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 103/190 (54%), Gaps = 18/190 (9%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 257
GC E+NKV GNFH + + A Q D++++ H I+ + FG+
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ATQPDNYDMRHTIHSIKFGDDVSH 157
Query: 258 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 316
+ G +PL +QE ++Y +K+VP+V+ D SG+ + S Q++ +
Sbjct: 158 KNLKGSFDPLANRDTSQENGLNTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYITYHH 217
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +P V+F Y+L PI + TE+ SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277
Query: 377 KKKIEIGKFS 386
KK+ ++GK +
Sbjct: 278 KKQ-QMGKLT 286
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 1/116 (0%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-DT 63
M IR D YPKI +D T +G VI+++ + + F+++ Y+ ++ + D
Sbjct: 1 MLDIRRFDIYPKIPKDLTQPTTAGAVISMLCVAFIAFMIFNDVLAYIFIDLRSEFFIDDP 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR 119
R + + +V+FP + C L VD D +G + K + G ESR
Sbjct: 61 GREGKIDVQVNVSFPHMACEYLGVDIQDENGRHEVGFIDHTNKVPIGDGGCRFESR 116
>gi|167383125|ref|XP_001736415.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901233|gb|EDR27345.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 116
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 75/115 (65%), Gaps = 9/115 (7%)
Query: 262 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQG 316
+VNP+DG+ T + MYQYF++VVP YT + I +N +SVTEH+R S EQG
Sbjct: 1 MVNPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNRIINTNGYSVTEHYRPGNLKSPEQG 60
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 371
+PGVF YD+S I+V + EE SF H LT++C I+GGVF + ++D FI+H
Sbjct: 61 ----IPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 111
>gi|451847161|gb|EMD60469.1| hypothetical protein COCSADRAFT_98785 [Cochliobolus sativus ND90Pr]
Length = 395
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 163/385 (42%), Gaps = 77/385 (20%)
Query: 8 IRSLDAYPKINEDFY--SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ S DA+PK + + R S +TL+ I + L +SE+ + T ++
Sbjct: 22 VSSFDAFPKTKKTYLVQGRNSSAWTVTLI--ITCIYLTWSEIARWYAGTTTQSFTIEKGV 79
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
++IN D+ A+ C+ L V+ D +G++ L ++ +K S S+ G
Sbjct: 80 SHDMQINLDIIV-AMKCADLHVNMQDAAGDRTL--AGELLRKDPTSW-----SQWTGKNT 131
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
K L G+ E + YG + E + +A +KK P L
Sbjct: 132 EKGTHEL----GKDETTQIPEWEEYG---------DVHEHLGKATKKK--FSKTPKL--- 173
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 244
+ C IYG L NKV G+FH A G + + G H+ + SF
Sbjct: 174 -----------RGPTDSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHL------EHSSF 216
Query: 245 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD-------- 293
N SH I +++FG ++P + NPLD TP+ +QY++ +VPT+YTD
Sbjct: 217 NFSHIIREMSFGPYYPSLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPALMPIM 276
Query: 294 ---VS------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 338
VS H I++NQ++VT S + +PG+F +D+ PI +
Sbjct: 277 ESMVSTNDQPSSNMFRMAHAIKTNQYAVTSQ---SHKVDDSYVPGIFVKFDIEPIMLAIV 333
Query: 339 EEHVSFLHFLTNVCAIVGGVFTVSG 363
EE SF + + +V GV G
Sbjct: 334 EESKSFWKLVITLVNVVSGVMVAGG 358
>gi|169614774|ref|XP_001800803.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
gi|111060809|gb|EAT81929.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
Length = 404
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 95/387 (24%), Positives = 158/387 (40%), Gaps = 81/387 (20%)
Query: 8 IRSLDAYPKINEDFY--SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ S DA+PK + + R S +TL+ + + L +SE+ + T V+
Sbjct: 22 VSSFDAFPKTKKTYLVQGRNSSAWTVTLI--LTCIYLSWSEITRWYAGSTTQSFSVEKGV 79
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
++IN D+ A+ C L V+ D +G++ L G+++ R D
Sbjct: 80 SHDMQINLDIIV-AMNCHDLRVNMQDAAGDRTL-------------AGDLL--RNDPTNW 123
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
Q G ++E G G E+ + E++ +A ++K
Sbjct: 124 S------QWTGRKMEKGMHELGKDDGVNPGWEELWDVHEQLGKAKKRK------------ 165
Query: 186 CKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 242
+ G + C I+G L+ NKV G+FH A G + + G D
Sbjct: 166 ------FSKTPRVRGAPDACRIFGSLDGNKVQGDFHITARGHGYQEFGEQHLD-----HK 214
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSG--- 296
+FN SH I +++FG ++P + NPLD T T +QY++ +VPT+YTD G
Sbjct: 215 TFNFSHIIREMSFGPYYPSLTNPLDNTIATTPTDQDHFYKFQYYLSIVPTIYTDNPGLLP 274
Query: 297 --------------------HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 336
H I++NQ++VT + + +PGVF +D+ PI +
Sbjct: 275 LLESVNRDPSAHPAKSIFSTHAIKTNQYAVTSQSHTVPE---NYVPGVFVKFDIEPIMLA 331
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSG 363
EE F L + +V GV G
Sbjct: 332 VVEEWGGFWRLLVRIVNVVSGVMVAGG 358
>gi|403216157|emb|CCK70655.1| hypothetical protein KNAG_0E04020 [Kazachstania naganishii CBS
8797]
Length = 351
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 87/364 (23%), Positives = 167/364 (45%), Gaps = 68/364 (18%)
Query: 16 KINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDV 75
K E + ++ GG+ ++++ + ++ + +SE Y + + +VD+ E + +N DV
Sbjct: 7 KTEEQYKQKSSKGGLTSILTYLFLIFIAYSEFGSYFGGYLDQQYIVDSELREDVELNLDV 66
Query: 76 TFPALPCSILSVDAMDISGE-----QHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK 130
F +PC + V+ D + + + L + F D++ N I I P++D+
Sbjct: 67 -FVHMPCDFIHVNVRDSTFDRKIVSEELKFEDMPFFIPYDTKVNDIPE----IITPEMDE 121
Query: 131 PLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK-----GWALSNPDLIDQ 185
L E + ++R+K + ++PD
Sbjct: 122 IL------------------------------GEAIPASFREKVDMRLYYDENDPDTHHH 151
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
E GC+I+G + VN+V G F G+ D+ A ++ N
Sbjct: 152 LP-----------EFNGCHIFGSIPVNRVRGEFQIT------AKGLGYRDMNAAPKEKIN 194
Query: 246 ISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
+H IN+ +FG+ +P + NPLD ++ ++ P + Y++ VVPT+Y + G + +NQ+
Sbjct: 195 FAHVINEWSFGDFYPYIDNPLDATAKFDKDDPLTAFVYYLSVVPTIYQKL-GAEVDTNQY 253
Query: 305 SVTEH-FRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFT 360
SV+E+ F S+++ T +PG+FF Y+ + + T+ +SFL F+ + AI+ V+
Sbjct: 254 SVSEYRFNSTDKTFRDTGYVPGIFFRYNFESLSIVMTDRRLSFLQFIVRLVAIMSFAVYI 313
Query: 361 VSGI 364
S I
Sbjct: 314 ASWI 317
>gi|195469521|ref|XP_002099686.1| GE16580 [Drosophila yakuba]
gi|194187210|gb|EDX00794.1| GE16580 [Drosophila yakuba]
Length = 430
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 180/378 (47%), Gaps = 29/378 (7%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++EL Y + ET ++ D +
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWH---ETDIVYQFEPDIA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNV--IESRQD 121
E ++++ D+T A+PC+ LS VD MD + + D+F + V S D
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWTMSEHD 127
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ I +Q H R + + A+ +D + VRE+ + A
Sbjct: 128 RLQFEAIQ--MQNHYLREQFHSV-------ADVLFKDIMRDPHPVRESASQMPAAPPPGA 178
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
L G E + + C ++G L +NKVAG H G H ++ +R
Sbjct: 179 LPLAVDLLGQHNVQPESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRR 238
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N +H+IN+L+FG++ +V PL+G + QYF+KVVPT + TI +
Sbjct: 239 MPANFTHRINRLSFGQYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINA 297
Query: 302 NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Q++VTE+ R + R PG++F YD S +K+ + + F +C+I+ G+
Sbjct: 298 FQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIMVDNDRDHLVTFAIRLCSIISGIIV 357
Query: 361 VSGIIDAFIYHGQRAIKK 378
+SG I+A + QR + +
Sbjct: 358 ISGAINALLLGIQRRLLR 375
>gi|442614645|ref|NP_001259099.1| CG4293, isoform E [Drosophila melanogaster]
gi|440216271|gb|AGB94945.1| CG4293, isoform E [Drosophila melanogaster]
Length = 439
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 168/352 (47%), Gaps = 31/352 (8%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + + GG ++L+S ++++ L ++EL Y + ET ++ D +
Sbjct: 19 KNLDAFKKVPEKYTETSEIGGTLSLLSRLLIVYLVYTELHYYWH---ETDIVYQFEPDIA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKK-RLDSQGNVIE-SRQD 121
E ++++ D+T A+PC+ LS VD MD + + D+F L +G E S D
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMSEHD 127
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ I +Q H R E + A+ +D + RE+ K A
Sbjct: 128 RLQFQAIQ--IQNHYLREEFHSV-------ADVLFKDIMRDNHPARESASKAPAAPPPGA 178
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
L G E + + C ++G L +NKVAG H G H ++ +R
Sbjct: 179 LPLSVDLHGRHNVQPESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRR 238
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQ 300
N +H+IN+L+FG++ +V PL+G + QYF+KVVPT ++ + TI
Sbjct: 239 MPANFTHRINRLSFGQYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFT--TIY 296
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
+ Q++VTE+ R E+ + PG++F YD S +K+ + + F +C
Sbjct: 297 AFQYAVTENVRKLERNSYGS-PGIYFKYDWSALKIIVRNDRDHLVTFAIRLC 347
>gi|281206876|gb|EFA81060.1| DUF1692 family protein [Polysphondylium pallidum PN500]
Length = 344
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 104/221 (47%), Gaps = 19/221 (8%)
Query: 5 MNKIRSLDAYPKINEDF-YSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ ++ D YPK+++ +T GGVIT + I + L SEL Y + + L VD
Sbjct: 97 LETMKLFDFYPKLDDSVPMQKTVYGGVITAICMIFTMFLLCSELYYYTFPIRDHSLKVDV 156
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMD-ISGEQHLDVKHDIFKKRLDSQGNVIESRQDG 122
+RG L IN D+ FP+L CS ++V+++D I G D + I ++RLD G VI+
Sbjct: 157 TRGNRLLINIDIHFPSLICSDINVESIDGIDGRPIKDASYQIVRERLDRNGVVIDPSNPP 216
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
G + RL N Y A + CCN C+++RE YR D
Sbjct: 217 PGF------FECVSCRLPANSKY------AVLYPQRCCNKCDDLREFYRTNKIPQHYADQ 264
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPG 223
QC + E E EGC IYG L V K+ G+ H G
Sbjct: 265 SPQC-----MISDPEAEDEGCRIYGTLWVQKMKGDIHILAG 300
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%)
Query: 322 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
PG++F YDLSP+ + + F+ +T+VCAI G +
Sbjct: 308 PGIYFKYDLSPLMIEVDQSSKPFVELVTSVCAIGGDI 344
>gi|313227239|emb|CBY22386.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 175/398 (43%), Gaps = 84/398 (21%)
Query: 5 MNKIRSLDAYPKINEDFYS-RTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL--- 60
+ K R LDA+ KI E+ S +T GGV T+V+ +MLLL E+ ++ T TK+
Sbjct: 22 LEKFRELDAFTKITEEAESPQTSHGGVCTMVTFTIMLLLLLGEMTVWF---TTTKIKYEF 78
Query: 61 -VDTSRGETLRINFDVTFPALPCSILSVDAMDISGE--------QHLDVKHDIFKKRLDS 111
VD+ + +N D+TF + PC ++S + +D SG+ Q ++ K++
Sbjct: 79 DVDSEYESKMHLNMDITFNS-PCHMISAEIVDSSGDAWGYSFQLQEDAADFELTKEKALE 137
Query: 112 QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYR 171
+ +++ ++ + P + L R G ++H E R
Sbjct: 138 RAKLLKMKE-SMTDPNMRDQLLREGHDVKH-------------------------LEFSR 171
Query: 172 KKGWALSNPDLIDQCKREGFLQ-RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG 230
KK N +++Q +Q + E +GC ++G +E+ K+AG ++ G
Sbjct: 172 KK-----NKKMMEQGMMHKVVQINLDPNEPQGCRVWGSVELQKIAGTIKI---QAGGFGG 223
Query: 231 VHVHDILAFQRDSF---------------------NISHKINKLAFGEHFPGVVNPLDGV 269
+ L+ D+ N SH+I+ +FG+ G+V LDG
Sbjct: 224 MGGIPGLSGGLDAIMGMFMMPMMGMGAQIQDGKKANFSHRIDHFSFGDPSSGLVYGLDGD 283
Query: 270 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN--QFSVTEHFRSSEQGRLQTLPGVFFF 327
QE + Y +KVVP TD+ Q Q++VT+H S++ P V
Sbjct: 284 IQIQEKENDDTTYVVKVVP---TDLKTFKFQQKAYQYAVTQHVGKSDK------PAVTIK 334
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
YD S + V+ TE SF+ LT + I+GG+ SGI+
Sbjct: 335 YDFSGLGVSITEYRESFVGLLTRLAGILGGIAASSGIL 372
>gi|162852511|emb|CAO03348.2| ERGIC and golgi 3 [Homo sapiens]
Length = 118
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 58/111 (52%), Positives = 79/111 (71%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ DAYPK EDF +T G +T+VS ++MLLLF SEL+ YL +L VD SRG+
Sbjct: 1 LKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGD 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
L+IN DV FP +PC+ LS+DAMD++GEQ LDV+H++FK+RLD G + S
Sbjct: 61 KLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSS 111
>gi|154415829|ref|XP_001580938.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121915161|gb|EAY19952.1| hypothetical protein TVAG_402060 [Trichomonas vaginalis G3]
Length = 359
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 154/362 (42%), Gaps = 37/362 (10%)
Query: 6 NKIRSLDAYPKI-NEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
N ++ LD + K + +F T G ++ + SI+ ++L F+EL Y + LL
Sbjct: 3 NLLKELDIFDKFADAEFALHTIGGKFMSAIFSIIAVILIFAELFNYTKPIVYRDLLNIPQ 62
Query: 65 RGETLRINFDVTFP-ALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+ +NF + ALPC L DA+D G + LDV +DI KR+ I+ + +
Sbjct: 63 LDKDNTVNFTFSIQVALPCFFLHFDALDSIGVEMLDVSNDIKFKRMSVDNRFIDYSNESL 122
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
+ C C+G + E CCN C+EV+ + +G NP
Sbjct: 123 -------------------KDICLPCHGLKPEGE-CCNTCDEVKAIFEARGEDF-NPLPF 161
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS--FHQSGVHVHDILAFQR 241
DQC K++ E C I G + K G FH APG++ F ++G H HD
Sbjct: 162 DQCMGN---VNFKKDMSESCLIEGTIHTFKSPGQFHIAPGRNTKFRRTG-HQHDTGLSPE 217
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDG--VRWTQETPS-GMYQYFIKVVPTVYTDVSGHT 298
S H I++ G+ + V +P+ G R P +Y FI V + D +T
Sbjct: 218 AS--CPHTIHEFYVGQKYDNVRSPIRGKIFRDRDSLPRIYLYDLFITKVLHTFNDALQYT 275
Query: 299 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 358
S ++S + G PG++F Y SP+ + + + FL ++ G+
Sbjct: 276 --SYEYSYNLGAKIFNPGSFYQ-PGIYFKYMFSPMTIVERSISKNPMRFLVTSVGVLAGI 332
Query: 359 FT 360
F
Sbjct: 333 FA 334
>gi|451997913|gb|EMD90378.1| hypothetical protein COCHEDRAFT_27091 [Cochliobolus heterostrophus
C5]
Length = 395
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 164/403 (40%), Gaps = 80/403 (19%)
Query: 8 IRSLDAYPKINEDFY--SRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+ S DA+PK + + R S +TL+ I + L +SE+ + T ++
Sbjct: 22 VSSFDAFPKTKKTYLVQGRNSSAWTVTLI--ITCIYLTWSEIARWYAGTTTQSFTIEKGV 79
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
++IN D+ A+ C+ L V+ D +G++ L ++ +K S
Sbjct: 80 SHDMQINLDIIV-AMKCADLHVNMQDAAGDRTL--AGELLRKDPTSWS------------ 124
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
Q G E G D EE + + G +
Sbjct: 125 -------QWTGKNTEKGTHELGK------DDTTQIPEWEEYGDVHEHLG----------K 161
Query: 186 CKREGFLQRIK-EEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDS 243
++ F + K + C IYG L NKV G+FH A G + + G H+ S
Sbjct: 162 ATKKKFSKTPKLRGPTDSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHL------DHSS 215
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD------- 293
FN SH I +++FG ++P + NPLD TP+ +QY++ +VPT+YTD
Sbjct: 216 FNFSHIIREMSFGPYYPSLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPSLMPL 275
Query: 294 ----VS------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 337
VS H I++NQ++VT S + +PG+F +D+ PI +
Sbjct: 276 MESVVSTNDQPSSNMFRMAHAIKTNQYAVTSQ---SHKVDDTYVPGIFVKFDIEPIMLAI 332
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTV-SGIIDAFIYHGQRAIKKK 379
EE SF L + +V GV S + F + + K+K
Sbjct: 333 VEESKSFWKLLITLVNVVSGVMVAGSWVWQMFDWASEFVGKRK 375
>gi|346469653|gb|AEO34671.1| hypothetical protein [Amblyomma maculatum]
Length = 285
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/215 (34%), Positives = 109/215 (50%), Gaps = 24/215 (11%)
Query: 181 DLIDQCKRE--GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D+ D R GF++ K G GC G ++KV GNFH S H +
Sbjct: 86 DIQDDMGRHEVGFVENTEKTPVGSGCRFEGKFFIHKVPGNFHV----STHAA-------- 133
Query: 238 AFQRDSFNISHKINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 293
A Q + +++H I+ L FG + G N LD + + + Y +K+VPTVY
Sbjct: 134 AKQPEKIDMTHIIHDLTFGVKMTDEVKGSFNSLDEMDKSGGNGIESHDYVMKIVPTVYEK 193
Query: 294 VSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
G I+S Q++ + S GR+ +P ++F YDL+PI V +T V FLT+V
Sbjct: 194 SRGERIESYQYTYAYKSYVSISHTGRI--MPAIWFRYDLTPITVKYTRRGVPLYSFLTSV 251
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CAIVGG FTV+GI+D+ I+ +K E+GK S
Sbjct: 252 CAIVGGTFTVAGIVDSLIFTASEVF-RKFEMGKLS 285
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT-SRG 66
+R D Y KI +D T +G VI+++S + +LF SE Y++ ++L VD S
Sbjct: 5 VRRFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELVSELYVDNPSSA 64
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
E + ++ ++T L CS++ +D D G +
Sbjct: 65 EKIPVSINITLLKLDCSVVGLDIQDDMGRHEV 96
>gi|443920575|gb|ELU40475.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
solani AG-1 IA]
Length = 506
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 135/321 (42%), Gaps = 49/321 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R DA+PK+ ++ +RT GG++T++ +++ +L ++L YL E + VD +
Sbjct: 15 VRQFDAFPKVRPNYKARTTGGGLMTVLVAVISFILVLNDLGDYLWGWREYEFTVDNNLAT 74
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ +N D+ +PC LSVD D +G++ L +H F R+DG
Sbjct: 75 VMYVNVDLVV-NMPCHFLSVDLRDAAGDRLFLTDEHGGF-------------RRDG---- 116
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
S Y D + +EV A ++ L + +
Sbjct: 117 -------------------ATSAYALNFRDSKVSVSPQEVVSASKRSQRGLFS--SFKKP 155
Query: 187 KREGFLQRIKE-EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
K F + C ++G + V KV N H ++S H L N
Sbjct: 156 KDPTFRPTYNHIPDASACRVFGTVAVKKVTANLHITTLGHGYRSAEHTDHTL------MN 209
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
++H IN+ +FG P + PLD +QYFI VVPT Y + +NQ+S
Sbjct: 210 LTHVINEFSFGPFIPDLSQPLDYSFEVTHEHFTAFQYFITVVPTTYQVPGQDPLHTNQYS 269
Query: 306 VTEHFRSSEQGRLQTLPGVFF 326
VT + R+ E GR PG+FF
Sbjct: 270 VTHYTRNIEHGR--GTPGIFF 288
>gi|17570549|ref|NP_508375.1| Protein Y102A11A.6 [Caenorhabditis elegans]
gi|351063407|emb|CCD71590.1| Protein Y102A11A.6 [Caenorhabditis elegans]
Length = 286
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 101/190 (53%), Gaps = 18/190 (9%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 257
GC E+NKV GNFH + + A Q +S+++ H I+ + FG+
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ATQPESYDMRHLIHSIKFGDDVSH 157
Query: 258 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 316
+ G +PL +QE ++Y +K+VP+V+ D SG + S Q++ +
Sbjct: 158 KNLKGSFDPLAKRNTSQENGLNTHEYILKIVPSVHEDYSGTILNSYQYTFGHKSYITYHH 217
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +P V+F Y+L PI + TE+ SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277
Query: 377 KKKIEIGKFS 386
KK+ +GK +
Sbjct: 278 KKQ-RLGKLT 286
>gi|406607484|emb|CCH41148.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Wickerhamomyces ciferrii]
Length = 359
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 146/344 (42%), Gaps = 65/344 (18%)
Query: 24 RTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCS 83
R+ G T++ + +L L + E+ + + + VD LRIN D+ A+PC+
Sbjct: 40 RSTKGSYSTIMMGLFILFLTWVEVGQFFGGEVDHQFRVDNKLQRDLRINLDIVV-AMPCN 98
Query: 84 ILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE 143
+ + D++ ++ L L H E
Sbjct: 99 FIHTNVKDLTDDRFL-------------------------------------ASELLHYE 121
Query: 144 TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 203
+ +DE+ +N ++ E + +I + + G K+ C
Sbjct: 122 GFSFFIPPGYKTDENYDSNTPDLDEVMAQ--------GIIAEFRDRG---DAKDSGAPAC 170
Query: 204 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 263
+IYG + VNKV+G+FH ++ H + D N +H I++ +FGE +P +
Sbjct: 171 HIYGSIPVNKVSGDFHITAQGYGYRGNSRSHVGI----DGLNFTHIISEFSFGEFYPYIH 226
Query: 264 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT--- 320
NPLD + YQY++ VVPTVY + G I++NQ+S +S Q +L +
Sbjct: 227 NPLDATVQITKEHLQSYQYYLSVVPTVYKKL-GVEIETNQYS------TSLQKKLYSFEN 279
Query: 321 --LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
+PG+FF YD PI + ++ + F FL + I GG+ V+
Sbjct: 280 KGVPGLFFKYDFEPISLIVEDKRIPFSTFLVRLATIYGGIIVVA 323
>gi|115433364|ref|XP_001216819.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189671|gb|EAU31371.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 449
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 139/323 (43%), Gaps = 64/323 (19%)
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR--QDGIGAP 126
L++N D+ +PC L V+ D +G++ L ++ K+ S ++ R + G+
Sbjct: 133 LQLNLDIVV-EMPCDTLDVNIQDAAGDRVL--AGELLKREPTSWQLWMDKRNYESYGGSH 189
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDLI 183
+ Q GRLE A+ D + EVR RKK L D +
Sbjct: 190 EYQTLSQEDAGRLE-----------AQDEDAHVHHVLGEVRRNPRKKFPKSPKLRRGDAV 238
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 242
D C+ IYG LE NKV G+FH A G + H+
Sbjct: 239 DSCR-----------------IYGSLEGNKVQGDFHITARGHGYRDFAPHL------DHQ 275
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT---------- 292
+FN SH I +L+FG H+P ++NPLD ET +QYF+ VVPT+Y+
Sbjct: 276 TFNFSHMITELSFGPHYPTLLNPLDKTIAETETHYYKFQYFLSVVPTIYSKGNRVLDTYS 335
Query: 293 -------DVSGHT---IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
D S H + +NQ++ T + + +PG+FF Y++ PI + +EE
Sbjct: 336 IAPPTLHDNSRHNKNLVFTNQYAATSQSDALPESPF-FVPGIFFKYNIEPILLLISEERG 394
Query: 343 SFLHFLTNVCAIVGGVFTVSGII 365
SFL L + V GV G +
Sbjct: 395 SFLSLLIRLVNTVSGVMVTGGWL 417
>gi|366987569|ref|XP_003673551.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
gi|342299414|emb|CCC67168.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
Length = 355
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 157/372 (42%), Gaps = 67/372 (18%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R DA+PK E ++ GGV T++ I + + +SE Y + +VD E
Sbjct: 6 LRVFDAFPKTEEQHEKKSTKGGVSTILIYIFAIFIAWSEFGSYFGGFVGERYVVDGDVKE 65
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK---------RLDSQGNVIES 118
T+ IN D+ F +PC ++V+ D + ++ L + F++ R++ +I
Sbjct: 66 TVSINMDL-FVNIPCKWITVNVRDQTMDRKLASEELNFEEMPFFIPFDVRINDIAEIITP 124
Query: 119 RQDGIGAPKIDKPL-QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
+ D I I ++ R+ ++E +D + NN
Sbjct: 125 QLDEILGEAIPAEFREKLDTRMYYDE-----------NDPETYNNL-------------- 159
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
PD GC+I+G L VN+VAG G D
Sbjct: 160 --PDF------------------NGCHIFGSLPVNRVAGELQIT------AKGYGYADRE 193
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
D +H IN+ +FG+ +P + NPLD ++ ETP Y Y + V+PT + + G
Sbjct: 194 RTPMDQIKFNHVINEFSFGDFYPYIDNPLDKSAKFDLETPKTAYSYDLSVIPTTFRKL-G 252
Query: 297 HTIQSNQFSVTEHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+ + Q+SV E+ + S R +PG+FF Y+ + + ++ ++F+ F+ + A
Sbjct: 253 TEVNTFQYSVAEYHYKGKDSPVPRSGRVPGIFFDYNFESLSIIVSDSRLNFIQFIIRLIA 312
Query: 354 IVGGVFTVSGII 365
I+ ++ I
Sbjct: 313 ILSFALYIASWI 324
>gi|402591333|gb|EJW85263.1| hypothetical protein WUBG_03826, partial [Wuchereria bancrofti]
Length = 244
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 101/190 (53%), Gaps = 18/190 (9%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 260
GC + G E++KV GNFH + H D Q +++++ H I+ + FG+
Sbjct: 68 GCRLEGKFEISKVPGNFHIS---------THAADT---QPETYDMRHTIHSVVFGDDIST 115
Query: 261 ----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 316
G NPL + S + Y +K+VP+VY D++G+ S Q++ +
Sbjct: 116 SQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTYHY 175
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +P ++F Y+L PI + +TE F F+T++CA+VGG FTV+GIIDA ++ +
Sbjct: 176 SGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLF-SLTEL 234
Query: 377 KKKIEIGKFS 386
+K ++GK S
Sbjct: 235 YRKHQMGKLS 244
>gi|154286632|ref|XP_001544111.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407752|gb|EDN03293.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 315
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 145/335 (43%), Gaps = 64/335 (19%)
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK 130
+N D+ A+PC L V+ D +G++ L D+ K+ S +G+ +
Sbjct: 1 MNLDIVV-AMPCDALRVNVQDAAGDRIL--ASDLLDKQQTSWA-AWNRELNGVTS----- 51
Query: 131 PLQRHGGRLEH---NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
GG E+ NE E+ D + E + +Y++K P L
Sbjct: 52 -----GGGREYQTLNEEDLSRLMEQEA-DAHVGHALGEAKRSYKRK--FPKGPKLK---- 99
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 246
+ E+ + C IYG LE NKV G+FH A G + + G H+ D+FN
Sbjct: 100 --------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHL------SHDAFNF 145
Query: 247 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS--------- 295
SH + +L+FG H+P ++NPLD + TP+ +QY++ VVPT+YT
Sbjct: 146 SHMVTELSFGPHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVL 203
Query: 296 -----------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
G TI +NQ++ T + +PG+FF Y++ PI + +EE S
Sbjct: 204 PDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGSL 262
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
L L + ++ GV G + +KK+
Sbjct: 263 LALLVRLVNVLAGVVVAGGWLFQISTWAMENLKKR 297
>gi|409048375|gb|EKM57853.1| hypothetical protein PHACADRAFT_116248 [Phanerochaete carnosa
HHB-10118-sp]
Length = 546
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 141/347 (40%), Gaps = 51/347 (14%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+ I DA+PK+ + +R+ G +T+ + + LL ++L Y+ + + VD
Sbjct: 20 VSTPIAEFDAFPKLPSTYKARSEGRGFLTVFVTFMAFLLVLNDLGEYIWGWPDHEFSVDR 79
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQ-HLDVKHDIFKKRLDSQGNVIESRQDG 122
R LRIN D+ +PC LSVD D G++ +L D F++ G + + Q
Sbjct: 80 DRSSDLRINVDMLV-NMPCQYLSVDLRDAVGDRLYLS---DSFRR----DGTLFDIGQAT 131
Query: 123 IGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL 182
L+ H L + S N R Y K
Sbjct: 132 A--------LKEHAAALSARQVVTQSRKSRGLFATLFRRNSGGFRPTYNYK--------- 174
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQR 241
G C +YG + V KV N H + S HV H+++
Sbjct: 175 ---------------PSGSACRVYGSVAVKKVTANLHVTTLGHGYASRQHVDHNLM---- 215
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N+SH I + +FG +FP + PLD E YQY++ VVPT Y + +
Sbjct: 216 ---NLSHVITEFSFGPYFPDITQPLDNSFELTEDSFVSYQYYLHVVPTTYIAPRSRPLHT 272
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+Q+SVT + R + +PG+FF +D+ P+ +T + S L L
Sbjct: 273 HQYSVTHYTRVLKHN--NGIPGIFFKFDVDPMSLTIHQRTTSLLQLL 317
>gi|313241668|emb|CBY33893.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/398 (25%), Positives = 174/398 (43%), Gaps = 84/398 (21%)
Query: 5 MNKIRSLDAYPKINEDFYS-RTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL--- 60
+ K R LDA+ KI E+ S +T GGV T+ + +MLLL E+ ++ T TK+
Sbjct: 22 LEKFRELDAFTKITEEAESPQTSHGGVCTMFTFTIMLLLLLGEMTVWF---TTTKIKYEF 78
Query: 61 -VDTSRGETLRINFDVTFPALPCSILSVDAMDISGE--------QHLDVKHDIFKKRLDS 111
VD+ + +N D+TF + PC ++S + +D SG+ Q ++ K++
Sbjct: 79 DVDSEYESKMHLNMDITFNS-PCHMISAEIVDSSGDAWGYSFQLQEDAADFELTKEKALE 137
Query: 112 QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYR 171
+ +++ ++ + P + L R G ++H E R
Sbjct: 138 RAKLLKMKE-SMTDPNMRDQLLREGHDVKH-------------------------LEFSR 171
Query: 172 KKGWALSNPDLIDQCKREGFLQ-RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG 230
KK N +++Q +Q + E +GC ++G +E+ K+AG ++ G
Sbjct: 172 KK-----NKKMMEQGMMHKVVQINLDPNEPQGCRVWGSVELQKIAGTIKI---QAGGFGG 223
Query: 231 VHVHDILAFQRDSF---------------------NISHKINKLAFGEHFPGVVNPLDGV 269
+ L+ D+ N SH+I+ +FG+ G+V LDG
Sbjct: 224 MGGIPGLSGGLDAIMGMFMMPMMGMGAQIQDGKKANFSHRIDHFSFGDPSSGLVYGLDGD 283
Query: 270 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN--QFSVTEHFRSSEQGRLQTLPGVFFF 327
QE + Y +KVVP TD+ Q Q++VT+H S++ P V
Sbjct: 284 IQIQEKENDDTTYVVKVVP---TDLKTFKFQQKAYQYAVTQHVGKSDK------PAVTIK 334
Query: 328 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
YD S + V+ TE SF+ LT + I+GG+ SGI+
Sbjct: 335 YDFSGLGVSITEYRESFVGLLTRLAGILGGIAASSGIL 372
>gi|363748002|ref|XP_003644219.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
DBVPG#7215]
gi|356887851|gb|AET37402.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
DBVPG#7215]
Length = 340
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 154/363 (42%), Gaps = 59/363 (16%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R+ DA+PK + ++ GG+ ++V + +L + +SE Y + + +VD
Sbjct: 1 MPSLRTFDAFPKTEQQHVKKSSKGGLTSIVIYLFLLFIAWSEFGSYFGGYIDEQYIVDDE 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
T +IN ++ + +PC L V A D +G+
Sbjct: 61 IRTTAQINMNI-YVKMPCKYLEVTARDQTGD----------------------------- 90
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSD--EDCCNNCEEVREAYRKKGWALSNPDL 182
LQ RL + + YG + ++ + + +++ + P+L
Sbjct: 91 -------LQIVSERLNFQDIHFRVPYGTKMTEFNDVISPDLDDILADAIPAQFTSDMPEL 143
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQR 241
I+ +GC+IYG + VNKV+G A G ++ + +L
Sbjct: 144 ----------PMIEGINFDGCSIYGSVPVNKVSGELQITAKGWTYMSTRRTPFSVL---- 189
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N SH IN+L+FG+ FP + N LDGV + P Y YF V+PT Y + G + +
Sbjct: 190 ---NFSHVINELSFGDFFPYIDNTLDGVGRIADEPLKAYYYFTSVLPTAYKKM-GAEVHT 245
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
NQ+SV +SS L G+ Y+ +KV +E + F F+ + AI+ V +
Sbjct: 246 NQYSVDAIEKSSSSHALGP-TGITISYNFEALKVIIKDERIGFTQFIVRLVAILSFVVYL 304
Query: 362 SGI 364
+ +
Sbjct: 305 ASL 307
>gi|225712562|gb|ACO12127.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Lepeophtheirus salmonis]
Length = 290
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 109/218 (50%), Gaps = 25/218 (11%)
Query: 181 DLIDQCKRE--GFLQRIKE---EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
D+ D R GF++ + +G GC +NKV GNFH + H D
Sbjct: 86 DIQDDMGRHEVGFVENTAKTPIHDGVGCLFEAHFHINKVPGNFHVS---------THSVD 136
Query: 236 ILAFQRDSFNISHKINKLAFGEHFP-------GVVNPLDGVRWTQETPSGMYQYFIKVVP 288
+ Q D +N SH+I++++FG G N L G ++ ++Y +K+VP
Sbjct: 137 V---QPDEYNFSHEIHEVSFGSKIKKISSKNIGTFNSLSGRDSSESGALDSHEYVMKIVP 193
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
T Y + G + + Q++ S + +P ++F YDL+PI V + E HFL
Sbjct: 194 TTYESLGGAKLFAYQYTYAYRSYVSFGHGGRVVPALWFRYDLNPITVKYHETRPPIYHFL 253
Query: 349 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
T VCAIVGG FTV+GIID+ ++ + + KK E+GK S
Sbjct: 254 TTVCAIVGGTFTVAGIIDSTLFTATQ-LFKKFELGKLS 290
>gi|18921097|ref|NP_569847.1| CG4293, isoform A [Drosophila melanogaster]
gi|24638890|ref|NP_726677.1| CG4293, isoform B [Drosophila melanogaster]
gi|85724768|ref|NP_001033816.1| CG4293, isoform D [Drosophila melanogaster]
gi|85724770|ref|NP_001033817.1| CG4293, isoform C [Drosophila melanogaster]
gi|2961397|emb|CAA18090.1| EG:65F1.1 [Drosophila melanogaster]
gi|7290051|gb|AAF45518.1| CG4293, isoform A [Drosophila melanogaster]
gi|7290052|gb|AAF45519.1| CG4293, isoform B [Drosophila melanogaster]
gi|15292011|gb|AAK93274.1| LD35174p [Drosophila melanogaster]
gi|84798360|gb|ABC67159.1| CG4293, isoform C [Drosophila melanogaster]
gi|84798361|gb|ABC67160.1| CG4293, isoform D [Drosophila melanogaster]
gi|220955778|gb|ACL90432.1| CG4293-PA [synthetic construct]
Length = 441
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 165/352 (46%), Gaps = 29/352 (8%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + + GG ++L+S ++++ L ++EL Y + ET ++ D +
Sbjct: 19 KNLDAFKKVPEKYTETSEIGGTLSLLSRLLIVYLVYTELHYYWH---ETDIVYQFEPDIA 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKK-RLDSQGNVIE-SRQD 121
E ++++ D+T A+PC+ LS VD MD + + D+F L +G E S D
Sbjct: 76 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMSEHD 127
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ I +Q H R E + A+ +D + RE+ K A
Sbjct: 128 RLQFQAIQ--IQNHYLREEFHSV-------ADVLFKDIMRDNHPARESASKAPAAPPPGA 178
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
L G E + + C ++G L +NKVAG H G H ++ +R
Sbjct: 179 LPLSVDLHGRHNVQPESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRR 238
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
N +H+IN+L+FG++ +V PL+G + QYF+KVVPT + TI +
Sbjct: 239 MPANFTHRINRLSFGQYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTIYA 297
Query: 302 NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
Q++VTE+ R + R PG++F YD S +K+ + + F +C
Sbjct: 298 FQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIIVRNDRDHLVTFAIRLC 349
>gi|341874049|gb|EGT29984.1| hypothetical protein CAEBREN_24080 [Caenorhabditis brenneri]
Length = 286
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 102/190 (53%), Gaps = 18/190 (9%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 257
GC E+NKV GNFH + + A Q +++++ H I+ + FG+
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ASQPENYDMKHIIHSIKFGDDVSH 157
Query: 258 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 316
+ G +PL QE ++Y +K+VP+V+ D SG+ + S Q++ +
Sbjct: 158 KNLKGSFDPLANRDSLQENGLSTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYITYHH 217
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +P V+F Y+L PI + TE+ SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277
Query: 377 KKKIEIGKFS 386
KK+ ++GK +
Sbjct: 278 KKQ-QMGKLT 286
>gi|320580226|gb|EFW94449.1| COPii-coated vesicle-associated protein, putative [Ogataea
parapolymorpha DL-1]
Length = 901
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 148/360 (41%), Gaps = 74/360 (20%)
Query: 12 DAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRI 71
D+ KI R+ G T+++ + +L L + E+ Y++ + + VD + L I
Sbjct: 572 DSAAKIAPSQQVRSTRGSYSTIITYLFLLFLIWVEVGGYIDGAIDHQFTVDELVRKDLVI 631
Query: 72 NFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI------ESRQDGIGA 125
N D+ A+PC+ + + D++ ++ L + L+ QG E I
Sbjct: 632 NLDLVV-AMPCNYIHTNVRDLTDDRFLAAE------LLNYQGTTFNIPRWYEQSAKKIVT 684
Query: 126 PKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 185
P+++ L+R L+ Y G +
Sbjct: 685 PELEAVLERS---LQARFQYQGEHH----------------------------------- 706
Query: 186 CKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 245
+E C I+G + VN+V G H G D + N
Sbjct: 707 -----------DEGAPACRIFGAIPVNRVKGELHIT------AKGYGYRDRTRIPAEGLN 749
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
+H I++ +FGE FP + NPLD T + ++Y I VVPT+Y + G I +NQ+S
Sbjct: 750 FTHAISEFSFGEFFPYLDNPLDMTLKTTDAHLHTFKYHINVVPTLYRKL-GVEIDTNQYS 808
Query: 306 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+ S + + +PG+FF Y+ PIK+ E +SF F+ + I+GG+ V+G +
Sbjct: 809 L-----SLTESSGKYVPGIFFQYEFEPIKLVVEETRLSFWQFVVRLATIMGGILVVAGWL 863
>gi|403330686|gb|EJY64240.1| hypothetical protein OXYTRI_24846 [Oxytricha trifallax]
Length = 345
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 167/380 (43%), Gaps = 77/380 (20%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+I+ D Y + +D ++SG +++ +M+ L S+ ++ +++L+D + G
Sbjct: 12 RIKFFDFYKDLPQDLAEPSWSGATVSMFVMGLMVALIISQTYSFMQFQRTSEILIDVNSG 71
Query: 67 ET-LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGA 125
+ L IN ++T PC +LS+D +D++G +DV + K LD
Sbjct: 72 NSKLNININITMHKAPCHVLSLDIVDVTGVHVMDVGGKLHKHSLD--------------- 116
Query: 126 PKIDKPLQRHGGRLEHNETYC-GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ G L H++T G + SSD V + YR A+
Sbjct: 117 --------KDGFYLGHHDTMDEGPEFKQASSD---------VNDIYRDTIKAM------- 152
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
++ EGC + G + +NKV GNFH S H G V I +
Sbjct: 153 -------------DDQEGCMVEGTVIINKVPGNFHL----STHSFGEVVQKIYMNGK-KL 194
Query: 245 NISHKINKLAFGE----------HFPGVVNPLDG--VRWTQETPSG--MYQYFIKVVPTV 290
+ +H +N L+FG+ + +DG V Q G + Y++ +
Sbjct: 195 DFTHTVNHLSFGDDKQMKSIQSKYNEKYTFDMDGTYVDQNQHLYQGQLLANYYLDINQVD 254
Query: 291 YTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
Y D +G + Q ++SS+ Q LP +FF Y+LSP+K+ +T + S+ F
Sbjct: 255 YLDATGIFYKLLQ---GFKYKSSKSIMAQMGLPAIFFRYELSPVKLQYTMTYKSWSEFFI 311
Query: 350 NVCAIVGGVFTVSGIIDAFI 369
+ AI+GG++ V+GII++F+
Sbjct: 312 EISAIIGGMYVVAGIIESFL 331
>gi|254572003|ref|XP_002493111.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv46p [Komagataella pastoris GS115]
gi|238032909|emb|CAY70932.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv46p [Komagataella pastoris GS115]
Length = 333
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 155/365 (42%), Gaps = 73/365 (20%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
IR DA+PK R+ G T++ +L L + E+ Y++ + + ++D +
Sbjct: 7 IRVFDAFPKTEPVNTVRSTKGSYSTILMGFFILFLIWVEIGGYVDGYIDRQFMLDRNIQR 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L IN D+ F A PC+ L H +VK DI + R +Q
Sbjct: 67 VLNINLDM-FVATPCNYL-----------HTNVK-DITQDRFLAQ--------------- 98
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQC 186
E + + N + +++R G L +D+
Sbjct: 99 -------------------------EQLNFEGVNFF--IPDSFRVNGDESQGSTLDLDEV 131
Query: 187 KREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
RE L +E+ + C+I+G + VNKV G FH GK G D
Sbjct: 132 MRESALAEFREKKSFTHGDAPACHIFGSIPVNKVHGFFHIT-GK-----GYGYRDRSIVP 185
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
+++ N +H I++ +FGE +P + NPLD T + Y++ VVPT Y + G I
Sbjct: 186 KEALNFTHVISEFSFGEFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTEYKKL-GIVID 244
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+ Q+S+T +E L PG+FF Y PI ++ E+ +SF+ FL + I GG+
Sbjct: 245 TTQYSMT----VTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTICGGIMV 300
Query: 361 VSGII 365
V+ I
Sbjct: 301 VAKWI 305
>gi|300123494|emb|CBK24766.2| unnamed protein product [Blastocystis hominis]
Length = 235
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 109/235 (46%), Gaps = 24/235 (10%)
Query: 84 ILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI---ESRQDGIGAPKIDKPLQ--RHGGR 138
++ V D G D++++I K LD GN I + Q + P ++ L+ +H
Sbjct: 1 MIQVGYRDALGNDRADIENEILKTNLDVNGNPIGKTDKSQVTVTVPTKEEVLENTKHDDD 60
Query: 139 LEHN---ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQR 194
+ CG C+GA+ E CCN CEE+ AYRKK W + QC +LQ+
Sbjct: 61 EIVVIDDKKECGDCFGAKEKSE-CCNTCEELIAAYRKKNWDVDRIKAQAPQCAGFNYLQK 119
Query: 195 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-----DSFNISHK 249
K GC + G L + KV G+ PG+ ++D+L+ +S N++H
Sbjct: 120 WKNGVERGCRLEGKLSITKVQGHVFIIPGR--------INDLLSNSEIRQIANSLNVTHT 171
Query: 250 INKLAFGEHFPGVVNPLDGVRWTQETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
I+ + GE P NP R + MYQYF+ +PT Y + SG ++S Q
Sbjct: 172 IHHFSLGEAIPEQKNPFVDHRGVMAVDHASMYQYFVNAIPTTYINKSGKELKSYQ 226
>gi|145510182|ref|XP_001441024.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408263|emb|CAK73627.1| unnamed protein product [Paramecium tetraurelia]
Length = 320
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 82/384 (21%), Positives = 159/384 (41%), Gaps = 76/384 (19%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M ++++D Y K+ + T SG V+++++ I + L+ SE+ Y+ +++++VD
Sbjct: 1 MKLLKAIDLYGKVPKGLAEPTSSGAVVSVLTLIFLGLMVMSEVIEYITIDVQSEIIVDQQ 60
Query: 65 RG-ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+ ++++FD+ F PC L +D +QD +
Sbjct: 61 LSKDRVQVSFDIKFVRAPCDFLEID------------------------------QQDAM 90
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G + ++ R++ +E G + NN + +A
Sbjct: 91 GQSLSQQFMEFKYYRMDSSERRIGEYIRNQ-------NNWIVIEDA-------------- 129
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS---------FHQSGVHVH 234
R E +GC + G L++N+V G F P +S H + H
Sbjct: 130 ----------RTAVAEKQGCEVVGSLKINRVKGKISFGPHRSHTYIGAVGNLHLPLDYSH 179
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 294
++F N K+ + + ++ + S +++FI ++PT YT +
Sbjct: 180 KFVSFTFGDENALKKVKSMFKQGQLESLAGSQRIKKYELASQSMQHEHFIHIIPTHYTLL 239
Query: 295 SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ T +SV ++ + + R V YD +P VT+ + LHFL +CA+
Sbjct: 240 NKQT-----YSVYQYTANHNEVRSHNYANVQLRYDFAPTTVTYWQTKEDILHFLVQICAV 294
Query: 355 VGGVFTVSGIIDAFIYHGQRAIKK 378
+GG+FTVS +I+A +Y R++ K
Sbjct: 295 IGGIFTVSSMIEASVYKVMRSVLK 318
>gi|449704125|gb|EMD44426.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 185
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 96/172 (55%), Gaps = 12/172 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ D Y K+ ED +R GG +T++ +++++L +E YL +LLVD R
Sbjct: 1 MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAP 126
+ ++FD+TFP C I SVD + SGE + ++ ++ K R+ G+++ E+ I +
Sbjct: 61 KIPVHFDITFPYSSCPITSVDILTKSGESMIGIEQNVTKIRIHHDGSLVTENEMKAIQSK 120
Query: 127 -KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
I+ P + C SCYGAE+ ++ CC C++V+EAY+K+GW L
Sbjct: 121 LSIETP----------DPKECRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL 162
>gi|351707253|gb|EHB10172.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Heterocephalus glaber]
Length = 211
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 78/137 (56%), Gaps = 10/137 (7%)
Query: 232 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 291
H H DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT
Sbjct: 80 HAHLAALVNHDSYNFSHRIDHLSFGELVPGIINPLDGTEKIAIDHNQMFQYFITVVPT-- 137
Query: 292 TDVSGHTIQ----SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
HT + ++QFSVTE R + + G+F YDLS + VT TEEH+ F
Sbjct: 138 ---KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQ 194
Query: 347 FLTNVCAIVGGVFTVSG 363
F +C IVGG+F+ +G
Sbjct: 195 FFVRLCGIVGGIFSTTG 211
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 17/58 (29%), Positives = 33/58 (56%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+N ++ LDA+PK+ + + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 10 LNLVKELDAFPKVPQSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVD 67
>gi|123361353|ref|XP_001295947.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121875215|gb|EAX83017.1| hypothetical protein TVAG_111750 [Trichomonas vaginalis G3]
Length = 338
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 152/359 (42%), Gaps = 49/359 (13%)
Query: 31 ITLVSSIVMLLLFFSELRLYLNAVTETKLLVD---TSRGETLRINFDVTFPALPCSILSV 87
I+ S + L F+++ L + L D + R + + ++ + PC +L +
Sbjct: 4 ISQAMSFLSTFLIFAQIILMVTPKIHRDLSTDHIYSLRTDLVNVSLNFLINQ-PCEVLHL 62
Query: 88 DAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 147
D +D G + L V + +R++ + +E L + + C
Sbjct: 63 DILDSIGHKQLLVNDTLKWRRVNQEKGFME---------------------LYNKKKQCH 101
Query: 148 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 207
SCY + CCN CE+++E Y + P+ QCK E + K + E C++ G
Sbjct: 102 SCYDF-YDNRFCCNGCEKLKEIYHSNN-KTATPENWTQCKPEN---KQKFDPNEKCHVKG 156
Query: 208 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 267
+ VN+V G+FH A G+S G H H IL + H I L FG + P +PL
Sbjct: 157 KISVNRVPGSFHLAIGQSIEDYG-HQH-ILLDDYQTITFDHDIIDLRFGANIPMTSHPLR 214
Query: 268 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ-----FSVTEHFRSSEQGRLQTLP 322
G +Y + + P V+ G I+ +S+T H +P
Sbjct: 215 GTHIKSTGEPLATEYNLIITPIVFY-ADGQYIEKGFEYVYFYSMTYHL----------VP 263
Query: 323 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 381
G++F+Y +P + T + SF FL + ++ G++ + ++ F+ + KKK+E
Sbjct: 264 GIYFYYSFTPYTIAVTWQSRSFRSFLISTGGLLSGIYAIFSMVSTFLEKSDQK-KKKVE 321
>gi|145549492|ref|XP_001460425.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428255|emb|CAK93028.1| unnamed protein product [Paramecium tetraurelia]
Length = 320
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 162/390 (41%), Gaps = 88/390 (22%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M ++S+D Y K+ + T SG V+++++ I++ L+ +E Y+ +++++VD
Sbjct: 1 MKLLKSIDLYGKVPKGLAEPTSSGAVVSIITLILLALMIINEGIEYITIDVQSEIIVDQK 60
Query: 65 RG-ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
+ +++N D+ F PC L +D +QD +
Sbjct: 61 LSKDRVQVNLDIKFIKAPCDFLEID------------------------------QQDAM 90
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G + ++ RL+ NE S Y S NN E+ +A
Sbjct: 91 GQSLSQQFMELKYYRLDSNERRI-SEYTRNS------NNWVEIEDA-------------- 129
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
R E +GC + G L+VN+V G F +S+ G +
Sbjct: 130 ----------RTAINEKQGCEVIGNLKVNRVRGKISFGAHRSYSYIGA-----VGNLNLP 174
Query: 244 FNISHKINKLAFGEHFP----------GVVNPLDGVRWTQE----TPSGMYQYFIKVVPT 289
+ SHK +FG+ G ++ G + ++ + S +++FI ++PT
Sbjct: 175 LDYSHKFVSFSFGDEDALKKVKSLFQQGQLDSFAGTQRIKKPELASQSMQHEHFISIIPT 234
Query: 290 VYTDVSGHTIQSNQFSVTEH-FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
YT ++ Q++ + RS+ G +Q YD +P VT+ + LHF
Sbjct: 235 HYTLLNKQVYSVYQYTANHNEVRSNNYGNVQ------LRYDFAPTTVTYWQTKEDILHFY 288
Query: 349 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
+CA++GG+FTVS +I+A +Y R + K
Sbjct: 289 VQICAVIGGIFTVSSMIEACVYKVMRMLLK 318
>gi|312374049|gb|EFR21698.1| hypothetical protein AND_16520 [Anopheles darlingi]
Length = 252
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/268 (27%), Positives = 123/268 (45%), Gaps = 46/268 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ + LDA+PK+ E+F T GG ++L+S +V++ L + E+ YL++ DT
Sbjct: 11 LEAVSQLDAFPKVKEEFVEATRVGGTLSLISRLVIIFLIYHEVTYYLDSRLVFTFKPDTD 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-KRLDSQGNVIESRQDGI 123
L+++ D+T A+PC + D +D + + ++F L + E
Sbjct: 71 LHSKLKVHIDLTV-AMPCKSIGADILDSTNQ-------NVFSFGVLQEEDTWFEL----- 117
Query: 124 GAPKIDKPLQR-HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL--SNP 180
P QR H ++H+ +Y Y + E K A+ S P
Sbjct: 118 ------CPSQRVHFDYMQHHNSYLRQEY-------------HSIAEILYKSDHAVVYSMP 158
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ + I + + C I+G L +NKVAGNFH GK+ H + H+H F
Sbjct: 159 ERV----------IIPQRPHDACRIHGVLTLNKVAGNFHITVGKTIHFARGHIHLNSIFA 208
Query: 241 RDSFNISHKINKLAFGEHFPGVVNPLDG 268
N SH+IN+ +FG+H G+++PL+G
Sbjct: 209 NTQTNFSHRINRFSFGDHTAGIIHPLEG 236
>gi|151946097|gb|EDN64328.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
gi|190408176|gb|EDV11441.1| hypothetical protein SCRG_01831 [Saccharomyces cerevisiae RM11-1a]
gi|259148509|emb|CAY81754.1| Erv41p [Saccharomyces cerevisiae EC1118]
Length = 352
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 160/362 (44%), Gaps = 65/362 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK E + ++ GG+ +L++ + +L + ++E Y + + +VD+
Sbjct: 1 MAGLKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI---------FKKRLDSQGNV 115
+T++IN D+ + C L ++ D Q +D K + F D++ N
Sbjct: 61 VRDTVQINMDI-YVNTKCDWLQINVRD----QTMDRKLVLEELQLEEMPFFIPYDTKVND 115
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
I I P++D+ L G AE +R+K
Sbjct: 116 INE----IITPELDEIL--------------GEAIPAE----------------FREK-- 139
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
L D+ R E GC+I+G + VN+V+G KS G
Sbjct: 140 -LDTRSFFDESDP----NRAHLPEFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASR 190
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDV 294
+ FN H IN+ +FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++ +
Sbjct: 191 KAPLEELKFN--HVINEFSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL 248
Query: 295 SGHTIQSNQFSVTEH--FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
G + +NQ+SV ++ + +PG+FF Y+ P+ + ++ +SF+ FL +
Sbjct: 249 -GAEVDTNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLV 307
Query: 353 AI 354
AI
Sbjct: 308 AI 309
>gi|170587366|ref|XP_001898447.1| HT034 [Brugia malayi]
gi|158594071|gb|EDP32661.1| HT034, putative [Brugia malayi]
Length = 286
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 100/190 (52%), Gaps = 18/190 (9%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 260
GC G +++KV GNFH + H D Q +++++ H I+ + FG+
Sbjct: 110 GCRFEGKFDISKVPGNFHIS---------THAADT---QPETYDMRHTIHSVVFGDDVST 157
Query: 261 ----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 316
G NPL + S + Y +K+VP+VY D++G+ S Q++ +
Sbjct: 158 SQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTYHY 217
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +P ++F Y+L PI + +TE F F+T++CA+VGG FTV+GIIDA ++ +
Sbjct: 218 SGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLF-SLTEL 276
Query: 377 KKKIEIGKFS 386
+K ++GK S
Sbjct: 277 YRKHQMGKLS 286
>gi|426372082|ref|XP_004052960.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Gorilla gorilla
gorilla]
Length = 354
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 84/147 (57%), Gaps = 6/147 (4%)
Query: 222 PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ 281
P ++ H H +S+N SH+I+ L+FGE P ++NPLDG + M+Q
Sbjct: 166 PPRAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQ 225
Query: 282 YFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFT 338
YFI VVPT ++T +S T +QFSVTE R + + G+F YDLS + VT T
Sbjct: 226 YFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVT 282
Query: 339 EEHVSFLHFLTNVCAIVGGVFTVSGII 365
EEH+ F F +C IVGG+F+ +G++
Sbjct: 283 EEHMPFWQFFVRLCGIVGGIFSTTGML 309
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 49/89 (55%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + VD
Sbjct: 19 LSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKD 78
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
LRIN D+T A+ C + D +D++
Sbjct: 79 FSSKLRINIDITV-AMKCQYVGADVLDLA 106
>gi|198468706|ref|XP_001354796.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
gi|198146533|gb|EAL31851.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
Length = 445
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 171/366 (46%), Gaps = 42/366 (11%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-- 61
+++ R+LDA+ K+ E + T GG ++L+S ++++ L ++EL Y + ET ++
Sbjct: 14 LLDFARNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWH---ETDIVYQF 70
Query: 62 --DTSRGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
D S E ++++ D+T A+PC LS VD MD + + D+F + G +
Sbjct: 71 EPDISLDEQVQMHVDITV-AMPCVALSGVDLMD-------ETQQDVF-----AYGTL--- 114
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG---- 174
+++G+ D Q H ++ Y + S D + +R+ Y KG
Sbjct: 115 QREGVWWKMSDNDRQ-HFQSIQMTNHYLREEF---HSVADVFFK-DIMRDPYPMKGDPTA 169
Query: 175 -WALSN------PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH 227
A+S P + E + + C ++G L +NKVAG H G
Sbjct: 170 GSAISPAIVAPPPGALPASLELHLPNGQPETKFDACRLHGTLGINKVAGVLHLVGGAQPV 229
Query: 228 QSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
H ++ +R N +H+IN+L+FG++ +V PL+G + QYF+KVV
Sbjct: 230 VGLFEDHWVIELRRMPANFTHRINRLSFGQYSRRIVQPLEGDESIIHEEATTVQYFLKVV 289
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLH 346
PT + TI + Q++VTE+ R + R PG++F YD S +K+ + + +
Sbjct: 290 PTEIHQ-TFTTINTFQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVSNDRDHLVT 348
Query: 347 FLTNVC 352
F +C
Sbjct: 349 FAIRLC 354
>gi|224000371|ref|XP_002289858.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975066|gb|EED93395.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 338
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 163/392 (41%), Gaps = 74/392 (18%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+R+ DA+ K + R+ SGG ITL++SI LLF S++ LY+ T + + S
Sbjct: 3 LRTYDAFAKPIDGIRERSVSGGFITLLASITAALLFLSQIILYIQVDTRHSMHLAESVPS 62
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
L F P +ILS I H+ H K ++ QDG
Sbjct: 63 AL-------FNKSPQNILS--GHQIPLRVHVTFPHLPCK--------ALDYSQDGN---- 101
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
G+ EH Y + Y + K+ P +ID K
Sbjct: 102 -----SESTGKFEH---YHSAPY------------------TFTKR-----VPTVIDYKK 130
Query: 188 R--EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------ 239
GF + + +GC + G ++V +V G + + IL+F
Sbjct: 131 AAVSGF-KDVNTARRQGCTLVGTIKVPRVGGTMSISVSPEAWRRAT---SILSFGVDLGK 186
Query: 240 QRDSF-----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG--MYQYFIKVVPTVYT 292
+D F N++H ++ + FG+ FP NPL GV + SG + +K+VPT Y
Sbjct: 187 DQDMFHGKLPNVTHYVHDITFGDPFPPGSNPLKGVHHVMDNGSGVALANVAVKLVPTTYK 246
Query: 293 DVSGHTIQSNQFSVTEHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 349
++ Q SV+ H E R LPG+ YD +P+ V E ++L FL+
Sbjct: 247 RTIYSAKETYQASVSRHIVQPETLAAQRSTLLPGLMLTYDFTPLAVRHVESRENWLVFLS 306
Query: 350 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 381
++ IVGGVF G++ + + +A+ KK++
Sbjct: 307 SLVGIVGGVFVTVGLVSGCLVNSAQAVAKKMD 338
>gi|195165324|ref|XP_002023489.1| GL20164 [Drosophila persimilis]
gi|194105594|gb|EDW27637.1| GL20164 [Drosophila persimilis]
Length = 445
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 169/366 (46%), Gaps = 42/366 (11%)
Query: 4 IMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV-- 61
+++ R+LDA+ K+ E + T GG ++L+S ++++ L ++EL Y + ET ++
Sbjct: 14 LLDFARNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWH---ETDIVYQF 70
Query: 62 --DTSRGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKKRLDSQGNVIES 118
D S E ++++ D+T A+PC LS VD MD + + D+F + G +
Sbjct: 71 EPDISLDEQVQMHVDITV-AMPCVALSGVDLMD-------ETQQDVF-----AYGTL--- 114
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
+++G+ D Q H ++ Y + S D + +R+ Y KG +
Sbjct: 115 QREGVWWKMSDNDRQ-HFQSIQMTNHYLREEF---HSVADVFFK-DIMRDPYPMKGDPTA 169
Query: 179 N-----------PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH 227
P + E + + C ++G L +NKVAG H G
Sbjct: 170 GSAIAPAIVAPPPGALPASLELHLPNGQPETKFDACRLHGTLGINKVAGVLHLVGGAQPV 229
Query: 228 QSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 287
H ++ +R N +H+IN+L+FG++ +V PL+G + QYF+KVV
Sbjct: 230 VGLFEDHWVIELRRMPANFTHRINRLSFGQYSRRIVQPLEGDESIIHEEATTVQYFLKVV 289
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLH 346
PT + TI + Q++VTE+ R + R PG++F YD S +K+ + + +
Sbjct: 290 PTEIHQ-TFTTINTFQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVSNDRDHLVT 348
Query: 347 FLTNVC 352
F +C
Sbjct: 349 FAIRLC 354
>gi|12006037|gb|AAG44724.1|AF267855_1 HT034 [Homo sapiens]
Length = 199
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 82/143 (57%), Gaps = 10/143 (6%)
Query: 250 INKLAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 304
I+KL+FG ++ G N L G P + Y +K+VPTVY D SG S Q+
Sbjct: 59 IHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQY 118
Query: 305 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
+V E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+
Sbjct: 119 TVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVA 176
Query: 363 GIIDAFIYHGQRAIKKKIEIGKF 385
GI+D+ I+ A KKI++GK
Sbjct: 177 GILDSCIFTASEAW-KKIQLGKM 198
>gi|339233696|ref|XP_003381965.1| conserved hypothetical protein [Trichinella spiralis]
gi|316979152|gb|EFV61980.1| conserved hypothetical protein [Trichinella spiralis]
Length = 331
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 89/172 (51%), Gaps = 4/172 (2%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFGEHF 259
+ C I+G+ +NK+ G ++ V I A Q + FN SH+I K FG
Sbjct: 144 DACRIHGYFLMNKLRGKLRIKFKETVRLEAVSNFIIFARRQNEGFNFSHRIEKFGFGPRI 203
Query: 260 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGR 317
G++NPLDG + M+ Y+I+VVPT TD++G ++Q+SVT R +QG
Sbjct: 204 AGIINPLDGFQKESFDRRDMFYYYIQVVPTKITDLNGMETFTSQYSVTHKRRIIDHDQGS 263
Query: 318 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ G+F ++D +P+ V + S F +CAIVGG+F + I A +
Sbjct: 264 HGSC-GIFIYFDFAPMMVLIRKSKTSLFVFALRICAIVGGIFACTDFIIALM 314
>gi|268577857|ref|XP_002643911.1| Hypothetical protein CBG02175 [Caenorhabditis briggsae]
Length = 282
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 102/190 (53%), Gaps = 19/190 (10%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 257
GC E+NKV GNFH S H + Q D +++ H I+ + FG+
Sbjct: 107 GCRFESRFEINKVPGNFHL----STHSATT--------QPDGYDMRHIIHSIKFGDDVSH 154
Query: 258 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 316
+ G +PL R +E+ ++Y +K+VP+V+ D SG+ + S Q++ +
Sbjct: 155 KNLKGSFDPLAN-REAKESGLNTHEYILKIVPSVHEDYSGNILNSYQYTYGHKSYVTYHH 213
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+ +P V+F Y+L PI + TE SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 214 SGKIIPAVWFKYELQPITLKQTEHRQSFYIFLTSICAVVGGTFTVAGIIDSTFFTISEMV 273
Query: 377 KKKIEIGKFS 386
KK+ ++GK +
Sbjct: 274 KKQ-QMGKLT 282
>gi|300121843|emb|CBK22417.2| unnamed protein product [Blastocystis hominis]
Length = 251
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 108/225 (48%), Gaps = 23/225 (10%)
Query: 149 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ----CK--REGFLQRIKEEEGEG 202
CYGA ++ CCN C + EAY +GW+ P + Q C+ R L G
Sbjct: 35 CYGA-GAEGQCCNTCSAIVEAYNSRGWS---PHFVLQFSPLCRNSRPSVLSF-----KSG 85
Query: 203 CNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C I+G ++V++VAG+ H G V+D + SH I +FG+H PG
Sbjct: 86 CMIWGAIDVHQVAGDIHIQTTTGMIDILGAPVYDAEIISK--LKSSHFIEHFSFGKHIPG 143
Query: 262 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS---SEQGRL 318
V NPL+G R+ + + Y I+++P +Y + G I+SN+ SV E + G
Sbjct: 144 VENPLNGRRFLANQLTS-HAYQIEILPAIY-ERGGVEIRSNEISVYETDKVVTVEPSGTA 201
Query: 319 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
PG+FF Y +SP + E+ F + +C ++GG+ V G
Sbjct: 202 DVEPGLFFKYRISPFEHVIREDRKEFWSLVVRLCGVMGGMMAVGG 246
>gi|118357982|ref|XP_001012239.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila]
gi|89294006|gb|EAR91994.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila
SB210]
Length = 323
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 162/389 (41%), Gaps = 80/389 (20%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M R DA+ K+N+D S + GG+ ++++ + +LF E + + KL V +
Sbjct: 1 MQSFRKFDAFQKVNQDIDSSSSVGGLFSIIALAIGFILFCHEFQEWNKYTIVRKLEVQSL 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
++ N D+TF +PCS++S+D + G+Q V++ +
Sbjct: 61 NQAIIKANIDLTFFNVPCSLISLDVLYQDGQQ------------------VLQDYSSTLT 102
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
K+D R + TY E E+ EEV E + K
Sbjct: 103 RIKLD----RQNKEIGTETTY------VEVEQENSQQKIEEVLEQIKNK----------- 141
Query: 185 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
E C I+G L +N + G+F F + G+ +
Sbjct: 142 ----------------EQCRIHGQLLLNTIPGSFKF---RILQMKGLDEQLL-----KQL 177
Query: 245 NISHKINKLAFG--------EHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
NI+HKINKL+FG E G+ D R+ E Y +IK++P
Sbjct: 178 NINHKINKLSFGDTIKTKKIEKVLGLDKSDSEAFDESRYNYEYRCS-YDNYIKILPLNAE 236
Query: 293 DVS--GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
++ G+ I++N F T + + + + + V F Y +SPI + + ++ SF F+
Sbjct: 237 NIKELGY-IRTNSFRFTMYQQVIPKEQTDIIE-VSFNYQVSPINIVYQTKNKSFYSFVVQ 294
Query: 351 VCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
VCAI+GG+F V G+I+ + + +I K
Sbjct: 295 VCAIIGGIFCVFGVINTLVLNIISSINSK 323
>gi|392297516|gb|EIW08616.1| Erv41p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 352
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 88/362 (24%), Positives = 160/362 (44%), Gaps = 65/362 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK E + ++ GG+ +L++ + +L + ++E Y + + +VD+
Sbjct: 1 MAGLKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI---------FKKRLDSQGNV 115
+T++IN D+ + C L ++ D Q +D K + F D++ N
Sbjct: 61 VRDTVQINMDI-YVNTKCDWLQINVRD----QTMDRKLVLEELQLEEMPFFIPYDTKVND 115
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
I I P++D+ L G AE +R+K
Sbjct: 116 INE----IITPELDEIL--------------GEAIPAE----------------FREK-- 139
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
L D+ + E GC+I+G + VN+V+G KS G
Sbjct: 140 -LDTRSFFDESDP----NKAHLPEFNGCHIFGSIPVNRVSGELQII-AKSL---GYVASR 190
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDV 294
+ FN H IN+ +FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++ +
Sbjct: 191 KAPLEELKFN--HVINEFSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL 248
Query: 295 SGHTIQSNQFSVTEH--FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
G + +NQ+SV ++ + +PG+FF Y+ P+ + ++ +SF+ FL +
Sbjct: 249 -GAEVDTNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLV 307
Query: 353 AI 354
AI
Sbjct: 308 AI 309
>gi|349580221|dbj|GAA25381.1| K7_Erv41p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 352
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 160/362 (44%), Gaps = 65/362 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK E + ++ GG+ +L++ + +L + ++E Y + + +VD+
Sbjct: 1 MAGLKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI---------FKKRLDSQGNV 115
+T++IN D+ + C L ++ D Q +D K + F D++ N
Sbjct: 61 VRDTVQINMDI-YVNTKCDWLQINVRD----QTMDRKLVLEELQLEEMPFFIPYDTKVND 115
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
I I P++D+ L G AE +R+K
Sbjct: 116 INE----IITPELDEIL--------------GEAIPAE----------------FREK-- 139
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
L D+ + E GC+++G + VN+V+G KS G
Sbjct: 140 -LDTRSFFDESDP----NKAHLPEFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASR 190
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDV 294
+ FN H IN+ +FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++ +
Sbjct: 191 KAPLEELKFN--HVINEFSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL 248
Query: 295 SGHTIQSNQFSVTEH--FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
G + +NQ+SV ++ + +PG+FF Y+ P+ + ++ +SF+ FL +
Sbjct: 249 -GAEVDTNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLV 307
Query: 353 AI 354
AI
Sbjct: 308 AI 309
>gi|6323573|ref|NP_013644.1| Erv41p [Saccharomyces cerevisiae S288c]
gi|2497084|sp|Q04651.1|ERV41_YEAST RecName: Full=ER-derived vesicles protein ERV41
gi|558408|emb|CAA86254.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285813935|tpg|DAA09830.1| TPA: Erv41p [Saccharomyces cerevisiae S288c]
Length = 352
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 160/362 (44%), Gaps = 65/362 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK E + ++ GG+ +L++ + +L + ++E Y + + +VD+
Sbjct: 1 MAGLKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI---------FKKRLDSQGNV 115
+T++IN D+ + C L ++ D Q +D K + F D++ N
Sbjct: 61 VRDTVQINMDI-YVNTKCDWLQINVRD----QTMDRKLVLEELQLEEMPFFIPYDTKVND 115
Query: 116 IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGW 175
I I P++D+ L G AE +R+K
Sbjct: 116 INE----IITPELDEIL--------------GEAIPAE----------------FREK-- 139
Query: 176 ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 235
L D+ + E GC+++G + VN+V+G KS G
Sbjct: 140 -LDTRSFFDESDP----NKAHLPEFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASR 190
Query: 236 ILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDV 294
+ FN H IN+ +FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++ +
Sbjct: 191 KAPLEELKFN--HVINEFSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL 248
Query: 295 SGHTIQSNQFSVTEH--FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 352
G + +NQ+SV ++ + +PG+FF Y+ P+ + ++ +SF+ FL +
Sbjct: 249 -GAEVDTNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLV 307
Query: 353 AI 354
AI
Sbjct: 308 AI 309
>gi|452822342|gb|EME29362.1| hypothetical protein Gasu_31910 [Galdieria sulphuraria]
Length = 170
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 92/177 (51%), Gaps = 9/177 (5%)
Query: 39 MLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
M+LL SE+ Y T L+VD +R E+ I D+TFP + C L +D MD +G+ L
Sbjct: 1 MILLIISEVGRYWKPQVTTHLVVDYNREESFEIYLDITFPHIGCGALGLDTMDATGDSQL 60
Query: 99 DVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 157
+V + K R+ G+ + Q ++K + H LE T C SCYGA+ S +
Sbjct: 61 EVVNSKLSKFRVFQNGSQVLWNQ-----SIVEKDGKVHSFVLE-EATNCKSCYGAQISTD 114
Query: 158 DCCNNC-EEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 213
CCN C EEV AY GW+ + +QC EG +Q ++ +GC+ G +EV K
Sbjct: 115 QCCNTCEEEVLLAYEWIGWSY-QVEQFEQCHMEGVVQWVQSVLSQGCHFQGTIEVAK 170
>gi|123437985|ref|XP_001309782.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121891523|gb|EAX96852.1| hypothetical protein TVAG_470170 [Trichomonas vaginalis G3]
Length = 344
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 167/383 (43%), Gaps = 51/383 (13%)
Query: 11 LDAYPK-INEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKL-----LVDTS 64
D +PK I+ +T G +I+++S + L F E+ ++ +++L L D
Sbjct: 2 FDFFPKFIDASMVHKTTCGAIISIISIAAVAALSFFEIYSFVYPPIKSELVSLSELSDAL 61
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
T+ NF V LPC ++S+D D+ G I+K RLD+ N I Q
Sbjct: 62 SDFTISFNFSVD---LPCILVSIDIYDVLGTLTDPNSKSIYKLRLDNNRNPIPYSQ---- 114
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSD-EDCCNNCEEVREAYRKKGWALSNPDLI 183
+ N CGSCYG E ++ CCN CE+V + K G L+N
Sbjct: 115 --------------VSQN---CGSCYGTEFAEGSRCCNTCEDVVSHHIKAGRPLTNVTTW 157
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
QC E + KE+ C I+G V+ + G P S ++ F +
Sbjct: 158 QQCINEKYDFTGKEK----CQIFGNHHVSAIDGGIRILPRFSSNEE--------PFTK-L 204
Query: 244 FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSN 302
N++H I+ + FG F PLD Q P Y+Y +K VPTV + G
Sbjct: 205 LNLTHYIDHITFGTSFGP--QPLDDALIVQSEPGQFHYRYDLKAVPTVMHNQDGSITHGF 262
Query: 303 QFSV-TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q++V + +++ RL G+FF Y + + V + + ++ + I GG F +
Sbjct: 263 QYAVDSAKIPITDRTRLGE--GIFFNYYFATVAVVGKPDRFTIYILISRLFCIFGGGFFL 320
Query: 362 SGIIDAFIYHGQRAIKKKIEIGK 384
+ +ID+F Y ++ K+ IGK
Sbjct: 321 ARLIDSFGYR-IHTMEGKMRIGK 342
>gi|195347402|ref|XP_002040242.1| GM19035 [Drosophila sechellia]
gi|194121670|gb|EDW43713.1| GM19035 [Drosophila sechellia]
Length = 437
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 176/370 (47%), Gaps = 35/370 (9%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + + GG +++++ L ++EL Y + ET+++ D +
Sbjct: 19 KNLDAFKKVPEKYTETSEIGGT----PALMIVYLVYTELHYYWH---ETEIVYQFEPDIA 71
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFKK-RLDSQGNVIE-SRQD 121
E ++++ D+T A+PC+ LS VD MD + + D+F L +G E S D
Sbjct: 72 LDEQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMSEHD 123
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
+ I +Q H R E + A+ +D + RE+ K A
Sbjct: 124 RLQFQAIQ--IQNHYLREEFHSV-------ADVLFKDIMRDPHPARESASKAHAAPPPGA 174
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
L G E + + C ++G L +NKVAG H G H ++ +R
Sbjct: 175 LPLSVDLHGQHNVQPESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRR 234
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQ 300
N +H+IN+L+FG++ +V PL+G + QYF+KVVPT ++ + TI
Sbjct: 235 MPANFTHRINRLSFGQYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFT--TIN 292
Query: 301 SNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+ Q++VTE+ R + R PG++F YD S +K+ + + F +C+I+ G+
Sbjct: 293 AFQYAVTENVRKLDSERNSYGSPGIYFKYDWSALKIMVRNDRDHLVTFAIRLCSIISGII 352
Query: 360 TVSGIIDAFI 369
+SG I+A +
Sbjct: 353 VISGAINALL 362
>gi|328700149|ref|XP_003241164.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Acyrthosiphon pisum]
gi|328700151|ref|XP_001951220.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Acyrthosiphon pisum]
gi|328700153|ref|XP_003241165.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 3 [Acyrthosiphon pisum]
Length = 289
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 138/309 (44%), Gaps = 45/309 (14%)
Query: 5 MNKIRSLDAYPKINEDFYS-RTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
+N ++ LD++PK+ E+ Y T+S ++T++ S+ L L SE++ +L + + DT
Sbjct: 11 LNIVKELDSFPKVQEEIYEPSTYSNVILTVLISVFGLWLLISEIQYFLQEHYIYRFVPDT 70
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
L IN D+T A C + D +D +G+ + +F
Sbjct: 71 DYESKLPINIDITV-ASTCDSIGADIVDTTGQNMM-----LF------------------ 106
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G L+ ++T+ + E +RE Y L D
Sbjct: 107 -------------GELKTDDTWWEMTKEQQQHFEKMRKFNAYLREEYHSMKDILWMFDDY 153
Query: 184 DQCKREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQR 241
+ K + F++ K + C I+G L +NKV GNFH PGKS G HVH F
Sbjct: 154 NTLKNKIFVRTDKPNTLPDACRIHGSLILNKVIGNFHITPGKSLIVPGGHVHLTGPFFGS 213
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--I 299
++ N SH+IN+ +FG G++ PL+G + + Y+YFI VV TDV + I
Sbjct: 214 EATNFSHRINQFSFGVPTKGIIYPLEGELYETNENAVSYKYFIDVVA---TDVKSRSNEI 270
Query: 300 QSNQFSVTE 308
++ Q+S +
Sbjct: 271 KTYQYSAKD 279
>gi|324499844|gb|ADY39943.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Ascaris suum]
Length = 429
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 165/379 (43%), Gaps = 30/379 (7%)
Query: 8 IRSLDAYPKINEDFYS-RTFSGGVITLVSSIVMLLLFFSELRLYLNAVTE--TKLLVDTS 64
++SLDA+ K ++ + SG +I++V V+ +L F EL+ Y+ TE K VDT+
Sbjct: 19 VQSLDAFDKTTDEIKEEKKTSGAIISVVCFTVIGVLVFGELKTYIYGDTEFEYKFTVDTA 78
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
E + D+ A PC+ L + E+ + KR ++ E Q
Sbjct: 79 FDEQPELELDMIV-ATPCTNLVAQLSGTAAEEFFLLNQF---KRDPTRFEFTEREQKYWD 134
Query: 125 APKIDKPLQRHGGR----LEHNETYCGSCYGAESSDEDCCNNCEEVR-EAYRKK------ 173
K + + GG LE E G ++ + E + E RK
Sbjct: 135 ELKRVHGVTKPGGMVFKGLEKMEFVSGHVEEGLKAEAEVKQREEAIAIEKERKNNKQEDT 194
Query: 174 --GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSG 230
G L + I+ +++EG C ++G + VNKV G+ GK G
Sbjct: 195 FGGAILLIGNGINVFHI--LASDSQKDEGTACRVHGRVRVNKVKGDSVIITAGKGAGIDG 252
Query: 231 VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT- 289
+ H + ++ NISH+I +L FG G++ PL G E+ Y+YF+KVVPT
Sbjct: 253 LFAH--VDGASNAGNISHRIARLHFGPWIGGLLTPLAGTEQISESGIDEYRYFLKVVPTR 310
Query: 290 -VYTDVSGHTIQSNQFSVTE-HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
++ G + Q+SVT+ H R S GR P + Y+ + + V E S
Sbjct: 311 IFHSGFFGGSTMRYQYSVTKTHKRPS--GREHMHPAIAIHYEFAALVVEVRETQTSLFQL 368
Query: 348 LTNVCAIVGGVFTVSGIID 366
+C++VGGVF S I++
Sbjct: 369 FVRLCSVVGGVFATSSILN 387
>gi|145543941|ref|XP_001457656.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425473|emb|CAK90259.1| unnamed protein product [Paramecium tetraurelia]
Length = 322
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 162/379 (42%), Gaps = 80/379 (21%)
Query: 8 IRSLDAYPKINEDFYSRTFS-GGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+R LD + K+N D + S GG +T+++ ++ + +E RL+ + + ++D
Sbjct: 3 LRQLDFFRKLNTDIGDTSSSLGGFLTMIAFALVTIFTMNECRLFFSTELNYQTVIDNDTE 62
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ +++ D A PC +LS+D D G +DV ++ K LD + +V+ P
Sbjct: 63 QFIKVYLDAIVGA-PCMVLSLDQQDEVGVHVMDVSGNLKKIALDKERHVL---------P 112
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
ID +NE R YR S+ +L+D
Sbjct: 113 TID-----------NNE-----------------------RPNYRG-----SDQELVDAI 133
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSFN 245
+ +GE C GF VNKV GNFH + H +H D+ +++
Sbjct: 134 EAIN--------QGEQCQFKGFFSVNKVPGNFHISYHAHHHLIQRIHQRDLSTYRK--LK 183
Query: 246 ISHKINKLAFGEH--------FPGVVNPLDGVRW---TQETPSGM---YQYFIKVVPTVY 291
+ H I +L FG++ +P + W + P G Y+Y+I +P +
Sbjct: 184 LDHTIYELRFGDNSSSFKMKKYPKSLQKFQS-SWNSIAKTAPEGEKQDYEYYINALPVRF 242
Query: 292 TDVSGHTIQS-NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 350
D Q+ ++S+ E + + ++F Y +SP+ + ++ + S HF+
Sbjct: 243 YDDKERNYQTLYKYSINE---AQMTRSFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQ 299
Query: 351 VCAIVGGVFTVSGIIDAFI 369
+ AIVGGVF V GI+++ I
Sbjct: 300 LLAIVGGVFAVIGIVNSII 318
>gi|414586932|tpg|DAA37503.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 63
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 53/54 (98%)
Query: 333 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
++VTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 10 LQVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 63
>gi|195629654|gb|ACG36468.1| hypothetical protein [Zea mays]
Length = 76
Score = 100 bits (248), Expect = 2e-18, Method: Composition-based stats.
Identities = 44/72 (61%), Positives = 57/72 (79%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
MDA + +++ LDAYPK+NEDFY T GG++TLV+++VMLLLF SE R Y + TETKL+
Sbjct: 1 MDAFLQRLKRLDAYPKVNEDFYKWTLFGGIVTLVAAVVMLLLFISETRSYFYSATETKLV 60
Query: 61 VDTSRGETLRIN 72
VDTSR E LR+N
Sbjct: 61 VDTSRRERLRVN 72
>gi|440794754|gb|ELR15909.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 306
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 93/188 (49%), Gaps = 16/188 (8%)
Query: 203 CNIYGFLEVNKVAGNFHFAPGK----SFHQSGVHVHDIL-----AFQRDS--FNISHKIN 251
C + G + V K+ G F + + S + S ++ H DS FN++H+I
Sbjct: 121 CLLTGHMAVRKIRGQFQISSRRFNPFSIYGSSLNKHTPTEDHPHPHPEDSLPFNVTHRIR 180
Query: 252 KLAFGEHFPGVVNPLDGVRWT-QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 310
+L+FG V PLDG+ T +E Y YF+++VP Y G ++S F+ T H
Sbjct: 181 ELSFGPKVLPDVGPLDGIVQTMREGERSQYSYFLQIVPASYHYADGRVVESYSFAFTMH- 239
Query: 311 RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+ R + PGVF+ YD SP + E SF HF+T CA++GG F V G++ A
Sbjct: 240 ---TESRSELAPGVFWKYDFSPYATSLREVPKSFSHFITRCCAVIGGTFVVFGLLSALAS 296
Query: 371 HGQRAIKK 378
+ A KK
Sbjct: 297 RLETAAKK 304
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 95/228 (41%), Gaps = 19/228 (8%)
Query: 8 IRSLDAYPKINEDFYS-RTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS-R 65
+R D + K++ +T S G ++++ ++ LF E+ Y A +++ VDT+ R
Sbjct: 8 LREFDIFSKVDPTAPRVKTVSSGAVSILCFFLLGYLFLQEVAEYQKAEVTSQVSVDTTIR 67
Query: 66 GE--TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHD---IFKKRLDSQGNVIESRQ 120
E +L ++ V FP L C VDA D +G D + K+ L + ++
Sbjct: 68 NEFDSLLVSLTVEFPNLGCEDFGVDAADYTGHLLGDATGPGGTLVKRPLTADRCLLTGH- 126
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEE----VREAYRKKGWA 176
+ KI Q R Y GS + ED + E +R + +
Sbjct: 127 --MAVRKIRGQFQISSRRFNPFSIY-GSSLNKHTPTEDHPHPHPEDSLPFNVTHRIRELS 183
Query: 177 LSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGK 224
L D +G +Q ++ EGE FL++ V ++H+A G+
Sbjct: 184 FGPKVLPDVGPLDGIVQTMR--EGERSQYSYFLQI--VPASYHYADGR 227
>gi|226497610|ref|NP_001145501.1| uncharacterized protein LOC100278902 [Zea mays]
gi|195657145|gb|ACG48040.1| hypothetical protein [Zea mays]
Length = 110
Score = 99.0 bits (245), Expect = 4e-18, Method: Composition-based stats.
Identities = 42/99 (42%), Positives = 70/99 (70%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++SL+A+P E +T+SG V+T++ ++M+ LF EL+ YL T ++ VD RGE
Sbjct: 7 LKSLNAFPHAEEHLLKKTYSGAVVTILGLLIMITLFVHELQFYLTTYTVHQMSVDLKRGE 66
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK 106
TL I+ +++FP+LPC +LSVDA+D+SG+ +D+ +I+K
Sbjct: 67 TLPIHVNMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWK 105
>gi|255714272|ref|XP_002553418.1| KLTH0D16324p [Lachancea thermotolerans]
gi|238934798|emb|CAR22980.1| KLTH0D16324p [Lachancea thermotolerans CBS 6340]
Length = 340
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 155/355 (43%), Gaps = 61/355 (17%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +R+ DA+PK E ++ GG ++++ + ++ + +SE + + + V
Sbjct: 1 MAGLRTFDAFPKTEEQHVRKSSKGGYTSILTYVFLIFIAWSEFGSFFGGYVDEQYGVSKD 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD---SQGNVIESRQD 121
E ++IN D+ F +PC L V D +G++ L V+ ++ + + G + R +
Sbjct: 61 LREAVQINMDM-FVHMPCQWLDVIVQDHTGDRKL-VREELKMESIPFFLPFGTAVNERNE 118
Query: 122 GIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD 181
I + +D+ L E + +R D
Sbjct: 119 -IASLGLDEVL------------------------------AEAIPGQFR---------D 138
Query: 182 LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR 241
ID F + +E GC+++G + VN V G+ P V D
Sbjct: 139 QID------FGSEDESKEFNGCHVFGTITVNMVKGDLIIIPRSQ------SVRDFGRMPP 186
Query: 242 DSFNISHKINKLAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
D+ N+SH IN+ +FG+ +P + NPLD R T E + + Y VVPT++ + G +
Sbjct: 187 DAINLSHVINEFSFGDFYPYIDNPLDRSARITAEHTTS-FHYHTSVVPTIFQKL-GAEVN 244
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+NQ+S++E + L+ +P + F Y + +T +E +SF F+ + AI+
Sbjct: 245 TNQYSLSETKHETPPSGLR-VPAIIFSYSFEALTITIRDERISFWQFIVRLVAIL 298
>gi|145540599|ref|XP_001455989.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423798|emb|CAK88592.1| unnamed protein product [Paramecium tetraurelia]
Length = 322
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 168/391 (42%), Gaps = 93/391 (23%)
Query: 8 IRSLDAYPKINEDFYSRTFS-GGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRG 66
+R LD + K+N D + + GG +T ++ ++ +L +E RL+ + + ++D
Sbjct: 3 LRQLDFFRKLNTDIGDTSSALGGFLTTIAFALVTILTMNECRLFFSTELNYQTVIDNDTE 62
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP 126
+ ++++ D+ A PC +LS+D D G +DV + K LD +V+ P
Sbjct: 63 QFIKVHLDMIVGA-PCMVLSLDQQDEVGVHVMDVSGTLKKISLDKDRHVL---------P 112
Query: 127 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 186
ID NE E S+++ + E + +
Sbjct: 113 SIDS-----------NERP-----NYEGSEQELLDAIEAINQ------------------ 138
Query: 187 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA-PGKSFHQSGVHVHDILAFQRDSFN 245
GE C + GF +VNKV GNFH + + +H D+ F++
Sbjct: 139 -------------GEQCQLKGFFQVNKVPGNFHVSYHAHHYLLQRIHQRDLSVFRK--MK 183
Query: 246 ISHKINKLAFGEHFPGVVNPLDGVR------------WTQ---ETPSGM---YQYFIKVV 287
+ H I +L FGE + +R W Q P G Y+Y+I +
Sbjct: 184 LDHSIYELRFGE-----ITTTSKMRKYSKSLQKFQNSWKQIVKSAPEGEKQDYEYYIDAL 238
Query: 288 PTVYTDVSGHTIQS-NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFL 345
P + D + Q+ ++S+ E ++ R T + ++F Y +SP+ + ++ + S
Sbjct: 239 PVRFYDENERNYQTLYKYSINE----AQMPRTFTEIDSIYFKYQISPVNMVYSIQKKSVY 294
Query: 346 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
HF+ + AI+GGVF V GI+++ + Q+AI
Sbjct: 295 HFIVQLLAIIGGVFAVIGILNSIV---QKAI 322
>gi|256052432|ref|XP_002569774.1| ptx1 protein [Schistosoma mansoni]
gi|353229921|emb|CCD76092.1| putative ptx1 protein [Schistosoma mansoni]
Length = 460
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 88/364 (24%), Positives = 154/364 (42%), Gaps = 55/364 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ LD +PK+ + T+SGG++T+++ + L E R YL+ +D S
Sbjct: 71 VNELDVFPKLPRECKKSTWSGGLVTILTFGCISWLLIMEFRSYLDPPVNYSYELDKSTTG 130
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
+++N D+ A PC +S MD+ +D+ G+ +
Sbjct: 131 KVKVNIDIVV-ASPCHAVS---MDV----------------VDTSGSSLSD--------- 161
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG-----WAL---SN 179
E N Y + + S + + E R K W S
Sbjct: 162 ------------EENIQYLPTSFELTPSARAAFKYRQYIAETLRAKHHTIQHWLWKYTSG 209
Query: 180 PDLIDQCKREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDIL 237
++ + +++ ++ + C I G L V KV GN H GK + G +H+H ++
Sbjct: 210 TNVFTIFEVPVADEKVSDDRNSDACRIVGTLFVKKVGGNIHILFGKPLNGFGNLHLH-VV 268
Query: 238 AFQRDSF-NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
F S N SH+IN +FG+ G ++PL+ V + +QYF+ +VPT +
Sbjct: 269 PFSGQSLQNFSHRINHFSFGDLVNGQIHPLEAVESVTDIAFTSFQYFVTMVPTKVVN-HF 327
Query: 297 HTIQSNQFSVTEHFRSSEQ-GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
H ++ Q++ T R+ + +PG+FF YD+ P+ V T + F T + A+
Sbjct: 328 HITETYQYAATLQNRTIDHDAGSHGIPGIFFVYDIFPLVVKITYDRELLGTFFTRLAALA 387
Query: 356 GGVF 359
GG+F
Sbjct: 388 GGIF 391
>gi|47219772|emb|CAG03399.1| unnamed protein product [Tetraodon nigroviridis]
Length = 378
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 98/224 (43%), Gaps = 62/224 (27%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGK-----------SFHQSGV-----------------H 232
C I+G L VNKVAGNFH GK S H + H
Sbjct: 130 RACRIHGHLYVNKVAGNFHITVGKYVTSLLGYSVVSLHSIPIGVTLFLLLSRSIPHPRGH 189
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE--------TP-------- 276
H DS+N SH+I+ L+FGE PG+++PLDG TP
Sbjct: 190 AHLAALVSHDSYNFSHRIDHLSFGEDLPGIISPLDGTEKVSADCTAVLSLTPLHRCDFFL 249
Query: 277 ----------------SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 319
+ ++QYFI +VPT + + +++Q+SVTE R+ +
Sbjct: 250 PRLFFKMCDFRFSLLANHIFQYFITIVPT-KLNTYKVSAETHQYSVTEQDRAINHAAGSH 308
Query: 320 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ G+F YD+S + V TE+H+ FL +C IVGG+F+ +
Sbjct: 309 GVSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIVGGIFSTTA 352
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 55/101 (54%), Gaps = 8/101 (7%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + T SGG ++L++ +M +L F E +Y + + + VD
Sbjct: 10 LTLVKELDAFPKVPESYVESTASGGTVSLIAFSLMAILAFLEFFVYRDTWMKYEYEVDKD 69
Query: 65 RGETLRINFDVTFP-ALPCSILSVDAMDISGEQHLDVKHDI 104
G LRIN D+T +P ++L + ++ L V+H +
Sbjct: 70 FGSKLRINVDITVADEMPMTLLHI-------QERLKVEHSL 103
>gi|391338468|ref|XP_003743580.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Metaseiulus occidentalis]
Length = 292
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 98/202 (48%), Gaps = 32/202 (15%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
+G+GCN +NKV GNFH S H + Q D ++SH+I+ L FGE
Sbjct: 109 DGKGCNFVSKFTINKVPGNFHV----STHAAKT--------QPDDIDMSHEIHSLTFGEQ 156
Query: 259 F--------PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----- 305
G N L + + Y +K+VPTVY SG ++ Q++
Sbjct: 157 LIYELGDDIKGSFNALQNHDRLKADGKESHDYVMKIVPTVYELSSGDSLVGYQYTHAHKS 216
Query: 306 -VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
+T F + GR+ +P ++F YDL+PI V + FLTNVCAIVGG FTV GI
Sbjct: 217 YITLSFSA---GRI--IPAIWFKYDLNPITVRYHRRTQPLYSFLTNVCAIVGGTFTVVGI 271
Query: 365 IDAFIYHGQRAIKKKIEIGKFS 386
I++ + +K E+GK S
Sbjct: 272 INSICFTAGEVF-RKFEMGKLS 292
>gi|297803392|ref|XP_002869580.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
lyrata]
gi|297315416|gb|EFH45839.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
lyrata]
Length = 480
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 103/201 (51%), Gaps = 33/201 (16%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 258
GC I G++ V KV GN + +SG H +F N+SH +N L+FG+
Sbjct: 293 GCRIEGYIRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGQRIMP 342
Query: 259 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
+ G+ + LDG + + P+ ++++++V T +G +
Sbjct: 343 QKFSELKRLSPYLGLSHDRLDGRPFINQRDLGPNVTIEHYLQIVKTEVVKSNGQAL-VEA 401
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ T H S LP F ++LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 402 YEYTAH---SSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFTVAG 458
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
I+D+ ++H + KKIE+GK
Sbjct: 459 ILDSILHHSM-TLMKKIELGK 478
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+KI+S+D Y KI D + SG +++++++ M+ LF EL YL T T ++VD S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIIAALSMIFLFGMELNNYLAVSTSTSVIVDRSA 64
Query: 66 -GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LR++F+++FP+L C SVD D+ G L+V I K +DS
Sbjct: 65 DGDFLRLDFNISFPSLSCEFASVDVSDVLGTNRLNVTKTIRKFSIDSN 112
>gi|154335780|ref|XP_001564126.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061160|emb|CAM38182.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 309
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 135/323 (41%), Gaps = 47/323 (14%)
Query: 69 LRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI 128
+ ++FDV FP + C+ LS+D +D +G + I K + G V +
Sbjct: 1 MPVHFDVLFPYMSCNRLSIDVVDATGTAKFNCTGTIHKLPISGDGEV-----------QY 49
Query: 129 DKPLQRHGGRLEHNET----YCGSC--YGAESSDED--------CCNNCEEVREAYRKKG 174
++ G +E ++T C C + E D CC++C+ V E Y+
Sbjct: 50 KGTMKDLGNDIEMDDTGGDKKCRRCPSFAFEGVAADVRNAAASKCCDSCDSVFELYKDLE 109
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
+ QC + + E GCN+ G L++ KV F P ++ + +
Sbjct: 110 KEFPGIEYFPQCLEQLY------ERARGCNVIGSLDLKKVPVTVIFGPRRTGRR--YSLK 161
Query: 235 DILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLDGVRWTQETPSGMYQYFIKVVP 288
D++ + SH I KL G E F GV PL G +T S +Y +KVVP
Sbjct: 162 DVI-----RLDTSHVIKKLRIGDEAVERFSKHGVAEPLCGHERFSKTYSET-RYLVKVVP 215
Query: 289 TVYTDVSGHTIQSNQFSVTEHFRSSE--QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
T Y +++ + + S G +P V F ++ + I+V E H
Sbjct: 216 TTYRKTRTRDAKASTYEYSAQCSSQAIVVGFSGVVPAVLFAFEPAAIQVNNVFERQPVSH 275
Query: 347 FLTNVCAIVGGVFTVSGIIDAFI 369
FL +C IVGG+F V G ID+ +
Sbjct: 276 FLVQLCGIVGGLFVVLGFIDSTV 298
>gi|22328963|ref|NP_567765.2| protein PDI-like 5-4 [Arabidopsis thaliana]
gi|75213708|sp|Q9T042.1|PDI54_ARATH RecName: Full=Protein disulfide-isomerase 5-4; Short=AtPDIL5-4;
AltName: Full=Protein disulfide-isomerase 7; Short=PDI7;
AltName: Full=Protein disulfide-isomerase 8-2;
Short=AtPDIL8-2; Flags: Precursor
gi|4490704|emb|CAB38838.1| putative protein [Arabidopsis thaliana]
gi|7269561|emb|CAB79563.1| putative protein [Arabidopsis thaliana]
gi|15450832|gb|AAK96687.1| putative protein [Arabidopsis thaliana]
gi|20259836|gb|AAM13265.1| putative protein [Arabidopsis thaliana]
gi|332659897|gb|AEE85297.1| protein PDI-like 5-4 [Arabidopsis thaliana]
Length = 480
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 102/201 (50%), Gaps = 33/201 (16%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 258
GC + G++ V KV GN + +SG H +F N+SH +N L+FG
Sbjct: 293 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 342
Query: 259 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
+ G+ + LDG + + P+ ++++++V T +G +
Sbjct: 343 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQAL-VEA 401
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ T H S LP F ++LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 402 YEYTAH---SSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFTVAG 458
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
I+D+ ++H + KKIE+GK
Sbjct: 459 ILDSILHHSM-TLMKKIELGK 478
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+KI+S+D Y KI D + SG +++++++ M+ LF EL YL T T ++VD S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIIAALSMIFLFGMELNNYLAVSTSTSVIVDRSA 64
Query: 66 -GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LR++F+++FP+L C SVD D+ G L+V I K +DS
Sbjct: 65 DGDFLRLDFNISFPSLSCEFASVDVSDVLGTNRLNVTKTIRKFSIDSN 112
>gi|238480964|ref|NP_680742.2| protein PDI-like 5-4 [Arabidopsis thaliana]
gi|332659898|gb|AEE85298.1| protein PDI-like 5-4 [Arabidopsis thaliana]
Length = 532
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 102/201 (50%), Gaps = 33/201 (16%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 258
GC + G++ V KV GN + +SG H +F N+SH +N L+FG
Sbjct: 345 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 394
Query: 259 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
+ G+ + LDG + + P+ ++++++V T +G +
Sbjct: 395 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQALV-EA 453
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ T H S LP F ++LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 454 YEYTAH---SSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFTVAG 510
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
I+D+ ++H + KKIE+GK
Sbjct: 511 ILDSILHHSM-TLMKKIELGK 530
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 58/101 (57%), Gaps = 1/101 (0%)
Query: 13 AYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR-GETLRI 71
A KI D + SG +++++++ M+ LF EL YL T T ++VD S G+ LR+
Sbjct: 64 ASKKIPRDLTEASLSGAGLSIIAALSMIFLFGMELNNYLAVSTSTSVIVDRSADGDFLRL 123
Query: 72 NFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
+F+++FP+L C SVD D+ G L+V I K +DS
Sbjct: 124 DFNISFPSLSCEFASVDVSDVLGTNRLNVTKTIRKFSIDSN 164
>gi|167523643|ref|XP_001746158.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775429|gb|EDQ89053.1| predicted protein [Monosiga brevicollis MX1]
Length = 1400
Score = 96.3 bits (238), Expect = 2e-17, Method: Composition-based stats.
Identities = 51/149 (34%), Positives = 84/149 (56%), Gaps = 7/149 (4%)
Query: 197 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 256
+ E +GC ++G + V +V+ NFHF+ GKS H + H H + + + N SH+I++ +F
Sbjct: 165 DAEPDGCRVHGTMPVARVSSNFHFSAGKSVHHASGHAHVPIDPNQKTINFSHRIDRFSFS 224
Query: 257 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTE--HFRSS 313
G + LDG ++ ++QYF+KVVPT + +SNQ+SVTE H ++
Sbjct: 225 SEQRGAM-ALDGDMKVSDSNKQLFQYFLKVVPTTTKRMDEAEPFRSNQYSVTEQHHILAA 283
Query: 314 EQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
+ + LPG+ F Y++ PI V E+ V
Sbjct: 284 NE---RKLPGIHFKYEIEPIGVLVHEQAV 309
Score = 53.5 bits (127), Expect = 2e-04, Method: Composition-based stats.
Identities = 27/95 (28%), Positives = 51/95 (53%), Gaps = 3/95 (3%)
Query: 2 DAIMNKIRSLDAYPKI--NEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKL 59
+ + +++ LD +PK+ + D + + SG V+T++ + ++ L F+EL Y +
Sbjct: 8 ERLQEQVKQLDVFPKVEPDMDIQTTSISGAVVTIIVGLAIVGLIFTELMYYRTVDVVYEY 67
Query: 60 LVDTSRGETLRINFDVTFPALPCSILSVDAMDISG 94
VDT + + D+T A+PC VD +D+SG
Sbjct: 68 AVDTDLDPHMNLTVDMTI-AMPCENFGVDYIDVSG 101
>gi|410046954|ref|XP_003952285.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Pan troglodytes]
Length = 333
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 144/369 (39%), Gaps = 101/369 (27%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
+ ++ ++ LDA+PK+ E + + SGG ++L++ M LL E +Y + + + V
Sbjct: 16 EKTLSLVKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEV 75
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQD 121
D LRIN D+T A+ C + D +D++ ++ S
Sbjct: 76 DKDFSSKLRINIDITV-AMKCQYVGADVLDLA-------------------ETMVASADG 115
Query: 122 GIGAPKI--DKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN 179
+ P + P Q+ R+ S E S +D + A++ AL
Sbjct: 116 LVYEPTVFDLSPQQKEWQRMLQ---LIQSRLQEEHSLQDVI-----FKSAFKSASTAL-- 165
Query: 180 PDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
P D + + C I+G L VNKVAGNFH Q
Sbjct: 166 PPREDDSS----------QSPDACRIHGHLYVNKVAGNFHITVDNQMFQ----------- 204
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGH 297
YFI VVPT ++T +S
Sbjct: 205 ------------------------------------------YFITVVPTKLHTYKISAD 222
Query: 298 TIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
T +QFSVTE R + + G+F YDLS + VT TEEH+ F F +C IVG
Sbjct: 223 T---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVG 279
Query: 357 GVFTVSGII 365
G+F+ +G++
Sbjct: 280 GIFSTTGML 288
>gi|328352874|emb|CCA39272.1| Peroxisomal membrane protein PEX28 [Komagataella pastoris CBS 7435]
Length = 849
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 17/189 (8%)
Query: 183 IDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 236
+D+ RE L +E+ + C+I+G + VNKV G FH GK G D
Sbjct: 644 LDEVMRESALAEFREKKSFTHGDAPACHIFGSIPVNKVHGFFHIT-GK-----GYGYRDR 697
Query: 237 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+++ N +H I++ +FGE +P + NPLD T + Y++ VVPT Y + G
Sbjct: 698 SIVPKEALNFTHVISEFSFGEFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTEYKKL-G 756
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
I + Q+S+T +E L PG+FF Y PI ++ E+ +SF+ FL + I G
Sbjct: 757 IVIDTTQYSMT----VTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTICG 812
Query: 357 GVFTVSGII 365
G+ V+ I
Sbjct: 813 GIMVVAKWI 821
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
IR DA+PK R+ G T++ +L L + E+ Y++ + + ++D +
Sbjct: 523 IRVFDAFPKTEPVNTVRSTKGSYSTILMGFFILFLIWVEIGGYVDGYIDRQFMLDRNIQR 582
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHL 98
L IN D+ F A PC+ L + DI+ ++ L
Sbjct: 583 VLNINLDM-FVATPCNYLHTNVKDITQDRFL 612
>gi|378726952|gb|EHY53411.1| hypothetical protein HMPREF1120_01605 [Exophiala dermatitidis
NIH/UT8656]
Length = 326
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 94/208 (45%), Gaps = 47/208 (22%)
Query: 201 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 259
+ C IYG LE NKV G+FH A G + + G+ H FN SH IN+L+FG H+
Sbjct: 86 DSCRIYGSLEGNKVQGDFHITARGHGYMEFGMQQH----LDHSRFNFSHHINELSFGPHY 141
Query: 260 PGVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT-------------------------- 292
PG++NPLD T + YQY++ +VPT++T
Sbjct: 142 PGLLNPLDKTSAVTTDVHFMRYQYYLSIVPTIFTKRRVSTSSGALDPAAIPQPPTLDLTP 201
Query: 293 ----DVSG--------HTIQSNQFSVTEHFRSSEQGRL---QTLPGVFFFYDLSPIKVTF 337
D G H + ++ T + ++ Q R T+PGVFF YD+ PI +
Sbjct: 202 NDHRDKDGVVRHVPNPHAGRDSKSVFTNQYAATSQSREVPGNTVPGVFFKYDIEPILLIV 261
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGII 365
+E SFL + + ++ GV G +
Sbjct: 262 SERRSSFLGLIVRLVNVISGVLVAGGWM 289
>gi|21618302|gb|AAM67352.1| unknown [Arabidopsis thaliana]
Length = 317
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 100/201 (49%), Gaps = 33/201 (16%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 258
GC + G++ V KV GN + +SG H +F N+SH +N L+FG
Sbjct: 130 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 179
Query: 259 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
+ G+ + LDG + + P+ ++++++V T +G +
Sbjct: 180 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQAL---- 235
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
E+ S LP F ++LSP++V TE SF HF+TNVCAI+GG FTV+G
Sbjct: 236 VEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGAFTVAG 295
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
I+D+ ++H + KKIE+GK
Sbjct: 296 ILDSILHHSM-TLMKKIELGK 315
>gi|302808800|ref|XP_002986094.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
gi|300146242|gb|EFJ12913.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
Length = 475
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 130/283 (45%), Gaps = 48/283 (16%)
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G P I + H + EH S YG +D + + EA K L+ D
Sbjct: 217 GFPSIRIFRKGHDLKDEHGHHEHDSYYGERDTD-----SLVKAMEALVPKETTLALED-- 269
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
K G ++R G GC I GF+ KV GN + SG H +F +
Sbjct: 270 ---KTNGTVKRPAPRAG-GCRIEGFIRAKKVPGNIIISA-----HSGSH-----SFDASA 315
Query: 244 FNISHKINKLAFGEH------------FPGVVNPLDGVR-------WTQETPSGMYQYFI 284
N++H +++ +FG +P + + D V + + + + +++
Sbjct: 316 MNMTHYVSQFSFGRELNFWMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYL 375
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT--LPGVFFFYDLSPIKVTFTEEH 341
+VV T + + +FS+ E + +S +Q +P F Y+LSP++V E
Sbjct: 376 QVVKTEVVSLQ----KRKEFSLLEQYDYTSHSNTVQNTNVPVAKFHYELSPMQVLVKENP 431
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ + HG + KKIE+GK
Sbjct: 432 KSFSHFITNVCAIIGGVFTVAGIVDSML-HGAMRMVKKIELGK 473
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 63/112 (56%), Gaps = 1/112 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+KI+S+D Y KI D + SG ++L+++ M+ LF EL YL + T ++VD S+
Sbjct: 5 SKIKSIDFYRKIPRDLTEASLSGAGLSLIAAFAMIFLFGMELNNYLTVSSTTNVVVDRSK 64
Query: 66 -GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI 116
GE LRI F+++FPAL C SVD D G ++ + K +D ++
Sbjct: 65 DGEYLRIQFNMSFPALSCEFASVDVSDALGTNRYNLTKTVRKYPIDPNLKIV 116
>gi|444706692|gb|ELW48018.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Tupaia chinensis]
Length = 821
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 5/108 (4%)
Query: 280 YQYFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 337
+ Y +K+VPTVY D SG S Q++V E+ S GR+ +P ++F YDLSPI V +
Sbjct: 716 HDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKY 773
Query: 338 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
TE F+T +CAI+GG FTV+GI+D+ I+ A KK+++GK
Sbjct: 774 TERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW-KKVQLGKM 820
>gi|225461068|ref|XP_002281649.1| PREDICTED: protein disulfide isomerase-like 5-4 [Vitis vinifera]
gi|297735969|emb|CBI23943.3| unnamed protein product [Vitis vinifera]
Length = 482
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 104/204 (50%), Gaps = 38/204 (18%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 260
GC I GF+ V KV GN + +SG H +F N+SH I+ L+FG P
Sbjct: 294 GCRIEGFVRVKKVPGNLVISA-----RSGSH-----SFDPSQMNMSHVISHLSFGRKIAP 343
Query: 261 GVVNPLDGV-------------RWTQETPSG-----MYQYFIKVVPTVYTDVSGHTIQSN 302
V++ + V R PS +++++VV T H +
Sbjct: 344 RVMSDMKRVLPYIGGSHDRLNGRSYISHPSDSNANVTIEHYLQVVKTEVITTRDHKL--- 400
Query: 303 QFSVTEHFRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
V E+ ++ +Q+L P F ++LSP++V TE SF HF+TNVCAI+GGVFT
Sbjct: 401 ---VEEYEYTAHSSLVQSLYIPVAKFHFELSPMQVLVTENRKSFWHFITNVCAIIGGVFT 457
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+GI+D+ +++ R + KKIE+GK
Sbjct: 458 VAGILDSVLHNTMR-LMKKIELGK 480
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+KI+S+D Y KI D + SG +++++++ M+ LF EL YL+ T T ++VD +S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSVIAALSMMFLFGMELSNYLSVSTSTSVIVDQSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
G+ LRI F+++FPAL C SVD D+ G L++ I K +D
Sbjct: 65 DGDFLRIEFNISFPALSCEFASVDVSDVLGTNRLNITKTIRKYSID 110
>gi|256269733|gb|EEU05000.1| Erv41p [Saccharomyces cerevisiae JAY291]
Length = 353
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/363 (23%), Positives = 158/363 (43%), Gaps = 66/363 (18%)
Query: 5 MNKIRSLDAY-PKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDT 63
M +++ DA+ K E + ++ GG+ +L++ + +L + ++E Y + + +VD+
Sbjct: 1 MAGLKTFDAFRTKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDS 60
Query: 64 SRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI---------FKKRLDSQGN 114
+T++IN D+ + C L ++ D Q +D K + F D++ N
Sbjct: 61 QVRDTVQINMDI-YVNTKCDWLQINVRD----QTMDRKLVLEELQLEEMPFFIPYDTKVN 115
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
I I P++D+ L G AE +R+K
Sbjct: 116 DINE----IITPELDEIL--------------GEAIPAE----------------FREK- 140
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
L D+ + E GC+I+G + VN+V+G + G
Sbjct: 141 --LDTRSFFDESDP----NKAHLPEFNGCHIFGSIPVNRVSGELQITA----NSLGYVAS 190
Query: 235 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTD 293
+ FN H IN+ +FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++
Sbjct: 191 RKAPLEELKFN--HVINEFSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKK 248
Query: 294 VSGHTIQSNQFSVTEH--FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
+ G + +NQ+SV ++ + +PG+FF Y+ P+ + ++ +SF+ FL +
Sbjct: 249 L-GAEVDTNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRL 307
Query: 352 CAI 354
AI
Sbjct: 308 VAI 310
>gi|224117462|ref|XP_002317580.1| predicted protein [Populus trichocarpa]
gi|222860645|gb|EEE98192.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 104/217 (47%), Gaps = 30/217 (13%)
Query: 187 KREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 244
K E Q +K GC I G++ V KV GN + SG H +F
Sbjct: 277 KPENATQHVKRPAPSAGGCRIEGYVRVKKVPGNLMISA-----LSGAH-----SFDSKQM 326
Query: 245 NISHKINKLAFG-EHFPGVV--------------NPLDGVRWTQETPSGMYQYFIKVVPT 289
N+SH I+ +FG + P V+ + L+G + G +
Sbjct: 327 NLSHVISHFSFGMKVLPRVMSDVKRLLPYIGRSHDKLNGRSFINHRDVGANVTIEHYLQV 386
Query: 290 VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHF 347
V T+V S + + E+ ++ QT +P F ++LSP++V TE SF HF
Sbjct: 387 VKTEVVTRRSSSERKLIEEYEYTAHSSLSQTVYMPTAKFHFELSPMQVLITENSKSFSHF 446
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+TNVCAI+GGVFTV+GI+D+ ++H R + KK+E+GK
Sbjct: 447 ITNVCAIIGGVFTVAGILDSILHHTVRMM-KKVELGK 482
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
NK++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 NKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMMFLFGMELNNYLTVNTSTTVIVDNSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FP+L C SVD D+ G L++ I K +D
Sbjct: 65 DGEFLRIDFNISFPSLSCEFASVDVSDVLGTNRLNITKTIRKFSID 110
>gi|255563725|ref|XP_002522864.1| thioredoxin domain-containing protein, putative [Ricinus communis]
gi|223537948|gb|EEF39562.1| thioredoxin domain-containing protein, putative [Ricinus communis]
Length = 478
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/241 (31%), Positives = 116/241 (48%), Gaps = 40/241 (16%)
Query: 168 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKS 225
E+ K +L P ++ K E Q K GC I G++ V KV GN +
Sbjct: 252 ESLVKTMESLVAPIQLESLKSENATQSTKRPAPLTGGCRIEGYVRVKKVPGNLIISA--- 308
Query: 226 FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-PGVVN--------------PLDG-- 268
+SG H +F N+SH I+ L+FG P V+N L+G
Sbjct: 309 --RSGAH-----SFDPSQMNMSHVISHLSFGLKVSPKVMNEAKRLVPYIGGSHDKLNGRS 361
Query: 269 -VRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT---LPG 323
V + ++++++V T V T S S + + E + + L +P
Sbjct: 362 FVNHRDVDANVTIEHYLQIVKTEVVTRRS-----SREHKLLEEYEYTAHSSLVQSVYIPA 416
Query: 324 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 383
F ++LSP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ ++H R + KK+E+G
Sbjct: 417 AKFHFELSPMQVLITENPKSFSHFITNVCAIIGGVFTVAGILDSILHHTVR-LMKKVELG 475
Query: 384 K 384
K
Sbjct: 476 K 476
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++++++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIIAALSMVFLFGMELSNYLTVNTSTSVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN 114
G+ LRI+F+++FPAL C SVD D+ G L++ I K +D N
Sbjct: 65 DGDFLRIDFNLSFPALSCEFASVDVSDVLGTNRLNITKTIRKFSIDHDLN 114
>gi|384244593|gb|EIE18093.1| protein disulfide isomerase [Coccomyxa subellipsoidea C-169]
Length = 479
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 98/213 (46%), Gaps = 56/213 (26%)
Query: 202 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE- 257
GC + GF+ V KV G HF +PG SF + N+SH +N L FG
Sbjct: 291 GCALSGFVLVKKVPGALHFLAKSPGHSF-------------DYQAMNMSHVVNYLYFGNK 337
Query: 258 ------------HFPGV----VNPLDGVRWTQETPSGMYQYFIKVV----------PTVY 291
H G+ + L G + ++++++VV P +
Sbjct: 338 PSPRRHQSLAKLHPAGLSDDWADKLAGQDFFSRAAKATFEHYMQVVLTTIEPSKHRPELS 397
Query: 292 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
D +T+ S+ + + +P F YDLSPI++ +E+ ++ HF+T
Sbjct: 398 YDAYEYTVHSHTYDTAD------------IPAAKFTYDLSPIQILVSEKRRAWYHFVTTT 445
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
CAI+GGVFTV+GI+D ++ G R KK+E+GK
Sbjct: 446 CAIIGGVFTVAGIVDGLVHTGAR-FAKKVELGK 477
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 68/115 (59%), Gaps = 1/115 (0%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M ++ K+RS+D Y KI D T +G I+LV++ +++L +EL +L T+ +L+
Sbjct: 1 MARVLQKLRSVDFYRKIPNDLTEATLAGAGISLVAAFTIVVLLTAELSSFLAIETKEELI 60
Query: 61 VDTS-RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGN 114
VD S G+ LRINF+++FP+L C ++D D G + +++ I K +D G
Sbjct: 61 VDRSAHGDLLRINFNISFPSLSCEFATLDVSDALGTKRMNLTKTIRKLPIDEDGQ 115
>gi|365986066|ref|XP_003669865.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
gi|343768634|emb|CCD24622.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
Length = 353
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 164/373 (43%), Gaps = 58/373 (15%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ DA+PKI + ++ GG+ ++++ ++++ + +SE Y + + +VD E
Sbjct: 5 LKVFDAFPKIEDQNKKKSTKGGITSILTYVLIIFIAWSEFGSYFGGFVDQQYIVDGMLRE 64
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK 127
T+ IN D+ + +PC + V+ D Q LD K + + + I P+
Sbjct: 65 TVPINLDL-YVNVPCEWVHVNVRD----QTLDRKFASQELKFEEMPFFIPFDVRLNDNPE 119
Query: 128 IDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK 187
I P E +E G AE ++ + R + + +NPD
Sbjct: 120 IVTP--------ELDEI-LGEAIPAEFREK------LDTRMFFDE-----NNPD------ 153
Query: 188 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR---DSF 244
+ + GC+I+G + VN+VAG Q H + R +
Sbjct: 154 ------KSHLPDFNGCHIFGSVNVNQVAGEL---------QVTAKGHGYADYHRAPLEKV 198
Query: 245 NISHKINKLAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQ 303
N +H IN+ +FGE FP + NPLD ++ + P Y Y V+P +Y + G + + Q
Sbjct: 199 NFAHVINEFSFGEFFPYIDNPLDNSAKFNMDDPLTAYVYDTSVIPMIYRKM-GAEVDTFQ 257
Query: 304 FSVTEHFRSSEQGRLQT---LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVF 359
+SV EH S++ +PG+FF Y+ + + ++ + F+ F+ + AI+ V+
Sbjct: 258 YSVAEHQYKSKESSSSNSFRVPGIFFQYNFENLSIVVSDRRLGFIQFIVRLVAILSFAVY 317
Query: 360 TVSGII---DAFI 369
S + D FI
Sbjct: 318 IASWLFILADMFI 330
>gi|255082155|ref|XP_002508296.1| predicted protein [Micromonas sp. RCC299]
gi|226523572|gb|ACO69554.1| predicted protein [Micromonas sp. RCC299]
Length = 507
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 118/257 (45%), Gaps = 37/257 (14%)
Query: 148 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 207
+ Y + + E EE+ A++ A + D ++ Q +K+ +G GC++ G
Sbjct: 268 TSYHGDRTVEAITTFAEELLPAWK----ATDHKDTELAIRQPVETQTVKKIDGPGCSVTG 323
Query: 208 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-PGVVNPL 266
F+ V KV G+ H +F +S N+SH ++ FG+ P L
Sbjct: 324 FVLVKKVPGHLWVTATSKSH----------SFHAESMNMSHVVHHFYFGQQLTPQRKRYL 373
Query: 267 DGVRWTQETPSG------------------MYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 308
D ++ P G ++++++ V T SG N + T+
Sbjct: 374 DRFHSREKDPKGDWHDKLAGGTFTSEEDNVTHEHYLQTVLTTIKP-SGSPAPFNVYEYTQ 432
Query: 309 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 368
H S + LP F +D SP++++ +EE F HF+T + AIVGGV++V GI D F
Sbjct: 433 HSHSLRSEK--ELPRAKFHFDPSPVQISVSEERQKFYHFITTLMAIVGGVYSVMGIADGF 490
Query: 369 IYHGQRAIKKKIEIGKF 385
+++ +A KKK E+GKF
Sbjct: 491 VHNSIQAWKKK-ELGKF 506
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 69/111 (62%), Gaps = 1/111 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR- 65
K +++D Y KI +D T G VI++++++V+ LL SE+ YL +T++++D S
Sbjct: 8 KFKNVDFYRKIPKDMTEGTIPGSVISMLAALVIGLLLVSEVGSYLTPKFDTRVVIDRSAD 67
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI 116
GE +RINF+V+FPAL C SVD D G ++ +FK+ +D++ N +
Sbjct: 68 GEMMRINFNVSFPALSCEFASVDVGDAMGLNRFNLTKTVFKRAIDAKLNPL 118
>gi|296086862|emb|CBI33029.3| unnamed protein product [Vitis vinifera]
Length = 139
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 40/66 (60%), Positives = 52/66 (78%)
Query: 109 LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVRE 168
+D+ GN + +QD IG P+I+K LQRHGGRLE N YCGSCYGAE +D+DC N+C+E RE
Sbjct: 73 IDAHGNEVAVKQDEIGGPQIEKLLQRHGGRLERNGKYCGSCYGAEVTDDDCGNSCDEDRE 132
Query: 169 AYRKKG 174
Y+K+G
Sbjct: 133 TYKKRG 138
>gi|343473351|emb|CCD14737.1| hypothetical protein, unlikely [Trypanosoma congolense IL3000]
Length = 141
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 17/134 (12%)
Query: 268 GVRWTQETPSGMYQYFIKVVPTVY---TDVS-GHTIQSNQFSVTEHFRSS---------- 313
GV E G + YF+KVVPT+Y T +S G ++SNQ+SVT HF +S
Sbjct: 6 GVENPSEDLIGRFAYFVKVVPTLYQVRTLMSLGRVVESNQYSVTHHFTASWDAADQNNQT 65
Query: 314 -EQGRLQTLPGVFFFYDLSPIKVTFTEEHV--SFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
+ +PGVF YD+SPI+V+ H S +H + +CA+ GGV+TV G+ID+ +
Sbjct: 66 NRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVMGLIDSMFF 125
Query: 371 HGQRAIKKKIEIGK 384
H R +++KI GK
Sbjct: 126 HSIRRVQEKINRGK 139
>gi|440798302|gb|ELR19370.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 328
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 38/201 (18%)
Query: 197 EEEGEGCNIYGFLEVNKVAGNFHFA------------------------PGKSFHQSGVH 232
+ E GC+I G++ V KV GNFH + + F+ SGV
Sbjct: 116 DSELSGCSIAGYINVPKVPGNFHLSTHGRNVQAQDIDMQHNINSFFFTDSPRVFYPSGVS 175
Query: 233 VHDILAFQRDSFNISHKINKLA----FGEHFPGVVNPLDGV-RWTQETPSGM---YQYFI 284
V A++ N+ ++N A + G+ PLDG+ + + +G+ Y+Y+I
Sbjct: 176 VP---AWRNWHSNVVAELNAQARDQDTDDDVVGLFRPLDGITKANSQRKNGVGVSYEYYI 232
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
++VPT+ G T + QF+ + ++ +G+ P V+F YD+SPI V T S
Sbjct: 233 QIVPTILEFPDGRTKHTYQFTYNFNDVATPEGKT---PSVYFKYDISPITVKITRGRGSL 289
Query: 345 LHFLTNVCAIVGGVFTVSGII 365
HFL +CAIVGG+FTVSG+I
Sbjct: 290 GHFLLQLCAIVGGIFTVSGLI 310
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
+++ ++S D Y ++ +D + G +++LV +M +L E+ Y + TET++LVD
Sbjct: 4 SMLGLLKSFDLYRRVPKDLTKGSVPGAIVSLVCLTIMAMLISWEVYCYASIKTETQMLVD 63
Query: 63 TSRG-ETLRINFDVTFPALPCSILSVDAMDISG 94
T R E +RIN +VT P +PC ++++D D+ G
Sbjct: 64 TPRNLEKIRININVTVPRIPCYVIALDTEDVLG 96
>gi|356517290|ref|XP_003527321.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 100/201 (49%), Gaps = 33/201 (16%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 260
GC + G++ V KV GN + H +F N+SH IN L+FG+ P
Sbjct: 293 GCRVEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHFINNLSFGKKVTP 342
Query: 261 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 303
+ + L+G +T G +++I++V T +G+ + +
Sbjct: 343 RAMSDVKLLIPYIGSSHDRLNGRSFTNTHDLGANVTIEHYIQIVKTEVVTRNGYKLI-EE 401
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 402 YEYTAH---SSVAHSVDIPAAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAG 458
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
I+D+ +++ R + KK+E+GK
Sbjct: 459 ILDSILHNTIRMM-KKVELGK 478
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 66/107 (61%), Gaps = 1/107 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSR 65
K++S+D Y KI D + SG +++V+++ M+ LF EL YL+ T T ++VD +S
Sbjct: 6 KLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMMFLFGMELSSYLSVSTSTSVIVDKSSD 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LRI+F+++FPAL C SVD D+ G L++ + K +DS
Sbjct: 66 GDYLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDSN 112
>gi|449468488|ref|XP_004151953.1| PREDICTED: protein disulfide-isomerase 5-4-like [Cucumis sativus]
Length = 481
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 113/223 (50%), Gaps = 37/223 (16%)
Query: 182 LIDQCKRE-GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
L D+ E G ++R G GC I G++ V KV G+ A H +F
Sbjct: 274 LEDKSNNETGNVKRPAPSAG-GCRIEGYVRVKKVPGSLVIAARSESH----------SFD 322
Query: 241 RDSFNISHKINKLAFGEH--------------FPGVV-NPLDGVRWTQETPSG---MYQY 282
N+SH I+ L+FG + G+ + L+G + + G ++
Sbjct: 323 ASQMNMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEH 382
Query: 283 FIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 341
++++V T V T SG ++ ++ T H S+ +P V F + LSP++V TE
Sbjct: 383 YLQIVKTEVLTRRSGKLLE--EYEYTAHSSVSQS---LYIPVVKFHFVLSPMQVVITENQ 437
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+DA +++ R + KK+E+GK
Sbjct: 438 KSFSHFITNVCAIIGGVFTVAGILDALLHNTIR-LMKKVELGK 479
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 65/107 (60%), Gaps = 1/107 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR- 65
K++S+D Y KI D T SG +++V+++ M+ LF EL YL+ T T ++VD S
Sbjct: 6 KLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSTD 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LR++F+++FPAL C +VD D+ G L++ I K +DS
Sbjct: 66 GDFLRMDFNISFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSIDSN 112
>gi|224013160|ref|XP_002295232.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969194|gb|EED87536.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 488
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 179/449 (39%), Gaps = 105/449 (23%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLR 70
+ A+ + T G V++L+S +M+LLFF E + + + + VD + + LR
Sbjct: 42 MHAFSWFKDALRDATKIGVVMSLLSIFIMILLFFCETYAFSRSTISSTIAVDPNSEQLLR 101
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIE-----SRQDGIGA 125
+NF+VT L C SVD D G ++ DI K LD QG + + Q +
Sbjct: 102 LNFNVTLYDLHCDYASVDIWDTLGTNQQNITKDIVKWNLDDQGQRKKFAGRNAEQRAVTH 161
Query: 126 PKIDKPLQ-------------------------RHGGR--LEHNETYCGSC--------- 149
+ D+ LQ RH G+ ++ +C C
Sbjct: 162 EEHDETLQDLADALGGELHAVALDPESIVEFHKRHNGQAIIDFYAPWCIWCQRLEPTWEK 221
Query: 150 YGAESSDE---------DCCNN---CEEVR-EAYRKKGW----ALSNPD---------LI 183
+ + SDE DC + C++ R A+ W PD L+
Sbjct: 222 FARQVSDERINLGVGKVDCVTHAQLCKDQRVMAFPTLRWFENGKAVMPDYRGDRTVDALV 281
Query: 184 DQCKR-----EGFL-QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
D KR EG + +E+ GC I G L VN+V G F H+ +H +
Sbjct: 282 DYAKRRVGSNEGSNDEEFEEDHHPGCLISGHLMVNRVPGRFQIEARSVNHE----LHSAM 337
Query: 238 AFQRDSFNISHKINKLAFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV- 290
N++H+++ L FG H V+ D V + + M K PT
Sbjct: 338 T------NLTHRVHDLTFGALSGPPGHMLHVLPFFDTVPEKYKHTNPMQD---KYYPTYE 388
Query: 291 YTDVSGHTIQSNQFSVTEHFRSS-------EQGRL-----QTLPGVFFFYDLSPIKVTFT 338
+ H ++ + F S EQ +L +P + F +DLSP+ V +
Sbjct: 389 FHQAFHHHLKIISTHIDYLFSRSTVLYQILEQSQLVFYEEVNVPEIQFSFDLSPMSVNVS 448
Query: 339 EEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
+E + ++T++CAI+GG +T G+I+A
Sbjct: 449 KEGRKWYEYVTSLCAIIGGTYTTLGLINA 477
>gi|119928709|ref|XP_001256294.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Bos taurus]
Length = 144
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 48/105 (45%), Positives = 67/105 (63%), Gaps = 5/105 (4%)
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 339
Y +K+VPTVY D SG S Q++V E+ S GR+ +P ++F YDLSPI V +TE
Sbjct: 41 YILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTE 98
Query: 340 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
F+T +CAI+GG FTV+GI+D+ I+ A KKI++GK
Sbjct: 99 RRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW-KKIQLGK 142
>gi|356549839|ref|XP_003543298.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 101/204 (49%), Gaps = 39/204 (19%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 260
GC I G++ V KV GN F+ + H +F N+SH IN L+FG P
Sbjct: 293 GCRIDGYVRVKKVPGNLIFSARSNAH----------SFDASQMNMSHVINHLSFGRKVSP 342
Query: 261 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 303
V+ + L+G + G ++++++V T I
Sbjct: 343 RVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTMEHYLQIVKT-------EVITRKD 395
Query: 304 FSVTEHFRSSEQGRL-QTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+ + E + + + Q+L P F +LSP++V TE SF HF+TNVCAIVGG+FT
Sbjct: 396 YKLVEEYEYTAHSSVAQSLHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIVGGIFT 455
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+GI+DA +++ R + KK+E+GK
Sbjct: 456 VAGIMDAILHNTIR-LMKKVELGK 478
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 68/108 (62%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+KI+S+D Y KI D + SG +++V+++ M+ LF EL YL+ T T+++VD +S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMIFLFGMELNSYLSVTTSTQVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LRI+F+++FPAL C +VD D+ G L++ + K +DS
Sbjct: 65 DGDYLRIDFNISFPALSCEFAAVDVSDVLGTNRLNLTKTVRKFSIDSN 112
>gi|449489976|ref|XP_004158474.1| PREDICTED: protein disulfide-isomerase 5-3-like [Cucumis sativus]
Length = 224
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 113/223 (50%), Gaps = 37/223 (16%)
Query: 182 LIDQCKRE-GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
L D+ E G ++R G GC I G++ V KV G+ A H +F
Sbjct: 17 LEDKSNNETGNVKRPAPSAG-GCRIEGYVRVKKVPGSLVIAARSESH----------SFD 65
Query: 241 RDSFNISHKINKLAFGEH--------------FPGVV-NPLDGVRWTQETPSG---MYQY 282
N+SH I+ L+FG + G+ + L+G + + G ++
Sbjct: 66 ASQMNMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEH 125
Query: 283 FIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 341
++++V T V T SG ++ ++ T H S+ +P V F + LSP++V TE
Sbjct: 126 YLQIVKTEVLTRRSGKLLE--EYEYTAHSSVSQS---LYIPVVKFHFVLSPMQVVITENQ 180
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+DA +++ R + KK+E+GK
Sbjct: 181 KSFSHFITNVCAIIGGVFTVAGILDALLHNTIR-LMKKVELGK 222
>gi|344250048|gb|EGW06152.1| UPF0474 protein C5orf41-like [Cricetulus griseus]
Length = 745
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 87/186 (46%), Gaps = 25/186 (13%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 125 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 172
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 173 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 232
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
E+ S GR+ +P ++F YDLSPI V +TE F+T A VF +G+
Sbjct: 233 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTREAAEWFVFWGTGM-- 288
Query: 367 AFIYHG 372
YHG
Sbjct: 289 --AYHG 292
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTS 64
+ D Y K+ +D T++G +I++ + +L LF SEL ++ +L V D
Sbjct: 24 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 83
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
G + ++ +++ P L C ++ +D D G H+D
Sbjct: 84 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 121
>gi|440293957|gb|ELP87004.1| hypothetical protein EIN_318630 [Entamoeba invadens IP1]
Length = 316
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 93/188 (49%), Gaps = 22/188 (11%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGK-SFHQ-------------SGVHVHDILAFQRDSFNIS 247
GC ++G ++V++V+G FH A GK ++ Q + +H H + SFN +
Sbjct: 117 GCRMHGTMKVSRVSGEFHVAFGKIAYRQQRTNQVITATQKHTQMHTHQFTMQEMKSFNPT 176
Query: 248 HKINKLAFGEHFPGVVN-----PLDGVRWT-QETPSGMYQYFIKVVPTVYTDVSGHTIQS 301
H IN LAF P PL+G +T + + Y Y+I V+PT+ HT +S
Sbjct: 177 HFINNLAFSNT-PSYTTHAGETPLNGKEYTLKGYDNARYTYYINVIPTL-NKYPTHTTRS 234
Query: 302 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 361
Q S+ E F G T PGVFF Y+LSP V SF H + + AI+GGV+ +
Sbjct: 235 YQLSINERFVPVTYGPTFTQPGVFFKYELSPYIVINEMMDHSFAHSIASTAAIIGGVWII 294
Query: 362 SGIIDAFI 369
G I F+
Sbjct: 295 FGWISRFL 302
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 56/106 (52%), Gaps = 4/106 (3%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
++ D +PK+ + ++ + G+++++S ++ +L F+E ++N + + VDT +
Sbjct: 9 LKQFDMFPKVPNNVKIKSNATGILSIISYAIIGILIFNEAYNFMNPNWVSHVDVDTVKAG 68
Query: 68 TL---RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI-FKKRL 109
L IN D+TFP + C+ D +I+G L V I F RL
Sbjct: 69 VLPNIYINVDITFPNMKCADFGFDVTEITGSLQLGVTEGIKFDDRL 114
>gi|444316650|ref|XP_004178982.1| hypothetical protein TBLA_0B06400 [Tetrapisispora blattae CBS 6284]
gi|387512022|emb|CCH59463.1| hypothetical protein TBLA_0B06400 [Tetrapisispora blattae CBS 6284]
Length = 355
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/370 (22%), Positives = 162/370 (43%), Gaps = 69/370 (18%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M +++ DA+PK ++ ++ G+ ++++ +LL+ ++E + + + +++
Sbjct: 1 MAGLKTFDAFPKTDDQHIKKSKKVGLTSILTYFFLLLITWTEFGNFFGGYIDQQYIINND 60
Query: 65 R-----GETLRINFDVTFPALPCSILSVDAMDISGEQ-----HLDVKHDIFKKRLDSQGN 114
+ E + IN D+ + LPC L V++ DI+G+ +L + F S+ N
Sbjct: 61 KLQDQVHELVHINLDI-YIKLPCKWLDVNSRDITGDHTFVSNYLTFEDMPFFIPYGSKLN 119
Query: 115 VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 174
++ I P ID+ L E + +R+K
Sbjct: 120 ILHD----IVTPNIDQIL------------------------------GEAIPAEFREK- 144
Query: 175 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHV 233
L +D+ + + E +GC+++G + VN+V G F A G +
Sbjct: 145 --LDTIIPLDENGKPLY-------ELDGCHVFGQIPVNRVQGELQFTAKGYGYMNWERTP 195
Query: 234 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYT 292
++++ N H IN+ +FG FP + NPLD + + P + Y VVP+ Y
Sbjct: 196 YELI-------NFDHVINEFSFGNFFPYIDNPLDNTAKINLDDPVTSWIYDTSVVPSYYR 248
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQT----LPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
+ G + + Q+SV+++ + + T +PG+FF YD + + T+ +SF FL
Sbjct: 249 KL-GAEVDTFQYSVSQYSYNGTSLQKMTSSTSVPGIFFKYDFEALSLVLTDHRISFFQFL 307
Query: 349 TNVCAIVGGV 358
+ AI+ V
Sbjct: 308 IRLVAILSFV 317
>gi|308807242|ref|XP_003080932.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
gi|116059393|emb|CAL55100.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
Length = 533
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS-RG 66
IR +D Y K+ +F T G +I+++S+++ML LF SEL Y + ETK++VD S G
Sbjct: 26 IRGMDFYRKVPREFSEGTLGGSIISILSAVLMLYLFLSELGKYSTSSFETKVVVDRSVDG 85
Query: 67 ETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
E LRINF+++FPAL C SVD D G ++ +FK+ +D++ N I Q
Sbjct: 86 ELLRINFNLSFPALSCEFASVDVGDALGLNRFNLTKTVFKRAIDAEMNPIGPLQ 139
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 112/228 (49%), Gaps = 28/228 (12%)
Query: 168 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA---PGK 224
EA R+ + L P +D +R G GC I GF+ V KV G+ + P
Sbjct: 321 EAAREANFNLQLPASVDVQRRI---------MGPGCAITGFVLVKKVPGHLWISASSPDH 371
Query: 225 SFHQSGVHVHDILAFQRDSFNISHKIN--------KLAFGEHFPGVVNPLDGVRWTQETP 276
SFH +++ ++ + F H+++ K GE + L G + E+
Sbjct: 372 SFHGQNMNMTHVV----NHFYFGHQLSDDRRRYLEKFHAGEKAGDWHDRLAGQTFVSESA 427
Query: 277 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 336
++++++ TV T ++ + FSV E+ + + + LP F Y SP+++
Sbjct: 428 HISHEHYLQ---TVLTSIAPRGRFALPFSVYEYTQHAHAVH-EPLPKAKFHYQPSPMQIA 483
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+EE ++F F+T++ AI+GGV++V GI D +++ ++KK+E+GK
Sbjct: 484 VSEERMAFYSFITSLMAIIGGVYSVMGIADGVLFNSIALVRKKLELGK 531
>gi|168012320|ref|XP_001758850.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689987|gb|EDQ76356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 487
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 100/202 (49%), Gaps = 35/202 (17%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----- 256
GC + GF+ V KV G + SG H +F S N++H + +FG
Sbjct: 300 GCRVEGFVRVKKVPGELMISA-----HSGSH-----SFDATSMNMTHYVGFFSFGRKTSW 349
Query: 257 -------EHFPGV---VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS----N 302
E P + ++ L G + E + + ++++VV T ++ H Q
Sbjct: 350 RSVHWVNEMLPALDSNIDRLTGQVFPSEYENITHDHYLQVVKTEV--ITLHRKQDLRVLE 407
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q+ T H S + +P V F Y+LSP++V E SF HFLTN+CAI+GGVFTV+
Sbjct: 408 QYDYTAH---SNMIQSTKVPVVKFHYELSPMQVLVKENPKSFSHFLTNLCAIIGGVFTVA 464
Query: 363 GIIDAFIYHGQRAIKKKIEIGK 384
GIID+ + H I KK+E+GK
Sbjct: 465 GIIDSML-HNAMHIMKKVELGK 485
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+K++S+D Y KI D + SG +++++++ M+ LF EL YL+ T T ++VD SR
Sbjct: 5 SKLKSIDFYRKIPRDLTEASLSGAGLSIIAALTMVFLFGMELSAYLSTTTSTSVVVDRSR 64
Query: 66 -GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G ++ + K +D
Sbjct: 65 DGEYLRIDFNLSFPALSCEFASVDVSDVLGTHRFNLTKTVRKYPID 110
>gi|194768867|ref|XP_001966532.1| GF22223 [Drosophila ananassae]
gi|190617296|gb|EDV32820.1| GF22223 [Drosophila ananassae]
Length = 448
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 178/393 (45%), Gaps = 53/393 (13%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV----DTS 64
++LDA+ K+ E + T GG ++L+S ++++ L ++EL Y + ET ++ D S
Sbjct: 19 KNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWH---ETDIVYQFQPDMS 75
Query: 65 RGETLRINFDVTFPALPCSILS-VDAMDISGEQHLDVKHDIFK----------------K 107
+ ++++ D+T A+PC+ LS VD MD + + D+F
Sbjct: 76 LDDQVQMHVDITV-AMPCASLSGVDLMD-------ETQQDVFAYGTLQREGVWWEMNDHD 127
Query: 108 RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVR 167
RL Q I+++ + L + R H + S + A +
Sbjct: 128 RLQFQAIQIQNQYLREEFHSLADVLFKDIMRDTHPQRESPSTFPAAPPPPGALPVALDFH 187
Query: 168 EAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSF 226
+ ++ A +NP+ D C+ G L G N KVAG H G
Sbjct: 188 MS-QQAAAAAANPETKYDACRLHGTL---------GIN--------KVAGVLHLVGGAQP 229
Query: 227 HQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 286
H ++ +R N +H+IN+L+FG++ +V PL+G + + QYF+KV
Sbjct: 230 VVGLFEDHWMIELRRMPANFTHRINRLSFGQYSRRIVQPLEGDETIIQEEATTVQYFLKV 289
Query: 287 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFL 345
VPT + TI + Q+SVTE+ R + R PG++F YD S +K+ +
Sbjct: 290 VPTEIRQ-TFSTINTFQYSVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVDNDRDHLA 348
Query: 346 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 378
F+ +C+I+ G+ +SG I++ + QR + +
Sbjct: 349 TFVIRLCSIISGIIVISGAINSLLIAIQRRLLR 381
>gi|67482091|ref|XP_656395.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473591|gb|EAL51010.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449705171|gb|EMD45274.1| Hypothetical protein EHI5A_018710 [Entamoeba histolytica KU27]
Length = 315
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 94/183 (51%), Gaps = 20/183 (10%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 247
GC +YG ++V++V+G FH A GK SF Q + H+H + SFN +
Sbjct: 116 GCRMYGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 248 HKINKLAFGEHFPGVVN----PLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 302
H IN L+F V+ PL+G ++T + Y+I V+PT++ S +T+++
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKKFTLSGFDNARKTYYINVIPTLFKYPS-YTLRTY 234
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q SV E G T PGVFF Y+LSP V SF H L +V AI+GGV +
Sbjct: 235 QLSVNERDVPVTYGASFTQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLIIM 294
Query: 363 GII 365
G++
Sbjct: 295 GLL 297
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 55/113 (48%), Gaps = 4/113 (3%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I ++ D + K+ E +T + + +++S +++ LL SE + N + +
Sbjct: 1 MKKIQQFLKECDIFLKVPEKLKIKTNTTKLFSIISYVIIGLLILSETYNFFNPQWVSHVD 60
Query: 61 VDTSRGETL---RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI-FKKRL 109
VDT + L IN D++FP + C +D +I+G L V I F KRL
Sbjct: 61 VDTVKAGVLPNMYINIDMSFPKMNCDDFGLDVTEITGSLQLGVTDGIKFDKRL 113
>gi|302800507|ref|XP_002982011.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
gi|300150453|gb|EFJ17104.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
Length = 476
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 129/284 (45%), Gaps = 49/284 (17%)
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G P I + H + EH S YG +D + + EA K L+ D
Sbjct: 217 GFPSIRIFHKGHDLKDEHGHHEHDSYYGERDTD-----SLVKAMEALVPKETTLALED-- 269
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVA-GNFHFAPGKSFHQSGVHVHDILAFQRD 242
K G ++R G GC I GF+ KV GN + SG H +F
Sbjct: 270 ---KTNGTVKRPAPRAG-GCRIEGFIRAKKVVPGNIIISA-----HSGSH-----SFDAS 315
Query: 243 SFNISHKINKLAFGEH------------FPGVVNPLDGVR-------WTQETPSGMYQYF 283
+ N++H +++ FG +P + + D V + + + + ++
Sbjct: 316 AMNMTHYVSQFTFGRELNFWMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHY 375
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT--LPGVFFFYDLSPIKVTFTEE 340
++VV T + + +FS+ E + +S +Q +P F Y+LSP++V E
Sbjct: 376 LQVVKTEVVSLR----KRKEFSLLEQYDYTSHSNTIQNTNVPVAKFHYELSPMQVLVKEN 431
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ + HG + KKIE+GK
Sbjct: 432 PKSFSHFITNVCAIIGGVFTVAGIVDSML-HGAMRMVKKIELGK 474
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 63/112 (56%), Gaps = 1/112 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR 65
+KI+S+D Y KI D + SG ++L+++ M+ LF EL YL + T ++VD S+
Sbjct: 5 SKIKSIDFYRKIPRDLTEASLSGAGLSLIAAFAMIFLFGMELNNYLTVSSTTNVVVDRSK 64
Query: 66 -GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI 116
GE LRI F+++FPAL C SVD D G ++ + K +D ++
Sbjct: 65 DGEYLRIQFNMSFPALSCEFASVDVSDALGTNRYNLTKTVRKYPIDPNLKIV 116
>gi|356543934|ref|XP_003540413.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 39/204 (19%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC I G++ V KV GN + + H +F N+SH IN L+FG
Sbjct: 293 GCRIDGYVRVKKVPGNLIISARSNAH----------SFDASQMNMSHVINHLSFGRKVSL 342
Query: 262 VV---------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 303
V + L+G + G ++++++V T I +
Sbjct: 343 RVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTIEHYLQIVKT-------EVITRKE 395
Query: 304 FSVTEHFRSSEQGRL-QTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
+ + E + + + Q+L P F +LSP++V TE SF HF+TNVCAI+GG+FT
Sbjct: 396 YKLVEEYEYTAHSSVAQSLHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIIGGIFT 455
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+GI+DA I+H + KK+E+GK
Sbjct: 456 VAGIMDA-IFHNTIRLMKKVELGK 478
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 68/108 (62%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+KI+S+D Y KI D + SG +++V+++ M+ LF EL YL+ T T+++VD +S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMIFLFGMELNSYLSVSTSTQVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LRI+F+++FPAL C +VD D+ G L++ + K +DS
Sbjct: 65 DGDYLRIDFNISFPALSCEFAAVDVSDVLGTNRLNLTKTVRKFSIDSN 112
>gi|356545151|ref|XP_003541008.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 453
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 33/201 (16%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 260
GC + G++ V KV GN + H +F N+SH IN L+FG+ P
Sbjct: 266 GCRVEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHVINNLSFGKKVTP 315
Query: 261 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 303
+ + L+G + G +++I++V T G+ + +
Sbjct: 316 RAMSDVKLLIPYIGSSHDRLNGRSFINTRDLGANVTIEHYIQIVKTEVVTRKGYKLI-EE 374
Query: 304 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 363
+ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 375 YEYTAH---SSVAHSLDIPVAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAG 431
Query: 364 IIDAFIYHGQRAIKKKIEIGK 384
I+D+ +++ R + KKIE+GK
Sbjct: 432 ILDSILHNTIRMV-KKIELGK 451
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 47/70 (67%), Gaps = 1/70 (1%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSR 65
K++S+D Y KI D + SG +++V+++VM+ LF EL Y++ T T ++VD +S
Sbjct: 6 KLKSVDFYRKIPRDLTEASLSGAGLSIVAALVMMFLFGMELSSYMSVSTSTSVIVDKSSD 65
Query: 66 GETLRINFDV 75
G+ LRI+F++
Sbjct: 66 GDYLRIDFNI 75
>gi|308487907|ref|XP_003106148.1| hypothetical protein CRE_15417 [Caenorhabditis remanei]
gi|308254138|gb|EFO98090.1| hypothetical protein CRE_15417 [Caenorhabditis remanei]
Length = 427
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/407 (25%), Positives = 168/407 (41%), Gaps = 61/407 (14%)
Query: 1 MDAIMNKIRSLDAYPKINEDF-----------YSRTFSGGVITLVSSIVMLLLFFSELRL 49
MD ++IR KI EDF + S G I+ V ++ LF +E
Sbjct: 1 MDLGTSEIRQRKGITKIVEDFDIFEKVVENVKEEKKASSGAISFVCFTIIFCLFCTETYT 60
Query: 50 YL-NAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAM--DISGEQHLDVKHDIFK 106
+L + + + VDT E ++ D+ PCS+L V + + SG L
Sbjct: 61 FLFHKKYDYRFAVDTEMDEMPLLDLDIVINT-PCSVLQVASSSDEYSGGDGL-------- 111
Query: 107 KRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEV 166
R Q N +R D ++ + RH ++N + E D+D N E +
Sbjct: 112 LRQTIQKN--PTRFDFTDEEQMYWTILRHAHD-QYNRRGMRALEELEYVDDDIETNLEHL 168
Query: 167 ------REAYRKKGWALSNPDLIDQCKRE---------GFLQRIKE------EEGEGCNI 205
EA K + N + + G Q + + E+G+ C +
Sbjct: 169 ANEKQEEEAAHIKEQRMKNKQTKHRGTGQIMFLVSNGMGMFQLVADNGGADGEDGKACRL 228
Query: 206 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ---RDSFNISHKINKLAFGEHFPGV 262
+G +V K GK + +L F+ + NISH+I K FG PG+
Sbjct: 229 HGKFKVRK---------GKEEKIVMSISNPLLMFEHQEKQPGNISHRIEKFNFGPRIPGL 279
Query: 263 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 322
V PL G E+ +Y+YFIK+VPT HT+ + Q+SVT + ++G +
Sbjct: 280 VTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTHTL-AYQYSVTFLKKQLKEGE-HSHG 337
Query: 323 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
G+ F Y+ + + + V+ +L +C+I+GGV+ S II+ +
Sbjct: 338 GILFEYEFTANVIEVHKTSVTLFSYLIRICSILGGVYATSTIINNVV 384
>gi|365759132|gb|EHN00939.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|401842937|gb|EJT44934.1| ERV41-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 285
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 88/158 (55%), Gaps = 13/158 (8%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC+I+G + VN+V+G K F + H + + N +H IN+ +FG+ +P
Sbjct: 93 GCHIFGSVPVNRVSGVLQIT-AKGFGYADSHRASL-----EDLNFAHVINEFSFGDFYPY 146
Query: 262 VVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF----RSSEQG 316
+ NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++ SS +G
Sbjct: 147 IDNPLDNTAQFDQDEPLTTYLYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLNKDSSVKG 205
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 206 N-RRVPGIFFKYNFEPLSIVVSDVRISFIQFLVRLVAI 242
>gi|297830752|ref|XP_002883258.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
lyrata]
gi|297329098|gb|EFH59517.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 106/204 (51%), Gaps = 36/204 (17%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 260
GC + G++ V KV GN + SG H +F N+SH ++ L+FG P
Sbjct: 293 GCRVEGYVRVKKVPGNLVISA-----HSGAH-----SFDSSQMNMSHVVSHLSFGRMISP 342
Query: 261 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPT-VYTDVSG--HTIQ 300
++ + LDG + + G ++++++V T V T SG H++
Sbjct: 343 RLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQIVKTEVITRRSGQEHSLI 402
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
++ T H S + LP F ++LSP+++ TE SF HF+TN+CAI+GGVFT
Sbjct: 403 -EEYEYTAH---SSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGVFT 458
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+GI+D+ I+H + KK+E+GK
Sbjct: 459 VAGILDS-IFHNTVRLIKKVELGK 481
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 64/105 (60%), Gaps = 1/105 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSR 65
K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 6 KLKSVDFYRKIPRDLTEASLSGAGLSIVAALFMMFLFGMELSSYLEVNTTTAVIVDKSSD 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
G+ LRI+F+++FPAL C SVD D+ G L++ I K +D
Sbjct: 66 GDFLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTIRKFPID 110
>gi|444732203|gb|ELW72509.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Tupaia chinensis]
Length = 250
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 87/172 (50%), Gaps = 8/172 (4%)
Query: 181 DLIDQCKR-EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 239
DL Q K + LQ I+ E ++ + + G P H G H H
Sbjct: 63 DLSPQQKEWQRMLQVIQSRLQEEHSLQDVIFKSAFKGTTALPPRAIPHPRG-HAHLAALV 121
Query: 240 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGH 297
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S
Sbjct: 122 NHDSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISAD 181
Query: 298 TIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 348
T +QFSVTE R + + G+F YDLS + VT TEEH+ F F
Sbjct: 182 T---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFF 230
>gi|431918151|gb|ELK17379.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Pteropus alecto]
Length = 313
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 79/165 (47%), Gaps = 21/165 (12%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 136 KIPLNGGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 183
Query: 254 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 307
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 184 SFGDTLQVRNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVAN 243
Query: 308 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
E+ S GR+ +P ++F YDLSPI V +TE F+T V
Sbjct: 244 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTV 286
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 11 LDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV---DTSRGE 67
D Y K+ +D T++G +I++ + +L LF SEL +L +L V D G
Sbjct: 38 FDIYRKVPKDLTQPTYTGAIISICCCVFILFLFLSELTGFLTTEVVNELYVDDPDKDSGG 97
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQ---HLD 99
+ ++ +++ P L C ++ +D D G H+D
Sbjct: 98 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHID 132
>gi|357474735|ref|XP_003607653.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355508708|gb|AES89850.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 477
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 100/205 (48%), Gaps = 41/205 (20%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC + G++ V KV G+ + H +F N+SH IN L+FG+
Sbjct: 290 GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 336
Query: 262 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 299
V P +D W T++ + +++I+VV T G+ +
Sbjct: 337 -VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKL 395
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
++ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVF
Sbjct: 396 I-EEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVF 451
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGK 384
TV+GI+D+ +++ +A+ KKIEIGK
Sbjct: 452 TVAGILDSILHNTIKAM-KKIEIGK 475
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++++++ M+ LF EL Y T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSILAALAMMFLFGMELSNYFAVTTSTSVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LRI+F+ +FPAL C SVD D+ G L++ + K +DS+
Sbjct: 65 DGDFLRIDFNFSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDSK 112
>gi|224126339|ref|XP_002319814.1| predicted protein [Populus trichocarpa]
gi|222858190|gb|EEE95737.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 97/200 (48%), Gaps = 28/200 (14%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG-EHFP 260
GC I G++ V KV GN + +SG H +F N+SH I+ +FG + P
Sbjct: 294 GCRIEGYVRVKKVPGNLVISA-----RSGAH-----SFDSAQMNLSHVISHFSFGMKVLP 343
Query: 261 GVV--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
V+ + L+G + G + V T+V + +
Sbjct: 344 RVMSDVKRLIPHIGRSHDKLNGRSFINHRDVGANVTIEHYLQVVKTEVVTRRSSAEHKLI 403
Query: 307 TEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
E+ ++ QT +P F ++LSP++V TE SF HF+TNVCAI+GGVFTV+GI
Sbjct: 404 EEYEYTAHSSLAQTVYMPTAKFHFELSPMQVLITENPKSFSHFITNVCAIIGGVFTVAGI 463
Query: 365 IDAFIYHGQRAIKKKIEIGK 384
+D+ I H + KK+E+GK
Sbjct: 464 LDS-ILHNTFRMMKKVELGK 482
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
NK++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 NKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELNNYLTVNTSTSVIVDNSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FP+L C SVD D+ G L++ I K +D
Sbjct: 65 DGEFLRIDFNLSFPSLSCEFASVDVSDVLGTNRLNITKTIRKFSID 110
>gi|365991164|ref|XP_003672411.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
gi|343771186|emb|CCD27168.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
Length = 341
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 83/376 (22%), Positives = 159/376 (42%), Gaps = 72/376 (19%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M+K+ + DA+PK E+ ++ GG+ ++++ + +L + ++E+ Y E + +VD
Sbjct: 1 MSKLGAFDAFPKTEEEHVKKSTRGGLSSILTYLFLLFMIYNEVGRYFGGFIEQQYIVDIE 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
E +INFD+ F C ++ V +D+ + N+ S D I
Sbjct: 61 IQERAQINFDI-FLNTTCDLIDVRIVDL------------------TSDNMKRSVSDEIS 101
Query: 125 APKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID 184
+ + +G R+ Y E + ++
Sbjct: 102 FEDLTFYIP-YGTRI----NILNGIYTTEFDE-------------------------VLT 131
Query: 185 QCKREGFLQRIKEEEGE-------GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
Q F RI E E C+++G ++VN++ G + S +++D
Sbjct: 132 QAIPYEFGMRIDERPPEDDMPNINACHLFGSVDVNRLPGILEISTN-----STGNIND-- 184
Query: 238 AFQRDSFNISHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSG 296
+ + +H IN+L+FGE FP + NPLD + + P Y Y++ V+PT+Y + G
Sbjct: 185 ----NGKSFAHVINELSFGEFFPFIDNPLDNTAKVLPDQPLTTYSYYLTVIPTIYEKL-G 239
Query: 297 HTIQSNQFSVTEH-FRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 353
+ +NQ+S+ E F+ + QT + YD + + + + F+ FL + A
Sbjct: 240 KRVNTNQYSLNEFIFKHIYNVKSQTQYDEAIRIHYDFDALSIFMHDTRLDFIQFLVRLVA 299
Query: 354 IVGGVFTVSGIIDAFI 369
I+ V ++ + FI
Sbjct: 300 ILSFVVYIASWVFRFI 315
>gi|159483443|ref|XP_001699770.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
gi|158281712|gb|EDP07466.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
Length = 474
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 202 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVHV-HDILAFQ---RDSFNISHKINKLA 254
GCN+ GF+ V KV G HF + G SF + +++ H I +F R S ++ +L
Sbjct: 286 GCNLAGFVMVKKVPGTVHFVARSEGHSFDHTWMNMTHMIHSFHVGTRPSPRKYQQLKRLH 345
Query: 255 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV-PTVYTDVSGHTIQSNQFSVTEHFRSS 313
+ L + E ++++++VV T+ S HT + + T H S
Sbjct: 346 PAGLTADWADKLHDQLFVSEHTQSTHEHYLQVVLTTIEPRHSRHTGNYDAYEYTAHSHSY 405
Query: 314 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 373
+ ++P F YDLSPI++ E + FLT CAI+GGVFTV+GI+DA +Y
Sbjct: 406 QS---DSIPSARFTYDLSPIQILVHETSKPWYQFLTTSCAIIGGVFTVAGILDALLYQSF 462
Query: 374 RAIKKKIEIGK 384
+ + KK+ +GK
Sbjct: 463 KVV-KKLNLGK 472
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 80/134 (59%), Gaps = 6/134 (4%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + ++++++D + KI D T +G +++V++++M+LLF +EL +L+ T ++L+
Sbjct: 1 MVRLFSRLKAIDFFKKIPSDLTEATLTGAWLSIVAAVLMILLFVAELSAFLSTTTSSQLV 60
Query: 61 VDTS-RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR----LDSQGNV 115
VD S + E L++NF+++FPAL C +VD D G + +++ + K ++ QG
Sbjct: 61 VDRSPQNELLKLNFNISFPALSCEFATVDVSDSLGTKRMNLTKTVRKVPITLDMERQGAA 120
Query: 116 IESRQDGIGAPKID 129
+E +G PK D
Sbjct: 121 VEDTAHKVG-PKYD 133
>gi|428175103|gb|EKX43995.1| hypothetical protein GUITHDRAFT_159761 [Guillardia theta CCMP2712]
Length = 475
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 99/206 (48%), Gaps = 28/206 (13%)
Query: 195 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 254
+ G GC + G L V + APG Q+ V D F ++ ++SH +N L+
Sbjct: 282 VDSHNGVGCMVSGLLHVQR-------APGMLKVQA---VSDSHEFNWETMDVSHTVNHLS 331
Query: 255 FGE------------HFPGVVNPLDGVRWT--QETPSGMYQYFIKVVPTVYTDVSGHTI- 299
FG H V LD +T Q P+ +++++KVV T S +
Sbjct: 332 FGPFLSETAWMVLPPHIAASVGSLDDRSFTSDQHVPT-THEHYVKVVRHEVTPPSSWKVA 390
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
Q + H S+ + +P V YD+ PI V F E+ +F HF+TN+CAIVGGVF
Sbjct: 391 QITSYGYVVH--SNNIQKAGEVPTVRINYDILPIIVQFHEKKQAFYHFVTNLCAIVGGVF 448
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGKF 385
TV+GII + + ++KK E+GK
Sbjct: 449 TVAGIIASLMDKSINLMRKKQELGKL 474
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 70/128 (54%), Gaps = 7/128 (5%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSR----TFSGGVITLVSSIVMLLLFFSELRLYLNAVTE 56
M + ++S+D Y K+ D + SG ++++++++M+ L +EL YL +E
Sbjct: 1 MSGFLQGLKSVDFYRKLKRDLQQELTEASVSGAALSIIAAVIMIGLVAAELTAYLTVQSE 60
Query: 57 TKLLVD---TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+++++D +S +TL++NF+ TFP L C SVDA + G + + K RLD G
Sbjct: 61 SRVVLDHFESSSDDTLQVNFNFTFPHLKCDYASVDATNFMGTHDAGLAARVSKIRLDKNG 120
Query: 114 NVIESRQD 121
N++ D
Sbjct: 121 NLVGRHDD 128
>gi|217072996|gb|ACJ84858.1| unknown [Medicago truncatula]
gi|388501234|gb|AFK38683.1| unknown [Medicago truncatula]
Length = 243
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 100/205 (48%), Gaps = 41/205 (20%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC + G++ V KV G+ + H +F N+SH IN L+FG+
Sbjct: 56 GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 102
Query: 262 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 299
V P +D W T++ + +++I+VV T G+ +
Sbjct: 103 -VTPRAMIDVKHWIPYLGINHDRLNGRSFVNTRDLEGNVTIEHYIQVVKTEVITRKGYKL 161
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
++ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVF
Sbjct: 162 -IEEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVF 217
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGK 384
TV+GI+D+ +++ +A+ KKIEIGK
Sbjct: 218 TVAGILDSILHNTIKAM-KKIEIGK 241
>gi|325185550|emb|CCA20033.1| thioredoxinlike protein putative [Albugo laibachii Nc14]
Length = 503
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 96/193 (49%), Gaps = 12/193 (6%)
Query: 201 EGCNIYGFLEVNKVAGNFHFAPGK---SFHQSGVHVHDI---LAFQRDSFNISHKINKLA 254
EGC + G L VN+V F SF G++V + L+F + + S K +L+
Sbjct: 316 EGCEVSGSLNVNRVPSRLVFTARSKDLSFDLRGINVTHVVHHLSFGQVTRKQSTKSTQLS 375
Query: 255 FG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 313
+HFP LDG + E + ++F+ V+ + + + + + RS+
Sbjct: 376 MSFDHFP-----LDGKTFRTENENITVEHFLSVIGVDHMEAKSKHMGLVERTYQIVARSN 430
Query: 314 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 373
+ LP F +D+SP+ + + + F FLT++CAIVGG+ T+ G +DA YH
Sbjct: 431 QYNATDMLPAALFTFDISPLVIQMSSDSTPFYRFLTSLCAIVGGMVTIIGFVDAGAYHAM 490
Query: 374 RAIKKKIEIGKFS 386
+IK+K ++GK +
Sbjct: 491 NSIKRKRQLGKLN 503
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 6/141 (4%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + D + K+ E R+ G V T+++ ++ + L R Y + + ++
Sbjct: 1 MTMVPKSFSKFDLFRKVPEHLSERSSLGTVFTVLTLVLSVYLITVNFRSYQDTSIHSIVV 60
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDV----KHDIFKKRLDSQGNVI 116
+D + + LRINF+++ A+PC SVD D G Q +++ +H S GNV
Sbjct: 61 MDDHQEDQLRINFNISLLAIPCQFASVDVSDYIGMQLINITRHLRHFQLATTAHSPGNV- 119
Query: 117 ESRQDGIGAPKIDKPLQRHGG 137
R I DK L GG
Sbjct: 120 -QRVQEIVIHDGDKGLPTWGG 139
>gi|18402672|ref|NP_566664.1| protein PDI-like 5-3 [Arabidopsis thaliana]
gi|75273652|sp|Q9LJU2.1|PDI53_ARATH RecName: Full=Protein disulfide-isomerase 5-3; Short=AtPDIL5-3;
AltName: Full=Protein disulfide-isomerase 12;
Short=PDI12; AltName: Full=Protein disulfide-isomerase
8-1; Short=AtPDIL8-1; Flags: Precursor
gi|11994143|dbj|BAB01164.1| unnamed protein product [Arabidopsis thaliana]
gi|15215847|gb|AAK91468.1| AT3g20560/K10D20_9 [Arabidopsis thaliana]
gi|332642877|gb|AEE76398.1| protein PDI-like 5-3 [Arabidopsis thaliana]
Length = 483
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 97/200 (48%), Gaps = 28/200 (14%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 260
GC + G++ V KV GN + SG H +F N+SH ++ +FG P
Sbjct: 293 GCRVEGYVRVKKVPGNLVISA-----HSGAH-----SFDSSQMNMSHVVSHFSFGRMISP 342
Query: 261 GVV--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 306
++ + LDG + + G + TV T+V +
Sbjct: 343 RLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQTVKTEVITRRSGQEHSLI 402
Query: 307 TEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 364
E+ ++ QT LP F ++LSP+++ TE SF HF+TN+CAI+GGVFTV+GI
Sbjct: 403 EEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGVFTVAGI 462
Query: 365 IDAFIYHGQRAIKKKIEIGK 384
+D+ I+H + KK+E+GK
Sbjct: 463 LDS-IFHNTVRLVKKVELGK 481
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 64/105 (60%), Gaps = 1/105 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSR 65
K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 6 KLKSVDFYRKIPRDLTEASLSGAGLSIVAALFMMFLFGMELSSYLEVNTTTAVIVDKSSD 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
G+ LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 66 GDFLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTVRKFPID 110
>gi|403372594|gb|EJY86197.1| hypothetical protein OXYTRI_15812 [Oxytricha trifallax]
Length = 349
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 163/390 (41%), Gaps = 78/390 (20%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLY--LNAVTETKL-LV 61
M ++LD + K+ + T GG++++ S V+L+LF E+ Y LN +T + +
Sbjct: 1 MRVFKNLDYFRKVAPEHTKPTVIGGLVSICSLSVILMLFCYEINDYLKLNIKKDTYIGAL 60
Query: 62 DTSRG---ETLRINFDVTFPALPCSILSVDAMD-ISGEQHLDVKHDIFKKRLDSQGNVIE 117
D G E + +N D+TFP +PC ++ VD +S ++ +IF++R+ + G V++
Sbjct: 61 DRQPGVDVEFINMNLDITFPHVPCFMIDVDQRSTVSQSDKEEINKNIFRRRIGADGQVLD 120
Query: 118 SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWAL 177
S P + A S E C N + + R G +
Sbjct: 121 SVTPDFNNPSV----------------VVKDLADALISGESC--NIKGRIKLERVTGQII 162
Query: 178 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 237
N R GF+Q ++ + + VA F HV + L
Sbjct: 163 MNFQ-----NRVGFVQELQRSKPD------------VAAKLSFG----------HVINSL 195
Query: 238 AFQRDSFNISHKIN--KLAFGEHFPGVVNPLDGVR---WTQETPSGMYQYFIKVVPTVYT 292
F H+ N K FG + +D V + + S Y YF K+VP V+
Sbjct: 196 TFGE-----PHQQNAIKKRFGNTDHTQFDMMDFVEDSLYENDKGSRDYFYFFKLVPHVFI 250
Query: 293 D-VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
D ++ QS +S+ + ++S+ +Q P + YD +P+ + T++ FL NV
Sbjct: 251 DEINLEQYQSFSYSLNHNSKASQ---VQNFPQITMIYDFAPVNMKITKQQRDLSRFLVNV 307
Query: 352 ------------CAIVGGVFTVSGIIDAFI 369
CAI+GG+F + G+I+ +
Sbjct: 308 SQYDLFISYMQLCAIIGGIFVIFGLINRLL 337
>gi|123435131|ref|XP_001308935.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121890639|gb|EAX96005.1| hypothetical protein TVAG_369150 [Trichomonas vaginalis G3]
Length = 353
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 100/224 (44%), Gaps = 15/224 (6%)
Query: 146 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 205
C C+ + + CCN C+ ++E Y+ P+ QC+ R E C +
Sbjct: 127 CYPCFKVQFHNYTCCNGCDRLKENYKLNNLT-PEPEKWPQCQTNA---RPDINSSEKCLV 182
Query: 206 YGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDIL-AFQRDSFNISHKINKLAFGEHFPGVV 263
G + VN+V G+FH A G++ + G H+H++L F +F SH I + FG
Sbjct: 183 KGKVSVNRVRGSFHIAAGRNIYLNDGSHIHELLDDFPNLAF--SHAIEHIRFGPRIITAK 240
Query: 264 NPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 322
PL V +E + + Y + V P ++ + +S +++V H + P
Sbjct: 241 QPLQNLVMRAKENLTVTHDYSLLVTPVIFVADNQFIEKSFEYTVYLHPVQDKD------P 294
Query: 323 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 366
G++F Y +P + T SF FL + G++ ++ IID
Sbjct: 295 GIYFDYQFTPYTIQITWISRSFRGFLISTAGFTAGLYAIASIID 338
>gi|339244785|ref|XP_003378318.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Trichinella spiralis]
gi|316972786|gb|EFV56437.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Trichinella spiralis]
Length = 334
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 73/131 (55%), Gaps = 6/131 (4%)
Query: 258 HFPGVVNPLDGVRWTQETPSG----MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 313
+ PG NPL ++P Y Y +K+VPTVY +++G+ + Q++
Sbjct: 130 NLPGNFNPLMNAE-VLDSPVDNFPFSYDYILKIVPTVYENIAGNMKHAYQYTYARKTYIE 188
Query: 314 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 373
QT P ++F YD +PI V + E FLT++CAI+GG FTV+G+ID+F +
Sbjct: 189 MSFTGQTNPTLWFRYDFTPITVKYHERRQPLYIFLTSICAIIGGTFTVAGLIDSFFFTAS 248
Query: 374 RAIKKKIEIGK 384
+ + KK+E+GK
Sbjct: 249 Q-LYKKVELGK 258
>gi|357452761|ref|XP_003596657.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355485705|gb|AES66908.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 482
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 120/267 (44%), Gaps = 40/267 (14%)
Query: 138 RLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 197
R +H S YG +D E + ++ + + L+ D ++ + +R
Sbjct: 234 RSDHGHHEHESYYGDRDTDS-LVKTMENILASFPSEYYKLALEDKLNVTEDS---KRPAP 289
Query: 198 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 257
G GC I G++ V KV GN + H +F N+SH ++ L+FG+
Sbjct: 290 SSG-GCRIEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHAVHHLSFGK 338
Query: 258 HF------------PGVVNP---LDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTI 299
P V N LDG+ + G ++++++V T G+ +
Sbjct: 339 KLSPKLMSDVQRLIPYVGNSHDRLDGLSFINSHDFGANVTLEHYLQIVKTEVITRQGYQL 398
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT--FTEEHVSFLHFLTNVCAIVGG 357
++ T H S +P F LSP++V TE+H SF HF+TNVCAIVGG
Sbjct: 399 -VEEYEYTAH---SSLAHSLHVPVARFHLQLSPMQVCVLITEDHKSFSHFITNVCAIVGG 454
Query: 358 VFTVSGIIDAFIYHGQRAIKKKIEIGK 384
VFTV+GI ++ I H + +K+E+GK
Sbjct: 455 VFTVAGITES-ILHNTIRLMRKVELGK 480
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 66/108 (61%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+KI+S+D Y KI D + SG +++V+++ M+ LF EL YL+ T T +++D +S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMMFLFGMELNEYLSVHTSTSVIIDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
GE LRI+F+++F AL C SVD D+ G +++ + K +DS
Sbjct: 65 DGEFLRIDFNLSFHALSCEFASVDVSDVLGTNRMNLTKTVRKFSIDSN 112
>gi|145479237|ref|XP_001425641.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392712|emb|CAK58243.1| unnamed protein product [Paramecium tetraurelia]
Length = 326
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/306 (26%), Positives = 126/306 (41%), Gaps = 56/306 (18%)
Query: 24 RTFSGGVITLVSSIVMLLLFFSEL-RLYLNAVTETKLLVDTSR-GETLRINFDVTFPALP 81
+T GG++ LV+ + L E+ R + V T +DT+ E +R+N ++T +
Sbjct: 16 KTTCGGILALVTIFSVGFLIIGEIIRSFQLEVLST---IDTTNVDERIRVNLNITVHDMT 72
Query: 82 CSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH 141
C LS+D D++G D+++ I K R+ G I +++ ++ L H
Sbjct: 73 CFALSLDQQDVTGTHLEDMEYTIHKLRI-RDGRFI-NKEYAENVKLFEQSLYHWNW---H 127
Query: 142 NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK---------REGFL 192
N CYGA+ + C C++V AY + W L + I QCK R F
Sbjct: 128 NANEVNDCYGAQLFEGQKCITCQDVLLAYASRDWPLPRKESIQQCKYSYIQQNGRRVLFT 187
Query: 193 QRIKEEE------------------GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 234
+ EE GE C I+G + ++ GNFH SFH G V
Sbjct: 188 EDFGEERRGQQYIDMNDLTAMAFTYGESCQIFGHFYIKRIPGNFHI----SFHGKGQAVS 243
Query: 235 DILAFQRDSFNISHKINKL---------AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIK 285
I +SH IN L FG +F N LDG + QY++K
Sbjct: 244 LI----SQDIQLSHTINWLEFTPQKQGPTFGRYFK-TTNTLDGTTHQLKQKEDT-QYYLK 297
Query: 286 VVPTVY 291
+V + Y
Sbjct: 298 LVESHY 303
>gi|219130117|ref|XP_002185219.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403398|gb|EEC43351.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 421
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/405 (23%), Positives = 164/405 (40%), Gaps = 81/405 (20%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A M +R LDA+ K + S++ GG+ITLV++ V LF ++ Y+ + LL+
Sbjct: 72 AAMVGLRKLDAFVKTRPELRSQSAVGGMITLVAATVSAFLFVGQIIHYIIGNPKDSLLLS 131
Query: 63 TSRGETLRINFDVTFPALPCS--ILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
S V+ P +P + L+ ++ + + LD+
Sbjct: 132 KS----------VSIPLIPLTSNYLTTKILERAAKLPLDML------------------- 162
Query: 121 DGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNP 180
I P + H +L+ N + E ++ + K + P
Sbjct: 163 --ITFPYL------HCSQLDFNH-------------DGASLATSEFQKLHPKHSLTMRTP 201
Query: 181 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 240
+ E + + ++G+GC I G + V VAG F K Q + +
Sbjct: 202 -----FQHELSTAKFETKKGQGCTIEGHIRVPVVAGKFEITLNKRTWQQAASILNRQMLM 256
Query: 241 R----------------DSFNISHKINKLAFGEHFP-GVVNPLDGVRWTQETPSG---MY 280
+ D +N +H I+ + FG+ FP + PL+ R G +
Sbjct: 257 QVLGATSEHTSSNDELGDRYNSTHFIHYIRFGDSFPLNIEKPLEKRRHIFRNKYGAMAVQ 316
Query: 281 QYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE---QGRLQTLPGVFFFYDLSPIKVT 336
+ I++VPT T + + Q+ Q SV + E Q +LPG+ YD SP+ V
Sbjct: 317 EMKIELVPTYTSTWLPTSSRQTYQASVVDSTIEPEHMAQAGASSLPGLAVQYDFSPLTVY 376
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 381
T + L FL+++ +IVGGVF G++ + H +A+ KKI+
Sbjct: 377 HTGGRDNILVFLSSLVSIVGGVFVTVGLVSGCLVHSAQAVAKKID 421
>gi|323303637|gb|EGA57425.1| Erv41p [Saccharomyces cerevisiae FostersB]
Length = 284
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 82/159 (51%), Gaps = 10/159 (6%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E GC+I+G + VN+V+G KS + + +H IN+ +FG+
Sbjct: 90 EFNGCHIFGSIPVNRVSGELQIT-AKSLXYVASRKAPL-----EELKFNHVINEFSFGDF 143
Query: 259 FPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 315
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 316 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDXRLSFIQFLVRLVAI 241
>gi|167382848|ref|XP_001736294.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901464|gb|EDR27547.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 315
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 97/200 (48%), Gaps = 20/200 (10%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 247
GC ++G ++V++V+G FH A GK SF Q + H+H + SFN +
Sbjct: 116 GCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 248 HKINKLAFGEHFPGVVN----PLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 302
H IN L+F V+ PL+G +T + Y+I V+PT++ S +T+++
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKEFTLNGFDNARKTYYINVIPTLFKYPS-YTLRTY 234
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q SV+E G PGVFF Y+LSP V SF H L +V AIVGGV +
Sbjct: 235 QLSVSERDIPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIVGGVLIII 294
Query: 363 GIIDAFIYHGQRAIKKKIEI 382
G + + + +E+
Sbjct: 295 GWLSKLFDSNRELVTSVVEM 314
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 55/113 (48%), Gaps = 4/113 (3%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I ++ D + K+ E T + + +++S I++ LL FSE +LN + +
Sbjct: 1 MKKIQQVLKECDIFLKVPEKLKITTNTTKLFSVISYIIIGLLVFSETYNFLNPQWVSHVD 60
Query: 61 VDTSRGETL---RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI-FKKRL 109
VDT + L IN D+TFP + C +D +I+G L V I F RL
Sbjct: 61 VDTVKAGVLPNMYINIDITFPKMKCDDFGLDVTEITGSLQLGVTDGIKFDNRL 113
>gi|195639434|gb|ACG39185.1| PDIL5-4 - Zea mays protein disulfide isomerase [Zea mays]
Length = 485
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 138/310 (44%), Gaps = 54/310 (17%)
Query: 104 IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 161
I ++D V R++ I G P I + R G ++ N+ + Y E E
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256
Query: 162 NCEEVREAYRKKGWALSNPD----LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGN 217
E K+ AL+ D +D KR + GC I GF+ V +V G+
Sbjct: 257 AMETYVANIPKEAHALALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGS 309
Query: 218 FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGV 262
+ +SG H +F N+SH + + +FG+ + G
Sbjct: 310 VVISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGY 359
Query: 263 VNPLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR 317
+ L G +T + + +++++VV T + T S S + V E + +
Sbjct: 360 HDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSS 414
Query: 318 LQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 374
L +P V F ++ SP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ I+H
Sbjct: 415 LVHSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTL 473
Query: 375 AIKKKIEIGK 384
+ KKIE+GK
Sbjct: 474 RMVKKIELGK 483
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSID 110
>gi|207342541|gb|EDZ70277.1| YML067Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|323336174|gb|EGA77445.1| Erv41p [Saccharomyces cerevisiae Vin13]
gi|323347070|gb|EGA81345.1| Erv41p [Saccharomyces cerevisiae Lalvin QA23]
Length = 284
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E GC+I+G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 90 EFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 143
Query: 259 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 315
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 316 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 241
>gi|407037175|gb|EKE38536.1| hypothetical protein ENU1_163530 [Entamoeba nuttalli P19]
Length = 315
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 20/200 (10%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 247
GC ++G ++V++V+G FH A GK SF Q + H+H + SFN +
Sbjct: 116 GCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 248 HKINKLAFGEHFPGVVN----PLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 302
H IN L+F V+ PL+G +T + Y+I V+PT++ S +T+++
Sbjct: 176 HYINHLSFSNILGSTVHSGETPLNGKEFTLNGFDNARKTYYINVIPTLFKYPS-YTLRTY 234
Query: 303 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 362
Q SV E G PGVFF Y+LSP V SF H L +V AI+GGV +
Sbjct: 235 QLSVNERDVPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLIIM 294
Query: 363 GIIDAFIYHGQRAIKKKIEI 382
G++ + +E+
Sbjct: 295 GLLSRLFDSKHELVTSVVEM 314
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 4/113 (3%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M I ++ D + K+ E +T + + +++S I++ LL FSE + N + +
Sbjct: 1 MKKIQQFLKECDIFLKVPEKLKIKTNTTKLFSIISYIIIGLLIFSETYNFFNPQWVSHVD 60
Query: 61 VDTSRGETL---RINFDVTFPALPCSILSVDAMDISGEQHLDVKHDI-FKKRL 109
VDT + L IN D+TFP + C +D +I+G L V I F RL
Sbjct: 61 VDTVKAGVLPNMYINIDMTFPKMNCDDFGLDVTEITGSLQLGVTDGIKFDNRL 113
>gi|224030141|gb|ACN34146.1| unknown [Zea mays]
Length = 483
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 52/308 (16%)
Query: 104 IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 161
I ++D V R++ I G P I + R G ++ N+ + Y E E
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256
Query: 162 NCEEVREAYRKKGWALSNPD--LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 219
E K+ AL + +D KR + GC I GF+ V +V G+
Sbjct: 257 AMETYVANIPKEAHALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGSVV 309
Query: 220 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGVVN 264
+ +SG H +F N+SH + + +FG+ + G +
Sbjct: 310 ISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGYHD 359
Query: 265 PLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 319
L G +T + + +++++VV T + T S S + V E + + L
Sbjct: 360 RLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSSLV 414
Query: 320 ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+P V F ++ SP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ I+H +
Sbjct: 415 HSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRM 473
Query: 377 KKKIEIGK 384
KKIE+GK
Sbjct: 474 VKKIELGK 481
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEVSLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSID 110
>gi|162462518|ref|NP_001105762.1| protein disulfide isomerase12 [Zea mays]
gi|59861281|gb|AAX09970.1| protein disulfide isomerase [Zea mays]
gi|414590455|tpg|DAA41026.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 483
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 52/308 (16%)
Query: 104 IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 161
I ++D V R++ I G P I + R G ++ N+ + Y E E
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256
Query: 162 NCEEVREAYRKKGWALSNPD--LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 219
E K+ AL + +D KR + GC I GF+ V +V G+
Sbjct: 257 AMETYVANIPKEAHALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGSVV 309
Query: 220 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGVVN 264
+ +SG H +F N+SH + + +FG+ + G +
Sbjct: 310 ISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGYHD 359
Query: 265 PLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 319
L G +T + + +++++VV T + T S S + V E + + L
Sbjct: 360 RLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSSLV 414
Query: 320 ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 376
+P V F ++ SP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ I+H +
Sbjct: 415 HSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRM 473
Query: 377 KKKIEIGK 384
KKIE+GK
Sbjct: 474 VKKIELGK 481
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSID 110
>gi|323307814|gb|EGA61076.1| Erv41p [Saccharomyces cerevisiae FostersO]
Length = 284
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 82/159 (51%), Gaps = 10/159 (6%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E GC+I+G + VN+V+G KS + + +H IN+ +FG+
Sbjct: 90 EFNGCHIFGSIPVNRVSGELQIT-AKSLXYVASRKAPL-----EELKFNHVINEFSFGDF 143
Query: 259 FPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 315
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 316 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDIRLSFIQFLVRLVAI 241
>gi|323332255|gb|EGA73665.1| Erv41p [Saccharomyces cerevisiae AWRI796]
gi|323352959|gb|EGA85259.1| Erv41p [Saccharomyces cerevisiae VL3]
gi|365763687|gb|EHN05213.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 250
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 84/165 (50%), Gaps = 10/165 (6%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
R E GC+I+G + VN+V+G KS G + FN H IN+
Sbjct: 50 NRAHLPEFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINE 103
Query: 253 LAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-- 309
+FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 104 FSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRY 162
Query: 310 FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 163 LYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 207
>gi|558407|emb|CAA86253.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 284
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E GC+++G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 90 EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 143
Query: 259 FPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 315
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 316 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 241
>gi|171693749|ref|XP_001911799.1| hypothetical protein [Podospora anserina S mat+]
gi|170946823|emb|CAP73627.1| unnamed protein product [Podospora anserina S mat+]
Length = 180
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 69/127 (54%), Gaps = 8/127 (6%)
Query: 243 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT-----DVS 295
SFN SH IN+L+FG + P ++NPLD + S +QYF+ +VPTVY+ S
Sbjct: 15 SFNFSHIINELSFGPYLPSLINPLDQTVNSAPEHSHFHRFQYFLSIVPTVYSLGHPDSYS 74
Query: 296 GHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 354
+I +NQ++VTE E +Q +PG+F YD+ PI + E+ SF FL V I
Sbjct: 75 SRSIFTNQYAVTEQSAPIPENMEMQMIPGIFVKYDIEPILLNIVEDRDSFFVFLIKVVNI 134
Query: 355 VGGVFTV 361
+ G
Sbjct: 135 LSGAMVA 141
>gi|299116076|emb|CBN74492.1| DEAD box helicase [Ectocarpus siliculosus]
Length = 865
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 67/112 (59%), Gaps = 1/112 (0%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
A M R LD YPKI D T GG + ++ ++MLLLF EL +++A E++++VD
Sbjct: 403 ATMGAWRLLDLYPKIPTDLSQSTAVGGWFSTLTGVIMLLLFQVELFSFMSAPIESQVVVD 462
Query: 63 TSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVK-HDIFKKRLDSQG 113
L+INF+++F LPC LSVDA+D+ G +++ ++ K LD QG
Sbjct: 463 NVLETKLQINFNMSFLDLPCEYLSVDALDVLGSNRVNITGKEVQKWHLDPQG 514
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 95/214 (44%), Gaps = 39/214 (18%)
Query: 188 REGFLQR-IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 246
R+GF + + +++ GC + G + VN+V GNFH H F + N+
Sbjct: 669 RKGFPEVGLHDDKWPGCMVTGHIMVNRVPGNFHIEAASKSH----------TFHGATTNL 718
Query: 247 SHKINKLAFGEHFPGVVN--------------PLDGVRWTQETPSGMYQYFIKVVPTVY- 291
SH ++ ++FG P PLDG + ++++VV ++Y
Sbjct: 719 SHIVHHMSFGNDPPRRTQTKINRLTEDLRQNAPLDGNVYVANAYHQAPHHYLRVVGSMYH 778
Query: 292 -----TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 346
T G+ I +N ++ E+ +P F Y++SP+ V E +
Sbjct: 779 LSPMKTPWHGYQIVAN----SQMMLYDEE----EVPEARFSYNISPMSVLVRSEKRPWYD 830
Query: 347 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 380
F+T V AIVGG F++ G++DA ++ R +++
Sbjct: 831 FVTKVLAIVGGTFSMVGLVDAAVFRASRKAGRQL 864
>gi|302841900|ref|XP_002952494.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
nagariensis]
gi|300262133|gb|EFJ46341.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
nagariensis]
Length = 478
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 81/138 (58%), Gaps = 9/138 (6%)
Query: 1 MDAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLL 60
M + +K++++D + KI D T +G I+++++++M+ LF +E+ +L+ T T+L+
Sbjct: 1 MARLFSKLKAIDFFKKIPSDLTEATLTGAWISILAAVIMVFLFTAEMMSFLSTTTTTQLI 60
Query: 61 VDTS-RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFK-------KRLDSQ 112
VD S + E L++NF+++FPAL C +VD D G + +++ + K +R+ +
Sbjct: 61 VDRSPQNELLKLNFNISFPALSCEFATVDVSDTLGTKRMNLTKTVRKMPITTELERMSEK 120
Query: 113 GNVIESRQDGIGAPKIDK 130
G+ +E G PK D+
Sbjct: 121 GSAVEDSSHKPG-PKYDE 137
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 93/197 (47%), Gaps = 23/197 (11%)
Query: 202 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVH----VHDILAFQRDSFNISHKINKLA 254
GCN+ GF+ V KV G + G SF + ++ VH R S ++ +L
Sbjct: 289 GCNLAGFVMVKKVPGTLTVVARSEGHSFDHTWMNMTHLVHTFHVGTRPSPRKYQQLKRL- 347
Query: 255 FGEHFPGVVNPLDGVRWTQ------ETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVT 307
P D W + E P +++++++V T + S H+ + + T
Sbjct: 348 ----HPAGEGEGDLFWWREKREKRGEHPQSTHEHYLQIVLTSIEPRRSRHSGNYDAYEYT 403
Query: 308 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
H S + +P F YDLSPI++ E + FLT CAI+GGVFTV+GI+DA
Sbjct: 404 AH---SHTYQSDAIPSARFTYDLSPIQILVQETARPWYQFLTTSCAIIGGVFTVAGILDA 460
Query: 368 FIYHGQRAIKKKIEIGK 384
+Y + + KK+ +GK
Sbjct: 461 LLYQSFKVV-KKLNLGK 476
>gi|412994089|emb|CCO14600.1| predicted protein [Bathycoccus prasinos]
Length = 528
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 65/109 (59%), Gaps = 1/109 (0%)
Query: 3 AIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD 62
I+ K + +D Y KI D T+ G +++++++ +++ L +E R YL ETK++VD
Sbjct: 2 GILTKAKGMDFYRKIPRDMTQGTYLGTILSILATSLIVFLLIAETRAYLKTTFETKVVVD 61
Query: 63 TS-RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
S GE LRINF+V+FPAL C SVD D G ++ +FK+ +D
Sbjct: 62 RSVDGELLRINFNVSFPALSCEFASVDVGDALGLTRYNLTKTVFKRPID 110
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 93/208 (44%), Gaps = 29/208 (13%)
Query: 195 IKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
++ GC+I GF+ V KV G+ F A K+ H +F D N++H+++
Sbjct: 330 VQTRASTGCSITGFVLVKKVPGHVFFTADAKNGH----------SFDVDKLNVTHQVHHF 379
Query: 254 AFGEHFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
FG+ + L + P ++++++ V T +
Sbjct: 380 YFGQQLSASRQKYMARFHRGEKEGDWHDKLANDFVVSKNPRTSHEHYLQTVLTTMQPLGP 439
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
N + T+H S + +T P F + SP+++ E+ F F+T + AIVG
Sbjct: 440 FAQPFNVYEYTQHTHSVKTPDGET-PRAKFHFTPSPVQILGVEKRREFYQFITTLMAIVG 498
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
GV++V GIID +++ K+K+++GK
Sbjct: 499 GVYSVVGIIDGLMHNTSLMFKRKMQLGK 526
>gi|402595088|gb|EJW89014.1| hypothetical protein WUBG_00081 [Wuchereria bancrofti]
Length = 578
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 87/175 (49%), Gaps = 6/175 (3%)
Query: 196 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 254
++ EG C I+G + VNKV G+ F + GK G+ H + N+SH+I +
Sbjct: 372 EKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLGVDGIFAH--FGGLSNPGNVSHRIERFN 429
Query: 255 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 312
FG G+V PL G+ ET ++YF+KVVPT ++ + G + + Q+SVT +
Sbjct: 430 FGPTIYGLVTPLAGIEQISETGMDEFRYFLKVVPTRIYHSGLFGGSTLTYQYSVT-FMKK 488
Query: 313 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
+ + + + Y+ + + S L L +C+ VGGVF S ++++
Sbjct: 489 TPKKDVHKHAAIIIHYEFAATVIEVRRIQSSLLQMLIRLCSAVGGVFATSVLLNS 543
>gi|393908149|gb|EJD74928.1| hypothetical protein LOAG_17836 [Loa loa]
Length = 430
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 86/175 (49%), Gaps = 6/175 (3%)
Query: 196 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 254
++ EG C I+G + VNKV G+ F + GK G+ H NISH+I +
Sbjct: 222 EKNEGTACRIHGRMRVNKVKGDSFIISTGKGLDVDGIFAH--FGGVSSPSNISHRIERFN 279
Query: 255 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 312
FG G+V PL G+ ET ++YF+K+VPT ++ + G + + Q+SVT +
Sbjct: 280 FGPRIYGLVTPLAGIEQISETGVDEFRYFLKIVPTRIYHSGLFGGSTLTYQYSVT-FMKK 338
Query: 313 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
+ + + + Y+ + + S L L +C+ VGGVF S ++++
Sbjct: 339 TPKKDVHKHTAIIIHYEFAATVIEVRHVQSSLLQMLVRLCSAVGGVFATSILLNS 393
>gi|115472445|ref|NP_001059821.1| Os07g0524100 [Oryza sativa Japonica Group]
gi|75118816|sp|Q69SA9.1|PDI54_ORYSJ RecName: Full=Protein disulfide isomerase-like 5-4;
Short=OsPDIL5-4; AltName: Full=Protein disulfide
isomerase-like 8-1; Short=OsPDIL8-1; Flags: Precursor
gi|50508559|dbj|BAD30858.1| thioredoxin family-like protein [Oryza sativa Japonica Group]
gi|113611357|dbj|BAF21735.1| Os07g0524100 [Oryza sativa Japonica Group]
gi|215704615|dbj|BAG94243.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218199742|gb|EEC82169.1| hypothetical protein OsI_26259 [Oryza sativa Indica Group]
gi|222637167|gb|EEE67299.1| hypothetical protein OsJ_24505 [Oryza sativa Japonica Group]
Length = 485
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 103/224 (45%), Gaps = 44/224 (19%)
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+D KR L GC I GF+ V KV G+ + +SG H +F
Sbjct: 282 VDPAKRPAPLT-------SGCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPS 324
Query: 243 SFNISHKINKLAFGEHFPGV-------VNPLDG------------VRWTQETPSGMYQYF 283
N+SH + + +FG+ + P G V+ + +++
Sbjct: 325 QINVSHYVTQFSFGKRLSAKMFNELKRLTPYVGGHHDRLAGQSYIVKHGDVNANVTIEHY 384
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEE 340
+++V T + S + + E + + L +P V F ++ SP++V TE
Sbjct: 385 LQIVKTELVTLRS----SKELKLVEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEL 440
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ I+H + KK+E+GK
Sbjct: 441 PKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRLVKKVELGK 483
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 78/138 (56%), Gaps = 5/138 (3%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSNYLAVNTSTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG 124
GE LRI+F+++FPAL C SVD D+ G L++ + K +D N++ + +
Sbjct: 65 DGEFLRIDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDR--NLVPTGSEFHP 122
Query: 125 APKIDKPLQRHGGRLEHN 142
P + +HG +E N
Sbjct: 123 GPI--PTVSKHGDDVEEN 138
>gi|303275141|ref|XP_003056869.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461221|gb|EEH58514.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 604
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 99/209 (47%), Gaps = 40/209 (19%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC I G + VN+V G F+ + H G H+I D N++H + L+FG+ PG
Sbjct: 402 GCIIEGSVRVNRVPGAFYV----TAHSKG---HNI---NVDVVNMTHVLRHLSFGKTVPG 451
Query: 262 VVN-------------PLD-----GVRWTQET-----PSGMYQYFIKVVPTVYTDVSGHT 298
+ P D V +ET P ++++++KVV + + G
Sbjct: 452 RPSYVPRHMRRVWSKIPKDMGGRFAVAGAEETFASAEPYTVHEHYLKVVSHAFEPIDGDA 511
Query: 299 IQ-------SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
+Q SN+F + E P + F YD+SP++V EE L + +
Sbjct: 512 VQLYEYTFNSNRFKLAPAAYGDEDDAHVDGPMIKFSYDVSPMRVVLREETKPVLDWTLGM 571
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKI 380
CA++GGV+T SG+++AFI +G +K+++
Sbjct: 572 CALMGGVYTCSGLLEAFISNGVSVVKRRV 600
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 51/99 (51%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D + + D Y K+ + + GGV+++V V +LLF ++LR T T +LV
Sbjct: 19 DGVGGVFKRADMYAKLPRELAEGSVLGGVLSVVFLCVFVLLFAAQLRELWGVTTVTDVLV 78
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDV 100
D S +T ++N + PAL C ++ +D G +H ++
Sbjct: 79 DHSDDDTFQVNLKLELPALSCEWATIHVIDALGTRHFNI 117
>gi|301089326|ref|XP_002894975.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104295|gb|EEY62347.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 102
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 54/86 (62%)
Query: 285 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 344
+VVPT YT +S I +NQFS TEHFR + LP V F Y SPI + V F
Sbjct: 5 QVVPTEYTFLSASRIITNQFSATEHFRQLTPVSDKGLPMVSFSYTFSPIMFRIEQYRVGF 64
Query: 345 LHFLTNVCAIVGGVFTVSGIIDAFIY 370
L FLT+VCAIVGGVFT+ GI+D+ +
Sbjct: 65 LQFLTSVCAIVGGVFTILGIMDSLAF 90
>gi|323448816|gb|EGB04710.1| hypothetical protein AURANDRAFT_55105 [Aureococcus anophagefferens]
Length = 324
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 97/194 (50%), Gaps = 27/194 (13%)
Query: 193 QRIKE-EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKIN 251
QR+ E +E GC + G + VN+V GNFH +S H H++ A N+SH +N
Sbjct: 133 QRMLEIKEHPGCMVSGHVLVNRVPGNFHIE-ARSIH------HNLNAAMT---NLSHVVN 182
Query: 252 KLAFG-----------EHFPGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 298
L+FG +P V+PLDG + ++ ++ KVV T + +V G
Sbjct: 183 HLSFGTPLAKDMQRKVSKYPQFQSVHPLDGGIFVSRDYHQVHHHYSKVVSTHF-EVGGMM 241
Query: 299 IQSNQFSVTEHFRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+S + + S+ + P F YDLSP+ V + + + F+T+VCAI+G
Sbjct: 242 TKSREIVGYQMLAQSQIMHYNEMDVPEAKFSYDLSPMAVLVSSKGRRWYDFVTSVCAIIG 301
Query: 357 GVFTVSGIIDAFIY 370
G FTV GI+DA +Y
Sbjct: 302 GTFTVVGIVDAVLY 315
>gi|366997520|ref|XP_003678522.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
gi|342304394|emb|CCC72184.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
Length = 347
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 81/159 (50%), Gaps = 14/159 (8%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E C+I+G + VN+VAG F HQ +V D +H IN+ +FG+
Sbjct: 158 EYSACHIFGSIPVNRVAGEFQITTIDR-HQPIENVVDF----------THVINEFSFGDF 206
Query: 259 FPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE-HFRSSEQG 316
FP V NPLD ++ + YQY + VVPT+Y + G I +NQ+S++E H+++
Sbjct: 207 FPYVDNPLDSTAKYVPDEKLTSYQYHLSVVPTIYNKM-GVLINTNQYSLSEYHYKNITNA 265
Query: 317 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 355
+ PG+F Y+ + + + + F FL + AI+
Sbjct: 266 NDKNSPGIFIKYNFESLTIIVNDRRLGFTQFLIRLIAIL 304
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 49/94 (52%), Gaps = 1/94 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M+ ++S DA+PK +E++ ++ GG+ T+ + + +L + +SE Y E K +VD
Sbjct: 1 MSALKSFDAFPKTDEEYTKKSTKGGLSTIATYLFLLFIAWSEFGSYFGGFVEQKYVVDNQ 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHL 98
E IN D+ + C +L V D + + +
Sbjct: 61 VREVTEINLDI-YVNTTCRLLDVRVFDETKDMRM 93
>gi|170588701|ref|XP_001899112.1| hypothetical protein [Brugia malayi]
gi|158593325|gb|EDP31920.1| conserved hypothetical protein [Brugia malayi]
Length = 430
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 87/175 (49%), Gaps = 6/175 (3%)
Query: 196 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 254
++ EG C I+G + VNKV G+ F + GK G+ H + N+SH+I +
Sbjct: 223 EKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLGVDGIFAH--FGGVSNPGNLSHRIERFN 280
Query: 255 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 312
FG G+V PL G+ ET ++YF+KVVPT ++ + G + + Q+SVT +
Sbjct: 281 FGPTIYGLVTPLAGIEQISETGIDEFRYFLKVVPTRIYHSGLFGGSTLTYQYSVT-FMKK 339
Query: 313 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 367
+ + + + Y+ + + S L L +C+ VGGVF S ++++
Sbjct: 340 TPKKDVHKHAAIVIHYEFAATVIEVRRIQSSLLQMLIRLCSAVGGVFATSVLLNS 394
>gi|32566449|ref|NP_510494.2| Protein C18B12.6 [Caenorhabditis elegans]
gi|25809204|emb|CAA20929.2| Protein C18B12.6 [Caenorhabditis elegans]
Length = 428
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 94/401 (23%), Positives = 168/401 (41%), Gaps = 49/401 (12%)
Query: 1 MDAIMNKIRSLDAYPKINEDF-----------YSRTFSGGVITLVSSIVMLLLFFSELRL 49
M+ ++IR KI EDF + S G I+ + ++ LF +E
Sbjct: 1 MELGSSEIRQRKGISKIVEDFDIFEKVVENVKEEKKVSAGAISFICFTIIFCLFCTETYT 60
Query: 50 YL-NAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKR 108
+L + + + +DT E ++ D+ PCSIL V +S + ++
Sbjct: 61 FLFHKKYDYRFALDTEMDEMPLLDLDMVINT-PCSILQV----VSSSDEYSGGDGLLRQT 115
Query: 109 LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEV-- 166
+ Q N +R D ++ + RH ++N+ + E D+D N E +
Sbjct: 116 I--QKN--PTRFDFTDEEQMYWTILRHAHD-QYNKKGLRALEELEYVDDDIETNLEHLAN 170
Query: 167 ----REAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN---------- 212
EA K L N + + G Q I G ++ + N
Sbjct: 171 EKQDEEAAHIKELRLKNK----KTQHRGTGQ-IMFLVSNGMGMFQLVADNGGGDGDDGKA 225
Query: 213 -KVAGNFHFAPGKSFHQSGVHVHDILAF---QRDSFNISHKINKLAFGEHFPGVVNPLDG 268
++ G F GK + ++ F ++ S NISH+I K FG PG+V PL G
Sbjct: 226 CRLHGKFKVRKGKEEKIVMSISNPMMMFDHQEKQSGNISHRIEKFNFGPRIPGLVTPLAG 285
Query: 269 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 328
E+ +Y+YFIK+VPT +T+ + Q+SVT + ++G + G+ F Y
Sbjct: 286 AEHISESGQDIYRYFIKIVPTKIYGYFSYTM-AYQYSVTFLKKQLKEGE-HSHGGILFEY 343
Query: 329 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ + + + ++ + +L +C+I+GGV+ S I++ +
Sbjct: 344 EFTANVIEVHKTSITLISYLIRICSILGGVYATSTIVNNIL 384
>gi|357122608|ref|XP_003563007.1| PREDICTED: protein disulfide isomerase-like 5-4-like [Brachypodium
distachyon]
Length = 485
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 44/224 (19%)
Query: 183 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 242
+D KR + GC + GF+ V KV G+ + +SG H +F
Sbjct: 282 VDPAKRPAPMT-------SGCRVEGFVRVKKVPGSVIISA-----RSGSH-----SFDPS 324
Query: 243 SFNISHKINKLAFGEHF-PGVVNPLDG------------------VRWTQETPSGMYQYF 283
N+SH + + +FG P + + L V+ + +++
Sbjct: 325 QINVSHYVTQFSFGNRLSPNMFSELKRLIPYVGGHHDRLAGQSYIVKHGDNNANVTIEHY 384
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEE 340
+++V T + S + V E + + L +P V F ++ SP++V TE
Sbjct: 385 LQIVKTELVTLR----SSKELKVFEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEL 440
Query: 341 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ +++ R + KK+E+GK
Sbjct: 441 PKSFSHFITNVCAIIGGVFTVAGILDSILHNTLRLV-KKVELGK 483
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSID 110
>gi|341884627|gb|EGT40562.1| hypothetical protein CAEBREN_07459 [Caenorhabditis brenneri]
Length = 428
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/396 (23%), Positives = 165/396 (41%), Gaps = 52/396 (13%)
Query: 3 AIMNKIRSLDAYPKINEDFYS-RTFSGGVITLVSSIVMLLLFFSELRLYL-NAVTETKLL 60
I + LD + K+ E+ + S G I+ + V+ LF +E +L + + +
Sbjct: 13 GITKIVEDLDIFEKVVENVKEEKKASSGAISFICFTVIFCLFCTETYTFLFHKKYDYRFA 72
Query: 61 VDTSRGETLRINFDVTFPALPCSILSVDAM--DISGEQHLDVKHDIFKKRLDSQGNVIES 118
VDT E + D+ PCS++ V + + SG L R Q N +
Sbjct: 73 VDTEMDEMPLFDLDMVINT-PCSLMQVASSSDEYSGGDGL--------LRQTIQKN--PT 121
Query: 119 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 178
R + ++ + RH + N + E D+D N E + + +++ A
Sbjct: 122 RFEFTDEEQMYWTILRHAHD-QFNRKGLRALEELEYVDDDIETNLEHLADEKQQEEAAHL 180
Query: 179 NPDLIDQCKRE---------------GFLQRIKE------EEGEGCNIYGFLEVNKVAGN 217
+ K++ G Q + + E+G+ C ++G +V K
Sbjct: 181 KEQRMKNKKQQHKGTGQIMFLVSNGMGMFQLVADNGGADREDGKACRLHGKFKVRK---- 236
Query: 218 FHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEHFPGVVNPLDGVRWTQ 273
GK + +L F + N ISH+I K FG PG+V PL G
Sbjct: 237 -----GKEEKIVMSISNPLLMFDHQAENQPGNISHRIEKFNFGPRIPGLVTPLAGAEHIS 291
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPI 333
E+ +Y+YFIK+VPT +T+ + Q+SVT + ++G + G+ F Y+ +
Sbjct: 292 ESGQDIYRYFIKIVPTKIYGYFTYTM-AYQYSVTFLKKQLKEGE-HSHGGILFEYEFNAN 349
Query: 334 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ + V+ +L +C+I+GGV+ S I++ +
Sbjct: 350 VIEVHKTSVTLFSYLIRICSILGGVYATSTIVNNIV 385
>gi|326503558|dbj|BAJ86285.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 101/205 (49%), Gaps = 37/205 (18%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 258
GC I GF+ V KV G+ + +SG H +F N+SH + +FG+
Sbjct: 294 GCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPSQINVSHYVTTFSFGKRLSS 343
Query: 259 ---------FP---GVVNPLDG----VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
FP G + L G V+ + ++++++V T + S
Sbjct: 344 KMFNELKRLFPYVGGHHDRLAGQSYVVKHGDVNANVTIEHYLQIVKTELVTLR----YSK 399
Query: 303 QFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+ V E + + L +P V F ++ SP++V TE SF HF+TNVCAI+GGVF
Sbjct: 400 ELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGK 384
TV+GI+D+ +++ R + KK+E+GK
Sbjct: 460 TVAGILDSILHNTLRLV-KKVELGK 483
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++ +++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIFAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LR++F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRMDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSID 110
>gi|12321801|gb|AAG50943.1|AC079284_18 hypothetical protein [Arabidopsis thaliana]
Length = 451
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 41/282 (14%)
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G P I + G R +H S YG +D EE+ + +K+ L+ +
Sbjct: 188 GYPSIRIFRRGSGLREDHGNHEHESYYGDRDTDS-LVKMVEELLKPIKKEDHKLA----L 242
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
D K GC I G++ KV G + SG H +F
Sbjct: 243 DGKSDNAASTFKKAPVSGGCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQ 292
Query: 244 FNISHKINKLAFGE---------------HFPGVVNPLDGVRWTQET---PSGMYQYFIK 285
N+SH + L FG + + L+G + E + +++++
Sbjct: 293 MNMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQ 352
Query: 286 VVPT-VYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
++ T V + SG H++ ++ T H S R P F ++LSP++V +E
Sbjct: 353 IIKTEVISRRSGQEHSLI-EEYEYTAH---SSVARSYHYPEAKFHFELSPMQVLISENPK 408
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ + R + KKIE+GK
Sbjct: 409 SFSHFITNVCAIIGGVFTVAGILDSIFQNTVRMV-KKIELGK 449
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+KI+S+D Y KI D + SG +++V+++ ML LF EL YL T T ++VD +S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMLFLFGMELSSYLAINTSTSVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
G+ L I+F+++FPAL C SVD D+ G L++ I K +D
Sbjct: 65 DGDFLNIDFNISFPALSCEFASVDVSDVFGTHRLNISKTIRKVPID 110
>gi|413953324|gb|AFW85973.1| putative DUF1692 domain containing protein [Zea mays]
Length = 1070
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 54/83 (65%), Gaps = 7/83 (8%)
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 340
+KVVPT Y +S + +NQ SVTE+F R +E+ P V+F YDLSPI T EE
Sbjct: 515 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTERA----WPAVYFLYDLSPITFTIKEE 570
Query: 341 HVSFLHFLTNVCAIVGGVFTVSG 363
+FLHF+T +CA++GG F ++G
Sbjct: 571 RRNFLHFITRLCAVLGGTFAMTG 593
>gi|268581819|ref|XP_002645893.1| Hypothetical protein CBG07646 [Caenorhabditis briggsae]
Length = 426
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/398 (23%), Positives = 165/398 (41%), Gaps = 43/398 (10%)
Query: 1 MDAIMNKIRSLDAYPKINEDF-----------YSRTFSGGVITLVSSIVMLLLFFSELRL 49
MD ++IR KI EDF + S G I+ V ++ LF +E
Sbjct: 1 MDLGTSEIRQRKGITKIVEDFDIFEKVVENVKEEKKASSGAISFVCFTIIFCLFCTETYT 60
Query: 50 YL-NAVTETKLLVDTSRGETLRINFDVTFPALPCSILSVDAM--DISGEQHLDVKHDIFK 106
+L + + + VDT E ++ D+ PC+I+ V + + SGE L
Sbjct: 61 FLFHKKYDYRFAVDTEMDEMPLLDLDMVINT-PCNIMQVASSSDEYSGENGL-------- 111
Query: 107 KRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEV 166
R Q N +R D ++ + RH ++N+ + E D+D N E +
Sbjct: 112 LRQTIQKN--PTRFDFTDEEQMYWTILRHAHD-QYNKRGLRALEELEYVDDDIETNLEHL 168
Query: 167 -REAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN-----------KV 214
E ++ + + ++ ++ +I G ++ + N ++
Sbjct: 169 ANEKQEEEAAHIKEQRMKNKKQQHRGNGQIMFLVSNGMGMFQLVADNGGGDGDDGKACRL 228
Query: 215 AGNFHFAPGKSFHQSGVHVHDILAFQR---DSFNISHKINKLAFGEHFPGVVNPLDGVRW 271
G F GK + ++ F NISH+I K FG PG+V PL G
Sbjct: 229 HGKFRVRKGKEEKIIMSISNPLIMFDHGGPQQGNISHRIEKFNFGPRIPGLVTPLAGAEH 288
Query: 272 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 331
E+ +Y+YFIK+VPT +T+ + Q+SVT + ++G + G+ F Y+ +
Sbjct: 289 ISESGQDIYRYFIKIVPTKIYGYFTYTL-AYQYSVTFLKKQLKEGE-HSHGGILFEYEFT 346
Query: 332 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 369
+ + + +L +C+I+GGV+ S II+ +
Sbjct: 347 ANVIEVHKTSTTLFSYLIRICSILGGVYATSTIINNIV 384
>gi|413951106|gb|AFW83755.1| hypothetical protein ZEAMMB73_317062 [Zea mays]
Length = 1594
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 343
+KVVPT Y +S + +NQ SVTE+F S + P V+F YDLSPI T EE +
Sbjct: 515 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 573
Query: 344 FLHFLTNVCAIVGGVFTVSG 363
FLHF+T +CA++GG F ++G
Sbjct: 574 FLHFITRLCAVLGGTFAMTG 593
>gi|413949740|gb|AFW82389.1| putative DUF1692 domain containing protein [Zea mays]
Length = 1061
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 284 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 343
+KVVPT Y +S + +NQ SVTE+F S + P V+F YDLSPI T EE +
Sbjct: 501 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 559
Query: 344 FLHFLTNVCAIVGGVFTVSG 363
FLHF+T +CA++GG F ++G
Sbjct: 560 FLHFITRLCAVLGGTFAMTG 579
>gi|223995687|ref|XP_002287517.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976633|gb|EED94960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 457
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 92/193 (47%), Gaps = 36/193 (18%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 259
GC I GFL V++ GNFH H H+ N+SH IN L+FG+ F
Sbjct: 277 GCQISGFLLVDRAPGNFHIQAQSKGHDLAAHM----------TNVSHIINHLSFGKPFSK 326
Query: 260 -----------PG---VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 305
PG P DG + + + +++KV+ T + G Q+++++
Sbjct: 327 YFLKDGLKNTPPGFLETTKPFDGNVYITQNEHEAHHHYLKVITTEFEPEKG--AQNSKYN 384
Query: 306 VTEHFR------SSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 357
E R SS+ R +P F YDLSPI V++ +++ + + T++ AI+GG
Sbjct: 385 KKEPSRAYQILQSSQLSLYRSDIVPEAKFTYDLSPIAVSYNKKYRHWYDYFTSLMAIIGG 444
Query: 358 VFTVSGIIDAFIY 370
FTV G++++ I+
Sbjct: 445 TFTVVGMLESGIH 457
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 49/95 (51%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
I +LD Y K+ D T G +++ ++ M LFF E + Y ++ T L +D++
Sbjct: 1 IANLDMYRKVPVDLLEGTRRGSILSTIAIFTMTTLFFLETKAYFSSTLATSLALDSNSDP 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKH 102
+R+NF++T L C ++D + + G Q +H
Sbjct: 61 NIRVNFNITMMDLKCDYATIDVVSVLGTQQNVTQH 95
>gi|42562656|ref|NP_175508.2| protein Disulfide Isomerase (PDIa) family, redox active TRX
domain-containing protein [Arabidopsis thaliana]
gi|332194483|gb|AEE32604.1| protein Disulfide Isomerase (PDIa) family, redox active TRX
domain-containing protein [Arabidopsis thaliana]
Length = 484
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 41/282 (14%)
Query: 124 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 183
G P I + G R +H S YG +D EE+ + +K+ L+ +
Sbjct: 221 GYPSIRIFRRGSGLREDHGNHEHESYYGDRDTDS-LVKMVEELLKPIKKEDHKLA----L 275
Query: 184 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 243
D K GC I G++ KV G + SG H +F
Sbjct: 276 DGKSDNAASTFKKAPVSGGCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQ 325
Query: 244 FNISHKINKLAFGE---------------HFPGVVNPLDGVRWTQET---PSGMYQYFIK 285
N+SH + L FG + + L+G + E + +++++
Sbjct: 326 MNMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQ 385
Query: 286 VVPT-VYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 342
++ T V + SG H++ ++ T H S R P F ++LSP++V +E
Sbjct: 386 IIKTEVISRRSGQEHSLI-EEYEYTAH---SSVARSYHYPEAKFHFELSPMQVLISENPK 441
Query: 343 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ + R + KKIE+GK
Sbjct: 442 SFSHFITNVCAIIGGVFTVAGILDSIFQNTVRMV-KKIELGK 482
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+KI+S+D Y KI D + SG +++V+++ ML LF EL YL T T ++VD +S
Sbjct: 5 SKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMLFLFGMELSSYLAINTSTSVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
G+ L I+F+++FPAL C SVD D+ G L++ I K +D
Sbjct: 65 DGDFLNIDFNISFPALSCEFASVDVSDVFGTHRLNISKTIRKVPID 110
>gi|414590454|tpg|DAA41025.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 435
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSID 110
>gi|299469370|emb|CBG91903.1| putative PDI-like protein [Triticum aestivum]
gi|299469398|emb|CBG91917.1| putative PDI-like protein [Triticum aestivum]
Length = 485
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/205 (30%), Positives = 101/205 (49%), Gaps = 37/205 (18%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 258
GC I GF+ V KV G+ + +SG H +F N+SH + +FG+
Sbjct: 294 GCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPSQINVSHYVTTFSFGKRLSS 343
Query: 259 ---------FP---GVVNPLDG----VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 302
FP G + L G V+ + ++++++V T + +
Sbjct: 344 KMFNELKRLFPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLR----YAK 399
Query: 303 QFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
+ V E + + L +P V F ++ SP++V TE SF HF+TNVCAI+GGVF
Sbjct: 400 ELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459
Query: 360 TVSGIIDAFIYHGQRAIKKKIEIGK 384
TV+GI+D+ +++ R + KK+E+GK
Sbjct: 460 TVAGILDSILHNTLRLV-KKVELGK 483
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++ +++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIFAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSID 110
>gi|397568633|gb|EJK46248.1| hypothetical protein THAOC_35093 [Thalassiosira oceanica]
Length = 601
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 97/220 (44%), Gaps = 60/220 (27%)
Query: 197 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 256
E+E GC I GFL V++ GNFH H H+ N+SH IN L+FG
Sbjct: 403 EDEHPGCQISGFLLVDRAPGNFHIQAQSKNHDLAAHM----------TNVSHIINHLSFG 452
Query: 257 EHFP------GVVN----------PLDGVRWTQETPSGMYQYFIKVVPTVY--------- 291
+ F G+ N P DG + + +++KV+ T +
Sbjct: 453 KPFSKYFIKEGLKNTPAGFLDTTRPFDGNVYVTHNEHEAHHHYLKVITTEFEPQRDTKKQ 512
Query: 292 ------------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 339
+ +QS+Q S+ R +P F YDLSPI V++++
Sbjct: 513 YGKKKGFYKPPEPQRAYQILQSSQLSLY---------RNDIVPEAKFTYDLSPIAVSYSK 563
Query: 340 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 379
++ ++ + T++ AI+GG FTV G++++ +Y A+ KK
Sbjct: 564 KYRAWYDYFTSLMAIIGGTFTVVGMVESSLY----AVSKK 599
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 56/106 (52%), Gaps = 1/106 (0%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ SLD Y K+ D T G +++ ++ + M LFF E R + ++ T L +D++ +
Sbjct: 80 LASLDMYRKVPVDLLEGTKRGSIMSTLAIMSMATLFFLETRAFFSSSLSTNLALDSNTDQ 139
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+R+NF++T L C ++D + + G Q +V + K +D G
Sbjct: 140 NVRVNFNITMMDLRCDYATIDVVSVLGTQQ-NVTQHVQKYPIDQYG 184
>gi|145350046|ref|XP_001419434.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579665|gb|ABO97727.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 513
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 109/228 (47%), Gaps = 28/228 (12%)
Query: 168 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA---PGK 224
EA +++ L P +D KR G GC I GF+ V KV G+ + P
Sbjct: 301 EAAQEENMKLRLPASVDMQKRI---------IGPGCAITGFVLVKKVPGHLWISASSPDH 351
Query: 225 SFHQSGVHVHDILAFQRDSFNISHKIN--------KLAFGEHFPGVVNPLDGVRWTQETP 276
SFH +++ ++ + F H+++ K GE + L R+
Sbjct: 352 SFHGETMNMTHVV----NHFYFGHQLSDERRRYLEKFHAGEKAGDWHDRLASERFVSNAA 407
Query: 277 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 336
++++++ V T T +T+ + + T+H + + LP F Y SP+++
Sbjct: 408 HVSHEHYLQTVLTTITPRGRYTLPFSVYEYTQHSHAVHE----PLPKAKFHYQPSPMQIV 463
Query: 337 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
+EE ++F F+T++ AI+GGV++V GI D +++ +++K+E+GK
Sbjct: 464 VSEEKMAFYSFITSLMAIIGGVYSVMGIADGVLFNSLALVRRKLELGK 511
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 67/113 (59%), Gaps = 1/113 (0%)
Query: 9 RSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS-RGE 67
RS+D Y K+ D T SG VI++ ++++M L SELR Y ++ +TK++VD S GE
Sbjct: 35 RSVDFYRKLPRDMTEGTVSGSVISIFAAVLMTFLLLSELRSYSSSSFDTKVVVDRSVDGE 94
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQ 120
LRINF+++FPAL C SVD D G ++ +FK+ +D+ I Q
Sbjct: 95 LLRINFNLSFPALSCEFASVDVGDALGLNRFNLTKTVFKRAIDADMRAIGPLQ 147
>gi|414590456|tpg|DAA41027.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 439
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 65/106 (61%), Gaps = 1/106 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++V+++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIVDRSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
GE LRI+F+++FPAL C SVD D+ G L++ + K +D
Sbjct: 65 DGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSID 110
>gi|388497088|gb|AFK36610.1| unknown [Medicago truncatula]
Length = 457
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 66/108 (61%), Gaps = 1/108 (0%)
Query: 6 NKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TS 64
+K++S+D Y KI D + SG +++++++ M+ LF EL YL T T ++VD +S
Sbjct: 5 SKLKSVDFYRKIPRDLTEASLSGAGLSILAALAMMFLFGMELSNYLAVTTSTSVIVDKSS 64
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LRI+F+ +FPAL C SVD D+ G L++ + K +DS+
Sbjct: 65 DGDFLRIDFNFSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDSK 112
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 78/180 (43%), Gaps = 40/180 (22%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC + G++ V KV G+ + H +F N+SH IN L+FG+
Sbjct: 290 GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 336
Query: 262 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 299
V P +D W T++ + +++I+VV T G+ +
Sbjct: 337 -VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKL 395
Query: 300 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 359
++ T H S +P F +LSP++V TE SF HF+TNVCAI+GG F
Sbjct: 396 I-EEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGCF 451
>gi|255074657|ref|XP_002501003.1| predicted protein [Micromonas sp. RCC299]
gi|226516266|gb|ACO62261.1| predicted protein [Micromonas sp. RCC299]
Length = 515
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 97/215 (45%), Gaps = 42/215 (19%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC I G VN+V G F+ P H D N++H + L+FG+H PG
Sbjct: 313 GCIIDGSFRVNRVPGAFYVTPHSMGHN----------LNPDVINMTHTVKHLSFGKHVPG 362
Query: 262 -----------VVNPL-----------DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 299
V N + D + E P+ ++++++K+V + + G +
Sbjct: 363 RPSYVPRNLRRVWNRVPKDLGGRFAAGDEATFYSEEPNTVHEHYLKIVSRTFEPLEGQAV 422
Query: 300 Q-------SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 351
Q SN+F + + + + P + F YD+SP+ V E L ++ +
Sbjct: 423 QLYEYTFNSNRFRLNPPLAADGDPDQHVDGPMIKFSYDVSPMSVVLKEVKKPLLDWILGM 482
Query: 352 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 386
CA++GGV+T +G+++ F+ A+K++ +GK S
Sbjct: 483 CALLGGVYTCAGLLETFLQSSVCAVKRR--VGKIS 515
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 56/111 (50%), Gaps = 1/111 (0%)
Query: 2 DAIMNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLV 61
D + ++S+D Y K+ + T+ GGV +++ V + LF +LR T T + V
Sbjct: 22 DGVGGALKSVDLYAKMPRELAEGTYLGGVFSILLMFVFVSLFGMQLRALWTVGTRTDIAV 81
Query: 62 DTSRGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVK-HDIFKKRLDS 111
D S ++NF V PAL C +VD +D G +H ++ I+K + +
Sbjct: 82 DHSEDAKFQVNFKVELPALSCEWATVDVIDALGTRHFNISGESIYKHSMGA 132
>gi|303279378|ref|XP_003058982.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460142|gb|EEH57437.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 486
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 64/111 (57%), Gaps = 1/111 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS-R 65
K+RS+D Y KI D T G VI++ S++++ LL SE+ Y +TK++VD S
Sbjct: 8 KLRSVDFYRKIPRDMSEGTVPGSVISIGSALLIALLLVSEIGRYATPTWKTKVVVDRSLD 67
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVI 116
G+ ++INF+V+FPAL C SVD D G ++ +FK+ L G +
Sbjct: 68 GDMMKINFNVSFPALSCEFASVDVGDAMGLNRYNLTKTVFKRALARDGTPL 118
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 95/208 (45%), Gaps = 30/208 (14%)
Query: 193 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 252
+ ++ +G GC++ GF+ KV G+ + H +F + N++H +N
Sbjct: 291 ESVRAVKGPGCSVTGFVLAKKVPGHVWITANSNSH----------SFHPEEMNMTHTVNH 340
Query: 253 LAFGEHF----------------PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 296
L FG + L GV + + ++++++ V T +G
Sbjct: 341 LFFGNQLGRNKLKALERRERGASSNWHDKLAGVTFRSLQTNVTHEHYLQTVLTTLRP-AG 399
Query: 297 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 356
+ + + T+H + R LP F ++ SP++V TEE F HF+T + AIVG
Sbjct: 400 SYVAYHAYEYTQHSHALVTTR--ELPRAKFHFNPSPVQVVVTEEREPFYHFITTLMAIVG 457
Query: 357 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
GV++V GI D F+ H + +K E+GK
Sbjct: 458 GVYSVCGIADGFV-HNTLNMMRKFELGK 484
>gi|299115405|emb|CBN74236.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 447
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 92/199 (46%), Gaps = 30/199 (15%)
Query: 194 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 253
R+K++ GC + GF+ VN+V GNFH + H + + NISH + L
Sbjct: 264 RLKQDY-PGCQLSGFIMVNRVPGNFHIEARSALH----------SIDPTAANISHVVKTL 312
Query: 254 AFGEHFP---------GV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300
FG P GV + L+ ++ ++ ++IKVV T ++
Sbjct: 313 KFGTQVPVRGRRVIESGVELEGLPALEDRVYSIDSLHTAPHHYIKVVSTFVGGLAKTDNL 372
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
Q V+ EQ ++ P F YDLSP+ V + + FLT+V AIVGG FT
Sbjct: 373 QYQMMVSSQTMPYEQDQV---PEAKFSYDLSPMSVHIKQRRRKWYDFLTSVLAIVGGTFT 429
Query: 361 VSGIIDAFIYHGQRAIKKK 379
V G++D ++ R +K+K
Sbjct: 430 VVGVLDNILF---RVVKQK 445
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 59/109 (54%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
M I++ D Y KI D T G V++ + ML+LF ELR +L T + +D++
Sbjct: 1 MPTIKTFDFYRKIPLDLTETTLQGAVMSGCALFCMLILFLCELRAFLTPEVYTTVAIDSN 60
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+ LRINF++T ALPC SVD +D+ G +++ +I K D G
Sbjct: 61 QDSKLRINFNITMLALPCDYASVDVLDLLGTNKVNMTQNIVKWHTDENG 109
>gi|449530722|ref|XP_004172342.1| PREDICTED: protein disulfide isomerase-like 5-4-like, partial
[Cucumis sativus]
Length = 176
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 41/107 (38%), Positives = 65/107 (60%), Gaps = 1/107 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSR- 65
K++S+D Y KI D T SG +++V+++ M+ LF EL YL+ T T ++VD S
Sbjct: 6 KLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIVDNSTD 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQ 112
G+ LR++F+++FPAL C +VD D+ G L++ I K +DS
Sbjct: 66 GDFLRMDFNISFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSIDSN 112
>gi|297847442|ref|XP_002891602.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
lyrata]
gi|297337444|gb|EFH67861.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
lyrata]
Length = 484
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 63/105 (60%), Gaps = 1/105 (0%)
Query: 7 KIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD-TSR 65
KI+S+D Y KI D + SG +++V+++ ML LF EL YL T T ++VD +S
Sbjct: 6 KIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMLFLFGMELSSYLAINTSTSVIVDKSSD 65
Query: 66 GETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLD 110
G+ L I+F+++FPAL C SVD D+ G L++ I K +D
Sbjct: 66 GDFLDIDFNISFPALSCEFASVDVSDVFGTHRLNITKTIRKVPID 110
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 98/204 (48%), Gaps = 36/204 (17%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
GC I G++ KV G + SG H +F N+SH + L+FG
Sbjct: 294 GCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQMNMSHIVTHLSFGTMVSE 343
Query: 262 VV---------------NPLDGVRWTQETP---SGMYQYFIKVVPT-VYTDVSG--HTIQ 300
+ + L+G + + + ++++++V T V + SG H++
Sbjct: 344 RLWTDMKRLLPYLGQSHDRLNGKSFINQRKFDVNVTIEHYLQIVKTEVISRRSGKEHSLI 403
Query: 301 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360
++ T H S P F ++LSP++V +E SF HF+TNVCAI+GGVFT
Sbjct: 404 -EEYEYTAH---SSVAHSYHYPEAKFHFELSPMQVLISENPKSFSHFITNVCAIIGGVFT 459
Query: 361 VSGIIDAFIYHGQRAIKKKIEIGK 384
V+GI+D+ + R + KKIE+GK
Sbjct: 460 VAGILDSIFQNTVRMV-KKIELGK 482
>gi|297793639|ref|XP_002864704.1| hypothetical protein ARALYDRAFT_919317 [Arabidopsis lyrata subsp.
lyrata]
gi|297800754|ref|XP_002868261.1| hypothetical protein ARALYDRAFT_915383 [Arabidopsis lyrata subsp.
lyrata]
gi|297310539|gb|EFH40963.1| hypothetical protein ARALYDRAFT_919317 [Arabidopsis lyrata subsp.
lyrata]
gi|297314097|gb|EFH44520.1| hypothetical protein ARALYDRAFT_915383 [Arabidopsis lyrata subsp.
lyrata]
Length = 53
Score = 77.4 bits (189), Expect = 1e-11, Method: Composition-based stats.
Identities = 34/47 (72%), Positives = 38/47 (80%)
Query: 73 FDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESR 119
FD+ FPALPCSILSVDAMDISGE DVKHDI K+RLDS GN + +
Sbjct: 6 FDIRFPALPCSILSVDAMDISGELLCDVKHDIIKRRLDSNGNTLRGK 52
>gi|397641928|gb|EJK74922.1| hypothetical protein THAOC_03372 [Thalassiosira oceanica]
Length = 583
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 91/203 (44%), Gaps = 43/203 (21%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E GC + G L VN+V GNFH KS + H++ A N++H++N ++FGE
Sbjct: 385 EHPGCQVSGHLMVNRVPGNFHIE-AKSVN------HNLNAAMT---NLTHRVNHISFGEP 434
Query: 259 FPGV--------------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ NP+D + + ++IKVV T
Sbjct: 435 ITKLPYHMENTPFMRKVKRVLKQVPEEHKQFNPMDDQEYITTQFHQAFHHYIKVVSTHLN 494
Query: 293 DVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
S T+ N + ++ EQ ++ +P F YD+SP+ V +E + +
Sbjct: 495 MGSSSTV--NDVNSITVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDY 552
Query: 348 LTNVCAIVGGVFTVSGIIDAFIY 370
LT++CAI+GG FT G+IDA +Y
Sbjct: 553 LTSLCAIIGGTFTTLGLIDATLY 575
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%)
Query: 22 YSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGETLRINFDVTFPALP 81
+ T G ++++ + VM +LF SE + T + +D + +R+NF++T L
Sbjct: 122 FQATSLGALMSICAISVMGILFLSETLAFARTTMRTAIALDENDQPQIRLNFNITLMDLH 181
Query: 82 CSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
C +SVD D G +V +I K +LD G
Sbjct: 182 CDYVSVDVWDTLGTNRQNVTKNIEKWQLDESG 213
>gi|219125194|ref|XP_002182871.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405665|gb|EEC45607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 467
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 94/222 (42%), Gaps = 40/222 (18%)
Query: 173 KGWALSNPDLIDQCKREGFLQRIKEEEGE--GCNIYGFLEVNKVAGNFHFAPGKSFHQSG 230
K W D D + E Q ++ + GC + G L VN+V GNFH H
Sbjct: 254 KEWHSKASDSADPAEVEKKRQLYQQNRPDHPGCQVSGHLMVNRVPGNFHLEAKSKSHNLN 313
Query: 231 VHVHDILAFQRDSFNISHKINKLAFGE--------------HFP---GVVNPLDGVRWTQ 273
+ N+SH +N L+FGE P P+DG +
Sbjct: 314 AAM----------TNLSHVVNHLSFGEPIDENNRKSKRILKQVPEEHRQFAPMDGQAFLT 363
Query: 274 ETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFY 328
+ + ++IKVV T + S+ + ++ EQ ++ +P F Y
Sbjct: 364 KAFHQAFHHYIKVVSTHLN------MGSSDANSMLTYQFLEQSQIVFYDDVNVPEARFSY 417
Query: 329 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 370
DLSP+ V +E + +LT++CAI+GG FT G+IDA +Y
Sbjct: 418 DLSPMSVVVEKEGRKWYDYLTSLCAIIGGTFTTLGLIDATLY 459
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 57/106 (53%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ S+D Y ++ +D T G ++++ + +VM +LF SE + T + +D +
Sbjct: 1 MSSVDFYRRVPKDLTEATSLGAIMSVCALVVMGVLFLSETAAFARTGIATSITLDENTSP 60
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+R+NF++T L C +S+D D G +V +I K +LD+QG
Sbjct: 61 QIRLNFNITLTDLQCDYVSIDVWDALGTNKQNVTKNIDKWQLDAQG 106
>gi|223646904|gb|ACN10210.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
gi|223672767|gb|ACN12565.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
Length = 238
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 42/67 (62%)
Query: 202 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 261
C I+G L VNKVAGNFH GK+ H H D++N SH+I+ L+FGE PG
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFGEEIPG 228
Query: 262 VVNPLDG 268
++NPLDG
Sbjct: 229 IINPLDG 235
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 49/89 (55%), Gaps = 1/89 (1%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ ++ LDA+PK+ E + T +GG ++L++ M LL F E +Y + + + VD
Sbjct: 11 LTLVKELDAFPKVPESYVETTATGGTVSLIAFTAMALLAFLEFFVYRDTWMQYEYEVDKD 70
Query: 65 RGETLRINFDVTFPALPCSILSVDAMDIS 93
LRIN D+T A+ C + D +D++
Sbjct: 71 FSSKLRINIDITV-AMRCQFVGADVLDLA 98
>gi|224013158|ref|XP_002295231.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969193|gb|EED87535.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 492
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 89/205 (43%), Gaps = 43/205 (20%)
Query: 199 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 258
E GC + G L VN+V GNFH KS + H++ A N++H++N L+FGE
Sbjct: 290 EHPGCQVSGHLMVNRVPGNFHIE-AKSVN------HNLNAAMT---NLTHRVNHLSFGEP 339
Query: 259 FPGV--------------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 292
+ NP+D + + ++IKVV T
Sbjct: 340 ITKLPPHMENTPFMRKVKRVLKQVPEEHKQFNPMDDTEYVTAQFHQAFHHYIKVVSTHLN 399
Query: 293 --DVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFYDLSPIKVTFTEEHVSFL 345
S N + ++ EQ ++ +P F YD+SP+ V +E +
Sbjct: 400 MGSSSKSEYSVNDVNAVTVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWY 459
Query: 346 HFLTNVCAIVGGVFTVSGIIDAFIY 370
+LT++CAI+GG FT G+IDA +Y
Sbjct: 460 DYLTSLCAIIGGTFTTLGLIDATLY 484
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 55/106 (51%)
Query: 8 IRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTSRGE 67
+ S+D Y ++ +D T G ++++ + VM +LFFSE + T + +D +
Sbjct: 13 MSSVDFYRRVPKDLTEATSLGAIMSICAITVMAILFFSETLAFARTAMVTSIALDENDQP 72
Query: 68 TLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQG 113
+R+NF++T L C +SVD D G +V +I K +LD G
Sbjct: 73 QIRLNFNITLMDLHCDFVSVDVWDTLGTNRQNVTKNIEKWQLDEDG 118
>gi|357474783|ref|XP_003607677.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355508732|gb|AES89874.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 156
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 85/163 (52%), Gaps = 31/163 (19%)
Query: 244 FNISHKINKLAFGEHFPGVVNP---LDGVRW------------------TQETPSGM-YQ 281
N+SH IN L+FG+ V P +D W T++ + +
Sbjct: 1 MNMSHVINHLSFGKK----VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIE 56
Query: 282 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 341
++I+VV T G+ + ++ T H S +P F +LSP++V TE
Sbjct: 57 HYIQVVKTEVITRKGYKLIE-EYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQ 112
Query: 342 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
SF HF+TNVCAI+GGVFTV+GI+D+ +++ +A+ KKIEIGK
Sbjct: 113 KSFSHFITNVCAIIGGVFTVAGILDSILHNTIKAM-KKIEIGK 154
>gi|428185569|gb|EKX54421.1| hypothetical protein GUITHDRAFT_99900 [Guillardia theta CCMP2712]
Length = 475
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 55/237 (23%)
Query: 182 LIDQCKREGFLQRI--KEEEGE-------GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 232
L+ Q + R+ KE++GE GC + G L V + APG Q+
Sbjct: 260 LMKQVNLQAPKSRVVDKEQDGEKESHNGVGCMVAGMLHVQR-------APGSIILQA--- 309
Query: 233 VHDILAFQRDSFNISHKINKLAFGEHF---PGVVNP---------LDGVRWTQE--TPSG 278
V D F + ++SH +N L+FG VV P LD ++ E TP+
Sbjct: 310 VSDGHEFNWATMDVSHTVNHLSFGPFLSETAWVVMPPDIAQAVGSLDDKKFLSEERTPT- 368
Query: 279 MYQYFIKVVPTVY----------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 328
++++++KVV V + G+ + +N+ R +P Y
Sbjct: 369 VWEHYVKVVKNVVELPRSWGIPPVEAHGYVVHTNKVQ-----------RYAEVPTARINY 417
Query: 329 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 385
D+ PI V S HFLT +CAIVGGVFTVSGI + + G ++ K IGK
Sbjct: 418 DILPIIVHVKTSRESNYHFLTKLCAIVGGVFTVSGIFASMVEGGIASLTHKETIGKL 474
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 55/113 (48%), Gaps = 11/113 (9%)
Query: 17 INEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVD------TSRGETLR 70
+N + T +G +I++++ ++M+ L +++ + +ET +++D T L+
Sbjct: 18 LNAELTEGTITGSIISILTGVLMVYLIVAQIFAWRALNSETSVVLDHYSHMKTGADSLLQ 77
Query: 71 INFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI 123
INF+ TF L C SVDA + G + + K LD G RQ G+
Sbjct: 78 INFNFTFNHLSCEYASVDAANFMGTHDAGISSKVTKVHLDKNG-----RQLGV 125
>gi|307110923|gb|EFN59158.1| hypothetical protein CHLNCDRAFT_138016 [Chlorella variabilis]
Length = 360
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 68/121 (56%), Gaps = 5/121 (4%)
Query: 5 MNKIRSLDAYPKINEDFYSRTFSGGVITLVSSIVMLLLFFSELRLYLNAVTETKLLVDTS 64
+ +++S+D Y K+ D T SG I++ ++ ++L L +EL Y++ T T ++VD S
Sbjct: 6 LARLKSVDFYRKLPTDLTEATLSGAAISIATTFIILFLLGAELSSYMSTQTRTDMVVDRS 65
Query: 65 -RGETLRINFDVTFPALPCSILSVDAMDISGEQHLDVKHDIFKK----RLDSQGNVIESR 119
GE LR+NF+++FP L C ++D D G + L++ + K+ + G +E +
Sbjct: 66 AHGELLRVNFNISFPQLSCEFATLDVSDAMGLKRLNLTKTVRKQPITEEMQRAGQAVEDK 125
Query: 120 Q 120
+
Sbjct: 126 K 126
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 55/97 (56%), Gaps = 13/97 (13%)
Query: 288 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 347
P + D +T+QS++++ +H + F Y +SPI++ TE+ F
Sbjct: 275 PELQFDAYEYTVQSHKYNAEDHASAK------------FTYKMSPIQIVVTEQPKQLYKF 322
Query: 348 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 384
LT +CA++GGVFTV+GI+D + H I KK+++GK
Sbjct: 323 LTAICAVIGGVFTVAGILDGMV-HQVNKIAKKVDLGK 358
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,130,027,028
Number of Sequences: 23463169
Number of extensions: 265384921
Number of successful extensions: 554176
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1061
Number of HSP's successfully gapped in prelim test: 53
Number of HSP's that attempted gapping in prelim test: 549614
Number of HSP's gapped (non-prelim): 1645
length of query: 386
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 242
effective length of database: 8,980,499,031
effective search space: 2173280765502
effective search space used: 2173280765502
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)